This document summarizes a study that aimed to identify auxiliary web images using a combination of analyses. The methodology involved analyzing features of web pages like image position, dimensions and number of images using the DOM, and analyzing image-level features like number of colors and faces using tools like PIL and OpenCV. A SVM classifier was trained on these features from a sample of pages to recognize auxiliary images with an average accuracy of 93.17% after optimization. While the current approach performed well, future work could involve context analysis, weighted features and adaptive page analysis to improve the system.
Image video processing and canvas element by Abhay RaoMrinmay Kulkarni
Canvas Element allows fixed element control and enables analysis of Image/Video algorithms as a part of OpenCV will be studied and implemented in JAVA Script
Talk given at The Rich Web Experience 2008. Check out blog for more demos, and sample code.
I hate images. Not pictures or icons, mind you, but user interface graphics. I think that small gradient PNGs that web developers set to repeat are the spacer gifs of today. Images are hard to change, and slower to download.
6 weeks 6 months live project summer industrial training in cmc limited 2012CMC Limited
CMC Limited (A TCS Subsidiary) is India’s leading Information Technology company, which has been under the Ministry of Information Technology, Department of Electronics, Government of India, since 1976. Today, it offers high quality IT solutions & services to users worldwide, Hardware Maintenance, Education & Training & Turnkey Project Implementation through a group of highly qualified professionals operating from 14 major cities in India & abroad, including the Middle East, European Union and the United States of America. CMC America (formerly BRI Inc.) is CMC’s subsidiary in USA.
Post merger of CMC with TATA Sons in October 2001, CMC-TCS are now working jointly on important offshore and national projects globally and constitute one of the biggest IT consortium in the World.
CMC has been in the forefront of developing some of the largest IT projects in India and abroad due to which the practical exposure of its IT personnel is unmatched. CMC is a pioneer in the field of Education and Training also. We have tie-ups with a number of reputed academic institutes like JNTU, Hyderabad, Netaji Subhash Open University, , Narsee Monjee Institute of Management Studies, and University of Calcutta etc. to jointly conduct courses.
Meeting your Industrial Training requirement, CMC has conceptualized and designed live projects, and provides necessary infrastructure, guidance, software and hardware for project development. Trainees can develop these projects in a team as per their interests in the latest technology areas. Trainees can go back with a well-documented project report and an Industrial Project Training certificate from CMC Limited (A Tata Enterprise). Details of the training programs are attached herewith for your reference (Kindly download all attachments).
Image video processing and canvas element by Abhay RaoMrinmay Kulkarni
Canvas Element allows fixed element control and enables analysis of Image/Video algorithms as a part of OpenCV will be studied and implemented in JAVA Script
Talk given at The Rich Web Experience 2008. Check out blog for more demos, and sample code.
I hate images. Not pictures or icons, mind you, but user interface graphics. I think that small gradient PNGs that web developers set to repeat are the spacer gifs of today. Images are hard to change, and slower to download.
6 weeks 6 months live project summer industrial training in cmc limited 2012CMC Limited
CMC Limited (A TCS Subsidiary) is India’s leading Information Technology company, which has been under the Ministry of Information Technology, Department of Electronics, Government of India, since 1976. Today, it offers high quality IT solutions & services to users worldwide, Hardware Maintenance, Education & Training & Turnkey Project Implementation through a group of highly qualified professionals operating from 14 major cities in India & abroad, including the Middle East, European Union and the United States of America. CMC America (formerly BRI Inc.) is CMC’s subsidiary in USA.
Post merger of CMC with TATA Sons in October 2001, CMC-TCS are now working jointly on important offshore and national projects globally and constitute one of the biggest IT consortium in the World.
CMC has been in the forefront of developing some of the largest IT projects in India and abroad due to which the practical exposure of its IT personnel is unmatched. CMC is a pioneer in the field of Education and Training also. We have tie-ups with a number of reputed academic institutes like JNTU, Hyderabad, Netaji Subhash Open University, , Narsee Monjee Institute of Management Studies, and University of Calcutta etc. to jointly conduct courses.
Meeting your Industrial Training requirement, CMC has conceptualized and designed live projects, and provides necessary infrastructure, guidance, software and hardware for project development. Trainees can develop these projects in a team as per their interests in the latest technology areas. Trainees can go back with a well-documented project report and an Industrial Project Training certificate from CMC Limited (A Tata Enterprise). Details of the training programs are attached herewith for your reference (Kindly download all attachments).
Web App Essentials cover the basic theoretical knowledge which are required for writing small and middle size application. The topics which are covered:
spa premises,
spa architecture,
mvc pattern and framework,
templating,
module pattern,
ui rendering,
amd,
base libraries.
When Orbitz Worldwide released a new generation of its global technology platform there were some lofty goals for the UI. They wanted to build a presentation tier (HTML, CSS, JavaScript) that would meet the goals of internationalization, accessibility, have rich Ajax interactions, and be faster and easier to develop in. This session will explore the key challenges in achieving these goals, including what worked, what didn\'t, and what\'s next.
Everything is Awesome - Cutting the Corners off the WebJames Rakich
The web is awesome despite it's detractors. But we can't forget our fundamentals when we're trying to forge ahead with new tech. This talk is about how to approach the building blocks of the web in a way that takes advantage of their strengths and avoids their weaknesses.
Web App Essentials cover the basic theoretical knowledge which are required for writing small and middle size application. The topics which are covered:
spa premises,
spa architecture,
mvc pattern and framework,
templating,
module pattern,
ui rendering,
amd,
base libraries.
When Orbitz Worldwide released a new generation of its global technology platform there were some lofty goals for the UI. They wanted to build a presentation tier (HTML, CSS, JavaScript) that would meet the goals of internationalization, accessibility, have rich Ajax interactions, and be faster and easier to develop in. This session will explore the key challenges in achieving these goals, including what worked, what didn\'t, and what\'s next.
Everything is Awesome - Cutting the Corners off the WebJames Rakich
The web is awesome despite it's detractors. But we can't forget our fundamentals when we're trying to forge ahead with new tech. This talk is about how to approach the building blocks of the web in a way that takes advantage of their strengths and avoids their weaknesses.
The French Revolution, which began in 1789, was a period of radical social and political upheaval in France. It marked the decline of absolute monarchies, the rise of secular and democratic republics, and the eventual rise of Napoleon Bonaparte. This revolutionary period is crucial in understanding the transition from feudalism to modernity in Europe.
For more information, visit-www.vavaclasses.com
Instructions for Submissions thorugh G- Classroom.pptxJheel Barad
This presentation provides a briefing on how to upload submissions and documents in Google Classroom. It was prepared as part of an orientation for new Sainik School in-service teacher trainees. As a training officer, my goal is to ensure that you are comfortable and proficient with this essential tool for managing assignments and fostering student engagement.
Students, digital devices and success - Andreas Schleicher - 27 May 2024..pptxEduSkills OECD
Andreas Schleicher presents at the OECD webinar ‘Digital devices in schools: detrimental distraction or secret to success?’ on 27 May 2024. The presentation was based on findings from PISA 2022 results and the webinar helped launch the PISA in Focus ‘Managing screen time: How to protect and equip students against distraction’ https://www.oecd-ilibrary.org/education/managing-screen-time_7c225af4-en and the OECD Education Policy Perspective ‘Students, digital devices and success’ can be found here - https://oe.cd/il/5yV
Synthetic Fiber Construction in lab .pptxPavel ( NSTU)
Synthetic fiber production is a fascinating and complex field that blends chemistry, engineering, and environmental science. By understanding these aspects, students can gain a comprehensive view of synthetic fiber production, its impact on society and the environment, and the potential for future innovations. Synthetic fibers play a crucial role in modern society, impacting various aspects of daily life, industry, and the environment. ynthetic fibers are integral to modern life, offering a range of benefits from cost-effectiveness and versatility to innovative applications and performance characteristics. While they pose environmental challenges, ongoing research and development aim to create more sustainable and eco-friendly alternatives. Understanding the importance of synthetic fibers helps in appreciating their role in the economy, industry, and daily life, while also emphasizing the need for sustainable practices and innovation.
Ethnobotany and Ethnopharmacology:
Ethnobotany in herbal drug evaluation,
Impact of Ethnobotany in traditional medicine,
New development in herbals,
Bio-prospecting tools for drug discovery,
Role of Ethnopharmacology in drug evaluation,
Reverse Pharmacology.
Read| The latest issue of The Challenger is here! We are thrilled to announce that our school paper has qualified for the NATIONAL SCHOOLS PRESS CONFERENCE (NSPC) 2024. Thank you for your unwavering support and trust. Dive into the stories that made us stand out!
GIÁO ÁN DẠY THÊM (KẾ HOẠCH BÀI BUỔI 2) - TIẾNG ANH 8 GLOBAL SUCCESS (2 CỘT) N...
Identifying Auxiliary Web Images Using Combinations of Analyses
1. Identifying Auxiliary Web Images
Using Combination of Analyses
Tewson Seeoun
Sirindhorn International Institute of Technology
With Guidance From
Asst. Prof. Dr. Toshiaki Kondo
Sirindhorn International Institute of Technology
Dr. Choochart Haruechaiyasak
Human Language Technology Laboratory, NECTEC, NSTDA
2. Agenda
● Introduction
● Background
● Document Object Model (DOM) in HTML
● Support Vector Machine (SVM)
● Objective
● Methodology
● Results
● Discussion / Future Work
● Conclusion
Acknowledgement
2
●
3. Introduction
● Websites contain images.
● Some images are not necessary.
● Search Engine Indexing
● Printing
● Ignoring them is sometimes
economical and green.
3
4. Background - DOM
● Web browsers / layout engines parse
HTML / CSS / JavaScript into DOM.
● DOM represents things (elements) in a Web page.
● An element has properties (position, size, etc.).
● JavaScript sees DOM.
4
5. Background - SVM
● SVM is a supervised machine learning algorithm
● SVM is used for statistical pattern recognition.
5
6. Objective (for now)
To recognize patterns of auxiliary Web images quickly
using DOM analysis and basic image processing
6
7. Methodology
HTML IMG
PyQtWebKit Python
CSS DOM Files
JS
jQuery
PIL
Page Level Features
OpenCV
Domain Level Features
Tesseract
Labels MySQL Image Level Features
7
8. Methodology (continued)
● Image Level Features
● No. of Colors
● No. of Human Faces
● No. of Alphabets
● Page Level Features
● Position
● Dimension
● No. of Images with Similar Dimension
● Domain Level Features
External / Internal Links
8
●
9. Methodology (continued)
MySQL 80% (500/626) Randomly-Selected
SVM (Train)
20%
Model
SVM (Predict)
Results
Results
Results
9
10. Results
10-fold Cross-Validation (10 Experiments)
Average Accuracy = 84.92%
After Applying Grid-Search Technique
Average Accuracy = 93.17%
10
11. Discussion
● Some pages cannot be parsed.
● Frames and redirections
● Positions can be miscalculated.
● JavaScript used in displaying images
● CSS sprites
● Tesseract is not well-tuned.
● Small images have to be magnified, but how much?
● Downloading images for processing is a bottleneck.
● Features are not weighted.
The definition of “auxiliary image” is subjective.
11
●
14. Acknowledgement
● NSTDA, NECTEC, and YSTP program
● Dr. Choochart Haruechaiyasak
● Dr. Toshiaki Kondo
● Mr. Krikamol Muendet
● And Many Others...
14