This document is a project report submitted by Jay Shah, Pooja Shah, and Tapan Desai for their Bachelor of Engineering degree. It describes their project titled "Smart Glass", which is an Android application that uses image recognition and processing to identify objects in photos taken by the user and provide additional information about the objects. The report outlines the proposed system, including its features such as document capture, visual search, visual commerce, and providing real-time information. It also discusses the project management process, including the project schedule, resources required, and risk mitigation strategy.
The document discusses Optical Character Recognition (OCR) and the IMPACT project. IMPACT is supported by the European Community under the FP7 ICT Work Programme and aims to improve access to historical text. It is coordinated by the National Library of the Netherlands and involves several other European libraries and research institutions. ABBYY provides OCR technology for the IMPACT members to recognize text in old documents.
The document presents a presentation on character recognition and conversion. It discusses the purpose of character recognition as document processing and speeding up recognition. It describes the architecture as containing templates, scanning, recognition, and coding. It details testing through sample and performance testing, showing the conversion of various images to text. It concludes by discussing applications and limitations of character recognition technology.
1. The document discusses an optical character recognition (OCR) system based on a grid infrastructure that aims to recognize characters from scanned documents and convert them to electronic format more effectively.
2. The proposed OCR system is divided into five modules: document processing, system training, document recognition, document editing, and document searching.
3. A feasibility study is conducted to analyze the technical, operational, and economic feasibility of the proposed OCR system. Training methods like supervised and unsupervised training are also discussed.
The document compares OCR (optical character recognition) software to Urjanet for processing utility bill data. OCR requires significant upfront costs, hardware, manual labor to scan, sort, and review bills for errors. It often results in incomplete, inconsistent data with high error rates. Urjanet provides automated, timely access to normalized utility bill data with minimal human intervention and no ongoing maintenance costs. Urjanet outputs data that is more accurate, detailed and arrives sooner than what organizations get from using OCR software.
The document describes a project to develop optical character recognition (OCR) software for recognizing online and offline handwritten text in multiple languages. It aims to recognize characters from scanned documents or real-time handwriting input and create a user profile. The system scope includes recognizing handwriting from multiple users and cursive script. It will store recognized characters in a text file and optionally convert words to audio for reading documents aloud. The document provides details on OCR technology, applications, literature review, user and system requirements, and the project's goal of using OCR for applications like forms processing.
The document discusses Optical Character Recognition (OCR) and describes the key steps and algorithms involved. It summarizes the main modules in an OCR system including pre-processing, feature extraction, classification, and post-processing. It then discusses two specific algorithms - Principal Component Analysis and Learning Vector Quantization - that can be used to implement OCR. The document also evaluates the feasibility and provides a high-level design for an OCR system including graphical user interface, scanner, training, and main modules.
IOSR Journal of Computer Engineering (IOSR-JCE) is a double blind peer reviewed International Journal that provides rapid publication (within a month) of articles in all areas of computer engineering and its applications. The journal welcomes publications of high quality papers on theoretical developments and practical applications in computer technology. Original research papers, state-of-the-art reviews, and high quality technical notes are invited for publications.
The document discusses Optical Character Recognition (OCR) and the IMPACT project. IMPACT is supported by the European Community under the FP7 ICT Work Programme and aims to improve access to historical text. It is coordinated by the National Library of the Netherlands and involves several other European libraries and research institutions. ABBYY provides OCR technology for the IMPACT members to recognize text in old documents.
The document presents a presentation on character recognition and conversion. It discusses the purpose of character recognition as document processing and speeding up recognition. It describes the architecture as containing templates, scanning, recognition, and coding. It details testing through sample and performance testing, showing the conversion of various images to text. It concludes by discussing applications and limitations of character recognition technology.
1. The document discusses an optical character recognition (OCR) system based on a grid infrastructure that aims to recognize characters from scanned documents and convert them to electronic format more effectively.
2. The proposed OCR system is divided into five modules: document processing, system training, document recognition, document editing, and document searching.
3. A feasibility study is conducted to analyze the technical, operational, and economic feasibility of the proposed OCR system. Training methods like supervised and unsupervised training are also discussed.
The document compares OCR (optical character recognition) software to Urjanet for processing utility bill data. OCR requires significant upfront costs, hardware, manual labor to scan, sort, and review bills for errors. It often results in incomplete, inconsistent data with high error rates. Urjanet provides automated, timely access to normalized utility bill data with minimal human intervention and no ongoing maintenance costs. Urjanet outputs data that is more accurate, detailed and arrives sooner than what organizations get from using OCR software.
The document describes a project to develop optical character recognition (OCR) software for recognizing online and offline handwritten text in multiple languages. It aims to recognize characters from scanned documents or real-time handwriting input and create a user profile. The system scope includes recognizing handwriting from multiple users and cursive script. It will store recognized characters in a text file and optionally convert words to audio for reading documents aloud. The document provides details on OCR technology, applications, literature review, user and system requirements, and the project's goal of using OCR for applications like forms processing.
The document discusses Optical Character Recognition (OCR) and describes the key steps and algorithms involved. It summarizes the main modules in an OCR system including pre-processing, feature extraction, classification, and post-processing. It then discusses two specific algorithms - Principal Component Analysis and Learning Vector Quantization - that can be used to implement OCR. The document also evaluates the feasibility and provides a high-level design for an OCR system including graphical user interface, scanner, training, and main modules.
IOSR Journal of Computer Engineering (IOSR-JCE) is a double blind peer reviewed International Journal that provides rapid publication (within a month) of articles in all areas of computer engineering and its applications. The journal welcomes publications of high quality papers on theoretical developments and practical applications in computer technology. Original research papers, state-of-the-art reviews, and high quality technical notes are invited for publications.
Voice Based Search Engine for Visually Impairment PeoplesIJASRD Journal
World Wide Web (WWW) is unexpectedly emerging because the accepted records supply for our society. The WWW is normally reachable the usage of an internet-browsing package from a networked pc. The layout of facts on the net is visually orientated. The reliance on visible presentation locations excessive cognitive demands on a person to function this sort of system. The interaction might also sometimes require the whole attention of a consumer. The design of information presentation at the web is predominately visible-oriented. This presentation technique requires most, if no longer all, of the person’s attention and imposes significant cognitive load on a user. This technique isn't always sensible, in particular for the visually impaired persons. The awareness of this challenge is to develop a prototype which supports net browsing the use of a speech-based interface, e.g. A telephone, and to degree its effectiveness. The command input and the delivery of web contents are totally in voice. Audio icons are constructed into the prototype so that users will have higher knowledge of the original shape/purpose of a web page. Navigation and manage commands are available to decorate the net browsing enjoy. The effectiveness of this prototype is evaluated in a consumer take a look at involving both generally sighted and visually impaired humans. Voice browsers allow human beings to get right of entry to the Web the usage of speech synthesis, pre-recorded audio, and speech reputation. This may be supplemented via keypads and small presentations. Voice may also be supplied as an accessory to standard computing device browsers with high resolution graphical presentations, presenting an on hand alternative to the use of the keyboard or screen, as an instance in cars in which palms/eyes unfastened operation is crucial. Voice interplay can get away the bodily obstacles on keypads and shows as cell devices turn out to be ever smaller. The browser will have an integrated textual content extraction engine that inspects the content of the page to construct a structured illustration. The inner nodes of the structure constitute diverse tiers of abstraction of the content. This enables in easy and bendy navigation of the page so that it will hastily home into gadgets of interest.
Chat-Bot for College Management System using A.IIRJET Journal
This document discusses developing a chatbot for a college management system using artificial intelligence. It would analyze user queries about college activities and provide responses. Users could ask questions through the chatbot without going to the college in person. Natural language processing and sentiment analysis techniques would be used to understand questions and determine appropriate responses from the knowledge database. The proposed system would include user registration and login, categorizing questions, using AI algorithms to analyze questions and provide answers, and interfacing with a database to retrieve information.
This document is a project report submitted as a minor project for a bachelor's degree in computer science and engineering. It describes the development of an information system called "Information for Tutor Guardian" or i-TG. The system was created to track student academic information and make it easily accessible to staff, faculty advisors, and committee members. The document includes sections on certificates, acknowledgements, table of contents, list of figures, introduction, background/literature review, process model, design, technical details, coding, screen layouts, future enhancements, and conclusion. It was submitted by three students under the guidance of a professor from the computer science department.
IRJET- College Enquiry Chatbot System(DMCE)IRJET Journal
The document describes a college enquiry chatbot system called DMCE that was developed by students and a professor at Datta Meghe College of Engineering. The chatbot uses artificial intelligence and machine learning to answer students' questions about college-related activities and events. It analyzes user queries through natural language processing and provides responses using an artificial intelligence markup language called AIML. The chatbot aims to reduce the need for students to personally visit the college to get information by providing an automated online service via a mobile application with a graphical user-friendly interface.
IRJET - Visual E-Commerce Application using Deep LearningIRJET Journal
This document discusses developing a visual e-commerce application using deep learning for object recognition. It proposes a system with three phases: 1) scanning real-time objects using an Android app and YOLO algorithm for object detection, 2) detecting the objects, and 3) recommending similar products to the user. The goal is to make e-commerce searching more user-friendly by allowing image-based searches rather than relying solely on text. This could help illiterate users and improve the shopping experience overall. The document reviews related work on object recognition techniques and visual search engines to provide context.
This document describes a project report submitted for a Bachelor of Technology degree. The report details the development of a chatbot created in Python by two students, Garvit Bajpai and Rakesh Kumar Kannaujiya, under the guidance of their professor Mr. Abhinandan Tripathi. The report provides information on the background, literature review, proposed solution, implementation, advantages and disadvantages of creating a chatbot in Python.
This document proposes a system to detect fake product reviews on e-commerce sites. The system uses sentiment analysis, content similarity analysis, and review deviation analysis to identify fake reviews. It extracts product reviews from websites, preprocesses the data, and uses three techniques to detect fake reviews. The fake reviews are then used to train a classifier to label new reviews as fake or genuine. The system was able to detect 111 fake reviews out of 300 with the classifier identifying an additional 18 fake reviews. The techniques aim to make online shopping reviews more trustworthy.
1. The document describes a proposed system called HAPPI (HAsh-based and PiPelIned) architecture for hardware-enhanced association rule mining. HAPPI aims to solve performance bottlenecks in existing Apriori-based hardware schemes by reducing the frequency of loading the database into hardware.
2. HAPPI includes three hardware modules - a systolic array to compare candidate itemsets with database items, a trimming filter to eliminate infrequent items, and a hash table to filter unnecessary candidate itemsets.
3. The proposed HAPPI system is intended to address limitations of existing Apriori-based approaches that involve repeatedly loading large candidate itemsets and databases into hardware.
The document describes the development of an online job portal system. The system allows job seekers to create profiles, upload resumes and apply for jobs posted by employers. Employers can post job listings, search resume databases and block candidates. The system aims to automate the manual job recruitment process and make it easier for job seekers and employers to connect. It was developed using PHP and MySQL on a LAMP stack with a distributed architecture and centralized database storage.
Mobile Phone Examiner Plus is forensic analysis software from AccessData that supports over 6800 mobile devices running iOS or Android. It allows extraction of physical, logical, and application data without requiring iTunes. The software automatically roots Android devices and extracts data, clones SIM cards, locks or exports data to Excel. It provides visualization of call logs, social timelines, application folders, and communication methods to analyze phone usage. The package includes the software, SIM card reader, and USB cables for $4,260.
A black-box-approach-for-response-quality-evaluation-of-conversational-agent-...Cemal Ardil
The document discusses the challenges of evaluating conversational agent systems. It proposes a black-box approach to evaluate response quality using observation, classification, and scoring. This approach is used to assess three example systems: AnswerBus, START, and AINI. The evaluation of conversational agents is difficult because existing information retrieval metrics do not apply, and there is no single correct answer for many questions.
Novel character segmentation reconstruction approach for license plate recogn...Conference Papers
The document proposes a new approach called partial character reconstruction for segmenting characters in license plate images to improve license plate recognition performance. It introduces using angular information and stroke width properties in different domains to segment characters and then reconstruct their complete shapes for recognition. Experimental results on several benchmark license plate databases and video databases show the technique is effective in handling images affected by multiple challenges.
this is VTU FINAL YEAR PROJECT REPORT full report is attached below.this alone with front pages attached Front pages report follows all the guidelines specified by vtu according to our college.
This document is a project report for a File Tracking System created by Sabana Maharjan. It includes a disclaimer, recommendation from the project supervisor, and a certificate of completion signed by external and internal examiners. The acknowledgments thank the computer science department and supervisor for their support. The abstract describes the project as an application to efficiently collect and track file information. It will use cloud technology and push notifications to alert users. The table of contents outlines the following chapters which cover introduction and background, literature review, methodology, system analysis and design, and implementation details.
Khetrabasi Rout is a software engineer with over 2 years of experience developing Java web applications using technologies like Spring, Hibernate, and Jasper Reports. He has worked on projects including an energy management system, healthcare EMR system, GPS tracking system, and biometric security system. His responsibilities included requirements gathering, coding, database development, testing, and deploying software. He is proficient in Java, HTML/CSS, JavaScript, databases like MySQL and MongoDB, and frameworks like Spring and Spring Boot.
Khetrabasi Rout is a software engineer with over 2 years of experience developing Java web applications using technologies like Spring, Hibernate, and Jasper Reports. He has worked on projects including an energy management system, healthcare EMR system, GPS tracking system, and biometrics security system. He is proficient in Java, HTML/CSS, JavaScript, databases like MySQL and MongoDB, frameworks like Spring and frameworks like Spring Boot. He has a MCA degree from Khallikote College and seeks new opportunities as a software engineer.
This document contains summaries of 26 different projects related to various domains including networking, image processing, healthcare, education, and more. The projects cover a range of modules and functionalities like user registration, file uploading/downloading, password authentication, device control, and more. The technologies used include Java, J2EE, SQL, and various other tools and platforms.
The document summarizes a student management system project developed using Java and Microsoft Access. It includes an introduction, existing manual system overview, proposed computerized system features, database tables, screenshots and a conclusion. The proposed system aims to automate the existing manual student records maintenance process through a user-friendly interface, centralized database, and reports generation capabilities. It analyzes the requirements and provides use case and data flow diagrams for the new computerized student management system.
DETECTION OF FAKE ACCOUNTS IN INSTAGRAM USING MACHINE LEARNINGijcsit
With the advent of the Internet and social media, while hundreds of people have benefitted from the vast sources of information available, there has been an enormous increase in the rise of cyber-crimes, particularly targeted towards women. According to a 2019 report in the [4] Economics Times, India has witnessed a 457% rise in cybercrime in the five year span between 2011 and 2016. Most speculate that this is due to impact of social media such as Facebook, Instagram and Twitter on our daily lives. While these definitely help in creating a sound social network, creation of user accounts in these sites usually needs just an email-id. A real life person can create multiple fake IDs and hence impostors can easily be made. Unlike the real world scenario where multiple rules and regulations are imposed to identify oneself in a unique manner (for example while issuing one’s passport or driver’s license), in the virtual world of social media, admission does not require any such checks. In this paper, we study the different accounts of Instagram, in particular and try to assess an account as fake or real using Machine Learning techniques namely Logistic Regression and Random Forest Algorithm.
7-12-13 Contech Quality Control Track SheetCharles Neel
This quality control track sheet documents the progress and completion of various installation and testing tasks for a project involving an NRU unit. It lists tasks such as conduit installation, cable tray installation, wire pulling, wire termination, instrument wire installation, wire insulation, motor winding tests, and bolt torque checks. For each task, it records the date completed or notes if a task is not applicable. Progress is tracked for a P-1 reflux pump.
Voice Based Search Engine for Visually Impairment PeoplesIJASRD Journal
World Wide Web (WWW) is unexpectedly emerging because the accepted records supply for our society. The WWW is normally reachable the usage of an internet-browsing package from a networked pc. The layout of facts on the net is visually orientated. The reliance on visible presentation locations excessive cognitive demands on a person to function this sort of system. The interaction might also sometimes require the whole attention of a consumer. The design of information presentation at the web is predominately visible-oriented. This presentation technique requires most, if no longer all, of the person’s attention and imposes significant cognitive load on a user. This technique isn't always sensible, in particular for the visually impaired persons. The awareness of this challenge is to develop a prototype which supports net browsing the use of a speech-based interface, e.g. A telephone, and to degree its effectiveness. The command input and the delivery of web contents are totally in voice. Audio icons are constructed into the prototype so that users will have higher knowledge of the original shape/purpose of a web page. Navigation and manage commands are available to decorate the net browsing enjoy. The effectiveness of this prototype is evaluated in a consumer take a look at involving both generally sighted and visually impaired humans. Voice browsers allow human beings to get right of entry to the Web the usage of speech synthesis, pre-recorded audio, and speech reputation. This may be supplemented via keypads and small presentations. Voice may also be supplied as an accessory to standard computing device browsers with high resolution graphical presentations, presenting an on hand alternative to the use of the keyboard or screen, as an instance in cars in which palms/eyes unfastened operation is crucial. Voice interplay can get away the bodily obstacles on keypads and shows as cell devices turn out to be ever smaller. The browser will have an integrated textual content extraction engine that inspects the content of the page to construct a structured illustration. The inner nodes of the structure constitute diverse tiers of abstraction of the content. This enables in easy and bendy navigation of the page so that it will hastily home into gadgets of interest.
Chat-Bot for College Management System using A.IIRJET Journal
This document discusses developing a chatbot for a college management system using artificial intelligence. It would analyze user queries about college activities and provide responses. Users could ask questions through the chatbot without going to the college in person. Natural language processing and sentiment analysis techniques would be used to understand questions and determine appropriate responses from the knowledge database. The proposed system would include user registration and login, categorizing questions, using AI algorithms to analyze questions and provide answers, and interfacing with a database to retrieve information.
This document is a project report submitted as a minor project for a bachelor's degree in computer science and engineering. It describes the development of an information system called "Information for Tutor Guardian" or i-TG. The system was created to track student academic information and make it easily accessible to staff, faculty advisors, and committee members. The document includes sections on certificates, acknowledgements, table of contents, list of figures, introduction, background/literature review, process model, design, technical details, coding, screen layouts, future enhancements, and conclusion. It was submitted by three students under the guidance of a professor from the computer science department.
IRJET- College Enquiry Chatbot System(DMCE)IRJET Journal
The document describes a college enquiry chatbot system called DMCE that was developed by students and a professor at Datta Meghe College of Engineering. The chatbot uses artificial intelligence and machine learning to answer students' questions about college-related activities and events. It analyzes user queries through natural language processing and provides responses using an artificial intelligence markup language called AIML. The chatbot aims to reduce the need for students to personally visit the college to get information by providing an automated online service via a mobile application with a graphical user-friendly interface.
IRJET - Visual E-Commerce Application using Deep LearningIRJET Journal
This document discusses developing a visual e-commerce application using deep learning for object recognition. It proposes a system with three phases: 1) scanning real-time objects using an Android app and YOLO algorithm for object detection, 2) detecting the objects, and 3) recommending similar products to the user. The goal is to make e-commerce searching more user-friendly by allowing image-based searches rather than relying solely on text. This could help illiterate users and improve the shopping experience overall. The document reviews related work on object recognition techniques and visual search engines to provide context.
This document describes a project report submitted for a Bachelor of Technology degree. The report details the development of a chatbot created in Python by two students, Garvit Bajpai and Rakesh Kumar Kannaujiya, under the guidance of their professor Mr. Abhinandan Tripathi. The report provides information on the background, literature review, proposed solution, implementation, advantages and disadvantages of creating a chatbot in Python.
This document proposes a system to detect fake product reviews on e-commerce sites. The system uses sentiment analysis, content similarity analysis, and review deviation analysis to identify fake reviews. It extracts product reviews from websites, preprocesses the data, and uses three techniques to detect fake reviews. The fake reviews are then used to train a classifier to label new reviews as fake or genuine. The system was able to detect 111 fake reviews out of 300 with the classifier identifying an additional 18 fake reviews. The techniques aim to make online shopping reviews more trustworthy.
1. The document describes a proposed system called HAPPI (HAsh-based and PiPelIned) architecture for hardware-enhanced association rule mining. HAPPI aims to solve performance bottlenecks in existing Apriori-based hardware schemes by reducing the frequency of loading the database into hardware.
2. HAPPI includes three hardware modules - a systolic array to compare candidate itemsets with database items, a trimming filter to eliminate infrequent items, and a hash table to filter unnecessary candidate itemsets.
3. The proposed HAPPI system is intended to address limitations of existing Apriori-based approaches that involve repeatedly loading large candidate itemsets and databases into hardware.
The document describes the development of an online job portal system. The system allows job seekers to create profiles, upload resumes and apply for jobs posted by employers. Employers can post job listings, search resume databases and block candidates. The system aims to automate the manual job recruitment process and make it easier for job seekers and employers to connect. It was developed using PHP and MySQL on a LAMP stack with a distributed architecture and centralized database storage.
Mobile Phone Examiner Plus is forensic analysis software from AccessData that supports over 6800 mobile devices running iOS or Android. It allows extraction of physical, logical, and application data without requiring iTunes. The software automatically roots Android devices and extracts data, clones SIM cards, locks or exports data to Excel. It provides visualization of call logs, social timelines, application folders, and communication methods to analyze phone usage. The package includes the software, SIM card reader, and USB cables for $4,260.
A black-box-approach-for-response-quality-evaluation-of-conversational-agent-...Cemal Ardil
The document discusses the challenges of evaluating conversational agent systems. It proposes a black-box approach to evaluate response quality using observation, classification, and scoring. This approach is used to assess three example systems: AnswerBus, START, and AINI. The evaluation of conversational agents is difficult because existing information retrieval metrics do not apply, and there is no single correct answer for many questions.
Novel character segmentation reconstruction approach for license plate recogn...Conference Papers
The document proposes a new approach called partial character reconstruction for segmenting characters in license plate images to improve license plate recognition performance. It introduces using angular information and stroke width properties in different domains to segment characters and then reconstruct their complete shapes for recognition. Experimental results on several benchmark license plate databases and video databases show the technique is effective in handling images affected by multiple challenges.
this is VTU FINAL YEAR PROJECT REPORT full report is attached below.this alone with front pages attached Front pages report follows all the guidelines specified by vtu according to our college.
This document is a project report for a File Tracking System created by Sabana Maharjan. It includes a disclaimer, recommendation from the project supervisor, and a certificate of completion signed by external and internal examiners. The acknowledgments thank the computer science department and supervisor for their support. The abstract describes the project as an application to efficiently collect and track file information. It will use cloud technology and push notifications to alert users. The table of contents outlines the following chapters which cover introduction and background, literature review, methodology, system analysis and design, and implementation details.
Khetrabasi Rout is a software engineer with over 2 years of experience developing Java web applications using technologies like Spring, Hibernate, and Jasper Reports. He has worked on projects including an energy management system, healthcare EMR system, GPS tracking system, and biometric security system. His responsibilities included requirements gathering, coding, database development, testing, and deploying software. He is proficient in Java, HTML/CSS, JavaScript, databases like MySQL and MongoDB, and frameworks like Spring and Spring Boot.
Khetrabasi Rout is a software engineer with over 2 years of experience developing Java web applications using technologies like Spring, Hibernate, and Jasper Reports. He has worked on projects including an energy management system, healthcare EMR system, GPS tracking system, and biometrics security system. He is proficient in Java, HTML/CSS, JavaScript, databases like MySQL and MongoDB, frameworks like Spring and frameworks like Spring Boot. He has a MCA degree from Khallikote College and seeks new opportunities as a software engineer.
This document contains summaries of 26 different projects related to various domains including networking, image processing, healthcare, education, and more. The projects cover a range of modules and functionalities like user registration, file uploading/downloading, password authentication, device control, and more. The technologies used include Java, J2EE, SQL, and various other tools and platforms.
The document summarizes a student management system project developed using Java and Microsoft Access. It includes an introduction, existing manual system overview, proposed computerized system features, database tables, screenshots and a conclusion. The proposed system aims to automate the existing manual student records maintenance process through a user-friendly interface, centralized database, and reports generation capabilities. It analyzes the requirements and provides use case and data flow diagrams for the new computerized student management system.
DETECTION OF FAKE ACCOUNTS IN INSTAGRAM USING MACHINE LEARNINGijcsit
With the advent of the Internet and social media, while hundreds of people have benefitted from the vast sources of information available, there has been an enormous increase in the rise of cyber-crimes, particularly targeted towards women. According to a 2019 report in the [4] Economics Times, India has witnessed a 457% rise in cybercrime in the five year span between 2011 and 2016. Most speculate that this is due to impact of social media such as Facebook, Instagram and Twitter on our daily lives. While these definitely help in creating a sound social network, creation of user accounts in these sites usually needs just an email-id. A real life person can create multiple fake IDs and hence impostors can easily be made. Unlike the real world scenario where multiple rules and regulations are imposed to identify oneself in a unique manner (for example while issuing one’s passport or driver’s license), in the virtual world of social media, admission does not require any such checks. In this paper, we study the different accounts of Instagram, in particular and try to assess an account as fake or real using Machine Learning techniques namely Logistic Regression and Random Forest Algorithm.
7-12-13 Contech Quality Control Track SheetCharles Neel
This quality control track sheet documents the progress and completion of various installation and testing tasks for a project involving an NRU unit. It lists tasks such as conduit installation, cable tray installation, wire pulling, wire termination, instrument wire installation, wire insulation, motor winding tests, and bolt torque checks. For each task, it records the date completed or notes if a task is not applicable. Progress is tracked for a P-1 reflux pump.
This document describes a project to build a web crawler and search engine to provide student information to students. It will scrape data like exam results, college details, and fees from other websites and provide the information to students in a searchable online interface. The system will include a desktop application for scraping data and storing it in a SQL Server database. It will also have a web application for students to search for their results or compare results with other students. The project aims to make student exam data and materials easily available from a single portal.
COPY TEXT FROM IMAGE - Optical Character Recognition (OCR) kannan1171
Step 1: Open OneNote and use the screen clipping tool to select an area of an image file containing text.
Step 2: The clipped image is inserted into OneNote. Right click the image and select "Copy Text from Picture" to extract the text.
Step 3: The text is then pasted into OneNote or another document, allowing easy copying of text from image files into Word documents or other formats for documentation generation.
A simple natural language interface application for launching applications and showing user information based on voice input processed by using natural language programming concepts
E-tendering is an online process for carrying out the entire tendering cycle, including price bid submission, to harness the efficiency, economy, and speed of the internet. The key modules of the e-tendering cycle include tender notice creation, promotion, document distribution, bid submission, payment processing, evaluation, negotiation, awarding, and storage. Benefits include a completely automated and transparent process, time and cost savings, easier vendor participation, and improved record keeping.
Smart home device system using arduino uno & X-Beepremdeshmane
The document discusses the development of a user-centric intelligent home automation system using Arduino Uno and XBee. It defines the problem as controlling home devices wirelessly using a graph system. The objectives are to allow convenient, safe, energy-efficient and secure control of devices via mobile. It describes the existing remote control methods and chooses a Windows OS-based system using .NET and C/C++ to interface with Arduino, XBee and home devices.
Artificial neural networks are commonly used in optical character recognition algorithms due to their flexibility, ability to learn, and power. ANNs work by taking an input, running it through a network of neurons arranged in layers, and producing an output. They can be trained to recognize patterns through a learning stage where they are given many examples of input and output pairs. Once trained, ANNs can accurately evaluate new inputs and recognize characters at a 98% rate with only 5% error. Common types of ANNs include feedforward, recurrent, radial basis function, and self-organizing networks.
This document discusses fraud detection in online auctions. It begins with an introduction that describes how online auctions work and the types of fraud that can occur, such as sellers not delivering purchased items or posting fake listings. It then outlines the hardware and software requirements for developing a fraud detection system, including using Java, Tomcat web server, and MySQL database. The document provides literature reviews on these technologies and describes the existing system, proposed improved system, and system design modules.
This document presents a proposal for a data leakage detection system using fake object generation. It begins with an introduction that defines data leakage and notes the increasing costs associated with data breaches. The proposed system aims to improve detection of guilty agents that leak data by having the data distributor add fake objects to the different data sets distributed to various agents. This allows the distributor to identify the source if the fake objects are later discovered in leaked data. The document outlines the system architecture, modules including database maintenance and agent monitoring, and concludes that the approach of distributing different fake objects can help assess the likelihood of agents being responsible for data leaks.
LICENSE NUMBER PLATE RECOGNITION SYSTEM USING ANDROID APPAditya Mishra
The document outlines the development of a number plate recognition system using optical character recognition, including analyzing existing approaches, designing the system architecture, specifying functional and non-functional requirements, and testing the system. It also provides integrated summaries of several research papers on topics like automatic number plate recognition, optical character recognition techniques, and license plate recognition using OCR and template matching.
This document outlines an E-Tender System project developed using ASP.Net and SQL Server. It discusses the objectives of creating a website to advertise tenders online to help connect buyers and sellers. The project aims to automate the current manual tender process to save time and resources. Key features of the E-Tender System include online registration, searching for tenders, and advertising tenders for sale. It is intended to provide fast, easy, and accurate management of tender data and information.
This document provides an overview of a text recognition software project for Android mobile devices. The project aims to develop an application that can capture an image using a mobile device, localize text regions in the image, recognize the text, and optionally integrate it with applications like translation. Key requirements include capturing images with 2-4 megapixels, localizing text regions on images with homogeneous backgrounds, recognizing printed alphanumeric text in a limited set of fonts, and developing a translation application using the recognized text. Non-functional requirements address performance, safety, security, and software quality attributes like portability, maintainability, and reliability. The document describes the intended users, scope, features, design constraints, and provides detailed functional requirements for the image
Design and implementation of students timetable management systemNnachi Isaac Onuwa
This document summarizes a student's project on designing and implementing a mobile-based timetable management system for the Department of Computer Science at Akanu Ibiam Federal Polytechnic. The project aims to address the problems with the current manual timetabling system, such as delays in producing timetables and inability to make last-minute changes. The student proposes developing a mobile application using genetic algorithms and technologies like Java, XML and PHP to automate the timetabling process and make timetables easily accessible to students and staff. The application will store timetable data in a MySQL database and be accessible via Android mobile devices for improved convenience.
The authors propose strategies for detecting data leakage when sensitive data is shared with third parties (agents). They develop a model to assess the likelihood that leaked data came from one or more agents versus being independently gathered. The strategies involve how to distribute data objects among agents in a way that improves the ability to identify leakages. Some strategies involve injecting "fake but realistic" data objects that act as watermarks without modifying real data. The strategies are evaluated based on their ability to identify leakers in different data leakage scenarios.
UML (Unified Modeling Language) is a standard modeling language used to visualize, specify, construct, and document software systems. It uses graphical notation to depict systems from initial design through detailed design. Common UML diagram types include use case diagrams, class diagrams, sequence diagrams, activity diagrams, and state machine diagrams. UML provides a standard way to communicate designs across development teams and is supported by many modeling tools.
The document describes an online shopping cart system. It includes sections on the system scope, software requirements specification, functional model using data flow diagrams, activity diagrams showing the customer workflow, and a use case diagram showing actors and use cases. The functional model section includes data flow diagrams for the overall system, product listing, administrator, and secure gateway provider. The activity diagram shows the customer's shopping, checkout, and order cancellation processes. The use case diagram identifies the customer and purchased item as actors and use cases for viewing products, purchasing an item, and cancelling an order.
Research: Developing an Interactive Web Information Retrieval and Visualizati...Roman Atachiants
The document describes developing an interactive web information retrieval and visualization system. The system aims to make information searching and presentation easier and more efficient. It does this through speech recognition, keyword extraction from text, query construction and expansion using concepts, filtering and summarizing search results, and visualization. The system architecture includes these main components and was tested with satisfactory results. However, some challenges remain in creating a smooth presentation experience.
Efficient and accurate object detection has been an important topic in the advancement of computer vision systems.
Our project aims to detect the object with the goal of achieving high accuracy with a real-time performance.
In this project, we use a completely deep learning based approach to solve the problem of object detection.
The input to the system will be a real time image, and the output will be a bounding box corresponding to all the objects in the image, along with the class of object in each box.
Objective
Develop a application that detects an object and it can be used for vehicles counting, when the object is a vehicle such as a bicycle or car, it can count how many vehicles have passed from a particular area or road and it can recognize human activity too.
The document discusses Securonix, a cybersecurity company that provides next-generation security analytics solutions. It describes Securonix's mission to help organizations detect and respond to cyber threats through its SNYPR Security Analytics Platform. The platform uses big data technologies like Apache Hadoop and Kafka to collect and analyze billions of security events per day to identify advanced persistent threats, insider threats, and other risks. It also provides log management, security information and event management, and user and entity behavior analytics capabilities on a single platform.
Organisering av digitale prosjekt: Hva har IT-bransjen lært om store prosjekter?Torgeir Dingsøyr
IT-bransjen har gjort store endringer i måten de gjennomfører prosjekter på gjennom bruk av smidige metoder. Disse metodene ble først brukt på små, samlokaliserte team men brukes nå også i store prosjekter med mange team og flere hundre utviklere. Hvordan jobber IT-bransjen for å sikre vellykkede store prosjekter?
This is my thesis part of my MTech in Data Analytics. It talks about Malware analysis using static, dymanic and memory analysis techniques. This paper also focuses on implementation algo approach etc in detail.
The document describes a dissertation submitted for a Bachelor of Science degree. It outlines the objectives of creating an interactive web tutorial to inform newly diagnosed diabetics about their condition in a fun way. It details the methodology, which involved designing page layouts, learning Flash and ActionScript for interactivity, and creating a Java game applet. The achievements were successfully meeting the objectives of an informative yet entertaining resource through the implemented system, which utilized HTML, CSS, JavaScript, Flash and Java applets.
Design and Development of a Knowledge Community SystemHuu Bang Le Phan
The document is a dissertation submitted by Le Phan Huu Bang to the Department of Computer Science at the National University of Singapore in 2008/2009 describing the design and development of a Knowledge Community System (K-Comm). The dissertation includes chapters on introducing knowledge and the need for knowledge sharing, reviewing existing literature, providing an overview of the K-Comm system and its features, and describing the implementation of K-Comm.
This document describes a thesis submitted to fulfill the requirements for a Bachelor of Technology degree in computer science and engineering. The thesis aims to improve the efficiency of identifying members of a community using seed set expansion. It explores different seed expansion algorithms and identifies opportunities to improve performance. The authors develop their own modification of the PageRank algorithm and show it achieves higher performance than existing approaches. They evaluate their methods on multiple publicly available datasets containing ground-truth communities.
Final Year Project-Gesture Based Interaction and Image ProcessingSabnam Pandey, MBA
This document summarizes a student's final year project report on developing a gesture recognition system for browsing pictures. The student aims to implement algorithms for skin and contour detection of a user's hand in real-time images from a webcam. The report includes chapters on literature review of gesture recognition and image processing techniques, methodology using the waterfall model, requirements analysis and design diagrams, implementation details using OpenCV, and testing and evaluation of the project objectives and aims.
Design and Development Of Automated Examination System.Shivakant Dubey
This document provides a synopsis for a project to design and develop an automated examination system. The current manual examination process is prone to errors. The project aims to create a computerized system to efficiently store and manage large amounts of student examination data. It will allow administrators to maintain databases and automatically calculate marks, grades, and print transcripts. The system is intended to avoid errors from manual handling of examination data.
Restaurant e-menu on iPad, Rapid Application Development (RAD), Model-View-Controller (MVC), ASP.Net, Xcode, Web services, iPad application and mobile application development.
This document provides an overview of a project report on a social media application called Eywry. The report was submitted by Hridam Dhimal and Priyanka Sharma to the Department of Computer Application at DAV College to fulfill the requirements for a Bachelor's degree in Computer Application. It includes recommendations from their supervisor, approval letters, acknowledgments, and an abstract describing the project's goal of developing a social media application focused on spreading positivity. The report is organized into chapters covering introduction and background, system analysis and design, implementation and testing, and conclusions.
E learning project report (Yashraj Nigam)Yashraj Nigam
This document presents a major project report on an E-Learning (Web Based Learning System) submitted in partial fulfillment of the requirements for a Bachelor of Engineering degree. The document includes a declaration signed by the three project team members confirming the originality of the work. It also includes certificates signed by the project supervisor and institute heads. The document provides an acknowledgement of the guidance and support received. It includes lists of figures and tables as well as the table of contents. It introduces the scope and objectives of the project to develop an E-Learning management system to automate processes like managing student, class, assignment, quiz and question details in order to increase efficiency and proper resource management.
This document provides an industrial training report on a "Quiz System" project completed at Webtek Labs Pvt. Ltd. It includes sections on the organization profile, introduction of the project, problem specification, objectives, system analysis including feasibility study and hardware/software requirements. The development environment utilized Oracle database for backend and NetBeans IDE 8.1 for frontend development. The report describes testing and implementation of the quiz system including screenshots. It provides an overview of using a computerized quiz system to overcome limitations of manual systems and enable students to take quizzes, view results, and assess learning.
The document discusses software architecture for developers. It covers topics such as what software architecture is, the software architecture role, designing software through understanding requirements, quality attributes, and constraints. It also discusses visualizing software using sketches and diagrams, documenting software architecture, and provides several case studies and examples. The key points are that software architecture is important for avoiding chaos, a good architecture enables agility, and the software architecture role involves both technical and soft skills as well as coding, coaching and collaboration.
A Usability Evaluation carried out on my second year Brunel Group project.
A.R.C. (Augmented Reality Communicator), is an augmented reality social networking application , designed and built for my second year group project at Brunel University.
Software Requirements Specification on Student Information System (SRS on SIS)Minhas Kamal
The document summarizes the inception phase of requirements engineering for developing a Student Information System for the Institute of Information Technology at the University of Dhaka. Key activities in the inception phase included identifying stakeholders such as students, faculty, administrators and alumni; eliciting their requirements through discussions; identifying common and conflicting requirements; and prioritizing requirements to develop an initial set of requirements for the system.
Sudheer Mechineni, Head of Application Frameworks, Standard Chartered Bank
Discover how Standard Chartered Bank harnessed the power of Neo4j to transform complex data access challenges into a dynamic, scalable graph database solution. This keynote will cover their journey from initial adoption to deploying a fully automated, enterprise-grade causal cluster, highlighting key strategies for modelling organisational changes and ensuring robust disaster recovery. Learn how these innovations have not only enhanced Standard Chartered Bank’s data infrastructure but also positioned them as pioneers in the banking sector’s adoption of graph technology.
“An Outlook of the Ongoing and Future Relationship between Blockchain Technologies and Process-aware Information Systems.” Invited talk at the joint workshop on Blockchain for Information Systems (BC4IS) and Blockchain for Trusted Data Sharing (B4TDS), co-located with with the 36th International Conference on Advanced Information Systems Engineering (CAiSE), 3 June 2024, Limassol, Cyprus.
In his public lecture, Christian Timmerer provides insights into the fascinating history of video streaming, starting from its humble beginnings before YouTube to the groundbreaking technologies that now dominate platforms like Netflix and ORF ON. Timmerer also presents provocative contributions of his own that have significantly influenced the industry. He concludes by looking at future challenges and invites the audience to join in a discussion.
Enchancing adoption of Open Source Libraries. A case study on Albumentations.AIVladimir Iglovikov, Ph.D.
Presented by Vladimir Iglovikov:
- https://www.linkedin.com/in/iglovikov/
- https://x.com/viglovikov
- https://www.instagram.com/ternaus/
This presentation delves into the journey of Albumentations.ai, a highly successful open-source library for data augmentation.
Created out of a necessity for superior performance in Kaggle competitions, Albumentations has grown to become a widely used tool among data scientists and machine learning practitioners.
This case study covers various aspects, including:
People: The contributors and community that have supported Albumentations.
Metrics: The success indicators such as downloads, daily active users, GitHub stars, and financial contributions.
Challenges: The hurdles in monetizing open-source projects and measuring user engagement.
Development Practices: Best practices for creating, maintaining, and scaling open-source libraries, including code hygiene, CI/CD, and fast iteration.
Community Building: Strategies for making adoption easy, iterating quickly, and fostering a vibrant, engaged community.
Marketing: Both online and offline marketing tactics, focusing on real, impactful interactions and collaborations.
Mental Health: Maintaining balance and not feeling pressured by user demands.
Key insights include the importance of automation, making the adoption process seamless, and leveraging offline interactions for marketing. The presentation also emphasizes the need for continuous small improvements and building a friendly, inclusive community that contributes to the project's growth.
Vladimir Iglovikov brings his extensive experience as a Kaggle Grandmaster, ex-Staff ML Engineer at Lyft, sharing valuable lessons and practical advice for anyone looking to enhance the adoption of their open-source projects.
Explore more about Albumentations and join the community at:
GitHub: https://github.com/albumentations-team/albumentations
Website: https://albumentations.ai/
LinkedIn: https://www.linkedin.com/company/100504475
Twitter: https://x.com/albumentations
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...Neo4j
Leonard Jayamohan, Partner & Generative AI Lead, Deloitte
This keynote will reveal how Deloitte leverages Neo4j’s graph power for groundbreaking digital twin solutions, achieving a staggering 100x performance boost. Discover the essential role knowledge graphs play in successful generative AI implementations. Plus, get an exclusive look at an innovative Neo4j + Generative AI solution Deloitte is developing in-house.
In the rapidly evolving landscape of technologies, XML continues to play a vital role in structuring, storing, and transporting data across diverse systems. The recent advancements in artificial intelligence (AI) present new methodologies for enhancing XML development workflows, introducing efficiency, automation, and intelligent capabilities. This presentation will outline the scope and perspective of utilizing AI in XML development. The potential benefits and the possible pitfalls will be highlighted, providing a balanced view of the subject.
We will explore the capabilities of AI in understanding XML markup languages and autonomously creating structured XML content. Additionally, we will examine the capacity of AI to enrich plain text with appropriate XML markup. Practical examples and methodological guidelines will be provided to elucidate how AI can be effectively prompted to interpret and generate accurate XML markup.
Further emphasis will be placed on the role of AI in developing XSLT, or schemas such as XSD and Schematron. We will address the techniques and strategies adopted to create prompts for generating code, explaining code, or refactoring the code, and the results achieved.
The discussion will extend to how AI can be used to transform XML content. In particular, the focus will be on the use of AI XPath extension functions in XSLT, Schematron, Schematron Quick Fixes, or for XML content refactoring.
The presentation aims to deliver a comprehensive overview of AI usage in XML development, providing attendees with the necessary knowledge to make informed decisions. Whether you’re at the early stages of adopting AI or considering integrating it in advanced XML development, this presentation will cover all levels of expertise.
By highlighting the potential advantages and challenges of integrating AI with XML development tools and languages, the presentation seeks to inspire thoughtful conversation around the future of XML development. We’ll not only delve into the technical aspects of AI-powered XML development but also discuss practical implications and possible future directions.
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdfPaige Cruz
Monitoring and observability aren’t traditionally found in software curriculums and many of us cobble this knowledge together from whatever vendor or ecosystem we were first introduced to and whatever is a part of your current company’s observability stack.
While the dev and ops silo continues to crumble….many organizations still relegate monitoring & observability as the purview of ops, infra and SRE teams. This is a mistake - achieving a highly observable system requires collaboration up and down the stack.
I, a former op, would like to extend an invitation to all application developers to join the observability party will share these foundational concepts to build on:
How to Get CNIC Information System with Paksim Ga.pptxdanishmna97
Pakdata Cf is a groundbreaking system designed to streamline and facilitate access to CNIC information. This innovative platform leverages advanced technology to provide users with efficient and secure access to their CNIC details.
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdfMalak Abu Hammad
Discover how MongoDB Atlas and vector search technology can revolutionize your application's search capabilities. This comprehensive presentation covers:
* What is Vector Search?
* Importance and benefits of vector search
* Practical use cases across various industries
* Step-by-step implementation guide
* Live demos with code snippets
* Enhancing LLM capabilities with vector search
* Best practices and optimization strategies
Perfect for developers, AI enthusiasts, and tech leaders. Learn how to leverage MongoDB Atlas to deliver highly relevant, context-aware search results, transforming your data retrieval process. Stay ahead in tech innovation and maximize the potential of your applications.
#MongoDB #VectorSearch #AI #SemanticSearch #TechInnovation #DataScience #LLM #MachineLearning #SearchTechnology
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slackshyamraj55
Discover the seamless integration of RPA (Robotic Process Automation), COMPOSER, and APM with AWS IDP enhanced with Slack notifications. Explore how these technologies converge to streamline workflows, optimize performance, and ensure secure access, all while leveraging the power of AWS IDP and real-time communication via Slack notifications.
A tale of scale & speed: How the US Navy is enabling software delivery from l...sonjaschweigert1
Rapid and secure feature delivery is a goal across every application team and every branch of the DoD. The Navy’s DevSecOps platform, Party Barge, has achieved:
- Reduction in onboarding time from 5 weeks to 1 day
- Improved developer experience and productivity through actionable findings and reduction of false positives
- Maintenance of superior security standards and inherent policy enforcement with Authorization to Operate (ATO)
Development teams can ship efficiently and ensure applications are cyber ready for Navy Authorizing Officials (AOs). In this webinar, Sigma Defense and Anchore will give attendees a look behind the scenes and demo secure pipeline automation and security artifacts that speed up application ATO and time to production.
We will cover:
- How to remove silos in DevSecOps
- How to build efficient development pipeline roles and component templates
- How to deliver security artifacts that matter for ATO’s (SBOMs, vulnerability reports, and policy evidence)
- How to streamline operations with automated policy checks on container images
Introducing Milvus Lite: Easy-to-Install, Easy-to-Use vector database for you...Zilliz
Join us to introduce Milvus Lite, a vector database that can run on notebooks and laptops, share the same API with Milvus, and integrate with every popular GenAI framework. This webinar is perfect for developers seeking easy-to-use, well-integrated vector databases for their GenAI apps.
Full-RAG: A modern architecture for hyper-personalizationZilliz
Mike Del Balso, CEO & Co-Founder at Tecton, presents "Full RAG," a novel approach to AI recommendation systems, aiming to push beyond the limitations of traditional models through a deep integration of contextual insights and real-time data, leveraging the Retrieval-Augmented Generation architecture. This talk will outline Full RAG's potential to significantly enhance personalization, address engineering challenges such as data management and model training, and introduce data enrichment with reranking as a key solution. Attendees will gain crucial insights into the importance of hyperpersonalization in AI, the capabilities of Full RAG for advanced personalization, and strategies for managing complex data integrations for deploying cutting-edge AI solutions.
For the full video of this presentation, please visit: https://www.edge-ai-vision.com/2024/06/building-and-scaling-ai-applications-with-the-nx-ai-manager-a-presentation-from-network-optix/
Robin van Emden, Senior Director of Data Science at Network Optix, presents the “Building and Scaling AI Applications with the Nx AI Manager,” tutorial at the May 2024 Embedded Vision Summit.
In this presentation, van Emden covers the basics of scaling edge AI solutions using the Nx tool kit. He emphasizes the process of developing AI models and deploying them globally. He also showcases the conversion of AI models and the creation of effective edge AI pipelines, with a focus on pre-processing, model conversion, selecting the appropriate inference engine for the target hardware and post-processing.
van Emden shows how Nx can simplify the developer’s life and facilitate a rapid transition from concept to production-ready applications.He provides valuable insights into developing scalable and efficient edge AI solutions, with a strong focus on practical implementation.
“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
Smart Snap - Report
1. Smart Glass
Submitted in partial fulfillment of the requirements
of the degree of
Bachelor of Engineering
by
Jay Shah, 60003100048
Pooja Shah, 60003100043
Tapan Desai, 60003100012
Supervisors:
Prof. Vinaya Sawant
Prof. Anuja Nagare
Information Technology
Dwarkadas J. Sanghvi College of Engineering, University Of Mumbai
2013-2014
2. Project Report Approval for B. E.
This project report entitled “Smart Glass” by Jay Shah, Pooja Shah and
Tapan Desai is approved for the degree of Information Technology.
Internal Guide
(Prof. Vinaya Sawant)
Internal Examiner External Examiner
Vice Principal (Acad) and HOD, IT Dept. Principal
(Dr. A. R. Joshi) (Dr. Hari Vasudevan)
3. Declaration
We declare that this written submission represents our ideas in our own words and
where others' ideas or words have been included, we have adequately cited and
referenced the original sources. We also declare that we have adhered to all principles
of academic honesty and integrity and have not misrepresented or fabricated or
falsified any idea/data/fact/source in our submission. We understand that any violation
of the above will be cause for disciplinary action by the Institute and can also evoke
penal action from the sources which have thus not been properly cited or from whom
proper permission has not been taken when needed.
Jay Shah, 60003100048
----------------------------------------- -----------------------------------------
Pooja Shah, 60003100043
----------------------------------------- -----------------------------------------
Tapan Desai, 60003100012
----------------------------------------- -----------------------------------------
(Name of student and Roll No) (Signature)
Date: 28/04/2014
4. ACKNOWLEDGEMENTS
We are highly indebted to Dwarkadas J. Sanghvi College of Engineering for their
guidance and constant supervision as well as for providing necessary information
regarding the project & also for their support in completing the project.
We would like to express our heartfelt gratitude towards our guide Prof. Vinaya Sawant
and our co-guide Prof. Anuja Nagare for their kind co-operation and encouragement
which help us in completion of this Synopsis.
We would like to express our special gratitude and thanks to faculty of Information
Technology Department for giving us such attention and time. Our thanks and
appreciations also go to our colleagues in developing the project and people who have
willingly helped us out with their abilities.
Jay Shah
Pooja Shah
Tapan Desai
5. Table of Contents
1. Introduction……………………………………………………………………………………1
2. Literature Review……………………………………………………………………………..2
3. Problem Definition…………………………………………………………………………….5
4. Proposed System………………………………………………………………………………7
5. Project Management………………………………………………………………………….11
5.1 Schedule……………………………………………………………………………………...11
5.2 Project Resources…………………………………………………………………………….12
5.3 Project Estimates...…………………………………………………………………………...13
5.4 Risk Mitigation Strategy……………………………………………………………………..15
6. Project Design………………………………………………………………………………..18
6.1 System Architecture. ………………………………………………………………………...18
6.2 Module/Component Description……………………………………………………………..20
6.3 User Interface Design………………………………………………………………………...27
7. Implementation……………………………………………………………………………….30
7.1 Modules/ Component Description……………………………………………………………30
7.2 Module-wise Algorithm……………………………………………………………………...32
8. Experiments and Project Testing……………………………………………………………..34
8.1 Test plan……………………………………………………………………………………...34
8.2 Test cases……………………………………………………………………………………..35
8.3 Methods used…………………………………………………………………………………36
8.4 Test Results…………………………………………………………………………………..37
9. Maintenance………………………………………………………………………………….38
9.1 User Manual………………………………………………………………………………….38
9.2 Constraints……………………………………………………………………………………38
10. Conclusion and Future Scope………………………………………………………………...40
11. References and Bibliography………………………………………………………………...41
6. List of Figures
Figure 2.1: Screenshot for existing system..........................................................................................................3
Figure 3.1: Use your mobile ................................................................................................................................6
Figure 3.2: Snap a picture…………………………………………………………………….............................6
Figure 3.3: Get related content.............................................................................................................................6
Figure 4.1: Architecture of Proposed System......................................................................................................9
Figure 5.1: Gnatt Chart for Project Schedule.....................................................................................................12
Figure 6.1: Actual System Architecture based on modules ...............................................................................20
Figure 6.2: Use Case Diagram...........................................................................................................................21
Figure 6.3: Activity Diagram.............................................................................................................................22
Figure 6.4: State transition to capture image .....................................................................................................23
Figure 6.5: State transition for text conversion..................................................................................................23
Figure 6.6: Deployment Diagram ......................................................................................................................24
Figure 6.7: Data Flow Diagram .........................................................................................................................26
Figure 6.8: Splash Screen ..................................................................................................................................26
Figure 6.9: Smart Snap of Play Store.................................................................................................................27
Figure 6.10: Main Menu ....................................................................................................................................27
Figure 6.11: Slide Menu ....................................................................................................................................27
List of Tables
Table 1: COCOMO Model………………………………………………………………...................................................14
Table 2: Risk Mitigation Strategy………………………………………………………………........................................15
Table 3: Risk Sheet………………………………………………………………..............................................................16
Table 4: Test Case………………………………………………………………...............................................................36
7. Project Report: Smart Glass
Chapter 1
1. Introduction
Mobile photo + Image Recognition = Identification
A lot of existing systems exist which provide text based results for news. There are E-commerce portals
to buy products based on text search. The thing common with both the systems is a text based search.
Our project aims on eliminating text based search and give results based on visual search.
Our project add context to images. Our image recognition products help in connecting images to
relevant information. The user snaps a picture; the application tells the user what's in it.
Now that the communication has moved from text to images, technology needs to improve. The project
is based on the latest technology of visual search. The application is an android application in which it
identifies objects from images. Once objects are identified, it is able to provide additional information
about them, e.g. related web-pages, description, present statistics etc.
The application allows the user to click an image and search for it over the web using our web engine
or just execute the actions specified by them on their own spreadsheet. As we see the application is
totally customizable as per user requirements. The main features of the application includes Document
Capture, News aggregator, Real-Time Information and Visual Commerce.
8. Project Report: Smart Glass
2
Chapter 2
2. Literature Review
This section consists of the research conducted by the team. It includes a cursory preface of the existing
system and its drawbacks. It also gives an idea about all potential frameworks that could have been
used in building this application.
2.1 Existing Systems
Google Goggles:
Scan barcodes using Goggles to get product information
Scan QR codes using Goggles to extract information
Recognize famous landmarks
Translate by taking a picture of foreign language text
Add Contacts by scanning business cards or QR codes
Scan text using Optical Character Recognition (OCR)
Recognize paintings, books, DVDs, CDs, and just about any 2D image
Solve Sudoku puzzles
Find similar products
9. Project Report: Smart Glass
3
Smart Glance:
Support for secured cloud hosted service & on-premises Installation
Supports multiple mobile platforms
View Data in Rich Graphical formats
Analyse by Charts – trend & Column, Pie & Donut
Zoom In/Out on charts for better view
Tools to compare two elements instantly
Tools to compare against target/benchmark
Support for offline access
Localization in multiple languages – Spanish, German, Italian, French, Chinese &
Japanese.
Figure 1: Screenshots of existing system
2.2 Pitfalls
Google Goggles:
Redirects to the Google search engine while scanning an image instead of giving the
results on the website.
No visual commerce present. Scans the image and provides the web link for the product.
Users cannot customize the search based on their requirements. Gives all the
information, instead of providing a link which the user actually needs.
Smart Glance:
10. Project Report: Smart Glass
4
Provides information for all the machines instead of narrowing the search down to the
machine required.
Doesn’t include any other features.
11. Project Report: Smart Glass
5
Chapter 3
3. Problem Definition
Problem Definition
Smart Glass is an android application which takes in images using mobile camera. The application uses
optical character recognition and image processing to scan the images and gives results based on the search.
The application has a number of features Visual Commerce, Visual Search, Real time object statistics,
language translator, user specified spreadsheets, form processing and document capture.
The application will run on all android operating systems. The user can get any information based on the
image they’ve searched from a dedicated database.
Document Capture:
This features converts a printed document to editable text. Scanned documents can be exported to
various formats. Prevents wastage of time and resources as the user doesn’t have to transfer and type
content from images.
Visual Commerce:
12. Project Report: Smart Glass
6
User can purchase products directly from the application from EBay and Amazon. The user just have
to click the picture of the product they want to purchase and the application will give them results.
News Aggregator:
The application will give results based on the images the user clicks. The application will retrieve
information from the dedicated databases which will get populated automatically with the latest content
every day.
Real-Time Information:
This feature will give real time results for the captured products. This information is retrieved from a user
defined spread-sheet or database. This system can be used to retrieve information from ERP systems as
well. This feature gives users complete customization over the results.
The basic idea of the application is:
Figure 2: Use your mobile Figure 3: Snap a picture Figure 4: Get related content
The user clicks a photo from his/her smart phone. The image is scanned using an OCR. The results from
the OCR are passed as a query to our database. This database is populated everyday using crawlers. The
data matched in the database is passed back to the mobile device and displayed to the user.
13. Project Report: Smart Glass
7
Chapter 4
4. Proposed System
4.1 Draft of Proposed System
Project plan is basically a proposed approach to creating the application.
The basic concept of the application is that the user won’t have to type everything they want to
search. The user can just click the image of the object they want to search and the application will
give results directly.
This software can provide information such as machine specification, maintenance statistics,
production capacity, effectiveness etc. by just placing the camera on the machine. It can also help
detect real time machine speed or the temperature of the furnace just by connecting it to the
factory’s ERP system.
The application can be used by anyone. The interface is user-friendly and the user will get all the
results directly in the application. The user won’t be redirected to any of the web browser. Just by
placing the camera on any brand name will retrieve all the information about the brand along with
its specialization, web pages and various branches.
Major technologies to be used:
14. Project Report: Smart Glass
8
• OCR Engine
• MATLAB
• Php
• MySQL
• Eclipse
Today more than texts images get uploaded on Internet. This application makes use of images and
retrieves news and important data needed as per the user’s requirement. It helps user to manage his
own spreadsheet through this app. User can specify the action that he /she wants to perform on the
click of that particular object. After capturing a particular object user can retrieve latest news on
that object. User can also change the file format using this application. Users can even buy products
using this online.
4.2 Expected Modules:
The basic project plan is dividing the application into four modules:
Document Capture: This module will allow user to capture documents and edit them. The user
would be allowed to save them in a PDF format. This is done using an OCR engine. The text which
is read in the document is then sent to the server. The server will then send back an edited form of
the document.
Visual Search: This module will again be created using an OCR engine. This module will allow
user to click photos of nearby texts and retrieve results based on the text from the database. The
project plan includes creating our own database which will be populated using crawlers.
Visual Commerce: The basic idea of this module is to allow user to shop using our application.
The application will be connected to various API’s from top E-commerce portals. Whenever the
user clicks the photo of a nearby object they want to buy, the application will display products from
e-commerce portals.
Real-Time information: This module will include giving real-time information from a connected
spreadsheet. This module will make the application user customisable.
15. Project Report: Smart Glass
9
4.3 Architecture for Proposed System:
Figure 5: Architecture of Proposed System
The proposed architecture is based on a mobile device which is the central unit of the
application. The mobile device would include the OCR engine and the required Camera
Activity. The application then connects to the services of individual modules.
Visual Search: For this module the application is connected to the search engine which is
integrated with the application database.
Document Capture: This uses the Document Translator API.
Real-Time Information: The application is connected to the web services which are then
connected to the Spreadsheet and the ERP system.
4.4 Advantages of Proposed System
The application is an all-in-one system. Each module has its own advantage for the user.
They are as follows:
Document Capture: The user doesn’t have to type or re-type an entire document. The user has just
to click an image of a printed document and get results based on the text recognized. The main
advantage will be that the user will be saving a lot of time. Also the application converts the new
document into PDF format. This document could be shared using all major social networking
platforms.
16. Project Report: Smart Glass
10
Visual Search: The user doesn’t have to login to the web browser and search for the news. The
main advantage of this module is that it allows user to retrieve news within seconds. All the news
is aggregated and stored in a database. Again saves time for the user.
Visual Commerce: This module will allow user to shop online without searching for an item they
like over all the e-commerce portals. The top e-commerce portals are aggregated in this small
application. Thus, it acts as an one stop shopping solution. The main feature will be visual search
in e-commerce instead of traditional text-based search. The user is allowed to just click a photo of
the object they want to acquire and get the product price and availability from e-commerce portals.
Real-Time Information: This module will allow user to customise the application as per their
requirement. The application can be linked to user created spreadsheets or an ERP system. The
major advantage is that the user will get real-time information from the application.
17. Project Report: Smart Glass
11
Chapter 5
5. Project Management
Project management is the discipline of planning, organizing, securing and managing resources to
achieve specific goals.
Schedule
Schedule helped us to know project's milestones, activities, and deliverables, with start and
finish dates.
Project scope is defined and the appropriate methods for completing the project are determined.
The durations for the various tasks necessary to complete the work are listed.
Gnatt chart for our project is as follows:
18. Project Report: Smart Glass
12
Figure 6: Gnatt Chart for Project Schedule
Project Resources
The project will require a limited amount of resources.
Hardware:
A mobile device running on Android Platform. The application should have a running camera.
A computer which could serve as a server for Database. The server needs to run 24x7.
Software:
Android SDK for development along with an Android IDE.
Google API
API’s from E-Commerce portals to link them to the application.
Google Spread sheets.
Other Requirements:
The mobile network should have an active internet service.
The mobile device should support Google Play Store.
19. Project Report: Smart Glass
13
Project Estimates
Estimation is basically identifying and acquiring necessary resources such as equipment’s,
materials, man-power etc. required for accomplishing the project successfully. Estimation
techniques used for our project are as follows:
Lines of Code
Lines of code (LOC) is a software metric used to measure the size of a computer program by
counting the number of lines in the text of the program's source code. It is typically used to
predict the amount of effort that will be required to develop a program, as well as to estimate
programming productivity or maintainability once the software is produced.
For our project, the estimated lines of code = 6.5 K
The above mentioned Lines of Code (LOC) include the following:
• Authentication to access the tool
• Code for adding/deleting questions and updating keywords of the questions
• Logic for highlighting the important sentences in the document
• Frequency calculation of the keywords in the document for report preparation
• Graph generation code to display the overall performance of the class
COCOMO Estimation Model
The Constructive Cost Model (COCOMO) is an algorithmic software cost estimation model
that computes software development effort and cost as a function of program size. Program
size is expressed in estimated thousands of source lines of code (SLOC). Basic COCOMO is
good for quick estimate of software cost.
COCOMO applies to three classes of software projects:
Organic projects: "small" teams with "good" experience working with "less than rigid"
requirements
20. Project Report: Smart Glass
14
Semi-detached projects: "medium" teams with mixed experience working with a mix of rigid
and less than rigid requirements
Embedded projects: developed within a set of "tight" constraints. It is also combination of
organic and semi-detached projects.
The basic COCOMO equations take the form:
Effort applied=a*(KLOC)b
[man-months]
Development time=c*(effort applied)d
[months]
People required = Effort applied [ count]
Development time
where, KLOC is the estimated number of delivered lines (expressed in thousands ) of code for project.
The co-efficient a, b, c, d is given in the following table:
Software Project A B C D
Organic 2.4 1.05 2.5 0.38
Semi-detached 3.0 1.12 2.5 0.35
Embedded 3.6 1.20 2.5 0.32
Table 1: COCOMO Model
Estimates of Effort, Cost, Duration:
E = a*(KLOC)b
= 3.0*(6.5)1.12
= 24.411
D = c*(E)d
= 2.5 *(24.411)0.35
= 7.65
21. Project Report: Smart Glass
15
P = E/D
= 24.411
7.65
= 3
Risk Mitigation Strategy
RISK CATEGORY PROBABILITY IMPACT
Insufficient Accuracy PS High 1
Inadequate knowledge about
application
BU Low 3
Table 2: Risk Mitigation Strategy
Impact Values :
1-catastropic 2-critical
3-marginal 4-negligible
A project team begins by listing all risks in first column of table. Each risk is
categorized in the second column (PS-Project Risk, DE-Development Risk, BU-Business Risk,
and TE- Technical Risk). The probability of occurrence of each risk is entered in the next
column of the table.
RMMM Plan for each risk:
Risk Information Sheet
Project Name : Smart Glass
Risk Id:- 001 Date :- 4/8/2013 Probability :- High Impact :- catastrophic
Origin :- Jay Shah Assigned To :- Tapan Desai
Description :-
The chances of loss due to this could be 65%. Lack of accuracy will lead into incorrect results
given by blurred or tilted image and it may lead to crashing of the application.
Mitigation/Monitoring :-
22. Project Report: Smart Glass
16
The user can ensure accuracy by using the proper or specified megapixel of the camera in his
android smartphone. He should try to click image where there is sufficient brightness.
Management :-
Once risk becomes active then we will provide the user with the facility to edit the text if he or
she feels that the converted image is incorrect.
Status :- Still left to implement.
Approval :- Vinaya Sawant Closing Date :- 17-10-2013
Table 3: Risk Sheet
23. Project Report: Smart Glass
17
Risk Information Sheet
Project Name : Smart Glass
Risk Id:- 002 Date :- 15/8/2013 Probability :- Low Impact :- serious
Origin :- Tapan Desai Assigned To :- Pooja Shah
Description :-
The user has 5% chances of loss. If the user does not have prior knowledge about the working
about the application then he or she may switch to some other application if they find any.
Mitigation/Monitoring :-
The GUI(Graphical user interface) of the application would be simple enough for the user to
understand the basic operations that are to be carried out for successful working of the
application.
Management :-
Once risk becomes active then we will provide the user with the Help menu where he would
find all the steps that are to be carried out for using the application efficiently.
Status :- Still left to implement.
Approval :- Vinaya Sawant Closing Date :- 17-10-2013
Table 4: Risk Sheet 2
24. Project Report: Smart Glass
18
Chapter 6
6. Project Design
System Architecture
The main aim of the system is to retrieve data from the captured image. The data is then passed
on to the servers. The servers then check for relevant information of the searched module. This
information is then passed on to the mobile device which the user can read.
The architecture is based on an OCR engine and Camera Activity running on a mobile device.
The architecture is divided into various units for each type of modules
a. Visual Search: Visual search module is based on Database Technology, Crawlers and
OCR engines. The user has to click an image of the text he/she wants to search. This text
is then read by the OCR engine. The text is then passed as an query to the server. The server
consists of MySQL database which daily populated by two crawlers. The database is then
searched for the query. Once the results are found they are sent back to the mobile device.
The response is quick and the results are displayed within a span of 3 seconds. The main
feature of Visual Search is not using any readymade search engines but aggregating news
in our own database and displaying results for the user.
b. Document Capture: The text in this module is read by the OCR Engine. The recognized
text is then passed to the server. There is another OCR present in the server to improve the
text accuracy. The retrieved text is then passed back to the user mobile device in an editable
25. Project Report: Smart Glass
19
format. After the text is edited the document is converted into PDF Format using Document
Translator. The saved document can also be shared on all the social networking platforms.
c. Real Time Information: This feature makes the application customisable. The application
can be linked to user defined spread sheets or on site ERP System. This module again uses
the OCR engine and read the text from the image. The text is then matched from the linked
spreadsheet or ERP system and the results are displayed to the user. The spreadsheet can
contain any data which can be filled by the user. The application sends a query to the
spreadsheet and retrieves information from the same. Real time information works well for
large factories with heavy machineries as the information can be retrieved quickly from the
machines.
d. Visual Commerce [6]: Visual Commerce is an unique blend of Visual Search and E-
Commerce. This module is implemented using the Voila Jhones algorithm. In this module
the user doesn’t need to click a photo of the required object. Instead they have to just hover
their camera over the object. The object is recognized from a data set using the algorithm.
Once the object is recognized it is searched in the top e-commerce portals using their API’s.
These API’s are linked with the application at first to gain easy access over the portals.
Once the object is recognized in the portals they are displayed in an aggregated form to the
user with their price and availability mentioned. The data set can be expanded over time
using various machine learning algorithms.
e. Database: The database is MySQL database being populated via crawlers. The database is
created as an aggregation for news articles. There are two crawlers running daily on the
database. The first crawler runs through RSS links and stores the URL in the database. The
other crawler then runs through those links and stores all the news articles. The search is
based on a few parameters which includes relevancy, precision, date of the article and
priority.
f. OCR Engine: The OCR Engine using is Teserract [2]
. Tesseract is an optical character
recognition engine for various operating systems. It is free software, released under the
Apache License, Version 2.0, and development has been sponsored by Google since 2006.
Tesseract is considered one of the most accurate open source OCR engines currently
available. If Tesseract is used to process right-to-left text such Arabic or Hebrew the results
are ordered as though it is left-to-right text.
Tesseract is suitable for use as a backend engine.
Module wise System Architecture:
26. Project Report: Smart Glass
20
Figure 7: Actual System Architecture based on modules
Module/Component Description
UML is de facto standard notation for software design. It can be used for drawing diagrams and
also to generate codes, apply design patterns, mine requirements and perform impact analysis.
UML is flexible and UML models are portable. UML is well known visual language that can
capture much of the information that one needs to communicate about the system.
Use Case Diagram: A use case diagram at its simplest is a representation of a user's interaction
with the system and depicting the specifications of a use case. A use case diagram can portray the
different types of users of a system and the various ways that they interact with the system. This
type of diagram is typically used in conjunction with the textual use case and will often be
accompanied by other types of diagrams as well.
27. Project Report: Smart Glass
21
Figure 8: Use Case Diagram
Activity Diagram: Activity diagrams are graphical representations of workflows of stepwise
activities and actions with support for choice, iteration and concurrency. In the Unified Modelling
Language, activity diagrams are intended to model both computational and organisational
processes (i.e. workflows). Activity diagrams show the overall flow of control.
28. Project Report: Smart Glass
22
Figure 9: Activity Diagram
State Transition Diagram: A state diagram is a type of diagram used in computer science and
related fields to describe the behavior of systems. State diagrams require that the system described
is composed of a finite number of states; sometimes, this is indeed the case, while at other times
this is a reasonable abstraction. Many forms of state diagrams exist, which differ slightly and
have different semantics.
29. Project Report: Smart Glass
23
Figure 10: State transition to capture image
Figure 11: State transition for text conversion
Deployment Diagram: Deployment diagrams are used to visualize the topology of the physical
components of a system where the software components are deployed.
So deployment diagrams are used to describe the static deployment view of a system. Deployment
diagrams consist of nodes and their relationships
30. Project Report: Smart Glass
24
Figure 12: Deployment Diagram
Class Diagram: The class diagram is a static diagram. It represents the static view of an
application. Class diagram is not only used for visualizing, describing and documenting different
aspects of a system but also for constructing executable code of the software application.
The class diagram describes the attributes and operations of a class and also the constraints
imposed on the system. The class diagrams are widely used in the modelling of object oriented
systems because they are the only UML diagrams which can be mapped directly with object
oriented languages.
Figure 15: Class Diagram
Component Diagram: Component diagrams are different in terms of nature and behaviour.
Component diagrams are used to model physical aspects of a system.
31. Project Report: Smart Glass
25
Now the question is what are these physical aspects? Physical aspects are the elements like
executables, libraries, files, documents etc which resides in a node.
So component diagrams are used to visualize the organization and relationships among
components in a system. These diagrams are also used to make executable systems.
Figure 16: Component Diagram
Data Flow Diagram: A data flow diagram (DFD) is a graphical representation of the "flow" of
data through an information system, modelling its process aspects. Often they are a preliminary
step used to create an overview of the system which can later be elaborated. DFDs can also be
used for the visualization of data processing (structured design).
A DFD shows what kinds of information will be input to and output from the system, where the
data will come from and go to, and where the data will be stored. It does not show information
about the timing of processes, or information about whether processes will operate in sequence
or in parallel (which is shown on a flowchart).
33. Project Report: Smart Glass
27
User Interface Design
Figure 14: Splash Screen Figure 15: Smart Snap of Play Store
Figure 156: Main Menu Figure 167: Slide Menu
34. Project Report: Smart Glass
28
Figure 18: Working of Visual Search. The selection screen.
Fig. 19: Visual Search results Fig. 20: News articles based on Visual search
35. Project Report: Smart Glass
29
Fig. 21 Share option in the application Fig. 22 Loading Screen for Document Capture
Fig. 23 Real-Time Information Fig. 24 Results for Real-Time Information
36. Project Report: Smart Glass
30
Chapter 7
7. Implementation
Modules/ Component Description:
Mobile Device:
This Module contains three sub modules. It consists of all the features that are present on the
mobile phone of the user. It contains GUI and OCR. GUI contains one more module.
Mobile Device is the connection between users’s input to the backend.
GUI:
The GUI is the interface with which the user interacts. The GUI is based for an high end
Android Device. It also contains a slide menu. Visual Search consists of UI which connects to
the database. Document Capture is designed in a way to connect to the server. It also includes
a load screen. Real Time information connects with the user defined spreadsheets.
OCR:
This module contains the recognition of the data from the image. This module helps in
retrieving data from the images and sends back to display it to the user. This module works on
37. Project Report: Smart Glass
31
the mobile device of the user. There is also an OCR running on the server for Document
Capture Module. The OCR Engine used in the project is Teserract.
Capture Image:
This module helps the user to capture images through the mobile device. The images captured
through this module are used for further processing. This module provides the input to the
whole system. The OCR module works on the captured image.
Image Processing [1]:
This module helps in processing and retrieval of the images from the image that is captured
from the user. The images extracted from this module helps in retrieval of the information as
per the user’s request. The application uses Voila Johnes Algorithm to process image from
capture image in Visual Commerce.
Visual Search:
In Visual Search the images extracted or captured by the user are used for buying the products
in that image from the sites like Amazon and eBay. In this module the information about those
products from the shopping sites from where the user can buy those products.
Search Engine:
This module is a search engine from where the data or the information is extracted based on
the user’s request. It is connected to a database which consists of images and information and
news about products. The database is created on MySql which is populated by two crawlers [5]
.
It consists of all the files and details about the products, websites and RSS feeds regarding
various fields and retrieve data according to the user’s request.
Web Service:
Helps to establish a link between the data source and the android application. Based on the php
framework, it fires query to the MySQL server and retrieves query results which are then
encoded in the JSON format and passed to the application.
38. Project Report: Smart Glass
32
Spreadsheets:
Instead of hard coding the actions to specific keywords, the user is provided with an ability to
specify his own actions to specific keywords through configuring his spreadsheet. The user has
to specify his web publish url provided by the Google Spreadsheet to configure it. The
spreadsheet has columns like products, action and a url belonging to that particular action.
Document Translator
This provides a convenient way to process an image and segregate it into text and images
separately. It saves the time and energy of retyping and also provides a convenient way to
change the fonts and resizing the images.
Module-wise Algorithm:
Visual Search
1. Scan the word content
2. Extract the text content from the image
3. Spell check the retreived content
4. Text sent to the server
5. Server Side Programming
a. Split the sentence into words
b. Remove stop words
c. Search for relevant news articles based on scanned words
d. JSON encode the retreived articles and pass it to the device
6. Decode the JSON response on the device
7. Display results.
Visual Commerce
1. Scan the word content
2. Extract the text content from the image
3. Spell check the retreived content
4. Text sent to the server
5. Server Side Programming
39. Project Report: Smart Glass
33
a. Split the sentence into words
b. Remove stop words
c. Aggregate deals from leading ecommerce portals
d. Standardizing the results into a uniform format.
e. JSON encode the result and pass it to the device
6. Decode the JSON response on the device
7. Display results
Real Time Information
1. Scan the word content
2. Extract the text content from the image
3. Spell check the retrieved content
4. Text sent to the server
5. Server Side Programming
a. Split the sentence into words
b. Remove stop words
c. Connect to the ERP / Excel / Spreadsheet from which the content is to
be monitored
d. Filter out the rows matching with the searched content
e. Standardizing the results into a uniform format as per the required
information
f. JSON encode the result and pass it to the device
6. Decode the JSON response on the device
7. Display results
40. Project Report: Smart Glass
34
Chapter 8
8. Experiments and Project Testing
Test plan
The objective of the plan is to break the product down into distinct areas and identify features
of the Smart Snap Application that are to be tested. The test plan approach that has been used
in our project includes the following:
1. Design verification or Compliance test:
These stages of testing have been performed during the development or approval stage of
the product, typically on a small sample of units.
2. Test Coverage
The design verification tests have been performed at the point of reaching every milestone.
Test areas include testing of various features such as line segmentation, line and edge
detection, Binary image conversion, symbol detection, palm colour detection, etc.
3. Test Methods
Testing of diverse features has been performed in “Smart Snap”. For each module,
corresponding outputs were checked. For testing each module, the output produced from
running the code was checked with the test data set.
41. Project Report: Smart Glass
35
4. Test Responsibility
The team members working on their respective features performed the testing of those
features. Test responsibilities also include, the data collected, and how that data was used
and reported.
8.2Test cases
A test case is a set of conditions or variables under which we will determine whether the Smart
Snap application is working correctly or not. We have used many test cases to determine that
the system is sufficiently scrutinized.
Test
Case
ID
Case Description Expected Result Actual Result Pass/Fail
1 User takes picture
of a document
The image should be
converted into editable
text and displayed.
The image is converted
into editable text and
displayed.
Pass
2 The user takes a
picture of a text
The OCR should read
the text and display the
related news articles to
the user.
The OCR reads the text
correctly and displays
it to the user.
Pass
3 The user clicks a
photo in portrait
mode instead of
landscape mode.
An error message is
displayed and the user
is prompted to click the
image back in
landscape mode.
The error message is
displayed prompting
user to click the photo
in landscape mode.
Pass
4 The user clicks the
share button
The application should
display all the
available platforms on
The application
displays all the
sharable platforms to
the user.
Pass
42. Project Report: Smart Glass
36
which the user can
share the results
5 The user clicks the
create PDF option.
A PDF should be
created for the user in
the Micro SD card.
A PDF is created for
the user in the memory
card.
Pass
Table 4: Test Case
8.3 Methods used
The methods used by us for testing are as given below:
1. Unit Testing
Unit testing is a method by which individual units of source code, sets of one or more
program modules together are tested to determine if they fit for use. In our application, we
considered each module as one unit and tested these units with help of test cases and test
plan developed. Unit testing was carried out on each module and on every function within
the module. Output of each unit was assessed for accuracy and if found incorrect,
appropriate corrections were made.
2. Integration Testing
Integration testing is a phase in software testing in which individual software modules are
combined and tested as a group. The purpose of integration testing is to detect any
inconsistencies between the software units that are integrated together. The modules of our
application were integrated together in order to verify that they provide the required
functionalities appropriately. The various modules were tested together to check for their
accuracy and compatibility.
3. System Testing
System testing of software or hardware is testing conducted on a complete, integrated
system to evaluate the system's compliance with its specified requirements. As a rule,
system testing takes, as its input, all of the "integrated" software components that have
successfully passed integration testing and also the software system itself integrated with
any applicable hardware system. We performed system testing after integration testing to
ensure proper functioning of the project as whole.
4. Acceptance Testing
43. Project Report: Smart Glass
37
Acceptance testing will be conducted to determine if the requirements and specifications
are met. It may involve performance tests. Acceptance testing performed by the customer
is known as user acceptance testing (UAT), end-user testing, site (acceptance) testing, or
field (acceptance) testing. Acceptance testing generally involves running a suite of tests on
the completed system. Each individual test, known as a case, exercises a particular
operating condition of the user's environment or feature of the system, and will result in a
pass or fail, or Boolean, outcome. Here the end user will use the Smart Snap Application
first in a tested environment and then in the environment of his/her own home.
5. Usability testing
Usability testing is a technique used to evaluate a product by testing it on users. Usability
testing focuses on measuring a human-made product's capacity to meet its intended
purpose. . Usability testing measures the usability, or ease of use, of a specific object or set
of objects. The results of this review will help improve the end-user interaction of the
software. The purpose of usability testing is to ensure that the Smart Snap Application will
function in a manner that is acceptable to the user.
6. Performance testing
Performance testing is testing that is performed, to determine how fast some aspect of
a system performs under a particular workload. It can also serve to validate and verify
other quality attributes of the system, such as scalability, reliability and resource usage. In
the Smart Snap Application, these tests ensure that the system provides acceptable response
times. It should not exceed 10 seconds once the user has finished loading the image and
has clicked on the ‘Output’ button.
8.4 Test Results
Preliminary test conducted showed promising results. The tests conducted included taking a
number of photographs at different angles to ensure readability of data. The information read
via the OCR engine were good. The data was even read at an angle of 30 degrees. However,
the data could not be read when the photo was blur. The angle for the OCR engine can be
improved.
Further tests were conducted for the retrieval of data from the servers. The data was retrieved
within 3 seconds for Visual Search which was astounding. The results before were quite bad
as the entire data was being loaded to the local database and then being searched. Algorithms
were improved and then the searched data only was being sent. This improved the speed
exponentially.
45. Project Report: Smart Glass
39
Chapter 9
9. Maintenance
9.1 User Manual
Prerequisites
The user must have an Android Phone
The user must have an active data plan or any sort of internet connectivity
How to use
The user must start the application.
Select one of the various modules which are Document Capture, Visual Search, Visual
Commerce and Real Time Information.
The user then must click a photo for which he wants the results
The results are displayed which can be shared on various social networking platforms
9.2 Constraints
The user must have internet connectivity.
46. Project Report: Smart Glass
40
Visual commerce does not work in portrait mode. The user must click the photo in
landscape mode only.
Document Capture works only in portrait mode. The image shouldn’t be clicked in
landscape mode.
While clicking the image the hand should be stable and the image clicked should not have
any disruptions. The environment should not be too dark.
47. Project Report: Smart Glass
41
Chapter 10
10. Conclusion and Future Scope
After working on the project for a span of 6 months we have proposed a new system of search
based on visual search instead of text based search. Text based search is a traditional outdated
search. A new way of search was required which was more precise and optimized.
Based on Visual Search, Image Recognition and Database Technology a more optimized way
of search and conversion tool was created which would help the user in saving a lot of time.
The features are closely integrated and help the user in converting image to text, buying new
products and getting the latest news.
The application has a number of future scopes. The precision of the application needs to be
improved for better results. The database can be optimized to give quicker search results.
Also the future scope includes adding a number of features. These features include adding
Sudoku Solver, Location based Navigation and Voice Based Search.
Overall we conclude, the application brings on new technology which is user-friendly and save
a lot of time for the user.
48. Project Report: Smart Glass
42
References and Bibliography
A
Android: It is an operating system for Mobile devices and tablets being developed by Google.
D
Document Capture: A feature that will allow users to convert images into editable texts. These texts
will be copied to the devices clipboard.
F
Forward-When the recipient of an email message sends it on to someone he or she thinks might find
it interesting or benefit from.
I
Image Recognition: Image recognitions is recognises relevant information from the image clicked
using various available technologies.
M
MATLAB: Developed by MathWorks, MATLAB allows matrix manipulations, plotting
of functions and data, implementation of algorithms, creation of user interfaces, and interfacing with
programs written in other languages, including C, C++, Java, and Fortran.
N
News Aggregator: News aggregator is a feature which pulls in all the relevant information about a
topic from the net using custom search.
O
Optical Character Recognition: Optical character recognition, usually abbreviated to OCR, is
the mechanical or electronic conversion of scanned images of handwritten, typewritten or printed
text into machine-encoded text.
P
Page Rank: It assigns a weight to the web page depending on the ranking of the users.
Q
QR Codes: QR code (abbreviated from Quick Response Code) is the trademark for a type of matrix
barcode
S
Spreadsheets: Spreadsheets are sheets available online by Google. The user can create tables and
files within a spredsheets and store custom information.
49. Project Report: Smart Glass
43
T
Test-An action taken to ensure an email will perform properly before it is sent. A test message is sent
to several “testing” accounts and allows marketers to identify problems such as broken links or
images and rectify them before sending the email to an entire list as well as a means of comparing
the results of different versions of an email.
V
Visual Commerce: A feature that allows users to purchase online not by typing but just by clicking
an image of the object they want to purchase.
Recommended sites, newsletters, blogs and books.
Websites
1. The website has all the details about image processing basics.
http://www.idi.ntnu.no/~blake/gbimpdet.htm
2. Google Codes-This is the place to find explanations related to Tesseract OCR.
http://code.google.com/p/tesseract-ocr/
3. http://seomojo.net/how_seo.htm
4. http://framework.zend.com/
5. http://visual.ly/google%E2%80%99s-hummingbird-algorithm-%E2%80%93-
what%E2%80%99s-it-all-about
6. http://www.mathworks.in/help/vision/ref/vision.cascadeobjectdetectorclass.html
Books
Document image analysis: A Primer
Rangachar Department of Computer Science & Engineering, The Pennsylvania State
University, University Park, PA 16802, USA
Kasturi LawrenceO’Gorman Avaya Labs, Room 1B04, 233 Mt. Airy Road, Basking Ridge, NJ
07920, USA
Venu Govindraju CEDAR, State University of New York at Buffalo, Amherst, NY 14228, USA