SlideShare a Scribd company logo
1 of 19
DESIGN, ANALYSIS AND IMPLEMENTATION OF AN INFORMATION RETRIEVAL SYSTEM (Enterprise Search) Using C++ and PHP Maheshwaran Janarthanan   Narendran Hariparanthaman   Team i Explore
Introduction ,[object Object],[object Object],[object Object],[object Object],[object Object]
Components of  I Explore ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Preprocessing Steps ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Multilingual Indexes ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Data Structures ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Query Processor ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Query Optimization and Multilingual Support ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Query  Retrieved Results Corpus of Data Spell Checker and Auto Correction Indexer Redefined Barrels (Inverted Index) Query Processor MEGA INDEX BLOCK  (Forward Index, File Meta Index, Positional Index, bi gram, N gram index, okapi++ indexes) Query Logs 3 Tier Architectural View of I Explore USER Data presentation layer PHP Data Processing layer C++ Data Storage layer MySQL, Flat files
Why 3 Tier Architecture??? ,[object Object],[object Object],[object Object],[object Object]
Okapi++ ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Algorithm 1.Input the signals from the Input units to MCU 2.Input signals from nearby weapon system to MCU 3.Compare the input signals with standard data in memory 4.The required signals are given to the Tran receiver of weapon matrix 5.Selection of weapons 6.Angling of weapons 7.Calculation of time 8.Input from the timer given to the weapon
Cognizance Algorithm for Query Expansion ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Spelling correction 1.Input the signals from the Input units to MCU 2.Input signals from nearby weapon system to MCU 3.Compare the input signals with standard data in memory 4.The required signals are given to the Tran receiver of weapon matrix 5.Selection of weapons 6.Angling of weapons 7.Calculation of time 8.Input from the timer given to the weapon
Performance Analysis Comparison of various models without Query Expansion techniques Execution time Statistics:  Model MAP R Precision No of relevant docs returned P 5  P 10 Boolean 0.0773 0.1477 13 0.2 0.2 VSM 0.0587 0.0916 17 0.12 0.12 Okapi 0.1028 0.1536 25 0.22 0.18 Optimum (Okapi ++) 0.1769 0.2215 36 0.34 0.29 Model Metallica(sec) (for 10 queries) Boolean 14 VSM 16 Okapi 15 Optimum 15
Performance Analysis  Okapi ++ with Cognizance algorithm : Disk space comparison: Query  Expansion Technique MAP R precision No of relevant documents retrieved P 5 P 10 Title + description 0.2078 0.2517 42 0.4 0.33 Title + narration 0.2104 0.2555 47 0.42 0.34 Title + narration + description 0.2021 0.25 43 0.4 0.31 Index Size (in bytes) Positional Index 242, 109, 895 Variable block Index 139, 266, 798
Okapi++ - An all rounder ,[object Object],[object Object],[object Object],[object Object],[object Object]
References ,[object Object],[object Object],[object Object],[object Object],[object Object]
Queries ? ? ?

More Related Content

Viewers also liked

101科研社簡章(公告版)
101科研社簡章(公告版)101科研社簡章(公告版)
101科研社簡章(公告版)
Howard Chang
 
20140924成功科研(親子天下)
20140924成功科研(親子天下)20140924成功科研(親子天下)
20140924成功科研(親子天下)
Howard Chang
 
102科研社簡章(公告版)
102科研社簡章(公告版)102科研社簡章(公告版)
102科研社簡章(公告版)
Howard Chang
 
Presentacion power point
Presentacion power pointPresentacion power point
Presentacion power point
Lu Saade
 
51初審結果(中)
51初審結果(中)51初審結果(中)
51初審結果(中)
Howard Chang
 
2012竹縣科展計畫
2012竹縣科展計畫2012竹縣科展計畫
2012竹縣科展計畫
Howard Chang
 
Политика
ПолитикаПолитика
Политика
nasbsu
 
Манифест
МанифестМанифест
Манифест
nasbsu
 
100科研社簡章
100科研社簡章100科研社簡章
100科研社簡章
Howard Chang
 

Viewers also liked (20)

101科研社簡章(公告版)
101科研社簡章(公告版)101科研社簡章(公告版)
101科研社簡章(公告版)
 
"L'ultima copia del News of the World"
"L'ultima copia del News of the World""L'ultima copia del News of the World"
"L'ultima copia del News of the World"
 
20140924成功科研(親子天下)
20140924成功科研(親子天下)20140924成功科研(親子天下)
20140924成功科研(親子天下)
 
102科研社簡章(公告版)
102科研社簡章(公告版)102科研社簡章(公告版)
102科研社簡章(公告版)
 
送展表
送展表送展表
送展表
 
Ropes,belts modification anatomy bandha workshop
Ropes,belts modification anatomy bandha workshopRopes,belts modification anatomy bandha workshop
Ropes,belts modification anatomy bandha workshop
 
Bandha
BandhaBandha
Bandha
 
Horvaatia
HorvaatiaHorvaatia
Horvaatia
 
Presentacion power point
Presentacion power pointPresentacion power point
Presentacion power point
 
100科研社簡章
100科研社簡章100科研社簡章
100科研社簡章
 
Horvaatia
HorvaatiaHorvaatia
Horvaatia
 
Tp 13
Tp 13Tp 13
Tp 13
 
51初審結果(中)
51初審結果(中)51初審結果(中)
51初審結果(中)
 
2012竹縣科展計畫
2012竹縣科展計畫2012竹縣科展計畫
2012竹縣科展計畫
 
Tp 14
Tp 14Tp 14
Tp 14
 
Tp 11!!
Tp 11!!Tp 11!!
Tp 11!!
 
Политика
ПолитикаПолитика
Политика
 
Midiateca 1 da unidade 2 - bases teorico-metodologicas da avaliacao
Midiateca 1 da unidade 2 - bases teorico-metodologicas da avaliacaoMidiateca 1 da unidade 2 - bases teorico-metodologicas da avaliacao
Midiateca 1 da unidade 2 - bases teorico-metodologicas da avaliacao
 
Манифест
МанифестМанифест
Манифест
 
100科研社簡章
100科研社簡章100科研社簡章
100科研社簡章
 

Similar to I explore

Ijarcet vol-2-issue-3-881-883
Ijarcet vol-2-issue-3-881-883Ijarcet vol-2-issue-3-881-883
Ijarcet vol-2-issue-3-881-883
Editor IJARCET
 
ALM Search Presentation for the VSS Arch Council
ALM Search Presentation for the VSS Arch CouncilALM Search Presentation for the VSS Arch Council
ALM Search Presentation for the VSS Arch Council
Sunita Shrivastava
 

Similar to I explore (20)

High-performance database technology for rock-solid IoT solutions
High-performance database technology for rock-solid IoT solutionsHigh-performance database technology for rock-solid IoT solutions
High-performance database technology for rock-solid IoT solutions
 
The Adventure: BlackRay as a Storage Engine
The Adventure: BlackRay as a Storage EngineThe Adventure: BlackRay as a Storage Engine
The Adventure: BlackRay as a Storage Engine
 
The hidden engineering behind machine learning products at Helixa
The hidden engineering behind machine learning products at HelixaThe hidden engineering behind machine learning products at Helixa
The hidden engineering behind machine learning products at Helixa
 
Getting Started with Amazon Redshift
Getting Started with Amazon RedshiftGetting Started with Amazon Redshift
Getting Started with Amazon Redshift
 
Open source Technology
Open source TechnologyOpen source Technology
Open source Technology
 
Ijarcet vol-2-issue-3-881-883
Ijarcet vol-2-issue-3-881-883Ijarcet vol-2-issue-3-881-883
Ijarcet vol-2-issue-3-881-883
 
I/O & virtualization performance with a search engine based on an xml databa...
 I/O & virtualization performance with a search engine based on an xml databa... I/O & virtualization performance with a search engine based on an xml databa...
I/O & virtualization performance with a search engine based on an xml databa...
 
ALM Search Presentation for the VSS Arch Council
ALM Search Presentation for the VSS Arch CouncilALM Search Presentation for the VSS Arch Council
ALM Search Presentation for the VSS Arch Council
 
Introduction to MongoDB and its best practices
Introduction to MongoDB and its best practicesIntroduction to MongoDB and its best practices
Introduction to MongoDB and its best practices
 
data base management system (DBMS)
data base management system (DBMS)data base management system (DBMS)
data base management system (DBMS)
 
Privacy preserving multi-keyword ranked search over encrypted cloud data
Privacy preserving multi-keyword ranked search over encrypted cloud dataPrivacy preserving multi-keyword ranked search over encrypted cloud data
Privacy preserving multi-keyword ranked search over encrypted cloud data
 
Organizing the Data Chaos of Scientists
Organizing the Data Chaos of ScientistsOrganizing the Data Chaos of Scientists
Organizing the Data Chaos of Scientists
 
AWS re:Invent 2016: Netflix: Using Amazon S3 as the fabric of our big data ec...
AWS re:Invent 2016: Netflix: Using Amazon S3 as the fabric of our big data ec...AWS re:Invent 2016: Netflix: Using Amazon S3 as the fabric of our big data ec...
AWS re:Invent 2016: Netflix: Using Amazon S3 as the fabric of our big data ec...
 
Handling Data in Mega Scale Systems
Handling Data in Mega Scale SystemsHandling Data in Mega Scale Systems
Handling Data in Mega Scale Systems
 
AWS Summit Atlanta Keynote
AWS Summit Atlanta KeynoteAWS Summit Atlanta Keynote
AWS Summit Atlanta Keynote
 
Windows Azure: Lessons From The Field
Windows Azure: Lessons From The FieldWindows Azure: Lessons From The Field
Windows Azure: Lessons From The Field
 
Big Data Meetup #7
Big Data Meetup #7Big Data Meetup #7
Big Data Meetup #7
 
MongoDB What's new in 3.2 version
MongoDB What's new in 3.2 versionMongoDB What's new in 3.2 version
MongoDB What's new in 3.2 version
 
DataFinder: A Python Application for Scientific Data Management
DataFinder: A Python Application for Scientific Data ManagementDataFinder: A Python Application for Scientific Data Management
DataFinder: A Python Application for Scientific Data Management
 
A Robust Keywords Based Document Retrieval by Utilizing Advanced Encryption S...
A Robust Keywords Based Document Retrieval by Utilizing Advanced Encryption S...A Robust Keywords Based Document Retrieval by Utilizing Advanced Encryption S...
A Robust Keywords Based Document Retrieval by Utilizing Advanced Encryption S...
 

Recently uploaded

EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
Earley Information Science
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
vu2urc
 

Recently uploaded (20)

Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdf
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 

I explore

  • 1. DESIGN, ANALYSIS AND IMPLEMENTATION OF AN INFORMATION RETRIEVAL SYSTEM (Enterprise Search) Using C++ and PHP Maheshwaran Janarthanan Narendran Hariparanthaman Team i Explore
  • 2.
  • 3.
  • 4.
  • 5.
  • 6.
  • 7.
  • 8.
  • 9. Query Retrieved Results Corpus of Data Spell Checker and Auto Correction Indexer Redefined Barrels (Inverted Index) Query Processor MEGA INDEX BLOCK (Forward Index, File Meta Index, Positional Index, bi gram, N gram index, okapi++ indexes) Query Logs 3 Tier Architectural View of I Explore USER Data presentation layer PHP Data Processing layer C++ Data Storage layer MySQL, Flat files
  • 10.
  • 11.
  • 12. Algorithm 1.Input the signals from the Input units to MCU 2.Input signals from nearby weapon system to MCU 3.Compare the input signals with standard data in memory 4.The required signals are given to the Tran receiver of weapon matrix 5.Selection of weapons 6.Angling of weapons 7.Calculation of time 8.Input from the timer given to the weapon
  • 13.
  • 14. Spelling correction 1.Input the signals from the Input units to MCU 2.Input signals from nearby weapon system to MCU 3.Compare the input signals with standard data in memory 4.The required signals are given to the Tran receiver of weapon matrix 5.Selection of weapons 6.Angling of weapons 7.Calculation of time 8.Input from the timer given to the weapon
  • 15. Performance Analysis Comparison of various models without Query Expansion techniques Execution time Statistics: Model MAP R Precision No of relevant docs returned P 5 P 10 Boolean 0.0773 0.1477 13 0.2 0.2 VSM 0.0587 0.0916 17 0.12 0.12 Okapi 0.1028 0.1536 25 0.22 0.18 Optimum (Okapi ++) 0.1769 0.2215 36 0.34 0.29 Model Metallica(sec) (for 10 queries) Boolean 14 VSM 16 Okapi 15 Optimum 15
  • 16. Performance Analysis Okapi ++ with Cognizance algorithm : Disk space comparison: Query Expansion Technique MAP R precision No of relevant documents retrieved P 5 P 10 Title + description 0.2078 0.2517 42 0.4 0.33 Title + narration 0.2104 0.2555 47 0.42 0.34 Title + narration + description 0.2021 0.25 43 0.4 0.31 Index Size (in bytes) Positional Index 242, 109, 895 Variable block Index 139, 266, 798
  • 17.
  • 18.