SlideShare a Scribd company logo
Full-text & Relational search
VIJAY YADAV
072-BCT-547
2
SEARCH IS HARD
 Average no. of search per day
is over 3.5 billion on Google
alone.
 That’s one search for every two
people (including babies and
grandmothers, but excluding
zombies) in the world.
3
 That doesn’t even include the number of searches on Amazon,
LinkedIn and Facebook. We use search for everything.
 Oh, except the company data. We still use BI analysts, data
scientists, specialized tools, and SQL for that.
4
So what is full-text search ?
 It is simply a document-based
search mainly employed by
word processing applications
and various search engines.
 It often performs two tasks:
indexing and searching.
5
 The indexing stage will scan the text of all the documents and
build a list of search terms (often called as index). The indexer
will ignore stop words such as "the" and "and“. Also the word
drives, drove, driven will be recorded only as a single word
“drive”.
 In the search stage, when performing a specific query, only the
index is referenced, rather than the text of the original
documents.
6
Two ways of performance improvements
I. Improved query tools
II. Improved search algorithms
Improved search algorithms
 PageRank algorithm developed by google.
7
Improved query tools
 Keywords: Creators are asked to list the words that best
describe the text including synonyms.
 Phrase search: Will search only those documents that
contain certain phrase.
 Fuzzy search: Will search documents with even some
variations around the given term.
Some fuzzy search algorithms
Soundex
Metaphone
Double Metaphone
Soundex
In PostgreSQL, below two queries will result in same
output with soundex algorithm and hence even wrong
typed word can give right result.
1. SELECT soundex(‘elephant’);
————————————-
=> E415
2. SELECT soundex(‘elephents’);
————————————–
=> E415
Software performing full-text search
Problems with full-text search
• The results may not be 100% accurate.
• Large number of irrelevant search results due to
lack of relation among the words
Why Relational search ?
• Gives more accurate and relevant
result.
• Much useful for business analytics.
but…
Relational search is even harder because
1. Company’s data is complicated
• Search on LinkedIn probably means searching for a
person or a company.
• Search on Amazon probably means searching for a
product.
• But company’s data includes multiple databases, tables,
columns, rows with complicated relationships between
them.
2. Needs to be 100% accurate or you risk your business
What’s worse
than guessing?
Being
convinced by
bad data.
3. Needs to be faster
Relational search makes huge
difference in enterprise because it
takes deterministic input to give
deterministic output.
THANK YOU
18

More Related Content

What's hot

Jabnernako
JabnernakoJabnernako
Jabnernako
Gyram Mary Grace
 
Effective web search techniques
Effective web search techniquesEffective web search techniques
Effective web search techniques
aliciafe0215
 
Google searchpresentation2
Google searchpresentation2Google searchpresentation2
Google searchpresentation2
carolyn oldham
 
Googlesearchpresentation
GooglesearchpresentationGooglesearchpresentation
Googlesearchpresentation
carolyn oldham
 
Surfing the web
Surfing the webSurfing the web
Surfing the web
Natasha Langdown
 
Literature Searching Techniques by Nadeem Sohail
Literature Searching Techniques by Nadeem SohailLiterature Searching Techniques by Nadeem Sohail
Literature Searching Techniques by Nadeem Sohail
Nadeem Sohail
 
Internet Search Presentation
Internet Search PresentationInternet Search Presentation
Internet Search Presentation
Steve Guinan
 
Boolean Training
Boolean TrainingBoolean Training
Boolean Training
Somil Charan
 
Internet search techniques by zakir hossain
Internet search techniques by zakir hossainInternet search techniques by zakir hossain
Internet search techniques by zakir hossain
Zakir Hossain/ICS, Zurich
 
Advance searching techniques
Advance searching techniquesAdvance searching techniques
Advance searching techniques
Humayun Khan
 
Google searching techniques
Google searching techniquesGoogle searching techniques
Google searching techniques
abbas mohd
 
Week12keyconceptsgooglesearchtechniques
Week12keyconceptsgooglesearchtechniquesWeek12keyconceptsgooglesearchtechniques
Week12keyconceptsgooglesearchtechniques
carolyn oldham
 
Week13 key concepts_googlesearchtechniques
Week13 key concepts_googlesearchtechniquesWeek13 key concepts_googlesearchtechniques
Week13 key concepts_googlesearchtechniques
carolyn oldham
 
One page-tip-sheet-boolean-google-irina-shamaeva
One page-tip-sheet-boolean-google-irina-shamaevaOne page-tip-sheet-boolean-google-irina-shamaeva
One page-tip-sheet-boolean-google-irina-shamaeva
Irina Shamaeva
 
Smart Literature Searching by Susanne Noll
Smart Literature Searching by Susanne NollSmart Literature Searching by Susanne Noll
Smart Literature Searching by Susanne Noll
pvhead123
 
Searching techniques
Searching techniquesSearching techniques
Searching techniques
Jayatunga Amaraweera
 
Google search tips
Google search tipsGoogle search tips
Google search tips
Yashwant Shet Talaulikar
 
Literature searching techniques and free online resources for scholars by Nad...
Literature searching techniques and free online resources for scholars by Nad...Literature searching techniques and free online resources for scholars by Nad...
Literature searching techniques and free online resources for scholars by Nad...
Nadeem Sohail
 
Search engines
Search enginesSearch engines
Search engines
Sahiba Khurana
 
Search Engines
Search EnginesSearch Engines
Search Engines
Gajjala vivekananda
 

What's hot (20)

Jabnernako
JabnernakoJabnernako
Jabnernako
 
Effective web search techniques
Effective web search techniquesEffective web search techniques
Effective web search techniques
 
Google searchpresentation2
Google searchpresentation2Google searchpresentation2
Google searchpresentation2
 
Googlesearchpresentation
GooglesearchpresentationGooglesearchpresentation
Googlesearchpresentation
 
Surfing the web
Surfing the webSurfing the web
Surfing the web
 
Literature Searching Techniques by Nadeem Sohail
Literature Searching Techniques by Nadeem SohailLiterature Searching Techniques by Nadeem Sohail
Literature Searching Techniques by Nadeem Sohail
 
Internet Search Presentation
Internet Search PresentationInternet Search Presentation
Internet Search Presentation
 
Boolean Training
Boolean TrainingBoolean Training
Boolean Training
 
Internet search techniques by zakir hossain
Internet search techniques by zakir hossainInternet search techniques by zakir hossain
Internet search techniques by zakir hossain
 
Advance searching techniques
Advance searching techniquesAdvance searching techniques
Advance searching techniques
 
Google searching techniques
Google searching techniquesGoogle searching techniques
Google searching techniques
 
Week12keyconceptsgooglesearchtechniques
Week12keyconceptsgooglesearchtechniquesWeek12keyconceptsgooglesearchtechniques
Week12keyconceptsgooglesearchtechniques
 
Week13 key concepts_googlesearchtechniques
Week13 key concepts_googlesearchtechniquesWeek13 key concepts_googlesearchtechniques
Week13 key concepts_googlesearchtechniques
 
One page-tip-sheet-boolean-google-irina-shamaeva
One page-tip-sheet-boolean-google-irina-shamaevaOne page-tip-sheet-boolean-google-irina-shamaeva
One page-tip-sheet-boolean-google-irina-shamaeva
 
Smart Literature Searching by Susanne Noll
Smart Literature Searching by Susanne NollSmart Literature Searching by Susanne Noll
Smart Literature Searching by Susanne Noll
 
Searching techniques
Searching techniquesSearching techniques
Searching techniques
 
Google search tips
Google search tipsGoogle search tips
Google search tips
 
Literature searching techniques and free online resources for scholars by Nad...
Literature searching techniques and free online resources for scholars by Nad...Literature searching techniques and free online resources for scholars by Nad...
Literature searching techniques and free online resources for scholars by Nad...
 
Search engines
Search enginesSearch engines
Search engines
 
Search Engines
Search EnginesSearch Engines
Search Engines
 

Similar to Full text and relational search

Internet search techniques for K12
Internet search techniques for K12Internet search techniques for K12
Internet search techniques for K12
Zakir Hossain/ICS, Zurich
 
Google Advance Searching- Information System Audit & Control Report
Google Advance Searching- Information System Audit & Control ReportGoogle Advance Searching- Information System Audit & Control Report
Google Advance Searching- Information System Audit & Control Report
Shahzeb Pirzada
 
Ppt Lesson 03
Ppt Lesson 03Ppt Lesson 03
Ppt Lesson 03
almario1988
 
Cap 233 how search works
Cap 233 how search worksCap 233 how search works
Cap 233 how search works
James FitzGerald
 
Academic Skills 4
Academic Skills 4Academic Skills 4
Academic Skills 4
Hala Nur
 
SEO Basics - SEO Company in India
SEO Basics - SEO Company in IndiaSEO Basics - SEO Company in India
SEO Basics - SEO Company in India
annakoch32
 
SEO Tutorial - SEO Company in India
SEO Tutorial - SEO Company in IndiaSEO Tutorial - SEO Company in India
SEO Tutorial - SEO Company in India
annakoch32
 
Advanced Internet Searching
Advanced Internet SearchingAdvanced Internet Searching
Advanced Internet Searching
Pamela Seabolt
 
You Don't Know SEO
You Don't Know SEOYou Don't Know SEO
You Don't Know SEO
Michael King
 
Google search techniques
Google search techniquesGoogle search techniques
Google search techniques
Nirav Ranpara
 
Google search techniques
Google search techniquesGoogle search techniques
Google search techniques
Nirav Ranpara
 
SEO Code-Breaking: How Smarter Content Leads to Better Search Rankings
SEO Code-Breaking: How Smarter Content Leads to Better Search RankingsSEO Code-Breaking: How Smarter Content Leads to Better Search Rankings
SEO Code-Breaking: How Smarter Content Leads to Better Search Rankings
Chris Prendergast
 
Quality not quantity
Quality not quantityQuality not quantity
Quality not quantity
vanesz
 
Internet searching
Internet searchingInternet searching
Internet searching
muaz mustafa
 
professional fuzzy type-ahead rummage around in xml type-ahead search techni...
professional fuzzy type-ahead rummage around in xml  type-ahead search techni...professional fuzzy type-ahead rummage around in xml  type-ahead search techni...
professional fuzzy type-ahead rummage around in xml type-ahead search techni...
Kumar Goud
 
Lost in the Net: Navigating Search Engines
Lost in the Net:  Navigating Search EnginesLost in the Net:  Navigating Search Engines
Lost in the Net: Navigating Search Engines
Johan Koren
 
Search Strings
Search StringsSearch Strings
Search Strings
Erin Sees
 
Search Kwatkins
Search KwatkinsSearch Kwatkins
Search Kwatkins
Juan Pittau
 
Content Analyst - Conceptualizing LSI Based Text Analytics White Paper
Content Analyst - Conceptualizing LSI Based Text Analytics White PaperContent Analyst - Conceptualizing LSI Based Text Analytics White Paper
Content Analyst - Conceptualizing LSI Based Text Analytics White Paper
John Felahi
 
Search engine optimization
Search engine optimizationSearch engine optimization
Search engine optimization
University of peshawar
 

Similar to Full text and relational search (20)

Internet search techniques for K12
Internet search techniques for K12Internet search techniques for K12
Internet search techniques for K12
 
Google Advance Searching- Information System Audit & Control Report
Google Advance Searching- Information System Audit & Control ReportGoogle Advance Searching- Information System Audit & Control Report
Google Advance Searching- Information System Audit & Control Report
 
Ppt Lesson 03
Ppt Lesson 03Ppt Lesson 03
Ppt Lesson 03
 
Cap 233 how search works
Cap 233 how search worksCap 233 how search works
Cap 233 how search works
 
Academic Skills 4
Academic Skills 4Academic Skills 4
Academic Skills 4
 
SEO Basics - SEO Company in India
SEO Basics - SEO Company in IndiaSEO Basics - SEO Company in India
SEO Basics - SEO Company in India
 
SEO Tutorial - SEO Company in India
SEO Tutorial - SEO Company in IndiaSEO Tutorial - SEO Company in India
SEO Tutorial - SEO Company in India
 
Advanced Internet Searching
Advanced Internet SearchingAdvanced Internet Searching
Advanced Internet Searching
 
You Don't Know SEO
You Don't Know SEOYou Don't Know SEO
You Don't Know SEO
 
Google search techniques
Google search techniquesGoogle search techniques
Google search techniques
 
Google search techniques
Google search techniquesGoogle search techniques
Google search techniques
 
SEO Code-Breaking: How Smarter Content Leads to Better Search Rankings
SEO Code-Breaking: How Smarter Content Leads to Better Search RankingsSEO Code-Breaking: How Smarter Content Leads to Better Search Rankings
SEO Code-Breaking: How Smarter Content Leads to Better Search Rankings
 
Quality not quantity
Quality not quantityQuality not quantity
Quality not quantity
 
Internet searching
Internet searchingInternet searching
Internet searching
 
professional fuzzy type-ahead rummage around in xml type-ahead search techni...
professional fuzzy type-ahead rummage around in xml  type-ahead search techni...professional fuzzy type-ahead rummage around in xml  type-ahead search techni...
professional fuzzy type-ahead rummage around in xml type-ahead search techni...
 
Lost in the Net: Navigating Search Engines
Lost in the Net:  Navigating Search EnginesLost in the Net:  Navigating Search Engines
Lost in the Net: Navigating Search Engines
 
Search Strings
Search StringsSearch Strings
Search Strings
 
Search Kwatkins
Search KwatkinsSearch Kwatkins
Search Kwatkins
 
Content Analyst - Conceptualizing LSI Based Text Analytics White Paper
Content Analyst - Conceptualizing LSI Based Text Analytics White PaperContent Analyst - Conceptualizing LSI Based Text Analytics White Paper
Content Analyst - Conceptualizing LSI Based Text Analytics White Paper
 
Search engine optimization
Search engine optimizationSearch engine optimization
Search engine optimization
 

More from Vijay Yadav

Decision tree presentation
Decision tree presentationDecision tree presentation
Decision tree presentation
Vijay Yadav
 
Summary of Knowledge creating company article
Summary of Knowledge creating company articleSummary of Knowledge creating company article
Summary of Knowledge creating company article
Vijay Yadav
 
Hospital and-medical-store-management-system
Hospital and-medical-store-management-systemHospital and-medical-store-management-system
Hospital and-medical-store-management-system
Vijay Yadav
 
Software reuse slide
Software reuse slideSoftware reuse slide
Software reuse slide
Vijay Yadav
 
File system performance
File system performanceFile system performance
File system performance
Vijay Yadav
 
Client server
Client serverClient server
Client server
Vijay Yadav
 
Cricket database
Cricket databaseCricket database
Cricket database
Vijay Yadav
 
Web mining
Web miningWeb mining
Web mining
Vijay Yadav
 

More from Vijay Yadav (8)

Decision tree presentation
Decision tree presentationDecision tree presentation
Decision tree presentation
 
Summary of Knowledge creating company article
Summary of Knowledge creating company articleSummary of Knowledge creating company article
Summary of Knowledge creating company article
 
Hospital and-medical-store-management-system
Hospital and-medical-store-management-systemHospital and-medical-store-management-system
Hospital and-medical-store-management-system
 
Software reuse slide
Software reuse slideSoftware reuse slide
Software reuse slide
 
File system performance
File system performanceFile system performance
File system performance
 
Client server
Client serverClient server
Client server
 
Cricket database
Cricket databaseCricket database
Cricket database
 
Web mining
Web miningWeb mining
Web mining
 

Recently uploaded

Software Engineering and Project Management - Introduction, Modeling Concepts...
Software Engineering and Project Management - Introduction, Modeling Concepts...Software Engineering and Project Management - Introduction, Modeling Concepts...
Software Engineering and Project Management - Introduction, Modeling Concepts...
Prakhyath Rai
 
BRAIN TUMOR DETECTION for seminar ppt.pdf
BRAIN TUMOR DETECTION for seminar ppt.pdfBRAIN TUMOR DETECTION for seminar ppt.pdf
BRAIN TUMOR DETECTION for seminar ppt.pdf
LAXMAREDDY22
 
官方认证美国密歇根州立大学毕业证学位证书原版一模一样
官方认证美国密歇根州立大学毕业证学位证书原版一模一样官方认证美国密歇根州立大学毕业证学位证书原版一模一样
官方认证美国密歇根州立大学毕业证学位证书原版一模一样
171ticu
 
An improved modulation technique suitable for a three level flying capacitor ...
An improved modulation technique suitable for a three level flying capacitor ...An improved modulation technique suitable for a three level flying capacitor ...
An improved modulation technique suitable for a three level flying capacitor ...
IJECEIAES
 
IEEE Aerospace and Electronic Systems Society as a Graduate Student Member
IEEE Aerospace and Electronic Systems Society as a Graduate Student MemberIEEE Aerospace and Electronic Systems Society as a Graduate Student Member
IEEE Aerospace and Electronic Systems Society as a Graduate Student Member
VICTOR MAESTRE RAMIREZ
 
4. Mosca vol I -Fisica-Tipler-5ta-Edicion-Vol-1.pdf
4. Mosca vol I -Fisica-Tipler-5ta-Edicion-Vol-1.pdf4. Mosca vol I -Fisica-Tipler-5ta-Edicion-Vol-1.pdf
4. Mosca vol I -Fisica-Tipler-5ta-Edicion-Vol-1.pdf
Gino153088
 
Seminar on Distillation study-mafia.pptx
Seminar on Distillation study-mafia.pptxSeminar on Distillation study-mafia.pptx
Seminar on Distillation study-mafia.pptx
Madan Karki
 
spirit beverages ppt without graphics.pptx
spirit beverages ppt without graphics.pptxspirit beverages ppt without graphics.pptx
spirit beverages ppt without graphics.pptx
Madan Karki
 
AI assisted telemedicine KIOSK for Rural India.pptx
AI assisted telemedicine KIOSK for Rural India.pptxAI assisted telemedicine KIOSK for Rural India.pptx
AI assisted telemedicine KIOSK for Rural India.pptx
architagupta876
 
Curve Fitting in Numerical Methods Regression
Curve Fitting in Numerical Methods RegressionCurve Fitting in Numerical Methods Regression
Curve Fitting in Numerical Methods Regression
Nada Hikmah
 
Rainfall intensity duration frequency curve statistical analysis and modeling...
Rainfall intensity duration frequency curve statistical analysis and modeling...Rainfall intensity duration frequency curve statistical analysis and modeling...
Rainfall intensity duration frequency curve statistical analysis and modeling...
bijceesjournal
 
Mechanical Engineering on AAI Summer Training Report-003.pdf
Mechanical Engineering on AAI Summer Training Report-003.pdfMechanical Engineering on AAI Summer Training Report-003.pdf
Mechanical Engineering on AAI Summer Training Report-003.pdf
21UME003TUSHARDEB
 
Applications of artificial Intelligence in Mechanical Engineering.pdf
Applications of artificial Intelligence in Mechanical Engineering.pdfApplications of artificial Intelligence in Mechanical Engineering.pdf
Applications of artificial Intelligence in Mechanical Engineering.pdf
Atif Razi
 
哪里办理(csu毕业证书)查尔斯特大学毕业证硕士学历原版一模一样
哪里办理(csu毕业证书)查尔斯特大学毕业证硕士学历原版一模一样哪里办理(csu毕业证书)查尔斯特大学毕业证硕士学历原版一模一样
哪里办理(csu毕业证书)查尔斯特大学毕业证硕士学历原版一模一样
insn4465
 
Computational Engineering IITH Presentation
Computational Engineering IITH PresentationComputational Engineering IITH Presentation
Computational Engineering IITH Presentation
co23btech11018
 
integral complex analysis chapter 06 .pdf
integral complex analysis chapter 06 .pdfintegral complex analysis chapter 06 .pdf
integral complex analysis chapter 06 .pdf
gaafergoudaay7aga
 
22CYT12-Unit-V-E Waste and its Management.ppt
22CYT12-Unit-V-E Waste and its Management.ppt22CYT12-Unit-V-E Waste and its Management.ppt
22CYT12-Unit-V-E Waste and its Management.ppt
KrishnaveniKrishnara1
 
Manufacturing Process of molasses based distillery ppt.pptx
Manufacturing Process of molasses based distillery ppt.pptxManufacturing Process of molasses based distillery ppt.pptx
Manufacturing Process of molasses based distillery ppt.pptx
Madan Karki
 
学校原版美国波士顿大学毕业证学历学位证书原版一模一样
学校原版美国波士顿大学毕业证学历学位证书原版一模一样学校原版美国波士顿大学毕业证学历学位证书原版一模一样
学校原版美国波士顿大学毕业证学历学位证书原版一模一样
171ticu
 
ITSM Integration with MuleSoft.pptx
ITSM  Integration with MuleSoft.pptxITSM  Integration with MuleSoft.pptx
ITSM Integration with MuleSoft.pptx
VANDANAMOHANGOUDA
 

Recently uploaded (20)

Software Engineering and Project Management - Introduction, Modeling Concepts...
Software Engineering and Project Management - Introduction, Modeling Concepts...Software Engineering and Project Management - Introduction, Modeling Concepts...
Software Engineering and Project Management - Introduction, Modeling Concepts...
 
BRAIN TUMOR DETECTION for seminar ppt.pdf
BRAIN TUMOR DETECTION for seminar ppt.pdfBRAIN TUMOR DETECTION for seminar ppt.pdf
BRAIN TUMOR DETECTION for seminar ppt.pdf
 
官方认证美国密歇根州立大学毕业证学位证书原版一模一样
官方认证美国密歇根州立大学毕业证学位证书原版一模一样官方认证美国密歇根州立大学毕业证学位证书原版一模一样
官方认证美国密歇根州立大学毕业证学位证书原版一模一样
 
An improved modulation technique suitable for a three level flying capacitor ...
An improved modulation technique suitable for a three level flying capacitor ...An improved modulation technique suitable for a three level flying capacitor ...
An improved modulation technique suitable for a three level flying capacitor ...
 
IEEE Aerospace and Electronic Systems Society as a Graduate Student Member
IEEE Aerospace and Electronic Systems Society as a Graduate Student MemberIEEE Aerospace and Electronic Systems Society as a Graduate Student Member
IEEE Aerospace and Electronic Systems Society as a Graduate Student Member
 
4. Mosca vol I -Fisica-Tipler-5ta-Edicion-Vol-1.pdf
4. Mosca vol I -Fisica-Tipler-5ta-Edicion-Vol-1.pdf4. Mosca vol I -Fisica-Tipler-5ta-Edicion-Vol-1.pdf
4. Mosca vol I -Fisica-Tipler-5ta-Edicion-Vol-1.pdf
 
Seminar on Distillation study-mafia.pptx
Seminar on Distillation study-mafia.pptxSeminar on Distillation study-mafia.pptx
Seminar on Distillation study-mafia.pptx
 
spirit beverages ppt without graphics.pptx
spirit beverages ppt without graphics.pptxspirit beverages ppt without graphics.pptx
spirit beverages ppt without graphics.pptx
 
AI assisted telemedicine KIOSK for Rural India.pptx
AI assisted telemedicine KIOSK for Rural India.pptxAI assisted telemedicine KIOSK for Rural India.pptx
AI assisted telemedicine KIOSK for Rural India.pptx
 
Curve Fitting in Numerical Methods Regression
Curve Fitting in Numerical Methods RegressionCurve Fitting in Numerical Methods Regression
Curve Fitting in Numerical Methods Regression
 
Rainfall intensity duration frequency curve statistical analysis and modeling...
Rainfall intensity duration frequency curve statistical analysis and modeling...Rainfall intensity duration frequency curve statistical analysis and modeling...
Rainfall intensity duration frequency curve statistical analysis and modeling...
 
Mechanical Engineering on AAI Summer Training Report-003.pdf
Mechanical Engineering on AAI Summer Training Report-003.pdfMechanical Engineering on AAI Summer Training Report-003.pdf
Mechanical Engineering on AAI Summer Training Report-003.pdf
 
Applications of artificial Intelligence in Mechanical Engineering.pdf
Applications of artificial Intelligence in Mechanical Engineering.pdfApplications of artificial Intelligence in Mechanical Engineering.pdf
Applications of artificial Intelligence in Mechanical Engineering.pdf
 
哪里办理(csu毕业证书)查尔斯特大学毕业证硕士学历原版一模一样
哪里办理(csu毕业证书)查尔斯特大学毕业证硕士学历原版一模一样哪里办理(csu毕业证书)查尔斯特大学毕业证硕士学历原版一模一样
哪里办理(csu毕业证书)查尔斯特大学毕业证硕士学历原版一模一样
 
Computational Engineering IITH Presentation
Computational Engineering IITH PresentationComputational Engineering IITH Presentation
Computational Engineering IITH Presentation
 
integral complex analysis chapter 06 .pdf
integral complex analysis chapter 06 .pdfintegral complex analysis chapter 06 .pdf
integral complex analysis chapter 06 .pdf
 
22CYT12-Unit-V-E Waste and its Management.ppt
22CYT12-Unit-V-E Waste and its Management.ppt22CYT12-Unit-V-E Waste and its Management.ppt
22CYT12-Unit-V-E Waste and its Management.ppt
 
Manufacturing Process of molasses based distillery ppt.pptx
Manufacturing Process of molasses based distillery ppt.pptxManufacturing Process of molasses based distillery ppt.pptx
Manufacturing Process of molasses based distillery ppt.pptx
 
学校原版美国波士顿大学毕业证学历学位证书原版一模一样
学校原版美国波士顿大学毕业证学历学位证书原版一模一样学校原版美国波士顿大学毕业证学历学位证书原版一模一样
学校原版美国波士顿大学毕业证学历学位证书原版一模一样
 
ITSM Integration with MuleSoft.pptx
ITSM  Integration with MuleSoft.pptxITSM  Integration with MuleSoft.pptx
ITSM Integration with MuleSoft.pptx
 

Full text and relational search

  • 1. Full-text & Relational search VIJAY YADAV 072-BCT-547
  • 2. 2 SEARCH IS HARD  Average no. of search per day is over 3.5 billion on Google alone.  That’s one search for every two people (including babies and grandmothers, but excluding zombies) in the world.
  • 3. 3  That doesn’t even include the number of searches on Amazon, LinkedIn and Facebook. We use search for everything.  Oh, except the company data. We still use BI analysts, data scientists, specialized tools, and SQL for that.
  • 4. 4 So what is full-text search ?  It is simply a document-based search mainly employed by word processing applications and various search engines.  It often performs two tasks: indexing and searching.
  • 5. 5  The indexing stage will scan the text of all the documents and build a list of search terms (often called as index). The indexer will ignore stop words such as "the" and "and“. Also the word drives, drove, driven will be recorded only as a single word “drive”.  In the search stage, when performing a specific query, only the index is referenced, rather than the text of the original documents.
  • 6. 6 Two ways of performance improvements I. Improved query tools II. Improved search algorithms Improved search algorithms  PageRank algorithm developed by google.
  • 7. 7 Improved query tools  Keywords: Creators are asked to list the words that best describe the text including synonyms.  Phrase search: Will search only those documents that contain certain phrase.  Fuzzy search: Will search documents with even some variations around the given term.
  • 8. Some fuzzy search algorithms Soundex Metaphone Double Metaphone
  • 9. Soundex In PostgreSQL, below two queries will result in same output with soundex algorithm and hence even wrong typed word can give right result. 1. SELECT soundex(‘elephant’); ————————————- => E415 2. SELECT soundex(‘elephents’); ————————————– => E415
  • 11. Problems with full-text search • The results may not be 100% accurate. • Large number of irrelevant search results due to lack of relation among the words
  • 12.
  • 13. Why Relational search ? • Gives more accurate and relevant result. • Much useful for business analytics. but…
  • 14. Relational search is even harder because 1. Company’s data is complicated • Search on LinkedIn probably means searching for a person or a company. • Search on Amazon probably means searching for a product. • But company’s data includes multiple databases, tables, columns, rows with complicated relationships between them.
  • 15. 2. Needs to be 100% accurate or you risk your business What’s worse than guessing? Being convinced by bad data.
  • 16. 3. Needs to be faster
  • 17. Relational search makes huge difference in enterprise because it takes deterministic input to give deterministic output.