SlideShare a Scribd company logo
1 of 41
Download to read offline
SEARCH ENGINE DEPENDENCY
   AND ITS INFLUENCE ON
      DATA QUALITY
           By Ronan CHARDONNEAU
Index

            I - Introduction to the world of search engines
                 II - Risks of search engines dependency
                      III - How to solve the equation?
           IV - Future of Google and information research
                                V - Conclusion




I-Introduction   II-Risks   III-Solutions   IV-Future   V-Conclusion
The World of Search
               engines



I-Introduction   II-Risks   III-Solutions   IV-Future   V-Conclusion
Market configuration
          TOP 10 Search websites in the world for August 2007




            Target: users more than 15 year-old, home and at work
                         Source: comscore qSearch 2.0

I-Introduction   II-Risks     III-Solutions     IV-Future       V-Conclusion
Leaders per country




                                                Source: map made using data on « Alexa the
                                                   Web information company (2008) ».




I-Introduction   II-Risks   III-Solutions   IV-Future                  V-Conclusion
A win or lose market




I-Introduction   II-Risks   III-Solutions   IV-Future   V-Conclusion
Approximation of language contents
          available on Internet




                                                         Source: Internet world Stats




I-Introduction   II-Risks   III-Solutions   IV-Future   V-Conclusion
What has already been proved
   • Studies are showing that Internet is the main
   information provider (at least in Europe and
   America);
   • When surfing on the Internet search engines
   are the most used websites;
   • People trust search engines results;
   • When making research on the Internet people
   are mainly using one single search engine;



I-Introduction   II-Risks   III-Solutions   IV-Future   V-Conclusion
Brief summary
   • Google is the market leader, followers are far;
   • 8 search engine leaders and probably eight
   continents on Internet;
   • A market defined by the adoption of standards
   (<50%) to search;
   • Contents are mainly in English, importance of
   Chinese, quality contents in Japanese, German
   and Korean;
   • Internet users cannot live without search
   engines and are loyal to a specific one;

I-Introduction   II-Risks   III-Solutions   IV-Future   V-Conclusion
Risks of search engine
        dependency and its influence on
                 data quality




I-Introduction   II-Risks   III-Solutions   IV-Future   V-Conclusion
Definition
  The behaviour of not reconsidering the results coming from one single
  search engine.

  It normally starts when you hear sentences such as:

  - quot;Why should I bother using other search engines because I find
  everything I want with Google?quot;

  - Do I really have some risks when I am using Google?

  - All countries in the world have Google in their top 100 or less;

  - Google has been recognized as the most powerful brand;


I-Introduction   II-Risks    III-Solutions   IV-Future      V-Conclusion
• Who is Google? Well... It is our friend;
      • We can carry it everywhere, relevant, convenient(quick
      display, services associated);
      • But:
          – You have to know how to deal with it;
          – You have to know its limits;
          – You have to know its potential;




I-Introduction   II-Risks   III-Solutions   IV-Future   V-Conclusion
Consequences

      • If you don’t know how to deal with it:
           - You will never use his true capacities;
           - You will probably take the first information which is
             displayed;
      • If you don't know its limits:
           - And cannot find the information you will may think
             that the information does not exist;
           - You may even think that the technology does not
             exist elsewhere;
      • If you don’t know its potential:
           - You will not improve at performing research;


I-Introduction   II-Risks     III-Solutions   IV-Future     V-Conclusion
Advertisement
      • Search engines economical model is based on
      advertisement (99% of Google revenues are based on it);
      • However studies are showing that some categories of
      adults (non Internet generations) do not make the
      difference between commercial and non commercial
      links;
      • Some search engines are more commercial than others;
      • The more you know a search engine (Google) and the
      more you can practise Search Engine Optimization;




I-Introduction   II-Risks   III-Solutions   IV-Future   V-Conclusion
Google is not an isolated case
      • Baidu dependency in China and Yandex dependency in
      Russia;
      • Seznam dependency in Czech Republic;
      • Naver dependency in South Korea;
      • Yahoo dependency in Japan and many others Asian
      countries;




I-Introduction   II-Risks   III-Solutions   IV-Future   V-Conclusion
Brief summary

      • Search engine dependency is confortable and then
      understandable;
      • But for many reasons it goes for a mass consumption
      information (blog phenomenon, advertisement…) which is
      not the best ones;
      • In our countries it is Google dependency but keep in
      mind that Europe and Americas are not the center of the
      world;




I-Introduction   II-Risks     III-Solutions   IV-Future   V-Conclusion
How to solve the equation?




I-Introduction   II-Risks   III-Solutions   IV-Future   V-Conclusion
First point

    •    If an answer exist... we should look for it;
    •    At the moment there is no miracle solution
         for lazy search;
    •    But there are ways to get closer to the
         answer;



I-Introduction   II-Risks    III-Solutions   IV-Future   V-Conclusion
Three pillars
                       Learn how to use the technology




    Technological awareness                     Breaking the habits


I-Introduction   II-Risks     III-Solutions   IV-Future   V-Conclusion
Concrete case: Google
Learn how to use the technology:
• Make advanced research:
   – Simple Boolean operators («  », links:, define:, ?, *, ~,…) ;
   – Complex request: ?intitle:index.of? quot;quot; -filetype:html -filetype:asp -wiki -ringtone -filetype:htm
-posts -lyrics -filetype:shtml -filetype:php -filetype:doc -filetype:pdf -filetype:txt mpeg wma avi wmv
    – Google Advanced search;
• Using other Google services such as Google Alerts;
• Use sub Google search engines such as Google Scholars;

  Breaking the habits:
     - Get used to practice what you learnt and force
  yourself to do so;
     - Results are coming and you get used to it;
I-Introduction           II-Risks           III-Solutions             IV-Future             V-Conclusion
Concrete case: Google
  Technological awareness:
        By performing better at search you will discover new
   technologies that you will have to learn.

  For example: Google Alerts tell you that a new search
  engine is coming up and then you try it;




I-Introduction   II-Risks   III-Solutions   IV-Future   V-Conclusion
Technological awareness: Google
                                                  Google Advanced Search




                                                                Do you know
                                                                  iGoogle?

Google Ads

   When Google promotes its own technology good chances that it is worthwhile


I-Introduction    II-Risks     III-Solutions     IV-Future        V-Conclusion
Technological awareness: How to
             select the best
•     Search engine market is a world of buzz:




•     Where every search engine want to beat Google;
•     But are they really providing a technical
      revolution?
I-Introduction   II-Risks   III-Solutions   IV-Future   V-Conclusion
Start to look at what Google does not have

     •    Real time information: the Twitter example

                                            When Google starts to
                                            be interested in one's
                                             technology it should
                                              then be a good one




I-Introduction   II-Risks   III-Solutions    IV-Future     V-Conclusion
Start to look at what Google does not have

     •    Finding similar websites: Who is like it?

                                                   Unfortunately it is
                                                    working only for
                                                   popular websites




I-Introduction   II-Risks   III-Solutions   IV-Future    V-Conclusion
Start to look at what Google does not have

Another way of searching information: Social bookmarking



                                                  Advantages: you find
                                                  unindexed websites;
                                                     Disadvantages:
                                                   rubbish websites,
                                                    advertisement?




I-Introduction   II-Risks   III-Solutions   IV-Future    V-Conclusion
Start to look at what Google does not have


Graphical display




I-Introduction   II-Risks   III-Solutions   IV-Future   V-Conclusion
Start to look where
                 Google is not the best
     Look for specialized search engines
     - People: 123 People, CV gadget, Pipl…
     - Jobs: Indeed, JobiJoba…
     - Tutorials: Tutosearch, …
     - Torrent: Toorgle, …
     - Scientific information: Scirus,…
     - Information in a specific language: Yandex
        for Russian, Baidu for Chinese….
I-Introduction   II-Risks   III-Solutions   IV-Future   V-Conclusion
How to improve data quality on
                  the Internet?
     •    Triangle method: Locating three independent
          sources that point to the same answer;
     •    Recent events in Tibet showed how it was
          important to look at different sources of
          information and even out of your own country;
                            Source 1: Washington Post




           Source 2: Le Parisien   Source 3: AntiCnn.com
I-Introduction   II-Risks    III-Solutions   IV-Future     V-Conclusion
Brief summary

     •   Learn how to use, change your habits, be aware;
     •   Be curious
     •   Think about another way to look for information;
     •   Three dependent sources of information;




I-Introduction   II-Risks     III-Solutions   IV-Future   V-Conclusion
Future of Google and
                 information research




I-Introduction   II-Risks   III-Solutions   IV-Future   V-Conclusion
Semantic search
     •    You get feed instead of entering your
          request;
     •    Everything is talking about Semantic
          search;
     •    But it is mature yet, a buzz world again (there
          are not a lot of suggestions);
     •    Poor results if developped on scratch (poor index)
          if developped by huge companies (few
          suggestions);

I-Introduction   II-Risks   III-Solutions   IV-Future   V-Conclusion
Some issues to fix
     • How to well index pictures? Are solutions such
       as Google labeler are the best???
     • How to index videos?
     • How to index sounds?




I-Introduction   II-Risks   III-Solutions   IV-Future   V-Conclusion
A Google which will have to change
•     Too much information on the Internet;
•     A Google which is collapsing and providing
      more and more sub search engines;
•     The development of high bandwidth
      connection which mean graphical
      interface;
•     A technological awareness which is
      difficult to transmitt;

I-Introduction   II-Risks   III-Solutions   IV-Future   V-Conclusion
But a Google more and more
              present in our life
     •    Forecasts are going in that sense;
     •    Development of OS on cell phones, Web
          browser, Web software application
          (Google slides, Google « excel »....)




I-Introduction   II-Risks   III-Solutions   IV-Future   V-Conclusion
The question is just how they
                  will do it?




         Google in 1998                      Google 11 years after




I-Introduction   II-Risks   III-Solutions   IV-Future   V-Conclusion
Brief summary

     • Google will be with us in the future and we
       have to get used to it;
     • Information research will be more and more
       assisted but you will still be in late if you
       do not perform advanced research;
     • In a short future some issues will still be
       there (indexing of pictures…)


I-Introduction   II-Risks     III-Solutions   IV-Future   V-Conclusion
Conclusion




I-Introduction   II-Risks    III-Solutions   IV-Future   V-Conclusion
What you have to keep in mind
     •    At least if you are dependent you should
          be well dependent;
     •    Apply the triangle method;
     •    Reconsider on each time the information
          process (think differently);




I-Introduction   II-Risks   III-Solutions   IV-Future   V-Conclusion
Recommandations
 Master thesis about search engine dependency:
    - http://www.pandia.com/index.html
 List of search engines:
    - http://www.pandia.com/powersearch/index.html
    - http://www.philb.com/whichengine.htm
 To know more about search engines: Pandia search:
    - www.pandiasearch.com
 Documentaries:
    - Google: Behind the screen by IJsbrand van Veelen
 http://www.youtube.com/watch?v=TBNDYggyesc&hl=fr
    - The Great Firewall of China
 http://www.youtube.com/watch?v=IWsXhNJFj78&hl=fr
I-Introduction   II-Risks   III-Solutions   IV-Future   V-Conclusion
Thank you for your attention




http://moteurs-de-recherches-alternatifs.blogspot.com

More Related Content

Similar to Risks of Search Engine Dependency and Data Quality

Online navigation Module 3 lesson.pptx
Online navigation Module 3 lesson.pptxOnline navigation Module 3 lesson.pptx
Online navigation Module 3 lesson.pptxmakoycampos
 
Online navigation Module 3 lesson.pptx
Online navigation Module 3 lesson.pptxOnline navigation Module 3 lesson.pptx
Online navigation Module 3 lesson.pptxmakoycampos
 
2016 CaribbeanStartups.com Fall Boot Camp/ Demo Night
2016 CaribbeanStartups.com Fall Boot Camp/ Demo Night2016 CaribbeanStartups.com Fall Boot Camp/ Demo Night
2016 CaribbeanStartups.com Fall Boot Camp/ Demo NightChristine Souffrant Ntim
 
Galyna Paliychuk. How can a startup go global?
Galyna Paliychuk. How can a startup go global?Galyna Paliychuk. How can a startup go global?
Galyna Paliychuk. How can a startup go global?IT Arena
 
Google Management Ppt (group assignment).pptx
Google Management Ppt (group assignment).pptxGoogle Management Ppt (group assignment).pptx
Google Management Ppt (group assignment).pptxssuser8aaff2
 
TBEX June 2022_Marbella_Michael Collins_Travelmedia.ie_What the FLoC_with typ...
TBEX June 2022_Marbella_Michael Collins_Travelmedia.ie_What the FLoC_with typ...TBEX June 2022_Marbella_Michael Collins_Travelmedia.ie_What the FLoC_with typ...
TBEX June 2022_Marbella_Michael Collins_Travelmedia.ie_What the FLoC_with typ...TravelMedia.ie
 
Bussiness assignment 11
Bussiness assignment 11Bussiness assignment 11
Bussiness assignment 11JustFun7
 
Google 10 step marketing plan gozarin
Google   10 step marketing plan gozarinGoogle   10 step marketing plan gozarin
Google 10 step marketing plan gozarinmignonettejoy23
 
How to do Google's SWOT Analysis? Strengths, Weaknesses, Opportunities and T...
 How to do Google's SWOT Analysis? Strengths, Weaknesses, Opportunities and T... How to do Google's SWOT Analysis? Strengths, Weaknesses, Opportunities and T...
How to do Google's SWOT Analysis? Strengths, Weaknesses, Opportunities and T...SWOT & PESTLE.com
 
Launching a Free-to-Play Game: Strategies, Risks, and Pitfalls
Launching a Free-to-Play Game: Strategies, Risks, and PitfallsLaunching a Free-to-Play Game: Strategies, Risks, and Pitfalls
Launching a Free-to-Play Game: Strategies, Risks, and PitfallsEric Seufert
 
Pinterest, Google+, Instagram and Foursquare Marketing
Pinterest, Google+, Instagram and Foursquare MarketingPinterest, Google+, Instagram and Foursquare Marketing
Pinterest, Google+, Instagram and Foursquare MarketingAliza Sherman
 
How artificial intelligence certification help you in future to grow your self
How artificial intelligence certification help you in future to grow your selfHow artificial intelligence certification help you in future to grow your self
How artificial intelligence certification help you in future to grow your selfRobert Smith
 
Case google.pptx
Case google.pptx Case google.pptx
Case google.pptx THAO BUI
 
SEO & Artificial Intelligence: The new rules to stay on top!
SEO & Artificial Intelligence: The new rules to stay on top!SEO & Artificial Intelligence: The new rules to stay on top!
SEO & Artificial Intelligence: The new rules to stay on top!TheFamily
 
International Business Negotation
International Business NegotationInternational Business Negotation
International Business NegotationNabaraj Giri
 
Google inc. case study
Google inc. case studyGoogle inc. case study
Google inc. case studysovanna suos
 

Similar to Risks of Search Engine Dependency and Data Quality (20)

Online navigation Module 3 lesson.pptx
Online navigation Module 3 lesson.pptxOnline navigation Module 3 lesson.pptx
Online navigation Module 3 lesson.pptx
 
Online navigation Module 3 lesson.pptx
Online navigation Module 3 lesson.pptxOnline navigation Module 3 lesson.pptx
Online navigation Module 3 lesson.pptx
 
2016 CaribbeanStartups.com Fall Boot Camp/ Demo Night
2016 CaribbeanStartups.com Fall Boot Camp/ Demo Night2016 CaribbeanStartups.com Fall Boot Camp/ Demo Night
2016 CaribbeanStartups.com Fall Boot Camp/ Demo Night
 
Galyna Paliychuk. How can a startup go global?
Galyna Paliychuk. How can a startup go global?Galyna Paliychuk. How can a startup go global?
Galyna Paliychuk. How can a startup go global?
 
Google Management Ppt (group assignment).pptx
Google Management Ppt (group assignment).pptxGoogle Management Ppt (group assignment).pptx
Google Management Ppt (group assignment).pptx
 
Google- manendra
Google- manendraGoogle- manendra
Google- manendra
 
TBEX June 2022_Marbella_Michael Collins_Travelmedia.ie_What the FLoC_with typ...
TBEX June 2022_Marbella_Michael Collins_Travelmedia.ie_What the FLoC_with typ...TBEX June 2022_Marbella_Michael Collins_Travelmedia.ie_What the FLoC_with typ...
TBEX June 2022_Marbella_Michael Collins_Travelmedia.ie_What the FLoC_with typ...
 
Rockyproject
RockyprojectRockyproject
Rockyproject
 
Bussiness assignment 11
Bussiness assignment 11Bussiness assignment 11
Bussiness assignment 11
 
Google 10 step marketing plan gozarin
Google   10 step marketing plan gozarinGoogle   10 step marketing plan gozarin
Google 10 step marketing plan gozarin
 
How to do Google's SWOT Analysis? Strengths, Weaknesses, Opportunities and T...
 How to do Google's SWOT Analysis? Strengths, Weaknesses, Opportunities and T... How to do Google's SWOT Analysis? Strengths, Weaknesses, Opportunities and T...
How to do Google's SWOT Analysis? Strengths, Weaknesses, Opportunities and T...
 
Launching a Free-to-Play Game: Strategies, Risks, and Pitfalls
Launching a Free-to-Play Game: Strategies, Risks, and PitfallsLaunching a Free-to-Play Game: Strategies, Risks, and Pitfalls
Launching a Free-to-Play Game: Strategies, Risks, and Pitfalls
 
Pinterest, Google+, Instagram and Foursquare Marketing
Pinterest, Google+, Instagram and Foursquare MarketingPinterest, Google+, Instagram and Foursquare Marketing
Pinterest, Google+, Instagram and Foursquare Marketing
 
How artificial intelligence certification help you in future to grow your self
How artificial intelligence certification help you in future to grow your selfHow artificial intelligence certification help you in future to grow your self
How artificial intelligence certification help you in future to grow your self
 
Google In China - Case Study
Google In China - Case StudyGoogle In China - Case Study
Google In China - Case Study
 
Case google.pptx
Case google.pptx Case google.pptx
Case google.pptx
 
SEO & Artificial Intelligence: The new rules to stay on top!
SEO & Artificial Intelligence: The new rules to stay on top!SEO & Artificial Intelligence: The new rules to stay on top!
SEO & Artificial Intelligence: The new rules to stay on top!
 
Google And The Click Fraud Menace
Google And The Click Fraud MenaceGoogle And The Click Fraud Menace
Google And The Click Fraud Menace
 
International Business Negotation
International Business NegotationInternational Business Negotation
International Business Negotation
 
Google inc. case study
Google inc. case studyGoogle inc. case study
Google inc. case study
 

More from Nanor

Intermediate Report Presentation Example
Intermediate Report Presentation ExampleIntermediate Report Presentation Example
Intermediate Report Presentation ExampleNanor
 
Google Presentation Search Engine Dependency
Google Presentation Search Engine DependencyGoogle Presentation Search Engine Dependency
Google Presentation Search Engine DependencyNanor
 
Google Presentation Search Engine Dependency
Google Presentation Search Engine DependencyGoogle Presentation Search Engine Dependency
Google Presentation Search Engine DependencyNanor
 
Risks of search engine dependency and its influence on data quality
Risks of search engine dependency and its influence on data qualityRisks of search engine dependency and its influence on data quality
Risks of search engine dependency and its influence on data qualityNanor
 
Consumer Behavior 7 Elements
Consumer Behavior 7 ElementsConsumer Behavior 7 Elements
Consumer Behavior 7 ElementsNanor
 
Browser Wars: Internet Explorer versus Netscape
Browser Wars: Internet Explorer versus NetscapeBrowser Wars: Internet Explorer versus Netscape
Browser Wars: Internet Explorer versus NetscapeNanor
 
Browser Wars Internet Explorer versus Netscape
Browser Wars Internet Explorer versus NetscapeBrowser Wars Internet Explorer versus Netscape
Browser Wars Internet Explorer versus NetscapeNanor
 
Singleuropeanact
SingleuropeanactSingleuropeanact
SingleuropeanactNanor
 
Risks of search engine dependency and its influence on data quality
Risks of search engine dependency and its influence on data qualityRisks of search engine dependency and its influence on data quality
Risks of search engine dependency and its influence on data qualityNanor
 
Distribution Policy Apple
Distribution Policy AppleDistribution Policy Apple
Distribution Policy AppleNanor
 
Microsoft Distribution Channels Presentation
Microsoft Distribution Channels PresentationMicrosoft Distribution Channels Presentation
Microsoft Distribution Channels PresentationNanor
 
Mistery Shopping Grimm Museum
Mistery Shopping Grimm MuseumMistery Shopping Grimm Museum
Mistery Shopping Grimm MuseumNanor
 
Google International marketing
Google International marketingGoogle International marketing
Google International marketingNanor
 
Baidu
BaiduBaidu
BaiduNanor
 

More from Nanor (14)

Intermediate Report Presentation Example
Intermediate Report Presentation ExampleIntermediate Report Presentation Example
Intermediate Report Presentation Example
 
Google Presentation Search Engine Dependency
Google Presentation Search Engine DependencyGoogle Presentation Search Engine Dependency
Google Presentation Search Engine Dependency
 
Google Presentation Search Engine Dependency
Google Presentation Search Engine DependencyGoogle Presentation Search Engine Dependency
Google Presentation Search Engine Dependency
 
Risks of search engine dependency and its influence on data quality
Risks of search engine dependency and its influence on data qualityRisks of search engine dependency and its influence on data quality
Risks of search engine dependency and its influence on data quality
 
Consumer Behavior 7 Elements
Consumer Behavior 7 ElementsConsumer Behavior 7 Elements
Consumer Behavior 7 Elements
 
Browser Wars: Internet Explorer versus Netscape
Browser Wars: Internet Explorer versus NetscapeBrowser Wars: Internet Explorer versus Netscape
Browser Wars: Internet Explorer versus Netscape
 
Browser Wars Internet Explorer versus Netscape
Browser Wars Internet Explorer versus NetscapeBrowser Wars Internet Explorer versus Netscape
Browser Wars Internet Explorer versus Netscape
 
Singleuropeanact
SingleuropeanactSingleuropeanact
Singleuropeanact
 
Risks of search engine dependency and its influence on data quality
Risks of search engine dependency and its influence on data qualityRisks of search engine dependency and its influence on data quality
Risks of search engine dependency and its influence on data quality
 
Distribution Policy Apple
Distribution Policy AppleDistribution Policy Apple
Distribution Policy Apple
 
Microsoft Distribution Channels Presentation
Microsoft Distribution Channels PresentationMicrosoft Distribution Channels Presentation
Microsoft Distribution Channels Presentation
 
Mistery Shopping Grimm Museum
Mistery Shopping Grimm MuseumMistery Shopping Grimm Museum
Mistery Shopping Grimm Museum
 
Google International marketing
Google International marketingGoogle International marketing
Google International marketing
 
Baidu
BaiduBaidu
Baidu
 

Recently uploaded

"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr LapshynFwdays
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Wonjun Hwang
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsMiki Katsuragi
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
Bluetooth Controlled Car with Arduino.pdf
Bluetooth Controlled Car with Arduino.pdfBluetooth Controlled Car with Arduino.pdf
Bluetooth Controlled Car with Arduino.pdfngoud9212
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 

Recently uploaded (20)

"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering Tips
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
Bluetooth Controlled Car with Arduino.pdf
Bluetooth Controlled Car with Arduino.pdfBluetooth Controlled Car with Arduino.pdf
Bluetooth Controlled Car with Arduino.pdf
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 

Risks of Search Engine Dependency and Data Quality

  • 1. SEARCH ENGINE DEPENDENCY AND ITS INFLUENCE ON DATA QUALITY By Ronan CHARDONNEAU
  • 2. Index I - Introduction to the world of search engines II - Risks of search engines dependency III - How to solve the equation? IV - Future of Google and information research V - Conclusion I-Introduction II-Risks III-Solutions IV-Future V-Conclusion
  • 3. The World of Search engines I-Introduction II-Risks III-Solutions IV-Future V-Conclusion
  • 4. Market configuration TOP 10 Search websites in the world for August 2007 Target: users more than 15 year-old, home and at work Source: comscore qSearch 2.0 I-Introduction II-Risks III-Solutions IV-Future V-Conclusion
  • 5. Leaders per country Source: map made using data on « Alexa the Web information company (2008) ». I-Introduction II-Risks III-Solutions IV-Future V-Conclusion
  • 6. A win or lose market I-Introduction II-Risks III-Solutions IV-Future V-Conclusion
  • 7. Approximation of language contents available on Internet Source: Internet world Stats I-Introduction II-Risks III-Solutions IV-Future V-Conclusion
  • 8. What has already been proved • Studies are showing that Internet is the main information provider (at least in Europe and America); • When surfing on the Internet search engines are the most used websites; • People trust search engines results; • When making research on the Internet people are mainly using one single search engine; I-Introduction II-Risks III-Solutions IV-Future V-Conclusion
  • 9. Brief summary • Google is the market leader, followers are far; • 8 search engine leaders and probably eight continents on Internet; • A market defined by the adoption of standards (<50%) to search; • Contents are mainly in English, importance of Chinese, quality contents in Japanese, German and Korean; • Internet users cannot live without search engines and are loyal to a specific one; I-Introduction II-Risks III-Solutions IV-Future V-Conclusion
  • 10. Risks of search engine dependency and its influence on data quality I-Introduction II-Risks III-Solutions IV-Future V-Conclusion
  • 11. Definition The behaviour of not reconsidering the results coming from one single search engine. It normally starts when you hear sentences such as: - quot;Why should I bother using other search engines because I find everything I want with Google?quot; - Do I really have some risks when I am using Google? - All countries in the world have Google in their top 100 or less; - Google has been recognized as the most powerful brand; I-Introduction II-Risks III-Solutions IV-Future V-Conclusion
  • 12. • Who is Google? Well... It is our friend; • We can carry it everywhere, relevant, convenient(quick display, services associated); • But: – You have to know how to deal with it; – You have to know its limits; – You have to know its potential; I-Introduction II-Risks III-Solutions IV-Future V-Conclusion
  • 13. Consequences • If you don’t know how to deal with it: - You will never use his true capacities; - You will probably take the first information which is displayed; • If you don't know its limits: - And cannot find the information you will may think that the information does not exist; - You may even think that the technology does not exist elsewhere; • If you don’t know its potential: - You will not improve at performing research; I-Introduction II-Risks III-Solutions IV-Future V-Conclusion
  • 14. Advertisement • Search engines economical model is based on advertisement (99% of Google revenues are based on it); • However studies are showing that some categories of adults (non Internet generations) do not make the difference between commercial and non commercial links; • Some search engines are more commercial than others; • The more you know a search engine (Google) and the more you can practise Search Engine Optimization; I-Introduction II-Risks III-Solutions IV-Future V-Conclusion
  • 15. Google is not an isolated case • Baidu dependency in China and Yandex dependency in Russia; • Seznam dependency in Czech Republic; • Naver dependency in South Korea; • Yahoo dependency in Japan and many others Asian countries; I-Introduction II-Risks III-Solutions IV-Future V-Conclusion
  • 16. Brief summary • Search engine dependency is confortable and then understandable; • But for many reasons it goes for a mass consumption information (blog phenomenon, advertisement…) which is not the best ones; • In our countries it is Google dependency but keep in mind that Europe and Americas are not the center of the world; I-Introduction II-Risks III-Solutions IV-Future V-Conclusion
  • 17. How to solve the equation? I-Introduction II-Risks III-Solutions IV-Future V-Conclusion
  • 18. First point • If an answer exist... we should look for it; • At the moment there is no miracle solution for lazy search; • But there are ways to get closer to the answer; I-Introduction II-Risks III-Solutions IV-Future V-Conclusion
  • 19. Three pillars Learn how to use the technology Technological awareness Breaking the habits I-Introduction II-Risks III-Solutions IV-Future V-Conclusion
  • 20. Concrete case: Google Learn how to use the technology: • Make advanced research: – Simple Boolean operators («  », links:, define:, ?, *, ~,…) ; – Complex request: ?intitle:index.of? quot;quot; -filetype:html -filetype:asp -wiki -ringtone -filetype:htm -posts -lyrics -filetype:shtml -filetype:php -filetype:doc -filetype:pdf -filetype:txt mpeg wma avi wmv – Google Advanced search; • Using other Google services such as Google Alerts; • Use sub Google search engines such as Google Scholars; Breaking the habits: - Get used to practice what you learnt and force yourself to do so; - Results are coming and you get used to it; I-Introduction II-Risks III-Solutions IV-Future V-Conclusion
  • 21. Concrete case: Google Technological awareness: By performing better at search you will discover new technologies that you will have to learn. For example: Google Alerts tell you that a new search engine is coming up and then you try it; I-Introduction II-Risks III-Solutions IV-Future V-Conclusion
  • 22. Technological awareness: Google Google Advanced Search Do you know iGoogle? Google Ads When Google promotes its own technology good chances that it is worthwhile I-Introduction II-Risks III-Solutions IV-Future V-Conclusion
  • 23. Technological awareness: How to select the best • Search engine market is a world of buzz: • Where every search engine want to beat Google; • But are they really providing a technical revolution? I-Introduction II-Risks III-Solutions IV-Future V-Conclusion
  • 24. Start to look at what Google does not have • Real time information: the Twitter example When Google starts to be interested in one's technology it should then be a good one I-Introduction II-Risks III-Solutions IV-Future V-Conclusion
  • 25. Start to look at what Google does not have • Finding similar websites: Who is like it? Unfortunately it is working only for popular websites I-Introduction II-Risks III-Solutions IV-Future V-Conclusion
  • 26. Start to look at what Google does not have Another way of searching information: Social bookmarking Advantages: you find unindexed websites; Disadvantages: rubbish websites, advertisement? I-Introduction II-Risks III-Solutions IV-Future V-Conclusion
  • 27. Start to look at what Google does not have Graphical display I-Introduction II-Risks III-Solutions IV-Future V-Conclusion
  • 28. Start to look where Google is not the best Look for specialized search engines - People: 123 People, CV gadget, Pipl… - Jobs: Indeed, JobiJoba… - Tutorials: Tutosearch, … - Torrent: Toorgle, … - Scientific information: Scirus,… - Information in a specific language: Yandex for Russian, Baidu for Chinese…. I-Introduction II-Risks III-Solutions IV-Future V-Conclusion
  • 29. How to improve data quality on the Internet? • Triangle method: Locating three independent sources that point to the same answer; • Recent events in Tibet showed how it was important to look at different sources of information and even out of your own country; Source 1: Washington Post Source 2: Le Parisien Source 3: AntiCnn.com I-Introduction II-Risks III-Solutions IV-Future V-Conclusion
  • 30. Brief summary • Learn how to use, change your habits, be aware; • Be curious • Think about another way to look for information; • Three dependent sources of information; I-Introduction II-Risks III-Solutions IV-Future V-Conclusion
  • 31. Future of Google and information research I-Introduction II-Risks III-Solutions IV-Future V-Conclusion
  • 32. Semantic search • You get feed instead of entering your request; • Everything is talking about Semantic search; • But it is mature yet, a buzz world again (there are not a lot of suggestions); • Poor results if developped on scratch (poor index) if developped by huge companies (few suggestions); I-Introduction II-Risks III-Solutions IV-Future V-Conclusion
  • 33. Some issues to fix • How to well index pictures? Are solutions such as Google labeler are the best??? • How to index videos? • How to index sounds? I-Introduction II-Risks III-Solutions IV-Future V-Conclusion
  • 34. A Google which will have to change • Too much information on the Internet; • A Google which is collapsing and providing more and more sub search engines; • The development of high bandwidth connection which mean graphical interface; • A technological awareness which is difficult to transmitt; I-Introduction II-Risks III-Solutions IV-Future V-Conclusion
  • 35. But a Google more and more present in our life • Forecasts are going in that sense; • Development of OS on cell phones, Web browser, Web software application (Google slides, Google « excel »....) I-Introduction II-Risks III-Solutions IV-Future V-Conclusion
  • 36. The question is just how they will do it? Google in 1998 Google 11 years after I-Introduction II-Risks III-Solutions IV-Future V-Conclusion
  • 37. Brief summary • Google will be with us in the future and we have to get used to it; • Information research will be more and more assisted but you will still be in late if you do not perform advanced research; • In a short future some issues will still be there (indexing of pictures…) I-Introduction II-Risks III-Solutions IV-Future V-Conclusion
  • 38. Conclusion I-Introduction II-Risks III-Solutions IV-Future V-Conclusion
  • 39. What you have to keep in mind • At least if you are dependent you should be well dependent; • Apply the triangle method; • Reconsider on each time the information process (think differently); I-Introduction II-Risks III-Solutions IV-Future V-Conclusion
  • 40. Recommandations Master thesis about search engine dependency: - http://www.pandia.com/index.html List of search engines: - http://www.pandia.com/powersearch/index.html - http://www.philb.com/whichengine.htm To know more about search engines: Pandia search: - www.pandiasearch.com Documentaries: - Google: Behind the screen by IJsbrand van Veelen http://www.youtube.com/watch?v=TBNDYggyesc&hl=fr - The Great Firewall of China http://www.youtube.com/watch?v=IWsXhNJFj78&hl=fr I-Introduction II-Risks III-Solutions IV-Future V-Conclusion
  • 41. Thank you for your attention http://moteurs-de-recherches-alternatifs.blogspot.com