Omar, M., Mehmood, A., Choi, G.S., Park, H.W.@ (2017 Online First). Global mapping of artificial intelligence in Google and Google Scholar. Scientometrics.
https://link.springer.com/article/10.1007%2Fs11192-017-2534-4
#인공지능 분야의 국가별 현황과 전망을 #웹보메트릭스 #빅데이터 #구글데이터 이용해서 세계지도를 그렸습니다.
Predicting Salary Using Data Science: A Comprehensive Analysis.pdf
Global mapping of artificial intelligence in Google and Google Scholar
1. Is Boom in Artificial
Intelligence a Global One?
Presented by Han Woo Park
Co-Authors
Muhammad Omar, Arif Mehmood, Gyu Sang Choi
Yeungnam University
Omar, M., Mehmood, A., Choi, G.S., Park, H.W.@ (2017 Online First). Global mapping of
artificial intelligence in Google and Google Scholar. Scientometrics.
https://link.springer.com/article/10.1007%2Fs11192-017-2534-4
#인공지능 분야의 국가별 현황과
전망을 #웹보메트릭스 #빅데이터 #구글데이터 이용해서 세계지도를 그렸습니다.
Presented at DISC 2016
http://watef.org/home/bbs/sub_disc_program.php
https://jceasia.org/about-journal/
2. The Study
Investigation of world wide prevalence of AI using
– Google Trends (GT) - https://www.google.com/trends
• Using News, Web, and YouTube Categories
– Google Books Ngram Viewer- https://books.google.com/ngrams
• Using various corpora
– URL data of Search Engine(SE) results
• Google- https://www.google.com
• Google Scholar (GS)- https://scholar.google.com/
3. Selection of Query, “artificial intelligence”
The bi-gram, artificial intelligence, umbrellas all the related terms
4. Selection of Query, “artificial intelligence”
The bi-gram, artificial intelligence, umbrellas all the related terms
5. Selection of Query, “artificial intelligence”
The bi-gram, artificial intelligence, is popular world wide
6. Results- Google Books Ngram Viewer
The bi-gram, artificial intelligence, is more popular in books published in Great
Britain and United States.
7. Results- Google Trends
Top-5 countries (a) Emphasizing a search category- You Tube, Web, and News. (b) & (c)
Emphasizing a country and its top search category.
8. URLs Data Collection
• Search Engines Choice
– Google Scholar Vs. all
• Van Noorden et al. survey paper in Nature 2014, shows
in terms of scholarly-reach GS was ranked 1st, followed
by ResearchGate 2nd, LinkedIn 3rd, Facebook 4th,
Google+ 5th, Twitter 6th, ReseacherID 7th, Mendley 8th,
ORCID 9th, Academia.edu 10th, BiomedExperts 11th,
Microsoft Academic Search 12th
– Google Vs. Bing Vs. Yahoo Vs. Naver ?
• https://www.similarweb.com/website/google.com?c
ompetitors=bing.com
10. URLs Data Collection- Search Query
https://pro.similarweb.com
– We collected data for years 2004-2016
– Search Query
• allintitle: "artificial intelligence“
• The title of a webpage is unique
• Title is an important factor in search engine
optimization (SEO)
11. URLs Data Collection- Web Scraping
https://pro.similarweb.com
– Data Miner https://data-miner.io
• Google chrome extension
• Free for moderate size data (free for 500 pages/month)
• Using data extraction rules, i.e. recipes, we extracted
URLs associated with hyperlinked tiles (query results)
• No programming background required
• Easy to use
• May be available fully free for academics
20. Findings …
To quantify a concept, like AI,
• we need different approaches
• GTs, Google Ngrams Viewer and URLs data
complements each other
• GTs doesn’t show an important country for AI
• Google Books Ngram Viewer is old (upto 2008)
and don’t provide information of end users
• We also need small and targeted local area
analysis in addition to Big Data
21. Findings …
URLs data manipulation provides Big Data
analysis easy
• We don’t need traditional hardware and
software for Big Data analytics
• We scraped/analyzed Big web data for free
22. Findings …
• Top ranked results show that AI is popular mostly
amongst professionals researchers, programmers
– Top results reveal few countries
– But low ranked results show AI dominance all over the
globe from low income countries to high income
• If AI-based systems come to dominate human
society, humans will become completely
dependent on digital information.
• We also need to study impact of AI using theories
like McDonaldization proposed by George Reitzer.
Many Google trend categories exist we exploited only three namely- News, Web, and You Tube search categories.
Books predominantly in the English language published in any country. English (2009) corpus was generated in July 2009. See section, Corpora @ https://books.google.com/ngrams/info , for further details.
Google Trends settings: Worldwide, 2004 – present (Dated 28 November 2016), All categories, Web Search.
Interest over time: Numbers represent search interest relative to the highest point on the chart for the given region and time. A value of 100 is the peak popularity for the term. A value of 50 means that the term is half as popular. Likewise a score of 0 means the term was less than 1% as popular as the peak.
Google Trends settings: Worldwide, 2004 – present (Dated 28 November 2016), All categories, Web Search.
Interest by region: See which term ranked highest in each region during the specified time frame. Values are scaled from 0 to 100, where 100 is the region with peak popularity, a value of 50 is the region where the term is half as popular, and a value of 0 means that term was less than 1% as popular as the peak.
English 2009 (eng_2009) Books predominantly in the English language published in any country.
British English 2009 (eng_gb_2009) Books predominantly in the English language that were published in Great Britain.
American English 2009 (eng_us_2009) Books predominantly in the English language that were published in the United States.
French 2009 (fre_2009 ) Books predominantly in the French language.
German 2009 (ger_2009) Books predominantly in the German language.
AI popularity is shown in terms of (1) field of study- here AI is termed as a study discipline (2) search term.
Note: Rank 1 means lowest rank. Data was collected from GTs in May 2016 and the results are shown by Excel charts. Gimp 2.0 used for screen capture.
June 14th, 2016, google.com is world’s most popular website. According to alexa.com its global rank is #1 and for similarweb.com its global rank is #2 and category rank in terms of “internet and telecom search engine” is also ranked #1.
[Van Noorden R. Online collaboration: Scientists and the social network. Nature. 2014 Aug;512(7513):126-9.] Van Noorden et al. conducted a survey in May 2014, where in terms of scholarly-reach GS was ranked 1st, followed by ResearchGate 2nd, LinkedIn 3rd, Facebook 4th, Google+ 5th, Twitter 6th, ReseacherID 7th, Mendley 8th, ORCID 9th, Academia.edu 10th, BiomedExperts 11th, Microsoft Academic Search 12th
Look at the popularity of Google. June 14th, 2016, google.com is world’s most popular website. According to alexa.com its global rank is #1 and for similarweb.com its global rank is #2 and category rank in terms of “internet and telecom search engine” is also ranked #1.
Google is at the Top.
[Van Noorden R. Online collaboration: Scientists and the social network. Nature. 2014 Aug;512(7513):126-9.] Van Noorden et al. conducted a survey in May 2014, where in terms of scholarly-reach GS was ranked 1st, followed by ResearchGate 2nd, LinkedIn 3rd, Facebook 4th, Google+ 5th, Twitter 6th, ReseacherID 7th, Mendley 8th, ORCID 9th, Academia.edu 10th, BiomedExperts 11th, Microsoft Academic Search 12th
Look at the popularity of Google. June 14th, 2016, google.com is world’s most popular website. According to alexa.com its global rank is #1 and for similarweb.com its global rank is #2 and category rank in terms of “internet and telecom search engine” is also ranked #1.
Google is at the Top.
[Van Noorden R. Online collaboration: Scientists and the social network. Nature. 2014 Aug;512(7513):126-9.] Van Noorden et al. conducted a survey in May 2014, where in terms of scholarly-reach GS was ranked 1st, followed by ResearchGate 2nd, LinkedIn 3rd, Facebook 4th, Google+ 5th, Twitter 6th, ReseacherID 7th, Mendley 8th, ORCID 9th, Academia.edu 10th, BiomedExperts 11th, Microsoft Academic Search 12th
Look at the popularity of Google. June 14th, 2016, google.com is world’s most popular website. According to alexa.com its global rank is #1 and for similarweb.com its global rank is #2 and category rank in terms of “internet and telecom search engine” is also ranked #1.
Google is at the Top.
[Van Noorden R. Online collaboration: Scientists and the social network. Nature. 2014 Aug;512(7513):126-9.] Van Noorden et al. conducted a survey in May 2014, where in terms of scholarly-reach GS was ranked 1st, followed by ResearchGate 2nd, LinkedIn 3rd, Facebook 4th, Google+ 5th, Twitter 6th, ReseacherID 7th, Mendley 8th, ORCID 9th, Academia.edu 10th, BiomedExperts 11th, Microsoft Academic Search 12th