Copyright © 2013, KISTIMSRA Meeting (2013.1)
InSciTe Project
Hanmin Jung
Head of the Dept. of Computer Intelligence Research
Copyright © 2013, KISTIMSRA Meeting (2013.1)
KISTI
Institute of Advanced Information
S/W Research Center
Dept. of Computer Intelligence Research
Copyright © 2013, KISTIMSRA Meeting (2013.1) 3
Human vs. Machine Intelligence
Copyright © 2013, KISTIMSRA Meeting (2013.1) 4
Machine Intelligence
http://powet.tv/powetblog/wp-content/uploads/2011/02/watson_the_computer_beats_ken_jennings_and_brad_rutter_at_jeopardy_full.jpg
IBM Watson
Copyright © 2013, KISTIMSRA Meeting (2013.1)
Machine Intelligence
http://cdn3.digitaltrends.com/wp-content/uploads/2011/10/1200-siri.jpg
Standford’s Robotic Car
Copyright © 2013, KISTIMSRA Meeting (2013.1)
Machine Intelligence
http://cdn3.digitaltrends.com/wp-content/uploads/2011/10/1200-siri.jpg
Apple Siri
Copyright © 2013, KISTIMSRA Meeting (2013.1) 7
Web Evolution
Copyright © 2013, KISTIMSRA Meeting (2013.1) 8
Size of Data in the World
http://www.ektron.com/billcavablog/Big-Data-Big-Content-Big-Challenges/
Q: How about human?
A: Our brain has the capacity
to store information
in the hundreds of terabytes
to petabyte range.
Copyright © 2013, KISTIMSRA Meeting (2013.1) 9
Effect of Big Data
Search Evaluation
http://static.googleusercontent.com/external_content/untrusted_dlcp/research.google.com/en//pubs/archive/40491.pdf
Copyright © 2013, KISTIMSRA Meeting (2013.1) 10
Search
Clustering
Extracting
Decision
Support
Forecasting
Scenario
Planning
Advising
Modified from D. Bousfield & P. Fooladi, “STM Information: 2009 Final Market Size and Share Report”, 2010.
Value Pyramid
InSciTe Advanced (2011)
InSciTe Adaptive (2012)
Copyright © 2013, KISTIMSRA Meeting (2013.1)
Needs of Experts
Relationship between technologiesRelationship between technologies
Leading companiesLeading companies
Technology gapTechnology gap
New entriesNew entries
Social informationSocial information
Technology hierarchyTechnology hierarchy
Standard patentsStandard patents
Product informationProduct information
Trend reportsTrend reports
Search historySearch history
Partner candidates recommendationPartner candidates recommendation
Significance of papers/patentsSignificance of papers/patents
Market sharesMarket shares
Citation informationCitation information
Key players in groupKey players in group
Core technologiesCore technologies
Market sizeMarket size
Information verificationInformation verification
11
Copyright © 2013, KISTIMSRA Meeting (2013.1)
Technology Intelligence
R. Rohrbeck, H. Arnold, and J. Heuer, “Strategic Foresight in Multimedia Enterprises”, 2007.
Copyright © 2013, KISTIMSRA Meeting (2013.1) 13
Quantitative Analytics
Copyright © 2013, KISTIMSRA Meeting (2013.1) 14
Quantitative Analytics
http://www.google.com/insights/search/
Insights for Search
Copyright © 2013, KISTIMSRA Meeting (2013.1) 15
TI Projects
FUSE
Funded by IARPA (early 2011 ~ early 2016)
Kick off meeting in summer, 2011
Foresight and Understanding from Scientific Exposition Program
Seeks to develop automated methods that aid in the systematic,
continuous, and comprehensive assessment of technical emergence using
information found in the published scientific, technical, and patent
literature
Partners
BAE Systems, Brandeis Univ., New York Univ., 1790 Analytics, …
Copyright © 2013, KISTIMSRA Meeting (2013.1)
TI Projects
CUBIST
Funded by the European Commission (late 2010 ~ late 2013)
1st CUBIST workshop in July, 2011
Combining and Uniting Business Intelligence with Semantic Technologies
Program
Aims to develop new ways to interrogate not only the massive volume data
on the Internet, but also analyze the different formats it exist in – such as
blogs, wikis, and video
Partners
SAP, Ontotext, Sheffield Hallam Univ., …
Copyright © 2013, KISTIMSRA Meeting (2013.1)
TI Projects
Common Technologies
Semantic technologies
Ontology, reasoning, URI scheme
Analytics model
BYOM (e.g. technology opportunity discovery model, technology
evolution model, formal concept analysis model)
Information extraction (InSciTe, FUSE)
Named entities and events/relations in textual documents
Copyright © 2013, KISTIMSRA Meeting (2013.1)
InSciTe Advanced (2011)
Copyright © 2013, KISTIMSRA Meeting (2013.1)
InSciTe Advanced (2011)
Data Fact Sheet
Articles: 15.4 millions (6.7 millions for papers, 8.7 millions for patents)
IEEE proceedings/journals (2001~2011)
Papers for all technical areas (2009~2011)
US/EU/Japan patents (2001~2011)
Technical terms: 68 thousands
Institutions: 340 thousands
Copyright © 2013, KISTIMSRA Meeting (2013.1) 20
InSciTe Adaptive (2012)
Copyright © 2013, KISTIMSRA Meeting (2013.1)
InSciTe Adaptive (2012)
Crawling Web Data by RSS & Google API
Copyright © 2013, KISTIMSRA Meeting (2013.1)
InSciTe Adaptive (2012)
Data Fact Sheet
Articles: 22.6 millions (9.8 millions for papers, 7.6 millions for patents, 5.3
millions for Web data)
All technical areas (2001~2011)
Named entities: 1.9 millions
Authority dictionary: 1.5 millions entries
Linked Data: 290 GB (will be connected)
Copyright © 2013, KISTIMSRA Meeting (2013.1) 23
InSciTe Adaptive (2012)
Big Data Test Bed
Copyright © 2013, KISTIMSRA Meeting (2013.1)
Case Studies
Ministry of Justice (2007~)
Copyright © 2013, KISTIMSRA Meeting (2013.1)
Case Studies
Korea Customs Service (2010~2011)
Copyright © 2013, KISTIMSRA Meeting (2013.1) 26
Case Studies
Defense Agency for Technology and Quality (2011~2012)
Copyright © 2013, KISTIMSRA Meeting (2013.1) 27
ISTIC, China
For national digital library based on analytics
Case Studies
Copyright © 2013, KISTIMSRA Meeting (2013.1)
InSciTe Architecture
Analytics Models
ETD Model
Emerging Technology Discovery Model
TLCD Model
Technology Life Cycle Discovery Model
TLC Model
Technology Life Cycle Model
OntoRelFinder®
Relationship Path Finder
OntoReasoner®
Reasoning Engine
OntoURI®
Semantic Knowledge Manager
OntoPipeliner®
Semantic Service Composer
SS&AE
Semantic Search & Analytics Engine
OntoURIResolver®
Identity Resolver
SINDI-CORE/LINK
Entity & Relationship Extractor
TUC Model
Terminology Use Cycle Model
Ontology
Linked Data
OntoFrame
OntoVerifier®
Reasoning Verifier
Web Data Crawler
RSS/Google API
Web Data
Literatures
Copyright © 2013, KISTIMSRA Meeting (2013.1)
InSciTe Project
Goal & Tasks (2013)
Development of S&T Literature Big Data Analytics/Application Platform
Big Data mining technology
Semantic analytics technology
Big Data relationship analytics/application technology
Technologies
Text mining
Multimedia mining
Semantic integration
Reasoning and graph analysis
Modeling and assess for relationship analytics and application
Copyright © 2013, KISTIMSRA Meeting (2013.1)
InSciTe Project
Partners (2013)
OVUM, UK
Building analytics model
Understanding business needs
Planning InSciTe service
MSRA, China
TBD
GESIS & Hildesheim Univ., Germany
Analyzing patent trends
Assessing InSciTe service platform
…
Copyright © 2013, KISTIMSRA Meeting (2013.1) 31
Homepage
http://semantics.kisti.re.kr
Copyright © 2013, KISTIMSRA Meeting (2013.1) 3232
Thank you
jhm@kisti.re.kr
“A lot of times, people don’t know what they want until you show it to them.”
by Steve Jobs
“Many people won’t be convinced until they’ve seen it for themselves.”
by Jakob Nielsen

InSciTe Project

  • 1.
    Copyright © 2013,KISTIMSRA Meeting (2013.1) InSciTe Project Hanmin Jung Head of the Dept. of Computer Intelligence Research
  • 2.
    Copyright © 2013,KISTIMSRA Meeting (2013.1) KISTI Institute of Advanced Information S/W Research Center Dept. of Computer Intelligence Research
  • 3.
    Copyright © 2013,KISTIMSRA Meeting (2013.1) 3 Human vs. Machine Intelligence
  • 4.
    Copyright © 2013,KISTIMSRA Meeting (2013.1) 4 Machine Intelligence http://powet.tv/powetblog/wp-content/uploads/2011/02/watson_the_computer_beats_ken_jennings_and_brad_rutter_at_jeopardy_full.jpg IBM Watson
  • 5.
    Copyright © 2013,KISTIMSRA Meeting (2013.1) Machine Intelligence http://cdn3.digitaltrends.com/wp-content/uploads/2011/10/1200-siri.jpg Standford’s Robotic Car
  • 6.
    Copyright © 2013,KISTIMSRA Meeting (2013.1) Machine Intelligence http://cdn3.digitaltrends.com/wp-content/uploads/2011/10/1200-siri.jpg Apple Siri
  • 7.
    Copyright © 2013,KISTIMSRA Meeting (2013.1) 7 Web Evolution
  • 8.
    Copyright © 2013,KISTIMSRA Meeting (2013.1) 8 Size of Data in the World http://www.ektron.com/billcavablog/Big-Data-Big-Content-Big-Challenges/ Q: How about human? A: Our brain has the capacity to store information in the hundreds of terabytes to petabyte range.
  • 9.
    Copyright © 2013,KISTIMSRA Meeting (2013.1) 9 Effect of Big Data Search Evaluation http://static.googleusercontent.com/external_content/untrusted_dlcp/research.google.com/en//pubs/archive/40491.pdf
  • 10.
    Copyright © 2013,KISTIMSRA Meeting (2013.1) 10 Search Clustering Extracting Decision Support Forecasting Scenario Planning Advising Modified from D. Bousfield & P. Fooladi, “STM Information: 2009 Final Market Size and Share Report”, 2010. Value Pyramid InSciTe Advanced (2011) InSciTe Adaptive (2012)
  • 11.
    Copyright © 2013,KISTIMSRA Meeting (2013.1) Needs of Experts Relationship between technologiesRelationship between technologies Leading companiesLeading companies Technology gapTechnology gap New entriesNew entries Social informationSocial information Technology hierarchyTechnology hierarchy Standard patentsStandard patents Product informationProduct information Trend reportsTrend reports Search historySearch history Partner candidates recommendationPartner candidates recommendation Significance of papers/patentsSignificance of papers/patents Market sharesMarket shares Citation informationCitation information Key players in groupKey players in group Core technologiesCore technologies Market sizeMarket size Information verificationInformation verification 11
  • 12.
    Copyright © 2013,KISTIMSRA Meeting (2013.1) Technology Intelligence R. Rohrbeck, H. Arnold, and J. Heuer, “Strategic Foresight in Multimedia Enterprises”, 2007.
  • 13.
    Copyright © 2013,KISTIMSRA Meeting (2013.1) 13 Quantitative Analytics
  • 14.
    Copyright © 2013,KISTIMSRA Meeting (2013.1) 14 Quantitative Analytics http://www.google.com/insights/search/ Insights for Search
  • 15.
    Copyright © 2013,KISTIMSRA Meeting (2013.1) 15 TI Projects FUSE Funded by IARPA (early 2011 ~ early 2016) Kick off meeting in summer, 2011 Foresight and Understanding from Scientific Exposition Program Seeks to develop automated methods that aid in the systematic, continuous, and comprehensive assessment of technical emergence using information found in the published scientific, technical, and patent literature Partners BAE Systems, Brandeis Univ., New York Univ., 1790 Analytics, …
  • 16.
    Copyright © 2013,KISTIMSRA Meeting (2013.1) TI Projects CUBIST Funded by the European Commission (late 2010 ~ late 2013) 1st CUBIST workshop in July, 2011 Combining and Uniting Business Intelligence with Semantic Technologies Program Aims to develop new ways to interrogate not only the massive volume data on the Internet, but also analyze the different formats it exist in – such as blogs, wikis, and video Partners SAP, Ontotext, Sheffield Hallam Univ., …
  • 17.
    Copyright © 2013,KISTIMSRA Meeting (2013.1) TI Projects Common Technologies Semantic technologies Ontology, reasoning, URI scheme Analytics model BYOM (e.g. technology opportunity discovery model, technology evolution model, formal concept analysis model) Information extraction (InSciTe, FUSE) Named entities and events/relations in textual documents
  • 18.
    Copyright © 2013,KISTIMSRA Meeting (2013.1) InSciTe Advanced (2011)
  • 19.
    Copyright © 2013,KISTIMSRA Meeting (2013.1) InSciTe Advanced (2011) Data Fact Sheet Articles: 15.4 millions (6.7 millions for papers, 8.7 millions for patents) IEEE proceedings/journals (2001~2011) Papers for all technical areas (2009~2011) US/EU/Japan patents (2001~2011) Technical terms: 68 thousands Institutions: 340 thousands
  • 20.
    Copyright © 2013,KISTIMSRA Meeting (2013.1) 20 InSciTe Adaptive (2012)
  • 21.
    Copyright © 2013,KISTIMSRA Meeting (2013.1) InSciTe Adaptive (2012) Crawling Web Data by RSS & Google API
  • 22.
    Copyright © 2013,KISTIMSRA Meeting (2013.1) InSciTe Adaptive (2012) Data Fact Sheet Articles: 22.6 millions (9.8 millions for papers, 7.6 millions for patents, 5.3 millions for Web data) All technical areas (2001~2011) Named entities: 1.9 millions Authority dictionary: 1.5 millions entries Linked Data: 290 GB (will be connected)
  • 23.
    Copyright © 2013,KISTIMSRA Meeting (2013.1) 23 InSciTe Adaptive (2012) Big Data Test Bed
  • 24.
    Copyright © 2013,KISTIMSRA Meeting (2013.1) Case Studies Ministry of Justice (2007~)
  • 25.
    Copyright © 2013,KISTIMSRA Meeting (2013.1) Case Studies Korea Customs Service (2010~2011)
  • 26.
    Copyright © 2013,KISTIMSRA Meeting (2013.1) 26 Case Studies Defense Agency for Technology and Quality (2011~2012)
  • 27.
    Copyright © 2013,KISTIMSRA Meeting (2013.1) 27 ISTIC, China For national digital library based on analytics Case Studies
  • 28.
    Copyright © 2013,KISTIMSRA Meeting (2013.1) InSciTe Architecture Analytics Models ETD Model Emerging Technology Discovery Model TLCD Model Technology Life Cycle Discovery Model TLC Model Technology Life Cycle Model OntoRelFinder® Relationship Path Finder OntoReasoner® Reasoning Engine OntoURI® Semantic Knowledge Manager OntoPipeliner® Semantic Service Composer SS&AE Semantic Search & Analytics Engine OntoURIResolver® Identity Resolver SINDI-CORE/LINK Entity & Relationship Extractor TUC Model Terminology Use Cycle Model Ontology Linked Data OntoFrame OntoVerifier® Reasoning Verifier Web Data Crawler RSS/Google API Web Data Literatures
  • 29.
    Copyright © 2013,KISTIMSRA Meeting (2013.1) InSciTe Project Goal & Tasks (2013) Development of S&T Literature Big Data Analytics/Application Platform Big Data mining technology Semantic analytics technology Big Data relationship analytics/application technology Technologies Text mining Multimedia mining Semantic integration Reasoning and graph analysis Modeling and assess for relationship analytics and application
  • 30.
    Copyright © 2013,KISTIMSRA Meeting (2013.1) InSciTe Project Partners (2013) OVUM, UK Building analytics model Understanding business needs Planning InSciTe service MSRA, China TBD GESIS & Hildesheim Univ., Germany Analyzing patent trends Assessing InSciTe service platform …
  • 31.
    Copyright © 2013,KISTIMSRA Meeting (2013.1) 31 Homepage http://semantics.kisti.re.kr
  • 32.
    Copyright © 2013,KISTIMSRA Meeting (2013.1) 3232 Thank you jhm@kisti.re.kr “A lot of times, people don’t know what they want until you show it to them.” by Steve Jobs “Many people won’t be convinced until they’ve seen it for themselves.” by Jakob Nielsen