SlideShare a Scribd company logo
The Future is All Mine
Text and Data Mining
Projects in Europe
@openminted_eu @futuretdm
@openminted_eu
@futuretdm
Funded by:
Projects funded by
@openminted_eu
@futuretdm
Text and data mining is
the future
“Text and data mining (TDM) is the
process of deriving information from
machine-read material. It works by
copying large quantities of material,
extracting the data, and recombining it
to identify patterns.”
JISC
Projects funded by
@openminted_eu
@futuretdm
Text and data mining
helps us understand the
past
Mining historical
books:
the evolution of
language
Source: http://www.sciencemag.org/content/331/6014/176 (Baylor College of Medicine, Houston)
Projects funded by
@openminted_eu
@futuretdm
Text and data mining
predicts the future
Mining newspapers:
Predicts revolutions
Source: http://journals.uic.edu/ojs/index.php/fm/article/view/3663/3040 (University of Illinois)
Projects funded by
@openminted_eu
@futuretdm
Text and data mining
saves the future
Mining scientific
publications about
diseases:
Save lives
Source: http://dl.acm.org/citation.cfm?id=2623667 (Baylor College of Medicine, Houston)
Projects funded by
@openminted_eu
@futuretdm
Text mining – it seems so easy:
Linguistic
Analysis:
Entity
Recognition
Data Mining
Knowledge
Discovery
Information
Extraction
STAGE 1 STAGE 2 STAGE 3 STAGE 4
Information
Retrieval
Projects funded by
@openminted_eu
@futuretdm
But it actually poses many
challenges…
?
?
?
?
?
?
?
??
?? ?
?
??
?
?
How do I
make my texts
readable by
machines?
?Which mining
method to
use?
STAGE 1 STAGE 2 STAGE 3 STAGE 4
Where do I
find data?
Projects funded by
@openminted_eu
@futuretdm
9
Current Barriers in Europe
Awareness across Institutions & Stakeholders
 Lack of awareness among research
communities
 Lack of guidance to uncover TDM potential
Skills and Tools
 Availability and accessibility across disciplines
 Gap in skills across various sectors
Licensing & Open Access
 License proliferation and interoperability
issues
 License barriers to transparent open access
Copyright and Data Protection
 TDM activities infringing current copyright laws
 Legal and policy limitations and barriers for
TDM
Projects funded by
@openminted_eu
@futuretdm
EU PROJECTS on TDM
FutureTDM
Identify TDM
barriers and
policy solutions
Open mine
Build a TDM
eInfrastructure
Projects funded by
@openminted_eu
@futuretdm
ELABORATE a legal and
policy framework for future
TDM and specify a research
agenda to foster the spread
of TDM
BUILD a website: a
Collaborative
Knowledge Base and
an Open Information
Hub combined
ANALYSE current
application areas and best
practices in TDM
ASSESS existing
studies, legal
regulations and
policies on TDM
Main Objectives of FutureTDM
INVOLVE all key
stakeholders to
identify practices,
requirements, and
specific challenges
INCREASE
awareness of
TDM to attract
new target
groups and
science domains
@openminted_eu
@futuretdm
This project has received funding from the European Union’s Horizon 2020
Research and Innovation Programme under Grant Agreement No 665940.
Bottom-up
approach:
Stakeholder
workshops and
knowledge cafes
throughout Europe
FutureTDM
@openminted_eu
@futuretdm
This project has received funding from the European Union’s Horizon 2020
Research and Innovation Programme under Grant Agreement No 665940.
Data centre Data centre Data centre Data centre
in public cloud
Publisher text
corpus
OpenAIRE/CORE text
corpus
PMC text
corpus
Other text
corpora
Other text
corpora
Other text
corpora
Other types of text
corpora
Layer 3:
Interoperability
to shared storage and
computing resources
Language resources
Language resources
Language resources Language resources
Layer 2:
Interoperability of
language resources
& corpora
Layer 1:
Interoperability
of text mining services
(platforms or
components)
Language resources and corpora registry service
Platform services Registry Workflow ManagementAuth2 & Policy management Annotator Accounting
Mining Platforms Mining Platforms Mining Platforms
Proprietary architectures
Mining Platforms
Objective of OpenMinTeD
@openminted_eu
Projects funded by@futuretdm
OpenMinTeD brings together:
14
ACCESSIBLE
CONTENT
DISCOVERABLE
SERVICES
EFFICIENT
PROCESSING
TDM
COMMUNITIES
VALUE ADDED
APPS
Via standardised programmatic
interfaces and access rules
Easily discoverable text mining
services and workflows which
process, analyse and annotate text
Operate on public e-Infrastructures
via standarized APIs
Different scientific communities
have different challenges
Community-driven applications to
illustrate the value of the
infastructure. Engage with industry.
OPENMINTED = The Open Mining Infrastructure for Text and Data
Become involved
Follow us on Twitter for the latest updates and blogs
@openminted_eu
@futuretdm
Follow our websites
www.openminted.eu
www.futuretdm.eu
Projects funded by
@openminted_eu
@futuretdm
THANK YOU
• Athena RIC
• Univ. of Manchester (NacTem)
• Univ. of Darmstadt
• INRA
• EMBL-EBI
• Agro-Know
• LIBER
• Univ. of Amsterdam
• Open University UK
• EPFL
• CNIO
• Univ. of Sheffield (GATE)
• GESIS
• GRNET
• Frontiers
• Univ. of Stirling
PARTNERS OPENMINTEDPARTNERS FUTURETDM
• SYNYO GmbH (SYNYO)
• LIBER Europe
• Open Knowledge Foundation
LBG (OK/CM)
• Radboud Univ. Nijmegen
• The British Library Board
• Univ. of Amsterdam
• Athena RIC
• Ubiquity Press
• Fundacja Projekt: Polska (FPP)

More Related Content

What's hot

Connecting Museums with Linked Data
Connecting Museums with Linked DataConnecting Museums with Linked Data
Connecting Museums with Linked Data
National Institute of Informatics (NII)
 
re3data.org – a Registry of Research Data Repositories
re3data.org – a Registry of Research Data Repositoriesre3data.org – a Registry of Research Data Repositories
re3data.org – a Registry of Research Data Repositories
Heinz Pampel
 
Elab 16 5-13-re3data-scholze-final
Elab 16 5-13-re3data-scholze-finalElab 16 5-13-re3data-scholze-final
Elab 16 5-13-re3data-scholze-final
Karlsruhe Institute of Technology (KIT)
 
Big Data Europe SC6 WS 3: Ron Dekker, Director CESSDA European Open Science A...
Big Data Europe SC6 WS 3: Ron Dekker, Director CESSDA European Open Science A...Big Data Europe SC6 WS 3: Ron Dekker, Director CESSDA European Open Science A...
Big Data Europe SC6 WS 3: Ron Dekker, Director CESSDA European Open Science A...
BigData_Europe
 
FREYA - Connected Open Identifiers for Discovery, Access and Use of Research ...
FREYA - Connected Open Identifiers for Discovery, Access and Use of Research ...FREYA - Connected Open Identifiers for Discovery, Access and Use of Research ...
FREYA - Connected Open Identifiers for Discovery, Access and Use of Research ...
EUDAT
 
Making Research Data Repositories Visible – The re3data.org Registry
Making Research Data Repositories Visible – The re3data.org RegistryMaking Research Data Repositories Visible – The re3data.org Registry
Making Research Data Repositories Visible – The re3data.org Registry
Heinz Pampel
 
Libraries at the centre of the debate on copyright and text and data mining: ...
Libraries at the centre of the debate on copyright and text and data mining: ...Libraries at the centre of the debate on copyright and text and data mining: ...
Libraries at the centre of the debate on copyright and text and data mining: ...
LIBER Europe
 
Scholze goportis 4-11-14
Scholze goportis 4-11-14Scholze goportis 4-11-14
Scholze goportis 4-11-14
Karlsruhe Institute of Technology (KIT)
 
re3data.org – Registry of Research Data Repositories
re3data.org – Registry of Research Data Repositoriesre3data.org – Registry of Research Data Repositories
re3data.org – Registry of Research Data Repositories
Heinz Pampel
 
Imac 090924
Imac 090924Imac 090924
Imac 090924
Frits van Latum
 
Scholze imcw 2014-11-25
Scholze imcw 2014-11-25Scholze imcw 2014-11-25
Open content opens up new avenues of research
Open content opens up new avenues of researchOpen content opens up new avenues of research
Open content opens up new avenues of research
Felix Lohmeier
 
Gobinda Chowdhury
Gobinda ChowdhuryGobinda Chowdhury
Gobinda Chowdhury
maredata
 
Understanding the users of the Parliamentary Web Archive: a user research pro...
Understanding the users of the Parliamentary Web Archive: a user research pro...Understanding the users of the Parliamentary Web Archive: a user research pro...
Understanding the users of the Parliamentary Web Archive: a user research pro...
Peter Webster
 
Zenodo - The catch-all repository
Zenodo - The catch-all repository Zenodo - The catch-all repository
Zenodo - The catch-all repository
OpenAccessBelgium
 
Eva Méndez: Política europea y EOSC
Eva Méndez: Política europea y EOSCEva Méndez: Política europea y EOSC
Eva Méndez: Política europea y EOSC
maredata
 
Aggregating Research papers from Publishers' Systems to Support Text and Data...
Aggregating Research papers from Publishers' Systems to Support Text and Data...Aggregating Research papers from Publishers' Systems to Support Text and Data...
Aggregating Research papers from Publishers' Systems to Support Text and Data...
petrknoth
 
Library Science Talk: Tensions between copyright and knowledge discovery
Library Science Talk: Tensions between copyright and knowledge discoveryLibrary Science Talk: Tensions between copyright and knowledge discovery
Library Science Talk: Tensions between copyright and knowledge discovery
LIBER Europe
 
Horizon 2020: Outline of a Pilot for Open Research Data
Horizon 2020: Outline of a Pilot for Open Research Data  Horizon 2020: Outline of a Pilot for Open Research Data
Horizon 2020: Outline of a Pilot for Open Research Data
LIBER Europe
 
Rebecca Grant - DRI Training Series: 1. Organising Your Collection
Rebecca Grant - DRI Training Series: 1. Organising Your Collection Rebecca Grant - DRI Training Series: 1. Organising Your Collection
Rebecca Grant - DRI Training Series: 1. Organising Your Collection
dri_ireland
 

What's hot (20)

Connecting Museums with Linked Data
Connecting Museums with Linked DataConnecting Museums with Linked Data
Connecting Museums with Linked Data
 
re3data.org – a Registry of Research Data Repositories
re3data.org – a Registry of Research Data Repositoriesre3data.org – a Registry of Research Data Repositories
re3data.org – a Registry of Research Data Repositories
 
Elab 16 5-13-re3data-scholze-final
Elab 16 5-13-re3data-scholze-finalElab 16 5-13-re3data-scholze-final
Elab 16 5-13-re3data-scholze-final
 
Big Data Europe SC6 WS 3: Ron Dekker, Director CESSDA European Open Science A...
Big Data Europe SC6 WS 3: Ron Dekker, Director CESSDA European Open Science A...Big Data Europe SC6 WS 3: Ron Dekker, Director CESSDA European Open Science A...
Big Data Europe SC6 WS 3: Ron Dekker, Director CESSDA European Open Science A...
 
FREYA - Connected Open Identifiers for Discovery, Access and Use of Research ...
FREYA - Connected Open Identifiers for Discovery, Access and Use of Research ...FREYA - Connected Open Identifiers for Discovery, Access and Use of Research ...
FREYA - Connected Open Identifiers for Discovery, Access and Use of Research ...
 
Making Research Data Repositories Visible – The re3data.org Registry
Making Research Data Repositories Visible – The re3data.org RegistryMaking Research Data Repositories Visible – The re3data.org Registry
Making Research Data Repositories Visible – The re3data.org Registry
 
Libraries at the centre of the debate on copyright and text and data mining: ...
Libraries at the centre of the debate on copyright and text and data mining: ...Libraries at the centre of the debate on copyright and text and data mining: ...
Libraries at the centre of the debate on copyright and text and data mining: ...
 
Scholze goportis 4-11-14
Scholze goportis 4-11-14Scholze goportis 4-11-14
Scholze goportis 4-11-14
 
re3data.org – Registry of Research Data Repositories
re3data.org – Registry of Research Data Repositoriesre3data.org – Registry of Research Data Repositories
re3data.org – Registry of Research Data Repositories
 
Imac 090924
Imac 090924Imac 090924
Imac 090924
 
Scholze imcw 2014-11-25
Scholze imcw 2014-11-25Scholze imcw 2014-11-25
Scholze imcw 2014-11-25
 
Open content opens up new avenues of research
Open content opens up new avenues of researchOpen content opens up new avenues of research
Open content opens up new avenues of research
 
Gobinda Chowdhury
Gobinda ChowdhuryGobinda Chowdhury
Gobinda Chowdhury
 
Understanding the users of the Parliamentary Web Archive: a user research pro...
Understanding the users of the Parliamentary Web Archive: a user research pro...Understanding the users of the Parliamentary Web Archive: a user research pro...
Understanding the users of the Parliamentary Web Archive: a user research pro...
 
Zenodo - The catch-all repository
Zenodo - The catch-all repository Zenodo - The catch-all repository
Zenodo - The catch-all repository
 
Eva Méndez: Política europea y EOSC
Eva Méndez: Política europea y EOSCEva Méndez: Política europea y EOSC
Eva Méndez: Política europea y EOSC
 
Aggregating Research papers from Publishers' Systems to Support Text and Data...
Aggregating Research papers from Publishers' Systems to Support Text and Data...Aggregating Research papers from Publishers' Systems to Support Text and Data...
Aggregating Research papers from Publishers' Systems to Support Text and Data...
 
Library Science Talk: Tensions between copyright and knowledge discovery
Library Science Talk: Tensions between copyright and knowledge discoveryLibrary Science Talk: Tensions between copyright and knowledge discovery
Library Science Talk: Tensions between copyright and knowledge discovery
 
Horizon 2020: Outline of a Pilot for Open Research Data
Horizon 2020: Outline of a Pilot for Open Research Data  Horizon 2020: Outline of a Pilot for Open Research Data
Horizon 2020: Outline of a Pilot for Open Research Data
 
Rebecca Grant - DRI Training Series: 1. Organising Your Collection
Rebecca Grant - DRI Training Series: 1. Organising Your Collection Rebecca Grant - DRI Training Series: 1. Organising Your Collection
Rebecca Grant - DRI Training Series: 1. Organising Your Collection
 

Similar to The Future is All Mine

Open, Digital Science in Europe
Open, Digital Science in EuropeOpen, Digital Science in Europe
Open, Digital Science in Europe
Carl-Christian Buhr
 
New trends in ontological engineering, practices and tools
New trends in ontological engineering, practices and toolsNew trends in ontological engineering, practices and tools
New trends in ontological engineering, practices and tools
María Poveda Villalón
 
WEBINAR: "How to manage your data to make them open and fair"
WEBINAR:  "How to manage your data to make them open and fair"  WEBINAR:  "How to manage your data to make them open and fair"
WEBINAR: "How to manage your data to make them open and fair"
OpenAIRE
 
Open Science policy: EC, ERC, Belspo, FWO
Open Science policy: EC, ERC, Belspo, FWOOpen Science policy: EC, ERC, Belspo, FWO
Open Science policy: EC, ERC, Belspo, FWO
OpenAccessBelgium
 
FutureTDM: Increasing Uptake of Text and Data Mining in the EU
FutureTDM: Increasing Uptake of Text and Data Mining in the EUFutureTDM: Increasing Uptake of Text and Data Mining in the EU
FutureTDM: Increasing Uptake of Text and Data Mining in the EU
Brian Hole
 
Eu policy on open access april 2019 tsoukala
Eu policy on open access april 2019 tsoukalaEu policy on open access april 2019 tsoukala
Eu policy on open access april 2019 tsoukala
Victoria Tsoukala
 
The Developing Needs for e-infrastructures
The Developing Needs for e-infrastructuresThe Developing Needs for e-infrastructures
The Developing Needs for e-infrastructures
guest0dc425
 
Climate Change and Human Migration
Climate Change and Human MigrationClimate Change and Human Migration
Climate Change and Human Migration
petermurrayrust
 
Infrastructures for Open, Digital Science
Infrastructures for Open, Digital ScienceInfrastructures for Open, Digital Science
Infrastructures for Open, Digital Science
Carl-Christian Buhr
 
Workshop Fraunhofer Portugal on Open Science in Horizon 2020
Workshop Fraunhofer Portugal on Open Science in Horizon 2020Workshop Fraunhofer Portugal on Open Science in Horizon 2020
Workshop Fraunhofer Portugal on Open Science in Horizon 2020
Pedro Príncipe
 
Fit for Purpose! Shaping Open Access and Open Science Policies for Horizon Eu...
Fit for Purpose! Shaping Open Access and Open Science Policies for Horizon Eu...Fit for Purpose! Shaping Open Access and Open Science Policies for Horizon Eu...
Fit for Purpose! Shaping Open Access and Open Science Policies for Horizon Eu...
Victoria Tsoukala
 
NordForsk Open Access Reykjavik 14-15/8-2014: H2020
NordForsk Open Access Reykjavik 14-15/8-2014: H2020NordForsk Open Access Reykjavik 14-15/8-2014: H2020
NordForsk Open Access Reykjavik 14-15/8-2014: H2020
NordForsk
 
Katarzyna Szkuta: "The European Open Science Cloud and the Open Science Policy"
Katarzyna Szkuta: "The European Open Science Cloud and the Open Science Policy"Katarzyna Szkuta: "The European Open Science Cloud and the Open Science Policy"
Katarzyna Szkuta: "The European Open Science Cloud and the Open Science Policy"
DIPF | Leibniz-Institut für Bildungsforschung und Bildungsinformation
 
Pampel & Kindling: Repositorien für Forschungsdaten - Infrastrukturen für die...
Pampel & Kindling: Repositorien für Forschungsdaten - Infrastrukturen für die...Pampel & Kindling: Repositorien für Forschungsdaten - Infrastrukturen für die...
Pampel & Kindling: Repositorien für Forschungsdaten - Infrastrukturen für die...
Heinz Pampel
 
Online promises beyond the policies: what's under the skin
Online promises beyond the policies: what's under the skin Online promises beyond the policies: what's under the skin
Online promises beyond the policies: what's under the skin
Nicolaie Constantinescu
 
Rdaeu russia_fg_1_july2014_final
Rdaeu  russia_fg_1_july2014_finalRdaeu  russia_fg_1_july2014_final
Rdaeu russia_fg_1_july2014_final
Research Data Alliance
 
European Perspectives on Open Science Policy/JC Burgelman
European Perspectives on Open Science Policy/JC BurgelmanEuropean Perspectives on Open Science Policy/JC Burgelman
European Perspectives on Open Science Policy/JC Burgelman
Academy of Science of South Africa (ASSAf)
 
Libraries in the Big Data Era: Strategies and Challenges in Archiving and Sha...
Libraries in the Big Data Era: Strategies and Challenges in Archiving and Sha...Libraries in the Big Data Era: Strategies and Challenges in Archiving and Sha...
Libraries in the Big Data Era: Strategies and Challenges in Archiving and Sha...
Peter Löwe
 
Open access: What's in there for me? And some ideas for advocacy programmes
Open access:  What's in there for me?  And some ideas for advocacy programmesOpen access:  What's in there for me?  And some ideas for advocacy programmes
Open access: What's in there for me? And some ideas for advocacy programmes
Iryna Kuchma
 
e-infrastructures supporting open knowledge circulation - OpenAIRE France
e-infrastructures supporting open knowledge circulation - OpenAIRE Francee-infrastructures supporting open knowledge circulation - OpenAIRE France
e-infrastructures supporting open knowledge circulation - OpenAIRE France
Jean-François Lutz
 

Similar to The Future is All Mine (20)

Open, Digital Science in Europe
Open, Digital Science in EuropeOpen, Digital Science in Europe
Open, Digital Science in Europe
 
New trends in ontological engineering, practices and tools
New trends in ontological engineering, practices and toolsNew trends in ontological engineering, practices and tools
New trends in ontological engineering, practices and tools
 
WEBINAR: "How to manage your data to make them open and fair"
WEBINAR:  "How to manage your data to make them open and fair"  WEBINAR:  "How to manage your data to make them open and fair"
WEBINAR: "How to manage your data to make them open and fair"
 
Open Science policy: EC, ERC, Belspo, FWO
Open Science policy: EC, ERC, Belspo, FWOOpen Science policy: EC, ERC, Belspo, FWO
Open Science policy: EC, ERC, Belspo, FWO
 
FutureTDM: Increasing Uptake of Text and Data Mining in the EU
FutureTDM: Increasing Uptake of Text and Data Mining in the EUFutureTDM: Increasing Uptake of Text and Data Mining in the EU
FutureTDM: Increasing Uptake of Text and Data Mining in the EU
 
Eu policy on open access april 2019 tsoukala
Eu policy on open access april 2019 tsoukalaEu policy on open access april 2019 tsoukala
Eu policy on open access april 2019 tsoukala
 
The Developing Needs for e-infrastructures
The Developing Needs for e-infrastructuresThe Developing Needs for e-infrastructures
The Developing Needs for e-infrastructures
 
Climate Change and Human Migration
Climate Change and Human MigrationClimate Change and Human Migration
Climate Change and Human Migration
 
Infrastructures for Open, Digital Science
Infrastructures for Open, Digital ScienceInfrastructures for Open, Digital Science
Infrastructures for Open, Digital Science
 
Workshop Fraunhofer Portugal on Open Science in Horizon 2020
Workshop Fraunhofer Portugal on Open Science in Horizon 2020Workshop Fraunhofer Portugal on Open Science in Horizon 2020
Workshop Fraunhofer Portugal on Open Science in Horizon 2020
 
Fit for Purpose! Shaping Open Access and Open Science Policies for Horizon Eu...
Fit for Purpose! Shaping Open Access and Open Science Policies for Horizon Eu...Fit for Purpose! Shaping Open Access and Open Science Policies for Horizon Eu...
Fit for Purpose! Shaping Open Access and Open Science Policies for Horizon Eu...
 
NordForsk Open Access Reykjavik 14-15/8-2014: H2020
NordForsk Open Access Reykjavik 14-15/8-2014: H2020NordForsk Open Access Reykjavik 14-15/8-2014: H2020
NordForsk Open Access Reykjavik 14-15/8-2014: H2020
 
Katarzyna Szkuta: "The European Open Science Cloud and the Open Science Policy"
Katarzyna Szkuta: "The European Open Science Cloud and the Open Science Policy"Katarzyna Szkuta: "The European Open Science Cloud and the Open Science Policy"
Katarzyna Szkuta: "The European Open Science Cloud and the Open Science Policy"
 
Pampel & Kindling: Repositorien für Forschungsdaten - Infrastrukturen für die...
Pampel & Kindling: Repositorien für Forschungsdaten - Infrastrukturen für die...Pampel & Kindling: Repositorien für Forschungsdaten - Infrastrukturen für die...
Pampel & Kindling: Repositorien für Forschungsdaten - Infrastrukturen für die...
 
Online promises beyond the policies: what's under the skin
Online promises beyond the policies: what's under the skin Online promises beyond the policies: what's under the skin
Online promises beyond the policies: what's under the skin
 
Rdaeu russia_fg_1_july2014_final
Rdaeu  russia_fg_1_july2014_finalRdaeu  russia_fg_1_july2014_final
Rdaeu russia_fg_1_july2014_final
 
European Perspectives on Open Science Policy/JC Burgelman
European Perspectives on Open Science Policy/JC BurgelmanEuropean Perspectives on Open Science Policy/JC Burgelman
European Perspectives on Open Science Policy/JC Burgelman
 
Libraries in the Big Data Era: Strategies and Challenges in Archiving and Sha...
Libraries in the Big Data Era: Strategies and Challenges in Archiving and Sha...Libraries in the Big Data Era: Strategies and Challenges in Archiving and Sha...
Libraries in the Big Data Era: Strategies and Challenges in Archiving and Sha...
 
Open access: What's in there for me? And some ideas for advocacy programmes
Open access:  What's in there for me?  And some ideas for advocacy programmesOpen access:  What's in there for me?  And some ideas for advocacy programmes
Open access: What's in there for me? And some ideas for advocacy programmes
 
e-infrastructures supporting open knowledge circulation - OpenAIRE France
e-infrastructures supporting open knowledge circulation - OpenAIRE Francee-infrastructures supporting open knowledge circulation - OpenAIRE France
e-infrastructures supporting open knowledge circulation - OpenAIRE France
 

More from openminted_eu

Supporting the uptake of TDM
Supporting the uptake of TDMSupporting the uptake of TDM
Supporting the uptake of TDM
openminted_eu
 
OpenMinTeD, LIBER conference 2017
OpenMinTeD, LIBER conference 2017OpenMinTeD, LIBER conference 2017
OpenMinTeD, LIBER conference 2017
openminted_eu
 
Resource sync overview and real-world use cases for discovery, harvesting, an...
Resource sync overview and real-world use cases for discovery, harvesting, an...Resource sync overview and real-world use cases for discovery, harvesting, an...
Resource sync overview and real-world use cases for discovery, harvesting, an...
openminted_eu
 
Seamless access to the world's open access research papers via resources sync
Seamless access to the world's open access research papers via resources syncSeamless access to the world's open access research papers via resources sync
Seamless access to the world's open access research papers via resources sync
openminted_eu
 
Webinar slides: Interoperability between resources involved in TDM at the lev...
Webinar slides: Interoperability between resources involved in TDM at the lev...Webinar slides: Interoperability between resources involved in TDM at the lev...
Webinar slides: Interoperability between resources involved in TDM at the lev...
openminted_eu
 
Legal issues Text and Data Mining
Legal issues Text and Data MiningLegal issues Text and Data Mining
Legal issues Text and Data Mining
openminted_eu
 
Tentative steps in mining UK theses
Tentative steps in mining UK thesesTentative steps in mining UK theses
Tentative steps in mining UK theses
openminted_eu
 
Jisc Text Mining Capabilities
Jisc Text Mining CapabilitiesJisc Text Mining Capabilities
Jisc Text Mining Capabilities
openminted_eu
 
OpenMinTeD - Une infrastructure text-mining au service des scientifiques
OpenMinTeD - Une infrastructure text-mining au service des scientifiquesOpenMinTeD - Une infrastructure text-mining au service des scientifiques
OpenMinTeD - Une infrastructure text-mining au service des scientifiques
openminted_eu
 
Infrastructure crossroads... and the way we walked them in DKPro
Infrastructure crossroads... and the way we walked them in DKProInfrastructure crossroads... and the way we walked them in DKPro
Infrastructure crossroads... and the way we walked them in DKPro
openminted_eu
 
Experiences of Text Mining; the National Library of Austria perspective
Experiences of Text Mining; the National Library of Austria perspectiveExperiences of Text Mining; the National Library of Austria perspective
Experiences of Text Mining; the National Library of Austria perspective
openminted_eu
 
Text and Data Mining at the Royal Library in the Netherlands
Text and Data Mining at the Royal Library in the NetherlandsText and Data Mining at the Royal Library in the Netherlands
Text and Data Mining at the Royal Library in the Netherlands
openminted_eu
 

More from openminted_eu (12)

Supporting the uptake of TDM
Supporting the uptake of TDMSupporting the uptake of TDM
Supporting the uptake of TDM
 
OpenMinTeD, LIBER conference 2017
OpenMinTeD, LIBER conference 2017OpenMinTeD, LIBER conference 2017
OpenMinTeD, LIBER conference 2017
 
Resource sync overview and real-world use cases for discovery, harvesting, an...
Resource sync overview and real-world use cases for discovery, harvesting, an...Resource sync overview and real-world use cases for discovery, harvesting, an...
Resource sync overview and real-world use cases for discovery, harvesting, an...
 
Seamless access to the world's open access research papers via resources sync
Seamless access to the world's open access research papers via resources syncSeamless access to the world's open access research papers via resources sync
Seamless access to the world's open access research papers via resources sync
 
Webinar slides: Interoperability between resources involved in TDM at the lev...
Webinar slides: Interoperability between resources involved in TDM at the lev...Webinar slides: Interoperability between resources involved in TDM at the lev...
Webinar slides: Interoperability between resources involved in TDM at the lev...
 
Legal issues Text and Data Mining
Legal issues Text and Data MiningLegal issues Text and Data Mining
Legal issues Text and Data Mining
 
Tentative steps in mining UK theses
Tentative steps in mining UK thesesTentative steps in mining UK theses
Tentative steps in mining UK theses
 
Jisc Text Mining Capabilities
Jisc Text Mining CapabilitiesJisc Text Mining Capabilities
Jisc Text Mining Capabilities
 
OpenMinTeD - Une infrastructure text-mining au service des scientifiques
OpenMinTeD - Une infrastructure text-mining au service des scientifiquesOpenMinTeD - Une infrastructure text-mining au service des scientifiques
OpenMinTeD - Une infrastructure text-mining au service des scientifiques
 
Infrastructure crossroads... and the way we walked them in DKPro
Infrastructure crossroads... and the way we walked them in DKProInfrastructure crossroads... and the way we walked them in DKPro
Infrastructure crossroads... and the way we walked them in DKPro
 
Experiences of Text Mining; the National Library of Austria perspective
Experiences of Text Mining; the National Library of Austria perspectiveExperiences of Text Mining; the National Library of Austria perspective
Experiences of Text Mining; the National Library of Austria perspective
 
Text and Data Mining at the Royal Library in the Netherlands
Text and Data Mining at the Royal Library in the NetherlandsText and Data Mining at the Royal Library in the Netherlands
Text and Data Mining at the Royal Library in the Netherlands
 

Recently uploaded

Influence of Marketing Strategy and Market Competition on Business Plan
Influence of Marketing Strategy and Market Competition on Business PlanInfluence of Marketing Strategy and Market Competition on Business Plan
Influence of Marketing Strategy and Market Competition on Business Plan
jerlynmaetalle
 
Intelligence supported media monitoring in veterinary medicine
Intelligence supported media monitoring in veterinary medicineIntelligence supported media monitoring in veterinary medicine
Intelligence supported media monitoring in veterinary medicine
AndrzejJarynowski
 
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
Timothy Spann
 
University of New South Wales degree offer diploma Transcript
University of New South Wales degree offer diploma TranscriptUniversity of New South Wales degree offer diploma Transcript
University of New South Wales degree offer diploma Transcript
soxrziqu
 
My burning issue is homelessness K.C.M.O.
My burning issue is homelessness K.C.M.O.My burning issue is homelessness K.C.M.O.
My burning issue is homelessness K.C.M.O.
rwarrenll
 
Learn SQL from basic queries to Advance queries
Learn SQL from basic queries to Advance queriesLearn SQL from basic queries to Advance queries
Learn SQL from basic queries to Advance queries
manishkhaire30
 
Palo Alto Cortex XDR presentation .......
Palo Alto Cortex XDR presentation .......Palo Alto Cortex XDR presentation .......
Palo Alto Cortex XDR presentation .......
Sachin Paul
 
一比一原版(牛布毕业证书)牛津布鲁克斯大学毕业证如何办理
一比一原版(牛布毕业证书)牛津布鲁克斯大学毕业证如何办理一比一原版(牛布毕业证书)牛津布鲁克斯大学毕业证如何办理
一比一原版(牛布毕业证书)牛津布鲁克斯大学毕业证如何办理
74nqk8xf
 
Global Situational Awareness of A.I. and where its headed
Global Situational Awareness of A.I. and where its headedGlobal Situational Awareness of A.I. and where its headed
Global Situational Awareness of A.I. and where its headed
vikram sood
 
The Building Blocks of QuestDB, a Time Series Database
The Building Blocks of QuestDB, a Time Series DatabaseThe Building Blocks of QuestDB, a Time Series Database
The Building Blocks of QuestDB, a Time Series Database
javier ramirez
 
Udemy_2024_Global_Learning_Skills_Trends_Report (1).pdf
Udemy_2024_Global_Learning_Skills_Trends_Report (1).pdfUdemy_2024_Global_Learning_Skills_Trends_Report (1).pdf
Udemy_2024_Global_Learning_Skills_Trends_Report (1).pdf
Fernanda Palhano
 
在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样
在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样
在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样
v7oacc3l
 
Experts live - Improving user adoption with AI
Experts live - Improving user adoption with AIExperts live - Improving user adoption with AI
Experts live - Improving user adoption with AI
jitskeb
 
End-to-end pipeline agility - Berlin Buzzwords 2024
End-to-end pipeline agility - Berlin Buzzwords 2024End-to-end pipeline agility - Berlin Buzzwords 2024
End-to-end pipeline agility - Berlin Buzzwords 2024
Lars Albertsson
 
The Ipsos - AI - Monitor 2024 Report.pdf
The  Ipsos - AI - Monitor 2024 Report.pdfThe  Ipsos - AI - Monitor 2024 Report.pdf
The Ipsos - AI - Monitor 2024 Report.pdf
Social Samosa
 
一比一原版(Harvard毕业证书)哈佛大学毕业证如何办理
一比一原版(Harvard毕业证书)哈佛大学毕业证如何办理一比一原版(Harvard毕业证书)哈佛大学毕业证如何办理
一比一原版(Harvard毕业证书)哈佛大学毕业证如何办理
zsjl4mimo
 
一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理
一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理
一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理
nyfuhyz
 
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
sameer shah
 
一比一原版(Glasgow毕业证书)格拉斯哥大学毕业证如何办理
一比一原版(Glasgow毕业证书)格拉斯哥大学毕业证如何办理一比一原版(Glasgow毕业证书)格拉斯哥大学毕业证如何办理
一比一原版(Glasgow毕业证书)格拉斯哥大学毕业证如何办理
g4dpvqap0
 
Population Growth in Bataan: The effects of population growth around rural pl...
Population Growth in Bataan: The effects of population growth around rural pl...Population Growth in Bataan: The effects of population growth around rural pl...
Population Growth in Bataan: The effects of population growth around rural pl...
Bill641377
 

Recently uploaded (20)

Influence of Marketing Strategy and Market Competition on Business Plan
Influence of Marketing Strategy and Market Competition on Business PlanInfluence of Marketing Strategy and Market Competition on Business Plan
Influence of Marketing Strategy and Market Competition on Business Plan
 
Intelligence supported media monitoring in veterinary medicine
Intelligence supported media monitoring in veterinary medicineIntelligence supported media monitoring in veterinary medicine
Intelligence supported media monitoring in veterinary medicine
 
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
 
University of New South Wales degree offer diploma Transcript
University of New South Wales degree offer diploma TranscriptUniversity of New South Wales degree offer diploma Transcript
University of New South Wales degree offer diploma Transcript
 
My burning issue is homelessness K.C.M.O.
My burning issue is homelessness K.C.M.O.My burning issue is homelessness K.C.M.O.
My burning issue is homelessness K.C.M.O.
 
Learn SQL from basic queries to Advance queries
Learn SQL from basic queries to Advance queriesLearn SQL from basic queries to Advance queries
Learn SQL from basic queries to Advance queries
 
Palo Alto Cortex XDR presentation .......
Palo Alto Cortex XDR presentation .......Palo Alto Cortex XDR presentation .......
Palo Alto Cortex XDR presentation .......
 
一比一原版(牛布毕业证书)牛津布鲁克斯大学毕业证如何办理
一比一原版(牛布毕业证书)牛津布鲁克斯大学毕业证如何办理一比一原版(牛布毕业证书)牛津布鲁克斯大学毕业证如何办理
一比一原版(牛布毕业证书)牛津布鲁克斯大学毕业证如何办理
 
Global Situational Awareness of A.I. and where its headed
Global Situational Awareness of A.I. and where its headedGlobal Situational Awareness of A.I. and where its headed
Global Situational Awareness of A.I. and where its headed
 
The Building Blocks of QuestDB, a Time Series Database
The Building Blocks of QuestDB, a Time Series DatabaseThe Building Blocks of QuestDB, a Time Series Database
The Building Blocks of QuestDB, a Time Series Database
 
Udemy_2024_Global_Learning_Skills_Trends_Report (1).pdf
Udemy_2024_Global_Learning_Skills_Trends_Report (1).pdfUdemy_2024_Global_Learning_Skills_Trends_Report (1).pdf
Udemy_2024_Global_Learning_Skills_Trends_Report (1).pdf
 
在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样
在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样
在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样
 
Experts live - Improving user adoption with AI
Experts live - Improving user adoption with AIExperts live - Improving user adoption with AI
Experts live - Improving user adoption with AI
 
End-to-end pipeline agility - Berlin Buzzwords 2024
End-to-end pipeline agility - Berlin Buzzwords 2024End-to-end pipeline agility - Berlin Buzzwords 2024
End-to-end pipeline agility - Berlin Buzzwords 2024
 
The Ipsos - AI - Monitor 2024 Report.pdf
The  Ipsos - AI - Monitor 2024 Report.pdfThe  Ipsos - AI - Monitor 2024 Report.pdf
The Ipsos - AI - Monitor 2024 Report.pdf
 
一比一原版(Harvard毕业证书)哈佛大学毕业证如何办理
一比一原版(Harvard毕业证书)哈佛大学毕业证如何办理一比一原版(Harvard毕业证书)哈佛大学毕业证如何办理
一比一原版(Harvard毕业证书)哈佛大学毕业证如何办理
 
一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理
一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理
一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理
 
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
 
一比一原版(Glasgow毕业证书)格拉斯哥大学毕业证如何办理
一比一原版(Glasgow毕业证书)格拉斯哥大学毕业证如何办理一比一原版(Glasgow毕业证书)格拉斯哥大学毕业证如何办理
一比一原版(Glasgow毕业证书)格拉斯哥大学毕业证如何办理
 
Population Growth in Bataan: The effects of population growth around rural pl...
Population Growth in Bataan: The effects of population growth around rural pl...Population Growth in Bataan: The effects of population growth around rural pl...
Population Growth in Bataan: The effects of population growth around rural pl...
 

The Future is All Mine

  • 1. The Future is All Mine Text and Data Mining Projects in Europe @openminted_eu @futuretdm @openminted_eu @futuretdm Funded by:
  • 3. Text and data mining is the future “Text and data mining (TDM) is the process of deriving information from machine-read material. It works by copying large quantities of material, extracting the data, and recombining it to identify patterns.” JISC Projects funded by @openminted_eu @futuretdm
  • 4. Text and data mining helps us understand the past Mining historical books: the evolution of language Source: http://www.sciencemag.org/content/331/6014/176 (Baylor College of Medicine, Houston) Projects funded by @openminted_eu @futuretdm
  • 5. Text and data mining predicts the future Mining newspapers: Predicts revolutions Source: http://journals.uic.edu/ojs/index.php/fm/article/view/3663/3040 (University of Illinois) Projects funded by @openminted_eu @futuretdm
  • 6. Text and data mining saves the future Mining scientific publications about diseases: Save lives Source: http://dl.acm.org/citation.cfm?id=2623667 (Baylor College of Medicine, Houston) Projects funded by @openminted_eu @futuretdm
  • 7. Text mining – it seems so easy: Linguistic Analysis: Entity Recognition Data Mining Knowledge Discovery Information Extraction STAGE 1 STAGE 2 STAGE 3 STAGE 4 Information Retrieval Projects funded by @openminted_eu @futuretdm
  • 8. But it actually poses many challenges… ? ? ? ? ? ? ? ?? ?? ? ? ?? ? ? How do I make my texts readable by machines? ?Which mining method to use? STAGE 1 STAGE 2 STAGE 3 STAGE 4 Where do I find data? Projects funded by @openminted_eu @futuretdm
  • 9. 9 Current Barriers in Europe Awareness across Institutions & Stakeholders  Lack of awareness among research communities  Lack of guidance to uncover TDM potential Skills and Tools  Availability and accessibility across disciplines  Gap in skills across various sectors Licensing & Open Access  License proliferation and interoperability issues  License barriers to transparent open access Copyright and Data Protection  TDM activities infringing current copyright laws  Legal and policy limitations and barriers for TDM Projects funded by @openminted_eu @futuretdm
  • 10. EU PROJECTS on TDM FutureTDM Identify TDM barriers and policy solutions Open mine Build a TDM eInfrastructure Projects funded by @openminted_eu @futuretdm
  • 11. ELABORATE a legal and policy framework for future TDM and specify a research agenda to foster the spread of TDM BUILD a website: a Collaborative Knowledge Base and an Open Information Hub combined ANALYSE current application areas and best practices in TDM ASSESS existing studies, legal regulations and policies on TDM Main Objectives of FutureTDM INVOLVE all key stakeholders to identify practices, requirements, and specific challenges INCREASE awareness of TDM to attract new target groups and science domains @openminted_eu @futuretdm This project has received funding from the European Union’s Horizon 2020 Research and Innovation Programme under Grant Agreement No 665940.
  • 12. Bottom-up approach: Stakeholder workshops and knowledge cafes throughout Europe FutureTDM @openminted_eu @futuretdm This project has received funding from the European Union’s Horizon 2020 Research and Innovation Programme under Grant Agreement No 665940.
  • 13. Data centre Data centre Data centre Data centre in public cloud Publisher text corpus OpenAIRE/CORE text corpus PMC text corpus Other text corpora Other text corpora Other text corpora Other types of text corpora Layer 3: Interoperability to shared storage and computing resources Language resources Language resources Language resources Language resources Layer 2: Interoperability of language resources & corpora Layer 1: Interoperability of text mining services (platforms or components) Language resources and corpora registry service Platform services Registry Workflow ManagementAuth2 & Policy management Annotator Accounting Mining Platforms Mining Platforms Mining Platforms Proprietary architectures Mining Platforms Objective of OpenMinTeD @openminted_eu Projects funded by@futuretdm
  • 14. OpenMinTeD brings together: 14 ACCESSIBLE CONTENT DISCOVERABLE SERVICES EFFICIENT PROCESSING TDM COMMUNITIES VALUE ADDED APPS Via standardised programmatic interfaces and access rules Easily discoverable text mining services and workflows which process, analyse and annotate text Operate on public e-Infrastructures via standarized APIs Different scientific communities have different challenges Community-driven applications to illustrate the value of the infastructure. Engage with industry. OPENMINTED = The Open Mining Infrastructure for Text and Data
  • 15. Become involved Follow us on Twitter for the latest updates and blogs @openminted_eu @futuretdm Follow our websites www.openminted.eu www.futuretdm.eu Projects funded by @openminted_eu @futuretdm
  • 16. THANK YOU • Athena RIC • Univ. of Manchester (NacTem) • Univ. of Darmstadt • INRA • EMBL-EBI • Agro-Know • LIBER • Univ. of Amsterdam • Open University UK • EPFL • CNIO • Univ. of Sheffield (GATE) • GESIS • GRNET • Frontiers • Univ. of Stirling PARTNERS OPENMINTEDPARTNERS FUTURETDM • SYNYO GmbH (SYNYO) • LIBER Europe • Open Knowledge Foundation LBG (OK/CM) • Radboud Univ. Nijmegen • The British Library Board • Univ. of Amsterdam • Athena RIC • Ubiquity Press • Fundacja Projekt: Polska (FPP)