FutureTDM Symposium_DEMOS

FutureTDM
FutureTDMFutureTDM
OpenDataMonitor
Horizon 2020
Coordination and Support Action
GARRI-3-2014 Scientific Information in the Digital Age: Text and Data Mining (TDM)
Project number: 665940
Flash presentations/ Demos
FutureTDM
Reducing Barriers and Increasing Uptake of Text and Data Mining for Research Environments
using a Collaborative Knowledge and Open Information Approach
FutureTDM Symposium, Salzburg
June 13th, 2017
Flash Presentations / Demos
Presenters
▪ Stefan Kasberger (ContentMine)
▪ Donat Agosti (PLAZI)
▪ Petr Knoth (CORE)
▪ Ralf Klinkenberg (RapidMiner)
▪ Maria Gavriilidou (clarin:el)
▪ Alessio Palmero Aprosio (ALCIDE)
2FutureTDM
Flash Presentations / Demos
Stefan Kasberger
(ContentMine)
3FutureTDM
TDM Use-Case Tutorial
Demoing the three text data mining tutorials from ContentMine
▪ TDM for Pandemics with Zika
▪ Systematic Literature Review
▪ P-Cracking: finding statistical measures
4FutureTDM
Flash Presentations / Demos
Donat Agosti
(PLAZI)
5FutureTDM
Plazi
Liberating and disseminating biodiversity data from scientific publications
▪ Issue:
▪ continually growing corpus of 500 Million pages of scientific literature covering the description of the world’s
living diversity. E.g. > 17,000 new species description published every year
▪ Only incomplete data on publications, even less included facts
▪ Challenge: Provide real time, and promote access to ongoing as well as legacy publications
▪ Solutions:
▪ 1. Provide and maintain a TDM workflow to find articles, extract and disseminate facts (Plazi workflow)
▪ 2. Promote journal production workflows to create semantically enhanced publications upfront (e.g.
TaxPub/JATS based Pensoft workflow)
6FutureTDM
Treatment
Bank
Data mine, text
extraction
& markup
store &
access
Biodiversity
Literature
Repository
Persistent, resolvable identifers minted for:
• Articles: DOI (if no DOI exists)
• Treatments: httpURI
• Illustrations: DOI
7
6 M RDF triples
110,000 illustrations
850,000 bibliographic references
196,311 taxonomic treatments
20,202 articles
74 journals (2016)
PLAZI TDM RESULTS AND DISSEMINATION
Source (accessed June 11, 2017):
http://plazi.org
http://biolitrepo.org
Flash Presentations / Demos
Petr Knoth
(CORE)
8FutureTDM
OpenDataMonitor
Horizon 2020
Coordination and Support Action
GARRI-3-2014 Scientific Information in the Digital Age: Text and Data Mining (TDM)
Project number: 665940
Flash presentations/ Demos
CORE - Bringing science to all …
FutureTDM Symposium, Salzburg
June 13th, 2017
CORE – Millions of research papers ready to text mine
10FutureTDM
CORE – Millions of research papers ready to text mine
11FutureTDM
Flash Presentations / Demos
Ralf Klinkenberg
(RapidMiner)
12FutureTDM
RapidMiner – Unified Open Source Data Science Platform
13FutureTDM
DATA MASHUP
ENGINE
MODERN, AGILE ENTERPRISE PLATFORM
Ingestion
Blending
Cleansing
Best Practice
Recommendations
Unified Workflows Intelligent Utilization
In Hadoop In-Memory on Desktop or Server In Database Web Services Process Scheduler Web Apps
PRESCRIPTIVE
DECISION ENGINE
Diagnostic
Relationships
Predictive Insights
Prescriptive Actions
Business
Processes &
Applications
OPERATIONA-
LIZATION ENGINE
High-Velocity
Scoring
Honest Validation
Process Integration
Automation Services
WISDOM OF CROWDS ADVISOR
EFFORTLESS WORKFLOW DESIGNER
FEDERATED ANALYTICS DRIVER
Marketplace Innovations & Extensions
Any
Data Source
Data at Rest and
Data in Motion
RapidMiner – Open Source Data Science Platform
14FutureTDM
Lightning Fast: Visual interface
for rapidly building complete
analytic workflows
Powerful: Rich library of
algorithms and functions to
build the strongest possible
model for any use case
Open & Extensible: Open source
innovation keeps pace with
changing business needs
Unified Platform: Seamlessly integrates structured and
unstructured data from all types of sources as well as machine
learning algorithms by RapidMiner, R, Python, H2O, Hadoop,
Spark, PySpark, SparkR, SparkRM, etc. in a single visual
platform and allows easy deployment on-Server, in-Hadoop,
in- Cloud, as web services, in web apps, via Java API, etc.
Flash Presentations / Demos
Maria Gavriilidou
(clarin:el)
15FutureTDM
CLARIN and CLARIN:EL
CLARIN integrates
▪ Language Datasets: digital content of any medium (text, sound, image,
video), raw and annotated, lexica, ontologies, grammars etc.
▪ Language Technology tools: lemmatizers, taggers, term extractors,
sentiment annotators, summarizers, etc.
in a federation of trusted repositories
• available to researchers
▪ through national networks of organizations in each country
(today: 21 member-countries, 42 certified centers)
CLARIN:EL (www.clarin.gr)
• the Greek Language Resources, Tools/ Services Infrastructure
• for documenting, sharing and processing language data
16FutureTDM
more information
www.clarin.gr
http://inventory.clarin.gr
info@clarin.gr
clarin.gr
@CLARIN_el
https://www.linkedin.com/grp/home?gid=8
309819
17
FutureTDM
Flash Presentations / Demos
Alessio Palmero Aprosio
(ALCIDE)
18FutureTDM
ALCIDE
Online platform to perform temporal, geographical, and linguistic analysis of
historical documents.
▪ Extract information
▪ State of the art Human Language Technologies
▪ Tint (for Italian)
▪ Stanford CoreNLP (for English)
▪ Visualise data
▪ Intuitive and understandable data representation
19FutureTDM
Flash Presentations / Demos
Find out more!
Join us in the Demo Session
Thank you!
20FutureTDM
1 of 20

Recommended

FutureTDM Roadmap by
FutureTDM RoadmapFutureTDM Roadmap
FutureTDM RoadmapFutureTDM
371 views12 slides
Technologies and infrastructures supporting text and data analytics: Challeng... by
Technologies and infrastructures supporting text and data analytics: Challeng...Technologies and infrastructures supporting text and data analytics: Challeng...
Technologies and infrastructures supporting text and data analytics: Challeng...FutureTDM
325 views8 slides
The legal factors by
The legal factorsThe legal factors
The legal factorsFutureTDM
436 views4 slides
What have we learned from talking with the TDM community? by
What have we learned from talking with the TDM community?What have we learned from talking with the TDM community?
What have we learned from talking with the TDM community?FutureTDM
432 views12 slides
Text Mining: the next data frontier. Beyond Open Access by
Text Mining: the next data frontier. Beyond Open AccessText Mining: the next data frontier. Beyond Open Access
Text Mining: the next data frontier. Beyond Open Accessopenminted_eu
417 views27 slides
The Breakdown: What is OpenMinTeD? by
The Breakdown: What is OpenMinTeD?The Breakdown: What is OpenMinTeD?
The Breakdown: What is OpenMinTeD?openminted_eu
467 views22 slides

More Related Content

What's hot

Open Science policy: EC, ERC, Belspo, FWO by
Open Science policy: EC, ERC, Belspo, FWOOpen Science policy: EC, ERC, Belspo, FWO
Open Science policy: EC, ERC, Belspo, FWOOpenAccessBelgium
661 views21 slides
OSFair2017 Workshop | Towards a Policy Framework for the European Open Scienc... by
OSFair2017 Workshop | Towards a Policy Framework for the European Open Scienc...OSFair2017 Workshop | Towards a Policy Framework for the European Open Scienc...
OSFair2017 Workshop | Towards a Policy Framework for the European Open Scienc...Open Science Fair
187 views21 slides
Big Data Europe SC6 WS 3: Where we are and are going for Big Data in OpenScie... by
Big Data Europe SC6 WS 3: Where we are and are going for Big Data in OpenScie...Big Data Europe SC6 WS 3: Where we are and are going for Big Data in OpenScie...
Big Data Europe SC6 WS 3: Where we are and are going for Big Data in OpenScie...BigData_Europe
519 views14 slides
Rajendra Akerkar - LeMO Project by
Rajendra Akerkar - LeMO ProjectRajendra Akerkar - LeMO Project
Rajendra Akerkar - LeMO ProjectBigData_Europe
1.1K views12 slides
OSFair2017 Workshop | Service provisioning for excellent sciences by
OSFair2017 Workshop | Service provisioning for excellent sciencesOSFair2017 Workshop | Service provisioning for excellent sciences
OSFair2017 Workshop | Service provisioning for excellent sciencesOpen Science Fair
208 views16 slides
Big Data Europe SC6 WS 3: Ron Dekker, Director CESSDA European Open Science A... by
Big Data Europe SC6 WS 3: Ron Dekker, Director CESSDA European Open Science A...Big Data Europe SC6 WS 3: Ron Dekker, Director CESSDA European Open Science A...
Big Data Europe SC6 WS 3: Ron Dekker, Director CESSDA European Open Science A...BigData_Europe
423 views34 slides

What's hot(20)

Open Science policy: EC, ERC, Belspo, FWO by OpenAccessBelgium
Open Science policy: EC, ERC, Belspo, FWOOpen Science policy: EC, ERC, Belspo, FWO
Open Science policy: EC, ERC, Belspo, FWO
OpenAccessBelgium661 views
OSFair2017 Workshop | Towards a Policy Framework for the European Open Scienc... by Open Science Fair
OSFair2017 Workshop | Towards a Policy Framework for the European Open Scienc...OSFair2017 Workshop | Towards a Policy Framework for the European Open Scienc...
OSFair2017 Workshop | Towards a Policy Framework for the European Open Scienc...
Open Science Fair187 views
Big Data Europe SC6 WS 3: Where we are and are going for Big Data in OpenScie... by BigData_Europe
Big Data Europe SC6 WS 3: Where we are and are going for Big Data in OpenScie...Big Data Europe SC6 WS 3: Where we are and are going for Big Data in OpenScie...
Big Data Europe SC6 WS 3: Where we are and are going for Big Data in OpenScie...
BigData_Europe519 views
Rajendra Akerkar - LeMO Project by BigData_Europe
Rajendra Akerkar - LeMO ProjectRajendra Akerkar - LeMO Project
Rajendra Akerkar - LeMO Project
BigData_Europe1.1K views
OSFair2017 Workshop | Service provisioning for excellent sciences by Open Science Fair
OSFair2017 Workshop | Service provisioning for excellent sciencesOSFair2017 Workshop | Service provisioning for excellent sciences
OSFair2017 Workshop | Service provisioning for excellent sciences
Open Science Fair208 views
Big Data Europe SC6 WS 3: Ron Dekker, Director CESSDA European Open Science A... by BigData_Europe
Big Data Europe SC6 WS 3: Ron Dekker, Director CESSDA European Open Science A...Big Data Europe SC6 WS 3: Ron Dekker, Director CESSDA European Open Science A...
Big Data Europe SC6 WS 3: Ron Dekker, Director CESSDA European Open Science A...
BigData_Europe423 views
Library Science Talk: Tensions between copyright and knowledge discovery by LIBER Europe
Library Science Talk: Tensions between copyright and knowledge discoveryLibrary Science Talk: Tensions between copyright and knowledge discovery
Library Science Talk: Tensions between copyright and knowledge discovery
LIBER Europe909 views
Vassilios Peristeras: From Open to Linked Government Data: (European Commissi... by FIA2010
Vassilios Peristeras: From Open to Linked Government Data: (European Commissi...Vassilios Peristeras: From Open to Linked Government Data: (European Commissi...
Vassilios Peristeras: From Open to Linked Government Data: (European Commissi...
FIA2010780 views
OpenAIREplus NOADs by OpenAIRE
OpenAIREplus NOADsOpenAIREplus NOADs
OpenAIREplus NOADs
OpenAIRE789 views
The Open Science Agenda in Europe: Policy convergence & diversity of approaches by LIBER Europe
The Open Science Agenda in Europe: Policy convergence & diversity of approachesThe Open Science Agenda in Europe: Policy convergence & diversity of approaches
The Open Science Agenda in Europe: Policy convergence & diversity of approaches
LIBER Europe835 views
European open science cloud by Jisc
European open science cloudEuropean open science cloud
European open science cloud
Jisc309 views
OpenAIRE implementing open science by Jisc
OpenAIRE implementing open scienceOpenAIRE implementing open science
OpenAIRE implementing open science
Jisc289 views
The META-NET Strategic Research Agenda and Linked Open Data by Georg Rehm
The META-NET Strategic Research Agenda and Linked Open DataThe META-NET Strategic Research Agenda and Linked Open Data
The META-NET Strategic Research Agenda and Linked Open Data
Georg Rehm616 views
Open Science in Europe - policies and infrastructures: a user journey in Open... by Pedro Príncipe
Open Science in Europe - policies and infrastructures: a user journey in Open...Open Science in Europe - policies and infrastructures: a user journey in Open...
Open Science in Europe - policies and infrastructures: a user journey in Open...
Pedro Príncipe97 views
Legal Interoperability of Research Data: Principles and Implementation Guidel... by OpenAIRE
Legal Interoperability of Research Data: Principles and Implementation Guidel...Legal Interoperability of Research Data: Principles and Implementation Guidel...
Legal Interoperability of Research Data: Principles and Implementation Guidel...
OpenAIRE945 views
Information system support in construction industry with semantic web techno... by Pieter Pauwels
Information system support in construction industry with semantic web techno...Information system support in construction industry with semantic web techno...
Information system support in construction industry with semantic web techno...
Pieter Pauwels733 views
OpenAIRE – The path from OpenAIRE to EOSC in Belgium by OpenAccessBelgium
OpenAIRE – The path from OpenAIRE to EOSC in BelgiumOpenAIRE – The path from OpenAIRE to EOSC in Belgium
OpenAIRE – The path from OpenAIRE to EOSC in Belgium
OpenAccessBelgium107 views

Similar to FutureTDM Symposium_DEMOS

Tds — big science dec 2021 by
Tds — big science dec 2021Tds — big science dec 2021
Tds — big science dec 2021Gérard Dupont
38 views19 slides
Libraries in the Big Data Era: Strategies and Challenges in Archiving and Sha... by
Libraries in the Big Data Era: Strategies and Challenges in Archiving and Sha...Libraries in the Big Data Era: Strategies and Challenges in Archiving and Sha...
Libraries in the Big Data Era: Strategies and Challenges in Archiving and Sha...Peter Löwe
664 views42 slides
New trends in ontological engineering, practices and tools by
New trends in ontological engineering, practices and toolsNew trends in ontological engineering, practices and tools
New trends in ontological engineering, practices and toolsMaría Poveda Villalón
454 views39 slides
Sem tech in CH, Linked Data Meetup, 2014-08-21, Malmo, Sweden by
Sem tech in CH, Linked Data Meetup, 2014-08-21, Malmo, SwedenSem tech in CH, Linked Data Meetup, 2014-08-21, Malmo, Sweden
Sem tech in CH, Linked Data Meetup, 2014-08-21, Malmo, SwedenVladimir Alexiev, PhD, PMP
1.3K views68 slides
SESAM4 - A guide to semantics in the Linked Open Data cloud, Robert HP Engels... by
SESAM4 - A guide to semantics in the Linked Open Data cloud, Robert HP Engels...SESAM4 - A guide to semantics in the Linked Open Data cloud, Robert HP Engels...
SESAM4 - A guide to semantics in the Linked Open Data cloud, Robert HP Engels...The Research Council of Norway, IKTPLUSS
454 views30 slides
Your Content hides a treasure (and you might have not found it) - ForgetIT Pr... by
Your Content hides a treasure (and you might have not found it) - ForgetIT Pr...Your Content hides a treasure (and you might have not found it) - ForgetIT Pr...
Your Content hides a treasure (and you might have not found it) - ForgetIT Pr...Olivier Dobberkau
1.1K views74 slides

Similar to FutureTDM Symposium_DEMOS(20)

Libraries in the Big Data Era: Strategies and Challenges in Archiving and Sha... by Peter Löwe
Libraries in the Big Data Era: Strategies and Challenges in Archiving and Sha...Libraries in the Big Data Era: Strategies and Challenges in Archiving and Sha...
Libraries in the Big Data Era: Strategies and Challenges in Archiving and Sha...
Peter Löwe664 views
Your Content hides a treasure (and you might have not found it) - ForgetIT Pr... by Olivier Dobberkau
Your Content hides a treasure (and you might have not found it) - ForgetIT Pr...Your Content hides a treasure (and you might have not found it) - ForgetIT Pr...
Your Content hides a treasure (and you might have not found it) - ForgetIT Pr...
Olivier Dobberkau1.1K views
Virtual Research Environments as-a-serive by Blue BRIDGE
Virtual Research Environments as-a-seriveVirtual Research Environments as-a-serive
Virtual Research Environments as-a-serive
Blue BRIDGE203 views
Language Resources for Multilingual Europe by Georg Rehm
Language Resources for Multilingual EuropeLanguage Resources for Multilingual Europe
Language Resources for Multilingual Europe
Georg Rehm1.6K views
Linked Data for Biopharma by Tom Plasterer
Linked Data for BiopharmaLinked Data for Biopharma
Linked Data for Biopharma
Tom Plasterer3.4K views
OpenMinTeD, LIBER conference 2017 by openminted_eu
OpenMinTeD, LIBER conference 2017OpenMinTeD, LIBER conference 2017
OpenMinTeD, LIBER conference 2017
openminted_eu405 views
NordForsk Open Access Reykjavik 14-15/8-2014:Rda by NordForsk
NordForsk Open Access Reykjavik 14-15/8-2014:RdaNordForsk Open Access Reykjavik 14-15/8-2014:Rda
NordForsk Open Access Reykjavik 14-15/8-2014:Rda
NordForsk281 views
Lider Reference Model ld4lt session March, 3rd, 2015 by Sebastian Hellmann
Lider Reference Model ld4lt session  March, 3rd, 2015Lider Reference Model ld4lt session  March, 3rd, 2015
Lider Reference Model ld4lt session March, 3rd, 2015
Sebastian Hellmann1.2K views
An introduction to ViBRANT: Virtual Biodiversity Research and Access Network ... by Vince Smith
An introduction to ViBRANT: Virtual Biodiversity Research and Access Network ...An introduction to ViBRANT: Virtual Biodiversity Research and Access Network ...
An introduction to ViBRANT: Virtual Biodiversity Research and Access Network ...
Vince Smith414 views
IC-SDV 2018 The International Conference on Search, Data and Text Mining and ... by Dr. Haxel Consult
IC-SDV 2018 The International Conference on Search, Data and Text Mining and ...IC-SDV 2018 The International Conference on Search, Data and Text Mining and ...
IC-SDV 2018 The International Conference on Search, Data and Text Mining and ...
Dr. Haxel Consult300 views
How to Create the Google for Earth Data (XLDB 2015, Stanford) by Rainer Sternfeld
How to Create the Google for Earth Data (XLDB 2015, Stanford)How to Create the Google for Earth Data (XLDB 2015, Stanford)
How to Create the Google for Earth Data (XLDB 2015, Stanford)
Rainer Sternfeld1.5K views
Toward FAIR Semantic Resources by EUDAT
Toward FAIR Semantic ResourcesToward FAIR Semantic Resources
Toward FAIR Semantic Resources
EUDAT51 views
The International Information Conference on Search, Data Mining and Visualiza... by Dr. Haxel Consult
The International Information Conference on Search, Data Mining and Visualiza...The International Information Conference on Search, Data Mining and Visualiza...
The International Information Conference on Search, Data Mining and Visualiza...
Dr. Haxel Consult410 views

More from FutureTDM

FutureTDM Symposium: Skills & Education by
FutureTDM Symposium: Skills & EducationFutureTDM Symposium: Skills & Education
FutureTDM Symposium: Skills & EducationFutureTDM
288 views21 slides
Data Analytics and the Legal Landscape: Intellectual Property and Data Protec... by
Data Analytics and the Legal Landscape: Intellectual Property and Data Protec...Data Analytics and the Legal Landscape: Intellectual Property and Data Protec...
Data Analytics and the Legal Landscape: Intellectual Property and Data Protec...FutureTDM
381 views14 slides
The economic potential of data analytics by
The economic potential of data analyticsThe economic potential of data analytics
The economic potential of data analyticsFutureTDM
430 views16 slides
Introduction to the FutureTDM project by
Introduction to the FutureTDM projectIntroduction to the FutureTDM project
Introduction to the FutureTDM projectFutureTDM
964 views11 slides
FutureTDM Workshop II 29 March by
FutureTDM Workshop II 29 MarchFutureTDM Workshop II 29 March
FutureTDM Workshop II 29 MarchFutureTDM
536 views46 slides
Text and data mining - the opportunities and the EU conundrum - why aren’t we... by
Text and data mining - the opportunities and the EU conundrum - why aren’t we...Text and data mining - the opportunities and the EU conundrum - why aren’t we...
Text and data mining - the opportunities and the EU conundrum - why aren’t we...FutureTDM
347 views16 slides

More from FutureTDM(7)

FutureTDM Symposium: Skills & Education by FutureTDM
FutureTDM Symposium: Skills & EducationFutureTDM Symposium: Skills & Education
FutureTDM Symposium: Skills & Education
FutureTDM288 views
Data Analytics and the Legal Landscape: Intellectual Property and Data Protec... by FutureTDM
Data Analytics and the Legal Landscape: Intellectual Property and Data Protec...Data Analytics and the Legal Landscape: Intellectual Property and Data Protec...
Data Analytics and the Legal Landscape: Intellectual Property and Data Protec...
FutureTDM381 views
The economic potential of data analytics by FutureTDM
The economic potential of data analyticsThe economic potential of data analytics
The economic potential of data analytics
FutureTDM430 views
Introduction to the FutureTDM project by FutureTDM
Introduction to the FutureTDM projectIntroduction to the FutureTDM project
Introduction to the FutureTDM project
FutureTDM964 views
FutureTDM Workshop II 29 March by FutureTDM
FutureTDM Workshop II 29 MarchFutureTDM Workshop II 29 March
FutureTDM Workshop II 29 March
FutureTDM536 views
Text and data mining - the opportunities and the EU conundrum - why aren’t we... by FutureTDM
Text and data mining - the opportunities and the EU conundrum - why aren’t we...Text and data mining - the opportunities and the EU conundrum - why aren’t we...
Text and data mining - the opportunities and the EU conundrum - why aren’t we...
FutureTDM347 views
OpenMinteD Project - building a TDM infrastructure by FutureTDM
OpenMinteD Project - building a TDM infrastructureOpenMinteD Project - building a TDM infrastructure
OpenMinteD Project - building a TDM infrastructure
FutureTDM242 views

Recently uploaded

CRM stick or twist workshop by
CRM stick or twist workshopCRM stick or twist workshop
CRM stick or twist workshopinfo828217
8 views16 slides
CRIJ4385_Death Penalty_F23.pptx by
CRIJ4385_Death Penalty_F23.pptxCRIJ4385_Death Penalty_F23.pptx
CRIJ4385_Death Penalty_F23.pptxyvettemm100
6 views24 slides
TGP 2.docx by
TGP 2.docxTGP 2.docx
TGP 2.docxsandi636490
10 views8 slides
VoxelNet by
VoxelNetVoxelNet
VoxelNettaeseon ryu
6 views21 slides
SUPER STORE SQL PROJECT.pptx by
SUPER STORE SQL PROJECT.pptxSUPER STORE SQL PROJECT.pptx
SUPER STORE SQL PROJECT.pptxkhan888620
12 views16 slides
[DSC Europe 23] Zsolt Feleki - Machine Translation should we trust it.pptx by
[DSC Europe 23] Zsolt Feleki - Machine Translation should we trust it.pptx[DSC Europe 23] Zsolt Feleki - Machine Translation should we trust it.pptx
[DSC Europe 23] Zsolt Feleki - Machine Translation should we trust it.pptxDataScienceConferenc1
5 views12 slides

Recently uploaded(20)

CRM stick or twist workshop by info828217
CRM stick or twist workshopCRM stick or twist workshop
CRM stick or twist workshop
info8282178 views
CRIJ4385_Death Penalty_F23.pptx by yvettemm100
CRIJ4385_Death Penalty_F23.pptxCRIJ4385_Death Penalty_F23.pptx
CRIJ4385_Death Penalty_F23.pptx
yvettemm1006 views
SUPER STORE SQL PROJECT.pptx by khan888620
SUPER STORE SQL PROJECT.pptxSUPER STORE SQL PROJECT.pptx
SUPER STORE SQL PROJECT.pptx
khan88862012 views
[DSC Europe 23] Zsolt Feleki - Machine Translation should we trust it.pptx by DataScienceConferenc1
[DSC Europe 23] Zsolt Feleki - Machine Translation should we trust it.pptx[DSC Europe 23] Zsolt Feleki - Machine Translation should we trust it.pptx
[DSC Europe 23] Zsolt Feleki - Machine Translation should we trust it.pptx
[DSC Europe 23] Stefan Mrsic_Goran Savic - Evolving Technology Excellence.pptx by DataScienceConferenc1
[DSC Europe 23] Stefan Mrsic_Goran Savic - Evolving Technology Excellence.pptx[DSC Europe 23] Stefan Mrsic_Goran Savic - Evolving Technology Excellence.pptx
[DSC Europe 23] Stefan Mrsic_Goran Savic - Evolving Technology Excellence.pptx
Cross-network in Google Analytics 4.pdf by GA4 Tutorials
Cross-network in Google Analytics 4.pdfCross-network in Google Analytics 4.pdf
Cross-network in Google Analytics 4.pdf
GA4 Tutorials6 views
CRM stick or twist.pptx by info828217
CRM stick or twist.pptxCRM stick or twist.pptx
CRM stick or twist.pptx
info82821710 views
Chapter 3b- Process Communication (1) (1)(1) (1).pptx by ayeshabaig2004
Chapter 3b- Process Communication (1) (1)(1) (1).pptxChapter 3b- Process Communication (1) (1)(1) (1).pptx
Chapter 3b- Process Communication (1) (1)(1) (1).pptx
ayeshabaig20045 views
UNEP FI CRS Climate Risk Results.pptx by pekka28
UNEP FI CRS Climate Risk Results.pptxUNEP FI CRS Climate Risk Results.pptx
UNEP FI CRS Climate Risk Results.pptx
pekka2811 views
[DSC Europe 23] Spela Poklukar & Tea Brasanac - Retrieval Augmented Generation by DataScienceConferenc1
[DSC Europe 23] Spela Poklukar & Tea Brasanac - Retrieval Augmented Generation[DSC Europe 23] Spela Poklukar & Tea Brasanac - Retrieval Augmented Generation
[DSC Europe 23] Spela Poklukar & Tea Brasanac - Retrieval Augmented Generation
Ukraine Infographic_22NOV2023_v2.pdf by AnastosiyaGurin
Ukraine Infographic_22NOV2023_v2.pdfUkraine Infographic_22NOV2023_v2.pdf
Ukraine Infographic_22NOV2023_v2.pdf
AnastosiyaGurin1.4K views
3196 The Case of The East River by ErickANDRADE90
3196 The Case of The East River3196 The Case of The East River
3196 The Case of The East River
ErickANDRADE9012 views
[DSC Europe 23] Milos Grubjesic Empowering Business with Pepsico s Advanced M... by DataScienceConferenc1
[DSC Europe 23] Milos Grubjesic Empowering Business with Pepsico s Advanced M...[DSC Europe 23] Milos Grubjesic Empowering Business with Pepsico s Advanced M...
[DSC Europe 23] Milos Grubjesic Empowering Business with Pepsico s Advanced M...

FutureTDM Symposium_DEMOS

  • 1. OpenDataMonitor Horizon 2020 Coordination and Support Action GARRI-3-2014 Scientific Information in the Digital Age: Text and Data Mining (TDM) Project number: 665940 Flash presentations/ Demos FutureTDM Reducing Barriers and Increasing Uptake of Text and Data Mining for Research Environments using a Collaborative Knowledge and Open Information Approach FutureTDM Symposium, Salzburg June 13th, 2017
  • 2. Flash Presentations / Demos Presenters ▪ Stefan Kasberger (ContentMine) ▪ Donat Agosti (PLAZI) ▪ Petr Knoth (CORE) ▪ Ralf Klinkenberg (RapidMiner) ▪ Maria Gavriilidou (clarin:el) ▪ Alessio Palmero Aprosio (ALCIDE) 2FutureTDM
  • 3. Flash Presentations / Demos Stefan Kasberger (ContentMine) 3FutureTDM
  • 4. TDM Use-Case Tutorial Demoing the three text data mining tutorials from ContentMine ▪ TDM for Pandemics with Zika ▪ Systematic Literature Review ▪ P-Cracking: finding statistical measures 4FutureTDM
  • 5. Flash Presentations / Demos Donat Agosti (PLAZI) 5FutureTDM
  • 6. Plazi Liberating and disseminating biodiversity data from scientific publications ▪ Issue: ▪ continually growing corpus of 500 Million pages of scientific literature covering the description of the world’s living diversity. E.g. > 17,000 new species description published every year ▪ Only incomplete data on publications, even less included facts ▪ Challenge: Provide real time, and promote access to ongoing as well as legacy publications ▪ Solutions: ▪ 1. Provide and maintain a TDM workflow to find articles, extract and disseminate facts (Plazi workflow) ▪ 2. Promote journal production workflows to create semantically enhanced publications upfront (e.g. TaxPub/JATS based Pensoft workflow) 6FutureTDM Treatment Bank Data mine, text extraction & markup store & access Biodiversity Literature Repository Persistent, resolvable identifers minted for: • Articles: DOI (if no DOI exists) • Treatments: httpURI • Illustrations: DOI
  • 7. 7 6 M RDF triples 110,000 illustrations 850,000 bibliographic references 196,311 taxonomic treatments 20,202 articles 74 journals (2016) PLAZI TDM RESULTS AND DISSEMINATION Source (accessed June 11, 2017): http://plazi.org http://biolitrepo.org
  • 8. Flash Presentations / Demos Petr Knoth (CORE) 8FutureTDM
  • 9. OpenDataMonitor Horizon 2020 Coordination and Support Action GARRI-3-2014 Scientific Information in the Digital Age: Text and Data Mining (TDM) Project number: 665940 Flash presentations/ Demos CORE - Bringing science to all … FutureTDM Symposium, Salzburg June 13th, 2017
  • 10. CORE – Millions of research papers ready to text mine 10FutureTDM
  • 11. CORE – Millions of research papers ready to text mine 11FutureTDM
  • 12. Flash Presentations / Demos Ralf Klinkenberg (RapidMiner) 12FutureTDM
  • 13. RapidMiner – Unified Open Source Data Science Platform 13FutureTDM DATA MASHUP ENGINE MODERN, AGILE ENTERPRISE PLATFORM Ingestion Blending Cleansing Best Practice Recommendations Unified Workflows Intelligent Utilization In Hadoop In-Memory on Desktop or Server In Database Web Services Process Scheduler Web Apps PRESCRIPTIVE DECISION ENGINE Diagnostic Relationships Predictive Insights Prescriptive Actions Business Processes & Applications OPERATIONA- LIZATION ENGINE High-Velocity Scoring Honest Validation Process Integration Automation Services WISDOM OF CROWDS ADVISOR EFFORTLESS WORKFLOW DESIGNER FEDERATED ANALYTICS DRIVER Marketplace Innovations & Extensions Any Data Source Data at Rest and Data in Motion
  • 14. RapidMiner – Open Source Data Science Platform 14FutureTDM Lightning Fast: Visual interface for rapidly building complete analytic workflows Powerful: Rich library of algorithms and functions to build the strongest possible model for any use case Open & Extensible: Open source innovation keeps pace with changing business needs Unified Platform: Seamlessly integrates structured and unstructured data from all types of sources as well as machine learning algorithms by RapidMiner, R, Python, H2O, Hadoop, Spark, PySpark, SparkR, SparkRM, etc. in a single visual platform and allows easy deployment on-Server, in-Hadoop, in- Cloud, as web services, in web apps, via Java API, etc.
  • 15. Flash Presentations / Demos Maria Gavriilidou (clarin:el) 15FutureTDM
  • 16. CLARIN and CLARIN:EL CLARIN integrates ▪ Language Datasets: digital content of any medium (text, sound, image, video), raw and annotated, lexica, ontologies, grammars etc. ▪ Language Technology tools: lemmatizers, taggers, term extractors, sentiment annotators, summarizers, etc. in a federation of trusted repositories • available to researchers ▪ through national networks of organizations in each country (today: 21 member-countries, 42 certified centers) CLARIN:EL (www.clarin.gr) • the Greek Language Resources, Tools/ Services Infrastructure • for documenting, sharing and processing language data 16FutureTDM
  • 18. Flash Presentations / Demos Alessio Palmero Aprosio (ALCIDE) 18FutureTDM
  • 19. ALCIDE Online platform to perform temporal, geographical, and linguistic analysis of historical documents. ▪ Extract information ▪ State of the art Human Language Technologies ▪ Tint (for Italian) ▪ Stanford CoreNLP (for English) ▪ Visualise data ▪ Intuitive and understandable data representation 19FutureTDM
  • 20. Flash Presentations / Demos Find out more! Join us in the Demo Session Thank you! 20FutureTDM