Successfully reported this slideshow.
We use your LinkedIn profile and activity data to personalize ads and to show you more relevant ads. You can change your ad preferences anytime.

So where are we now? The TDM landscape

173 views

Published on

by Maria Eskevich, Centre for Language and Speech Technologies, Radboud University

Published in: Data & Analytics
  • Be the first to comment

  • Be the first to like this

So where are we now? The TDM landscape

  1. 1. OpenDataMonitor Horizon 2020 Coordination and Support Action GARRI-3-2014 Scientific Information in the Digital Age: Text and Data Mining (TDM) Project number: 665940 So where are we now? The TDM Landscape FutureTDM Reducing Barriers and Increasing Uptake of Text and Data Mining for Research Environments using a Collaborative Knowledge and Open Information Approach FTDM workshop/Brussels 27 September 2016
  2. 2. Text and Data Mining : Definition 2 • “Discovery by computer of new, previously unknown information, by automatically extracting and relating information from different (…) resources, to reveal otherwise hidden meanings” (Hearst, 1999), or • “Exploratory data analysis that leads to the discovery of heretofore unknown information, or to answers for questions for which the answer is not currently known” (Hearst, 1999). Raw data (sensor data, text, images, multimedia) Processed content (diagrams, charts, tables, references, maps, formulas, chemical structures) Text and Data Mining - Business or Competitive Intelligence; - Research Analytics; - Learning analytics or Educational Data Mining; - Predictive Analytics; Hearst, M.A., 1999. Untangling Text Data Mining. CollegePark, Maryland, Proceedings of the37thAnnual Meeting of theAssociationfor Computational Linguistics on Computational Linguistics (ACL’99), pp. 3–10.http://www.aclweb.org/anthology/P99-1001
  3. 3. TDM: Technology perspective 3 Information Retrieval and Search Artificial Intelligence Classification Clustering Regression Association Rules Statistics Information Extraction Concept Extraction Opinion Mining Summarisation Negation and Modality Detection Textual Entailment Natural Language Processing Machine Learning Machine Translation Predictive Analytics Multimedia Processing Source: M. Eskevich, A. van den Bosch, M. Caspers, L. Guibault, A. Bertone, S. Reilly, C. Munteanu, P. Leitner, S. Piperidis. Research Report on TDM Landscape in Europe. ProjectH2020GARRI-3-2014--665940 FutureTDM Deliverable D3.1, 2016. http://project.futuretdm.eu/wp-content/uploads/2016/05/D3.1-Research-Report-on-TDM-Landscape-in-Europe-FutureTDM.pdf
  4. 4. General economic structure : TDM across all economic sectors 4 Source: M. Eskevich, A. van den Bosch, M. Caspers, L. Guibault, A. Bertone, S. Reilly, C. Munteanu, P. Leitner, S. Piperidis. Research Report on TDM Landscape in Europe. ProjectH2020GARRI-3-2014--665940 FutureTDM Deliverable D3.1, 2016. http://project.futuretdm.eu/wp-content/uploads/2016/05/D3.1-Research-Report-on-TDM-Landscape-in-Europe-FutureTDM.pdf
  5. 5. TDM : business perspective 5 Enabling players :13% ICT Enablers : 19% Market places :3% Analytics: 27% Cross Infrastructures : 2% Vertical Applications: 27% Data users: 9% Source: S. Piperidis, K. Pouli, M. Gavriilidou, D. Galanis, J. Bakagianni, M. Eskevich, A. Bertone. European Landscape& of TDM Applications Report. Project H2020 GARRI-3-2014--665940 FutureTDM Deliverable D4.1, 2016. http://project.futuretdm.eu/wp-content/uploads/2016/06/FutureTDM_D4.1-European-Landscape-of-TDM-Applications-Report.pdf
  6. 6. TDM : scientific perspective 6 97.10% 0.38% 2.53% 2.90% all projects TDM specific TDM associated 51,2 B€ 75,3 M€633,8 M€ 852,4 M€ 319,9 M€ 149 M€ All FP7 & Horizon 2020 projects Primary sector Secondary sector Tertiary sector Quaternary sector Quinary sector Source: S. Piperidis, K. Pouli, M. Gavriilidou, D. Galanis, J. Bakagianni, M. Eskevich, A. Bertone. European Landscape& of TDM Applications Report. Project H2020 GARRI-3-2014--665940 FutureTDM Deliverable D4.1, 2016. http://project.futuretdm.eu/wp-content/uploads/2016/06/FutureTDM_D4.1-European-Landscape-of-TDM-Applications-Report.pdf
  7. 7. TDM in Europe: Language perspective 7 Source: http://www.meta-net.eu/whitepapers/key-results-and-cross-language-comparison
  8. 8. To conclude and to start the discussion • TDM technologies are present across all sectors of EU economy: • There is none and can‘t be one fit all solution • Shared infrastructures can help cross-polination of TDM solutions between sectors • EU case is specific due to the additional language dimension: • Bias towards English language in terms of availability of tools/data/overall focus • Multilinguality of the population requires additional costs on TDM solutions localisation 8

×