Successfully reported this slideshow.
We use your LinkedIn profile and activity data to personalize ads and to show you more relevant ads. You can change your ad preferences anytime.
Upcoming SlideShare
What to Upload to SlideShare
What to Upload to SlideShare
Loading in …3
1 of 46

FutureTDM Workshop II 29 March



Download to read offline

Our FutureTDM workshop at the European Parliament focus at the development of TDM policy. With EU copyright reform now in progress, we bring together policy makers and stakeholder groups so that we can share FutureTDM’s findings and our first expert driven policy recommendations that can help increase EU TDM.

FutureTDM Workshop II 29 March

  1. 1. OpenDataMonitor Horizon 2020 Coordination and Support Action GARRI-3-2014 Scientific Information in the Digital Age: Text and Data Mining (TDM) Project number: 665940 Increasing Uptake of Text and Data Mining in the EU FutureTDM Reducing Barriers and Increasing Uptake of Text and Data Mining for Research Environments using a Collaborative Knowledge and Open Information Approach Melanie Imming, LIBER, FutureTDM Workshop, European Parliament, Brussels 29 March 2017
  2. 2. What we are talking about today: an overview of the Workshop A reminder of what we mean by TDM An overview of the project and its aims Presentations on our most recent Reports − Economics of TDM − Policy Recommendations Later today: Session 2: What stakeholders can do to improve TDM in their sector – developing guidelines 2FutureTDM
  3. 3. “the discovery by computer of new, previously unknown information, by automatically extracting and relating information from different (…)resources, to reveal otherwise hidden meanings” (Hearst, 1999) What is TDM? 3 ICT Availability Compu ting Power Data 16 trillion gigabytes of data by 2020 (236% growth) Doubles every 2 years (Moores Law, 1965) Over 80% EU citizens have internet access (Eurostat 2014)
  4. 4. Potential of TDM 4 • Addressing grand challenges such as climate change and global epidemics • Improving population health, wealth and development • Creating new jobs and employment • Exponentially increasing the speed and progress of science through new insights and greater efficiency of research • Increasing transparency of governments and their actions • Fostering innovation and collaboration and boosting the impact of open science • Creating tools for education and research • Providing new and richer cultural insights • Speeding economic and social development in all parts of the globe (The Hague Declaration on Knowledge Discovery)
  5. 5. 5 TDM is not a homogeneous, self-contained, scientific domain, but rather a diverse and complex set of methods and technologies deployed in the framework of diverse disciplines and business activities The challenge
  6. 6. FutureTDM - the opportunity The FutureTDM project seeks to improve uptake of text and data mining (TDM) in the EU by actively engaging with stakeholders such as researchers, developers, publishers and SMEs. The use of content mining is significantly lower in Europe than in some American and Asian countries. The partners in the FutureTDM consortium share the ambition behind the EC’s call to develop policy and legal frameworks to reduce the barriers of TDM uptake and with it, promote the awareness of TDM opportunities across Europe. 6
  7. 7. FutureTDM 7 ELABORATE a legal and policy framework for future TDM, define policy priorities, specify a research agenda to foster the spread of TDM in various research fields within the EU BUILD a Collaborative Knowledge Base and an Open Information Hub combined on a web-based platform including intuitive tools ANALYSE current application areas and trends in TDM including statistics and key figures, collect relevant research and industrial projects and best practices ASSESS existing studies, legal regulations and policies on TDM within the European Union Main Objectives of FutureTDM INVOLVE all key stakeholders to identify practices, requirements, and specific challenges in the field of TDM INCREASE awareness of TDM to attract new target groups and science domains
  8. 8. FutureTDM the importance of the community FutureTDM: Improving uptake of text and data mining in the EU 9 Importance of the community: Making the project accessible to all stakeholders Collaboration with other projects like OpenMinTeD who are focusing on technical aspects
  9. 9. Reaching the TDM community – what we’ve been doing We met with people at events, ran our knowledge cafes and held interviews across Europe 10
  10. 10. Reaching out to the TDM community - media Perhaps you saw our videos? We encouraged people to: Blog on our website Tweet us at @futureTDM #FutureTDM 11
  11. 11. What we have produced - Expert Reports and Awareness Sheets Awareness sheets highlighting TDM projects, tools and services, organisations and interviews FutureTDM Knowledge Library • Research report on TDM landscape in Europe • Baseline report of policies, and barriers of TDM in Europe • European landscape of TDM applications report • Compendium of best practices and methodologies • FutureTDM policy framework Coming soon – practitioner guidelines and roadmap for TDM uptake 12
  12. 12. Key barriers discussed at our first workshop 13 AWARENESS TALKING POINTS REWARD & INCENTIVE LEGAL CLARITY REDUCE SKILLS GAPS ACCESS TO DATA Knowledge Discovery Standardisation & Harmonisation Risk Reduction Licensing? Privacy Protection & Ethics Useful data formats Interoperability - Tools & Resources No Hidden Barriers Discoverability User Friendly Inclusion Support Better Data Management Collaboration Across Fields Promote Innovation Commercial & Non-Commercial Interaction Who licenses the internet? Main users - Industry Skills lacking
  13. 13. Workshop 1 some feedback “Anyone with lawful access to data should be allowed to conduct TDM”. (Advocacy stakeholder) “The perception seems to be that legal barriers are very significant. This is true but other issues such as skills and infrastructure are equally if not more significant and these will take a long time to address” (Consultant) “Definitions of TDM may actually be the largest barrier in the copyright/legal discussions on making progress” (Legal Practice) “Licensing remains for us the most certain legal solution, achieving a good balance between different stakeholders” (Publisher) “We need a copyright exception for TDM for commercial and non- commercial entities” [Science and Innovation Researcher] 14
  14. 14. Now we want to help you The purpose of the workshop today is to move on from the barriers and talk solutions. We want positivity, best practice and recommendations to help EU maximise the TDM opportunity That’s why we will present: ▪ Economics and impact of TDM ▪ EU Policy recommendations and later today... ▪ what this might look like in practice for stakeholders In particular we welcome from you the solutions you see for TDM policy from your sector’s perspective - how this might look at EU level and in your own environment. A workshop blog and report will be published on the website after the event 15
  15. 15. What’s next? Please continue to make your thoughts on this visible to the project so we can continue to shape best practice: ● tweet today using hashtag #FutureTDM ● write a guest blog on our website ● fill in our feedback cards here today in session 2 ● Fill in our quick poll online at Save the date 13 June 2017 for for our final Symposium in Salzburg where our stakeholder guidelines and roadmap for TDM uptake will be made available 16
  16. 16. Our engagement so far... 17 17 Join the Community! FutureTDM Collaborative Knowledge Base and Open Information Hub Platform: Expert Blog Best Practice Catalogue Project Database Tool Selector Baseline Reports Comparative Analysis FutureTDM Guidelines Leading Projects Area (promoting/knowledge exchange - e.g. OpenMinTeD) Landscape Overview
  17. 17. OpenDataMonitor Horizon 2020 Coordination and Support Action GARRI-3-2014 Scientific Information in the Digital Age: Text and Data Mining (TDM) Project number: 665940 TDM – Economic Perspective FutureTDM Reducing Barriers and Increasing Uptake of Text and Data Mining for Research Environments using a Collaborative Knowledge and Open Information Approach Jan Strycharz, FutureTDM Workshop 2, European Parliament, Brussels 29th March 2017
  18. 18. TDM & Big Data 19FutureTDM
  19. 19. Value is KEY 20
  20. 20. The Meaning of Value 21
  21. 21. 22
  22. 22. 23 Translating data… Under- standing Selecting Prepro- cessing Transfor- mation Choosing the method Exploring Discovering patterns Interpreting …into action TDM Process
  23. 23. Types of Data Analyzed #1 24
  24. 24. Types of Data #2 25
  25. 25. How Much - Market 26 Europe 2016 2021 Big Data 9.4 B USD 29.4 B USD TDM 2.5 B USD 10.3 B USD World 2016 2019 Big Data 26 B USD 48 B USD TDM 7 B USD 17 B USD
  26. 26. How Much – Economic Impact 27
  27. 27. Value 28 1:5 1:11
  28. 28. How much – Economic Impact 29
  29. 29. Obstacles 30
  30. 30. Organizational Culture – Data Savvy 31
  31. 31. Organizational Culture – Data Savvy 32 REFERENCES TO ECONOMIC PERSPECTIVES: Barel, Yannick. Big Data. Big Opportunity, IBM: Power Systems. Bradshaw, David, Gabriella Cattaneo, Rosanna Lifonti, John Simcox. 2014. Uptake of Cloud in Europe. Follow-up of IDC Study on Quantitative estimates of the demand for Cloud Computing in Europe and the likely barriers to take-up, Framingham: IDC Analyze the Future. Brown, Brad, Jacques Bughin, Angela Hung Byers, Michael Chui, Richard Dobbs, James Manyika, Charles Roxburgh. 2011. Big data: The next frontier for innovation, competition and productivity, McKinsey Global Institute Buchholtz, Sonia, Maciej Bukowski, Aleksander Śniegocki. 2014. Big and open data in Europe. A growth engine or a missed opportunity?, Warszawa: demosEUROPA, Warsaw Institute for Economic Studies. Buytendijk, Frank. 2014. Hype Cycle for Big Data, Stamford: Gartner. Danson, Forrest et al. (red.). 2016. Analytics Trends 2016. The Next Evolution, New York: Deloitte. European Commission, Digital Market Factsheet (2015) European Parliament. 2016. Towards a thriving data-driven economy, Strasburg. Forbes Insights. 2015. Betting on Big Data. How the right culture, strategy and investments can help you leapfrog the competition. Washington. Fujitsu. 2014. Fujitsu Market Insights. Big Data, Fujitsu Technology Solutions GmbH. Girard, Greg. From Big Data to Real-Time Business Value, Framingham: IDC Analyze the Future. Heudecker, Nick, Lisa Kart. 2014. Survey Analysis: Big Data Investment Grows but Deployments Remain Scarce in 2014, Stamford: Gartner. Heudecker, Nick, Merv Adrian. 2015. Survey Analysis: Hadoop Adoption Drivers and Challenges, Stamford: Gartner. Heudecker, Nick, Lisa Kart. 2015. Survey Analysis: Practical Challenges Mount as Big Data Moves to Mainstream, Stamford: Gartner. Kart, Lisa. 2015. Big Data Industry Insights, Stamford: Gartner.
  32. 32. OpenDataMonitor Horizon 2020 Coordination and Support Action GARRI-3-2014 Scientific Information in the Digital Age: Text and Data Mining (TDM) Project number: 665940 Overcoming legal barriers FutureTDM Reducing Barriers and Increasing Uptake of Text and Data Mining for Research Environments using a Collaborative Knowledge and Open Information Approach FutureTDM Workshop II | Brussels | March 29, 2017 Marco Caspers (Institute for Information Law, University of Amsterdam)
  33. 33. Introduction To discuss: ▪Our work in FutureTDM ▪Policy framework ▪Principles ▪Designing a TDM exception 34FutureTDM
  34. 34. Our work • Desk research •regulation •policies •economy • Stakeholder engagement •interviews •workshops •conferences • Results: •Barriers to TDM •Legal & policies •Skills and education •Economy •Technical & infrastructure •Recommendations!
  35. 35. FutureTDM Policy Framework • Deliverable D5.1 recommendations cover: •Legal & Policies •Skills & Education •Economy & Incentives •Technical & Infrastructure • This presentation: •Only legal recos in context of TDM exception •Underlying research •Legal research into copyright and database law •Questionnaire on national exceptions •Response: 16 Member States •Stakeholder engagement •Views on legal rules
  36. 36. Fundamental principles 37
  37. 37. Scope and design of TDM exception • From our policy framework: • “Ensure legal rules reflect a fair balance between the interests of TDM practitioners and rightsholders (of copyright and database rights), and reaffirm that ideas and facts as such are not protected” • “Create harmonised and mandatory rules” • “Make clear rules”  minimise borderline cases 38 Acts? Purpose? Beneficiary? Overridable? Subject matter?
  38. 38. Acts • Reproductions & Extractions • UK (only repro) | EU • Digital copies • FR • FutureTDM: • All reproductions and extractions • No visibility for public • only aggregated information • No trade on expression  extraction of facts and ideas • No limits to retention • Reproducibility & verifiability • Or else: other exception? 39 Acts? Purpose? Beneficiary? Overridable? Subject-matter?
  39. 39. Purpose • Scientific research • UK | DSM proposal • Non-commercial • UK | France • FutureTDM: • No restriction • prevent border cases 40 Acts? Purpose? Beneficiary? Overridable? Subject-matter?
  40. 40. Beneficiary • Lawful access • UK | FR | DSM proposal • Research organisation • DSM proposal • trade-of with ‘non-commercial’ • FutureTDM • Lawful access • If allowed to read, allowed to mine • trade on ideas, not expression 41 Acts? Purpose? Beneficiary? Overridable? Subject-matter?
  41. 41. Overridable • Overridable by contract • Not: UK | FR | DSM proposal • technical overridability? • FutureTDM • Not overridable • Fragmented licensing landscape • Effectiveness 42 Acts? Purpose? Beneficiary? Overridable? Subject-matter?
  42. 42. Subject-matter • No restriction • UK | EU • Scientific literature • FR • FutureTDM • No restriction • Copyright principles apply horizontally • Uncertainty  borderline cases 43 Acts? Purpose? Beneficiary? Overridable? Subject-matter?
  43. 43. Compensation • UK | France | EU • No • Considerations for compensation • purpose: missed out on exploitation of the work (expression) • no where meant to cover investments (in services)
  44. 44. Measures • In essence no copyright issue • Overridability by (technical) security measures? •if yes: may not change status quo •if no: must achieve fair balance • How to balance? •voluntary ‘agreed’ best practices? •service-level obligations? •transparency?
  45. 45. What’s next? Want to know more? • See our Deliverable D5.1 “FutureTDM Policy Framework” • and upcoming deliverables: •Guidelines for stakeholders (end April) • You can provide your view in our next session now
  46. 46. FutureTDM Workshop II, 29 March – Any Questions? Q & A All slides will be on FutureTDM slideshare after the event #FutureTDM 47