Published on Mar 19, 2015 by PMR
Copyright is one of the greatest barrier to Open Data. This presentation for insidegovernment UK shows the struggle between those who want to reform copyright and those opposed to reform
1. Open government data is becoming more widespread, with governments, local authorities, and cities increasingly releasing their data to the public.
2. The author helped create data.gov.uk and establish principles for making public data open, accessible, and reusable.
3. Early examples show how open data can power applications from the public to track issues like anti-social behavior orders or find NHS dentists.
Emerging Institutional Paradigms for the Digital CommonsBob Chao
The document discusses emerging models for digital commons, which are digital data and information from publicly-funded sources made freely available online. It describes existing models like open-source software, open data repositories, and open access journals. Digital commons provide benefits like facilitating knowledge sharing globally and avoiding duplication. However, barriers include developing policies and incentives to support different commons models while balancing other values like intellectual property.
Social, Political and Legal Aspects of Text and Data Mining (TDM)Richard Smith-Unna
As part of the ContentMine project (http://contentmine.org), this talk gives an up-to-date (in 2014) overview of text and data mining.
Written by Michelle Brook, Charles Oppenheim and Peter Murray-Rust.
This presentation was given by Charles Oppenheim at WOSP 2014.
The HathiTrust Research Center: Enabling New Knowledge Through Shared Infrastructure
Robert McDonald - HathiTrust Research Center Executive committee member; Associate Dean for Library Technologies, Indiana University
The Progress of an S&T Information GatewayBob Chao
The document summarizes the progress towards establishing an open access science and technology (S&T) information gateway in Taiwan. It describes Taiwan's movement towards open source and open access models, including initiatives like Creative Commons Taiwan. It also outlines Taiwan's policies and organizations that aim to facilitate open sharing of S&T information through initiatives like the Government Research Bulletin database and CONsortium on Core Electronic Resources in Taiwan. The ultimate goal is an "integral system" as mandated by Taiwan's S&T Basic Law to create a comprehensive domestic and international S&T information network.
OU Library Research Support webinar: Data sharing: legal and ethical issuesdancrane_open
This document summarizes a workshop on data sharing and the legal and ethical issues involved. It covers copyright and database rights as they relate to data, obtaining proper consent from participants, and ensuring ethics approval. Open licensing options and considerations for reusing data from repositories are also discussed. Resources from the Open University Library and other organizations are provided to help with data management planning, consent forms, preparing data for sharing, and addressing any questions.
1. Open government data is becoming more widespread, with governments, local authorities, and cities increasingly releasing their data to the public.
2. The author helped create data.gov.uk and establish principles for making public data open, accessible, and reusable.
3. Early examples show how open data can power applications from the public to track issues like anti-social behavior orders or find NHS dentists.
Emerging Institutional Paradigms for the Digital CommonsBob Chao
The document discusses emerging models for digital commons, which are digital data and information from publicly-funded sources made freely available online. It describes existing models like open-source software, open data repositories, and open access journals. Digital commons provide benefits like facilitating knowledge sharing globally and avoiding duplication. However, barriers include developing policies and incentives to support different commons models while balancing other values like intellectual property.
Social, Political and Legal Aspects of Text and Data Mining (TDM)Richard Smith-Unna
As part of the ContentMine project (http://contentmine.org), this talk gives an up-to-date (in 2014) overview of text and data mining.
Written by Michelle Brook, Charles Oppenheim and Peter Murray-Rust.
This presentation was given by Charles Oppenheim at WOSP 2014.
The HathiTrust Research Center: Enabling New Knowledge Through Shared Infrastructure
Robert McDonald - HathiTrust Research Center Executive committee member; Associate Dean for Library Technologies, Indiana University
The Progress of an S&T Information GatewayBob Chao
The document summarizes the progress towards establishing an open access science and technology (S&T) information gateway in Taiwan. It describes Taiwan's movement towards open source and open access models, including initiatives like Creative Commons Taiwan. It also outlines Taiwan's policies and organizations that aim to facilitate open sharing of S&T information through initiatives like the Government Research Bulletin database and CONsortium on Core Electronic Resources in Taiwan. The ultimate goal is an "integral system" as mandated by Taiwan's S&T Basic Law to create a comprehensive domestic and international S&T information network.
OU Library Research Support webinar: Data sharing: legal and ethical issuesdancrane_open
This document summarizes a workshop on data sharing and the legal and ethical issues involved. It covers copyright and database rights as they relate to data, obtaining proper consent from participants, and ensuring ethics approval. Open licensing options and considerations for reusing data from repositories are also discussed. Resources from the Open University Library and other organizations are provided to help with data management planning, consent forms, preparing data for sharing, and addressing any questions.
In scientific communication, we observe a complex interaction of several stakeholder groups, each of which have distinct interests, strategies and approaches for Open Access and Open Data. The German government initiated a “Commission for the Future of the Information Infrastructure” (KII) in Germany. In this commission, most of the stakeholders are working together in order to design a future scenario for the supply of scientific information. The KII’s evaluation and recommendations for Open Access as well as research data will be particularly highly recognized and will significantly influence Open Access and Open Data developments in Germany.
I will outline the current situation in Germany – players and their interactions in terms of Open Access and Open Data – and present two initiatives and their work in detail. One of them, the KII process, will show the official site, the other one will show the grassroots site of the story.
Overview of the Open Access Landscape - ALA ALCTS Midwinter SymposiumRichard Huffine
The document provides an overview of open access to federally funded research. It defines open access as digital content that is available online for free without restrictions. It describes different types of open access like gold OA (content made freely available by journals) and green OA (content made freely available through repositories). It discusses US federal mandates for open access, including legislation that has been proposed but not passed as well as the current NIH public access policy. It outlines the current state of open access implementation across federal agencies and roles that different organizations can play in providing access to research outputs and data.
Copyright protects original works of authorship and applies to both published and unpublished works. Libraries, museums, and archives are carrying out large-scale digitization projects to make collections publicly accessible online with support from foundations. While mass digitization has highlighted the need to balance copyright protection and access, legislative changes are difficult so workarounds are used instead. Fair use best practices aim to provide clarity on assessing fair use based on whether the use is transformative and only takes the amount needed to achieve its purpose.
Workshop at Oxford on publishing for early career researchers - April 2011Jisc
This document discusses open publishing and what it offers researchers. It defines open as referring to permissions, cost, time, and access to papers, monographs, theses, and data. Researchers are encouraged to make their work openly accessible by putting papers in institutional repositories, publishing in open access journals, and negotiating rights with publishers. Open publishing can provide benefits like increased citations, savings and economic benefits, but does face challenges in funding models and getting researchers to participate. The document provides guidance on making theses, papers, monographs, and data open through various options and platforms.
This document discusses resources for empirical patent research including data sources, challenges, and tools. It provides an overview of available raw patent data sources such as national and regional patent office databases. It also outlines proprietary patent platforms and existing statistical datasets. The document notes that literature in the field is interdisciplinary and dispersed, and comprehensive searching is challenging. It recommends statistical software and working with colleagues in related fields like economics.
II Konferencja Naukowa : Nauka o informacji (informacja naukowa) w okresie zmian, Warszawa, 15-16.04.2013 r. Instytut Informacji Naukowej i Studiów Bibliologicznych, Uniwersytet Warszawski
The 2nd Scientific Conference : Information Science in an Age of Change, April 15-16, 2013. Institute of Information and Book Studies, University of Warsaw
The document discusses intellectual property, copyright, and related legal issues. It provides an overview of what is covered by copyright, including economic rights like reproduction and distribution rights, as well as moral rights of attribution and integrity. It outlines recent changes and developments in copyright law through international agreements and legislation in Australia. Current legal cases involving internet piracy and fair use of headlines are also summarized. Alternative approaches to copyright like copyleft and Creative Commons are mentioned.
The document discusses the evolving view of public access to publicly funded research in the US. It provides an overview of key events and policies that have shaped the discussion, including the Scholarly Publishing Roundtable in 2009, the America COMPETES Act of 2010, and the OSTP Directive of 2013. It also describes some partnerships between funding agencies and publishers to increase access to research outputs while recognizing the role of publishers.
School of rocking copyright 2017 in Lisbon centrumcyfrowe
The document discusses copyright reform in the European Union, outlining the current problems with copyright law, the proposals from the European Commission, and recommendations from education advocates. It analyzes proposals around exceptions for education and text and data mining, as well as a new neighboring right for publishers. Overall, it argues the Commission's proposals do not go far enough to modernize copyright for uses like education across the EU.
Opening up government information and data: developments in policy and practiceAnne Fitzgerald
"Opening up government information and data: developments in policy and practice", Presentation by Neale Hooper and Dr Anne Fitzgerald to the "Information Futures: Open Access, Open Data and Creative Commons" Professional Practice Seminar, Monash University, Melbourne, Australia on 23 October 2014
Librarian RDM Training: Ethics and copyright for research dataRobin Rice
This document provides an overview of ethics and copyright as they relate to research data management. It discusses ethical requirements for collecting human subject data, including obtaining consent and protecting privacy and confidentiality. Certain types of research may be exempt from ethics review. Intellectual property rights can apply to research data depending on the level of creativity in the data's collection and organization. Data licensing is an alternative to asserting copyright that allows explicitly defining how data can be used.
Frequently-asked questions on Freedom of Information and Environmental Inform...Chris Rusbridge
This document discusses Freedom of Information (FOI) and Environmental Information Regulations (EIR) as they relate to requests for research data. It finds that recorded requests for research data directly from universities are rare, with about 40 such requests across 21 universities over a 3 year period. Exemptions under FOI and exceptions under EIR that may be relevant to protecting research data include information intended for future publication, information related to ongoing research, and personal or confidential information. The document encourages consultation with FOI practitioners in responding to any requests and considers guidance from legal sources on complying with regulations.
Copyright and consent briefing for open educational resourcesmeganqb
This is a powerpoint file prepared for the "OER in the disciplines" conference 26 October 2010. It covers UK copyright law, the need to consider consent from people to be involved, and risks of non-compliance with best practice when using and contributing to open educational resources in teaching and learning.
1. INSIGHT is a €80m research center funded from 2013-2019 that brings together over 60 PIs and 300 researchers across 5 research centers and 6 institutions in Ireland.
2. The research focuses on developing a "Network of Knowledge" through linking open data using semantic web technologies like RDF and ontologies to create a universal, interconnected knowledge graph.
3. One goal is to accelerate innovation by applying linked data approaches to domains like digital humanities, life sciences, and connected health.
Thinking about resource issues: copyright and open accessAllison Fullard
The presentation was given to an international group of public health academics from African and Asian countries. They are preparing learning content for courses to be delivered in blended learning environments. Thinking about how copyright needs to be re-calibrated for our circumstances in 21st Century. Two publicly shared video clips are embedded into the file.
This document summarizes key points from a lecture and reading about new media institutions and business models. It discusses commercial vs volunteer production of information goods, challenges to strict copyright law, and different strategies for dealing with intellectual property like exclusive rights vs non-exclusive strategies. Key non-exclusive strategies discussed include cost minimization to obtain information inputs at low costs, and reliance on relationships and social networks rather than asserting exclusive rights. The document provides several examples to illustrate different strategies like scholarly lawyers, software publishing, and open source communities.
1) The document discusses university invention ownership models in Europe compared to the US Bayh-Dole Act.
2) In Europe, universities emphasize spinning off companies from inventions rather than licensing patents like in the US.
3) While the UK and Germany have policies promoting university patent ownership, overall university-owned patents make up a lower share, around 55%, compared to the nearly 100% share seen in the US under Bayh-Dole.
Presentation given by Rebecca Grant, Digital Archivist at the Digital Repository of Ireland, at the Heritage Week Event Organiser Training Day, a special day of talks covering every aspect of planning a Heritage Week event. The presentation outlines techniques and methods for sourcing and using primary and secondary sources in heritage event planning.
A Digitalization Partnership: Sharing human and material resources by Larissa...dduin
This document summarizes key points about intellectual property rights (IPR) considerations for a digitalization partnership between three Belgian institutions - NBGB, RBINS, and RMCA - called Be-TAF. It discusses IPR best practices for openly licensing content, managing copyright risks for different date ranges of publications, and models for rights agreements. It also examines some controversial edge cases and considers scientists' views that publicly funded research results should be widely disseminated with proper citation. The conclusion is that while efforts will be made to avoid infringing content, some risk cannot be eliminated and a notice-and-takedown policy should be implemented to manage any issues.
This document summarizes a presentation on using open-source tools to provide access to scientific literature on climate change and migration. It describes how ContentMine has built tools called "Open Climate Knowledge" to mine scientific articles on climate change from publishers' websites and other open sources. However, most of this literature (50-90%) is currently behind paywalls. The tools allow querying across open-access sources to provide summaries of available literature on topics like the relationship between climate change and human migration. Examples of results from initial queries on this topic are also provided.
Going Digital seminar, Hobart, Tasmania, 27 June 2014 - Dr Anne Fitzgerald: "...Anne Fitzgerald
Presentation "Challenges and Opportunities in the Digital Economy" by Dr Anne Fitzgerald at the "Going Digital - Law for the Digital Economy" seminar, in Hobart, Tasmania, Australia on 27 June 2014, covering legal challenges and opportunities in the digital economy.
This document summarizes Peter Murray-Rust's presentation on content mining and copyright. The presentation discusses how current copyright law hinders content mining of scholarly articles, which can help address grand challenges. It describes the benefits of content mining for research and innovation. However, major publishers oppose content mining and have stopped researchers from downloading articles needed for their work. Changes to copyright law are needed but not enough; governments and institutions must actively support content mining through funding, tools and protecting researchers.
In scientific communication, we observe a complex interaction of several stakeholder groups, each of which have distinct interests, strategies and approaches for Open Access and Open Data. The German government initiated a “Commission for the Future of the Information Infrastructure” (KII) in Germany. In this commission, most of the stakeholders are working together in order to design a future scenario for the supply of scientific information. The KII’s evaluation and recommendations for Open Access as well as research data will be particularly highly recognized and will significantly influence Open Access and Open Data developments in Germany.
I will outline the current situation in Germany – players and their interactions in terms of Open Access and Open Data – and present two initiatives and their work in detail. One of them, the KII process, will show the official site, the other one will show the grassroots site of the story.
Overview of the Open Access Landscape - ALA ALCTS Midwinter SymposiumRichard Huffine
The document provides an overview of open access to federally funded research. It defines open access as digital content that is available online for free without restrictions. It describes different types of open access like gold OA (content made freely available by journals) and green OA (content made freely available through repositories). It discusses US federal mandates for open access, including legislation that has been proposed but not passed as well as the current NIH public access policy. It outlines the current state of open access implementation across federal agencies and roles that different organizations can play in providing access to research outputs and data.
Copyright protects original works of authorship and applies to both published and unpublished works. Libraries, museums, and archives are carrying out large-scale digitization projects to make collections publicly accessible online with support from foundations. While mass digitization has highlighted the need to balance copyright protection and access, legislative changes are difficult so workarounds are used instead. Fair use best practices aim to provide clarity on assessing fair use based on whether the use is transformative and only takes the amount needed to achieve its purpose.
Workshop at Oxford on publishing for early career researchers - April 2011Jisc
This document discusses open publishing and what it offers researchers. It defines open as referring to permissions, cost, time, and access to papers, monographs, theses, and data. Researchers are encouraged to make their work openly accessible by putting papers in institutional repositories, publishing in open access journals, and negotiating rights with publishers. Open publishing can provide benefits like increased citations, savings and economic benefits, but does face challenges in funding models and getting researchers to participate. The document provides guidance on making theses, papers, monographs, and data open through various options and platforms.
This document discusses resources for empirical patent research including data sources, challenges, and tools. It provides an overview of available raw patent data sources such as national and regional patent office databases. It also outlines proprietary patent platforms and existing statistical datasets. The document notes that literature in the field is interdisciplinary and dispersed, and comprehensive searching is challenging. It recommends statistical software and working with colleagues in related fields like economics.
II Konferencja Naukowa : Nauka o informacji (informacja naukowa) w okresie zmian, Warszawa, 15-16.04.2013 r. Instytut Informacji Naukowej i Studiów Bibliologicznych, Uniwersytet Warszawski
The 2nd Scientific Conference : Information Science in an Age of Change, April 15-16, 2013. Institute of Information and Book Studies, University of Warsaw
The document discusses intellectual property, copyright, and related legal issues. It provides an overview of what is covered by copyright, including economic rights like reproduction and distribution rights, as well as moral rights of attribution and integrity. It outlines recent changes and developments in copyright law through international agreements and legislation in Australia. Current legal cases involving internet piracy and fair use of headlines are also summarized. Alternative approaches to copyright like copyleft and Creative Commons are mentioned.
The document discusses the evolving view of public access to publicly funded research in the US. It provides an overview of key events and policies that have shaped the discussion, including the Scholarly Publishing Roundtable in 2009, the America COMPETES Act of 2010, and the OSTP Directive of 2013. It also describes some partnerships between funding agencies and publishers to increase access to research outputs while recognizing the role of publishers.
School of rocking copyright 2017 in Lisbon centrumcyfrowe
The document discusses copyright reform in the European Union, outlining the current problems with copyright law, the proposals from the European Commission, and recommendations from education advocates. It analyzes proposals around exceptions for education and text and data mining, as well as a new neighboring right for publishers. Overall, it argues the Commission's proposals do not go far enough to modernize copyright for uses like education across the EU.
Opening up government information and data: developments in policy and practiceAnne Fitzgerald
"Opening up government information and data: developments in policy and practice", Presentation by Neale Hooper and Dr Anne Fitzgerald to the "Information Futures: Open Access, Open Data and Creative Commons" Professional Practice Seminar, Monash University, Melbourne, Australia on 23 October 2014
Librarian RDM Training: Ethics and copyright for research dataRobin Rice
This document provides an overview of ethics and copyright as they relate to research data management. It discusses ethical requirements for collecting human subject data, including obtaining consent and protecting privacy and confidentiality. Certain types of research may be exempt from ethics review. Intellectual property rights can apply to research data depending on the level of creativity in the data's collection and organization. Data licensing is an alternative to asserting copyright that allows explicitly defining how data can be used.
Frequently-asked questions on Freedom of Information and Environmental Inform...Chris Rusbridge
This document discusses Freedom of Information (FOI) and Environmental Information Regulations (EIR) as they relate to requests for research data. It finds that recorded requests for research data directly from universities are rare, with about 40 such requests across 21 universities over a 3 year period. Exemptions under FOI and exceptions under EIR that may be relevant to protecting research data include information intended for future publication, information related to ongoing research, and personal or confidential information. The document encourages consultation with FOI practitioners in responding to any requests and considers guidance from legal sources on complying with regulations.
Copyright and consent briefing for open educational resourcesmeganqb
This is a powerpoint file prepared for the "OER in the disciplines" conference 26 October 2010. It covers UK copyright law, the need to consider consent from people to be involved, and risks of non-compliance with best practice when using and contributing to open educational resources in teaching and learning.
1. INSIGHT is a €80m research center funded from 2013-2019 that brings together over 60 PIs and 300 researchers across 5 research centers and 6 institutions in Ireland.
2. The research focuses on developing a "Network of Knowledge" through linking open data using semantic web technologies like RDF and ontologies to create a universal, interconnected knowledge graph.
3. One goal is to accelerate innovation by applying linked data approaches to domains like digital humanities, life sciences, and connected health.
Thinking about resource issues: copyright and open accessAllison Fullard
The presentation was given to an international group of public health academics from African and Asian countries. They are preparing learning content for courses to be delivered in blended learning environments. Thinking about how copyright needs to be re-calibrated for our circumstances in 21st Century. Two publicly shared video clips are embedded into the file.
This document summarizes key points from a lecture and reading about new media institutions and business models. It discusses commercial vs volunteer production of information goods, challenges to strict copyright law, and different strategies for dealing with intellectual property like exclusive rights vs non-exclusive strategies. Key non-exclusive strategies discussed include cost minimization to obtain information inputs at low costs, and reliance on relationships and social networks rather than asserting exclusive rights. The document provides several examples to illustrate different strategies like scholarly lawyers, software publishing, and open source communities.
1) The document discusses university invention ownership models in Europe compared to the US Bayh-Dole Act.
2) In Europe, universities emphasize spinning off companies from inventions rather than licensing patents like in the US.
3) While the UK and Germany have policies promoting university patent ownership, overall university-owned patents make up a lower share, around 55%, compared to the nearly 100% share seen in the US under Bayh-Dole.
Presentation given by Rebecca Grant, Digital Archivist at the Digital Repository of Ireland, at the Heritage Week Event Organiser Training Day, a special day of talks covering every aspect of planning a Heritage Week event. The presentation outlines techniques and methods for sourcing and using primary and secondary sources in heritage event planning.
A Digitalization Partnership: Sharing human and material resources by Larissa...dduin
This document summarizes key points about intellectual property rights (IPR) considerations for a digitalization partnership between three Belgian institutions - NBGB, RBINS, and RMCA - called Be-TAF. It discusses IPR best practices for openly licensing content, managing copyright risks for different date ranges of publications, and models for rights agreements. It also examines some controversial edge cases and considers scientists' views that publicly funded research results should be widely disseminated with proper citation. The conclusion is that while efforts will be made to avoid infringing content, some risk cannot be eliminated and a notice-and-takedown policy should be implemented to manage any issues.
This document summarizes a presentation on using open-source tools to provide access to scientific literature on climate change and migration. It describes how ContentMine has built tools called "Open Climate Knowledge" to mine scientific articles on climate change from publishers' websites and other open sources. However, most of this literature (50-90%) is currently behind paywalls. The tools allow querying across open-access sources to provide summaries of available literature on topics like the relationship between climate change and human migration. Examples of results from initial queries on this topic are also provided.
Going Digital seminar, Hobart, Tasmania, 27 June 2014 - Dr Anne Fitzgerald: "...Anne Fitzgerald
Presentation "Challenges and Opportunities in the Digital Economy" by Dr Anne Fitzgerald at the "Going Digital - Law for the Digital Economy" seminar, in Hobart, Tasmania, Australia on 27 June 2014, covering legal challenges and opportunities in the digital economy.
This document summarizes Peter Murray-Rust's presentation on content mining and copyright. The presentation discusses how current copyright law hinders content mining of scholarly articles, which can help address grand challenges. It describes the benefits of content mining for research and innovation. However, major publishers oppose content mining and have stopped researchers from downloading articles needed for their work. Changes to copyright law are needed but not enough; governments and institutions must actively support content mining through funding, tools and protecting researchers.
Libraries at the centre of the debate on copyright and text and data mining: ...LIBER Europe
This document discusses text and data mining (TDM) and the debate around copyright. It makes three key points:
1) TDM has the potential to save lives by discovering new medical treatments, but copyright law is unclear if TDM infringes on copyright by copying content. Licensing models are not scalable solutions.
2) A workshop with stakeholders found that copyright reform is needed, including a specific exception in EU law for TDM and addressing legal uncertainty. Harmonization of copyright law across borders is also important.
3) Going forward, LIBER will advocate for copyright reform, support international harmonization efforts, engage stakeholders, and promote open access and interoperable licensing to help TDM grow in Europe
Clare Lanigan - Copyright and digital preservationdri_ireland
Presented at DRI Members Forum, 6th March 2019 by Clare Lanigan, Education & Outreach Manager at DRI. An overview of copyright requirements when archiving and publishing digital collections.
Automatic Extraction of Science and Medicine from the scholarly literatureTheContentMine
Published on Jun 04, 2015 by PMR
Many scientists have to extract many facts out the scholarly literature - to evaluate other work or to extract useful collections of facts. This shows the approach, especially for systematic reviews of animal or clinical trials
Automatic Extraction of Science and Medicine from the scholarly literaturepetermurrayrust
Many scientists have to extract many facts out the scholarly literature - to evaluate other work or to extract useful collections of facts. This shows the approach, especially for systematic reviews of animal or clinical trials
This document discusses open science business models and licensing. It begins with a word of caution about openness and business realities. It then covers open source licensing models including copyleft licenses. Open science principles are discussed along with licensing models for scientific outputs such as copyright, patents, and data. The document also discusses the growth of Creative Commons licensing and open access developments and business models. It analyzes licensing approaches for patents, databases, and open data, concluding with a discussion of open science research projects and the semantic web.
Co-presented for the course INLS 720: Metadata Architectures and Applications at UNC SILS. Subsequently, we also presented at the February 2013 meeting of the UNC Scholarly Communications Working Group. This presentation covered copyright in the context of metadata re-use, plus two case studies (one examining Duke University Press and the other examining open bibliographic data).
Presentation made by Dr. Tabrez Ahmad in the International Conference on "Open Sesame: Unlocking IP To Unleash The Commons".
A Joint Conference Under The Consilience Banner by: The Law & Technology Society (NLS) and Spicy IP, Supported by: MHRD Chair in IP, NLS. 28th & 29th May, 2016.
Public Sphere: Gov 2.0 - Brian FitzgeraldPia Waugh
The document discusses improving access to and reuse of public sector information through removing barriers to information flow. It advocates for open access, open standards, open licensing and open business models to share knowledge. Several case studies are provided that highlight challenges around access to geospatial and transport data in Australia. The document argues for adopting open licensing frameworks like Creative Commons to enable greater reuse of public sector information.
Copyright, Open Science, and Challenges for Research LibrariesAndres Guadamuz
This document discusses open science and challenges for research libraries. It summarizes views from the European Commission and White House supporting open access to publicly funded research to benefit the economy and innovation. It also outlines recent legal developments regarding orphan works, open licensing like CC 4.0, and cases establishing that open licensing and self-archiving can preempt copyright disputes. Data mining and ensuring open repository policies and APIs were discussed as ongoing issues.
"Let’s tackle it together: recent changes in copyright and intellectual prope...TDBaldwin
The document summarizes recent changes to UK copyright law and what they mean for academic libraries. Key changes include reforms to the Copyright, Designs and Patents Act 1988, the introduction of Extended Collective Licensing, and extensions to copyright terms. For higher education, the changes enable greater use of works for teaching, research, and library services. Academic libraries need to work with rights holders and other groups to understand and respond to ongoing legal reforms. Staying up to date on copyright issues requires following various organizations and resources.
Developing a research Library position statement on Text and Data Mining in t...Danny Kingsley
These are slides from a workshop held during the RLUK2017 Conference http://rlukconference.com/ presented by Dr Danny Kingsley, Dr Deborah Hansen and Anna Vernon.
The Abstract:
"The library community has been almost silent on the issue of text and data mining (T&DM) partly due to concerns about the risk of having institutions ‘cut off’ from subscriptions due to large downloads of research articles for the purpose of mining. This workshop is an intention to identify where the information rests about T&DM - including looking at the details as they appear in Jisc negotiated licenses - consider some case studies and develop together a set of principles that identify the position of research libraries in the on the issue of T&DM. "
The Regulation of Text and Data MiningLIBER Europe
The Regulation of Text and Data Mining (Melanie Dulong de Rosnay, CNRS, France). This presentation was one of the 10 most highly ranked at LIBER's Annual Conference 2014 in Riga, Latvia. Learn more: www.libereurope.eu
Data availability policies and licensingBrian Hole
This document discusses data availability policies and licensing. It covers the social contract of science including validation, dissemination and further development. It also discusses scientific malpractice by publishers, researchers and libraries. The document advocates for fair use copyright exceptions to allow mining of text and data for research and teaching. It summarizes benefits of text mining but notes issues with the "Licenses for Europe" initiative, including an unbalanced setup and lack of representation for the technology sector. Civil society has walked out of working group discussions on text and data mining licenses.
The European Parliament committee approved a proposed EU copyright directive that would require online platforms to screen uploaded content for copyrighted material using automated filters. This "Article 13" has raised concerns that it could block legitimate content like memes, snippets in news articles, and open source software. Another part of the proposal would require services like Facebook and Google to pay news publishers for displaying snippets of their articles. The full European Parliament will now vote on the proposal, which if passed would then need to be implemented by individual EU countries, with some variation possible. Critics argue it could negatively impact internet sharing and open innovation. Supporters say it supports publishers and democracy. The impacts on companies like Google and ability to share content online remain unclear.
The document outlines 10 principles for public rights in Australian copyright law. The principles are intended to provoke discussion and articulate the interests of those who support public access to and use of copyrighted works. The principles call for balancing private rights with public interests, limiting the scope and term of copyright, preserving Australian publications, allowing fair use through flexible exceptions, ensuring fair statutory licenses, supporting voluntary licensing, protecting public rights from technology and contracts, proportional enforcement, and free access to publicly funded works.
High throughput mining of the scholarly literature TheContentMine
Published on Jun 7, 2016 by PMR
Talk given to statisticians in Tilburg, with emphasis on scholarly comms for detecting unusual features. Includes demo of Amanuens.is and image mining
Amanuens.is HUmans and machines annotating scholarly literature TheContentMine
Published on May 19, 2016 by PMR
about 10,000 scholarly articles ("papers") are published each day. Amanuens.is is a symbiont of ContentMine and Hypothes.is (both Shuttleworth projects/Fellows) which annotates theses using an array of controlled vocabularies ("dictionaries"). The results, in semantic form are used to annotate the original material. The talk had live demos and used plant chemistry as the examples
This document summarizes Peter Murray-Rust's presentation on open content and programs at MIOSS 2016. It discusses how open approaches can lead to faster, better, and more inclusive research. It provides examples of open source tools for tasks like chemical entity recognition, structure identification, and chemical language parsing. It also describes efforts to openly mine scientific literature to extract facts and analyze large amounts of data.
Automatic Extraction of Knowledge from the LiteratureTheContentMine
Published on May 11, 2016 by PMR
ContentMine tools (and the Harvest alliance) can be used to search the literature for knowledge, especially in biomedicine. All tools are Open and shortly we shall be indexing the complete daily scholarly literature
Automatic Extraction of Knowledge from Biomedical literature TheContentMine
Published on Mar 16, 2016 by PMR
A plenary lecture to Cochrane Collaboration in Birmingham, on the value of automatically extracting knowledge. Covers the Why? How? What? Who? and problems and invites collaboration
Liberating facts from the scientific literature - Jisc Digifest 2016 TheContentMine
Published on Mar 4, 2016 by PMR
Text and data mining (TDM) techniques can be applied to a wide range of materials, from published research papers, books and theses, to cultural heritage materials, digitised collections, administrative and management reports and documentation, etc. Use cases include academic research, resource discovery and business intelligence.
This workshop will show the value and benefits of TDM techniques and demonstrate how ContentMine aims to liberate 100,000,000 facts from the scientific literature, and ContentMine will provide a hands on demo on a topical and accessible scientific/medical subject.
Published on Feb 29, 2016 by PMR
An overview of Text and Data Mining (ContentMining) including live demonstrations. The fundamentals: discover, scrape, normalize , facet/index, analyze, publish are exemplified using the recent Zika outbreak. Mining covers textual and non-textual content and examples of chemistry and phylogenetic tress are given.
Published on Feb 07, 2016 by PMR
Use of ContentMine tools on the Open Access subset of EuropePubMedCentral to discover new knowledge about the Zika virus. Includes clips of the software in action
Published on Jan 29, 2016 by PMR
Keynote talk to LEARN (LERU/H2020 project) for research data management. Emphasizes that problems are cultural not technical. Promotes modern approaches such as Git / continuous Integration, announces DAT. Asserts that the Right to Read in the Right to Mine. Calls for widespread development of content mining (TDM)
This document summarizes Peter Murray-Rust's work on mining scientific images and extracting structured information from them. It discusses his software tools for image analysis, including recognizing figures, tables, chemical structures, phylogenetic trees and extracting data. It provides examples of extracting information from images and transforming unstructured data in PDFs into semantic, computable formats. The document also discusses opportunities for collaboration and making the software openly available.
Digital Scholarship: Enlightenment or Devastated Landscape? TheContentMine
Published on Dec 17, 2015 by PMR
Every year 500 Billion USD of public funding is spent on research, but much of this lies hidden in papers that are never read. I describe how machines can help us to read the literature. However there is massive opposition from publishers who are trying to prevent open scholarship and who build walled gardens that they control
Open Knowledge and University of Cambridge European Bioinformatics InstituteTheContentMine
This document discusses open data and open science. It highlights Jean-Claude Bradley as a pioneer of open notebook science and open data who believed closed data means people die. It describes tools like ContentMine that can automatically extract data like chemical reactions, phylogenetic trees and clinical trial results from papers. Visitors can extract specific types of data while repositories can solve problems communally with continuous publication and validation.
Can Computers understand the scientific literature (includes compscie material)TheContentMine
Published on Jan 24, 2014 by PMR
With the semantic web machines can autonomously carry out many knowledge-based tasks as well as humans. The main problems are not technical but the prevention of access to information. I advocate automatic downloading and indexing of all scientific information
Jean-Claude Bradley's vision of open notebook science aims to change how science is done by making all research processes and outputs openly accessible in real-time. This addresses significant waste in the current system where most research funds are wasted due to flawed design, non-publication and poor reporting of results. Open science is inspired by open source software which engineers repositories for transparent and collaborative development, validation and improvement by global communities. The combination of openness, truth and community can optimize scientific processes and outputs.
Published on Jul 24, 2014 by PMR
PhD Theses are normally locked away digitally. They cost 20 billion dollars to create and we waste much of this value. By making them open we can use software to read, index, reuse, compute and add massive value
Published on Aug 22, 2014 by PMR
Open Data and Open Science presented in Rio for Open Science 2014-08-22. I argue that Open Notebook Science is the way forward and will lead to great benefits
How To Cultivate Community Affinity Throughout The Generosity JourneyAggregage
This session will dive into how to create rich generosity experiences that foster long-lasting relationships. You’ll walk away with actionable insights to redefine how you engage with your supporters — emphasizing trust, engagement, and community!
RFP for Reno's Community Assistance CenterThis Is Reno
Property appraisals completed in May for downtown Reno’s Community Assistance and Triage Centers (CAC) reveal that repairing the buildings to bring them back into service would cost an estimated $10.1 million—nearly four times the amount previously reported by city staff.
Indira awas yojana housing scheme renamed as PMAYnarinav14
Indira Awas Yojana (IAY) played a significant role in addressing rural housing needs in India. It emerged as a comprehensive program for affordable housing solutions in rural areas, predating the government’s broader focus on mass housing initiatives.
Jennifer Schaus and Associates hosts a complimentary webinar series on The FAR in 2024. Join the webinars on Wednesdays and Fridays at noon, eastern.
Recordings are on YouTube and the company website.
https://www.youtube.com/@jenniferschaus/videos
AHMR is an interdisciplinary peer-reviewed online journal created to encourage and facilitate the study of all aspects (socio-economic, political, legislative and developmental) of Human Mobility in Africa. Through the publication of original research, policy discussions and evidence research papers AHMR provides a comprehensive forum devoted exclusively to the analysis of contemporaneous trends, migration patterns and some of the most important migration-related issues.
Food safety, prepare for the unexpected - So what can be done in order to be ready to address food safety, food Consumers, food producers and manufacturers, food transporters, food businesses, food retailers can ...
Bharat Mata - History of Indian culture.pdfBharat Mata
Bharat Mata Channel is an initiative towards keeping the culture of this country alive. Our effort is to spread the knowledge of Indian history, culture, religion and Vedas to the masses.
1. Copyright Reform and Open Data
Peter Murray-Rust
contentmine.org
Improving Public Service Delivery through Open Data, InsideGovernment,
London, 2015-03-19
• Copyright restricts innovation
• Governments want to reform it to
generate wealth
• Content/text mining can generate wealth
• However: massive concerted opposition
2. The Right to Read is the Right to Mine
http://contentmine.org
3. Scientific and Medical publication (STM)[+]
• World Citizens pay $400,000,000,000…
• … for research in 1,500,000 articles …
• … cost $300,000 each to create …
• … $7000 each to “publish” [*]…
• … $10,000,000,000 from academic libraries …
• … to “publishers” who forbid access to 99.9% of citizens of
the world …
• 85% of medical research is wasted (not published, badly
conceived, duplicated, …)
[+] Figures probably +- 50 %
[*] arXiV preprint server costs $7 USD per paper
5. Open Content Mining of FACTs
Machines can interpret chemical reactions
We have done 500,000 patents. There are >
3,000,000 reactions/year. Added value > 1B Eur.
6. Typical Clinical Trial
Effect of drug over time. It takes ½ day to
extract data by hand. contentmine.org can
do it in 1 second, BUT will technically
break copyright.
7. Prof. Ian Hargreaves (2011): "David Cameron's
exam question”: "Could it be true that laws
designed more than three centuries ago with the
express purpose of creating economic incentives
for innovation by protecting creators' rights are
today obstructing innovation and economic
growth?”
“yes. We have found that the UK's intellectual
property framework, especially with regard to
copyright, is falling behind what is needed.” "Digital
Opportunity" by Prof Ian Hargreaves - http://www.ipo.gov.uk/ipreview.htm. Licensed under CC BY 3.0 via Wikipedia -
https://en.wikipedia.org/wiki/File:Digital_Opportunity.jpg#/media/File:Digital_Opportunity.jpg
8. http://www.jisc.ac.uk/reports/value-and-benefits-of-text-mining
In order to be 'mined', text must be accessed, copied,
analysed, annotated and related to existing
information and understanding. Even if the user has
access rights to the material, making annotated
copies can be illegal under current copyright law
without the permission of the copyright holder.
there is a new conundrum: the market intervention of
copyright – originally intended to protect creative
producers – may be inhibiting new knowledge discovery
and innovation.
It is often unclear whether text
mining is a permissible use
9. • “creative use of these large data sets in the US health care sector
could generate more than $300bn in value per annum” [MGI,
McKinsey]
• Gartner Inc. has identified 'Big Data' and 'Next-Generation
Analytics' as two of the 'Top 10 Strategic Technologies' for 2012.
• Given the volume of text generated by business, academic and
social activities – in for example competitor reports, research
publications or customer opinions on social networking sites – text
mining is, however, highly important. [JISC]
• there are some tasks that simply could not be achieved without
using text mining. For example, a major pharmaceutical company
used text mining tools to evaluate 50,000 patents in 18 months.
This would have taken 50 person years to achieve manually,
meaning that it would not even have been contemplated. [JISC]
“Big Data – and Analytics (ContentMining)
10. Hargreaves’ Recommendations 2011[*]
• Government should deliver copyright exceptions at
national level to realise all the opportunities within the
EU framework, including format shifting, parody, non-
commercial research, and library archiving.
• The UK should also promote at EU level an exception
to support text and data analytics. The UK should give
a lead at EU level to develop a further copyright
exception designed to build into the EU framework
adaptability to new technologies.
[*] Now UK law (2014-06 and 2014-10)
12. PUBLISHER TDM LICENCE INITIATIVES
GENERALLY DO NOT HELP
• Publishers have started offering their own TDM licences and policies
• Their licences often impose unfair (and in the case of the UK, unenforceable)
constraints on researchers’ freedom to exploit TDM, e.g., requiring users to
employ publisher’s API, putting unnecessary restrictions on how much can be
copied, or how fast it can be copied.
• Why “unenforceable”? Because, as noted earlier, UK law specifically states
that any contract or licence term that prevents anyone from doing TDM in the
manner prescribed in the new exception shall be deemed null and void.
• Really need a test case on these attempted restrictions.
• Springer and Royal Society offer generous TDM provisions.
• So why are so many publishers offering restrictive licences in the UK? Maybe
they hope licensees are ignorant of the strength of the new law, or the
publishers in fact don’t know about it. So they are either deliberately
misleading, or ignorant
Prof Charles Oppenheim and contentmine.org
13. As revealed by the 'public domain calculator’ established by
Europeana, there is a staggering complexity in the determination of
the different copyright term lengths in member states, some of
them requiring knowledge about the circumstances of the author's
death or about the situation of the author's heirs at the time of her
death –
Public domain material is frequently “re-copyrighted” (Copyfraud)
Which English storybook will never be public domain[1]?
[1]Peter Pan
The Public domain
14. The Right to Read is The Right To Mine
PMR in 2012: http://blog.okfn.org/2012/06/01/the-right-to-read-is-the-right-to-mine/
15. The Hague Declaration[*]
• Intellectual property was not designed to regulate the free flow of
facts, data and ideas, nor should it.
• Freedom to analyse and pursue intellectual curiosity without fear of
monitoring or repercussions must not be eroded in the digital
environment.
• Ethics around the use of data and content mining will need to continue
to evolve in response to changing technology
• Open access as “a comprehensive source of human knowledge and
cultural heritage” must be pursued in order to increase the uptake of
and equality of access to content mining technologies.
• Innovation and commercial research based on the use of facts, data,
and ideas should not be restricted by intellectual property law.
• [*] Drafted 2014-12 , convened by LIBER (Association of European
Research Libraries)
16. The Hague Declaration on Knowledge
Discovery in the Digital Age 2015
• The potential benefits of Text and Data Mining are vast and include:
•
• Addressing grand challenges such as climate change and global epidemics
• Improving population health, wealth and development
• Creating new jobs and employment
• Exponentially increasing the speed and progress of science through new
insights and greater efficiency of research
• Increasing transparency of Governments and their actions
• Fostering innovation and collaboration and boosting the impact of open
science
• Creating tools for education and research
• Providing new and richer cultural insights
• Speeding economic and social development in all parts of the globe
• LIBER – Association of European Research Libraries
19. Some of Reda’s 25 recommendations
• introduction of a single European Copyright Title based on Article
118 TFEU that would apply directly and uniformly across the Union,
• the EU legislator should further lower the barriers for re-use of
public sector information by exempting works produced by the
public sector - within the political, legal and administrative process -
from copyright protection;
• the ability to freely link from one resource to another is one of the
fundamental building blocks of the Internet;
• Emphasises that the exception for caricature, parody and pastiche
should apply regardless of the purpose of the parodic use;
• Stresses the need to enable automated analytical techniques for
text and data (e.g. 'text and data mining') for all purposes,
provided that the permission to read the work has been acquired;
20. Some Contacts/references
• JISC report on benefits of TDM - D. McDonald and U. Kelly, The
value and benefits of text mining (2012),
http://www.jisc.ac.uk/reports/value-and-benefits-of-text-mining
• Official guidance on the new UK copyright exception for TDM -
https://www.gov.uk/government/uploads/system/uploads/attachm
ent_data/file/315014/copyright-guidance-research.pdf
• Excellent general overview of the change to UK law and its
implications - http://copyrightuser.org/topics/text-and-data-
mining/ - provides link to the precise wording in the law
• http://contentmine.org
21. THE NEW UK LAW
• Came into force in June 2014
• Specific exception to copyright for TDM
• From the official guidance to the new exception issued by HMG: “Text and data mining usually requires
copying of the work to be analysed. An exception to copyright exists which allows researchers to make
copies of any copyright material for the purpose of computational analysis if they already have the right to
read the work (that is, they have ‘lawful access’ to the work). This exception only permits the making of
copies for the purpose of text and data mining for non-commercial research. Publishers and content
providers will be able to apply reasonable measures to maintain their network security or stability but
these measures should not prevent or unreasonably restrict researcher’s ability to text and data mine.
Contract terms that stop researchers making copies to carry out text and data mining will be
unenforceable.”
• Does not apply to database right. Interesting problem if a particular database enjoys both copyright and
database right!
• UK researchers do not have to ask for permission, pay fees, etc., to do such TDM
• What is, or is not “non-commercial”? Not always clear! The question must be asked at the time the TDM
was undertaken, so unexpected commercial benefits at the end of the project as OK, so long as at the time
the intent was non-commercial
• “Lawful access” usually means licensed content, whether OA or a subscription to the materials, but also
includes lawful access to printed works held in, say, a library
• Prof Charles Oppenheim and contentmine.org