La comparsa, pochi decenni fa, di Internet e della connettività globale ha dato origine ad un fenomeno assolutamente nuovo: un accumulo di enormi quantità di dati conservati in banche digitali, la cui quantità raddoppia ogni pochi giorni e in prospettiva ogni poche ore. E’ la realtà dei Big Data, di cui molto si parla e discute, sovente con toni entusiastici. Ma Big Data vuol dire anche problemi di utilizzo, di interpretazione e rischi di distorsioni. Se questo è rilevante per i dati che hanno un valore economico, l’accumulo di informazione e il come viene trattata ha risvolti altrettanto rilevanti sulla formazione di conoscenza.
Per affrontare queste sfide, cruciali sono il rapporto fra etica e scienza, l’analisi critica su come i dati vengono prodotti e proposti, e il coinvolgimento di tutti i soggetti sociali chiamati in causa.
12 settembre 2019 | Torino, Polo del '900
This discussion, covened by the Dubai Future Foundation, focusses on identifying the significance of the concept of well-being for social-science and policy; and the opportunities to measure it at scale.
Big data is prevalent in our daily life. Not surprisingly, big data becomes a hot topic discussedby commercial worlds, media, magazines, general publics and elsewhere. From academic point of view, isit a research area of potential worth being explored? Or it is just another hype? Are there only computer orIS related scholars suitable for big data research due to its nature? Or scholars from other research areas are alsosuitable for this subject? This study aims to answer these questions through the use of informetricsapproach and data source form the SSCI Journal database, leveraging informetric‟s robust natures ofquantitative power of analyze information in any form onto the data source of representativeness. This research shows that big data research is at its growth phase with an exponential growth patternsince 2012 and with great potential for years to come. And perhaps surprisingly, computer or IS relateddisciplinesare not on the top 5 research areas fromthis research results. In fact, the top five research disciplinesare more diversified then expected: business economics (#1), Government Law (#2), InformationScience/ Library Science (#3), Social Science (#4) and Computer Science (#5). Scholars from the USuniversities are the most productive in this subject while Asian countries, including Taiwan, are alsovisible. Besides, this study also identifies that big data publications from SSCI journal database during2005-2015 do fit Lotka‟s law. This study contributes tounderstand the current big data research trends and also show the ways toresearchers who are interested to conduct future research in big data regardless of their research backgrounds.
This presentation was provided by Glenn Hampson of Open Scholarship Initiative, during the NISO hot topic virtual conference "Open Research." The event was held on November 17, 2021.
Matching Uses and Protections for Government Data Releases: Presentation at t...Micah Altman
In the work included below, and presented at the Simons Institute, we describe work-in progress that aims to align emerging methods of data protections with research uses.
Privacy Gaps in Mediated Library Services: Presentation at NERCOMP2019Micah Altman
Libraries enable patrons to access a wide range of information, but much of the access to this information is now directly managedy publishers. This has lead to a significant gap across library values, patrons perception of privacy, and effective privacy protection for access to digital resources.
In the work included below, and presented at NERCOMP 2019, we review privacy principles based on ALA, IFLA, and NISO policies. We then organizing and comparing high level privacy protections required by ALA checklist, NISO, and GDPR. This framework of principles and controls is then used to score the privacy policies and practices of major vendors of research library content. We evaluate each element of the vendors privacy policy, and use instrumented browsers to identify the types of tracking mechanisms used by different vendors. We use this set of privacy scores to support analyses of change over time, and of potential gaps between patron expectations and privacy policies and practices.
Reproducibility from an infomatics perspectiveMicah Altman
Scientific reproducibility is most viewed through a methodological or statistical lens, and increasingly, through a computational lens. Over the last several years, I've taken part in collaborations to that approach reproducibility from the perspective of informatics: as a flow of information across a lifecycle that spans collection, analysis, publication, and reuse.
These slides sketch of this approach, and were presented at a recent workshop on reproducibility at the National Academy of Sciences, and at one our Program on Information Science brown bag talks. See: informatics.mit.edu
This presentation was provided by Gabriela Mejias of ORCID, during the NISO hot topic virtual conference "Open Research." The event was held on November 17, 2021.
Managing Confidential Information – Trends and ApproachesMicah Altman
Personal information is ubiquitous and it is becoming increasingly easy to link information to individuals. Laws, regulations and policies governing information privacy are complex, but most intervene through either access or anonymization at the time of data publication.
Trends in information collection and management -- cloud storage, "big" data, and debates about the right to limit access to published but personal information complicate data management, and make traditional approaches to managing confidential data decreasingly effective.
This session presented as part of the the Program on Information Science seminar series, examines trends information privacy. And the session will also discuss emerging approaches and research around managing confidential research information throughout its lifecycle.
This discussion, covened by the Dubai Future Foundation, focusses on identifying the significance of the concept of well-being for social-science and policy; and the opportunities to measure it at scale.
Big data is prevalent in our daily life. Not surprisingly, big data becomes a hot topic discussedby commercial worlds, media, magazines, general publics and elsewhere. From academic point of view, isit a research area of potential worth being explored? Or it is just another hype? Are there only computer orIS related scholars suitable for big data research due to its nature? Or scholars from other research areas are alsosuitable for this subject? This study aims to answer these questions through the use of informetricsapproach and data source form the SSCI Journal database, leveraging informetric‟s robust natures ofquantitative power of analyze information in any form onto the data source of representativeness. This research shows that big data research is at its growth phase with an exponential growth patternsince 2012 and with great potential for years to come. And perhaps surprisingly, computer or IS relateddisciplinesare not on the top 5 research areas fromthis research results. In fact, the top five research disciplinesare more diversified then expected: business economics (#1), Government Law (#2), InformationScience/ Library Science (#3), Social Science (#4) and Computer Science (#5). Scholars from the USuniversities are the most productive in this subject while Asian countries, including Taiwan, are alsovisible. Besides, this study also identifies that big data publications from SSCI journal database during2005-2015 do fit Lotka‟s law. This study contributes tounderstand the current big data research trends and also show the ways toresearchers who are interested to conduct future research in big data regardless of their research backgrounds.
This presentation was provided by Glenn Hampson of Open Scholarship Initiative, during the NISO hot topic virtual conference "Open Research." The event was held on November 17, 2021.
Matching Uses and Protections for Government Data Releases: Presentation at t...Micah Altman
In the work included below, and presented at the Simons Institute, we describe work-in progress that aims to align emerging methods of data protections with research uses.
Privacy Gaps in Mediated Library Services: Presentation at NERCOMP2019Micah Altman
Libraries enable patrons to access a wide range of information, but much of the access to this information is now directly managedy publishers. This has lead to a significant gap across library values, patrons perception of privacy, and effective privacy protection for access to digital resources.
In the work included below, and presented at NERCOMP 2019, we review privacy principles based on ALA, IFLA, and NISO policies. We then organizing and comparing high level privacy protections required by ALA checklist, NISO, and GDPR. This framework of principles and controls is then used to score the privacy policies and practices of major vendors of research library content. We evaluate each element of the vendors privacy policy, and use instrumented browsers to identify the types of tracking mechanisms used by different vendors. We use this set of privacy scores to support analyses of change over time, and of potential gaps between patron expectations and privacy policies and practices.
Reproducibility from an infomatics perspectiveMicah Altman
Scientific reproducibility is most viewed through a methodological or statistical lens, and increasingly, through a computational lens. Over the last several years, I've taken part in collaborations to that approach reproducibility from the perspective of informatics: as a flow of information across a lifecycle that spans collection, analysis, publication, and reuse.
These slides sketch of this approach, and were presented at a recent workshop on reproducibility at the National Academy of Sciences, and at one our Program on Information Science brown bag talks. See: informatics.mit.edu
This presentation was provided by Gabriela Mejias of ORCID, during the NISO hot topic virtual conference "Open Research." The event was held on November 17, 2021.
Managing Confidential Information – Trends and ApproachesMicah Altman
Personal information is ubiquitous and it is becoming increasingly easy to link information to individuals. Laws, regulations and policies governing information privacy are complex, but most intervene through either access or anonymization at the time of data publication.
Trends in information collection and management -- cloud storage, "big" data, and debates about the right to limit access to published but personal information complicate data management, and make traditional approaches to managing confidential data decreasingly effective.
This session presented as part of the the Program on Information Science seminar series, examines trends information privacy. And the session will also discuss emerging approaches and research around managing confidential research information throughout its lifecycle.
"Reproducibility from the Informatics Perspective"Micah Altman
Dr. Altman will provide expert comment on the need for informatics modeling as part of the National Academies workshop: Statistical Challenges in Assessing and Fostering the Reproducibility of Scientific Results
This workshop focuses on the topic of addressing statistical challenges in assessing and fostering the reproducibility of scientific results by examining three issues from a statistical perspective: the extent of reproducibility, the causes of reproducibility failures, and potential remedies.
This document discusses methods for measuring the impact of citizen science projects online. It describes the development of a framework called MICS (Measuring Impact of Citizen Science) for assessing citizen science impact. MICS includes indicators for different domains like society, science, economy, environment and governance. The framework provides characteristics for each indicator such as its name, description, data type, and how data should be collected and analyzed. Case studies are being used to help implement and refine the MICS framework.
Over the past 10 years, research systems have evolved from systems that focused on how to structure and record information on research, to systems capable of allowing significant insights to be derived based upon years of high quality information. In 2015, the maturity of the information now collected within many Current Research Information Systems, and the insights that this can provide is of equal or greater value than the insights that could be gleaned from established externally provided research metrics platforms alone. The ability to intersect these external and internal worlds provides new levels of strategic insight not previously available. With the addition of platforms that track altmetrics, and their ability to connect university publications data with a constant flow of real time attention level metrics, an image of a dynamic network of systems emerges, connected together by ever turning ‘cogs’ pushing and translating information. Add to this, the success of ORCID as pervasive researcher identifier infrastructure, and CASRAI as the emerging social contract for information exchange, and it becomes possible to extend this network back from the systems that track and record research information, through to the platforms through which research knowledge is created. The ‘Mechanics’ of this network of systems is more than just getting the ‘plumbing’ right. As research information moves through the network, its audience and purpose changes, the requirements for contextual metadata can also change. This presentation will explore the lived experience of Research Data Mechanics at Digital Science though illustrating how connections between Figshare, Altmetric, Symplectic Elements, and Dimensions can both enhance research system capability and reduce the burden on researchers, and research administration.
Infrastructure and practices for data citation have made substantial progress over the last decade. This increases the potential rewards for data publication and reproducible science, however overall incentives remain relatively weak.
authorsNote: This summarizes a presentation given at the *National Academies of Sciences* as part of [Data Citation Workshop: Developing Policy And Practice*](http://sites.nationalacademies.org/pga/brdi/index.htm) .
Meeting Federal Research Requirements for Data Management Plans, Public Acces...ICPSR
These slides cover evolving federal research requirements for sharing scientific data. Provided are updates on federal agency responses to the 2013 OSTP memo, guidance on data management plans, resources for data management and curation training for staff/researchers, and tips for evaluating public data-sharing services. ICPSR's public data-sharing service, openICPSR, is also presented. Recording of this presentation is here: https://www.youtube.com/watch?v=2_erMkASSv4&feature=youtu.be
Linking Data to Publications through Citation and Virtual ArchivesMicah Altman
This document discusses linking data to publications through citation and virtual archives. It argues that data citation and sharing infrastructure are necessary for scientific reproducibility and open data. It outlines elements of data management plans and requirements for data sharing infrastructure, including persistence, provenance, access control and incentives. The document advocates for data citations as first-class objects and emerging practices like assigning DOIs to datasets. It presents several use cases for the Dataverse network, a virtual archive designed for research data sharing through federated and organizational models.
This document summarizes a presentation about meeting federal data sharing requirements. It discusses the history of these requirements and defines good practices for data sharing and stewardship. It also reviews some public data sharing services and provides tips for evaluating them. Key aspects of good data sharing include maximizing access, protecting privacy, ensuring proper attribution, and having long-term preservation and sustainability plans. The presenter emphasizes that restricted-use or sensitive data can be effectively shared through secure virtual environments.
Big data, new epistemologies and paradigm shiftsrobkitchin
Big data and new data analytics are transforming research across disciplines by enabling new methods of data generation, collection, and analysis. This allows researchers to ask and answer questions in new ways. While opportunities exist to develop more sophisticated models and insights, there are also concerns about reductionism and losing nuance. In the social sciences and humanities, both opportunities and challenges exist regarding quantitative and qualitative approaches. Overall, while new paradigms may emerge, pluralism in methods and approaches is likely to continue across disciplines.
Assessing Digital Output in New Ways
Mike Taylor, Research Specialist, Elsevier Labs
Presented during NISO/BISG 8th Annual Changing Standards Landscape on June 27, 2014
Brown Bag: New Models of Scholarly Communication for Digital Scholarship, by ...Micah Altman
In his talk for the MIT Libraries Program on Information Science, Steve Griffin discusses how how research libraries can play a key and expanded role in enabling digital scholarship and creating the supporting activities that sustain it.
Research Data Sharing and Re-Use: Practical Implications for Data Citation Pr...SC CTSI at USC and CHLA
Date: Apr 4, 2018
Speaker: Hyoungjoo Park, PhD candidate, School of Information Studies, University of Wisconsin-Milwaukee, and Dietmar Wolfram, PhD
Overview: It is increasingly common for researchers to make their data freely available. This is often a requirement of funding agencies but also consistent with the principles of open science, according to which all research data should be shared and made available for reuse. Once data is reused, the researchers who have provided access to it should be acknowledged for their contributions, much as authors are recognised for their publications through citation. Hyoungjoo Park and Dietmar Wolfram have studied characteristics of data sharing, reuse, and citation and found that current data citation practices do not yet benefit data sharers, with little or no consistency in their format. More formalised citation practices might encourage more authors to make their data available for reuse.
Instructional Data Sets from Q-step Launch Event (Univ of Exeter) 3-20-2014ICPSR
Presentation about using social science data in the classroom and creating (and finding) resources with which to do it. Addresses both substantive courses and research methods/statistics courses.
Digital Scholar Webinar: Recruiting Research Participants Online Using RedditSC CTSI at USC and CHLA
This 50-minute presentation introduces r/SampleSize, a community on the website Reddit that allows for online participant recruitment without compulsory or immediate payment. It will provide an overview of best practices for recruiting participants on r/SampleSize. It will also compare r/SampleSize to Amazon Mechanical Turk (MTurk), a widely used crowdsourcing platform for recruiting research participants.
This presentation was provided by Melissa Levine of the University of Michigan during a NISO Virtual Conference on the topic of data curation, held on Wednesday, August 31, 2016
MIT Program on Information Science Talk -- Ophir Frieder on Searching in Hars...Micah Altman
Ophir Frieder, who holds the Robert L. McDevitt, K.S.G., K.C.H.S. and Catherine H. McDevitt L.C.H.S. Chair in Computer Science and Information Processing at Georgetown University, gave this talk on Searching in Harsh Environments as part of the Program on Information Science Brown Bag Series.
In the talk, illustrated by the slides below, Ophir rebuts the myth that "google has solved search", and discusses the challenges of searching for complex object, through hidden collections, and in harsh environments For more see: http://informatics.mit.edu/blg
FAIR for the future: embracing all things dataARDC
FAIR for the future: embracing all things data - Natasha Simons, Keith Russell and Liz Stokes, presented at Taylor & Francis Scholarly Summits in Sydney 11 Feb 2019 and Melbourne 14 Feb 2019.
This presentation was provided by Clara Llebot of Oregon State University, during the NISO hot topic virtual conference "Effective Data Management," which was held on September 29, 2021.
Agencies such as the NSF and NIH require data management plans as part of research proposals and the Office of Science and Technology Policy (OSTP) is requiring federal agencies to develop plans to increase public access to results of federally funded scientific research. These slides explore sustainable data sharing models, including models for sharing restricted-use data. Demos of these models and tips for accessing public data access services are provided as well as resources for creating data management plans for grant applications.
Managing Ireland's Research Data - 3 Research MethodsRebecca Grant
Slides providing an overview of the research methods used in the author's thesis, "Managing Ireland's Research Data: Recognising Roles for Recordkeepers". The methods discussed are online surveys, comparative case studies, and autoethnography.
Licensed as CC-BY.
Presentation during the 14th Association of African Universities (AAU) Conference and African Open Science Platform (AOSP)/Research Data Alliance (RDA) Workshop in Accra, Ghana, 7-8 June 2017.
"Reproducibility from the Informatics Perspective"Micah Altman
Dr. Altman will provide expert comment on the need for informatics modeling as part of the National Academies workshop: Statistical Challenges in Assessing and Fostering the Reproducibility of Scientific Results
This workshop focuses on the topic of addressing statistical challenges in assessing and fostering the reproducibility of scientific results by examining three issues from a statistical perspective: the extent of reproducibility, the causes of reproducibility failures, and potential remedies.
This document discusses methods for measuring the impact of citizen science projects online. It describes the development of a framework called MICS (Measuring Impact of Citizen Science) for assessing citizen science impact. MICS includes indicators for different domains like society, science, economy, environment and governance. The framework provides characteristics for each indicator such as its name, description, data type, and how data should be collected and analyzed. Case studies are being used to help implement and refine the MICS framework.
Over the past 10 years, research systems have evolved from systems that focused on how to structure and record information on research, to systems capable of allowing significant insights to be derived based upon years of high quality information. In 2015, the maturity of the information now collected within many Current Research Information Systems, and the insights that this can provide is of equal or greater value than the insights that could be gleaned from established externally provided research metrics platforms alone. The ability to intersect these external and internal worlds provides new levels of strategic insight not previously available. With the addition of platforms that track altmetrics, and their ability to connect university publications data with a constant flow of real time attention level metrics, an image of a dynamic network of systems emerges, connected together by ever turning ‘cogs’ pushing and translating information. Add to this, the success of ORCID as pervasive researcher identifier infrastructure, and CASRAI as the emerging social contract for information exchange, and it becomes possible to extend this network back from the systems that track and record research information, through to the platforms through which research knowledge is created. The ‘Mechanics’ of this network of systems is more than just getting the ‘plumbing’ right. As research information moves through the network, its audience and purpose changes, the requirements for contextual metadata can also change. This presentation will explore the lived experience of Research Data Mechanics at Digital Science though illustrating how connections between Figshare, Altmetric, Symplectic Elements, and Dimensions can both enhance research system capability and reduce the burden on researchers, and research administration.
Infrastructure and practices for data citation have made substantial progress over the last decade. This increases the potential rewards for data publication and reproducible science, however overall incentives remain relatively weak.
authorsNote: This summarizes a presentation given at the *National Academies of Sciences* as part of [Data Citation Workshop: Developing Policy And Practice*](http://sites.nationalacademies.org/pga/brdi/index.htm) .
Meeting Federal Research Requirements for Data Management Plans, Public Acces...ICPSR
These slides cover evolving federal research requirements for sharing scientific data. Provided are updates on federal agency responses to the 2013 OSTP memo, guidance on data management plans, resources for data management and curation training for staff/researchers, and tips for evaluating public data-sharing services. ICPSR's public data-sharing service, openICPSR, is also presented. Recording of this presentation is here: https://www.youtube.com/watch?v=2_erMkASSv4&feature=youtu.be
Linking Data to Publications through Citation and Virtual ArchivesMicah Altman
This document discusses linking data to publications through citation and virtual archives. It argues that data citation and sharing infrastructure are necessary for scientific reproducibility and open data. It outlines elements of data management plans and requirements for data sharing infrastructure, including persistence, provenance, access control and incentives. The document advocates for data citations as first-class objects and emerging practices like assigning DOIs to datasets. It presents several use cases for the Dataverse network, a virtual archive designed for research data sharing through federated and organizational models.
This document summarizes a presentation about meeting federal data sharing requirements. It discusses the history of these requirements and defines good practices for data sharing and stewardship. It also reviews some public data sharing services and provides tips for evaluating them. Key aspects of good data sharing include maximizing access, protecting privacy, ensuring proper attribution, and having long-term preservation and sustainability plans. The presenter emphasizes that restricted-use or sensitive data can be effectively shared through secure virtual environments.
Big data, new epistemologies and paradigm shiftsrobkitchin
Big data and new data analytics are transforming research across disciplines by enabling new methods of data generation, collection, and analysis. This allows researchers to ask and answer questions in new ways. While opportunities exist to develop more sophisticated models and insights, there are also concerns about reductionism and losing nuance. In the social sciences and humanities, both opportunities and challenges exist regarding quantitative and qualitative approaches. Overall, while new paradigms may emerge, pluralism in methods and approaches is likely to continue across disciplines.
Assessing Digital Output in New Ways
Mike Taylor, Research Specialist, Elsevier Labs
Presented during NISO/BISG 8th Annual Changing Standards Landscape on June 27, 2014
Brown Bag: New Models of Scholarly Communication for Digital Scholarship, by ...Micah Altman
In his talk for the MIT Libraries Program on Information Science, Steve Griffin discusses how how research libraries can play a key and expanded role in enabling digital scholarship and creating the supporting activities that sustain it.
Research Data Sharing and Re-Use: Practical Implications for Data Citation Pr...SC CTSI at USC and CHLA
Date: Apr 4, 2018
Speaker: Hyoungjoo Park, PhD candidate, School of Information Studies, University of Wisconsin-Milwaukee, and Dietmar Wolfram, PhD
Overview: It is increasingly common for researchers to make their data freely available. This is often a requirement of funding agencies but also consistent with the principles of open science, according to which all research data should be shared and made available for reuse. Once data is reused, the researchers who have provided access to it should be acknowledged for their contributions, much as authors are recognised for their publications through citation. Hyoungjoo Park and Dietmar Wolfram have studied characteristics of data sharing, reuse, and citation and found that current data citation practices do not yet benefit data sharers, with little or no consistency in their format. More formalised citation practices might encourage more authors to make their data available for reuse.
Instructional Data Sets from Q-step Launch Event (Univ of Exeter) 3-20-2014ICPSR
Presentation about using social science data in the classroom and creating (and finding) resources with which to do it. Addresses both substantive courses and research methods/statistics courses.
Digital Scholar Webinar: Recruiting Research Participants Online Using RedditSC CTSI at USC and CHLA
This 50-minute presentation introduces r/SampleSize, a community on the website Reddit that allows for online participant recruitment without compulsory or immediate payment. It will provide an overview of best practices for recruiting participants on r/SampleSize. It will also compare r/SampleSize to Amazon Mechanical Turk (MTurk), a widely used crowdsourcing platform for recruiting research participants.
This presentation was provided by Melissa Levine of the University of Michigan during a NISO Virtual Conference on the topic of data curation, held on Wednesday, August 31, 2016
MIT Program on Information Science Talk -- Ophir Frieder on Searching in Hars...Micah Altman
Ophir Frieder, who holds the Robert L. McDevitt, K.S.G., K.C.H.S. and Catherine H. McDevitt L.C.H.S. Chair in Computer Science and Information Processing at Georgetown University, gave this talk on Searching in Harsh Environments as part of the Program on Information Science Brown Bag Series.
In the talk, illustrated by the slides below, Ophir rebuts the myth that "google has solved search", and discusses the challenges of searching for complex object, through hidden collections, and in harsh environments For more see: http://informatics.mit.edu/blg
FAIR for the future: embracing all things dataARDC
FAIR for the future: embracing all things data - Natasha Simons, Keith Russell and Liz Stokes, presented at Taylor & Francis Scholarly Summits in Sydney 11 Feb 2019 and Melbourne 14 Feb 2019.
This presentation was provided by Clara Llebot of Oregon State University, during the NISO hot topic virtual conference "Effective Data Management," which was held on September 29, 2021.
Agencies such as the NSF and NIH require data management plans as part of research proposals and the Office of Science and Technology Policy (OSTP) is requiring federal agencies to develop plans to increase public access to results of federally funded scientific research. These slides explore sustainable data sharing models, including models for sharing restricted-use data. Demos of these models and tips for accessing public data access services are provided as well as resources for creating data management plans for grant applications.
Managing Ireland's Research Data - 3 Research MethodsRebecca Grant
Slides providing an overview of the research methods used in the author's thesis, "Managing Ireland's Research Data: Recognising Roles for Recordkeepers". The methods discussed are online surveys, comparative case studies, and autoethnography.
Licensed as CC-BY.
Presentation during the 14th Association of African Universities (AAU) Conference and African Open Science Platform (AOSP)/Research Data Alliance (RDA) Workshop in Accra, Ghana, 7-8 June 2017.
This document discusses the ethical issues that arise in social research using big data. It begins by explaining how big data differs from traditional research data in that it was not typically collected for research purposes. The main ethical issues discussed are privacy, informed consent, de-identification, and inequality due to the digital divide. The document provides examples and discusses how these issues pose new challenges for big data research. It emphasizes that while the risks are real, opportunities also exist if research is conducted ethically with guidance and consideration of issues like privacy, consent, and data provenance.
The Challenges of Making Data Travel, by Sabina LeonelliLEARN Project
1st LEARN Workshop. Embedding Research Data as part of the research cycle. 29 Jan 2016. Presentation by Sabina Leonelli, Exeter Centre for the Study of Life Sciences (Egenis) & Department of Sociology, Philosophy and Anthropology, University of Exeter
Alain Frey Research Data for universities and information producersIncisive_Events
Research data is growing exponentially but is disparate and challenging to understand fully. Universities face challenges in managing research data to meet funding and standards requirements. Thomson Reuters launched the Data Citation Index to make research data discoverable, accessible, and citable by bringing important data from diverse repositories into one searchable index. This addresses the need for a single access point for quality research data across disciplines and locations.
Lecture series: Using trace data or subjective data, that is the question dur...Bart Rienties
In this lecture series Bart Rienties (Professor of Learning Analytics, head of Academic Professional Development) will discuss how from the safety of your home you could use existing trace data to explore interactions between people (e.g., Twitter data, engagement data in a virtual learning environment, public data sets), and what the affordances and limitations of these trace data might be. Furthermore, he will discuss how other ways of collecting subjective data (e.g., surveys, interviews) might strengthen our understandings of complex interactions between people.
There are no prior requirements to join, and everyone is welcome. For those with a technical background you may enjoy this recent paper in PLOS ONE https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0233977. For those with a non-technical background, you may enjoy this paper https://journals.sfu.ca/flr/index.php/journal/article/view/348
Elsevier CWTS Open Data Report Presentation at RDA meeting in Barcelona Elsevier
The Open Data report is a result of a year-long, co-conducted study between Elsevier and the Centre for Science and Technology Studies (CWTS), part of Leiden University, the Netherlands. The study is based on a complementary methods approach consisting of a quantitative analysis of bibliometric and publication data, a global survey of 1,200 researchers and three case studies including in-depth interviews with key individuals involved in data collection, analysis and deposition in the fields of soil science, human genetics and digital humanities.
Privacy in Research Data Managemnt - Use CasesMicah Altman
From Integrating Approaches to Privacy across the Research Lifecycle http://privacytools.seas.harvard.edu/fall-2013-workshop
This workshop will consider how emerging tools and perspectives from a variety of disciplines, such as computer science, social science, law, and the health sciences, should be integrated in the management of confidential research data. Multidisciplinary discussion groups will grapple with these issues in the context of exemplar research use cases.
Cyber Summit 2016: Privacy Issues in Big Data Sharing and ReuseCybera Inc.
This document summarizes a presentation on big data and data reuse given by Bart Custers. It discusses:
1) The Eudeco project which examines big data and data reuse from legal, societal, economic, and technological perspectives across multiple European countries.
2) Issues with data sharing and reuse, including potential privacy violations, discrimination, lack of transparency, and unintended consequences from new uses of data or placing it in new contexts.
3) Potential solutions discussed, including privacy impact assessments, privacy by design, and new approaches focusing more on transparency and responsibility than restricting data access and use.
This presentation summarizes Jennifer Tucker's dissertation study, entitled “Motivating Subjects: Data Sharing in Cancer Research.” The research focused on the motivational factors that influence a researcher’s decision to share data.
Open Science policies can help achieve the UN Sustainable Development Goals through open data practices. Key elements of an effective open science policy include open access, open research, and open data policies. It also requires addressing issues of data justice, developing fair and interoperable data standards, and implementing policies that maximize the reuse and public impact of research data. Effective policies also engage stakeholders, advocate for open research, and link funding policies to open science goals. Surveys show more work is needed as most institutions still lack clear open data and open research data guidelines.
Data Management and Broader Impacts: a holistic approachMegan O'Donnell
This document summarizes a presentation on taking a holistic approach to data management and broader impacts. It discusses the National Science Foundation's broader impacts criterion, which requires research to benefit society. It argues that examining data through a broader impacts lens highlights the benefits of good data management, data management plans, and the value of data information literacy skills. Taking this holistic approach can help researchers understand why data management plans are important, justify spending more time on data practices, and encourage embracing data sharing.
Simon Hodson discusses key aspects of open science including open access to research outputs, FAIR data principles, and engaging society. Open science requires addressing technical, funding, skills, and mindset challenges. While data created with public funds should be open by default, legitimate exceptions exist for commercial interests, privacy, and security. Criteria for data appraisal, selection and preservation need input from disciplines. Barriers to data sharing include concerns over misuse and lack of credit, while benefits include advancing research and building institutional reputation. Open science governance is needed to balance openness with other priorities like intellectual property, and define roles and responsibilities among stakeholders.
This document discusses open science and FAIR data principles. It begins by outlining the benefits of open data, including enabling reproducibility, avoiding replication gaps, and allowing data reuse and reinterpretation. Open data practices have transformed areas like genomics and astronomy. FAIR data principles help enable large-scale data use and machine analysis. The document then defines open science, including open access, open data, FAIR data principles, and engagement with society. It discusses frameworks for developing open data strategies at the national and institutional levels. These include developing policies, incentives, skills training, and data infrastructure. While open data brings benefits, it also requires investment and cultural changes to fully realize. Stakeholders like government and research institutions can benefit
This document summarizes the key findings from a study examining open data practices among researchers globally. The study used a complementary methods approach, including a bibliometric analysis, global survey of 1,200 researchers, and 3 case studies.
The main findings were:
1) Data sharing practices vary significantly by field, with some fields having data sharing integrated into research and others not.
2) While most researchers recognize benefits of data sharing, it is not yet widespread in practice, with less than 15% sharing data in repositories.
3) Barriers to data sharing include a lack of incentives, training, and perception of data as personally owned.
4) To increase data sharing, policies need to incentiv
Big Data & Privacy -- Response to White House OSTPMicah Altman
Big data has huge implications for privacy, as summarized in our commentary below:
Both the government and third parties have the potential to collect extensive (sometimes exhaustive), fine grained, continuous, and identifiable records of a person’s location, movement history, associations and interactions with others, behavior, speech, communications, physical and medical conditions, commercial transactions, etc. Such “big data” has the ability to be used in a wide variety of ways, both positive and negative. Examples of potential applications include improving government and organizational transparency and accountability, advancing research and scientific knowledge, enabling businesses to better serve their customers, allowing systematic commercial and non-commercial manipulation, fostering pervasive discrimination, and surveilling public and private spheres.
On January 23, 2014, President Obama asked John Podesta to develop in 90 days, a 'comprehensive review' on big data and privacy.
This lead to a series of workshop on big data and technology at MIT, and on social cultural & ethical dimensions at NYU, with a third planned to discuss legal issues at Berkeley. A number of colleagues from our Privacy Tools for Research project and from the BigData@CSAIL projects have contributed to these workshops and raised many thoughtful issues (and the workshop sessions are online and well worth watching).
My colleagues at the Berkman Center, David O'Brien, Alexandra Woods, Salil Vadhan and I have submitted responses to these questions that outline a broad, comprehensive, and systematic framework for analyzing these types of questions and taxonomize a variety of modern technological, statistical, and cryptographic approaches to simultaneously providing privacy and utility. This comment is made on behalf of the Privacy Tools for Research Project, of which we are a part, and has benefitted from extensive commentary by the other project collaborators.
BioPharma and FAIR Data, a Collaborative AdvantageTom Plasterer
The concept of FAIR (Findable, Accessible, Interoperable and Reusable) data is becoming a reality as stakeholders from industry, academia, funding agencies and publishers are embracing this approach. For BioPharma being able to effectively share and reuse data is a tremendous competitive advantage, within a company, with peer organizations, key opinion leaders and regulatory agencies. A few key drivers, success stories and preliminary results of an industry data stewardship survey are presented.
This document contains the outline for a session on data management and sharing from the Force 11 Scholarly Communications Institute Summer School. The 30 minute session will discuss what constitutes research data, getting started with data management, and the value and benefits of open data sharing versus closed or restricted models. It provides examples of the economic value of open research data in Australia and the United States. It also explores what types of data may never be openly shared and discusses mediated access to certain types of sensitive data.
Data Sharing with ICPSR: Fueling the Cycle of Science through Discovery, Acce...ICPSR
Data Sharing with ICPSR was presented at IASSIST 2015 in Minneapolis, MN.
The learning objectives and content cover:
- Federal data sharing requirements and
other good reasons to share data
• Options for sharing data
• Protection of confidentiality when
sharing data
• Data discovery tools
• Online data exploration tools from ICPSR
Similar to La ricerca scientifica nell'era dei Big Data - Sabina Leonelli (20)
Intervento alla conferenza Torino e la battaglia mondiale dell'auto elettrica, tenutasi il 21 novembre 2019 al Politecnico di Torino.
Incontro a cura di Coordinamento Ingegneri e Tecnici, Centro di Documentazione Antonio Labriola, ISMEL
Oltre il '68
Conversazioni sul libro
Giovedì 24 ottobre ore 17
Polo del '900, Sala '900
Due educatori in viaggio nella provvisoria reale utopia, dalla segregazione all'integrazione sociale.
Gli autori Gianni Garena e Luciano Tosco, accompagnati in un viaggio dalla fine degli anni 60 ad oggi da Eleonora Artesio della Società di Mutuo Soccorso Solidea, propongono una Conversazione con attori e protagonisti di quegli anni e di oggi.
Interverranno in sequenza
Giovanni Ferrero Presidente Ismel
Nicoletta Fratta Presidente Cooperativa sociale Il Margine
Danila Mezzano Presidente Consorzio Naos, Cooperativa sociale Progetto Muret
Fiorenzo Alfieri già assessore Comune di Torino
Giancarlo Gonella Presidente Legacoop Piemonte
Gianfranco Marocchi Direttore Rivista Welfare Oggi e Direttore Biennale della Prossimità
Incursioni a cura degli attori dell’Associazione Arcobaleno con letture del libro Oltre il '68
Oltre il '68
Conversazioni sul libro
Giovedì 24 ottobre ore 17
Polo del '900, Sala '900
Due educatori in viaggio nella provvisoria reale utopia, dalla segregazione all'integrazione sociale.
Gli autori Gianni Garena e Luciano Tosco, accompagnati in un viaggio dalla fine degli anni 60 ad oggi da Eleonora Artesio della Società di Mutuo Soccorso Solidea, propongono una Conversazione con attori e protagonisti di quegli anni e di oggi.
Interverranno in sequenza
Giovanni Ferrero Presidente Ismel
Nicoletta Fratta Presidente Cooperativa sociale Il Margine
Danila Mezzano Presidente Consorzio Naos, Cooperativa sociale Progetto Muret
Fiorenzo Alfieri già assessore Comune di Torino
Giancarlo Gonella Presidente Legacoop Piemonte
Gianfranco Marocchi Direttore Rivista Welfare Oggi e Direttore Biennale della Prossimità
Incursioni a cura degli attori dell’Associazione Arcobaleno con letture del libro Oltre il '68
Oltre il '68
Conversazioni sul libro
Giovedì 24 ottobre ore 17
Polo del '900, Sala '900
Due educatori in viaggio nella provvisoria reale utopia, dalla segregazione all'integrazione sociale.
Gli autori Gianni Garena e Luciano Tosco, accompagnati in un viaggio dalla fine degli anni 60 ad oggi da Eleonora Artesio della Società di Mutuo Soccorso Solidea, propongono una Conversazione con attori e protagonisti di quegli anni e di oggi.
Interverranno in sequenza
Giovanni Ferrero Presidente Ismel
Nicoletta Fratta Presidente Cooperativa sociale Il Margine
Danila Mezzano Presidente Consorzio Naos, Cooperativa sociale Progetto Muret
Fiorenzo Alfieri già assessore Comune di Torino
Giancarlo Gonella Presidente Legacoop Piemonte
Gianfranco Marocchi Direttore Rivista Welfare Oggi e Direttore Biennale della Prossimità
Incursioni a cura degli attori dell’Associazione Arcobaleno con letture del libro Oltre il '68
Oltre il '68
Conversazioni sul libro
Giovedì 24 ottobre ore 17
Polo del '900, Sala '900
Due educatori in viaggio nella provvisoria reale utopia, dalla segregazione all'integrazione sociale.
Gli autori Gianni Garena e Luciano Tosco, accompagnati in un viaggio dalla fine degli anni 60 ad oggi da Eleonora Artesio della Società di Mutuo Soccorso Solidea, propongono una Conversazione con attori e protagonisti di quegli anni e di oggi.
Interverranno in sequenza
Giovanni Ferrero Presidente Ismel
Nicoletta Fratta Presidente Cooperativa sociale Il Margine
Danila Mezzano Presidente Consorzio Naos, Cooperativa sociale Progetto Muret
Fiorenzo Alfieri già assessore Comune di Torino
Giancarlo Gonella Presidente Legacoop Piemonte
Gianfranco Marocchi Direttore Rivista Welfare Oggi e Direttore Biennale della Prossimità
Incursioni a cura degli attori dell’Associazione Arcobaleno con letture del libro Oltre il '68
Oltre il '68
Conversazioni sul libro
Giovedì 24 ottobre ore 17
Polo del '900, Sala '900
Due educatori in viaggio nella provvisoria reale utopia, dalla segregazione all'integrazione sociale.
Gli autori Gianni Garena e Luciano Tosco, accompagnati in un viaggio dalla fine degli anni 60 ad oggi da Eleonora Artesio della Società di Mutuo Soccorso Solidea, propongono una Conversazione con attori e protagonisti di quegli anni e di oggi.
Interverranno in sequenza
Giovanni Ferrero Presidente Ismel
Nicoletta Fratta Presidente Cooperativa sociale Il Margine
Danila Mezzano Presidente Consorzio Naos, Cooperativa sociale Progetto Muret
Fiorenzo Alfieri già assessore Comune di Torino
Giancarlo Gonella Presidente Legacoop Piemonte
Gianfranco Marocchi Direttore Rivista Welfare Oggi e Direttore Biennale della Prossimità
Incursioni a cura degli attori dell’Associazione Arcobaleno con letture del libro Oltre il '68
Oltre il '68
Conversazioni sul libro
Giovedì 24 ottobre ore 17
Polo del '900, Sala '900
Due educatori in viaggio nella provvisoria reale utopia, dalla segregazione all'integrazione sociale.
Gli autori Gianni Garena e Luciano Tosco, accompagnati in un viaggio dalla fine degli anni 60 ad oggi da Eleonora Artesio della Società di Mutuo Soccorso Solidea, propongono una Conversazione con attori e protagonisti di quegli anni e di oggi.
Interverranno in sequenza
Giovanni Ferrero Presidente Ismel
Nicoletta Fratta Presidente Cooperativa sociale Il Margine
Danila Mezzano Presidente Consorzio Naos, Cooperativa sociale Progetto Muret
Fiorenzo Alfieri già assessore Comune di Torino
Giancarlo Gonella Presidente Legacoop Piemonte
Gianfranco Marocchi Direttore Rivista Welfare Oggi e Direttore Biennale della Prossimità
Incursioni a cura degli attori dell’Associazione Arcobaleno con letture del libro Oltre il '68
Torino Automotive Heritage
Seminario e dibattito pubblico
Venerdì 18 ottobre ore 15-18
Polo del '900, Sala Conferenze
Presentazione pubblica del progetto di network per la valorizzazione di Torino “città dell’automobile”, dal passato al futuro.
La fine di una stagione della fabbrica, innovazione e complessa trasformazione dell’automotive, pongono le condizioni per una storicizzazione e un nuovo brand di turismo industriale accessibile. Il seminario coinvolge esperti, aziende, amministratori e cittadini, per un progetto di conoscenza, di ricerca, di comunicazione con la costruzione di percorsi di visita e multimediali per l’integrazione delle reti virtuali del patrimonio legato all’automobilismo fino alla costruzione di un sistema della legacy automobilistica (architetture, veicoli storici, archivi, musei, automotive e design companies) integrato con i centri museali e l’offerta del territorio. A confronto i protagonisti: proprietà del patrimonio immobile, strutture museali, associazioni di settore, operatori privati del turismo culturale, aziende produttive e di R&D interessate al miglioramento dell’immagine del distretto automotive, centri di ricerca e formazione.
Intervengono:
Giovanni Ferrero – ISMEL
Un’opportunità della cultura industriale per la città
Sergio Pace – Politecnico di Torino
Le potenzialità degli archivi e della cultura storica
Rossella Maspoli – Politecnico di Torino
Il progetto Automotive Heritage Network
Manuel Ramello – AIPAI
Un'iniziativa in collaborazione Politecnico di Torino – DAD, Associazione italiana per il patrimonio archeologico industriale, ISMEL
Presentazione del Festival del Mutualismo con Guido Bonfante e Giorgio Viarengo.
Incontro con Derrik De Kerckhove, Giovanni Ferrero e Diego Robotti, in collaborazione con ISMEL:
Lo stato dell'arte dell’assistenza digitale sta prendendo forma.
Una donna anziana colpita dell’Alzheimer si fida di Alexa per ricordare le informazioni che ritiene indispensabili alla sua sopravvivenza e limitare i danni e gli impegni doverosi che graveranno sulla sua famiglia. E' una storia vera e la protagonista, Wendy Mitchell, la racconta in un libro che è già diventato un best seller.
Nel prossimo futuro si profila la possibilità per ognuno di disporre di un "gemello digitale”, nostro essere e nostra vita raddoppiati nel nostro smartphone, alter ego già messo a disposizione negli ospedali di punta per sapere tutta la storia del paziente.
Derrick de Kerchkove intende presentare il passato e futuro di questo nuovo terreno di collaborazione tra l'uomo e la macchina e prospettare per il mutuo soccorso nuove possibilità di assistenza solidale.
Contributo al seminario "Lavoro e partecipazione: orientamenti e dibattito attuale", tenutosi venerdì 27 settembre 2019 presso il Campus Luigi Einaudi di Torino.
Introduce e modera
Fiorella Lunardon, Università di Torino
Intervengono
Maurizio Bussi, OIL
Il lavoro nel XXI secolo secondo l’elaborazione dell’OIL
Salvo Leonardi, Fondazione Di Vittorio
Cambiamenti del lavoro, modelli organizzativi e partecipazione dei lavoratori nella contrattazione collettiva
Raffaele De Luca Tamajo | Franco Focareta, AGI
L’impatto della innovazione tecnologica sui tempi del lavoro e sulle relazioni sindacali
Tiziana Bocchi, Segretaria Nazionale Confederale UIL
Dall’informazione alla partecipazione: le linee strategiche del lavoro e dell’impresa per uno sviluppo sostenibile ed inclusivo
Iniziativa realizzata nell'ambito del progetto integrato del Polo del '900 Lavoro e partecipazione coordinato da ISMEL. In collaborazione con AGI - Avvocati Giuslavoristi Italiani e OIL - Organizzazione Internazionale del Lavoro
Il lavoro nel secondo dopoguerra e le Costituzioni in Europa
Seminario
Venerdì 20 settembre 2019
Campus Luigi Einaudi, Torino
Introduce e modera
Aldo Enrietti, ISMEL
Intervengono
Marco Scavino, Università di Torino
Il valore costituente del lavoro nel "secolo breve"
Paolo Tosi | Piergiovanni Alleva, AGI
Lavoro, partecipazione e impresa nella Costituzione italiana in comparazione con alcune altre Costituzioni europee
Gianni Rosas, OIL
Il lavoro e la sua regolazione nella Dichiarazione di Filadelfia del 1943 e il riconoscimento dei diritti associativi dei lavoratori e dell’impresa
Giuseppe Iuliano, CISL Nazionale
Il dialogo sociale nel contesto delle Istituzioni europee: riconoscimento degli attori sociali e ruolo consultivo del Comitato Economico e Sociale europeo
Iniziativa realizzata nell'ambito del progetto integrato del Polo del '900 Lavoro e partecipazione coordinato da ISMEL. In collaborazione con AGI - Avvocati Giuslavoristi Italiani e OIL - Organizzazione Internazionale del Lavoro
Il lavoro nel secondo dopoguerra e le Costituzioni in Europa
Seminario
Venerdì 20 settembre ore 17
Campus Luigi Einaudi, Aula A1
Introduce e modera
Aldo Enrietti, ISMEL
Intervengono
Marco Scavino, Università di Torino
Il valore costituente del lavoro nel "secolo breve"
Paolo Tosi | Piergiovanni Alleva, AGI
Lavoro, partecipazione e impresa nella Costituzione italiana in comparazione con alcune altre Costituzioni europee
Gianni Rosas, OIL
Il lavoro e la sua regolazione nella Dichiarazione di Filadelfia del 1943 e il riconoscimento dei diritti associativi dei lavoratori e dell’impresa
Giuseppe Iuliano, CISL Nazionale
Il dialogo sociale nel contesto delle Istituzioni europee: riconoscimento degli attori sociali e ruolo consultivo del Comitato Economico e Sociale europeo
Iniziativa realizzata nell'ambito del progetto integrato del Polo del '900 Lavoro e partecipazione coordinato da ISMEL. In collaborazione con AGI - Avvocati Giuslavoristi Italiani e OIL - Organizzazione Internazionale del Lavoro
Gli ultimi dati sull’andamento del mercato del lavoro in Piemonte segnalano un brusco peggioramento della condizione lavorativa delle donne, a fronte di un miglioramento della condizione degli uomini.
Davanti a questi dati è parso utile sviluppare un’analisi con la quale provare a fare chiarezza sulla condizione delle donne nel mercato del lavoro della nostra Regione, alla luce dei cambiamenti intervenuti negli ultimi anni con un occhio di riguardo alla condizione delle nuove generazioni e all’evoluzione delle “differenze di genere” tradizionalmente a svantaggio delle donne.
Per far emergere la loro condizione lavorativa. è stata fatta una precisa scelta: far parlare i numeri, mettere a confronto le diverse realtà per far emergere affinità ed elementi distintivi.
Presentazione della ricerca di Federica Volpi durante l'incontro "Lavoro e Stato sociale ai tempi dell'economia digitale" organizzato da ISMEL in collaborazione con Unione Culturale Franco Antonicelli e L'Indice dei Libri del Mese
Intervento di Luca Settineri al quinto e ultimo incontro del corso di formazione per dirigenti sindacali "Le parole dell'innovazione e il lavoro", nato da una progettazione congiunta tra ISMEL e le segreterie CGIL, CISL e UIL di Torino e tenutosi tra marzo e maggio 2019.
Intervento di Luciano Pero al quinto e ultimo incontro del corso di formazione per dirigenti sindacali "Le parole dell'innovazione e il lavoro", nato da una progettazione congiunta tra ISMEL e le segreterie CGIL, CISL e UIL di Torino e tenutosi tra marzo e maggio 2019.
Intervento di Francesco Garibaldo al quinto e ultimo incontro del corso di formazione per dirigenti sindacali "Le parole dell'innovazione e il lavoro", nato da una progettazione congiunta tra ISMEL e le segreterie CGIL, CISL e UIL di Torino e tenutosi tra marzo e maggio 2019.
Intervento di Stefano Di Dio al quarto incontro del corso di formazione per dirigenti sindacali "Le parole dell'innovazione e il lavoro", nato da una progettazione congiunta tra ISMEL e le segreterie CGIL, CISL e UIL di Torino e tenutosi tra marzo e maggio 2019.
Intervento di Stefano Di Dio al quarto incontro del corso di formazione per dirigenti sindacali "Le parole dell'innovazione e il lavoro", nato da una progettazione congiunta tra ISMEL e le segreterie CGIL, CISL e UIL di Torino e tenutosi tra marzo e maggio 2019.
Intervento di Ivana Pais al terzo incontro del corso di formazione per dirigenti sindacali "Le parole dell'innovazione e il lavoro", nato da una progettazione congiunta tra ISMEL e le segreterie CGIL, CISL e UIL di Torino e tenutosi tra marzo e maggio 2019.
More from Ismel - Istituto per la Memoria e la Cultura del Lavoro, dell'Impresa e dei Diritti Sociali (20)
A review of the growth of the Israel Genealogy Research Association Database Collection for the last 12 months. Our collection is now passed the 3 million mark and still growing. See which archives have contributed the most. See the different types of records we have, and which years have had records added. You can also see what we have for the future.
Executive Directors Chat Leveraging AI for Diversity, Equity, and InclusionTechSoup
Let’s explore the intersection of technology and equity in the final session of our DEI series. Discover how AI tools, like ChatGPT, can be used to support and enhance your nonprofit's DEI initiatives. Participants will gain insights into practical AI applications and get tips for leveraging technology to advance their DEI goals.
How to Manage Your Lost Opportunities in Odoo 17 CRMCeline George
Odoo 17 CRM allows us to track why we lose sales opportunities with "Lost Reasons." This helps analyze our sales process and identify areas for improvement. Here's how to configure lost reasons in Odoo 17 CRM
ISO/IEC 27001, ISO/IEC 42001, and GDPR: Best Practices for Implementation and...PECB
Denis is a dynamic and results-driven Chief Information Officer (CIO) with a distinguished career spanning information systems analysis and technical project management. With a proven track record of spearheading the design and delivery of cutting-edge Information Management solutions, he has consistently elevated business operations, streamlined reporting functions, and maximized process efficiency.
Certified as an ISO/IEC 27001: Information Security Management Systems (ISMS) Lead Implementer, Data Protection Officer, and Cyber Risks Analyst, Denis brings a heightened focus on data security, privacy, and cyber resilience to every endeavor.
His expertise extends across a diverse spectrum of reporting, database, and web development applications, underpinned by an exceptional grasp of data storage and virtualization technologies. His proficiency in application testing, database administration, and data cleansing ensures seamless execution of complex projects.
What sets Denis apart is his comprehensive understanding of Business and Systems Analysis technologies, honed through involvement in all phases of the Software Development Lifecycle (SDLC). From meticulous requirements gathering to precise analysis, innovative design, rigorous development, thorough testing, and successful implementation, he has consistently delivered exceptional results.
Throughout his career, he has taken on multifaceted roles, from leading technical project management teams to owning solutions that drive operational excellence. His conscientious and proactive approach is unwavering, whether he is working independently or collaboratively within a team. His ability to connect with colleagues on a personal level underscores his commitment to fostering a harmonious and productive workplace environment.
Date: May 29, 2024
Tags: Information Security, ISO/IEC 27001, ISO/IEC 42001, Artificial Intelligence, GDPR
-------------------------------------------------------------------------------
Find out more about ISO training and certification services
Training: ISO/IEC 27001 Information Security Management System - EN | PECB
ISO/IEC 42001 Artificial Intelligence Management System - EN | PECB
General Data Protection Regulation (GDPR) - Training Courses - EN | PECB
Webinars: https://pecb.com/webinars
Article: https://pecb.com/article
-------------------------------------------------------------------------------
For more information about PECB:
Website: https://pecb.com/
LinkedIn: https://www.linkedin.com/company/pecb/
Facebook: https://www.facebook.com/PECBInternational/
Slideshare: http://www.slideshare.net/PECBCERTIFICATION
How to Make a Field Mandatory in Odoo 17Celine George
In Odoo, making a field required can be done through both Python code and XML views. When you set the required attribute to True in Python code, it makes the field required across all views where it's used. Conversely, when you set the required attribute in XML views, it makes the field required only in the context of that particular view.
A workshop hosted by the South African Journal of Science aimed at postgraduate students and early career researchers with little or no experience in writing and publishing journal articles.
How to Fix the Import Error in the Odoo 17Celine George
An import error occurs when a program fails to import a module or library, disrupting its execution. In languages like Python, this issue arises when the specified module cannot be found or accessed, hindering the program's functionality. Resolving import errors is crucial for maintaining smooth software operation and uninterrupted development processes.
Main Java[All of the Base Concepts}.docxadhitya5119
This is part 1 of my Java Learning Journey. This Contains Custom methods, classes, constructors, packages, multithreading , try- catch block, finally block and more.
How to Setup Warehouse & Location in Odoo 17 InventoryCeline George
In this slide, we'll explore how to set up warehouses and locations in Odoo 17 Inventory. This will help us manage our stock effectively, track inventory levels, and streamline warehouse operations.
How to Add Chatter in the odoo 17 ERP ModuleCeline George
In Odoo, the chatter is like a chat tool that helps you work together on records. You can leave notes and track things, making it easier to talk with your team and partners. Inside chatter, all communication history, activity, and changes will be displayed.
La ricerca scientifica nell'era dei Big Data - Sabina Leonelli
1. Sabina Leonelli
Exeter Centre for the Study of Life Sciences (Egenis)
& Department of Sociology, Philosophy and Anthropology
University of Exeter
@sabinaleonelli
2. New technologies for
producing and storing lots of
data, fast, about anything and
everything
New institutions and
communication platforms for
disseminating data
New forms of analysis,
computing and automation
= Gateway to new social
behaviors, services, self-
understanding
= Novel status of data in research
3. Potential to improve
Pathways to and quality of discoveries: data
mining helps spot gaps and opportunities
Collaboration across sites, disciplines and
countries
Uptake of new technologies
Research evaluation, debate and
transparency
Significance of research components
beyond papers and patents
Fight against fraud, low quality and
duplication of efforts
Legitimacy of science, public trust and
engagement
4. Potential to improve
Pathways to and quality of discoveries: data
mining helps spot gaps and opportunities
Collaboration across sites, disciplines and
countries
Uptake of new technologies
Research evaluation, debate and
transparency
Significance of research components
beyond papers and patents
Fight against fraud, low quality and
duplication of efforts
Legitimacy of science, public trust and
engagement
5. Tracking data journeys
To understand how data move from sites of production to sites of
dissemination and interpretation/use, and with which consequences
• Focus:
1. Databases as windows on material/conceptual/institutional
labor required to make data widely accessible and useable
• labels & software to classify, model, visualize, retrieve data
• management of infrastructure and communications
2. Data re-use cases to investigate
• conditions under which data can be interpreted
• implications for discovery & what counts as good research
• role of Open Science movement in knowledge generation
6. Empirical sources:
Archives, scientific literature
Interviews and participant observation to document research
practices and attitudes to data openness, curation, re-use
Collaboration & direct involvement in data journeys
Comparative analysis across research areas and countries:
• Varieties of data types, research goals, methods and instruments
• Area-specific requirements, political economy and ethos
• Regulatory frameworks, research environments & infrastructure
• Data sharing and re-use across high- and low-income countries
9. Health and
environmental
data
Sensitive
biomedical data
Cross-species
(from yeast to
human)
Plant data across
lab and field,
North and South
10. 1. Favoring conservatism over
innovation
2. Making bias invisible
3. Building on unreliable data
4. Prioritising commercial interests
5. Encouraging research that is
irrelevant for or damaging to society
11. 1. Favoring conservatism over innovation
2. Making bias invisible
3. Building on unreliable data
4. Prioritising commercial interests
5. Encouraging research that is irrelevant for or
damaging to society
12. Nested, inter-dependent databases collecting different data
types and approaches
Not easily interoperable! Critical role of
language used to order and retrieve data (e.g. ontologies)
meta-data and experimental protocols (e.g. Minimal Information
and ISA-Tab standard)
standards for what counts as reliable data and evidential
significance
Real challenges in developing and updating database
content (formats, software, knowledge base)
Common standards help.. when complemented by trained
human judgement on how to apply them
13.
14. No sustainability for databases and related curation
Lack of long-term funds and willingness to invest
Vision of ‘data curation’ as technical service rather than research
Researchers rarely receiving expert support on data management
In the absence of intelligent human curation, many
databases disappear or (worse!) stagnate major
hurdles in re-using badly kept old data
General focus on re-using instead of creating: what
does it mean for creativity and innovation?
Esp. where objects of inquiry keep changing and evolving
15. 1. Favoring conservatism over innovation
2. Building on unreliable data
3. Making bias invisible
4. Prioritising commercial interests
5. Encouraging research that is irrelevant for or
damaging to society
16. Difficulties in locating error and evaluating data
provenance and quality, esp. when data travel beyond
specific communities of practice
Re-contextualising information (meta-data) is crucial
to data interpretation, but often insufficient or badly
selected/annotated
Data quality assessment
varies depending on specific use (e.g. microarrays)
often depends on access to original materials or
instruments, yet
▪ sample collections are unsystematic, underfunded, and not
interlinked (which makes samples hard to locate and relate to data)
▪ old instruments are not kept, unless for historical purposes
17. Data sharing and re-use needs data curation that is
Intelligent (Royal Society, 2012)
Informed by familiarity with research objects and target
systems
Trustworthy (able to weed out error and unreliable data)
Consistent across time and space (e.g. longitudinal data
collection in oceanography, epidemiology, environmental
science)
Databases are rarely regarded as trustworthy
beyond relatively small epistemic communities with
strong social bounds and user participation
18. 1. Favoring conservatism over innovation
2. Building on unreliable data
3. Making bias invisible
4. Prioritising commercial interests
5. Encouraging research that is irrelevant for or
damaging to society
19. Big data collections tend to be extremely selective:
Databases display the outputs of rich, English-speaking labs within visible and
popular research traditions, which deal with ‘tractable’ data formats
Involvement of poor/unfashionable labs, developing countries & non-scientists
is low and at the ‘receiving’ end
Increasing digital divide in research too: Inequalities of visibility, power and
location can be reinforced, rather than mitigated, by big data dissemination
Huge disparity in data sources and types of data that can be
curated/disseminated/reused
Inequality in representation - e.g. sampling across populations for
health research, diverse applications of data protection laws
Sampling based on convenience and institutional/financial
factors: not a novelty per se, yet often the resulting bias remains
unaccounted for in research methods and analysis
20. 1. Favoring conservatism over innovation
2. Building on unreliable data
3. Making bias invisible
4. Prioritising commercial interests
5. Encouraging research that is irrelevant for or
damaging to society
21. Triumph of commercial and opportunistic concerns
over scientific reasoning and investigative decisions
Data choice, processing and dissemination mechanisms are
governed by non-epistemic factors
More ‘tractable’ data in digital formats are more easily
shared, accumulated and exchanged as commodities..
.. while complex and expensive datasets often become or
remain private
In US, lack of appropriate regulation over data
dissemination and commercialization
Lack of clarity over legal regimes, esp. for research
data
22. 1. Favoring conservatism over innovation
2. Building on unreliable data
3. Making bias invisible
4. Prioritising commercial interests
5. Encouraging research that is irrelevant for or
damaging to society
23. BOD presented as
opportunity to shake up
the research system and
make it participatory and
socially responsive
E.g. increased citizen
engagement in data
collection, sharing and
analysis:
Citizen Science movement
the ‘right to science’ in
medicine
DIY Biology and Fab Labs
24.
25. Data linkage across data sources involves serious
risks to individuals and communities (e.g. privacy,
medical assessment, representation in government and
social services)
Research outputs may damage society instead of
leading to human flourishing (e.g. Royal Society Data
Governance Report 2017, Luciano Floridi’s work)
Importance of preserving human rights, not corrupting
them (e.g. ‘right to science’ movement, EffyVayena’s work)
Data sharing and re-use needs to be ethically
sound
26. AI-enabled mining of Big Data: alternative to
extensive human interventions & decision-
making currently required for data analysis
Big Data + AI = automation of inquiry
Promise of faster, better, more reliable, more
sustainable knowledge production
27. AI-enabled mining of Big Data: alternative to
extensive human interventions & decision-
making currently required for data analysis
Big Data + AI = automation of inquiry
Promise of faster, better, more reliable, more
sustainable knowledge production
28. Big data in and of themselves do not provide a
reliable evidence base for computational
analysis:
All data are FAIR (vs: Lovell, Leonelli et al under review)
Big Data are comprehensive, and thus
▪ Big data counters bias in data collection and interpretation
(vs: Leonelli 2018)
▪ Big data makes debate over sampling redundant, as we
have data about everything (vs: Leonelli 2014 in Big Data & Society)
29. Big data in and of themselves do not provide a
reliable evidence base for computational
analysis:
All data are FAIR (vs: Lovell, Leonelli et al under review)
Big Data are comprehensive, and thus
▪ Big data counters bias in data collection and interpretation
(vs: Leonelli 2018)
▪ Big data makes debate over sampling redundant, as we
have data about everything (vs: Leonelli 2014 in Big Data & Society)
30. Big data in and of themselves do not provide a
reliable evidence base for computational
analysis:
All data are FAIR (vs: Lovell, Leonelli et al under review)
Big Data are comprehensive, and thus
▪ Big data counters bias in data collection and interpretation
(vs: Leonelli 2018)
▪ Big data makes debate over sampling redundant, as we
have data about everything (vs: Leonelli 2014 in Big Data & Society)
31. Big data in and of themselves do not provide a
reliable evidence base for computational
analysis:
All data are FAIR (vs: Lovell, Leonelli et al under review)
Big Data are comprehensive, and thus
▪ Big data counters bias in data collection and interpretation
(vs: Leonelli 2018)
▪ Big data makes debate over sampling redundant, as we
have data about everything (vs: Leonelli 2014 in Big Data & Society)
32. Forging tools for unregulated mass surveillance of human
behavior at individual as well as community levels
Producing unreliable knowledge that does not help to tackle
urgent social challenges
Expanding existing divides and silencing scientific traditions
from low-resourced environments and ‘unfashionable’ topics
Eroding scientific expertise and centuries-old methodological
wisdom: ‘anything online goes’
Eroding trust and credibility of science: exponential growth of
opportunities for marketing “alternative facts”
33. Recognise local and situated nature of data selection, sampling
methods and data quality assessment informing decisions
about the scope of inferential claims (what they apply to)
Promote effective, context-specific, sustainable data curation,
and explicit criteria for data and meta-data inclusion and
formatting
Build accountability for choice and sources of data into data
infrastructures and analytic tools
Track data histories and journeys through meta-data
Avoid irreversible data linkage
Strengthen links between digital data and research materials
Build safeguards for social/ethical concerns to improve research
methods: ethical data handling enables data linkage, supports
information security, facilitates data mining
34. Special thanks to the hundreds of
scientists who participated in this
research, Michel Durinx for the figures,
the Data Studies group in Egenis and
the many colleagues in philosophy,
history and social studies of science
who provided crucial and generous
support and insights.
This research was supported by funding
from:
• European Research Council (grant
335925)
• Leverhulme Trust
• Economic and Social Research
Council
• Max Planck Institute for the History
of Science
• British Academy
• Australian Research Council
• University of Ghent
35. 1. Context-specific curation is key to data re-
use
2. Long-term maintenance is key to
trustworthiness
3. Which data and why?
4. Data and materials
5. Role of ethics, humanities and social
sciences in data management
36. Data curation is essential to BOD re-use and
interpretation, yet badly underestimated and
not rewarded
Value of long-standing research traditions and
reviewing methods
Crucial for data infrastructures to be user-friendly and
receptive to user feedback
Need case-by-case judgments on data quality and
fruitful modes of data sharing
37. Pluralism in methods and standards contributes
robustness to data analysis, and reduces risk of losing
system-specific knowledge
Standards and formats are key
Yet reliance on overly rigid standards creates exclusions and
obliterates system-specific knowledge
Data linkage methods are best when it is possible to
disaggregate
Interoperability is preferable to integration
38. Regular updates across nested infrastructures
Business plan for long-term sustainability
For BOD, this means:
Clear relation between international field-specific
databases, international clouds, national clouds,
institutional repositories
Make sure each node is resilient and system is not
crippled by individual node failure (now all
independently funded, typically in the short term)
39. Particularly important since hard to guarantee
data quality
Re-use often linked to participation in developing
data infrastructures
rarely the case for busy practitioners, considering also
gap in skills
Role of confidence assessments on data quality
and reliability (again: expert curation is key)
40.
41. Indiscriminate calls for open data can lead to
serendipity in what data are circulated and when
Need explicit rationale around priority given to specific data
types and sources (e.g. ‘omics’ in biology)
Substantive disagreements over data management:
Methods, terminologies, standards involved in data
production and interpretation
To be useable, data are handled by several individuals with
diverse expertise: the evidential value and representational
power attributed to a given dataset can vary
42. Criteria for what counts as good data – even as data
altogether – vary dramatically even within the same
field
Data producers, curators, users make choices about
what constitutes data at each stage of their journeys
Data as relational: what counts as data varies in
relation to research situations [Leonelli 2015, 2016, 2019]
Any object can be considered as a datum as long as (1) it is
treated as potential evidence for one or more claims about
phenomena, and (2) it is possible to circulate it among
individuals/groups
43. Inference as a process of situating data in
relation to elements of relevance to interpretation
(materials, instruments, interests, norms)
developing a context for inquiry that aligns
one’s purpose with existing theoretical
commitments and selected properties of data and
target system
E.g. Databases as enabling comparison among ways
to organise data
“All inductive inference is local” (Norton 2003)
44. What makes for ‘good’ inference?Triangulation
of different data sources often cited by BOD
advocates
My view: triangulation is necessary, but not
sufficient
Difficulties in accounting for partiality in data sources
(Leonelli 2014, 2016, 2018)
Efforts to maintain continuity and commensurability
when re-assessing same dataset with different
methods across time (Wylie 2017, Leonelli 2018)
45. Building on Alison Wylie’s conditions for robust evidential
reasoning..
(1) security, (2) causal anchoring and causal independence, (3)
conceptual independence, (4) grounds for calibration and (5)
addressing divergence [Chapman andWylie 2016]
.. And adding:
(6) diversity of sources and subsequent handling methods (make
data journeys trackable)
(7) explicit valuing criteria for data production, dissemination and
reuse (debate which data get to travel; ethics and social concerns
are key to data re-usability and security)
(8) material anchoring where possible (link digital databases and
sample collections)
(9) critical use of standards (balance standardization with local
solutions to preserve system-specific methods)
46. Choice of meta-data and link to research materials
Reliable stock centers and collections: rarely available &
coordinated with databases
E.g. model organism stock centres, biobanks
47.
48. Ethical, social and security concerns increase
quality and re-usability of data/infrastructures
Data re-use requires well-informed, sustainable,
inclusive, participative development and use of data
infrastructures
Related skills are as central to big data use as
computational skills
Data management training requires input from all
fields, esp. social science and humanities
49. Recognise local and situated nature of data selection, sampling
methods and data quality assessment informing decisions
about the scope of inferential claims (what they apply to)
Promote effective, context-specific, sustainable data curation,
and explicit criteria for data and meta-data inclusion and
formatting
Build accountability for choice and sources of data into data
infrastructures and analytic tools
Track data histories and journeys through meta-data
Avoid irreversible data linkage
Strengthen links between digital data and research materials
Build safeguards for social/ethical concerns to improve research
methods: ethical data handling enables data linkage, supports
information security, facilitates data mining
50. Special thanks to the hundreds of
scientists who participated in this
research, Michel Durinx for the figures,
the Data Studies group in Egenis and
the many colleagues in philosophy,
history and social studies of science
who provided crucial and generous
support and insights.
This research was supported by funding
from:
• European Research Council (grant
335925)
• Leverhulme Trust
• Economic and Social Research
Council
• Max Planck Institute for the History
of Science
• British Academy
• Australian Research Council
• University of Ghent
Responses to Mael:
Differences between discipines (e.g. physics where there is a well-established theoretical framework vs biology which is epistemologically vulnerable..?) BUT data cleaning… point to MIT volume
Big data techniques do not sidestep contextuality of data
Bernard Stigler: techniques are pharmakon (remedy)
..
move away from universal and a priori models of inductive inference