Talk given at the Sciencedigital@UNGA75 on 29th September as part of a series of side events to mark the 75th anniversary of the United Nations General Assembly.
Is the current measure of excellence perverting Science? A Data deluge is com...Lourdes Verdes-Montenegro
Talk prepared for motivating the Session proposed by AMIGA team to SKA Office and organized by William Garnier (SKAO) for ESOF (European Science Forum) held in Toulouse in July 2018
Data for Science: How Elsevier is using data science to empower researchersPaul Groth
Each month 12 million people use Elsevier’s ScienceDirect platform. The Mendeley social network has 4.6 million registered users. 3500 institutions make use of ClinicalKey to bring the latest in medical research to doctors and nurses. How can we help these users be more effective? In this talk, I give an overview of how Elsevier is employing data science to improve its services from recommendation systems, to natural language processing and analytics. While data science is changing how Elsevier serves researchers, it’s also changing research practice itself. In that context, I discuss the impact that large amounts of open research data are having and the challenges researchers face in making use of it, in particular, in terms of data integration and reuse. We are at just beginning to see of how technology and data is changing science correspondingly this impacts how best to empower those who practice it.
This is an updated version of an invited talk I presented at the European Research Council-Brussels (Scientific Seminar): "Love for Science or 'academic prostitution'".
It has been updated to be presented at the Document Freedom Day 2014, during the activities organized by the Oficina de Software Libre de la Universidad de Granada (26th March).
I have included some new slides and revised others.
I present a personal revision (sometimes my own vision) of some issues that I consider key for doing Science. It was focused on the expected audience, mainly Scientific Officers with background in different fields of science and scholarship, but also Agency staff.
Abstract: In a recent Special issue of Nature concerning Science Metrics it was claimed that " Research reverts to a kind of 'academic prostitution' in which work is done to please editors and referees rather than to further knowledge."If this is true, funding agencies should try to avoid falling into the trap of their own system. By perpetuating this 'prostitution' they risk not funding the best research but funding the best sold research.
Given the current epoch of economical crisis, where in a quest for funds researchers are forced into competitive game of pandering to panelists, its seems a good time for deep reflection about the entire scientific system.
With this talk I aim to provoke extra critical thinking among the committees who select evaluators, and among the evaluators, who in turn require critical thinking to the candidates when selecting excellent science.
I will present some initiatives (e.g. new tracers of impact for the Web era- 'altmetrics'), and on-going projects (e.g. how to move from publishing advertising to publishing knowledge), that might enable us to favor Science over marketing.
Despite the availability of many platforms for scientists to connect and share with their peers in the scientific community the majority do not make use of these tools, despite their promise and potential impact and influence on our careers. We are already being indexed and exposed on the internet via our publications, presentations and data and new “AltMetric scores” are being assigned to scientific publications as measures of popularity and, supposedly, of impact. We now have even more ways to contribute to science, to annotate and curate data, to “publish” in new ways, and many of these activities are as part of a growing crowdsourcing network. This presentation provides an overview of the various types of networking and collaborative sites available to scientists and ways to expose your scientific activities online. It will discuss the new world of AltMetrics that is in an explosive growth curve and will help you understand how to influence and leverage some of these new measures. Participating online, whether it be simply for career advancement or for wider exposure of your research, there are now a series of web applications that can provide a great opportunity to develop a scientific profile within the community.
How altmetrics can help researchers broaden the reach of their work
Slides from workshop to pepnet (Public Engagement network) at the University of Leeds on 28th November 2018
The reach of a publication can no longer be judged exclusively by the number of times it is cited. Because, we are now in the digital and sharing information age, academic conversations are as likely to be found on various academic social networks. So, we need new tools to measure the research impact. Altmetrics are new metrics proposed as alternatives to Impact Factor for journals and personal citation indexes like h-index. Altmetrics attempts to use the online activity to measure impact, buzz, word of mouth for scientific information and it includes new ways to measure usage at the citation level. In this workshop, I will explain about the application of "alternative metrics" tools such as: Altmetric.com, Impactstory.org, Plumanalytics.com, and PLoS metrics.
Is the current measure of excellence perverting Science? A Data deluge is com...Lourdes Verdes-Montenegro
Talk prepared for motivating the Session proposed by AMIGA team to SKA Office and organized by William Garnier (SKAO) for ESOF (European Science Forum) held in Toulouse in July 2018
Data for Science: How Elsevier is using data science to empower researchersPaul Groth
Each month 12 million people use Elsevier’s ScienceDirect platform. The Mendeley social network has 4.6 million registered users. 3500 institutions make use of ClinicalKey to bring the latest in medical research to doctors and nurses. How can we help these users be more effective? In this talk, I give an overview of how Elsevier is employing data science to improve its services from recommendation systems, to natural language processing and analytics. While data science is changing how Elsevier serves researchers, it’s also changing research practice itself. In that context, I discuss the impact that large amounts of open research data are having and the challenges researchers face in making use of it, in particular, in terms of data integration and reuse. We are at just beginning to see of how technology and data is changing science correspondingly this impacts how best to empower those who practice it.
This is an updated version of an invited talk I presented at the European Research Council-Brussels (Scientific Seminar): "Love for Science or 'academic prostitution'".
It has been updated to be presented at the Document Freedom Day 2014, during the activities organized by the Oficina de Software Libre de la Universidad de Granada (26th March).
I have included some new slides and revised others.
I present a personal revision (sometimes my own vision) of some issues that I consider key for doing Science. It was focused on the expected audience, mainly Scientific Officers with background in different fields of science and scholarship, but also Agency staff.
Abstract: In a recent Special issue of Nature concerning Science Metrics it was claimed that " Research reverts to a kind of 'academic prostitution' in which work is done to please editors and referees rather than to further knowledge."If this is true, funding agencies should try to avoid falling into the trap of their own system. By perpetuating this 'prostitution' they risk not funding the best research but funding the best sold research.
Given the current epoch of economical crisis, where in a quest for funds researchers are forced into competitive game of pandering to panelists, its seems a good time for deep reflection about the entire scientific system.
With this talk I aim to provoke extra critical thinking among the committees who select evaluators, and among the evaluators, who in turn require critical thinking to the candidates when selecting excellent science.
I will present some initiatives (e.g. new tracers of impact for the Web era- 'altmetrics'), and on-going projects (e.g. how to move from publishing advertising to publishing knowledge), that might enable us to favor Science over marketing.
Despite the availability of many platforms for scientists to connect and share with their peers in the scientific community the majority do not make use of these tools, despite their promise and potential impact and influence on our careers. We are already being indexed and exposed on the internet via our publications, presentations and data and new “AltMetric scores” are being assigned to scientific publications as measures of popularity and, supposedly, of impact. We now have even more ways to contribute to science, to annotate and curate data, to “publish” in new ways, and many of these activities are as part of a growing crowdsourcing network. This presentation provides an overview of the various types of networking and collaborative sites available to scientists and ways to expose your scientific activities online. It will discuss the new world of AltMetrics that is in an explosive growth curve and will help you understand how to influence and leverage some of these new measures. Participating online, whether it be simply for career advancement or for wider exposure of your research, there are now a series of web applications that can provide a great opportunity to develop a scientific profile within the community.
How altmetrics can help researchers broaden the reach of their work
Slides from workshop to pepnet (Public Engagement network) at the University of Leeds on 28th November 2018
The reach of a publication can no longer be judged exclusively by the number of times it is cited. Because, we are now in the digital and sharing information age, academic conversations are as likely to be found on various academic social networks. So, we need new tools to measure the research impact. Altmetrics are new metrics proposed as alternatives to Impact Factor for journals and personal citation indexes like h-index. Altmetrics attempts to use the online activity to measure impact, buzz, word of mouth for scientific information and it includes new ways to measure usage at the citation level. In this workshop, I will explain about the application of "alternative metrics" tools such as: Altmetric.com, Impactstory.org, Plumanalytics.com, and PLoS metrics.
Measuring Impact: Towards a data citation metricEdward Baker
How the ViBRANT and eMonocot projects are building tools, including a modified implementation of Bourne and Fink's 'Scholar Factor', the Biodiversity Data Journal, and Scratchpad's user metrics and statistics modules.
Astronomy is a collaborative science, but it has also become highly specialized, as many other disciplines. Improvement of sharing, discovery and access to resources will enable astronomers to greatly benefit from each other’s highly specialized knowhow. Some initiatives led by scientists and publishers, complement traditional paper publishing with assets published in more interactive digital formats. Among the main goals of these efforts are improving the reproducibility and clarity of the scientific outcome, going beyond the static PDF file, and fostering re-use, which turns into a more efficient exploitation of available digital resources.
With the flourishing environment of platforms for sharing data, establishing an online profile and engaging in scientific discourse through alternative modes of publishing and participation, there are numerous potential benefits. However, while many scientists invest significant amounts of time in sharing their activities and opinions with friends and family the majority do not make use of the new opportunities to participate in the developing social web of science, despite the potential impact and influence on future careers. We now have many new ways to contribute to science outside of the classical publishing model. These include the ability to annotate and curate data, to “publish” in new ways on blogs and micropublishing sites, and many of these activities can be as part of a growing crowdsourcing network. Our efforts in this area are already being indexed and exposed on the internet via our publications, presentations and data and increasingly we are being quantified. This presentation will provide an overview of the various types of networking and collaborative sites available to scientists and ways to expose their scientific activities online. Many of these can ultimately contribute to the developing metrics of a scientist as identified in the new world of alternative metrics. Participation offers a great opportunity to develop a scientific profile within the community and may ultimately be very beneficial, especially to scientists early in their career.
This presentation was given at the UKICRS meeting (http://www.ukicrs.org/2015-symposium.html) on April 16th 2015. This presentation was in a workshop and focused on trying to inform attendees in the postgraduate phases of their careers how to use online tools to start building a reputation and profile in their field
UCT eResearch is about IT-enabled research discovery at the University of Cape Town. This slide show was presented to IT faculty managers and technical staff to demonstrate how UCT eResearch can assist faculties at the university to raise eResearch capacity.
Ontologies For the Modern Age - McGuinness' Keynote at ISWC 2017Deborah McGuinness
Ontologies are seeing a resurgence of interest and usage as big data proliferates, machine learning advances, and integration of data becomes more paramount. The previous models of sometimes labor-intensive, centralized ontology construction and maintenance do not mesh well in today’s interdisciplinary world that is in the midst of a big data, information extraction, and machine learning explosion. In this talk, we will provide some historical perspective on ontologies and their usage, and discuss a model of building and maintaining large collaborative, interdisciplinary ontologies along with the data repositories and data services that they empower. We will give a few examples of heterogeneous semantic data resources made more interconnected and more powerful by ontology-supported infrastructures, discuss a vision for ontology-enabled future research and provide some examples in a large health empowerment joint effort between RPI and IBM Watson Health.
The purpose, practicalities, pitfalls and policies of managing and sharing da...Danny Kingsley
Talk to the Royal Society of Chemistry, Chemical Information and Computer Applications Group conference - Measurement, Information and Innovation: Digital Disruption in the Chemical Sciences. Tuesday 20th October 2015, RSC, Burlington House, Piccadilly, London
Many of us nowadays invest significant amounts of time in sharing our activities and opinions with friends and family via social networking tools such as Facebook, Twitter or other related websites. However, despite the availability of many platforms for scientists to connect and share with their peers in the scientific community the majority do not make use of these tools, despite their promise and potential impact and influence on our careers. We are already being indexed and exposed on the internet via our publications, presentations and data and new “AltMetric scores” are being assigned to scientific publications as measures of popularity and, supposedly, of impact. We now have even more ways to contribute to science, to annotate and curate data, to “publish” in new ways, and many of these activities are as part of a growing crowdsourcing network. This presentation provides an overview of the various types of networking and collaborative sites available to scientists and ways to expose your scientific activities online. It will discuss the new world of AltMetrics that is in an explosive growth curve and will help you understand how to influence and leverage some of these new measures. Participating online, whether it be simply for career advancement or for wider exposure of your research, there are now a series of web applications that can provide a great opportunity to develop a scientific profile within the community.
Public engagement while you sleep? How altmetrics can help researchers broade...UoLResearchSupport
Slides from a seminar delivered for pepnet at the University of Leeds 28 Nov 2018. Thanks to Charlotte Perry-Houts for extra content:
From peer reviewed journal articles, to assorted reports and grey literature, to datasets comprising numerical, textual or multimedia files; we generate thousands of research outputs.
In this session, Kirsten Thompson (OD&PL) and Nick Sheppard (Library) will discuss strategies for increasing quality online engagement with that research. We will explore how you can use ‘alternative metrics’, more commonly known as ‘altmetrics’, to monitor such engagement. Altmetrics can help to showcase the reach of your work, supplement grant and tenure applications, identify new audiences, and connect with other researchers in your discipline.
In the age of “fake news”, academics have a responsibility to share their expertise beyond the Ivory Tower. We’ll show you how to ensure all these disparate outputs are properly curated in university repositories with a Digital Object Identifier (DOI). There will also be an opportunity to learn about and contribute to the Library led Data Management Engagement Award, a first-ever competition launched to elicit new and imaginative ideas for engaging researchers in the practices of good Research Data Management (RDM).
How altmetrics can help researchers broaden the reach of their work. Workshop facilitated by Kirsten Thompson and Nick Sheppard at the University of Leeds for the #PepnetLeeds network November 28th 2018.
Big Data in Pediatric Critical Care by Mohit MehraData Con LA
Abstract:- There is an urgent need in the pediatric ICUs to collect, store and transform healthcare data to make accurate and timely predictions in the areas of patient outcomes and treatment recommendations. We are currently heavily invested in using open source big data stacks in order to achieve this goal and help our young ones. In this talk I can highlight how we go about managing structured and unstructured high frequency data generated from a disparate set of devices and systems and ultimately how we have created data pipelines to process the data and make it available for data scientists and app developers.
The Royal Society of Chemistry has provided access to data associated with millions of chemical compounds via our ChemSpider database for over 5 years. During this period the richness and complexity of the data has continued to expand dramatically and the original vision for providing an integrated hub for structure-centric data has been delivered across the world to hundreds of thousands of users. With an intention of expanding the reach to cover more diverse aspects of chemistry-related data including compounds, reactions and analytical data, to name just a few data-types, we are in the process of implementing a new architecture to build a Chemistry Data Repository. The data repository will manage the challenges of associated metadata, the various levels of required security (private, shared and public) and exposing the data as appropriate using semantic web technologies. Ultimately this platform will become the host for all chemicals, reactions and analytical data contained within RSC publications and specifically supplementary information. This presentation will report on how our efforts to manage chemistry related data has impacted chemists and projects across the world and will review specifically our contributions to projects involving natural products for collaborators in Brazil and China, for the Open Source Drug Discovery project in India, and our collaborations with scientists in Russia.
Capacity Building: Data Science in the University At Rensselaer Polytechnic ...James Hendler
In this short talk, presented at the ITU's Capacity Building Symposium, I review some of the pedagogical innovation in data science happening at Rensselaer (RPI) and some aspects of teaching data science that are crucial to larger success.
Measuring Impact: Towards a data citation metricEdward Baker
How the ViBRANT and eMonocot projects are building tools, including a modified implementation of Bourne and Fink's 'Scholar Factor', the Biodiversity Data Journal, and Scratchpad's user metrics and statistics modules.
Astronomy is a collaborative science, but it has also become highly specialized, as many other disciplines. Improvement of sharing, discovery and access to resources will enable astronomers to greatly benefit from each other’s highly specialized knowhow. Some initiatives led by scientists and publishers, complement traditional paper publishing with assets published in more interactive digital formats. Among the main goals of these efforts are improving the reproducibility and clarity of the scientific outcome, going beyond the static PDF file, and fostering re-use, which turns into a more efficient exploitation of available digital resources.
With the flourishing environment of platforms for sharing data, establishing an online profile and engaging in scientific discourse through alternative modes of publishing and participation, there are numerous potential benefits. However, while many scientists invest significant amounts of time in sharing their activities and opinions with friends and family the majority do not make use of the new opportunities to participate in the developing social web of science, despite the potential impact and influence on future careers. We now have many new ways to contribute to science outside of the classical publishing model. These include the ability to annotate and curate data, to “publish” in new ways on blogs and micropublishing sites, and many of these activities can be as part of a growing crowdsourcing network. Our efforts in this area are already being indexed and exposed on the internet via our publications, presentations and data and increasingly we are being quantified. This presentation will provide an overview of the various types of networking and collaborative sites available to scientists and ways to expose their scientific activities online. Many of these can ultimately contribute to the developing metrics of a scientist as identified in the new world of alternative metrics. Participation offers a great opportunity to develop a scientific profile within the community and may ultimately be very beneficial, especially to scientists early in their career.
This presentation was given at the UKICRS meeting (http://www.ukicrs.org/2015-symposium.html) on April 16th 2015. This presentation was in a workshop and focused on trying to inform attendees in the postgraduate phases of their careers how to use online tools to start building a reputation and profile in their field
UCT eResearch is about IT-enabled research discovery at the University of Cape Town. This slide show was presented to IT faculty managers and technical staff to demonstrate how UCT eResearch can assist faculties at the university to raise eResearch capacity.
Ontologies For the Modern Age - McGuinness' Keynote at ISWC 2017Deborah McGuinness
Ontologies are seeing a resurgence of interest and usage as big data proliferates, machine learning advances, and integration of data becomes more paramount. The previous models of sometimes labor-intensive, centralized ontology construction and maintenance do not mesh well in today’s interdisciplinary world that is in the midst of a big data, information extraction, and machine learning explosion. In this talk, we will provide some historical perspective on ontologies and their usage, and discuss a model of building and maintaining large collaborative, interdisciplinary ontologies along with the data repositories and data services that they empower. We will give a few examples of heterogeneous semantic data resources made more interconnected and more powerful by ontology-supported infrastructures, discuss a vision for ontology-enabled future research and provide some examples in a large health empowerment joint effort between RPI and IBM Watson Health.
The purpose, practicalities, pitfalls and policies of managing and sharing da...Danny Kingsley
Talk to the Royal Society of Chemistry, Chemical Information and Computer Applications Group conference - Measurement, Information and Innovation: Digital Disruption in the Chemical Sciences. Tuesday 20th October 2015, RSC, Burlington House, Piccadilly, London
Many of us nowadays invest significant amounts of time in sharing our activities and opinions with friends and family via social networking tools such as Facebook, Twitter or other related websites. However, despite the availability of many platforms for scientists to connect and share with their peers in the scientific community the majority do not make use of these tools, despite their promise and potential impact and influence on our careers. We are already being indexed and exposed on the internet via our publications, presentations and data and new “AltMetric scores” are being assigned to scientific publications as measures of popularity and, supposedly, of impact. We now have even more ways to contribute to science, to annotate and curate data, to “publish” in new ways, and many of these activities are as part of a growing crowdsourcing network. This presentation provides an overview of the various types of networking and collaborative sites available to scientists and ways to expose your scientific activities online. It will discuss the new world of AltMetrics that is in an explosive growth curve and will help you understand how to influence and leverage some of these new measures. Participating online, whether it be simply for career advancement or for wider exposure of your research, there are now a series of web applications that can provide a great opportunity to develop a scientific profile within the community.
Public engagement while you sleep? How altmetrics can help researchers broade...UoLResearchSupport
Slides from a seminar delivered for pepnet at the University of Leeds 28 Nov 2018. Thanks to Charlotte Perry-Houts for extra content:
From peer reviewed journal articles, to assorted reports and grey literature, to datasets comprising numerical, textual or multimedia files; we generate thousands of research outputs.
In this session, Kirsten Thompson (OD&PL) and Nick Sheppard (Library) will discuss strategies for increasing quality online engagement with that research. We will explore how you can use ‘alternative metrics’, more commonly known as ‘altmetrics’, to monitor such engagement. Altmetrics can help to showcase the reach of your work, supplement grant and tenure applications, identify new audiences, and connect with other researchers in your discipline.
In the age of “fake news”, academics have a responsibility to share their expertise beyond the Ivory Tower. We’ll show you how to ensure all these disparate outputs are properly curated in university repositories with a Digital Object Identifier (DOI). There will also be an opportunity to learn about and contribute to the Library led Data Management Engagement Award, a first-ever competition launched to elicit new and imaginative ideas for engaging researchers in the practices of good Research Data Management (RDM).
How altmetrics can help researchers broaden the reach of their work. Workshop facilitated by Kirsten Thompson and Nick Sheppard at the University of Leeds for the #PepnetLeeds network November 28th 2018.
Big Data in Pediatric Critical Care by Mohit MehraData Con LA
Abstract:- There is an urgent need in the pediatric ICUs to collect, store and transform healthcare data to make accurate and timely predictions in the areas of patient outcomes and treatment recommendations. We are currently heavily invested in using open source big data stacks in order to achieve this goal and help our young ones. In this talk I can highlight how we go about managing structured and unstructured high frequency data generated from a disparate set of devices and systems and ultimately how we have created data pipelines to process the data and make it available for data scientists and app developers.
The Royal Society of Chemistry has provided access to data associated with millions of chemical compounds via our ChemSpider database for over 5 years. During this period the richness and complexity of the data has continued to expand dramatically and the original vision for providing an integrated hub for structure-centric data has been delivered across the world to hundreds of thousands of users. With an intention of expanding the reach to cover more diverse aspects of chemistry-related data including compounds, reactions and analytical data, to name just a few data-types, we are in the process of implementing a new architecture to build a Chemistry Data Repository. The data repository will manage the challenges of associated metadata, the various levels of required security (private, shared and public) and exposing the data as appropriate using semantic web technologies. Ultimately this platform will become the host for all chemicals, reactions and analytical data contained within RSC publications and specifically supplementary information. This presentation will report on how our efforts to manage chemistry related data has impacted chemists and projects across the world and will review specifically our contributions to projects involving natural products for collaborators in Brazil and China, for the Open Source Drug Discovery project in India, and our collaborations with scientists in Russia.
Capacity Building: Data Science in the University At Rensselaer Polytechnic ...James Hendler
In this short talk, presented at the ITU's Capacity Building Symposium, I review some of the pedagogical innovation in data science happening at Rensselaer (RPI) and some aspects of teaching data science that are crucial to larger success.
This presentation was provided by Rick Johnson of Notre Dame University during the NISO virtual conference, That Cutting Edge: Technology's Impact on Scholarly Research Processes in the Library, held on October 24, 2018.
Talk given at the SKAO Speaker Series in June 2021, an update of the one given at Sciencedigital@UNGA75 on 29th September 2020 as part of a series of side events to mark the 75th anniversary of the United Nations General Assembly.
SGCI-URSSI-Sustainability in Research ComputingSandra Gesing
Sustainability in research computing has many facets such as funding and career paths for facilitators and research software engineers. The concern about sustainability is addressed in projects like the Science Gateways Community Institute (SGCI) and the conceptualization of the US Research Software Sustainability Institute (URSSI). Many further initiatives and projects are concerned with sustainability and the discussion at the ACI-REF VR Intermediate Workshop led to some consolidation ideas.
A talk at the Urban Science workshop at the Puget Sound Regional Council July 20 2014 organized by the Northwest Institute for Advanced Computing, a joint effort between Pacific Northwest National Labs and the University of Washington.
Rebecca Grant DAH Research Presentationdri_ireland
Presentation given by Rebecca Grant of the Digital Repository of Ireland at the Research in the Digital Age symposium at the Trinity Long Room Hub, 14 July, 2015. The presentation gives an overview of some of the key concepts and drivers in research data management for the arts and humanities, and introduces the Digital Repository of Ireland as potential place of deposit for such data.
SciDataCon 2014 Data Papers and their applications workshop - NPG Scientific ...Susanna-Assunta Sansone
Part of the SciDataCon14 workshop on "Data Papers and their applications" run by myself and Brian Hole to help attendees understand current data-publishing journals and trends and help them understand the editorial processes on NPG's Scientific Data and Ubiquity's Open Health Data.
Slides from Monday 30 July - Data in the Scholarly Communications Life Cycle Course which is part of the FORCE11 Scholarly Communications Institute.
Presenter - Natasha Simons
The Liber 2009 presentation repeated for a Dutch audience IN Dutch but with the english slides (just the first one is in Dutch :-)
Samenwerking Hogeschool bibliotheken SHB, 5 november 2009
Similar to Open Science for sustainability and inclusiveness: the SKA role model (20)
As Europe's leading economic powerhouse and the fourth-largest hashtag#economy globally, Germany stands at the forefront of innovation and industrial might. Renowned for its precision engineering and high-tech sectors, Germany's economic structure is heavily supported by a robust service industry, accounting for approximately 68% of its GDP. This economic clout and strategic geopolitical stance position Germany as a focal point in the global cyber threat landscape.
In the face of escalating global tensions, particularly those emanating from geopolitical disputes with nations like hashtag#Russia and hashtag#China, hashtag#Germany has witnessed a significant uptick in targeted cyber operations. Our analysis indicates a marked increase in hashtag#cyberattack sophistication aimed at critical infrastructure and key industrial sectors. These attacks range from ransomware campaigns to hashtag#AdvancedPersistentThreats (hashtag#APTs), threatening national security and business integrity.
🔑 Key findings include:
🔍 Increased frequency and complexity of cyber threats.
🔍 Escalation of state-sponsored and criminally motivated cyber operations.
🔍 Active dark web exchanges of malicious tools and tactics.
Our comprehensive report delves into these challenges, using a blend of open-source and proprietary data collection techniques. By monitoring activity on critical networks and analyzing attack patterns, our team provides a detailed overview of the threats facing German entities.
This report aims to equip stakeholders across public and private sectors with the knowledge to enhance their defensive strategies, reduce exposure to cyber risks, and reinforce Germany's resilience against cyber threats.
Show drafts
volume_up
Empowering the Data Analytics Ecosystem: A Laser Focus on Value
The data analytics ecosystem thrives when every component functions at its peak, unlocking the true potential of data. Here's a laser focus on key areas for an empowered ecosystem:
1. Democratize Access, Not Data:
Granular Access Controls: Provide users with self-service tools tailored to their specific needs, preventing data overload and misuse.
Data Catalogs: Implement robust data catalogs for easy discovery and understanding of available data sources.
2. Foster Collaboration with Clear Roles:
Data Mesh Architecture: Break down data silos by creating a distributed data ownership model with clear ownership and responsibilities.
Collaborative Workspaces: Utilize interactive platforms where data scientists, analysts, and domain experts can work seamlessly together.
3. Leverage Advanced Analytics Strategically:
AI-powered Automation: Automate repetitive tasks like data cleaning and feature engineering, freeing up data talent for higher-level analysis.
Right-Tool Selection: Strategically choose the most effective advanced analytics techniques (e.g., AI, ML) based on specific business problems.
4. Prioritize Data Quality with Automation:
Automated Data Validation: Implement automated data quality checks to identify and rectify errors at the source, minimizing downstream issues.
Data Lineage Tracking: Track the flow of data throughout the ecosystem, ensuring transparency and facilitating root cause analysis for errors.
5. Cultivate a Data-Driven Mindset:
Metrics-Driven Performance Management: Align KPIs and performance metrics with data-driven insights to ensure actionable decision making.
Data Storytelling Workshops: Equip stakeholders with the skills to translate complex data findings into compelling narratives that drive action.
Benefits of a Precise Ecosystem:
Sharpened Focus: Precise access and clear roles ensure everyone works with the most relevant data, maximizing efficiency.
Actionable Insights: Strategic analytics and automated quality checks lead to more reliable and actionable data insights.
Continuous Improvement: Data-driven performance management fosters a culture of learning and continuous improvement.
Sustainable Growth: Empowered by data, organizations can make informed decisions to drive sustainable growth and innovation.
By focusing on these precise actions, organizations can create an empowered data analytics ecosystem that delivers real value by driving data-driven decisions and maximizing the return on their data investment.
Open Science for sustainability and inclusiveness: the SKA role model
1. Open Science for sustainability and inclusiveness:
the SKA role model
Lourdes Verdes-Montenegro, Susana Sánchez
IAA Severo Ochoa Centre of Excellence (CSIC)
2. Open Science: a new concept?
• Too many adjectives for science:
excellent, high quality, trustable, … Open
3. • Too many adjectives for science:
excellent, high quality, trustable, … Open
Scientific Reproducibility is a fundamental
principle of the Scientific Method, a
process established in the 17th century that
marked the beginning of modern science
and laid the foundations for the Philosophy
of Science
• Let’s go back 383 years in time…
• Science = Scientific Method = Reproducible = Open!
Open Science: a new concept?
4. Open Science: but then we already follow it, right?
• We are scientists! We (want to) follow the Scientific Method!
2016
5. Open Science: but then we already follow it, right?
• We are scientists! We (want to) follow the Scientific Method!
Questionnaire on reproducibility (1500
scientists)
• 70% of researchers have tried and failed to
reproduce another scientist's experiments
• > 50% have failed to reproduce their own ones!
• Chemistry: 90% (60%)
• Biology: 80% (60%)
• Physics and engineering: 70% (50%)
• Medicine: 70% (60%)
• Earth and environmental science: 60% (40%)
2016
6. Open Science: but then we already follow it, right?
• We are scientists! We (want to) follow the Scientific Method!
Questionnaire on reproducibility (1500
scientists)
• 70% of researchers have tried and failed to
reproduce another scientist's experiments
• > 50% have failed to reproduce their own ones!
• Chemistry: 90% (60%)
• Biology: 80% (60%)
• Physics and engineering: 70% (50%)
• Medicine: 70% (60%)
• Earth and environmental science: 60% (40%)
2016
Aha!
So you don’t
empathise?
7. Open Science: but then we already follow it, right?
• We are scientists! We (want to) follow the Scientific Method!
Questionnaire on reproducibility (1500
scientists)
• 70% of researchers have tried and failed to
reproduce another scientist's experiments
• > 50% have failed to reproduce their own ones!
• Chemistry: 90% (60%)
• Biology: 80% (60%)
• Physics and engineering: 70% (50%)
• Medicine: 70% (60%)
• Earth and environmental science: 60% (40%)
2016
Aha!
So you don’t
empathise?
Maybe with this?
8. Open Science: then what happened since 1637?
• Moving beyond the PDF
40% Knowledge Burying in paper publication =
Rest In Paper
(S. Bechhofer 2011, Research Objects: Towards Exchange and Reuse of
Digital Knowledge)
http://www.clipartkid.com/rip-cliparts/
9. Open Science: then what happened since 1637?
• Moving beyond the PDF
http://www.clipartkid.com/rip-cliparts/
Moving from narratives (last 300 yrs) to the
actual output of research is not so easy
40% Knowledge Burying in paper publication =
Rest In Paper
(S. Bechhofer 2011, Research Objects: Towards Exchange and Reuse of
Digital Knowledge)
In practice
10. Open Science represents an approach to research that is collaborative,
transparent and accessible
Open Science definition, European Commission, 2017, doi: 10.2777/75255
Open Science for sustainability and inclusiveness
Remember: Open Science started bottom-up
(DORA, Metric Tide, Leiden Manifesto, Altmetrics, etc)
Open Science embodies the need to transform, open and democratize
the entire knowledge generation to ensure that every scientific challenge
is faced and really drives and allows the achievement of the United
Nations Sustainable Development Goals
UNESCO and Open Science [1]
[1]https://en.unesco.org/news/unesco-launches-global-consultation-develop-standard-setting-instrument-open-science
11. Open Science for sustainability and inclusiveness
Science hidden behind paywall barriers
• Free access to research sources to the whole scientific
community = limitations to science progress
• OS = Data and results more accessible and reliable
• OS = Promotion of scholarly exchange of ideas
• OS = Avoid duplication
Acceleration of knowledge transfer to Society, pandemics,
sanitary crisis
• Speed up building of skills
• Teaching, e.g. how to access public archives, fostering collaborative
practices
• Citizen science
12. Open Science for sustainability and inclusiveness
Promote equity, diversity and inclusion
• All previous items +
• A tool enabling an objective evaluation of work
• Barriers are even more emphasized to scientist women in
places where their contribution tend to be ignored or
anonymized.
14. Open Science in the Big Data era
We are in a race to exploit ever larger datasets
in our quest for “efficiency” we risk forgetting about reproducibility
The era of Big Data is beginning across sciences
Today is the time to ask
What kind of research want mega-science
infrastructures to do in tomorrow’s future?
15. The Square Kilometre Array
SKA1 Observatory
SKA Fact sheets. August 2018. skatelescope.org
The SKA Regional Centres, the core of the SKA Science
Credits: AENEAS
project
Credits:skatelescope.irg
16. Access to data products, tools and processing power to generate and analize
Advanced Data Products (ADPs)
The SKA Regional Centre Network
The core
of SKA
science
17. Access to data products, tools and processing power to generate and analize
Advanced Data Products (ADPs)
The SKA Regional Centre Network
The core
of SKA
science
Global shift in research practices
18. SKA duty: a role model
Reproducibility as a metric of success
Adoption of Open Science values
19. 2017-2019
http://www.aeneas2020.eu
Applying Open Science to build a
federated network of European SRCs
SKA is contributing to build the
European Open Science Cloud (EOSC)
Addressing the Open Science challenges
shared by ESFRI facilities and other pan-
European research infrastructures in
Astronomy and Particle Physics
2019-2022
https://projectescape.eu/
20. The challenge from different perspectives
Implementation of Open, reproducible science
is challenging, even more in this new
framework:
Individual users
Large teams
Service providers
Publishers
Evaluators/Funding
agencies
new roles new perspectives
21. Data to the desktop: “individual scientist”
• I have the best code, which I know how to use and can do special things
• I do not trust any pipeline that you made
• partly because I know better how to do it
• partly because I read the news and there is a reproducibility crisis
• well, and I can hardly reproduce the results of my own papers some years later...
• In general I want full control of the software and of the computational
environment
About trust
22. Computation to data, providers perspective: Data Centres
About technology
• We need to install your software in our platform. Can we trust it? Can
we run it? Environment, dependencies, etc
• Hey, we are offering services to the community, computation + tools.
We would be grateful if you allow us to share it with other users (with
proper credit)
• Mmmm, sharing is great, but, putting the software in the platform is
not enough: you need to provide the context for people to be able to
rerun the software on the same or other data
23. Large alliances of scientists
About metrics of research careers
• We have tools to generate Advanced Data Products, and we will put
them there where the storage and computation is (Data Centres)
• But... we put effort on it, what would we gain if we make the
*additional effort* to make it reusable? If we make it, then we will
pave the way to competitors
• Well, maybe we will share in 4 yrs time (PhD typical time)
24. Publishers
Publishing models
• Will we need different profiles of referees to evaluate the scientific
discussion together with the data quality and the methods (aka.
Reproducibility)?
• If the data and the methods (tools) will be in Data Centres, will our
referees need to become a “user” of the Data Centres to be able to
validate a paper?
• Will we be able to engage so many referees as may be needed?
• Will we need to validate the data, the tools, and the scientific analysis
separetely?
25. Policy makers / funding agencies
Evaluation
• How to measure reproducibility?
• How to weight it and/or aggregate with other indicators?
26. The Challenge: extraction of Scientific Knowledge
Huge and complex data volumes
Large teams distributed globally
A shared challenge for data-intensive research
Computing / storage / network / human resources will be needed:
• Efficient exploitation of Distributed Computing Infrastructures
• Large international alliances of scientists
• Tools to enhance scientific collaboration
• Platforms to share data, methods and knowledge
OpenScience
&e-Science
Open Science is the Aim and also the Mean
27. Supporting Open Science in the SKA Regional Centres
A network of interoperable SRCs should provide:
– access to project data / storage capacity for archiving SKA data
– place for software analysis, modelling, visualisation, algorithm
development / computational capacity, validation
– All this through a platform that
– is transparent and a location agnostic interface for users
– enables collaborative science
– User support (to enable scientific exploitation of the SKA data), training
the new generation of radioastronomers, citizen science
Antonio Chrysostomou’s talk
28. Key ingredients of the SRCs to support Open Science
Science
Gateway
New
Metrics
Workflows and
provenance
Reproducible
Notebooks
Standards for data
interoperability
(IVOA)
access to project data
place for software
analysis, visualisation
Location agnostic platform
Collaborative Science
User support and training
Open
to
society
Citizen Science
29. SKA: a mega-science infrastructure engaged with trustable,
reproducible science, going beyond numbers of papers/citations:
pursuing the Scientific Method
Difficult? yes!
But this word doesn’t
seem to intimidate
those aiming to build
the SKA
30. SKA: a mega-science infrastructure engaged with trustable,
reproducible science, going beyond numbers of papers/citations:
pursuing the Scientific Method
Difficult? yes!
But this word doesn’t
seem to intimidate
those aiming to build
the SKA
THANK YOU FOR YOUR
ATTENTION