Presented at ASIS&T 2009 in the student awards section. The presentation contains an overview of my dissertation proposal, as 2009 winner of the Thomson Reuters Information Science Doctoral Dissertation Proposal Scholarship, administered by the ASIS&T Information Science Education Committee
Thesis Proposal, as presented for dissertation proposal defenseHeather Piwowar
The slides I presented for my PhD proposal defense for my project, "Foundational studies for measuring the impact, prevalence, and patterns of publicly sharing biomedical research data." Dept of Biomedical Informatics, University of Pittsburgh.
Scott Edmunds slides from class 7 from the HKU Data Curation (module MLIM7350 from the Faculty of Education) course covering open data policy and practice, and the Hong Kong context.
searching tips and tools, recommendations, getting the most from databases, finding RCTs, EBP, evidence based practice, hospital library, DeepWeb, Grey Literature, Altmetrics,
Open Access and Property Rights on a Collision Course with ScholarsKimberly Yang
This seminar talk focused on open access as a philosophy directly affecting scholars, academics, and consumers and its tension with property rights (intellectual property, copyright, proprietary databases). These impact our ability (or inability) to search, identify, retrieve, access, and use full-text publications or data relevant to research topics and investigations. It impacts our ability to use others' works (and our own work) in our research, writing, publishing and teaching.
Thesis Proposal, as presented for dissertation proposal defenseHeather Piwowar
The slides I presented for my PhD proposal defense for my project, "Foundational studies for measuring the impact, prevalence, and patterns of publicly sharing biomedical research data." Dept of Biomedical Informatics, University of Pittsburgh.
Scott Edmunds slides from class 7 from the HKU Data Curation (module MLIM7350 from the Faculty of Education) course covering open data policy and practice, and the Hong Kong context.
searching tips and tools, recommendations, getting the most from databases, finding RCTs, EBP, evidence based practice, hospital library, DeepWeb, Grey Literature, Altmetrics,
Open Access and Property Rights on a Collision Course with ScholarsKimberly Yang
This seminar talk focused on open access as a philosophy directly affecting scholars, academics, and consumers and its tension with property rights (intellectual property, copyright, proprietary databases). These impact our ability (or inability) to search, identify, retrieve, access, and use full-text publications or data relevant to research topics and investigations. It impacts our ability to use others' works (and our own work) in our research, writing, publishing and teaching.
GenomeTrakr: Perspectives on linking internationally - Canada and IRIDA.cafionabrinkman
Talk at GenomeTrakr network meeting Sept 23 2015 in Washington DC. On Canada's open source Integrated Rapid Infectious Disease Analysis (IRIDA) bioinformatics platform - aiding genomic epidemiology analysis for public health agencies with planned open data release and linkage to GenomeTrakr. Discussed perspectives, challenges, solutions for getting more GenomeTrakr participation internationally.
Web-scale Discovery Tools and the Backgrounding of Government InformationChristopher Brown
Brown, Christopher. C. “Web-scale Discovery Tools and the Backgrounding of Government Information.” Presentation given at the Electronic Resources & Libraries Conference, 23 February 2015, Austin, TX.
Web-scale discovery tools have opened up new realms of discovery for libraries and their electronic resources. However, government information is being short-changed and backgrounded in these tools. Find out why this is happening, the diagnostic tests to show that this is so, and what we can do about it.
Google is not a doctor and so we don't complete (just a clarification regarding the title given to me by the conference organizers). Presentation at the annual convention of the Philippine Society of Nephrology, 27 April 2019, EDSA Shangrila Hotel.
Univ of Miami CTSI: Citizen science seminar; Oct 2014Richard Bookman
The University of Miami's Clinical & Translational Science Institute runs a seminar course for MS students.
This talk surveys 8 citizen science projects, reviews NIH's current activities, and identifies issues for attention, particularly with ethical, legal and social implications.
Disseminating Research and Managing Your Online Reputation Katja Reuter, PhD
This slide deck was presented at the 2017 ACR/ARHP Annual Meeting. It provided a general overview of the topic and addresses the following learning objectives include: (1) Understand the potential and limitations of digital dissemination of research; (2)
Understand relevant health content regulations, guidelines and ethics, (3) Understand the concept of and tools for measuring the results of one’s digital efforts, and (4) Understand the concept of and tools for online reputation management.
Paradise Lost and The Right to Read is the Right to Minepetermurrayrust
Presented to UIUC CIRSS seminars to a mixed group of Library, CS, domain scientists with a great contingent of Early Career Researchers. Starts by honouring the creation of the wonderful NCSA Mosaic at UIUC in 1993 and the paradise of knowledge and community it opened. Then shows the gradual and tragic decline of the web into a megacorporate neocolonialist empire, where knowledge is sacrificed for money and power.
You have seen many of the slides before but the words are different and have been recorded.
Laurie Goodman on "Overcoming Hurdles to Data Publication" for the Alan Turing Institute Symposium on Reproducibility for Data-Intensive Research, Oxford, 7th April 2016.
CINECA webinar slides: Open science through fair health data networks dream o...CINECAProject
Since the FAIR data principles were published in 2016, many organizations including science funders and governments have adopted these principles to promote and foster true open science collaborations. However, to define a vision and create a video of a Personal Health Train that leverages worldwide FAIR health data in a federated manner is one step. To actually make this happen at scale and be able to show new scientific and medical insights for it is quite another!
In this webinar, we will dive into the basics of FAIR health data, but also take stock of the current situation in health data networks: after a year of frantic research and collaborations and many open datasets and hackathons on COVID-19, has the situation actually improved? Are we sharing health data on a global scale to improve medical practice, or is quality medical data still only accessible to researchers with the right credentials and deep pockets?
This webinar is part of the “How FAIR are you” webinar series and hackathon, which aim at increasing and facilitating the uptake of FAIR approaches into software, training materials and cohort data, to facilitate responsible and ethical data and resource sharing and implementation of federated applications for data analysis.
The CINECA webinar series aims to discuss ways to address common challenges and share best practices in the field of cohort data analysis, as well as distribute CINECA project results. All CINECA webinars include an audience Q&A session during which attendees can ask questions and make suggestions. Please note that all webinars are recorded and available for posterior viewing. CINECA webinars include an audience Q&A session during which attendees can ask questions and make suggestions.
This webinar took place on 21st January 2021 and is part of the CINECA webinar series.
For previous and upcoming CINECA webinars see:
https://www.cineca-project.eu/webinars
In the late Fall and Winter of 2018, the Pistoia Alliance in cooperation with Elsevier and charitable organizations Cures within Reach and Mission: Cure ran a datathon aiming to find drugs suitable for treatment of childhood chronic pancreatitis, a rare disease that causes extreme suffering. The datathon resulted in identification of four candidate compounds in a short time frame of just under three months. In this webinar our speakers discuss the technologies that made this leap possible
Dr. Dennis Wang discusses possible ways to enable ML methods to be more powerful for discovery and to reduce ambiguity within translational medicine, allowing data-informed decision-making to deliver the next generation of diagnostics and therapeutics to patients quicker, at lowered costs, and at scale.
The talk by Dr. Dennis Wang was followed by a panel discussion with Mr. Albert Wang, M. Eng., Head, IT Business Partner, Translational Research & Technologies, Bristol-Myers Squibb.
In recent years there has been a dramatic increase in the number of freely accessible online databases serving the chemistry community. The internet provides chemistry data that can be used for data-mining, for computer models, and integration into systems to aid drug discovery. There is however a responsibility to ensure that the data are high quality to ensure that time is not wasted in erroneous searches, that models are underpinned by accurate data and that improved discoverability of online resources is not marred by incorrect data. In this article we provide an overview of some of the experiences of the authors using online chemical compound databases, critique the approaches taken to assemble data and we suggest approaches to deliver definitive reference data sources.
Researchers and public health practitioners increasingly use Internet big data as data source. What are some of the ethical problems, and how should they be tackled? The author advocates the creation of a self-regulatory body of researchers, a code of conduct, and a notice/opt-out infrastructure, to avoid a public backlash against social media tracking/monitoring for public health, similar to the Facebook fiasko in 2014 (Cornell study).
2016 Data Commons and Data Science Workshop June 7th and June 8th 2016. Genomic Data Commons, FAIR, NCI and making data more findable, publicly accessible, interoperable (machine readable), reusable and support recognition and attribution
Presentation at AMIA 2013 Washington DC, Nov 19th, Panel S50 Social Media and Me. I am focussing on the use of social media for research, in particular as tool for filtering the literature, twimpact factor, altmetrics...
Tweet Your Pubs: How Altmetrics are Changing the Way We Measure Research ImpactRobin Featherstone
Presentation given to the Northern Alberta Health Libraries Association (NAHLA) Trends Mini Conference in Edmonton at the University of Alberta on May 2, 2014
Invitational talk from the NSF/NCI workshop "Cyberinfrastructure in Behavioral Medicine" in San Diego on March 31st 2008, talking about what I call infodemiology / infoveillance work
DOE-NCI Pilots presentation at the Frederick National Laboratory Advisory Com...Warren Kibbe
May 2016 FNLAC presentation of the DOE-NCI partnership around three pilots focused on existing projects in NCI and existing NSCI directives and activities in DOE.
GenomeTrakr: Perspectives on linking internationally - Canada and IRIDA.cafionabrinkman
Talk at GenomeTrakr network meeting Sept 23 2015 in Washington DC. On Canada's open source Integrated Rapid Infectious Disease Analysis (IRIDA) bioinformatics platform - aiding genomic epidemiology analysis for public health agencies with planned open data release and linkage to GenomeTrakr. Discussed perspectives, challenges, solutions for getting more GenomeTrakr participation internationally.
Web-scale Discovery Tools and the Backgrounding of Government InformationChristopher Brown
Brown, Christopher. C. “Web-scale Discovery Tools and the Backgrounding of Government Information.” Presentation given at the Electronic Resources & Libraries Conference, 23 February 2015, Austin, TX.
Web-scale discovery tools have opened up new realms of discovery for libraries and their electronic resources. However, government information is being short-changed and backgrounded in these tools. Find out why this is happening, the diagnostic tests to show that this is so, and what we can do about it.
Google is not a doctor and so we don't complete (just a clarification regarding the title given to me by the conference organizers). Presentation at the annual convention of the Philippine Society of Nephrology, 27 April 2019, EDSA Shangrila Hotel.
Univ of Miami CTSI: Citizen science seminar; Oct 2014Richard Bookman
The University of Miami's Clinical & Translational Science Institute runs a seminar course for MS students.
This talk surveys 8 citizen science projects, reviews NIH's current activities, and identifies issues for attention, particularly with ethical, legal and social implications.
Disseminating Research and Managing Your Online Reputation Katja Reuter, PhD
This slide deck was presented at the 2017 ACR/ARHP Annual Meeting. It provided a general overview of the topic and addresses the following learning objectives include: (1) Understand the potential and limitations of digital dissemination of research; (2)
Understand relevant health content regulations, guidelines and ethics, (3) Understand the concept of and tools for measuring the results of one’s digital efforts, and (4) Understand the concept of and tools for online reputation management.
Paradise Lost and The Right to Read is the Right to Minepetermurrayrust
Presented to UIUC CIRSS seminars to a mixed group of Library, CS, domain scientists with a great contingent of Early Career Researchers. Starts by honouring the creation of the wonderful NCSA Mosaic at UIUC in 1993 and the paradise of knowledge and community it opened. Then shows the gradual and tragic decline of the web into a megacorporate neocolonialist empire, where knowledge is sacrificed for money and power.
You have seen many of the slides before but the words are different and have been recorded.
Laurie Goodman on "Overcoming Hurdles to Data Publication" for the Alan Turing Institute Symposium on Reproducibility for Data-Intensive Research, Oxford, 7th April 2016.
CINECA webinar slides: Open science through fair health data networks dream o...CINECAProject
Since the FAIR data principles were published in 2016, many organizations including science funders and governments have adopted these principles to promote and foster true open science collaborations. However, to define a vision and create a video of a Personal Health Train that leverages worldwide FAIR health data in a federated manner is one step. To actually make this happen at scale and be able to show new scientific and medical insights for it is quite another!
In this webinar, we will dive into the basics of FAIR health data, but also take stock of the current situation in health data networks: after a year of frantic research and collaborations and many open datasets and hackathons on COVID-19, has the situation actually improved? Are we sharing health data on a global scale to improve medical practice, or is quality medical data still only accessible to researchers with the right credentials and deep pockets?
This webinar is part of the “How FAIR are you” webinar series and hackathon, which aim at increasing and facilitating the uptake of FAIR approaches into software, training materials and cohort data, to facilitate responsible and ethical data and resource sharing and implementation of federated applications for data analysis.
The CINECA webinar series aims to discuss ways to address common challenges and share best practices in the field of cohort data analysis, as well as distribute CINECA project results. All CINECA webinars include an audience Q&A session during which attendees can ask questions and make suggestions. Please note that all webinars are recorded and available for posterior viewing. CINECA webinars include an audience Q&A session during which attendees can ask questions and make suggestions.
This webinar took place on 21st January 2021 and is part of the CINECA webinar series.
For previous and upcoming CINECA webinars see:
https://www.cineca-project.eu/webinars
In the late Fall and Winter of 2018, the Pistoia Alliance in cooperation with Elsevier and charitable organizations Cures within Reach and Mission: Cure ran a datathon aiming to find drugs suitable for treatment of childhood chronic pancreatitis, a rare disease that causes extreme suffering. The datathon resulted in identification of four candidate compounds in a short time frame of just under three months. In this webinar our speakers discuss the technologies that made this leap possible
Dr. Dennis Wang discusses possible ways to enable ML methods to be more powerful for discovery and to reduce ambiguity within translational medicine, allowing data-informed decision-making to deliver the next generation of diagnostics and therapeutics to patients quicker, at lowered costs, and at scale.
The talk by Dr. Dennis Wang was followed by a panel discussion with Mr. Albert Wang, M. Eng., Head, IT Business Partner, Translational Research & Technologies, Bristol-Myers Squibb.
In recent years there has been a dramatic increase in the number of freely accessible online databases serving the chemistry community. The internet provides chemistry data that can be used for data-mining, for computer models, and integration into systems to aid drug discovery. There is however a responsibility to ensure that the data are high quality to ensure that time is not wasted in erroneous searches, that models are underpinned by accurate data and that improved discoverability of online resources is not marred by incorrect data. In this article we provide an overview of some of the experiences of the authors using online chemical compound databases, critique the approaches taken to assemble data and we suggest approaches to deliver definitive reference data sources.
Researchers and public health practitioners increasingly use Internet big data as data source. What are some of the ethical problems, and how should they be tackled? The author advocates the creation of a self-regulatory body of researchers, a code of conduct, and a notice/opt-out infrastructure, to avoid a public backlash against social media tracking/monitoring for public health, similar to the Facebook fiasko in 2014 (Cornell study).
2016 Data Commons and Data Science Workshop June 7th and June 8th 2016. Genomic Data Commons, FAIR, NCI and making data more findable, publicly accessible, interoperable (machine readable), reusable and support recognition and attribution
Presentation at AMIA 2013 Washington DC, Nov 19th, Panel S50 Social Media and Me. I am focussing on the use of social media for research, in particular as tool for filtering the literature, twimpact factor, altmetrics...
Tweet Your Pubs: How Altmetrics are Changing the Way We Measure Research ImpactRobin Featherstone
Presentation given to the Northern Alberta Health Libraries Association (NAHLA) Trends Mini Conference in Edmonton at the University of Alberta on May 2, 2014
Invitational talk from the NSF/NCI workshop "Cyberinfrastructure in Behavioral Medicine" in San Diego on March 31st 2008, talking about what I call infodemiology / infoveillance work
DOE-NCI Pilots presentation at the Frederick National Laboratory Advisory Com...Warren Kibbe
May 2016 FNLAC presentation of the DOE-NCI partnership around three pilots focused on existing projects in NCI and existing NSCI directives and activities in DOE.
Perkenalan Program Studi Pendidikan Bahasa Inggris FKIP UNS Tahun 2016Hasan Zainnuri
Materi Perkenalan Program Studi Pendidikan Bahasa Inggris FKIP UNS Tahun 2016 ini disampaikan oleh Hasan Zainnuri, M.Pd. pada acara Pengenalan Kehidupan Kampus bagi Mahasiswa Baru (PKKMB) Universitas Sebelas Maret pada tanggal 20 Agustus 2016 di Aula Gedung A FKIP UNS.
"Leaders and Laggards in the preservation of raw biomedical research data" presented at NEDCC 2010, The Tectonics of Digital Curation
A Symposium on the Shifting Preservation and Access Landscape
Thesis defense, Heather Piwowar, Sharing biomedical research dataHeather Piwowar
Presentation by Heather Piwowar as PhD dissertation defense on March 24, 2010 at the Dept of Biomedical Informatics, U of Pittsburgh. "Foundational studies for
measuring the impact, prevalence, and patterns of publicly sharing biomedical research data." I passed :)
Why study Data Sharing? (+ why share your data)Heather Piwowar
A presentation to the DBMI department at the University of Pittsburgh about data sharing and reuse: what this means, why it is important, some of what we’ve learned, and what we still don’t know.
Abstract: http://j.mp/1MhWWei
Healthcare applications now have the ability to exploit big data in all its complexity. A crucial challenge is to achieve interoperability or integration so that a variety of content from diverse physical (IoT)- cyber (web-based)- and social sources, with diverse formats and modality (text, image, video), can be used in analysis, insight, and decision-making. At Kno.e.sis, an Ohio Center of Excellence in BioHealth Innovation, we have a variety of large, collaborative healthcare/clinical/biomedical projects, all involving domain experts and end-users, and access to real world data that include: clinical/EMR data (of individual patients and that related to public health), data from a variety of sensors (IoT) on and around patients measuring real-time physiological and environmental observations), social data (Twitter, Web forums, PatientsLikeMe), Web search logs, etc. Key projects include: Prescription drug abuse online-surveillance and epidemiology (PREDOSE), Social media analysis to monitor cannabis and synthetic cannabinoid use (eDrugTrends), Modeling Social Behavior for Healthcare Utilization in Depression, Medical Information Decision Assistant and Support (MIDAS) with application to musculoskeletal issues, kHealth: A Semantic Approach to Proactive, Personalized Asthma Management Using Multimodal Sensing (also for Dementia), and Cardiology Semantic Analysis System (with applications to Computer Assisted Coding and Computerized Document Improvement).
This talk will review how ontologies or knowledge graphs play a central role in supporting semantic filtering, interoperability and integration (including the issues such as disambiguation), reasoning and decision-making in all our health-centric research and applications. Additional relevant information is at the speaker’s HCLS page. http://knoesis.org/amit/hcls
Laurie Goodman at #aibsdata: Beyond Data Release Mandates - Helping Authors M...GigaScience, BGI Hong Kong
Laurie Goodman at the AIBS Changing Practices in Data Pub workshop: Beyond Data Release Mandates - Helping Authors Make Data Available. 3rd December 2014
May 2016 NCI Cancer Center Directors meeting. Data Sharing and the Cancer Genomic Data Commons (GDC). Focus is on cancer genomic and clinical phenotype data.
Scott Edmunds slides for class 8 from the HKU Data Curation (module MLIM7350 from the Faculty of Education) course covering science data, medical data and ethics, and the FAIR data principles.
Internal NIH Seminar to the BISTI Team on some early thoughts from the Associate Director for Data Science (ADDS). These ideas are for discussion only and in no way reflect what might happen subsequently. Presented April 1, 2014 (the date is purely a coincidence).
NIH Data Policy or: How I Learned to Stop Worrying and Love the Data Manageme...Kristin Briney
This talk provides background information on the NIH policy, what it is and how it came to be. It then goes through how to create a data sharing plan on your own and using the DMPTool. The talk wraps up with my top 5 recommendations for data management for those who have never done data management before.
The Path to Open Science with Illustrations from Computational Biology - A presentation made at the Microsoft 2011 Latin America Faculty Summit Cartagena, Columbia, May 18, 2011.
Similar to Thesis Proposal Piwowar Presentation 20091109 (20)
Calculating how much your University spends on Open Access--and what to do ab...Heather Piwowar
#NASIG2020 presentation
Librarians are working hard to understand how much money their university is spending on open access article processing fees (APCs), and how much of what they subscribe to is available as OA. This information is useful when making subscription decisions, considering Read and Publish agreements, rethinking library open access budgets, and designing Institution-wide OA policies.
This session will talk concretely about how to calculate the impact of Open Access on *your* university. It will provide an overview on how to estimate the amount of money spent across a university on Open Access fees: we will discuss underlying concepts behind calculating OA article-processing fee (APC) spend and give an overview of useful data sources, including Unsub.
Follow at @unsub_org
How to Calculate OA APC Spend for Your UniversityHeather Piwowar
Universities are hungry to know how much they spend on Open Access fees. This data is important to planning transformative and read and publish agreements, forming library strategy, and understanding scholarly communication on your campus. Unfortunately, it hasn’t been easy to calculate how much your university is spending on Open Access.
Learn how recent developments in data sources and tools have made this easier during this webinar. We will discuss the underlying concepts behind calculating OA article-processing fee (APC) spend, and provide you with paths to calculate the Open Access fees paid by your institution. ALCTS webinar.
Intro to Managing Serials with Net Cost per Paid UseHeather Piwowar
This webinar will introduce a new metric for evaluating the cost effectiveness of Serials: Net Cost Per Paid Use (NCPPU). NCPPU goes beyond the standard Cost Per Use calculation to exclude free content (OA and back catalog), incorporate ILL costs, and value citation and authorship. ALCTS webinar.
submission summary for #WSSSPE Policy session on Credit, Citation, and ImpactHeather Piwowar
submission summary for #WSSSPE Policy session on Credit, Citation, and Impact
presentation by Heather Piwowar
November 2013
agenda: http://wssspe.researchcomputing.org.uk/
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024Tobias Schneck
As AI technology is pushing into IT I was wondering myself, as an “infrastructure container kubernetes guy”, how get this fancy AI technology get managed from an infrastructure operational view? Is it possible to apply our lovely cloud native principals as well? What benefit’s both technologies could bring to each other?
Let me take this questions and provide you a short journey through existing deployment models and use cases for AI software. On practical examples, we discuss what cloud/on-premise strategy we may need for applying it to our own infrastructure to get it to work from an enterprise perspective. I want to give an overview about infrastructure requirements and technologies, what could be beneficial or limiting your AI use cases in an enterprise environment. An interactive Demo will give you some insides, what approaches I got already working for real.
Epistemic Interaction - tuning interfaces to provide information for AI supportAlan Dix
Paper presented at SYNERGY workshop at AVI 2024, Genoa, Italy. 3rd June 2024
https://alandix.com/academic/papers/synergy2024-epistemic/
As machine learning integrates deeper into human-computer interactions, the concept of epistemic interaction emerges, aiming to refine these interactions to enhance system adaptability. This approach encourages minor, intentional adjustments in user behaviour to enrich the data available for system learning. This paper introduces epistemic interaction within the context of human-system communication, illustrating how deliberate interaction design can improve system understanding and adaptation. Through concrete examples, we demonstrate the potential of epistemic interaction to significantly advance human-computer interaction by leveraging intuitive human communication strategies to inform system design and functionality, offering a novel pathway for enriching user-system engagements.
GraphRAG is All You need? LLM & Knowledge GraphGuy Korland
Guy Korland, CEO and Co-founder of FalkorDB, will review two articles on the integration of language models with knowledge graphs.
1. Unifying Large Language Models and Knowledge Graphs: A Roadmap.
https://arxiv.org/abs/2306.08302
2. Microsoft Research's GraphRAG paper and a review paper on various uses of knowledge graphs:
https://www.microsoft.com/en-us/research/blog/graphrag-unlocking-llm-discovery-on-narrative-private-data/
Connector Corner: Automate dynamic content and events by pushing a buttonDianaGray10
Here is something new! In our next Connector Corner webinar, we will demonstrate how you can use a single workflow to:
Create a campaign using Mailchimp with merge tags/fields
Send an interactive Slack channel message (using buttons)
Have the message received by managers and peers along with a test email for review
But there’s more:
In a second workflow supporting the same use case, you’ll see:
Your campaign sent to target colleagues for approval
If the “Approve” button is clicked, a Jira/Zendesk ticket is created for the marketing design team
But—if the “Reject” button is pushed, colleagues will be alerted via Slack message
Join us to learn more about this new, human-in-the-loop capability, brought to you by Integration Service connectors.
And...
Speakers:
Akshay Agnihotri, Product Manager
Charlie Greenberg, Host
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...James Anderson
Effective Application Security in Software Delivery lifecycle using Deployment Firewall and DBOM
The modern software delivery process (or the CI/CD process) includes many tools, distributed teams, open-source code, and cloud platforms. Constant focus on speed to release software to market, along with the traditional slow and manual security checks has caused gaps in continuous security as an important piece in the software supply chain. Today organizations feel more susceptible to external and internal cyber threats due to the vast attack surface in their applications supply chain and the lack of end-to-end governance and risk management.
The software team must secure its software delivery process to avoid vulnerability and security breaches. This needs to be achieved with existing tool chains and without extensive rework of the delivery processes. This talk will present strategies and techniques for providing visibility into the true risk of the existing vulnerabilities, preventing the introduction of security issues in the software, resolving vulnerabilities in production environments quickly, and capturing the deployment bill of materials (DBOM).
Speakers:
Bob Boule
Robert Boule is a technology enthusiast with PASSION for technology and making things work along with a knack for helping others understand how things work. He comes with around 20 years of solution engineering experience in application security, software continuous delivery, and SaaS platforms. He is known for his dynamic presentations in CI/CD and application security integrated in software delivery lifecycle.
Gopinath Rebala
Gopinath Rebala is the CTO of OpsMx, where he has overall responsibility for the machine learning and data processing architectures for Secure Software Delivery. Gopi also has a strong connection with our customers, leading design and architecture for strategic implementations. Gopi is a frequent speaker and well-known leader in continuous delivery and integrating security into software delivery.
State of ICS and IoT Cyber Threat Landscape Report 2024 previewPrayukth K V
The IoT and OT threat landscape report has been prepared by the Threat Research Team at Sectrio using data from Sectrio, cyber threat intelligence farming facilities spread across over 85 cities around the world. In addition, Sectrio also runs AI-based advanced threat and payload engagement facilities that serve as sinks to attract and engage sophisticated threat actors, and newer malware including new variants and latent threats that are at an earlier stage of development.
The latest edition of the OT/ICS and IoT security Threat Landscape Report 2024 also covers:
State of global ICS asset and network exposure
Sectoral targets and attacks as well as the cost of ransom
Global APT activity, AI usage, actor and tactic profiles, and implications
Rise in volumes of AI-powered cyberattacks
Major cyber events in 2024
Malware and malicious payload trends
Cyberattack types and targets
Vulnerability exploit attempts on CVEs
Attacks on counties – USA
Expansion of bot farms – how, where, and why
In-depth analysis of the cyber threat landscape across North America, South America, Europe, APAC, and the Middle East
Why are attacks on smart factories rising?
Cyber risk predictions
Axis of attacks – Europe
Systemic attacks in the Middle East
Download the full report from here:
https://sectrio.com/resources/ot-threat-landscape-reports/sectrio-releases-ot-ics-and-iot-security-threat-landscape-report-2024/
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...Ramesh Iyer
In today's fast-changing business world, Companies that adapt and embrace new ideas often need help to keep up with the competition. However, fostering a culture of innovation takes much work. It takes vision, leadership and willingness to take risks in the right proportion. Sachin Dev Duggal, co-founder of Builder.ai, has perfected the art of this balance, creating a company culture where creativity and growth are nurtured at each stage.
Elevating Tactical DDD Patterns Through Object CalisthenicsDorra BARTAGUIZ
After immersing yourself in the blue book and its red counterpart, attending DDD-focused conferences, and applying tactical patterns, you're left with a crucial question: How do I ensure my design is effective? Tactical patterns within Domain-Driven Design (DDD) serve as guiding principles for creating clear and manageable domain models. However, achieving success with these patterns requires additional guidance. Interestingly, we've observed that a set of constraints initially designed for training purposes remarkably aligns with effective pattern implementation, offering a more ‘mechanical’ approach. Let's explore together how Object Calisthenics can elevate the design of your tactical DDD patterns, offering concrete help for those venturing into DDD for the first time!
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf91mobiles
91mobiles recently conducted a Smart TV Buyer Insights Survey in which we asked over 3,000 respondents about the TV they own, aspects they look at on a new TV, and their TV buying preferences.
UiPath Test Automation using UiPath Test Suite series, part 3DianaGray10
Welcome to UiPath Test Automation using UiPath Test Suite series part 3. In this session, we will cover desktop automation along with UI automation.
Topics covered:
UI automation Introduction,
UI automation Sample
Desktop automation flow
Pradeep Chinnala, Senior Consultant Automation Developer @WonderBotz and UiPath MVP
Deepak Rai, Automation Practice Lead, Boundaryless Group and UiPath MVP
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
Thesis Proposal Piwowar Presentation 20091109
1. Foundational studies for measuring the
impact, prevalence, and patterns
of publicly sharing
biomedical research data
Heather Piwowar
Department of Biomedical Informatics
University of Pittsburgh
4. Sharing research data
PAST MEDICAL HISTORY:
Past medical history showed she had
superficial phlebitis times two in the past, had
non-insulin dependent diabetes mellitus for
four years.
She had been hypothyroid for three years.
HISTORY OF PRESENT ILLNESS:
The patient is a 58-year-old female, …
http://upload.wikimedia.org/wikipedia/commons/7/76/PeptideMSMS.jpg; http://en.wikipedia.org/wiki/Image:Helices.png; http://en.wikipedia.org/wiki/
Image:Heatmap.png; http://en.wikipedia.org/wiki/Image:Microarray2.gif;
http://zellig.cpmc.columbia.edu/medlee/demo/; htp://www.plosone.org/article/fetchArticle.action?articleURI=info:doi/10.1371/journal.pone.0000441
5. Sharing research data
PAST MEDICAL HISTORY:
Past medical history showed she had
superficial phlebitis times two in the past, had
non-insulin dependent diabetes mellitus for
four years.
She had been hypothyroid for three years.
HISTORY OF PRESENT ILLNESS:
The patient is a 58-year-old female, …
http://upload.wikimedia.org/wikipedia/commons/7/76/PeptideMSMS.jpg; http://en.wikipedia.org/wiki/Image:Helices.png; http://en.wikipedia.org/wiki/
Image:Heatmap.png; http://en.wikipedia.org/wiki/Image:Microarray2.gif;
http://zellig.cpmc.columbia.edu/medlee/demo/; htp://www.plosone.org/article/fetchArticle.action?articleURI=info:doi/10.1371/journal.pone.0000441
6. Sharing research data
PAST MEDICAL HISTORY:
Past medical history showed she had
superficial phlebitis times two in the past, had
non-insulin dependent diabetes mellitus for
four years.
She had been hypothyroid for three years.
HISTORY OF PRESENT ILLNESS:
The patient is a 58-year-old female, …
http://upload.wikimedia.org/wikipedia/commons/7/76/PeptideMSMS.jpg; http://en.wikipedia.org/wiki/Image:Helices.png; http://en.wikipedia.org/wiki/
Image:Heatmap.png; http://en.wikipedia.org/wiki/Image:Microarray2.gif;
http://zellig.cpmc.columbia.edu/medlee/demo/; htp://www.plosone.org/article/fetchArticle.action?articleURI=info:doi/10.1371/journal.pone.0000441
7. Sharing research data
PAST MEDICAL HISTORY:
Past medical history showed she had
superficial phlebitis times two in the past, had
non-insulin dependent diabetes mellitus for
four years.
She had been hypothyroid for three years.
HISTORY OF PRESENT ILLNESS:
The patient is a 58-year-old female, …
http://upload.wikimedia.org/wikipedia/commons/7/76/PeptideMSMS.jpg; http://en.wikipedia.org/wiki/Image:Helices.png; http://en.wikipedia.org/wiki/
Image:Heatmap.png; http://en.wikipedia.org/wiki/Image:Microarray2.gif;
http://zellig.cpmc.columbia.edu/medlee/demo/; htp://www.plosone.org/article/fetchArticle.action?articleURI=info:doi/10.1371/journal.pone.0000441
10. But... costly for authors
Find
Organize
Document
Deidentify
Format
Decide
Ask
Submit
Answer questions
Worry about mistakes being found
Worry about data being misinterpreted
Worry about being scooped
Forgo money and IP and prestige???
12. ... on initiatives, requests,
requirements, and tools
Funder data sharing requirements
Journal requirements and requests
Databases
Data sharing collaboration grids
Standards
Editorials, letters to the editor, discussion....
26. Limitations of the related research
• manual audits: small sample sizes
• surveys: few variables + self-reporting bias
• not much focus on measuring demonstrated behavior
• not much focus on rewards
• not much focus on policy
• not much focus on biomedical data other than
DNA sequences
27. Needed:
a study of data sharing behaviour and impact
that includes
• a measurement of demonstrated behavior
• policy variables
• estimate of rewards
• a broad and deep selection of data creation instances
28. Aim 1: Does sharing have benefit for
those who share?
Aim 2: Can sharing and withholding be
systematically measured?
Aim 3: How often is data shared?
What predicts sharing?
How can we model sharing behavior?
29. Scope of proposed study
studies
Published studies with English full text available in
a centralized portal
variables for examination
extracted from Medline and other sources
30. Microarray data
http://en.wikipedia.org/wiki/DNA_microarray
http://en.wikipedia.org/wiki/Image:Heatmap.png
http://commons.wikimedia.org/wiki/
File:DNA_double_helix_vertikal.PNG
35. Aim 1: Does sharing have benefit
for those who share?
dataset
85 cancer microarray trials published in 1999-2003,
as identified by Ntzani and Ioannidis (2003)
citations
ISI Web of Science Citation index, citations from
2004-2005
data sharing locations
Publisher and lab websites, microarray databases,
WayBack Internet Archive, Oncomine
statistics
Multivariate linear regression
38. Aim 1: Does sharing have benefit
for those who share?
In multivariate regression, we found studies that had
made their data publicly available received 69% more
citations than similar studies that did not share their
data (95% confidence interval: 18% to 143%)
Piwowar, Day and Fridsma (2007) Sharing Detailed
Research Data Is Associated with Increased Citation
Rate. PLoS ONE 2(3): e308
42. Next: What factors predict sharing?
Can I use the same methods of Aim 1
to choose studies and determine data sharing status?
No, those methods donʼt scale to identify or classify
enough datapoints.
http://www.flickr.com/photos/ryanr/142455033/
44. Need automated methods to:
Identify studies that generate datasets that
could potentially be shared (Aim 2a)
Determine which of these have in fact been
shared (Aim 2b)
47. Aim 2a: Identify studies that create
gene expression microarray data
Instead, look for wetlab methods in full text:
http://www.pubmedcentral.nih.gov/articlerender.fcgi?artid=1522022&tool=pmcentrez
http://www.pubmedcentral.nih.gov/articlerender.fcgi?artid=1590031&tool=pmcentrez
http://www.pubmedcentral.nih.gov/articlerender.fcgi?artid=1482311&tool=pmcentrez#id331936
http://www.pubmedcentral.nih.gov/articlerender.fcgi?artid=2082469&tool=pmcentrez
http://www.pubmedcentral.nih.gov/articlerender.fcgi?artid=126870&tool=pmcentrez#id442745
49. Aim 2a: Identify studies that create
gene expression microarray data
query development
Use supervised natural language processing
techniques on a corpus of Open Access articles
query evaluation
400 studies that created gene expression microarray
data, as identified by Ochsner et al (2008)
goal
>90% precision, and sufficient recall to retrieve >1250
articles
55. Aim 2b: Identify studies that share
their expression microarray data
reference standard
200 the 400 studies that created gene expression
microarray data have shared their microarray data, as
identified by Ochsner et al (2008)
goal
Establish that filter has >70% recall with an unbiased
representation of MeSH terms, dataset size, and
dataset species
63. Aim 3a: Prevalence of data sharing
PubMed Created Shared
Portal
ID data? data?
234 PMC Yes Yes
345 HighPr Yes Yes
456 Scirus Yes Yes
567 PMC Yes NO
678 PMC Yes NO
Prevalence = Number with Shared data
Number with Created data
65. Aim 3b: Correlates with data sharing
Covariates
PubMed Created Shared
Portal
ID data? data?
234 PMC Yes Yes
345 HighPr Yes Yes
456 Scirus Yes Yes
567 PMC Yes NO
678 PMC Yes NO
66. Aim 3b: Correlates with data sharing
Features to include:
• Does the journal have a data sharing policy?
• Is the study funded by the NIH?
• Is it subject tot the NIH data sharing plan
requirement?
• Number of authors
• Journal impact factor
• Are the experimental samples from humans?
• Disease of study
• Year of publication
• …
67. Aim 3b: Correlates with data sharing
Covariates
PubMed Created Shared Journal NIH #
Portal ...
ID data? data? policy funds? authors
234 PMC Yes Yes strong yes 2
345 HighPr Yes Yes weak yes 5
456 Scirus Yes Yes weak no 6
567 PMC Yes NO strong yes 5
678 PMC Yes NO strong no 2
76. Limitations
• Association does not imply causation
• Important influences will be missed due to focus on
measurable variables
• Some derived variables involve many estimates and
assumptions
• Only considering public sharing in primary
centralized databases
• Only one datatype
• Only research studies made available in full-text
portals
77. Risks and contingency plans
NLP performance may be inadequate
supplement with manual annotating via Mechanical Turk
Author ambiguity may introduce extreme outliers
use Author-ity (Smalheiser and Torvik, 2005) for name
disambiguation
Unable to derive a robust exploratory factor model
try other clustering techniques
Several variables may be unexpectedly difficult to
extract and cross-references
if not essential, defer analysis of that variable
78. Current status
Aim 1: Does sharing have benefit for
those who share?
Aim 2: Can sharing and withholding be
systematically measured?
ete d.
Aim 3: How often is data shared? ction
om pl
lot c
What predicts sharing? set c olle
pi ata behavior?
ll d
How can we model sharing
fu
No w:
79. Anticipated contributions
• Published assessment of the observed and
measured rewards, prevalence, and patterns of
gene expression microarray dataset sharing
• Publicly available dataset associating microarray
study publications with data sharing status
• Generalizable approach for developing practical,
real-world information retrieval using
centralized full-text query portals
• Preliminary model of data sharing behaviour
based on this large dataset
80. Future work
• Identify and model data reuse
• Citation analysis of the large cohort
• Supplement with survey responses
http://www.flickr.com/photos/cogdog/123072/
81. Data sharing plan
I post my data, code, and statistical scripts at
http://www.dbmi.pitt.edu/piwowar
Share yours too!
http://www.flickr.com/photos/myklroventine/892446624/
82. Thanks to:
➡ the NLM for funding training grant 5 T15 LM007059-22
➡ the Dept of Biomedical Informatics at the U of Pittsburgh
➡ my committee
Dr Wendy Chapman Biomed Informatics
Dr Ellen Detlefsen iSchool
Dr Madhavi Ganapathiraju Bioinformatics
Dr Brian Butler Katz School of Business
Dr Gunther Eysenbach U of Toronto, Health Policy
Mgmt and Evaluation
83.
84. Funder Journal Investigator Institution Study
Is research data shared
after publication?
aim
85. Prevalence of data withholding
via surveys
self-reported denying a request in last 3 years
trainees self-reported denying a request
been denied access to data, materials, code
authors “not able to retrieve raw data”
not willing to release data
0% 10% 20% 30% 40%
Campbell et al. JAMA. 2002.
Kyzas et al. J Natl Cancer Inst. 2005.
Vogeli et al. Acad Med. 2006.
Reidpath et al. Bioethics 2001.
86. Self‐reported reasons for data
withholding
sharing is too much effort
want student or jr faculty to publish more
they themselves want to publish more
cost
industrial sponsor
confidentiality
commercial value of results
0% 20% 40% 60% 80%
Campbell et al. JAMA 2002.
87. Correlates with self‐reported data
withholding
industry involvement
perceived competitiveness of field
male
sharing discouraged in training
human participants
academic productivity
0 1 2 3
Blumenthal et al. Acad Med. 2006