Presentation by Pistoia Alliance reps Ian Harrow (Pfizer) and Nick Lynch (AstraZeneca) at the International Conference on Trends for Scientific Information Professionals, October 2010.
The Pistoia Alliance Information Ecosystem WorkshopPistoia Alliance
Michael Braxenthaler, president of the Pistoia Alliance, introduces the concept of the information ecosystem in life science research and discusses the role the Pistoia Alliance can play within this ecosystem. The workshop occurred in October 2011.
Learn about the Open Data Center Alliance Workgroups, Usage Models and Roadmap Structure from the perspective of the Alliance Technical Coordination Committee. This presentation was used in the Nov. 18, 2010 Alliance Webcast delivered by Howard Grodin, VP of Strategic Programs, Terrermark; Alliance Technical Coordination Committee Member, and Ravi Subranamiam, Intel Corporation; Alliance Technical Advisor.
For more information about the Open Data Center Alliance, visit www.opendatacenteralliance.org. You will also find the Webcast recording that accompanies this presentation there.
The Pistoia Alliance: Update on Strategy and ProgressPistoia Alliance
Ramesh Durvasula, Pistoia Alliance board member, discusses the Pistoia Alliance mission and recaps activities in 2011-12, with particular emphasis on the successful completion of the Sequence Squeeze Competition and Sequence Services Phase 2. The presentation was delivered at BioITWorld in Boston in April 2012.
The Pistoia Alliance Information Ecosystem WorkshopPistoia Alliance
Michael Braxenthaler, president of the Pistoia Alliance, introduces the concept of the information ecosystem in life science research and discusses the role the Pistoia Alliance can play within this ecosystem. The workshop occurred in October 2011.
Learn about the Open Data Center Alliance Workgroups, Usage Models and Roadmap Structure from the perspective of the Alliance Technical Coordination Committee. This presentation was used in the Nov. 18, 2010 Alliance Webcast delivered by Howard Grodin, VP of Strategic Programs, Terrermark; Alliance Technical Coordination Committee Member, and Ravi Subranamiam, Intel Corporation; Alliance Technical Advisor.
For more information about the Open Data Center Alliance, visit www.opendatacenteralliance.org. You will also find the Webcast recording that accompanies this presentation there.
The Pistoia Alliance: Update on Strategy and ProgressPistoia Alliance
Ramesh Durvasula, Pistoia Alliance board member, discusses the Pistoia Alliance mission and recaps activities in 2011-12, with particular emphasis on the successful completion of the Sequence Squeeze Competition and Sequence Services Phase 2. The presentation was delivered at BioITWorld in Boston in April 2012.
A breakout discussion led by David Klatte at the Pistoia Alliance Information Ecosystem Workshop proposed a number of potential projects. The workshop was held in October 2011.
Pistoia Alliance SESL pilot Bio IT World Hanover 12 Oct 2011Ian Harrow
Towards a brokering framework for knowledge-based services: learning from the Pistoia Alliance SESL pilot
Ian Harrow PhD for the Pistoia Alliance
This presentation describes a pilot project to determine the feasibility of biomedical knowledge brokering. It shows query across multiple disparate data sources through a brokering demonstrator built from RDF triple store technology. The learning from this pilot is contributing to larger scale projects such as the Innovative Medicines Initiative, OpenPFACTs.
Towards a brokering framework for knowledge-based services: Learning from the...Pistoia Alliance
Ian Harrow, co-leader of the Pistoia Alliance SESL pilot, describes the vision for the SESL pilot, the outcomes, and the project's future. The presentation at the 2011 BioITWorld Conference and Expo included a link to the SESL public demonstrator.
RDAP13 Mark Parsons: The Research Data Alliance: Making Data WorkASIS&T
Mark Parsons, Rensselaer Polytechnic Institute
Mark A. Parsons and Francine Berman: "The Research Data Alliance: Making Data Work"
Panel: Global scientific data infrastructure
Research Data Access & Preservation Summit 2013
Baltimore, MD April 4, 2013 #rdap13
Long way from ideas and needs to software measurement standards - Failures, s...Luigi Buglione
This presentation:
1. presents the history of last 20 years about standards on measurement, especially in Software Engineering;
2. Proposes the coverage level by measurable entities and its level (de facto; de jure);
3. Puts in evidence which could be the next ‘de facto’ standards in Software Measurement to move towards the ‘de jure’ status in the short-medium term
OAPEN started its activities on September 1, 2008 and has now completed its project phase co-funded by the European Commission. The final stage of the project focused on the launch of the OAPEN Library, usability, and especially sustainability after the project period. The results were presented during the final conference in Berlin in February 2011.
In the future OAPEN will continue as an independent foundation governed by representatives of the participating institutions. The objectives for the foundation are to stimulate further OA publishing of academic books, to further develop OAPEN as a platform for OA books and to develop a sustainable business model. In the meantime, OAPEN is conducting a number of experiments in Open Access book publishing, in the form of pilot projects. The first pilot is conducted in the Netherlands with support from the Netherlands Organization for Scientific Research (NWO) and the Ministry of Education. For the UK a similar pilot project is being prepared by JISC Collections.
Open Data Center Alliance
Intel Developer Forum 2011 lecture session with:
Anna Claiborne
ODCA WG Chair, ODCA & Product Manager Security Services, Terremark
Ravi Subramaniam
Lead Technical Facilitator, ODCA & Principal Engineer, Intel
Open Data Center Alliance (ODCA) Overview
Overview:
Why Should You Care? (How can you participate?)
1st Release Introduction
Usage Topics Discussion
Ecosystem Opportunities and Engagement
The Pistoia Alliance: Strategy, Progress, MomentumPistoia Alliance
Pistoia Alliance Board Member Ramesh Durvasula of BMS provides an overview of the Pistoia Alliance and project status at the BioITWorld Expo in Boston on April 13, 2011.
Fairification experience clarifying the semantics of data matricesPistoia Alliance
This webinar presents the Statistics Ontology, STATO which is a semantic framework to support the creation of standardized analysis reports to help with review of results in the form of data matrices. STATO includes a hierarchy of classes and a vocabulary for annotating statistical methods used in life, natural and biomedical sciences investigations, text mining and statistical analyses.
Innovation applications of microphysiological systems (MPS) have been growing over the past decade, especially with respect to the use of complex human tissues for assessing safety of drug candidates – but broad industry adoption of MPS methods has not yet become a reality.
This webinar addresses some recent advances in MPS development and begins to explore the barriers to increased incorporation of MPS to improve drug safety assessment and to provide safer, more effective drugs into the clinical pipeline.
More Related Content
Similar to Emerging Life Sciences Collaboration on Common Service Specification
A breakout discussion led by David Klatte at the Pistoia Alliance Information Ecosystem Workshop proposed a number of potential projects. The workshop was held in October 2011.
Pistoia Alliance SESL pilot Bio IT World Hanover 12 Oct 2011Ian Harrow
Towards a brokering framework for knowledge-based services: learning from the Pistoia Alliance SESL pilot
Ian Harrow PhD for the Pistoia Alliance
This presentation describes a pilot project to determine the feasibility of biomedical knowledge brokering. It shows query across multiple disparate data sources through a brokering demonstrator built from RDF triple store technology. The learning from this pilot is contributing to larger scale projects such as the Innovative Medicines Initiative, OpenPFACTs.
Towards a brokering framework for knowledge-based services: Learning from the...Pistoia Alliance
Ian Harrow, co-leader of the Pistoia Alliance SESL pilot, describes the vision for the SESL pilot, the outcomes, and the project's future. The presentation at the 2011 BioITWorld Conference and Expo included a link to the SESL public demonstrator.
RDAP13 Mark Parsons: The Research Data Alliance: Making Data WorkASIS&T
Mark Parsons, Rensselaer Polytechnic Institute
Mark A. Parsons and Francine Berman: "The Research Data Alliance: Making Data Work"
Panel: Global scientific data infrastructure
Research Data Access & Preservation Summit 2013
Baltimore, MD April 4, 2013 #rdap13
Long way from ideas and needs to software measurement standards - Failures, s...Luigi Buglione
This presentation:
1. presents the history of last 20 years about standards on measurement, especially in Software Engineering;
2. Proposes the coverage level by measurable entities and its level (de facto; de jure);
3. Puts in evidence which could be the next ‘de facto’ standards in Software Measurement to move towards the ‘de jure’ status in the short-medium term
OAPEN started its activities on September 1, 2008 and has now completed its project phase co-funded by the European Commission. The final stage of the project focused on the launch of the OAPEN Library, usability, and especially sustainability after the project period. The results were presented during the final conference in Berlin in February 2011.
In the future OAPEN will continue as an independent foundation governed by representatives of the participating institutions. The objectives for the foundation are to stimulate further OA publishing of academic books, to further develop OAPEN as a platform for OA books and to develop a sustainable business model. In the meantime, OAPEN is conducting a number of experiments in Open Access book publishing, in the form of pilot projects. The first pilot is conducted in the Netherlands with support from the Netherlands Organization for Scientific Research (NWO) and the Ministry of Education. For the UK a similar pilot project is being prepared by JISC Collections.
Open Data Center Alliance
Intel Developer Forum 2011 lecture session with:
Anna Claiborne
ODCA WG Chair, ODCA & Product Manager Security Services, Terremark
Ravi Subramaniam
Lead Technical Facilitator, ODCA & Principal Engineer, Intel
Open Data Center Alliance (ODCA) Overview
Overview:
Why Should You Care? (How can you participate?)
1st Release Introduction
Usage Topics Discussion
Ecosystem Opportunities and Engagement
The Pistoia Alliance: Strategy, Progress, MomentumPistoia Alliance
Pistoia Alliance Board Member Ramesh Durvasula of BMS provides an overview of the Pistoia Alliance and project status at the BioITWorld Expo in Boston on April 13, 2011.
Similar to Emerging Life Sciences Collaboration on Common Service Specification (20)
Fairification experience clarifying the semantics of data matricesPistoia Alliance
This webinar presents the Statistics Ontology, STATO which is a semantic framework to support the creation of standardized analysis reports to help with review of results in the form of data matrices. STATO includes a hierarchy of classes and a vocabulary for annotating statistical methods used in life, natural and biomedical sciences investigations, text mining and statistical analyses.
Innovation applications of microphysiological systems (MPS) have been growing over the past decade, especially with respect to the use of complex human tissues for assessing safety of drug candidates – but broad industry adoption of MPS methods has not yet become a reality.
This webinar addresses some recent advances in MPS development and begins to explore the barriers to increased incorporation of MPS to improve drug safety assessment and to provide safer, more effective drugs into the clinical pipeline.
Federated Learning (FL) is a learning paradigm that enables collaborative learning without centralizing datasets. In this webinar, NVIDIA present the concept of FL and discuss how it can help overcome some of the barriers seen in the development of AI-based solutions for pharma, genomics and healthcare. Following the presentation, the panel debate on other elements that could drive the adoption of digital approaches more widely and help answer currently intractable science and business questions.
It seems that AI is also becoming a buzzword, like design thinking. Everyone is talking about AI or wants to have AI, and sees all the ideas and benefits – that’s fine, but how do you get started? But what’s different now? Three innovations have finally put AI on the fast track: Big Data, with the internet and sensors everywhere; massive computing power, especially through the Cloud; and the development of breakthrough algorithms, so computers can be trained to accomplish more sophisticated tasks on their own with deep learning. If you use new technology, you need to explore and know what’s possible. With design thinking, it aids to outline the steps and define the ways in which you’re going to create the solution. Starting with mapping the customer journey, defining who will be using that service enhanced with intelligent technology, or who will benefit and gain value from it. We discuss how these two worlds are coming together, and how you get started to transform your venture with Artificial Intelligence using Design Thinking.
Speaker: Claudio Mirti, Principal Solution Specialist – Data & AI, Microsoft
Themes and objectives:
To position FAIR as a key enabler to automate and accelerate R&D process workflows
FAIR Implementation within the context of a use case
Grounded in precise outcomes (e.g. faster and bigger science / more reuse of data to enhance value / increased ability to share data for collaboration and partnership)
To make data actionable through FAIR interoperability
Speakers:
Mathew Woodwark,Head of Data Infrastructure and Tools, Data Science & AI, AstraZeneca
Erik Schultes, International Science Coordinator, GO-FAIR
Georges Heiter, Founder & CEO, Databiology
Knowledge graphs ilaria maresi the hyve 23apr2020Pistoia Alliance
Data for drug discovery and healthcare is often trapped in silos which hampers effective interpretation and reuse. To remedy this, such data needs to be linked both internally and to external sources to make a FAIR data landscape which can power semantic models and knowledge graphs.
2020.04.07 automated molecular design and the bradshaw platform webinarPistoia Alliance
This presentation described how data-driven chemoinformatics methods may automate much of what has historically been done by a medicinal chemist. It explored what is reasonable to expect “AI” approaches might achieve, and what is best left with a human expert. The implications of automation for the human-machine interface were explored and illustrated with examples from Bradshaw, GSK’s experimental automated design environment.
This presentation reviewed the challenges in identifying, acquiring and utilizing research data in relation to an evolving data market. Strategic solutions were examined in which the FAIR principles play a key role in the future of data management.
Dr. Dennis Wang discusses possible ways to enable ML methods to be more powerful for discovery and to reduce ambiguity within translational medicine, allowing data-informed decision-making to deliver the next generation of diagnostics and therapeutics to patients quicker, at lowered costs, and at scale.
The talk by Dr. Dennis Wang was followed by a panel discussion with Mr. Albert Wang, M. Eng., Head, IT Business Partner, Translational Research & Technologies, Bristol-Myers Squibb.
With the explosion of interest in both enhanced knowledge management and open science, the past few years have seen considerable discussion about making scientific data “FAIR” — findable, accessible, interoperable, and reusable. The problem is that most scientific datasets are not FAIR. When left to their own devices, scientists do an absolutely terrible job creating the metadata that describe the experimental datasets that make their way in online repositories. The lack of standardization makes it extremely difficult for other investigators to locate relevant datasets, to re-analyse them, and to integrate those datasets with other data. The Center for Expanded Data Annotation and Retrieval (CEDAR) has the goal of enhancing the authoring of experimental metadata to make online datasets more useful to the scientific community. The CEDAR work bench for metadata management will be presented in this webinar. CEDAR illustrates the importance of semantic technology to driving open science. It also demonstrates a means for simplifying access to scientific data sets and enhancing the reuse of the data to drive new discoveries.
Open interoperability standards, tools and services at EMBL-EBIPistoia Alliance
In this webinar Dr Henriette Harmse from EMBL-EBI presents how they are using their ontology services at EMBL-EBI to scale up the annotation of data and deliver added value through ontologies and semantics to their users.
Fair webinar, Ted slater: progress towards commercial fair data products and ...Pistoia Alliance
Elsevier is a global information analytics business that helps institutions and professional’s
advance healthcare and open science to improve performance for the benefit of humanity.
In this webinar, we discuss how Elsevier is increasingly leveraging the FAIR Guiding Principles to improve its products and services to better serve the scientific community.
Application of recently developed FAIR metrics to the ELIXIR Core Data ResourcesPistoia Alliance
The FAIR (Findable, Accessible, Interoperable and Reusable) principles aim to maximize the discovery and reuse of digital resources. Using recently developed software and metrics to assess FAIRness and supported through an ELIXIR Implementation Study, Michel worked with a subset of ELIXIR Core Data Resources to apply these technologies. In this webinar, he will discuss their approach, findings, and lessons learned towards the understanding and promotion of the FAIR principles.
Implementing Blockchain applications in healthcarePistoia Alliance
Blockchain technology can revolutionise the way information is exchanged between parties by bringing an unprecedented level of security and trust to these transactions. The technology is finding its way into multiple use cases but we are yet to see full adoption and real-world business implementation in the Healthcare industry.
In this webinar we will explore the main challenges and considerations for the implementation of Blockchain technology in Healthcare use cases. This is the third webinar in our Blockchain Education series.
Building trust and accountability - the role User Experience design can play ...Pistoia Alliance
In this webinar our panel of UX specialists give a brief introduction to User Experience before presenting the design opportunities UX can bring to AI. We all know that AI has great potential but has some significant hurdles to overcome not least so the human aspect of trust and ethical considerations when designing in the life sciences.
In the late Fall and Winter of 2018, the Pistoia Alliance in cooperation with Elsevier and charitable organizations Cures within Reach and Mission: Cure ran a datathon aiming to find drugs suitable for treatment of childhood chronic pancreatitis, a rare disease that causes extreme suffering. The datathon resulted in identification of four candidate compounds in a short time frame of just under three months. In this webinar our speakers discuss the technologies that made this leap possible
PA webinar on benefits & costs of FAIR implementation in life sciences Pistoia Alliance
The slides from the Pistoia Alliance Debates Webinar where a panel of experts from technology support providers and the biopharma industry, who have been invited to share their views on the "Benefits and costs of FAIR Implementation for life science industry".
Creating novel drugs is an extraordinarily hard and complex problem.
One of the many challenges in drug design is the sheer size of the search space for novel chemical compounds. Scientists need to find molecules that are active toward a biological target or pathway and at the same time have acceptable ADMET properties.
There is now considerable research going on using various AI and ML approaches to tackle these challenges.
Our distinguished speakers, Drs. Alex Tropsha and Ola Engkvist, will discuss their recent work in Drug Design involving Deep Reinforcement Learning and Neural Networks, and will answer questions from the audience on the current state of the research in the field.
Speakers:
Prof Alex Tropsha, Professor at University of North Carolina at Chapel Hill, USA
Dr. Ola Engkvist, Associate Director at AstraZeneca R&D, Gothenburg, Sweden
Removing Uninteresting Bytes in Software FuzzingAftab Hussain
Imagine a world where software fuzzing, the process of mutating bytes in test seeds to uncover hidden and erroneous program behaviors, becomes faster and more effective. A lot depends on the initial seeds, which can significantly dictate the trajectory of a fuzzing campaign, particularly in terms of how long it takes to uncover interesting behaviour in your code. We introduce DIAR, a technique designed to speedup fuzzing campaigns by pinpointing and eliminating those uninteresting bytes in the seeds. Picture this: instead of wasting valuable resources on meaningless mutations in large, bloated seeds, DIAR removes the unnecessary bytes, streamlining the entire process.
In this work, we equipped AFL, a popular fuzzer, with DIAR and examined two critical Linux libraries -- Libxml's xmllint, a tool for parsing xml documents, and Binutil's readelf, an essential debugging and security analysis command-line tool used to display detailed information about ELF (Executable and Linkable Format). Our preliminary results show that AFL+DIAR does not only discover new paths more quickly but also achieves higher coverage overall. This work thus showcases how starting with lean and optimized seeds can lead to faster, more comprehensive fuzzing campaigns -- and DIAR helps you find such seeds.
- These are slides of the talk given at IEEE International Conference on Software Testing Verification and Validation Workshop, ICSTW 2022.
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdfPaige Cruz
Monitoring and observability aren’t traditionally found in software curriculums and many of us cobble this knowledge together from whatever vendor or ecosystem we were first introduced to and whatever is a part of your current company’s observability stack.
While the dev and ops silo continues to crumble….many organizations still relegate monitoring & observability as the purview of ops, infra and SRE teams. This is a mistake - achieving a highly observable system requires collaboration up and down the stack.
I, a former op, would like to extend an invitation to all application developers to join the observability party will share these foundational concepts to build on:
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...SOFTTECHHUB
The choice of an operating system plays a pivotal role in shaping our computing experience. For decades, Microsoft's Windows has dominated the market, offering a familiar and widely adopted platform for personal and professional use. However, as technological advancements continue to push the boundaries of innovation, alternative operating systems have emerged, challenging the status quo and offering users a fresh perspective on computing.
One such alternative that has garnered significant attention and acclaim is Nitrux Linux 3.5.0, a sleek, powerful, and user-friendly Linux distribution that promises to redefine the way we interact with our devices. With its focus on performance, security, and customization, Nitrux Linux presents a compelling case for those seeking to break free from the constraints of proprietary software and embrace the freedom and flexibility of open-source computing.
Transcript: Selling digital books in 2024: Insights from industry leaders - T...BookNet Canada
The publishing industry has been selling digital audiobooks and ebooks for over a decade and has found its groove. What’s changed? What has stayed the same? Where do we go from here? Join a group of leading sales peers from across the industry for a conversation about the lessons learned since the popularization of digital books, best practices, digital book supply chain management, and more.
Link to video recording: https://bnctechforum.ca/sessions/selling-digital-books-in-2024-insights-from-industry-leaders/
Presented by BookNet Canada on May 28, 2024, with support from the Department of Canadian Heritage.
Elevating Tactical DDD Patterns Through Object CalisthenicsDorra BARTAGUIZ
After immersing yourself in the blue book and its red counterpart, attending DDD-focused conferences, and applying tactical patterns, you're left with a crucial question: How do I ensure my design is effective? Tactical patterns within Domain-Driven Design (DDD) serve as guiding principles for creating clear and manageable domain models. However, achieving success with these patterns requires additional guidance. Interestingly, we've observed that a set of constraints initially designed for training purposes remarkably aligns with effective pattern implementation, offering a more ‘mechanical’ approach. Let's explore together how Object Calisthenics can elevate the design of your tactical DDD patterns, offering concrete help for those venturing into DDD for the first time!
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024Neo4j
Neha Bajwa, Vice President of Product Marketing, Neo4j
Join us as we explore breakthrough innovations enabled by interconnected data and AI. Discover firsthand how organizations use relationships in data to uncover contextual insights and solve our most pressing challenges – from optimizing supply chains, detecting fraud, and improving customer experiences to accelerating drug discoveries.
DevOps and Testing slides at DASA ConnectKari Kakkonen
My and Rik Marselis slides at 30.5.2024 DASA Connect conference. We discuss about what is testing, then what is agile testing and finally what is Testing in DevOps. Finally we had lovely workshop with the participants trying to find out different ways to think about quality and testing in different parts of the DevOps infinity loop.
The Art of the Pitch: WordPress Relationships and SalesLaura Byrne
Clients don’t know what they don’t know. What web solutions are right for them? How does WordPress come into the picture? How do you make sure you understand scope and timeline? What do you do if sometime changes?
All these questions and more will be explored as we talk about matching clients’ needs with what your agency offers without pulling teeth or pulling your hair out. Practical tips, and strategies for successful relationship building that leads to closing the deal.
Pushing the limits of ePRTC: 100ns holdover for 100 daysAdtran
At WSTS 2024, Alon Stern explored the topic of parametric holdover and explained how recent research findings can be implemented in real-world PNT networks to achieve 100 nanoseconds of accuracy for up to 100 days.
Sudheer Mechineni, Head of Application Frameworks, Standard Chartered Bank
Discover how Standard Chartered Bank harnessed the power of Neo4j to transform complex data access challenges into a dynamic, scalable graph database solution. This keynote will cover their journey from initial adoption to deploying a fully automated, enterprise-grade causal cluster, highlighting key strategies for modelling organisational changes and ensuring robust disaster recovery. Learn how these innovations have not only enhanced Standard Chartered Bank’s data infrastructure but also positioned them as pioneers in the banking sector’s adoption of graph technology.
PHP Frameworks: I want to break free (IPC Berlin 2024)Ralf Eggert
In this presentation, we examine the challenges and limitations of relying too heavily on PHP frameworks in web development. We discuss the history of PHP and its frameworks to understand how this dependence has evolved. The focus will be on providing concrete tips and strategies to reduce reliance on these frameworks, based on real-world examples and practical considerations. The goal is to equip developers with the skills and knowledge to create more flexible and future-proof web applications. We'll explore the importance of maintaining autonomy in a rapidly changing tech landscape and how to make informed decisions in PHP development.
This talk is aimed at encouraging a more independent approach to using PHP frameworks, moving towards a more flexible and future-proof approach to PHP development.
Epistemic Interaction - tuning interfaces to provide information for AI supportAlan Dix
Paper presented at SYNERGY workshop at AVI 2024, Genoa, Italy. 3rd June 2024
https://alandix.com/academic/papers/synergy2024-epistemic/
As machine learning integrates deeper into human-computer interactions, the concept of epistemic interaction emerges, aiming to refine these interactions to enhance system adaptability. This approach encourages minor, intentional adjustments in user behaviour to enrich the data available for system learning. This paper introduces epistemic interaction within the context of human-system communication, illustrating how deliberate interaction design can improve system understanding and adaptation. Through concrete examples, we demonstrate the potential of epistemic interaction to significantly advance human-computer interaction by leveraging intuitive human communication strategies to inform system design and functionality, offering a novel pathway for enriching user-system engagements.
UiPath Test Automation using UiPath Test Suite series, part 5DianaGray10
Welcome to UiPath Test Automation using UiPath Test Suite series part 5. In this session, we will cover CI/CD with devops.
Topics covered:
CI/CD with in UiPath
End-to-end overview of CI/CD pipeline with Azure devops
Speaker:
Lyndsey Byblow, Test Suite Sales Engineer @ UiPath, Inc.
GraphRAG is All You need? LLM & Knowledge GraphGuy Korland
Guy Korland, CEO and Co-founder of FalkorDB, will review two articles on the integration of language models with knowledge graphs.
1. Unifying Large Language Models and Knowledge Graphs: A Roadmap.
https://arxiv.org/abs/2306.08302
2. Microsoft Research's GraphRAG paper and a review paper on various uses of knowledge graphs:
https://www.microsoft.com/en-us/research/blog/graphrag-unlocking-llm-discovery-on-narrative-private-data/
20240605 QFM017 Machine Intelligence Reading List May 2024
Emerging Life Sciences Collaboration on Common Service Specification
1. Pistoia Alliance
An Emerging Vehicle forSciences
Emerging Life Collaboration:
Collaboration on
The Pistoia Alliance
Common Service Specification
Ian Harrow (Pfizer) and Nick Lynch (Astra Zeneca) for the Pistoia Alliance
ICIC 2010 http://pistoiaalliance.org
26th Oct 2010
2. Presentation Outline
• Pistoia Organisation
• Four projects:-
– Biomedical Knowledge Brokering SESL pilot
• More depth on this
– Vocabulary Standards Initiative
• An emerging project
– Sequence services
– Electronic Lab Notebook
• Summary
• Acknowledgements
3. Pistoia Background and History
2007 2008 2009 2010 Now
Informal Met in Create Pistoia as Not Official 7 / 10 top Pharma as members
meeting Pistoia for profit company Launch 38 members
Stanhope Gate Domains Established
Pistoia
Lhasa Curzon
Informal Collaborations Collaboration/project meeting
Pistoia Description History
The primary purpose of the Pistoia Alliance is to Initial Meeting with GSK, AZ,
streamline non-competitive elements of the life Pfizer and Novartis – outlined
similar challenges and
science workflow by the specification of common frustrations in the Informatics
standards, business terms, relationships and sector of Discovery
processes
The advent of Web Services and Web 2.0 allow for
Pistoia Goals decoupling of proprietary data from technology
• to allow this framework to encompass/support
Publicly available structural and biological DBs allow
most pre-competitive work between the for a non-IP related analysis and as a scientific test
organisations suite.
• to support life science workflow prior to
Sponsorship from R&D IS heads within Life Science
submission industry
• to work with other Standards organisations
4. Pistoia Domains
Pistoia Domains group areas of interest, scope and deliver projects
Pistoia Domain – high level collection
Pistoia Groups – as of Working Groups with common themes
External
Groups
defined in byelaws Domain Allows governance across outside of
Steering a domain using Working
Pistoia
Board of Groups Group chairs and
Technical Committee reps
Directors Could:
• Join Pistoia
Working The main project delivery • Influence Pistoia
Working mechanism in Pistoia. All
Officers Groups members
Groups standards will be • Influence through
(Operational delivered by WGs other standards
Team) groups and activities
Provide expertise for WGs • Collaborate on
and running Pistoia standards’ feasibility
Technical Pistoia Define: studies
•Requirements • Collaborate through
Committee Members •Technical Standards non-Pistoia
•Service Standards Standards initiatives
6. Pistoia Domains
Pistoia Domains focus on business workflows /supply chains
Enabling Knowledge and Information Services
VSI SESL
Vocabulary
Visualisation
Application Integration
Workflow
Others Biology Chemistry Translational
Data Data Data
Services Services Services
Sequencing ELN
7. The Pistoia
SESL Project
An Emerging Vehicle for SESL Pilot
Pistoia Alliance Collaboration:
TheBiomedical Brokering Service
for a Pistoia Alliance
Ian Harrow, Wendy Filsell, Dietrich Rebholz Schuhmann
http://pistoiaalliance.org
8. SESL: Biomedical Knowledge Brokering
• Challenge:
– No single system for retrieving gene to disease relationships contained in
both published & biological database content
– Need a „push model‟ for biomedical knowledge access: the current model
requires the consumer to search 1000‟s of content sources
• Opportunity: Pilot Project with key stakeholders
– Pilot a „push model‟ for biomedical knowledge brokering
– Engage multiple consumers, content providers and a single, public group to
develop the necessary infrastructure to explore the standards required for
the model to work in production
• History:
– May 2008: Common Disease Knowledge Environment (CDKE) IMI call drafted
– Sep 2008: postponed call publication
– Jan 2009: x-pharma meeting in London on how to progress CDKE
– Apr 2009: CDKE presented at SESL workshop
– Oct 2009: SESL Pilot meeting (funders)
– Jan 2010: Pilot launch
9. The Knowledge Service Framework
Multiple
Consumers
‘Consumer’
Disease Dossier Knowledge
Firewall Applications
Service Layer Std Public Common
Open Assertion & Meta Data Mgmt Vocabularies Service
Stds Transform / Translate Business Broker
Integrator Rules
Supplier
Firewall Content
Suppliers
Db 2
Effort required
Db 4 to fit DBs to
Corpus 1 service layer
Db 3 Corpus 5
9
10. A Production Service vision...
Consumer
Side Exemplar
Disease Dossier Application
License
Service Layer Std Public Service Layer Std Public Service Layer Std Public Service Layer Std Public
Vocabularies Vocabularies Vocabularies Vocabularies
Assertion & Meta Data Mgmt Assertion & Meta Data Mgmt Assertion & Meta Data Mgmt Assertion & Meta Data Mgmt
Transform / Translate Business Transform / Translate Business Transform / Translate Business Transform / Translate Business
Rules Rules Rules Rules
Integrator Integrator Integrator Integrator
Broker Org #1 Broker Org #2 Broker Org #3 Internal Broker
License
Corpus 1 Db 3 Corpus 5 Db 7 Corpus 9 Db 11 Corpus 13 Db 15
Corpus 4 Corpus 8 Corpus 12 Corpus 16
Db 2 Db 6 Db 10 Db 14
Supplier
Side
11. The Pilot
• Deliverables:
– Publication of standards & recommendations for service implementation
– Pilot implementation of service for a single disease (assertions from pre-defined
document sets & databases)
– Establish ways of working pre-competitively across industry/vendor/academia
– Dialogue and assessment of cost / value, with key content suppliers in moving to
such a push model for content (viability of moving to production)
• Status:
– AZ, Pfizer, GSK, Roche, Unilever, EBI, NPG, OUP, Elsevier & RSC
– 12 month project, £200K direct funding (+ PM & Architecture support)
– Contract between Pistoia & EBI signed 20th January 2010 for 1 year
• Scope:
– Development of an assertion database in combination with a user interface and
associated web services for one disease/indication/phenotype of broad interest:
Type II Diabetes
– Assertional content derived from 3 structured data sources and limited Journal
content (co-occurrence & statistical derivation from full text)
– Assertional evidence for filtering and drill down to primary data.
– Limited vocabulary development for area of focus: Type II Diabetes
12. Minimal configuration to test a
Brokering Service
Interface
User Interface Layer
at consumer org‟n
Condition:
Service Layer
Assertion & Meta Data Mgmt
Std Public
Vocabularies
Service Layer
Assertion & Meta Data Mgmt
Std Public
Vocabularies Brokering service
Identical structure.
Different content
Transform / Translate Query Transform / Translate Query Layer
templates templates
which can overlap. Triple store 1 Triple store 2
at EBI
Broker #1 Broker #2
Primary source
Elsevier RSC Layer
corpus corpus NPG
corpus
OUP
corpus
at provider org‟n
EBI Swissprot NCBI OMIM EBI Array EBI Swissprot
database Express database
database
13. SESL user interface mock-up
Gene R‟ship Disease Species Evidence
Gene: abc
1 abc1 Co-occurs Diabetes Mus Paper UID:1234
2 Relationship:
Up-Reg Any Diabetes Homo ArrayExpress: XXX
abc1
3 abc2 Disease: Diabetes
Co-occurs Diabetes Homo Paper UID:1344
4 abc13 Co-occurs Diabetes
Constraint: Species: Any Mus Paper UID:1314
5 abc7 MutationTissue: Any
Diabetes Rattus OMIMI: XXX
6 abc1 Co-occurs Diabetes Mus Paper UID:45643
7 abc1 Co-occurs Diabetes Homo Paper UID:2143
8 abc1 Co-occurs Diabetes Mus Paper UID:1204
14. Timelines: Development Phase
Task/Deliverable Phase Type Jan-10 Feb-10 Mar-10 Apr-10 May-10 Jun-10 Jul-10 Aug-10 Sep-10 Oct-10 Nov-10 Dec-10 Jan-11 Feb-11
Month 0 Month 1 Month 2 Month 3 Month 4 Month 5 Month 6 Month 7 Month 8 Month 9 Month 10 Month 11 Month 12
Finalised Technical Specification Deliverable
^
document (Month 4) 1
Build vocabularies within scope Development Task 2
RDF data export from UniProt Development Task 3
and Ensembl
RDF data export of Array Express Development Task 4
Extract literature assertions for Development Task 6
T2DB from publishers’ content
Develop RDF triple store schema Development Task 7
and demonstrator
Develop query definitions Development Task 8
Establish API services for remote Development Task 9
access
Develop simple user interface for Development Task 10
demonstrator (based on mock-
up)
Write documentation that Development Task 11
defines the standard framework
Access to early prototype Deliverable
demonstrator and report
(Month 7 & 8)
2&3 ^^
Final prototype demonstrator, Deliverable
recommendations post-pilot, 4&5
^ ^^
report (Month 11 & 12) and
public launch
15. Timelines:
Testing and Communication Phase
Task/Deliverable Phase Type Jan-10 Feb-10 Mar-10 Apr-10 May-10 Jun-10 Jul-10 Aug-10 Sep-10 Oct-10 Nov-10 Dec-10 Jan-11 Feb-11
Month 0 Month 1 Month 2 Month 3 Month 4 Month 5 Month 6 Month 7 Month 8 Month 9 Month 10 Month 11 Month 12
Tests of the demonstrator (full Testing and Task 12
private and limited public
instance)
communication
Deploy publc demonstrator Testing and Task 13
communication
Write publication for standard Testing and Task 14
definition communication
Develop recommendations for Testing and Task 15
post-pilot project communication
Final prototype demonstrator, Deliverable
recommendations post-pilot 4&5 ^ ^
and report (Month 11 & 12)
Public release of limited Deliverable
demonstrator (Month 13) 6 ^
16. Summary for SESL pilot
• Significant progress to towards realising
the technical goal of knowledge brokering
– Can a push model work? A hyperstandard?
• A unique consortium from three cultures:
industry, publishers and academia
– Working together – sharing costs and risks
• Business opportunities and concerns
– For data providers and consumers?
• Phase 2 planning is underway for 2011
17. The Pistoia VSI
Project
An Emerging Vehicle for Collaboration:
Vocabulary Standards Initiative
The Pistoia Alliance
Project Leads: Lee Harland and Christopher Larminie
http://pistoiaalliance.org
18. Standardizing Drug Target Types
• Representation of a molecular drug target in structured databases is ad-hoc
– Single protein-targets are “OK” (being linked via Entrez gene, but this is not an agreed
standard)
– Multi-protein targets, complexes, biologicals and many more are poorly described, often
simply raw text
• This project will focus on industry & suppliers to describe a specification for
reporting drug targets within structured content
– Minimal cost, just FTE time required
– This could feed into the IMI Open Pharmacology (OPS) call as an industry-publisher
requirement
– Output would be a specific set of “rules” regarding the representation of complex
molecular targets
– Aim would not be to define a list of all known targets, this would be out of scope. As will
any text-mining efforts.
– Recommendation to suppliers and industry to adopt specification along with industry-
generated mappings for pre-existing targets
– Deliverable – specification & publication
• Could be a start to a future, wider pharmacological data standard project
– All databases providing pharmacological activity content delivered in a standard way
– Could gain a quick-start building on MIABE standard
19. The Pistoia
Sequence
Services Project
An Emerging Vehicle for Collaboration:
The Pistoia Alliance
Project Lead : Simon Thornber
http://pistoiaalliance.org
20. Sequence services Project
Description
As a drive to cuts costs, encourage standards, and provide
simplification it is proposed that Pistoia commission a set of secure
internet hosted sequence services.
Benefits
These services will ultimately provide access to public, private &
commercial data & tools, that will enable scientists to search, store &
analyse all their sequence based data in a single web interface.
21. Current Status for sequence services
• Defined the Project Vision
• Split Vision into achievable phases of delivery
• Defined Phase 1 use cases
• Focus on Non-Functional use cases e.g. security
• Scoring criteria in final stages of drafting
• 5 Vendor presentations during May / June 2010
– Cognizant +Eagle Genomics, ThomsonReuters,
Genome Quest, & Constellation Technologies +
Microsoft + AWF and the STFC.
23. The Pistoia
ELN Project
An Emerging Vehicle for Collaboration:
The Pistoia Alliance
Project Lead : Richard Bolton
http://pistoiaalliance.org
24. ELN Project Description and Benefits
Description
To deliver a query service standard applicable for use with data types
commonly found in electronic lab notebooks (ELN‟s). The initial
scope will be against chemistry related ELN‟s but the solution should
aim to be general enough that it can be applied to other scientific
notebook applications.
Benefits
Searching of data stored in ELN‟s from different vendors. Lowering
the costs of using ELN data with partners and CRO‟s.
25. Current Status for ELN
• Active Participation at biweekly meetings from
GSK/AZ/Pfizer/BMS/Symyx/Edge/Accelrys
• Agreed 3 delivery phases
• Phase 1 Definition of problem space and creation of users stories.
– Complete. User Story Document „published‟
• Phase 2 Creation of ELN Query services definition.
– End to end process run through by team to create a full model
for two of the user stories.
– GGA chosen to complete work. Funding agreed and approved
by operations team. Work started but contract not yet in
place.
• Phase 3 Creation of POC in partnership with Vendor.
– Not yet started. Will likely require vendor partnership, budget
and technology decision.
27. Summary for Pistoia projects
• SESL Biomedical Knowledge Brokering
– Phase 1 pilot to complete by end 2010
– Phase 2 is planned
• Vocabulary Standards Initiative
– An emerging project on Drug Targets
• Sequence services
– Phase 1 nearing completion and Phase 2 planned
• Electronic Lab Notebook
– Phase 1 is complete and Phase 2 is underway
28. Acknowledgements
SESL ELN Sequencing
Dietrich Rebholz Schumann, EBI Richard Bolton, GSK Simon Thornber, GSK
Silvestras Kavaliauskas, EBI David Drake, AZ Cary O‟Donnell, AZ
Christoph Grabmuellerm EBI Steve Trudel, Pfizer Quan Yang, Novartis
Dominic Clark, EBI John Duncan, Pfizer Monica Arenz, Novartis
Mike Westaway, AZ Uwe Geissler, Novartis
Ian Dix, AZ
Carol McNab, BMS Steering Group:-
Wendy Filsell, Unilever
Ashley George, GSK
Ian Stott, Unilever
Peter Woollard, GSK Vendor reps from:- Tom Flores, GSK
Nigel Wilkinson, Pfizer Symyx Martyn Wilkins
Catherine Marshall, Pfizer Edge Patrick Warren
Michael Braxenthaler, Roche Accelrys
Jabe Wilson, Elsevier VSI
Richard O‟Bierne, Oxford UP
Richard Kidd, RSC Lee Harland, Christopher Larminie,
Alf Eaton, Nature PG Ian Dix, Wendy Filsell, OBO PRO