This document discusses how ontologies can help manage knowledge in drug discovery. It notes that while companies invest heavily in data generation, they struggle to effectively harness this data. Ontologies provide a way to semantically represent knowledge from diverse sources and distribute it for applications. This allows knowledge to be more effectively applied to important tasks like decision making, data analysis, and text mining of literature. The document compares different ontology standards and notes their suitability for knowledge management versus computational reasoning. It concludes that ontologies can support applications in drug discovery by providing integrated access to knowledge from multiple sources.
This document discusses how the increased generation of data in pharmaceutical R&D has failed to improve productivity due to an inability to properly organize and apply knowledge. It describes how existing integration strategies store data in isolated "silos" that are difficult to access and compare. The author argues that new semantic technologies based on ontologies can better integrate knowledge across discovery, development, and other business areas to improve transparency, safety monitoring, and decision making.
Lean Startup Machine (Washington, DC): Salivation armyJenny Wong
Presentation from the winning team, at the Lean Startup Machine event in Washington, D.C. 18-20 November 2011
The team: Andrew Evans, Kevin May, Jenny Wong, ChanMi Park
Personalising Medicine - The Hard Way or the Easy Way (linkedin)sgardne0
This document discusses the promise and challenges of personalized medicine. It begins by outlining the promise of using an individual's genomic and other health data to more accurately diagnose, treat, and manage disease. However, it notes that analyzing all the relevant factors for personalized medicine at scale currently poses immense computational challenges. The document then describes several case studies that demonstrate more effective approaches to personalized medicine using multi-trait analysis of many genomic and clinical factors. It argues that empowering individuals to better manage their own health using personalized digital tools could help shift healthcare outside of traditional settings and lower costs while improving outcomes.
Challenges for drug development jsr slides aug 2013CincyTechUSA
This document discusses the challenges facing the pharmaceutical industry in drug development in the 21st century. It notes that R&D productivity has remained flat despite increased spending. Factors like the patent cliff, rising healthcare costs, and increased regulatory demands mean the industry can no longer rely on the blockbuster drug model. Innovation is now focused on targeted therapies for niche markets. Pharmacologists must guide drug development to demonstrate a new drug's safety, efficacy, and economic value in order to gain approval and reimbursement.
Re-Engineering Early Phase Cancer Drug Development: Decreasing the Time from ...mconghuyen
The document summarizes efforts to decrease the time required to develop novel cancer therapeutics from target identification to clinical use. It describes how most oncology drugs fail in late stages of development, particularly phases 2 and 3, due to lack of efficacy. To address this, the National Cancer Institute has created programs like the Experimental Therapeutics Program and Chemical Biology Consortium to streamline the discovery and development process. This includes providing resources from target validation through early clinical trials to support academic and biotech projects focusing on areas of unmet medical need. The goal is to rapidly translate discoveries into treatments to benefit public health.
The document discusses breaking down information silos by promoting information sharing and collaboration. It describes how information silos are caused by rigid technologies, legacy processes, and lack of incentives for collaboration. This limits productivity, innovation, and collaboration. The document proposes providing technologies and processes that support information sharing, integrating data through standards, and building a collaborative culture with mutual trust. It provides examples of public-private partnerships between organizations like NIH, FDA, and companies to analyze clinical trial data and stimulate further research.
1. Innovation can originate from individuals, firms, universities, government laboratories, and other sources. Gavriel Iddan, an engineer, came up with the idea for a pill-sized camera after conversations with his gastroenterologist friend about the limitations of existing technologies for viewing the small intestine.
2. Iddan co-founded Given Imaging to develop the PillCam, a miniature camera enclosed in a capsule that can be ingested to take video of the small intestine as it passes through naturally. The PillCam provided a less invasive alternative to existing methods like endoscopy and surgery.
3. Given Imaging's PillCam technology experienced significant growth and commercial success. The company was acquired by Covid
1. Innovation can arise from individuals, firms, universities, government labs, or collaborative networks. Collaborative networks that leverage knowledge from multiple sources are a powerful source of innovation.
2. The ability to generate new ideas, known as creativity, underlies the innovation process. Creativity produces work that is both novel and useful.
3. Interfirm networks are an important source of innovation, as they provide access to a wider range of information and resources than individual firms possess alone. The structure of the network, such as its density, influences how information flows between firms.
This document discusses how the increased generation of data in pharmaceutical R&D has failed to improve productivity due to an inability to properly organize and apply knowledge. It describes how existing integration strategies store data in isolated "silos" that are difficult to access and compare. The author argues that new semantic technologies based on ontologies can better integrate knowledge across discovery, development, and other business areas to improve transparency, safety monitoring, and decision making.
Lean Startup Machine (Washington, DC): Salivation armyJenny Wong
Presentation from the winning team, at the Lean Startup Machine event in Washington, D.C. 18-20 November 2011
The team: Andrew Evans, Kevin May, Jenny Wong, ChanMi Park
Personalising Medicine - The Hard Way or the Easy Way (linkedin)sgardne0
This document discusses the promise and challenges of personalized medicine. It begins by outlining the promise of using an individual's genomic and other health data to more accurately diagnose, treat, and manage disease. However, it notes that analyzing all the relevant factors for personalized medicine at scale currently poses immense computational challenges. The document then describes several case studies that demonstrate more effective approaches to personalized medicine using multi-trait analysis of many genomic and clinical factors. It argues that empowering individuals to better manage their own health using personalized digital tools could help shift healthcare outside of traditional settings and lower costs while improving outcomes.
Challenges for drug development jsr slides aug 2013CincyTechUSA
This document discusses the challenges facing the pharmaceutical industry in drug development in the 21st century. It notes that R&D productivity has remained flat despite increased spending. Factors like the patent cliff, rising healthcare costs, and increased regulatory demands mean the industry can no longer rely on the blockbuster drug model. Innovation is now focused on targeted therapies for niche markets. Pharmacologists must guide drug development to demonstrate a new drug's safety, efficacy, and economic value in order to gain approval and reimbursement.
Re-Engineering Early Phase Cancer Drug Development: Decreasing the Time from ...mconghuyen
The document summarizes efforts to decrease the time required to develop novel cancer therapeutics from target identification to clinical use. It describes how most oncology drugs fail in late stages of development, particularly phases 2 and 3, due to lack of efficacy. To address this, the National Cancer Institute has created programs like the Experimental Therapeutics Program and Chemical Biology Consortium to streamline the discovery and development process. This includes providing resources from target validation through early clinical trials to support academic and biotech projects focusing on areas of unmet medical need. The goal is to rapidly translate discoveries into treatments to benefit public health.
The document discusses breaking down information silos by promoting information sharing and collaboration. It describes how information silos are caused by rigid technologies, legacy processes, and lack of incentives for collaboration. This limits productivity, innovation, and collaboration. The document proposes providing technologies and processes that support information sharing, integrating data through standards, and building a collaborative culture with mutual trust. It provides examples of public-private partnerships between organizations like NIH, FDA, and companies to analyze clinical trial data and stimulate further research.
1. Innovation can originate from individuals, firms, universities, government laboratories, and other sources. Gavriel Iddan, an engineer, came up with the idea for a pill-sized camera after conversations with his gastroenterologist friend about the limitations of existing technologies for viewing the small intestine.
2. Iddan co-founded Given Imaging to develop the PillCam, a miniature camera enclosed in a capsule that can be ingested to take video of the small intestine as it passes through naturally. The PillCam provided a less invasive alternative to existing methods like endoscopy and surgery.
3. Given Imaging's PillCam technology experienced significant growth and commercial success. The company was acquired by Covid
1. Innovation can arise from individuals, firms, universities, government labs, or collaborative networks. Collaborative networks that leverage knowledge from multiple sources are a powerful source of innovation.
2. The ability to generate new ideas, known as creativity, underlies the innovation process. Creativity produces work that is both novel and useful.
3. Interfirm networks are an important source of innovation, as they provide access to a wider range of information and resources than individual firms possess alone. The structure of the network, such as its density, influences how information flows between firms.
1. Creativity is the underlying process that enables innovation through generating new ideas. Innovation can originate from individuals, firms, universities, government research, and nonprofit organizations.
2. A key source of innovation is collaborative networks that leverage resources across multiple organizations and individuals. These networks are especially important in high-technology sectors due to the wide range of expertise required.
3. While individual organizations contribute to innovation, the most significant source may be the linkages between different components of the innovation system through collaborative relationships and networks.
Schilling (2017pp.15 42) sources of innovation (chapter 2)NandiNoprita
1. Creativity is the underlying process that enables innovation through the generation of new ideas. Innovation can originate from individuals, firms, universities, government research, and collaborative networks between these organizations.
2. Firms are a primary driver of innovation through their research and development activities. However, firms often collaborate externally with customers, suppliers, universities and other organizations to access a wider range of knowledge and resources.
3. The most significant source of innovation comes from collaborative networks that leverage the combined capabilities and resources across multiple organizations or individuals. These networks are especially important in high-technology sectors.
1. Creativity is the underlying process that enables innovation through the generation of new ideas. Innovation can originate from individuals, firms, universities, government research, and collaborative networks between these entities.
2. Firms are a primary driver of innovation through their research and development activities. However, firms often collaborate externally with customers, suppliers, universities, and other organizations to access a wider range of knowledge and resources.
3. The most significant source of innovation comes from collaborative networks that leverage the combined capabilities and resources across multiple individuals and organizations. These networks are especially important in high-technology fields.
The article discusses how data and new technologies are enabling innovation like never before. It outlines several key points:
1) The amount of digital data from various sources is massively increasing, including data from IT systems, research databases, patent filings, and new sources like social media and the Internet of Things.
2) Collaboration platforms and crowd-sourcing tools can help organizations assess, prioritize, fund, and realize innovation opportunities by pulling insights from various participants.
3) A wide range of data sources must be harnessed for insights, including repositories, exchanges, unstructured data, social media, sensor data, and more. Both looking back longitudinally and forward visioning with rapid prototyping
Getting an Inside Look: Given Imaging’s Camera Pilla
Gavriel Iddan was an electro-optical engineer at Israel’s Rafael Armament Development Authority, the Israeli authority for development of weapons and military technology. One of Iddan’s projects was to develop the “eye” of a guided missile, which leads the missile to its target. In 1981, Iddan traveled to Boston on sabbatical to work for a company that produced X-ray tubes and ultrasonic probes. While there, he befriended a gastroenterologist (a physician who focuses on digestive diseases) named Eitan Scapa. During long conversations in which each would discuss his respective field, Scapa taught Iddan about the technologies used to
view the interior lining of the digestive system. Scapa pointed out that the existing technologies had a number of significant limitations, particularly with respect to viewing the small intestine.b The small intestine is the locale of a number of serious disorders. In the United States alone, approximately 19 million people suffer from disorders in the small intestine (including bleeding, Crohn’s disease, celiac disease, chronic diarrhea, irritable bowel syndrome, and small bowel cancer)
1. Creativity is the underlying process that enables innovation by generating new ideas. Innovation can originate from individuals, firms conducting research and development, universities, government laboratories, and collaborative networks between multiple organizations that leverage each other's resources and capabilities.
2. The most significant source of innovation comes from collaborative networks rather than any single entity, as no single organization possesses all the resources needed for major innovations in many high-technology fields. Collaborative networks allow information and resources to flow more freely between organizations.
This paper explores biopharma industry challenges in product development and innovation, defines PLM for the industry and provides guidance on how to get started on the PLM journey.
Getting an Inside Look: Given Imaging’s Camera Pilla
Gavriel Iddan was an electro-optical engineer at Israel’s Rafael Armament Development Authority, the Israeli authority for development of weapons and military technology. One of Iddan’s projects was to develop the “eye” of a guided missile, which leads the missile to its target. In 1981, Iddan traveled to Boston on sabbatical to work for a company that produced X-ray tubes and ultrasonic probes. While there, he befriended a gastroenterologist (a physician who focuses on digestive diseases) named Eitan Scapa. During long conversations in which each would discuss his respective field, Scapa taught Iddan about the technologies used to
view the interior lining of the digestive system. Scapa pointed out that the existing technologies had a number of significant limitations, particularly with respect to viewing the small intestine.b The small intestine is the locale of a number of serious disorders. In the United States alone, approximately 19 million people suffer from disorders in the small intestine (including bleeding, Crohn’s disease, celiac disease, chronic diarrhea, irritable bowel syndrome, and small bowel cancer)
Pharma’s future has never looked more promising – or more ominous.
Major scientific, technological and socioeconomic changes will revive
the industry’s fortunes in another decade, but capitalising on these
trends will entail making crucial decisions first
This document discusses data science and the role of data scientists. It defines data science as using expertise in managing, transforming, and analyzing large, diverse datasets to help experts and decision-makers address challenges posed by big data. Specifically, data scientists help with infrastructure, reduce data into usable knowledge for domains, and help institutions manage data throughout its lifecycle. The proliferation of big data and new technologies has created a need for knowledge managers and data scientists to bridge gaps between technical and domain experts.
Lec#2 Big Data Analytics and AI in the battle against COVID-19 Aboul Ella Hassanien
Big data analytics and artificial intelligence are being used in the battle against COVID-19 in several ways:
- Tracking and predicting the spread of the virus using large epidemiological datasets and machine learning models.
- Developing data dashboards to monitor the pandemic situation.
- Using medical imaging and data to help develop treatments and potential cures.
- Leveraging big data can help plan for and prevent future pandemics by establishing precedents for data use in similar situations.
This document discusses how crowdsourcing and big data are changing healthcare. It provides Wikipedia as an example of how crowdsourcing has been used to build the largest online encyclopedia through collaborative contributions. The document also discusses how places like TED provide freely accessible knowledge on the internet and how this is impacting fields like healthcare. It introduces concepts like the "infinite monkey theorem" to illustrate how crowdsourcing approaches can achieve results that traditionally required formal organizations and significant resources.
Data Science and its Relationship to Big Data and Data-Driven Decision MakingDr. Volkan OBAN
Data Science and its Relationship to Big Data and Data-Driven Decision Making
To cite this article:
Foster Provost and Tom Fawcett. Big Data. February 2013, 1(1): 51-59. doi:10.1089/big.2013.1508.
Foster Provost and Tom Fawcett
Published in Volume: 1 Issue 1: February 13, 2013
ref:http://online.liebertpub.com/doi/full/10.1089/big.2013.1508
https://www.researchgate.net/publication/256439081_Data_Science_and_Its_Relationship_to_Big_Data_and_Data-Driven_Decision_Making
Can Open Source R&D Reinvigorate Drug Researchbhmunos
The open-source model pioneered by the software industry has potential applications to drug research and development. Open-source initiatives allow scientists to collaborate freely across organizations to solve shared problems. While there are significant differences between software and drug development, public-private partnerships have adapted open-source principles by crowdsourcing research through an outsourced network. These partnerships operate on small budgets by vetting proposals and outsourcing research, pursuing projects from early discovery through clinical trials. Over 20 such partnerships have been established and have attracted over $1.5 billion in funding to develop treatments for neglected diseases.
Using technology intelligence tools, companies can cut the time spent on research and development from weeks or months to seconds or minutes. Technology intelligence refers to identifying technological opportunities and threats that could impact a company's future growth. These tools provide contextual access to relevant information and insights by combining web content, scientific journals, and patents with search technology and analysis. For example, a company could search for ways to reduce energy consumption and the tool would return a summary of solutions from various categories, such as approaches from the EPA and Department of Energy, in under a minute. This represents a shift from traditional research methods to quickly gaining actionable intelligence through intuitive searches.
Big data has the potential to radically change healthcare delivery and research by enabling the analysis of large datasets to predict behaviors, encourage desirable outcomes, and reduce costs. However, healthcare organizations face significant challenges in realizing these benefits due to the fragmentation, security, and lack of standards around healthcare data. Adopting big data solutions that can integrate both internal and external real-time data sources will be key to overcoming these challenges and allowing healthcare constituents to make more informed decisions.
Global Pulse is an innovation initiative of the UN Secretary-General, harnessing today's new world of digital data and real-time analytics to gain a better understanding of changes in human well-being. Global Pulse hopes to contribute a future in which access to better information sooner makes it possible to keep international development on track, protect the world's most vulnerable populations, and strengthen resilience to global shocks.
Global Pulse functions as an innovation laboratory, bringing together expertise from UN agencies, governments, academia, and the private sector to research, develop, test and share tools and approaches for harnessing real-time data for more effective and efficient policy action.
The Pharma 2020 series
The Pharmaceutical industry's long successful strategy of placing big bets on a few molecules, promoting them heavily and turning them into blockbusters worked well for many years, but its R&D productivity has now plummeted and the environment’s changing. PwC believes that seven major trends are reshaping the marketplace:
Source of info:
http://www.pwc.com/gx/en/pharma-life-sciences/pharma2020/index.jhtml#
Presented at the World Conference on Educational Sciences http://www.wces2009.org/
February 04-07, Nicosia, Cyprus
Abstract: http://spuzic.synthasite.com/knowledge_-basics.php
Live presentation (Youtube): http://au.youtube.com/watch?v=ZYwYGXuVhqo
1. Creativity is the underlying process that enables innovation through generating new ideas. Innovation can originate from individuals, firms, universities, government research, and nonprofit organizations.
2. A key source of innovation is collaborative networks that leverage resources across multiple organizations and individuals. These networks are especially important in high-technology sectors due to the wide range of expertise required.
3. While individual organizations contribute to innovation, the most significant source may be the linkages between different components of the innovation system through collaborative relationships and networks.
Schilling (2017pp.15 42) sources of innovation (chapter 2)NandiNoprita
1. Creativity is the underlying process that enables innovation through the generation of new ideas. Innovation can originate from individuals, firms, universities, government research, and collaborative networks between these organizations.
2. Firms are a primary driver of innovation through their research and development activities. However, firms often collaborate externally with customers, suppliers, universities and other organizations to access a wider range of knowledge and resources.
3. The most significant source of innovation comes from collaborative networks that leverage the combined capabilities and resources across multiple organizations or individuals. These networks are especially important in high-technology sectors.
1. Creativity is the underlying process that enables innovation through the generation of new ideas. Innovation can originate from individuals, firms, universities, government research, and collaborative networks between these entities.
2. Firms are a primary driver of innovation through their research and development activities. However, firms often collaborate externally with customers, suppliers, universities, and other organizations to access a wider range of knowledge and resources.
3. The most significant source of innovation comes from collaborative networks that leverage the combined capabilities and resources across multiple individuals and organizations. These networks are especially important in high-technology fields.
The article discusses how data and new technologies are enabling innovation like never before. It outlines several key points:
1) The amount of digital data from various sources is massively increasing, including data from IT systems, research databases, patent filings, and new sources like social media and the Internet of Things.
2) Collaboration platforms and crowd-sourcing tools can help organizations assess, prioritize, fund, and realize innovation opportunities by pulling insights from various participants.
3) A wide range of data sources must be harnessed for insights, including repositories, exchanges, unstructured data, social media, sensor data, and more. Both looking back longitudinally and forward visioning with rapid prototyping
Getting an Inside Look: Given Imaging’s Camera Pilla
Gavriel Iddan was an electro-optical engineer at Israel’s Rafael Armament Development Authority, the Israeli authority for development of weapons and military technology. One of Iddan’s projects was to develop the “eye” of a guided missile, which leads the missile to its target. In 1981, Iddan traveled to Boston on sabbatical to work for a company that produced X-ray tubes and ultrasonic probes. While there, he befriended a gastroenterologist (a physician who focuses on digestive diseases) named Eitan Scapa. During long conversations in which each would discuss his respective field, Scapa taught Iddan about the technologies used to
view the interior lining of the digestive system. Scapa pointed out that the existing technologies had a number of significant limitations, particularly with respect to viewing the small intestine.b The small intestine is the locale of a number of serious disorders. In the United States alone, approximately 19 million people suffer from disorders in the small intestine (including bleeding, Crohn’s disease, celiac disease, chronic diarrhea, irritable bowel syndrome, and small bowel cancer)
1. Creativity is the underlying process that enables innovation by generating new ideas. Innovation can originate from individuals, firms conducting research and development, universities, government laboratories, and collaborative networks between multiple organizations that leverage each other's resources and capabilities.
2. The most significant source of innovation comes from collaborative networks rather than any single entity, as no single organization possesses all the resources needed for major innovations in many high-technology fields. Collaborative networks allow information and resources to flow more freely between organizations.
This paper explores biopharma industry challenges in product development and innovation, defines PLM for the industry and provides guidance on how to get started on the PLM journey.
Getting an Inside Look: Given Imaging’s Camera Pilla
Gavriel Iddan was an electro-optical engineer at Israel’s Rafael Armament Development Authority, the Israeli authority for development of weapons and military technology. One of Iddan’s projects was to develop the “eye” of a guided missile, which leads the missile to its target. In 1981, Iddan traveled to Boston on sabbatical to work for a company that produced X-ray tubes and ultrasonic probes. While there, he befriended a gastroenterologist (a physician who focuses on digestive diseases) named Eitan Scapa. During long conversations in which each would discuss his respective field, Scapa taught Iddan about the technologies used to
view the interior lining of the digestive system. Scapa pointed out that the existing technologies had a number of significant limitations, particularly with respect to viewing the small intestine.b The small intestine is the locale of a number of serious disorders. In the United States alone, approximately 19 million people suffer from disorders in the small intestine (including bleeding, Crohn’s disease, celiac disease, chronic diarrhea, irritable bowel syndrome, and small bowel cancer)
Pharma’s future has never looked more promising – or more ominous.
Major scientific, technological and socioeconomic changes will revive
the industry’s fortunes in another decade, but capitalising on these
trends will entail making crucial decisions first
This document discusses data science and the role of data scientists. It defines data science as using expertise in managing, transforming, and analyzing large, diverse datasets to help experts and decision-makers address challenges posed by big data. Specifically, data scientists help with infrastructure, reduce data into usable knowledge for domains, and help institutions manage data throughout its lifecycle. The proliferation of big data and new technologies has created a need for knowledge managers and data scientists to bridge gaps between technical and domain experts.
Lec#2 Big Data Analytics and AI in the battle against COVID-19 Aboul Ella Hassanien
Big data analytics and artificial intelligence are being used in the battle against COVID-19 in several ways:
- Tracking and predicting the spread of the virus using large epidemiological datasets and machine learning models.
- Developing data dashboards to monitor the pandemic situation.
- Using medical imaging and data to help develop treatments and potential cures.
- Leveraging big data can help plan for and prevent future pandemics by establishing precedents for data use in similar situations.
This document discusses how crowdsourcing and big data are changing healthcare. It provides Wikipedia as an example of how crowdsourcing has been used to build the largest online encyclopedia through collaborative contributions. The document also discusses how places like TED provide freely accessible knowledge on the internet and how this is impacting fields like healthcare. It introduces concepts like the "infinite monkey theorem" to illustrate how crowdsourcing approaches can achieve results that traditionally required formal organizations and significant resources.
Data Science and its Relationship to Big Data and Data-Driven Decision MakingDr. Volkan OBAN
Data Science and its Relationship to Big Data and Data-Driven Decision Making
To cite this article:
Foster Provost and Tom Fawcett. Big Data. February 2013, 1(1): 51-59. doi:10.1089/big.2013.1508.
Foster Provost and Tom Fawcett
Published in Volume: 1 Issue 1: February 13, 2013
ref:http://online.liebertpub.com/doi/full/10.1089/big.2013.1508
https://www.researchgate.net/publication/256439081_Data_Science_and_Its_Relationship_to_Big_Data_and_Data-Driven_Decision_Making
Can Open Source R&D Reinvigorate Drug Researchbhmunos
The open-source model pioneered by the software industry has potential applications to drug research and development. Open-source initiatives allow scientists to collaborate freely across organizations to solve shared problems. While there are significant differences between software and drug development, public-private partnerships have adapted open-source principles by crowdsourcing research through an outsourced network. These partnerships operate on small budgets by vetting proposals and outsourcing research, pursuing projects from early discovery through clinical trials. Over 20 such partnerships have been established and have attracted over $1.5 billion in funding to develop treatments for neglected diseases.
Using technology intelligence tools, companies can cut the time spent on research and development from weeks or months to seconds or minutes. Technology intelligence refers to identifying technological opportunities and threats that could impact a company's future growth. These tools provide contextual access to relevant information and insights by combining web content, scientific journals, and patents with search technology and analysis. For example, a company could search for ways to reduce energy consumption and the tool would return a summary of solutions from various categories, such as approaches from the EPA and Department of Energy, in under a minute. This represents a shift from traditional research methods to quickly gaining actionable intelligence through intuitive searches.
Big data has the potential to radically change healthcare delivery and research by enabling the analysis of large datasets to predict behaviors, encourage desirable outcomes, and reduce costs. However, healthcare organizations face significant challenges in realizing these benefits due to the fragmentation, security, and lack of standards around healthcare data. Adopting big data solutions that can integrate both internal and external real-time data sources will be key to overcoming these challenges and allowing healthcare constituents to make more informed decisions.
Global Pulse is an innovation initiative of the UN Secretary-General, harnessing today's new world of digital data and real-time analytics to gain a better understanding of changes in human well-being. Global Pulse hopes to contribute a future in which access to better information sooner makes it possible to keep international development on track, protect the world's most vulnerable populations, and strengthen resilience to global shocks.
Global Pulse functions as an innovation laboratory, bringing together expertise from UN agencies, governments, academia, and the private sector to research, develop, test and share tools and approaches for harnessing real-time data for more effective and efficient policy action.
The Pharma 2020 series
The Pharmaceutical industry's long successful strategy of placing big bets on a few molecules, promoting them heavily and turning them into blockbusters worked well for many years, but its R&D productivity has now plummeted and the environment’s changing. PwC believes that seven major trends are reshaping the marketplace:
Source of info:
http://www.pwc.com/gx/en/pharma-life-sciences/pharma2020/index.jhtml#
Presented at the World Conference on Educational Sciences http://www.wces2009.org/
February 04-07, Nicosia, Cyprus
Abstract: http://spuzic.synthasite.com/knowledge_-basics.php
Live presentation (Youtube): http://au.youtube.com/watch?v=ZYwYGXuVhqo
1. Vol. 2, No. 3 2005
Drug Discovery Today: Technologies
Editors-in-Chief
Kelvin Lam – Pfizer, Inc., USA
Henk Timmerman – Vrije Universiteit, The Netherlands
DRUG DISCOVERY
TODAY
TECHNOLOGIES Knowledge management
Ontologies in drug discovery
Stephen P. Gardner
CTO BioWisdom Ltd, Harston Mill, Harston, Cambridge, UK CB2 5GG
Knowledge is a fundamental driver of innovation in
Section Editor:
drug discovery. Although investment in the automated Manuel Peitsch – Novartis, Basel, Switzerland
generation of data has led to swelling databases, these
have not been harnessed effectively to meet the chal- In the past decade, the pharma industry had its own
lenges of real-world business. It has been too difficult to explosion of data, investing billions of dollars in new auto-
make the necessary connections between pieces of mated technologies. Whereas this has generated a smokesc-
reen of rapid action and technological development, R&D
knowledge in a range of systems towards the bigger
productivity has continued to decline as the investments in
picture and to formulate and execute a response to
systems to harness and focus that data on real business
this. The new generation of ontology-based semantic problems have lagged way behind. Far from being knowl-
technologies is changing the way that information is edge-led, drug discovery scientists often feel powerless in the
represented and used, building a new power system for face of large sets of increasingly complex and disconnected
data, with no means of pulling together the big picture that
R&D, based on the generation, distribution and appli-
would generate the insight to answer their questions. Backed
cation of knowledge across the organisation. into this corner, discovery science risks becoming something
of a data-driven crystal ball, generating large sets of data and
looking within them for enlightenment in the swirling latent
patterns that might be revealed.
Introduction: knowledge-led drug discovery: vision or
Without transparent, unfettered access to diverse knowl-
reality?
edge from inside and outside the organisation and the tools
To paraphrase Sir Francis Bacon [1], in the context of drug
to pull it all together, complex decisions simply cannot be
discovery, knowledge is power. It is a vital service that should
made in a fully informed fashion. Yet the pace of business
be flowing out of the walls of pharmaceutical companies, as
insists that crucial investment decisions are made rapidly and
fundamental to powering innovation as electricity is for
continuously. Harnessed properly, the knowledge of a com-
powering the instruments and lab equipment. However, just
pany’s assets could help pump its pipeline full of profitable
like electricity, knowledge can be unproductive and even
new compounds but if managed badly, they could lead to the
dangerous if mishandled. Unfocused, raw energy is expensive
wrong decisions being taken, which could cost billions if
and destructive – 60 years ago, the Hiroshima bomb delivered
those products fail in trials or, worse still, after they reach
an explosive yield of approximately 15 kt in a blast that deva-
the market.
stated an area of 5 square miles. This same energy harnessed
properly could power the whole of New York City for about 3 h
Gathering and applying knowledge in drug discovery
(18GW-hours) [2], long enough for the city to generate US$ 200
There are several elements required to be able to exploit
million of gross metropolitan product (http://www.usmayors.
information effectively inside a large corporation, especially
org/metroeconomies/1004/metroeconomies_1004.pdf).
one dealing with such diverse and complex information as a
E-mail address: S.P. Gardner (steve.gardner@biowisdom.com) pharmaceutical company. First, information has to be gen-
1740-6749/$ ß 2005 Elsevier Ltd. All rights reserved. DOI: 10.1016/j.ddtec.2005.08.004 www.drugdiscoverytoday.com 235
2. Drug Discovery Today: Technologies | Knowledge management Vol. 2, No. 3 2005
erated, by transforming it from its diverse primary sources (http://www.scienceandsociety.co.uk/results.asp?image=
into a medium that can be harnessed. Second, the infrastruc- 10211220& wwwflag=2&imagepos=7) [11]. This is not so
ture has to be in place to distribute the knowledge on demand much of a problem with structured information sources,
to scientific and business users throughout the company (and which can be parsed relatively easily in most cases but is a
possibly its collaborators). Third, the applications of the user serious problem with information from free textual sources.
have to be capable of taking advantage of the diverse set of Mining information from free text sources such as Medline,
information, so that it can be put to work for real business patent documents, regulatory submissions and preclinical
benefit. reports is crucial to the population of any comprehensive
Different informatics technologies have addressed these R&D knowledge resource. These resources contain not just
different phases but few bridge all three. One of the emerging simple data but also crucial interpretations, inferences and
technologies that have the potential to provide a coordinated opinions derived from experimentation. The difficulty comes
solution across all three is the new breed of semantic tech- in extracting the useful information from the text and trans-
nologies driven not by the structure or format of data but by forming it into a semantically consistent form. Keyword
the meaning (semantics) of those data. One of the primary searches are effectively useless at this and tools such as
development drivers of standards for these technologies is the Natural Language Processing (NLP) systems, although power-
Semantic Web [3], a project to help develop the next gen- ful, struggle with accurately identifying the complex nomen-
eration of Worldwide Web technologies, managed by W3C clature in the life sciences domain [12–14]. Solutions involving
(http://www.w3.org/Consortium). the use of an ontology as a domain guide can provide an
The semantics are embodied in descriptions of the con- effective way of optimising the population of specific relations.
cepts in the application domain, the relationships between For example, if a corpus was to be searched for references to a
them and their properties. This description of the domain is pattern such as ‘neuroleptic compounds interacting with GPCRs,’ it
called an ontology. In truth, the definition of an ontology would obviously be advantageous to have not only all the
can be hard to pin down as the term is often loosely names of all the neuroleptics and GPCRs and all of their
applied to dictionaries, thesauri and taxonomies as well synonyms a priori but also all of the forms that the interaction
as to much more complex networks of related information between compounds and proteins might take, for example,
[4]. binds, blocks, inhibits, among others. A comprehensive ontol-
Ontologies were originally used by Greek philosophers ogy will provide this level of domain knowledge and can
such as Aristotle [5] to name the things they saw in the significantly improve the accuracy and productivity of NLP-
universe and the relationships between them, so that they based text mining in biomedical literature [15].
could reason about the nature of the universe. Computer Over and above their contribution as an input to optimise
scientists have extended the principle and attempted to use text mining, ontologies also provide a repository for new
ontology-based models to reason computationally about the information generated during structured data parsing or text
domain, for example, to predict the optimal strategy for mining. By bringing together information from a wide variety
deploying air and ground resources in airline traffic manage- of resources into one ontology, they can be connected,
ment or space planning [6]. Biologists have likewise created managed and distributed as a coherent dataset. This ensures
nomenclatures and vocabularies based on ontologies to man- that information that would traditionally get lost or other-
age genomic and pathway integration [7–9]. wise become invisible over time (for example, an emailed
Ontologies can, however, be much more complex than copy of a PowerPoint presentation or Excel spreadsheet) will
simple hierarchies used to taxonomically organise a domain instead remain visible. It also allows knowledge from multi-
or provide a dictionary of certain types of concepts. They can ple sources inside and outside an organisation to be seman-
significantly enhance all three aspects of effective informa- tically normalised so that it can be compared and distributed
tion handling: generation, distribution and application. as a single consistent set of information [16].
Generating knowledge Distributing knowledge
One of the major problems in creating dictionaries, taxo- Several standards have been developed for the delivery of
nomies or ontologies is the scope of the domain, the rate of diverse information in ontologies to different applications.
knowledge acquisition and the limited time and resources The predominant standards in ontology development are
available. It is simply very difficult in a world where knowl- those defined by W3C as part of their Semantic Web efforts.
edge is changing rapidly to rely on human editors to maintain In particular, the Resource Data Framework (RDF) (http://
an up-to-date and accurate reflection of the state of the www.w3.org/RDF/), RDF Schema (http://www.w3.org/TR/
domain [10]. As far back as 1694, Leibniz was building cal- rdf-schema/) and Web Ontology Language (OWL) (http://
culating machines to automate the task of managing cate- www.w3.org/2004/OWL/) formats are important. These stan-
gories of concepts describing an ever more complex world dards provide a framework in which information from several
236 www.drugdiscoverytoday.com
3. Vol. 2, No. 3 2005 Drug Discovery Today: Technologies | Knowledge management
disparate sources can be abstracted, represented and distrib- reasoning engine. This effectively means reducing the
uted to applications. The motivation of the standards is, amount of information contained in the ontology to those
however, to enable better machine understanding of web- pieces that fit the limitations of the expressiveness of the
based information resources, not necessarily to facilitate language. These reductions will be governed largely by the
scientific discovery. This point causes some tensions as will needs of the reasoning engine and not by the end-user.
be discussed later. It is, therefore, important to consider which standards are
The fundamental knowledge representation framework is the most appropriate to use. RDF, RDF Schema and OWL Full
provided by RDF, a language (based on XML) that uses an are obvious choices but the reasoning-based standards have
abstract graph-based model to represent information in tri- limitations based on the applications that they are designed
plets of the form <subject> <predicate> <object>. RDF allows to support. Whereas computational reasoning based on
the expression of simple statements (sometimes called asser- ontologies is an interesting computer science problem and
tions) about concepts using named properties and values, for has been successful in several problem areas, for example
example, a triplet might be <‘Ontologies in Drug Discover- [17,18], it is not as immediately applicable to the wider life
y’><author><‘Steve Gardner’>. Graphs are made up of a science domain. The ideal characteristics of a system to which
collection of triplets, which might overlap, so that all papers computational reasoning can successfully be applied are that
with Steve Gardner as author could easily be extracted. The it should be comprehensively described, well bounded, rela-
concepts in RDF (such as the paper and author names) are tively small (<100 K concepts) and above all deterministic in
identified uniquely using Uniform Resource Identifier refer- nature (that is actions have defined and reproducible con-
ences (URIs) (http://www.isi.edu/in-notes/rfc2396.txt), so sequences) [19]. Life science is obviously almost the exact
that there is no ambiguity if different authors, for example, opposite; our scientific understanding of metabolism, dis-
share the same name. ease, pharmacology and the network of mutually compensat-
RDF Schema extends the capabilities provided by RDF and ing effects that any one stimulus might have in the body is far
allows an ontology creator to define a specific vocabulary for from complete, our nomenclatures overlap and intersect
their domain ontology. This allows them to describe specific awkwardly, biology is in few cases truly deterministic and
types of resources (such as all proteins, GPCRs or authors) and we are continually observing and describing new concepts
their properties and use them as types, classes and sets when and relationships.
interacting with the ontology. Whereas an RDF representa- At the same time, there are hundreds of real-world business
tion is very abstract, the definitions of vocabularies allowed applications inside pharma that are desperate for the delivery
by RDF Schema provide much of the domain-specific seman- of integrated sets of all available knowledge so that human
tics that will go into the ontology and make the ontology reasoning (i.e. expert scientists and business people) can
much more useful to domain specialists. make decisions in a more informed manner. When develop-
OWL is designed to provide computational reasoning (also ing ontologies for applications in life sciences, it is therefore
called inferencing) capabilities across ontologies. These could important to choose the level at which the standards repre-
in theory be used to infer, for example, that if a marketed sent sufficient information to power the real-world applica-
compound had a specific side effect associated with a parti- tions the knowledge will be used to support, rather than
cular receptor and that receptor was targeted by a therapy for necessarily following a series of standards developed for
a different disease, that the original marketed compound computer science applications that are better suited to other
might potentially have an alternate use for the second dis- domains.
ease. There are different levels of the OWL language, termed
Full, DL and Lite, which have different capabilities and Applying knowledge
different levels of support for computational reasoning. All Ontologies function as an atlas, describing the information
OWL dialects are built as a vocabulary extension of RDF, in content of the primary data sources that underpin them.
such a way that any RDF graph will form an OWL Full They provide a convenient schema to organise and deliver
ontology. OWL Full has minimal support for reasoning but information from many disparate sources to a range of dif-
maximum support for fully expressing any knowledge in any ferent business applications, in the same way that the map in
form. Other more restrictive OWL sublanguages include a GPS system might be used for finding a route between cities,
OWL DL (Description Logic), which provides a language locating a nearby restaurant or avoiding speed cameras on the
subset that supports Description Logic-based computational roads. The single consistent representation of knowledge can
reasoning, and OWL Lite, which is a further restricted subset be delivered to several different applications because of the
of OWL DL. The discriminating feature of OWL DL and OWL abstract way that knowledge is held in the ontology.
Lite is that the relationships particularly should be con- There are several drug discovery applications for which
strained to a logically discriminable subset that can be guar- ontologies are providing a knowledge substrate. This is
anteed to be computationally tractable (decidable) using a timely, as the job of drug discovery has become more chal-
www.drugdiscoverytoday.com 237
4. Drug Discovery Today: Technologies | Knowledge management Vol. 2, No. 3 2005
lenging recently owing to the increasing risk aversion of the times as long as the actual analysis. Because it is so difficult,
regulatory authorities, prescribing bodies, insurers and this data preparation phase is often curtailed and simpler, less
patients. It has become crucial to establish the comparative systematic analyses are performed instead. This has the
safety profile of a new compound relative to its class compe- obvious downside that fewer data are considered and impor-
titors much earlier in the development process, and the tant trends or correlations are less likely to emerge.
discovery of early-stage biomarkers for adverse events is much Ontologies can shortcut this information gathering phase
more important than it was even 2 years ago. This sensitivity and provide a semantically consistent and comprehensive
is particularly prevalent in first-in-class medicines, where an substrate for data mining and analysis. Because the ontology
acceptable baseline of adverse events has not yet been estab- already contains all of the important concepts and properties
lished, and the regulatory bodies have begun to demand full from all of the different primary data sources and because this
mechanistic explanations of any unusual events seen during has already been semantically normalised, information can
trials. This might, in turn, bring into play peripheral factors be directly compared in the analysis system. An example of
such as accepted assays that do not actually directly measure multidimensional information exported from an ontology
the level of a biomarker, or whose relationship to a toxicity is into the Spotfire DecisionSiteTM software is shown in Fig. 1.
at best poorly understood (e.g. photoxicity or liver injury). In Integration of chemical information: The seamless integration of
this environment, it is crucial to be armed with all of the chemical information with biological and business data is
available information and tools to help find the relevant difficult to achieve except in very specific, often single ven-
connections between nuggets of knowledge. dor, systems. This has limited the development of several
Many of the tools that are already in place inside a phar- applications where chemical data need to be correlated freely
maceutical company’s R&D environment can benefit from with a range of other information from different sources.
being connected to the network of knowledge embodied Example applications that could take advantage of this would
inside an ontology. Some of these underlying technologies include extended SAR analysis including biological data,
that have been enhanced by ontologies are listed below: chemical clustering based on affected target and pathway
information, freedom to operate searches in patent literature
Data mining and analysis: Although very good data mining or the selection of in-licensing candidates based on correla-
and analysis packages such as SpotfireTM, SASTM or R are in tions to known toxicities. Ontologies again provide a mechan-
widespread use, the Achilles heel is getting sufficient high- ism to represent chemical structure and property data
quality information to work from in a timely fashion. The alongside all of the other contextual data, whether it is biolo-
gathering and transformation of systematic multidimen- gical or business oriented in nature. Fig. 2 shows a chemical
sional data into a form that can be analysed can take several substructure search using CambridgeSoft’s ChemOfficeTM
Figure 1. Systematic analysis of anticancer therapeutic pipelines. Image shows all known anticancer therapeutics in development or marketed correlated
with manufacturers and activites. The colours of the squares represent the development phase information of the compounds.
238 www.drugdiscoverytoday.com
5. Vol. 2, No. 3 2005 Drug Discovery Today: Technologies | Knowledge management
Figure 2. Chemical substructure search driving contextual information retrieval. Image shows a substructure search for a fluorobenzene moiety, that has
returned Atorvastatin as a hit. This, in turn, has retrieved information regarding the targets, processes, disease, among others, associated with Atorvastatin
from a multirelational ontology derived from 45 structured and unstructured data sources.
Figure 3. Large-scale document navigation. Image shows a set of 14 Summary Basis for Approval (SBA) documents from the FDA regarding Rosuvastatin.
The ontology has been used to tag references to Rosuvastatin in specific contexts. In the case highlighted, this is showing all references to the different
Rosuvastatin-induced events mentioned within the 1200 page corpus.
www.drugdiscoverytoday.com 239
6. Drug Discovery Today: Technologies | Knowledge management Vol. 2, No. 3 2005
tool driving the retrieval of contextual information from Our ability to continually innovate is predicated on the
BioWisdom’s SofiaTM ontology. ability to see new information and patterns and reinterpret
Document analysis and submission: Navigating a complex docu- past experience. Ontology is one tool that has the potential to
ment structure and finding the correct context of all references provide the objective, comprehensive knowledge substrate
to a particular concept can be daunting in the life science that allows continuous researching and reappraisal of
domain where primary documents such as regulatory submis- hypotheses against new information as it becomes available
sions, Summary Basis for Approval (SBA) documents or patents to challenge old ideology and identify a range of new oppor-
are typically unstructured, poor quality, image only PDF docu- tunities.
ments. Ontologies coupled with text tagging tools such as
Corpora’s Jump!TM as shown in Fig. 3 can provide a navigation
framework around a document that makes finding informa- References
1 Bacon, F. (1587) Meditationes Sacrae, De Haeresibus
tion in large documents fast and intuitive.
2 Huber, P. and Mills, M.P. (2005) The Bottomless Well. Basic Books
3 Berners-Lee, T. et al. (2001) The semantic web. Sci. Am. 284, 34–43
Many business applications such as biomarker discovery, 4 Sowa, J., ed. (2000) Knowledge Representation Logical, Philosophical and
market differentiation, in-licensing and safety assessment Computational Foundations, Brooks/Cole
5 Aristotle (2004) The Categories, BookSurge Classics
can take advantage of these and other ontology-powered 6 Rajpathak, D. and Motta, E. (2004) An ontological formalization of the
technologies to bring more knowledge into the decision- planning task. Proceedings of the International Conference on Formal
making process. Ontologies in Information Systems (FOIS’04), Torino, Italy, Nov. 4–6, 2004
(http://kmi.open.ac.uk/people/dnyanesh/Publications/FOIS-2004/
Fois2004_camera-ready-final.pdf)
Concluding remarks 7 The Gene Ontology Consortium (2001) Creating the gene ontology
Although the technological development around ontologies resource: design and implementation. Genome Res. 11, 1425–1433
8 Williams, J. and Andersen, W. (2003) Bringing ontology to the genome
and their application to pharmaceutical business problems is
ontology. Comp. Funct. Genomics 4, 90–93
very recent, they are beginning to live up to their potential 9 Luciano, J.S. (2005) PAX of mind for pathway researchers. Drug Discov.
and are already demonstrating that they can deliver knowl- Today 10, 937–942
edge inside an enterprise pharmaceutical environment 10 Gardner, S.P. (Summer 2005) Ontologies: semantic networks of
pharmaceutical knowledge. Drug Discov. World 78–83
[17,20]. 11 Leibniz, G.W. (1965) Monadology and Other Philosophical Essays (Schrecker,
Some of the benefits that they offer might be disruptive to P. and Schrecker, A.M., eds and Trans.), Bobbs-Merrill Co.
established patterns of behaviour inside the discovery func- 12 Thomas, J. et al. (2000) Automatic extraction of protein interactions from
scientific abstracts, Pac. Symp. Biocomput. 541–552 (http://helix-
tion. For example, the notion that experts can automatically
web.stanford.edu/psb00/thomas.pdf)
maintain their currency in a domain simply by working in it 13 Melton, G.B. and Hripcsak, G. (2005) Automated detection of adverse
for years is sometimes misplaced, especially with the pace of events using natural language processing of discharge summaries. J. Am.
Med. Inform. Assoc. 4, 448–457
change in modern research. Experience suggests that without
14 Uramoto, N. et al. (2004) A text mining system for knowledge discovery
a prolonged effort, expertise gradually gives way to bias and from biomedical systems. IBM Syst. J. 43, 516–533
finally, in some cases, to dogma. This is difficult to recognise 15 Milward, D. et al. (2004) Ontology based interactive information
but can be a major obstacle to new hypotheses and ideas that extraction from scientific abstracts. Comp. Funct. Genomics 6, 67–71
16 Gardner, S.P. (2005) Ontologies and semantic data integration. Drug
are essential to innovation. Dogma exists in many forms, not Discov. Today 10, 1001–1007
simply individual but also ingrained in years of laboratory or 17 Stojanovic, L. et al. (2004) The role of ontologies in autonomic computing
clinical practice. Often molecules are typecast, with the site in systems. IBM Syst. J. 43, 598–616
18 Rector, A. (2003) Medical informatics. In Chapter 13 of the Description Logic
which they were discovered being imbued with a mythical
Handbook (Baader, F. et al. eds), pp. 406–426, Cambridge University Press
unquestionable status as the definition of their function. 19 Rector, A. (2003) Modularisation of domain ontologies implemented in
When they turn up years later being expressed in different description logics and related formalisms including OWL. Proceedings of K-
CAP’03, 23–25 October, 2003, Sanibel Island, FL, USA, pp. 121–128
tissues and possibly playing a different role, it comes as a
20 Torr-Brown, S. (2005) Advances in knowledge management for
surprise that assays that are used routinely might have dif- pharmaceutical research and development. Curr. Opin. Drug Discov. Dev. 8,
ferent interpretations. 316–322
240 www.drugdiscoverytoday.com