SlideShare a Scribd company logo
1 of 19
Download to read offline
Standards and Ontologies for
                             Functional Genomics
                                   Conference
                              October 23-26, 2004
                        University of Pennsylvania School
                                    of Medicine




EXPERIENCES IN BUILDING AN ONTOLOGY-
DRIVEN IMAGE DATABASE FOR BIOLOGISTS


Chris Catton
Image BioInformatics Laboratory
Department of Zoology
University of Oxford, UK

e-mail: chris.catton@zoo.ox.ac.uk
Outline

•   Why are images important?
•   What is the BioImage database?
•   Why use a semantic web architecture?
•   Lessons and research questions
Why are biological images important in the
                post-genomic age?
•   Images are semantic instruments for capturing aspects of the real
    world, and form a vital part of the scientific record, for which words are
    no substitute
•   In the post-genomic world, attention is now focused on the organization
    and integration of information within cells, for functional analyses of
    gene products
•   In a month a single active cell biology lab may generate between 10
    and 100 Gbytes of multidimensional image data
Images are complex …

•   An image database must be
    able to store original images in
    any digital format currently
    available or yet to be invented,
    including multi-channel 3D
    images, multi-channel videos,
    etc.
The need for image databases
•   The value of digital image information depends upon how easily it
    can be located, searched for relevance, and retrieved
•   Detailed descriptive metadata about the images are essential
•   Without them, digital image repositories become little more than
    meaningless and costly data graveyards
•   Despite the growth of on-line journals that permit the inclusion of
    media objects, few of these resources are freely available, and those
    that are are difficult to locate and are not cross-searchable
•   There is thus a need for a free publicly available image database with
    rich well-structured searchable metadata
•   The BioImage Database seeks to fulfil that need
This view has a growing acceptance …
What metadata?

•   Image acquisition (who took the original micrograph, where,
    when, under what conditions, for what purpose, etc.)
•   The media object itself (source and derivation, image type,
    dynamic range, resolution, format, codec, etc.),
•   The denotation of the referent (e.g. the name, age and condition
    of the subject),
•   Connotation of the referent (the image’s interpretation, meaning,
    purpose or significance, its relevance to its creator and others,
    and its semantic relationship to other images).
•   Field aspects of the real world that cannot conveniently be
    attached to any particular object (e.g. variations of illumination
    intensity or chemo-attractant concentration across the field of
    view of a light microscope image).
•   Sequences of change where there is a need to preserve the
    concept of object identity in the face of radical spatio-temporal
    changes in appearance.
Why use a semantic web architecture?

•   Traditional relational databases don’t meet our needs
     • Image data is complex, layered, and difficult to model
     • Images are searched primarily through their metadata
     • Metadata is time consuming and difficult to obtain
•   Ontologies offer the promise of better retrieval accuracy through
    linking to instances in an ontology, rather than attempting to
    process free text.
•   Ontologies offer the promise of easy inter-operability with other
    systems
The BioImage Ontology
Lessons learned:
                 Performance, scalability …

•   Database retrieval is slower than a traditional database would
    be
•   Scalability remains to be tested (true for all semantic web
    software)
•   Query languages (RDQL) are immature when compared to SQL
•   Parsing RDF is hard and slow (RDF-ABBREV output of the
    Jena parser is unreliable and the unstriped format requires
    multiple passes to create XML that can easily be transformed to
    HTML)
A problem with ontologies?

•   The volume of data generated in the Life Sciences is now
    estimated to be doubling every month

•   Already people look less and less at the raw scientific data
    (unless they are their own results)

•   As this volume of data accumulates, few if any of us will have
    the time or the mental capacity to assimilate new data, structure
    them in a meaningful way and extract information, without first
    processing the data through an ontology or some other similar
    machine-based organisational aid

•   THE ONTOLOGY WILL BE WRONG! (or we should all pack up
    and go home)
Paradigm shifts

•   Our human understanding of an area of science is never static,
    but is constantly being revised by new research
•   Such revisions in understanding are either evolutionary
    (incremental), following the progressive discovery of more and
    more detail, interpreted according to the prevailing paradigm, or
    revolutionary, when the prevailing paradigm is overthrown by
    another
•   How do paradigm revolutions succeed?
       "A new scientific truth does not triumph by convincing its
       opponents and making them see the light, but rather
       because its opponents eventually die, and a new generation
       grows up that is familiar with it"
                                          (Max Planck, 1949)
Factors preventing evolution

•   Ontology builders are ‘monks’ (and nuns) - led by an ‘abbot’, a
    relatively senior domain expert likely to be committed to encapsulating
    the dominant paradigm
•   Substantial problems confront any newcomers wishing to contribute,
    since ontology building is time-consuming and expensive
•   Since an ontology expresses the community consensus, there will be
    massive social pressures against change
•   If large volumes of data have already been encoded using an existing
    ontology, this will make it difficult to introduce change
•   The first ontology in a domain may assume a monopolistic position that
    becomes unassailable, even if it has universally acknowledged
    weaknesses
•   Ontologies are unlikely to evolve in response to the same market
    forces that drive the development of applications software
Encapsulating the dominant paradigm

•   Imagine a section of an ontology describing the development of adult
    mammalian bone marrow and brain, constructed according to the pre-1980
    dominant paradigm that bone marrow develops from mesoderm, while
    brain develops from ectoderm
An example of paradigm evolution

•   Subsequently, adult mouse brain was found to contain haemopoietic stem cells
•   Bartlett (1982) hypothesised that these cells developed from foetal haemopoietic cells that
    entered the brain tissue before the barrier was established




•   This challenge to the dominant paradigm that brain tissues are derived exclusively from
    ectoderm can be accommodated by extending the graph
An example of paradigm revolution

•   More recently, Brazelton et al. (2000) claimed that haemopoietic stem cells from adult
    bone marrow can develop into neural cells in adult mouse brain
•   If true, this result overthrows the paradigm that neuronal cells can only develop from
    embryonic ectoderm, requiring a new ontology incompatible with the old




•   This new ontology is no longer an extension of the previous one, since neural cells no
    longer develop only from foetal neuroepithelium
A way forward – using Named Graphs in
               RDF (and OWL?)
•   In response to considerable frustration and confusion within the RDF
    community about the best method of reifying RDF statements, Jeremy
    Carroll et al. proposed an extension to RDF
Thanks and acknowledgements

•   David Shotton and Simon Sparks
    for BioImage developments
    (http://www.bioimage.org)
•   John Pybus, our computer systems
    manager, for keeping us running in
    spite of the problems
•   Liz Mellings for unbounded
    patience inputting data and testing
•   The European Commission for
    funding the BioImage Project (EC
    IST 5th Framework Contract 2001-
    32688: ORIEL – Online Research
    Information Environment for the Life
    Sciences; http://www.oriel.org)
End

More Related Content

Viewers also liked

JavaOne 2009 - Full-Text Search: Human Heaven and Database Savior in the Cloud
JavaOne 2009 - Full-Text Search: Human Heaven and Database Savior in the CloudJavaOne 2009 - Full-Text Search: Human Heaven and Database Savior in the Cloud
JavaOne 2009 - Full-Text Search: Human Heaven and Database Savior in the CloudAaron Walker
 
OSDC 2010 - You've Got Cucumber in my Java and it Tastes Great
OSDC 2010 - You've Got Cucumber in my Java and it Tastes GreatOSDC 2010 - You've Got Cucumber in my Java and it Tastes Great
OSDC 2010 - You've Got Cucumber in my Java and it Tastes GreatAaron Walker
 
OSDC-2010 Database Full-text Search.... making it not suck
OSDC-2010 Database Full-text Search.... making it not suckOSDC-2010 Database Full-text Search.... making it not suck
OSDC-2010 Database Full-text Search.... making it not suckAaron Walker
 
Diona Slim Line. "Westfalia GMBH"
Diona Slim Line. "Westfalia GMBH"Diona Slim Line. "Westfalia GMBH"
Diona Slim Line. "Westfalia GMBH"mamontov
 
VRA 2012, Emerging New Roles, Or How My Job Has Evolved
VRA 2012, Emerging New Roles, Or How My Job Has EvolvedVRA 2012, Emerging New Roles, Or How My Job Has Evolved
VRA 2012, Emerging New Roles, Or How My Job Has EvolvedVisual Resources Association
 

Viewers also liked (7)

JavaOne 2009 - Full-Text Search: Human Heaven and Database Savior in the Cloud
JavaOne 2009 - Full-Text Search: Human Heaven and Database Savior in the CloudJavaOne 2009 - Full-Text Search: Human Heaven and Database Savior in the Cloud
JavaOne 2009 - Full-Text Search: Human Heaven and Database Savior in the Cloud
 
OSDC 2010 - You've Got Cucumber in my Java and it Tastes Great
OSDC 2010 - You've Got Cucumber in my Java and it Tastes GreatOSDC 2010 - You've Got Cucumber in my Java and it Tastes Great
OSDC 2010 - You've Got Cucumber in my Java and it Tastes Great
 
Biii
BiiiBiii
Biii
 
OSDC-2010 Database Full-text Search.... making it not suck
OSDC-2010 Database Full-text Search.... making it not suckOSDC-2010 Database Full-text Search.... making it not suck
OSDC-2010 Database Full-text Search.... making it not suck
 
Diona Slim Line. "Westfalia GMBH"
Diona Slim Line. "Westfalia GMBH"Diona Slim Line. "Westfalia GMBH"
Diona Slim Line. "Westfalia GMBH"
 
VRA 2012, Emerging New Roles, Or How My Job Has Evolved
VRA 2012, Emerging New Roles, Or How My Job Has EvolvedVRA 2012, Emerging New Roles, Or How My Job Has Evolved
VRA 2012, Emerging New Roles, Or How My Job Has Evolved
 
χριστοσ 2
χριστοσ  2χριστοσ  2
χριστοσ 2
 

Similar to Experiences in building an ontology driven image database for ...

Tragedy of the (Data) Commons
Tragedy of the (Data) CommonsTragedy of the (Data) Commons
Tragedy of the (Data) CommonsJames Hendler
 
How do we know what we don’t know: Using the Neuroscience Information Framew...
How do we know what we don’t know:  Using the Neuroscience Information Framew...How do we know what we don’t know:  Using the Neuroscience Information Framew...
How do we know what we don’t know: Using the Neuroscience Information Framew...Maryann Martone
 
Databases and Ontologies: Where do we go from here?
Databases and Ontologies:  Where do we go from here?Databases and Ontologies:  Where do we go from here?
Databases and Ontologies: Where do we go from here?Maryann Martone
 
Tragedy of the Data Commons (ODSC-East, 2021)
Tragedy of the Data Commons (ODSC-East, 2021)Tragedy of the Data Commons (ODSC-East, 2021)
Tragedy of the Data Commons (ODSC-East, 2021)James Hendler
 
Ontologies: What Librarians Need to Know
Ontologies: What Librarians Need to KnowOntologies: What Librarians Need to Know
Ontologies: What Librarians Need to KnowBarry Smith
 
The real world of ontologies and phenotype representation: perspectives from...
The real world of ontologies and phenotype representation:  perspectives from...The real world of ontologies and phenotype representation:  perspectives from...
The real world of ontologies and phenotype representation: perspectives from...Maryann Martone
 
Semantic Web & Web 3.0 empowering real world outcomes in biomedical research ...
Semantic Web & Web 3.0 empowering real world outcomes in biomedical research ...Semantic Web & Web 3.0 empowering real world outcomes in biomedical research ...
Semantic Web & Web 3.0 empowering real world outcomes in biomedical research ...Amit Sheth
 
Looking for Commonsense in the Semantic Web
Looking for Commonsense in the Semantic WebLooking for Commonsense in the Semantic Web
Looking for Commonsense in the Semantic WebValentina Presutti
 
Sci Know Mine 2013: What can we learn from topic modeling on 350M academic do...
Sci Know Mine 2013: What can we learn from topic modeling on 350M academic do...Sci Know Mine 2013: What can we learn from topic modeling on 350M academic do...
Sci Know Mine 2013: What can we learn from topic modeling on 350M academic do...William Gunn
 
Ontological realism as a strategy for integrating ontologies
Ontological realism as a strategy for integrating ontologiesOntological realism as a strategy for integrating ontologies
Ontological realism as a strategy for integrating ontologiesBarry Smith
 
A Semantic Web based Framework for Linking Healthcare Information with Comput...
A Semantic Web based Framework for Linking Healthcare Information with Comput...A Semantic Web based Framework for Linking Healthcare Information with Comput...
A Semantic Web based Framework for Linking Healthcare Information with Comput...Koray Atalag
 
Connecting life sciences data at the European Bioinformatics Institute
Connecting life sciences data at the European Bioinformatics InstituteConnecting life sciences data at the European Bioinformatics Institute
Connecting life sciences data at the European Bioinformatics InstituteConnected Data World
 
The biodiversity informatics landscape: a systematics perspective
The biodiversity informatics landscape: a systematics perspectiveThe biodiversity informatics landscape: a systematics perspective
The biodiversity informatics landscape: a systematics perspectiveVince Smith
 
OAI7 Research Objects
OAI7 Research ObjectsOAI7 Research Objects
OAI7 Research Objectsseanb
 
Software Ecosystem Evolution. It's complex!
Software Ecosystem Evolution. It's complex!Software Ecosystem Evolution. It's complex!
Software Ecosystem Evolution. It's complex!Tom Mens
 

Similar to Experiences in building an ontology driven image database for ... (20)

Tragedy of the (Data) Commons
Tragedy of the (Data) CommonsTragedy of the (Data) Commons
Tragedy of the (Data) Commons
 
How do we know what we don’t know: Using the Neuroscience Information Framew...
How do we know what we don’t know:  Using the Neuroscience Information Framew...How do we know what we don’t know:  Using the Neuroscience Information Framew...
How do we know what we don’t know: Using the Neuroscience Information Framew...
 
Databases and Ontologies: Where do we go from here?
Databases and Ontologies:  Where do we go from here?Databases and Ontologies:  Where do we go from here?
Databases and Ontologies: Where do we go from here?
 
Ngsp
NgspNgsp
Ngsp
 
Data Mining Dissertations and Adventures and Experiences in the World of Chem...
Data Mining Dissertations and Adventures and Experiences in the World of Chem...Data Mining Dissertations and Adventures and Experiences in the World of Chem...
Data Mining Dissertations and Adventures and Experiences in the World of Chem...
 
Tragedy of the Data Commons (ODSC-East, 2021)
Tragedy of the Data Commons (ODSC-East, 2021)Tragedy of the Data Commons (ODSC-East, 2021)
Tragedy of the Data Commons (ODSC-East, 2021)
 
Ontologies: What Librarians Need to Know
Ontologies: What Librarians Need to KnowOntologies: What Librarians Need to Know
Ontologies: What Librarians Need to Know
 
Collins seattle-2014-final
Collins seattle-2014-finalCollins seattle-2014-final
Collins seattle-2014-final
 
The real world of ontologies and phenotype representation: perspectives from...
The real world of ontologies and phenotype representation:  perspectives from...The real world of ontologies and phenotype representation:  perspectives from...
The real world of ontologies and phenotype representation: perspectives from...
 
Semantic Web & Web 3.0 empowering real world outcomes in biomedical research ...
Semantic Web & Web 3.0 empowering real world outcomes in biomedical research ...Semantic Web & Web 3.0 empowering real world outcomes in biomedical research ...
Semantic Web & Web 3.0 empowering real world outcomes in biomedical research ...
 
Looking for Commonsense in the Semantic Web
Looking for Commonsense in the Semantic WebLooking for Commonsense in the Semantic Web
Looking for Commonsense in the Semantic Web
 
Sci Know Mine 2013: What can we learn from topic modeling on 350M academic do...
Sci Know Mine 2013: What can we learn from topic modeling on 350M academic do...Sci Know Mine 2013: What can we learn from topic modeling on 350M academic do...
Sci Know Mine 2013: What can we learn from topic modeling on 350M academic do...
 
Ontological realism as a strategy for integrating ontologies
Ontological realism as a strategy for integrating ontologiesOntological realism as a strategy for integrating ontologies
Ontological realism as a strategy for integrating ontologies
 
Neuroscience as networked science
Neuroscience as networked scienceNeuroscience as networked science
Neuroscience as networked science
 
A Semantic Web based Framework for Linking Healthcare Information with Comput...
A Semantic Web based Framework for Linking Healthcare Information with Comput...A Semantic Web based Framework for Linking Healthcare Information with Comput...
A Semantic Web based Framework for Linking Healthcare Information with Comput...
 
Connecting life sciences data at the European Bioinformatics Institute
Connecting life sciences data at the European Bioinformatics InstituteConnecting life sciences data at the European Bioinformatics Institute
Connecting life sciences data at the European Bioinformatics Institute
 
The biodiversity informatics landscape: a systematics perspective
The biodiversity informatics landscape: a systematics perspectiveThe biodiversity informatics landscape: a systematics perspective
The biodiversity informatics landscape: a systematics perspective
 
Ontology
OntologyOntology
Ontology
 
OAI7 Research Objects
OAI7 Research ObjectsOAI7 Research Objects
OAI7 Research Objects
 
Software Ecosystem Evolution. It's complex!
Software Ecosystem Evolution. It's complex!Software Ecosystem Evolution. It's complex!
Software Ecosystem Evolution. It's complex!
 

Recently uploaded

Employee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxEmployee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxNirmalaLoungPoorunde1
 
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...Marc Dusseiller Dusjagr
 
Meghan Sutherland In Media Res Media Component
Meghan Sutherland In Media Res Media ComponentMeghan Sutherland In Media Res Media Component
Meghan Sutherland In Media Res Media ComponentInMediaRes1
 
Earth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatEarth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatYousafMalik24
 
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdfLike-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdfMr Bounab Samir
 
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptxECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptxiammrhaywood
 
How to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxHow to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxmanuelaromero2013
 
MICROBIOLOGY biochemical test detailed.pptx
MICROBIOLOGY biochemical test detailed.pptxMICROBIOLOGY biochemical test detailed.pptx
MICROBIOLOGY biochemical test detailed.pptxabhijeetpadhi001
 
Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17Celine George
 
Final demo Grade 9 for demo Plan dessert.pptx
Final demo Grade 9 for demo Plan dessert.pptxFinal demo Grade 9 for demo Plan dessert.pptx
Final demo Grade 9 for demo Plan dessert.pptxAvyJaneVismanos
 
Roles & Responsibilities in Pharmacovigilance
Roles & Responsibilities in PharmacovigilanceRoles & Responsibilities in Pharmacovigilance
Roles & Responsibilities in PharmacovigilanceSamikshaHamane
 
Full Stack Web Development Course for Beginners
Full Stack Web Development Course  for BeginnersFull Stack Web Development Course  for Beginners
Full Stack Web Development Course for BeginnersSabitha Banu
 
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...JhezDiaz1
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxSayali Powar
 
Blooming Together_ Growing a Community Garden Worksheet.docx
Blooming Together_ Growing a Community Garden Worksheet.docxBlooming Together_ Growing a Community Garden Worksheet.docx
Blooming Together_ Growing a Community Garden Worksheet.docxUnboundStockton
 

Recently uploaded (20)

TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdfTataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
 
Employee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxEmployee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptx
 
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
 
Meghan Sutherland In Media Res Media Component
Meghan Sutherland In Media Res Media ComponentMeghan Sutherland In Media Res Media Component
Meghan Sutherland In Media Res Media Component
 
Earth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatEarth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice great
 
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
 
Model Call Girl in Bikash Puri Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Bikash Puri  Delhi reach out to us at 🔝9953056974🔝Model Call Girl in Bikash Puri  Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Bikash Puri Delhi reach out to us at 🔝9953056974🔝
 
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdfLike-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
 
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptxECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
 
How to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxHow to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptx
 
MICROBIOLOGY biochemical test detailed.pptx
MICROBIOLOGY biochemical test detailed.pptxMICROBIOLOGY biochemical test detailed.pptx
MICROBIOLOGY biochemical test detailed.pptx
 
9953330565 Low Rate Call Girls In Rohini Delhi NCR
9953330565 Low Rate Call Girls In Rohini  Delhi NCR9953330565 Low Rate Call Girls In Rohini  Delhi NCR
9953330565 Low Rate Call Girls In Rohini Delhi NCR
 
Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17
 
Final demo Grade 9 for demo Plan dessert.pptx
Final demo Grade 9 for demo Plan dessert.pptxFinal demo Grade 9 for demo Plan dessert.pptx
Final demo Grade 9 for demo Plan dessert.pptx
 
Roles & Responsibilities in Pharmacovigilance
Roles & Responsibilities in PharmacovigilanceRoles & Responsibilities in Pharmacovigilance
Roles & Responsibilities in Pharmacovigilance
 
Full Stack Web Development Course for Beginners
Full Stack Web Development Course  for BeginnersFull Stack Web Development Course  for Beginners
Full Stack Web Development Course for Beginners
 
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
 
ESSENTIAL of (CS/IT/IS) class 06 (database)
ESSENTIAL of (CS/IT/IS) class 06 (database)ESSENTIAL of (CS/IT/IS) class 06 (database)
ESSENTIAL of (CS/IT/IS) class 06 (database)
 
Blooming Together_ Growing a Community Garden Worksheet.docx
Blooming Together_ Growing a Community Garden Worksheet.docxBlooming Together_ Growing a Community Garden Worksheet.docx
Blooming Together_ Growing a Community Garden Worksheet.docx
 

Experiences in building an ontology driven image database for ...

  • 1. Standards and Ontologies for Functional Genomics Conference October 23-26, 2004 University of Pennsylvania School of Medicine EXPERIENCES IN BUILDING AN ONTOLOGY- DRIVEN IMAGE DATABASE FOR BIOLOGISTS Chris Catton Image BioInformatics Laboratory Department of Zoology University of Oxford, UK e-mail: chris.catton@zoo.ox.ac.uk
  • 2. Outline • Why are images important? • What is the BioImage database? • Why use a semantic web architecture? • Lessons and research questions
  • 3. Why are biological images important in the post-genomic age? • Images are semantic instruments for capturing aspects of the real world, and form a vital part of the scientific record, for which words are no substitute • In the post-genomic world, attention is now focused on the organization and integration of information within cells, for functional analyses of gene products • In a month a single active cell biology lab may generate between 10 and 100 Gbytes of multidimensional image data
  • 4. Images are complex … • An image database must be able to store original images in any digital format currently available or yet to be invented, including multi-channel 3D images, multi-channel videos, etc.
  • 5. The need for image databases • The value of digital image information depends upon how easily it can be located, searched for relevance, and retrieved • Detailed descriptive metadata about the images are essential • Without them, digital image repositories become little more than meaningless and costly data graveyards • Despite the growth of on-line journals that permit the inclusion of media objects, few of these resources are freely available, and those that are are difficult to locate and are not cross-searchable • There is thus a need for a free publicly available image database with rich well-structured searchable metadata • The BioImage Database seeks to fulfil that need
  • 6. This view has a growing acceptance …
  • 7. What metadata? • Image acquisition (who took the original micrograph, where, when, under what conditions, for what purpose, etc.) • The media object itself (source and derivation, image type, dynamic range, resolution, format, codec, etc.), • The denotation of the referent (e.g. the name, age and condition of the subject), • Connotation of the referent (the image’s interpretation, meaning, purpose or significance, its relevance to its creator and others, and its semantic relationship to other images). • Field aspects of the real world that cannot conveniently be attached to any particular object (e.g. variations of illumination intensity or chemo-attractant concentration across the field of view of a light microscope image). • Sequences of change where there is a need to preserve the concept of object identity in the face of radical spatio-temporal changes in appearance.
  • 8. Why use a semantic web architecture? • Traditional relational databases don’t meet our needs • Image data is complex, layered, and difficult to model • Images are searched primarily through their metadata • Metadata is time consuming and difficult to obtain • Ontologies offer the promise of better retrieval accuracy through linking to instances in an ontology, rather than attempting to process free text. • Ontologies offer the promise of easy inter-operability with other systems
  • 10. Lessons learned: Performance, scalability … • Database retrieval is slower than a traditional database would be • Scalability remains to be tested (true for all semantic web software) • Query languages (RDQL) are immature when compared to SQL • Parsing RDF is hard and slow (RDF-ABBREV output of the Jena parser is unreliable and the unstriped format requires multiple passes to create XML that can easily be transformed to HTML)
  • 11. A problem with ontologies? • The volume of data generated in the Life Sciences is now estimated to be doubling every month • Already people look less and less at the raw scientific data (unless they are their own results) • As this volume of data accumulates, few if any of us will have the time or the mental capacity to assimilate new data, structure them in a meaningful way and extract information, without first processing the data through an ontology or some other similar machine-based organisational aid • THE ONTOLOGY WILL BE WRONG! (or we should all pack up and go home)
  • 12. Paradigm shifts • Our human understanding of an area of science is never static, but is constantly being revised by new research • Such revisions in understanding are either evolutionary (incremental), following the progressive discovery of more and more detail, interpreted according to the prevailing paradigm, or revolutionary, when the prevailing paradigm is overthrown by another • How do paradigm revolutions succeed? "A new scientific truth does not triumph by convincing its opponents and making them see the light, but rather because its opponents eventually die, and a new generation grows up that is familiar with it" (Max Planck, 1949)
  • 13. Factors preventing evolution • Ontology builders are ‘monks’ (and nuns) - led by an ‘abbot’, a relatively senior domain expert likely to be committed to encapsulating the dominant paradigm • Substantial problems confront any newcomers wishing to contribute, since ontology building is time-consuming and expensive • Since an ontology expresses the community consensus, there will be massive social pressures against change • If large volumes of data have already been encoded using an existing ontology, this will make it difficult to introduce change • The first ontology in a domain may assume a monopolistic position that becomes unassailable, even if it has universally acknowledged weaknesses • Ontologies are unlikely to evolve in response to the same market forces that drive the development of applications software
  • 14. Encapsulating the dominant paradigm • Imagine a section of an ontology describing the development of adult mammalian bone marrow and brain, constructed according to the pre-1980 dominant paradigm that bone marrow develops from mesoderm, while brain develops from ectoderm
  • 15. An example of paradigm evolution • Subsequently, adult mouse brain was found to contain haemopoietic stem cells • Bartlett (1982) hypothesised that these cells developed from foetal haemopoietic cells that entered the brain tissue before the barrier was established • This challenge to the dominant paradigm that brain tissues are derived exclusively from ectoderm can be accommodated by extending the graph
  • 16. An example of paradigm revolution • More recently, Brazelton et al. (2000) claimed that haemopoietic stem cells from adult bone marrow can develop into neural cells in adult mouse brain • If true, this result overthrows the paradigm that neuronal cells can only develop from embryonic ectoderm, requiring a new ontology incompatible with the old • This new ontology is no longer an extension of the previous one, since neural cells no longer develop only from foetal neuroepithelium
  • 17. A way forward – using Named Graphs in RDF (and OWL?) • In response to considerable frustration and confusion within the RDF community about the best method of reifying RDF statements, Jeremy Carroll et al. proposed an extension to RDF
  • 18. Thanks and acknowledgements • David Shotton and Simon Sparks for BioImage developments (http://www.bioimage.org) • John Pybus, our computer systems manager, for keeping us running in spite of the problems • Liz Mellings for unbounded patience inputting data and testing • The European Commission for funding the BioImage Project (EC IST 5th Framework Contract 2001- 32688: ORIEL – Online Research Information Environment for the Life Sciences; http://www.oriel.org)
  • 19. End