Building a Semantic Web Image Repository for Biological Research Images Jun Zhao , Graham Klyne and David Shotton [email_address] Image Bioinformatics Research Group Department of Zoology University of Oxford, UK
FlyTED
The Drosophila Testis Gene Expression Image Database
Publish research data
Use existing tools to build a biological image repository
Browse-able and searchable by research biologists
Accessible to wider communities, e.g. Semantic Web, Linked Data
Loosely coupled software architecture maximises the opportunity of replacing or updating components used
Images
“… so that people can come and see the images, and they will notice something special about the genes”
--- Dr Helen White-Cooper ( Drosophila testis expert)
Where we started from
In situ gene expression images of the ~1500 genes of the testis of Drosophila melanogaster
>=1 image for each wild type gene
Possibly, >=1 image for any of the 6 different mutant strains having defective sperm maturation
Images stored in the file system
Metadata in spreadsheets, but not expressed using controlled keywords
No way to search for images
Search through the hard disk whenever they need an image
The goal
Publish Drosophila gene expression images to the Web
Make them accessible and searchable to our biological researchers as well as third parties
Quick, easy and cost-effective approach
EPrints 3.0 (http://eprints.org/software/)
A digital repository software system
Quick and easy to deploy
Built-in user interface
Programmatically data access
Repository-specific protocol: OAI-PMH
Support for domain-specific image metadata, e.g. Serpent Project http://archive.serpentproject.com/
A Piglet Squid from Serpent Project
Our gene expression images Gene name Strain name >1 Expression location Slide name Creation date ………… .
Adaptation of EPrints
Basic structure cannot hold the domain metadata
Customize underlying database:
Add additional metadata fields in the database schema
Keep both images and their metadata files as blobs
0 comments
Post a comment