A presentation given by Dave Roberts and coauthored by David King, Simon Rycroft, David Morse, Lyubomir Penev, Donat Agosti & Vince Smith. This was given at the Fourth Metadata and Semantics Research Conference (MTSR 2010) at Acala de Henares, Madrid, in the premises of the Faculty of Law.
Presentation about the recommendations coming from our review of the challenges and priorities facing biodiversity informatics over the coming decade. Linked to this paper published in BMC Ecology: http://dx.doi.org/10.1186/1472-6785-13-16
Presentation about the recommendations coming from our review of the challenges and priorities facing biodiversity informatics over the coming decade. Linked to this paper published in BMC Ecology: http://dx.doi.org/10.1186/1472-6785-13-16
Can science be social? Collective and Citizen Experimentation in Computationa...Josep Perelló
Brief presentation of our OpenSystems UB Research Group activity mostly focused on Computational Social Science and in relation with Citizen Science Practices. Presentation in the COMSOTEC meeting held in Santander (Spetember 10, 2015). Facultad de Ciencias, Universidad de Cantabria, Santander.
Digital Cinema and New Media Arts at Calit2Larry Smarr
09.05.01
Keynote
Retreat of the Interdisciplinary Film and Digital Media Program
University of New Mexico
Title: Digital Cinema and New Media Arts at Calit2
Albuquerque, NM
This table lists some details of publicly funded projects at each of the KAPTUR partner institutions. The information is already publicly available, but was brought together at a project team meeting in July to inform ongoing discussion about visual arts research data.
myExperiment and the Rise of Social MachinesDavid De Roure
Talk at hubbub 2012, Indianapolis, 25 September 2012. The talk introduces myExperiment and Wf4Ever, discusses the future of research communication including FORCE11, and introduces the SOCIAM project (Theory and Practice of Social Machines) which launches in October 2012.
Making your data work for you: Scratchpads, publishing & the biodiversity dat...Vince Smith
This is a derivative of a talk I gave at the Linnean society on 20th Sept. 2012. This version was given at the i4Life Environmental Genomics workshop on 25th Sept. and refocused to look at the dark taxa problem and developing published descriptions of molecular sequence clusters.
Apresentação feita por Lourdes Casanova, co-leader do projeto "InnovaLatino", da INSEAD, no seminário do Open Innovation Seminar 2011, que ocorreu dia 23/11.
This talk was presented at the Web Development SIG of the Greater Cleveland PC Users Group on Saturday, September 19, 2009.
There is a follow-up meeting with a more in-depth look on Google Analytics - this presentation only walks through how to set it up and get started.
Can science be social? Collective and Citizen Experimentation in Computationa...Josep Perelló
Brief presentation of our OpenSystems UB Research Group activity mostly focused on Computational Social Science and in relation with Citizen Science Practices. Presentation in the COMSOTEC meeting held in Santander (Spetember 10, 2015). Facultad de Ciencias, Universidad de Cantabria, Santander.
Digital Cinema and New Media Arts at Calit2Larry Smarr
09.05.01
Keynote
Retreat of the Interdisciplinary Film and Digital Media Program
University of New Mexico
Title: Digital Cinema and New Media Arts at Calit2
Albuquerque, NM
This table lists some details of publicly funded projects at each of the KAPTUR partner institutions. The information is already publicly available, but was brought together at a project team meeting in July to inform ongoing discussion about visual arts research data.
myExperiment and the Rise of Social MachinesDavid De Roure
Talk at hubbub 2012, Indianapolis, 25 September 2012. The talk introduces myExperiment and Wf4Ever, discusses the future of research communication including FORCE11, and introduces the SOCIAM project (Theory and Practice of Social Machines) which launches in October 2012.
Making your data work for you: Scratchpads, publishing & the biodiversity dat...Vince Smith
This is a derivative of a talk I gave at the Linnean society on 20th Sept. 2012. This version was given at the i4Life Environmental Genomics workshop on 25th Sept. and refocused to look at the dark taxa problem and developing published descriptions of molecular sequence clusters.
Apresentação feita por Lourdes Casanova, co-leader do projeto "InnovaLatino", da INSEAD, no seminário do Open Innovation Seminar 2011, que ocorreu dia 23/11.
This talk was presented at the Web Development SIG of the Greater Cleveland PC Users Group on Saturday, September 19, 2009.
There is a follow-up meeting with a more in-depth look on Google Analytics - this presentation only walks through how to set it up and get started.
Wish you were here before!' Who Gains from Collaboration between Computer Sci...dduin
Daphne Duin, David King and Peter van den Besselaar
Symposium of the Oxford Internet Institute: Social Science and Digital Research: Interdisciplinary Insights12 March 2012
Who gains from collaboration between computer science and social research? Our presentation at 'Interdisciplinary Insights', Oxford, Monday 12 March 2012, organised by the Oxford Internet Institute describing our collaboration in the ViBRANT project.
Perspectives on Collaborative Research Environments offered by D4ScienceFAO
Slides presented at the "4th Session of the IODE Group of Experts on Biological and Chemical Data Management and Exchange Practices (GE-BICH-IV)" which took place on 27-30 January 2009 in Oostende, Belgium
More information at: http://d4science.eu/node/173
Community web sites: small pieces loosely joinedVince Smith
Presented by Dave Roberts and coauthored by Simon Rycroft and Vince Smith at the GBIF European Nodes workshop at the National Muséum of Natural History (MNHN) in Paris, France. 4-6 April, 2011.
BioVeL (Biodiversity Virtual e-Laboratory) is an e-laboratory that supports research on biodiversity using large amounts of data from cross-disciplinary sources.
Personal views on what Research Infrastructures really need for data - a more comprehensive version of the 5 minute presentation I have at XLDB-Europe, 8-10th June 2011 in Edinburgh
Scratchpads: Building web communities supporting biodiversity scienceVince Smith
Presented by Dave Roberts at a meeting titled "Information Technology in Biodiversity Conservation and in Agriculture" organized by the Club of Rome and the EU ICT-ENSURE project, at UNESCO, Paris. January 15th, 2009.
GBIF registry (GBRDS), at European Nodes meeting in Alicante, Spain (10 March...Dag Endresen
Regional NODES meeting of Europe 2010. Presentation of the Global Biodiversity Resources Discovery System (GBRDS, under development) for the NODES. How do we the NODES want the GBRDS to look like. What do we the NODES wish/need the GBRDS to be.
http://www.gbif.org/
http://gbrds.gbif.org/
http://code.google.com/p/gbif-registry/
Keynote presentation by Professor Carole Goble at BOSC (Bioinformatics Open Source Conference) Long Beach, California, USA, July 14 2012. Co-located with ISMB, Intelligent Systems in Molecular Biology
This hit the press this week and demonstrates how Shire Pharmaceuticals have leveraged the Quantix Cloud. They have been able to use our technology to convert a client/server app (IDBS E-Workbook) into a flexible SaaS model.
ViBRANT: linking communities and servicesVince Smith
Roberts, D and Smith, VS. 2011. ViBRANT: linking communities and services. TDWG 2011 Annual Conference, at the Astor Crown Plaza Hotel, New Orleans, Louisiana, USA. 16 - 21st October 2011.
Similar to Community web sites: small pieces loosely joined (20)
FP7 Funded RI Project experiences: some overly honest tips from a project coo...Vince Smith
Smith, V.S. 2014. FP7 Funded RI Project experiences: some overly honest tips from a project coordinator, EC Horizon 2020 Research Infrastructures Information Day in at the Natural History Museum London, U.K. 18 June 2014.
No specimen left behind: Collections digitisation at the NHM, London*Vince Smith
Presentation on the Natural History Museum, London Digitisation Programme, given at the "Collections for the 21st Century" meeting in Gainesville, Florida, 5-6 May 2014
Assisted restructure of web content for paper-based presentation: a look at w...Vince Smith
Heaton, A., Rycroft, S., Baker, E., Bouton, K., Scott, B., Koureas, D., Livermore, L., Roberts, D., Smith, V. 2013 Assisted restructure of web content for paper-based presentation: a look at workflows and data representations. TDWG, Biodiversity Information Standards. Grand Hotel Mediterraneo Florence, Italy, 27 Oct - 1 Nov., 2013.
Bibliography of Life: Comprehensive services for biodiversity bibliographic r...Vince Smith
King, D., Sautter, G., Morse, D., Penev, L., Biserkov, J., Georgiev, T., Roberts, D., Smith, V. Bibliography of Life: Comprehensive services for biodiversity bibliographic references (POSTER). TDWG, Biodiversity Information Standards. Grand Hotel Mediterraneo Florence, Italy, 27 Oct - 1 Nov., 2013.
Scratchpads: the Virtual Research Environment for biodiversity dataVince Smith
Rycroft, S., Roberts, D., Smith, V., Heaton, A., Bouton, K., Livermore, L., Koureas, D., Baker, E. 2013. Scratchpads: the Virtual Research Environment for biodiversity data. TDWG, Biodiversity Information Standards. Grand Hotel Mediterraneo Florence, Italy, 27 Oct - 1 Nov., 2013.
Next generation sequencing requires next generation publishing: the Biodivers...Vince Smith
Penev, L., Stoev, P., Komericki, A., Akkari, N., Li, S., Zhou, X., Edmunds, S., Hunter, C., Weigand, A., Porco, D., Zapparoli, M., Georgiev, T., Mietchen, D., Roberts, D., Smith, V. 2013. Next generation sequencing requires next generation publishing: the Biodiversity Data Journal published the first eukaryotic new species with a fully sequenced transcriptome, DNA barcode and microcomputed tomography. TDWG, Biodiversity Information Standards. Grand Hotel Mediterraneo Florence, Italy, 27 Oct - 1 Nov.
Use it or lose it: crowdsourcing support and outreach activities in a hybrid ...Vince Smith
Koureas, D., Livermore, L., Roberts, D., Smith, V. 2013. Use it or lose it: crowdsourcing support and outreach activities in a hybrid sustainability model for e-infrastructures – the ViBRANT project case studies. TDWG, Biodiversity Information Standards. Grand Hotel Mediterraneo Florence, Italy, 27 Oct - 1 Nov., 2013.
Vince smith-delivering biodiversity knowledge in the information age-notextVince Smith
Smith, V.S. 2013. Delivering biodiversity knowledge in the information age. Hellenic Botanical Society, Thessaloniki, Greece, 3-6 Oct. 2013. [Delivered via video link through Google Hangouts]
Don't make me think: biodiversity data publishing made easyVince Smith
Presented by V. Smith at the 2013 iEvoBio Conference. Part of Evolution 2013, the joint annual meeting of the Society for the Study of Evolution (SSE), the Society of Systematic Biologists (SSB), and the American Society of Naturalists (ASN). June 21-26, 2013, Snowbird Alpine Village, Utah, USA.
Essentials of Automations: The Art of Triggers and Actions in FMESafe Software
In this second installment of our Essentials of Automations webinar series, we’ll explore the landscape of triggers and actions, guiding you through the nuances of authoring and adapting workspaces for seamless automations. Gain an understanding of the full spectrum of triggers and actions available in FME, empowering you to enhance your workspaces for efficient automation.
We’ll kick things off by showcasing the most commonly used event-based triggers, introducing you to various automation workflows like manual triggers, schedules, directory watchers, and more. Plus, see how these elements play out in real scenarios.
Whether you’re tweaking your current setup or building from the ground up, this session will arm you with the tools and insights needed to transform your FME usage into a powerhouse of productivity. Join us to discover effective strategies that simplify complex processes, enhancing your productivity and transforming your data management practices with FME. Let’s turn complexity into clarity and make your workspaces work wonders!
Securing your Kubernetes cluster_ a step-by-step guide to success !KatiaHIMEUR1
Today, after several years of existence, an extremely active community and an ultra-dynamic ecosystem, Kubernetes has established itself as the de facto standard in container orchestration. Thanks to a wide range of managed services, it has never been so easy to set up a ready-to-use Kubernetes cluster.
However, this ease of use means that the subject of security in Kubernetes is often left for later, or even neglected. This exposes companies to significant risks.
In this talk, I'll show you step-by-step how to secure your Kubernetes cluster for greater peace of mind and reliability.
Elevating Tactical DDD Patterns Through Object CalisthenicsDorra BARTAGUIZ
After immersing yourself in the blue book and its red counterpart, attending DDD-focused conferences, and applying tactical patterns, you're left with a crucial question: How do I ensure my design is effective? Tactical patterns within Domain-Driven Design (DDD) serve as guiding principles for creating clear and manageable domain models. However, achieving success with these patterns requires additional guidance. Interestingly, we've observed that a set of constraints initially designed for training purposes remarkably aligns with effective pattern implementation, offering a more ‘mechanical’ approach. Let's explore together how Object Calisthenics can elevate the design of your tactical DDD patterns, offering concrete help for those venturing into DDD for the first time!
GridMate - End to end testing is a critical piece to ensure quality and avoid...ThomasParaiso2
End to end testing is a critical piece to ensure quality and avoid regressions. In this session, we share our journey building an E2E testing pipeline for GridMate components (LWC and Aura) using Cypress, JSForce, FakerJS…
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...Neo4j
Leonard Jayamohan, Partner & Generative AI Lead, Deloitte
This keynote will reveal how Deloitte leverages Neo4j’s graph power for groundbreaking digital twin solutions, achieving a staggering 100x performance boost. Discover the essential role knowledge graphs play in successful generative AI implementations. Plus, get an exclusive look at an innovative Neo4j + Generative AI solution Deloitte is developing in-house.
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdfPaige Cruz
Monitoring and observability aren’t traditionally found in software curriculums and many of us cobble this knowledge together from whatever vendor or ecosystem we were first introduced to and whatever is a part of your current company’s observability stack.
While the dev and ops silo continues to crumble….many organizations still relegate monitoring & observability as the purview of ops, infra and SRE teams. This is a mistake - achieving a highly observable system requires collaboration up and down the stack.
I, a former op, would like to extend an invitation to all application developers to join the observability party will share these foundational concepts to build on:
Unlocking Productivity: Leveraging the Potential of Copilot in Microsoft 365, a presentation by Christoforos Vlachos, Senior Solutions Manager – Modern Workplace, Uni Systems
UiPath Test Automation using UiPath Test Suite series, part 5DianaGray10
Welcome to UiPath Test Automation using UiPath Test Suite series part 5. In this session, we will cover CI/CD with devops.
Topics covered:
CI/CD with in UiPath
End-to-end overview of CI/CD pipeline with Azure devops
Speaker:
Lyndsey Byblow, Test Suite Sales Engineer @ UiPath, Inc.
SAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdfPeter Spielvogel
Building better applications for business users with SAP Fiori.
• What is SAP Fiori and why it matters to you
• How a better user experience drives measurable business benefits
• How to get started with SAP Fiori today
• How SAP Fiori elements accelerates application development
• How SAP Build Code includes SAP Fiori tools and other generative artificial intelligence capabilities
• How SAP Fiori paves the way for using AI in SAP apps
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...James Anderson
Effective Application Security in Software Delivery lifecycle using Deployment Firewall and DBOM
The modern software delivery process (or the CI/CD process) includes many tools, distributed teams, open-source code, and cloud platforms. Constant focus on speed to release software to market, along with the traditional slow and manual security checks has caused gaps in continuous security as an important piece in the software supply chain. Today organizations feel more susceptible to external and internal cyber threats due to the vast attack surface in their applications supply chain and the lack of end-to-end governance and risk management.
The software team must secure its software delivery process to avoid vulnerability and security breaches. This needs to be achieved with existing tool chains and without extensive rework of the delivery processes. This talk will present strategies and techniques for providing visibility into the true risk of the existing vulnerabilities, preventing the introduction of security issues in the software, resolving vulnerabilities in production environments quickly, and capturing the deployment bill of materials (DBOM).
Speakers:
Bob Boule
Robert Boule is a technology enthusiast with PASSION for technology and making things work along with a knack for helping others understand how things work. He comes with around 20 years of solution engineering experience in application security, software continuous delivery, and SaaS platforms. He is known for his dynamic presentations in CI/CD and application security integrated in software delivery lifecycle.
Gopinath Rebala
Gopinath Rebala is the CTO of OpsMx, where he has overall responsibility for the machine learning and data processing architectures for Secure Software Delivery. Gopi also has a strong connection with our customers, leading design and architecture for strategic implementations. Gopi is a frequent speaker and well-known leader in continuous delivery and integrating security into software delivery.
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024Neo4j
Neha Bajwa, Vice President of Product Marketing, Neo4j
Join us as we explore breakthrough innovations enabled by interconnected data and AI. Discover firsthand how organizations use relationships in data to uncover contextual insights and solve our most pressing challenges – from optimizing supply chains, detecting fraud, and improving customer experiences to accelerating drug discoveries.
State of ICS and IoT Cyber Threat Landscape Report 2024 previewPrayukth K V
The IoT and OT threat landscape report has been prepared by the Threat Research Team at Sectrio using data from Sectrio, cyber threat intelligence farming facilities spread across over 85 cities around the world. In addition, Sectrio also runs AI-based advanced threat and payload engagement facilities that serve as sinks to attract and engage sophisticated threat actors, and newer malware including new variants and latent threats that are at an earlier stage of development.
The latest edition of the OT/ICS and IoT security Threat Landscape Report 2024 also covers:
State of global ICS asset and network exposure
Sectoral targets and attacks as well as the cost of ransom
Global APT activity, AI usage, actor and tactic profiles, and implications
Rise in volumes of AI-powered cyberattacks
Major cyber events in 2024
Malware and malicious payload trends
Cyberattack types and targets
Vulnerability exploit attempts on CVEs
Attacks on counties – USA
Expansion of bot farms – how, where, and why
In-depth analysis of the cyber threat landscape across North America, South America, Europe, APAC, and the Middle East
Why are attacks on smart factories rising?
Cyber risk predictions
Axis of attacks – Europe
Systemic attacks in the Middle East
Download the full report from here:
https://sectrio.com/resources/ot-threat-landscape-reports/sectrio-releases-ot-ics-and-iot-security-threat-landscape-report-2024/
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...SOFTTECHHUB
The choice of an operating system plays a pivotal role in shaping our computing experience. For decades, Microsoft's Windows has dominated the market, offering a familiar and widely adopted platform for personal and professional use. However, as technological advancements continue to push the boundaries of innovation, alternative operating systems have emerged, challenging the status quo and offering users a fresh perspective on computing.
One such alternative that has garnered significant attention and acclaim is Nitrux Linux 3.5.0, a sleek, powerful, and user-friendly Linux distribution that promises to redefine the way we interact with our devices. With its focus on performance, security, and customization, Nitrux Linux presents a compelling case for those seeking to break free from the constraints of proprietary software and embrace the freedom and flexibility of open-source computing.
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...James Anderson
Effective Application Security in Software Delivery lifecycle using Deployment Firewall and DBOM
The modern software delivery process (or the CI/CD process) includes many tools, distributed teams, open-source code, and cloud platforms. Constant focus on speed to release software to market, along with the traditional slow and manual security checks has caused gaps in continuous security as an important piece in the software supply chain. Today organizations feel more susceptible to external and internal cyber threats due to the vast attack surface in their applications supply chain and the lack of end-to-end governance and risk management.
The software team must secure its software delivery process to avoid vulnerability and security breaches. This needs to be achieved with existing tool chains and without extensive rework of the delivery processes. This talk will present strategies and techniques for providing visibility into the true risk of the existing vulnerabilities, preventing the introduction of security issues in the software, resolving vulnerabilities in production environments quickly, and capturing the deployment bill of materials (DBOM).
Speakers:
Bob Boule
Robert Boule is a technology enthusiast with PASSION for technology and making things work along with a knack for helping others understand how things work. He comes with around 20 years of solution engineering experience in application security, software continuous delivery, and SaaS platforms. He is known for his dynamic presentations in CI/CD and application security integrated in software delivery lifecycle.
Gopinath Rebala
Gopinath Rebala is the CTO of OpsMx, where he has overall responsibility for the machine learning and data processing architectures for Secure Software Delivery. Gopi also has a strong connection with our customers, leading design and architecture for strategic implementations. Gopi is a frequent speaker and well-known leader in continuous delivery and integrating security into software delivery.
Generative AI Deep Dive: Advancing from Proof of Concept to ProductionAggregage
Join Maher Hanafi, VP of Engineering at Betterworks, in this new session where he'll share a practical framework to transform Gen AI prototypes into impactful products! He'll delve into the complexities of data collection and management, model selection and optimization, and ensuring security, scalability, and responsible use.
Sudheer Mechineni, Head of Application Frameworks, Standard Chartered Bank
Discover how Standard Chartered Bank harnessed the power of Neo4j to transform complex data access challenges into a dynamic, scalable graph database solution. This keynote will cover their journey from initial adoption to deploying a fully automated, enterprise-grade causal cluster, highlighting key strategies for modelling organisational changes and ensuring robust disaster recovery. Learn how these innovations have not only enhanced Standard Chartered Bank’s data infrastructure but also positioned them as pioneers in the banking sector’s adoption of graph technology.
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf91mobiles
91mobiles recently conducted a Smart TV Buyer Insights Survey in which we asked over 3,000 respondents about the TV they own, aspects they look at on a new TV, and their TV buying preferences.
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Community web sites: small pieces loosely joined
1. ViBRANT
Virtual Biodiversity
Community web sites: small pieces loosely joined
Dave Roberts, David King, Simon Rycroft, David Morse,
Lyubomir Penev, Donat Agosti & Vince Smith
SEVENTH FRAMEWORK
PROGRAMME
-infrastructure
3. ViBRANT
Virtual Biodiversity
Small pieces loosely joined
Has many potential meanings:
Joining contributors together to form
communities
Joining the data together that go towards
forming a Scratchpad
Joining Scratchpad content with the landscape of
biodiversity informatics data on the web
SEVENTH FRAMEWORK
PROGRAMME -infrastructure
4. ViBRANT
Virtual Biodiversity
Addressing the challenges of taxonomy
Goal ...
Inventory the Earth’s species
Document their relationships
“Publish” & apply these data
Data set ...
1.8 M described spp. (10M names)
300M pages (over last 250 years)
1.5-3B specimens
People ...
4-6,000 taxonomists
30-40,000 “pro-amateurs”
Many more citizen scientists?
SEVENTH FRAMEWORK
PROGRAMME -infrastructure
5. ViBRANT
Virtual Biodiversity
I The technology must largely embody the cause–effect
relationship connecting problem to solution.
II The effects of the technological fix must be assessable using
relatively unambiguous or uncontroversial criteria.
III Research and development is most likely to contribute
decisively to solving a social problem when it focuses on
improving a standardized technical core that already exists.
Sarewitz and Nelson (2008) Three rules for technological fixes. Nature, 456: 871-872
SEVENTH FRAMEWORK
PROGRAMME -infrastructure
6. ViBRANT
Virtual Biodiversity
Biodiversity - a kind of washing powder?
When 2010 was named as the "year of biodiversity" by
the UN, it began with a plea to save the world's
ecosystems.
UN Secretary-General Ban Ki-moon said: "Biological
diversity underpins ecosystem functioning... its
continued loss, therefore, has major implications for Recently, members of the public
current and future human well-being." were asked what biodiversity is.
The most common answer was
"some kind of washing powder".
http://www.bbc.co.uk/news/science-environment-11546289 15 October 2010
SEVENTH FRAMEWORK
PROGRAMME -infrastructure
7. ViBRANT
Virtual Biodiversity
Addressing the challenges of biodiversity informatics
“…the field [of biodiversity informatics] appears to be growing in
a void of overarching, motivating questions, effectively making it
a set of technologies in search of questions to address.”
Peterson et al, Syst. & Biodiv. 2010
SEVENTH FRAMEWORK
PROGRAMME -infrastructure
8. ViBRANT
Virtual Biodiversity
Scratchpads Hosted websites for taxonomists
http://scratchpads.eu Research & publication platform
Modular (Drupal) & flexible
Supports the taxonomic workflow
Bottom-up design, agile dev.
Ecosystem of communities (185)
2,350+ users (unpaid) from 2007
ViBRANT follow on, €4.75M
SEVENTH FRAMEWORK
PROGRAMME -infrastructure
9. ViBRANT
Virtual Biodiversity
Taxonomy & Literature DNA, Phylogeny & Specimens
2.3k users, 58 countries,
268k pages
185 "Virtual Research
Communities"
EDIT, GBIF, NHM, & EOL
Platform for biodiversity
research & data publication
eBooks eJournals
Changing the nature of
collaboration
Expanding opportunities to
participate in science
Image Galleries Societies & Organizations
SEVENTH FRAMEWORK
PROGRAMME -infrastructure
10. ViBRANT
Virtual Biodiversity
A website for you & your community
Magic
Your data Your web site
SEVENTH FRAMEWORK
PROGRAMME -infrastructure
18. ViBRANT
Virtual Biodiversity
Static web
pages
Web fora
with e-mail
integration
Newsletters with
User blogs e-mail integration
SEVENTH FRAMEWORK
PROGRAMME -infrastructure
19. ViBRANT
Virtual Biodiversity
Import from CSV text file to any
content type
SEVENTH FRAMEWORK
PROGRAMME -infrastructure
20. ViBRANT
Virtual Biodiversity
ViBRANT Products
A Virtual Research Environment (Scratchpads) where users can
safely store, share and manage their research information.
Analytical services for users to build identification keys and
phylogenetic trees.
A publication platform for users to automatically compile taxonomic
manuscripts from their research database.
A portal for users to centrally access publicly accessible biodiversity
research information and literature.
Training, support & sociological study, helping research communities
to use these tools and services.
A standards compliant technical architecture that can be sustained by
biodiversity research community.
SEVENTH FRAMEWORK
PROGRAMME -infrastructure
21. ViBRANT
Virtual Biodiversity
Training Biodiversity
& outreach data
programme standards
Networking
User Controlled
feedback vocabulary
The “chromosome” systems platform
WP3. Training
User
sociology
Data
aggregation
WP4. Standards
study portal
WP8. Mobilisation
Field GBIF
recording integration
support activities
Citizen
Biodiversity
science
visualisation
programme
layers
Scratchpads
Virtual Research
Service Research
Environment Distributed
Phylogenetic
Scratchpad
analysis
hosting
Bioclimatic Software
WP5. Data modelling
& metrics
module
integration
WP2. Architecture
WP6. Publishing WP7. Literature
Identification Sustainability
tools plan
Communal
Matrix data
biodiversity
editor
literature
Biodiversity Biodiversity
data literature
publishing markup
Scholarly
Biodiversity
manuscript
datamining
publishing
SEVENTH FRAMEWORK
PROGRAMME -infrastructure
22. ViBRANT
Virtual Biodiversity
Biodiversity literature
looks like this
Cues
Indented text
UPPER CASE TEXT
Bold text
Italic text
Latin
Keywords
Symbols
SEVENTH FRAMEWORK
PROGRAMME -infrastructure
23. ViBRANT
Virtual Biodiversity
Adobe Reader has this
M
BRITISH MUSEUM
(NATURAL HiSi
26JU
PRESENTED
GENERAL UC.-lARY
Bulletin ofthe
BritishMuseum (Natural History)
The ichneumon-fly genus Banchus
in the OldWorld
(Hymenoptera)
M. G. Fitton
series
Entomology
Vol51 Nol 25 July 1985
SEVENTH FRAMEWORK
PROGRAMME -infrastructure
24. ViBRANT
Virtual Biodiversity
Lura (BHL) has this
M
BRITISH MUSEUM
(NATURAL HiSi
26 JU
PRESENTED
GENERAL UC.-lARY
Bulletin of the
British Museum (Natural History)
The ichneumon-fly genus Banchus
(Hymenoptera) in the Old World
M. G. Fitton
Entomology series
Vol51 Nol 25 July 1985
SEVENTH FRAMEWORK
PROGRAMME -infrastructure
25. ViBRANT
Virtual Biodiversity
But choice of XML schema is important
ABBYY XML is very detailed
This line of text has 202 bytes:
The Bulletin of the British Museum (Natural History), instituted in
1949, is issued in fourscientific series, Botany, Entomology,
Geology (incorporating Mineralogy) and Zoology,and an
Historical series.
To encode in ABBYY XML format this line requires 45,533 bytes.
There are 84,263 lines in the document from which this example
was taken.
SEVENTH FRAMEWORK
PROGRAMME -infrastructure
26. ViBRANT
Virtual Biodiversity
Look for taxon names
Used uBio FindIT web service
Overall excellent
Especially as add Namebank ID
But still some oddities
Genus = ‘The’
The scutellum
The primitive
Species or Author = ‘and’
Exetastes and
B[anchus] falcatorius and
SEVENTH FRAMEWORK
PROGRAMME -infrastructure
27. ViBRANT
Virtual Biodiversity
Look for paragraph types
Simple keyword matching
Surprisingly effective!
Issue – can identify start, but not end…
Follow up work
Punctuation
Concepts
SEVENTH FRAMEWORK
PROGRAMME -infrastructure
28. ViBRANT
Virtual Biodiversity
Look for other proper names
Biologia Centrali-Americana has a gazetteer
Most journals do not
Generic solution = OpenCalais
Good accuracy
Old countries
D.D.R.
West Germany
Continents
America
SEVENTH FRAMEWORK
PROGRAMME -infrastructure
29. ViBRANT
Virtual Biodiversity
Ambiguities and Mis-identifications
New York Other Oddities
City Persons
State Surname only
Washington Two part names
City Van Veen
State van Veen
Lake George Regions and Continents
City East Africa
Lake Victoria Africa
City
SEVENTH FRAMEWORK
PROGRAMME -infrastructure
30. ViBRANT
Virtual Biodiversity
Negative spell checking
Go beyond stop words
Remove everything not in a spell dictionary
Check:
Minor
Vulgar
Bulletin 27 from the Zoology Series reduced
From 139,034
to 5,219 words
SEVENTH FRAMEWORK
PROGRAMME -infrastructure
31. ViBRANT
Virtual Biodiversity
Ligatures
INTRODUCTION.
Volume, one of five required for the enumeration of the Rhynchophora, was
THIS
commenced by Dr. Sharp in 1889 and is now concluded by myself. The study of the
" Otiorhynchinœ Alatse " has unfortunately been delayed for many years, during the
publication of Vol. IV. parts 4, 5, and 7, all of which are devoted to the Family
Curculionidœ. The present Volume, IV. part 3, includes the Subfamilies Attelabinae,
Pterocolinœ, Allocoryninee, Apioninœ, Thecesterninae, and Otiorhynchinre. The
Attelabinae are represented by 104 (88 new), the Pterocolinse by three (all new), the
Allocoryninse (a new subfamily) and Thecesterninse each by one, the Apioninae by
88 (84 new), and the Otiorhynchinae by 419 (340 new) species respectively; the total
number for the six subfamilies being 616 species, with 516 new, and forty new
genera. Amongst the 419 Otiorhynchinae, the apterous and winged forms are almost
equal in number, there being a preponderance of apterous terrestrial species
(Eupagoderes, Epicœrus, Epayriopsis, &c.) in the arid portions of Mexico and the
winged forms ÇExophthalmuS) &c.) becoming relatively more numerous in the forest
regions southward. Taking the Curculionidœ as a whole—the subfamilies
Curculioninae and Calandrinse, in addition to those worked out in the present
Volume,—the number of species enumerated altogether from Central America is as
follows :— Vol. IV. part 3, 616; IV. part 4, 1365; IV. part 5, 908; IV. part 7, 344 : total
3233. The three other families of Rhynchophora—the Brenthidae, Scolytidae, and
SEVENTH FRAMEWORK
PROGRAMME -infrastructure
32. ViBRANT
Virtual Biodiversity
Ligatures
Otiorhynchinæ => Otiorhynchinœ Thecesterninæ => Thecesterninse
Alatæ => Alatse Apioninæ => Apioninae
Curculionidæ => Curculionidœ Otiorhynchinæ => Otiorhynchinae
Attelabinæ => Attelabinae Otiorhynchinæ => Otiorhynchinae
Pterocolinæ => Pterocolinœ Curculionidæ => Curculionidœ
Allocoryninæ => Allocoryninee Curculioninæ => Curculioninae
Apioninæ => Apioninœ Calandrinæ => Calandrinse
Thecesterninæ => Thecesterninae Brenthidæ => Brenthidae
Otiorhynchinæ => Otiorhynchinre Scolytidæ => Scolytidae
Attelabinæ => Attelabinae Anthribidæ => Anthribidae
Pterocolinæ => Pterocolinse Hispidæ => Hispida
Allocoryninæ => Allocoryninse Cassididæ => Cassididae
For the 24 æ there are: 11 ae; 5 œ; 5 se; 1 ee; 1 re; 1 a?;
So not a single correct rendering of the ligature, æ.
By contrast, the only example of œ in the page, Epicœrus, was correctly rendered.
SEVENTH FRAMEWORK
PROGRAMME -infrastructure
34. ViBRANT
Virtual Biodiversity
Similar words?
denticulate => denticulata
Levenshtein distances of 1: 0,0,1
denticulate => reticulate
Levenshtein distances of 2: 3,2,0
denticulate => geniculate
Levenshtein distances of 2: 2,2,0
SEVENTH FRAMEWORK
PROGRAMME -infrastructure
35. ViBRANT
Virtual Biodiversity
What did we achieve?
Marked up 11 volumes, i.e. 4,504 pages
Have robust workflow, can mark up a Bulletin in about 10-15
minutes. Choke point is call to OpenCalais web service
No manual intervention or review required: workflow is
scalable
Recognising taxon names:
Well uBio gives us a goods start, and we have techniques to
cluster ALL mis-spellings and variants with a valid taxon; but
not perfect, eg BanchusFabricius ends up in more than one
cluster
SEVENTH FRAMEWORK
PROGRAMME -infrastructure
36. ViBRANT
Virtual Biodiversity
“making the Scratchpads better”
More reliable (e.g., distribute the servers)
More functional (e.g., phylogenetic & publication services)
Easier to use (better workflows)
Prettier (better graphical design - more intuitive)
More integrated (for data stored inside & outside the Scratchpad framework)
More sustainable (simple administration, distribute developers, development sandbox)
“making natural history better”
Easier to compile, manage and reuse your data
Easier to find and reuse other peoples data
Promoting your data inside & outside the taxonomic community
Getting people to work for you (crowdsourcing)
SEVENTH FRAMEWORK
PROGRAMME -infrastructure
37. ViBRANT
Virtual Biodiversity
Author
Manuscript Public Enhanced
preparation on HTML
a Scratchpad
PDF
Submit as XML
Enhanced XML Printed paper
Produce PDF
Send to Register with
reviewers ZooBank,
Publisher GBIF, EoL etc.
SEVENTH FRAMEWORK
PROGRAMME -infrastructure
38. ViBRANT
Virtual Biodiversity
Thank you for your
attention.
Any questions
SEVENTH FRAMEWORK
PROGRAMME -infrastructure