SlideShare a Scribd company logo
Catalog magic:
Behind the Scenes of Creating
a World Catalog of the
Therevidae
Gail E. Kampmeier
Illinois Natural History Survey, Prairie Research Institute
University of Illinois at Urbana-Champaign
gkamp@illinois.edu
Irina Brake
National Museum of Natural History, London
Kristin Algmin
University of Illinois at Urbana-Champaign
Why is it so Difficult to get from
Here… to… Here?
Therevidae
What Would Taxonomists Rather
Be Doing?
What do Taxonomists Wish
Would Happen?
1995 Freshmen of NSF PEET*
• Towards a World
Monograph of
the Therevidae
(Insecta: Diptera)
– 1995 – 2006
• Therevidae is
medium-sized
family with (now)
– 4 subfamilies
– ~130 genera
– ~1150 species
*National Science Foundation's Partnerships for Enhancing Expertise in Taxonomy
Products
• Trained
– 9 dipterists, 7 through Ph.D.
– Scientific illustrator
– Dozens of students in
databasing
• Publications
– 71 publications during grant
– 20 more since & counting
• Digitization
– Mandala database 1995-
– Website
– Collaborations with
DiscoverLife.org & GBIF
…and the world is unlikely to run
out of flies to study!
Process: Specimens
• Collect, sort, curate,
label, sex, determine, &
database specimen
information
– Assign unique identifiers
where none exist
• Visit & borrow material
from museums
• Examine types
Is All that Work Worthwhile?
• "A taxonomic paper often
plants the very seeds of its own
obsolescence." (Johnson 2011)
• There is no getting around the
work required to produce a
catalog or any taxonomic
treatment.
• What we can do, is make sure
that the information is
accessible and reusable.
• Is it time to ditch traditional
catalogs?
Henicomyia by J. Marie Metz
What Choices Do You Have?
• Last year's symposium on Arthropod Collections
databases explored some of your options, but
not all are suitable.
– Online collections database platforms (not suitable
for creating taxonomic catalogues that cross
collections not included)
• Arctos
• Specify 6
– Online taxonomic database platforms – optimize
creation of species pages
• Species File – taxonomic authority files
• Scratchpads – community-oriented contributions
• 3I – online revisions of taxa
• Encyclopedia of Life – Expert LifeDesks
What Choices Do You Have?
• Last year's symposium on Arthropod Collections
databases explored some of your options, but not all
are suitable.
– Online platforms designed to parse or take parsed data &
repurpose it (incl. online taxonomic database platforms
above)
• GBIF's Integrated Publishing Toolkit (IPT) – not thought of as a
workbench-level tool
• LUCID – especially good for keys & descriptive data
• Biodiversity Informatics Journal - Will take in parsed data from
Scratchpads and IPT & eventually databases (mechanism
unclear)
– Desktop or server-based platforms – usually in Filemaker
or 4D or MSAccess
• Mandala – http://www.inhs.illinois.edu/research/mandala/
• Biota - http://viceroy.eeb.uconn.edu/biota/
• Mantis - http://insects.oeb.harvard.edu/etypes/Downloads.htm
The Process: Decide on a Format
• Was decided to publish as traditional Myia catalog
• Expectations about what is in a "traditional catalog" or taxonomic
treatment & how it should be formatted
– Print styles (italics, bold, centered, hanging indents)
– Accented characters (for literature references, authority names, and localities)
– Special characters (for ♂ and ♀ signs)
– Notes kept with the taxon entry or as an appendix?
• Use Mandala to achieve retrievability & formatting of output
General Workflow:
TherevidMandala Database
• Input raw data: The Bulk of the Work is HERE!
• Link data in related tables
• Create fields for catalog output for
Taxa & their history
Literature (including disambiguation of
similar citations)
List of countries (& selected
states/provinces) by biogeographic region for
valid taxa
Create & number notes for listing in
appendix
• Create a script that finds data to be exported
• Create scripts to format data including styles
(bold, italics, codes for paragraph formatting)
• Export TaxonID& catalog output field only to
Filemaker Pro to isolate output & preserve
formatting including accented characters
Mandala
production db
Acrobat
MSWord
Catalog
Catalog Output
to new FMP db
Things Can Get Messy
• Some operations require expert
eyes to determine fitness-for-use
• A database can find, sort, &
summarize, but ultimately does
not "see" anomalies unless
specifically programmed to do so
• Automation (scripting, creation
of calculated fields) requires
time, refinement, & expertise
• Parsed data are key to flexibility
Create Taxonomic Hierarchy
Use to
automate
searches &
sort catalog
output by
classification
hierarchy,
rank, &
alphabetically
Use Reason for Status to Dictate
Formatting
We Used the Specimen* Table to
Define our Distribution
*based on 105,889 specimens with valid names & parsed localities
Script to Find & Sort Specimens
• Once sorted, export a summary for each
taxon
• Summary can then be
formatted in MSWord
• Bring back into Filemaker
for final formatting
• Spot possible outliers
• Match TaxonID to import
formatted information
into production db
TaxonIDx Biogeographic Region x
Country x State/Province
Filling in the Cracks
• All taxa, literature, and specimens to be included in the
catalog were marked by an expert with a code for easier
retrieval
• Communication about scripts & field calculations were
done in Google Docs
• Literature with the same authors and years had to be
disambiguated with letters following the year.
– Used in both the literature cited and text of the catalog
• After including the notes in the text flow, it was decided
by the authors to number and put them into an appendix.
– Finding & sorting of these could be automated
– Replace with series allowed numbering of notes
– Awkward (but necessary) to renumber notes when new ones
were found to be needed.
General Workflow
• TaxonID is for reference only
• Resize catalog output field
(in layout mode) so all contents
will always be seen (page size)
& make sure to size the field to
fit the contents
• Open in Preview to check
• Save as PDF
Mandala
production db
Catalog Output
to new FMP db
General Workflow
• This step mainly
preserves catalog text styles &
accented characters out of FMP
• Save As MS Word document
after verifying expected results.
• Saving as Word will collapse
the formatting into giant
paragraphs
Mandala
production db
Catalog Output
to new FMP db
Acrobat
General Workflow
Mandala
production db
Acrobat
MSWord
Catalog
Catalog Output
to new FMP db
• Create styles in MSWord for
formatting text & paragraphs
• Search & replace special
characters (%%, $$, zzz, ||, //);
♂ and ♀ signs
• Clean up extra spaces,
paragraphs, & punctuation
• Using Google Docs is not (yet)
an option for a traditionally
published catalog as the
formatting tools aren't adequate
Send Out to Experts
Consensus!
• When the experts are happy,
we're done, right?
• Still have to update the
database & web output online
– complements printed
catalog as it is dynamic
• Push corrections to public
portals of data (own website,
DiscoverLife, GBIF, etc.)
• So "magic" is a relative, kind
of wishful term—the future is
more likely in platforms such
as those being coordinated by
Pensoft.
References, Resources
• Miller, J. et al. 2012. From taxonomic literature to cybertaxonomic
content. BMC Biology 10:87http://www.biomedcentral.com/content/pdf/1741-
7007-10-87.pdf
• Johnson, N.F. 2012. A collaborative, integrated and electronic future for
taxonomy. Invertebrate Systematics 25: 471–475.
http://www.publish.csiro.au/?act=view_file&file_id=IS11052.pdf
• Biodiversity Data Journal (publication debut Dec.
2012)http://www.pensoft.net/journals/bdj
• Symposium: Arthropod Collections Databases. 2011 ECN
meeting, Reno, NV http://www.ecnweb.org/past/2011
• Darwin Core Standard http://rs.tdwg.org/dwc/
• Kampmeier, G. E. and M. E. Irwin. 2009. Meeting the interrelated
challenges of tracking specimen, nomenclature, and literature data in
Mandala. Chapter 15 in T. Pape, D. Bickel, and R. Meier (eds.) Diptera
Diversity: Status, Challenges and Tools. Leiden: Brill Academic
Publishers, pp. 407-437.
http://www.inhs.illinois.edu/research/mandala/Ch15_Mandala_DiptDiv2009.pdf
More Refs & Resources
• Kennedy, J., R. Hyam, R. Kukla, T. Paterson. 2006.
Standard data model representation for taxonomic
information. A Journal of Integrative Biology 10(2):
220-230. http://www.hyam.net/publications/omi.2006.10.220.pdf
• Penev, L., T. Georgiev, P. Stoev, D. Roberts, V. Smith.
2012. Making small data big! The Biodiversity Data
Journal (BDJ). TDWG 2012, Beijing, 22-26 October.
http://www.tdwg.org/fileadmin/2012conference/slides/Biodiversity_Data
_Journal.pdf
• Catalog of Life
http://www.catalogueoflife.org/colwebsite/sites/default/files/2012_CoL-
Standard_Dataset_v6_3.pdf
Acknowledgements
• Michael E. Irwin
• F. Chris Thompson
• Neal Evenhuis
• Christine Lambkin
• Shaun Winterton
• Don Webb
• Mark Metz
• Martin Hauser
• Kevin Holston
• Steve Gaimari
• J. Marie Metz
• David Yeates
• Amanda Buck
• Brian Wiegmann
• Evert Schlinger
• John Pickering
• FMWebschool
• National Science
Foundation
• Schlinger Foundation
• Illinois Natural History
Survey
• University of Illinois
• Discover Life
• Biodiversity Information
Standards (TDWG)
NSF Projects:
Therevid PEET:
DEB-95-21925;
99-77958
Fiji Arthropod
Survey: DEB-
0425790
FLYTREE: EF-
0334948
Tabanid PEET:
DEB 07-31528
©2012 University of Illinois Board of Trustees.
All rights reserved. For permission information,
contact the Illinois Natural History Survey.
References to commercial products are for informational purposes
only and do not imply endorsement.
Appendix
Additional information for the
curious of slides jettisoned for
time
Why Use A Database?
• Flexibility
– Finely parsed data may be
pieced together for
publication, labels
– Scripting of often used
functions
• Reuse/repurposing of data
– Sharing with GBIF,
DiscoverLife.org, museums
• Centralization of work
environment
– Workers can be anywhere,
any time zone
– Backup can be automated
• Individual work environment
– Choice with platforms not
required to be online
(although trade-off)
Vision
• "Taxonomy should fully embrace
electronic media and informatics tools.
Particularly, this step requires the
development and widespread
implementation of community data
standards. The barriers to progress in
these areas are not technological, but are
primarily social. The community needs to
see clear evidence of the value added
through these changes in procedures and
insist upon their use as standard practice."
Johnson, N.F. 2011. A collaborative, integrated and electronic future for taxonomy.
Invertebrate Systematics 25: 471.
Any Database Can Record the
Basics, but…
• How the information is related is also key
– defining taxonomic ranks as parent-child relationship
– valid taxonomic entities related to their synonyms
– types and specimens determined for a taxon
– literature associated with a taxonomic name
– collecting localities and collecting events
• Readability – if a published work rather than raw database output
• Format
– Based on existing print models?
– Print styles (italics, bold, centered, hanging indents)
– Accented characters (for literature references, authority names, and
localities)
– Special characters (for ♂ and ♀ signs)
– Notes kept with the taxon entry or as an appendix?
Mandala Data Model
• Not all of this is
required for a
traditional
catalog, but
these tables
contain a
wealth of vital,
interrelated
data.
• Tables with
rounded edges
are authority
files
Use the Classification
Hierarchy to
Automate Searches
Reason for Status
Used for
Formatting

More Related Content

What's hot

Understanding Taxonomy, Drupal Camp Colorado, June 2009
Understanding Taxonomy, Drupal Camp Colorado, June 2009Understanding Taxonomy, Drupal Camp Colorado, June 2009
Understanding Taxonomy, Drupal Camp Colorado, June 2009
David Lanier
 
Embedding Linked Data Invisibly into Web Pages: Strategies and Workflows for ...
Embedding Linked Data Invisibly into Web Pages: Strategies and Workflows for ...Embedding Linked Data Invisibly into Web Pages: Strategies and Workflows for ...
Embedding Linked Data Invisibly into Web Pages: Strategies and Workflows for ...
National Information Standards Organization (NISO)
 
New INSPIRE (basic) 12-2015
New INSPIRE (basic) 12-2015New INSPIRE (basic) 12-2015
New INSPIRE (basic) 12-2015
Indiana State Library
 
CUA LSC 747_2011
CUA LSC 747_2011CUA LSC 747_2011
CUA LSC 747_2011
SCPilsk
 
Making your data work for you: Scratchpads, publishing & the biodiversity dat...
Making your data work for you: Scratchpads, publishing & the biodiversity dat...Making your data work for you: Scratchpads, publishing & the biodiversity dat...
Making your data work for you: Scratchpads, publishing & the biodiversity dat...
Vince Smith
 
SPARQL1.1 Tutorial, given in UChile by Axel Polleres (DERI)
SPARQL1.1 Tutorial, given in UChile by Axel Polleres (DERI)SPARQL1.1 Tutorial, given in UChile by Axel Polleres (DERI)
SPARQL1.1 Tutorial, given in UChile by Axel Polleres (DERI)
net2-project
 
NISO/DCMI Webinar: Cooperative Authority Control: The Virtual International A...
NISO/DCMI Webinar: Cooperative Authority Control: The Virtual International A...NISO/DCMI Webinar: Cooperative Authority Control: The Virtual International A...
NISO/DCMI Webinar: Cooperative Authority Control: The Virtual International A...
National Information Standards Organization (NISO)
 
Rda policy statement and guidelines for phil libraries mila ramos
Rda policy statement and guidelines for phil libraries   mila ramosRda policy statement and guidelines for phil libraries   mila ramos
Rda policy statement and guidelines for phil libraries mila ramos
Philippine Association of Academic/Research Librarians
 
Open taxonomy
Open taxonomyOpen taxonomy
Open taxonomy
Roderic Page
 
Astronomy libraries - your gateway to information
Astronomy libraries - your gateway to informationAstronomy libraries - your gateway to information
Astronomy libraries - your gateway to information
Uta Grothkopf
 
Using DAS software, an introduction to some DAS implementations
Using DAS software, an introduction to some DAS implementationsUsing DAS software, an introduction to some DAS implementations
Using DAS software, an introduction to some DAS implementations
Rafael C. Jimenez
 
SHACL: Shaping the Big Ball of Data Mud
SHACL: Shaping the Big Ball of Data MudSHACL: Shaping the Big Ball of Data Mud
SHACL: Shaping the Big Ball of Data Mud
Richard Cyganiak
 

What's hot (12)

Understanding Taxonomy, Drupal Camp Colorado, June 2009
Understanding Taxonomy, Drupal Camp Colorado, June 2009Understanding Taxonomy, Drupal Camp Colorado, June 2009
Understanding Taxonomy, Drupal Camp Colorado, June 2009
 
Embedding Linked Data Invisibly into Web Pages: Strategies and Workflows for ...
Embedding Linked Data Invisibly into Web Pages: Strategies and Workflows for ...Embedding Linked Data Invisibly into Web Pages: Strategies and Workflows for ...
Embedding Linked Data Invisibly into Web Pages: Strategies and Workflows for ...
 
New INSPIRE (basic) 12-2015
New INSPIRE (basic) 12-2015New INSPIRE (basic) 12-2015
New INSPIRE (basic) 12-2015
 
CUA LSC 747_2011
CUA LSC 747_2011CUA LSC 747_2011
CUA LSC 747_2011
 
Making your data work for you: Scratchpads, publishing & the biodiversity dat...
Making your data work for you: Scratchpads, publishing & the biodiversity dat...Making your data work for you: Scratchpads, publishing & the biodiversity dat...
Making your data work for you: Scratchpads, publishing & the biodiversity dat...
 
SPARQL1.1 Tutorial, given in UChile by Axel Polleres (DERI)
SPARQL1.1 Tutorial, given in UChile by Axel Polleres (DERI)SPARQL1.1 Tutorial, given in UChile by Axel Polleres (DERI)
SPARQL1.1 Tutorial, given in UChile by Axel Polleres (DERI)
 
NISO/DCMI Webinar: Cooperative Authority Control: The Virtual International A...
NISO/DCMI Webinar: Cooperative Authority Control: The Virtual International A...NISO/DCMI Webinar: Cooperative Authority Control: The Virtual International A...
NISO/DCMI Webinar: Cooperative Authority Control: The Virtual International A...
 
Rda policy statement and guidelines for phil libraries mila ramos
Rda policy statement and guidelines for phil libraries   mila ramosRda policy statement and guidelines for phil libraries   mila ramos
Rda policy statement and guidelines for phil libraries mila ramos
 
Open taxonomy
Open taxonomyOpen taxonomy
Open taxonomy
 
Astronomy libraries - your gateway to information
Astronomy libraries - your gateway to informationAstronomy libraries - your gateway to information
Astronomy libraries - your gateway to information
 
Using DAS software, an introduction to some DAS implementations
Using DAS software, an introduction to some DAS implementationsUsing DAS software, an introduction to some DAS implementations
Using DAS software, an introduction to some DAS implementations
 
SHACL: Shaping the Big Ball of Data Mud
SHACL: Shaping the Big Ball of Data MudSHACL: Shaping the Big Ball of Data Mud
SHACL: Shaping the Big Ball of Data Mud
 

Viewers also liked

Price2 ecn2013
Price2 ecn2013Price2 ecn2013
Price2 ecn2013ECNOfficer
 
Sikes ecn2013 dn_ab
Sikes ecn2013 dn_abSikes ecn2013 dn_ab
Sikes ecn2013 dn_abECNOfficer
 
Thomas ecn2013
Thomas ecn2013Thomas ecn2013
Thomas ecn2013ECNOfficer
 
Mc alister ecn2013
Mc alister ecn2013Mc alister ecn2013
Mc alister ecn2013ECNOfficer
 
Furth ecn 2013
Furth ecn 2013Furth ecn 2013
Furth ecn 2013ECNOfficer
 
Rubinoff ecn2013 uhim
Rubinoff ecn2013 uhimRubinoff ecn2013 uhim
Rubinoff ecn2013 uhimECNOfficer
 
Mayer rokitansky-küster-hauser syndrome
Mayer rokitansky-küster-hauser syndromeMayer rokitansky-küster-hauser syndrome
Mayer rokitansky-küster-hauser syndromeMATIAS FREITAS FH
 
How i came to retainment
How i came to retainmentHow i came to retainment
How i came to retainmentdmitleonov
 
как помочь российскому производителю
как помочь российскому производителюкак помочь российскому производителю
как помочь российскому производителюdmitleonov
 
факторинг Vs кассовый разрыв
факторинг Vs  кассовый разрывфакторинг Vs  кассовый разрыв
факторинг Vs кассовый разрывdmitleonov
 

Viewers also liked (17)

Price2 ecn2013
Price2 ecn2013Price2 ecn2013
Price2 ecn2013
 
Sikes ecn2013 dn_ab
Sikes ecn2013 dn_abSikes ecn2013 dn_ab
Sikes ecn2013 dn_ab
 
Thomas ecn2013
Thomas ecn2013Thomas ecn2013
Thomas ecn2013
 
Mc alister ecn2013
Mc alister ecn2013Mc alister ecn2013
Mc alister ecn2013
 
Furth ecn 2013
Furth ecn 2013Furth ecn 2013
Furth ecn 2013
 
Rubinoff ecn2013 uhim
Rubinoff ecn2013 uhimRubinoff ecn2013 uhim
Rubinoff ecn2013 uhim
 
Limbic system
Limbic systemLimbic system
Limbic system
 
Limbic system
Limbic systemLimbic system
Limbic system
 
Intracranial arteries
Intracranial arteriesIntracranial arteries
Intracranial arteries
 
Cranial nerves part i
Cranial nerves part iCranial nerves part i
Cranial nerves part i
 
Petrous apex and skull base
Petrous apex and skull basePetrous apex and skull base
Petrous apex and skull base
 
Cranial nerves part 1
Cranial nerves part 1Cranial nerves part 1
Cranial nerves part 1
 
Cranial nerves part ii
Cranial nerves part iiCranial nerves part ii
Cranial nerves part ii
 
Mayer rokitansky-küster-hauser syndrome
Mayer rokitansky-küster-hauser syndromeMayer rokitansky-küster-hauser syndrome
Mayer rokitansky-küster-hauser syndrome
 
How i came to retainment
How i came to retainmentHow i came to retainment
How i came to retainment
 
как помочь российскому производителю
как помочь российскому производителюкак помочь российскому производителю
как помочь российскому производителю
 
факторинг Vs кассовый разрыв
факторинг Vs  кассовый разрывфакторинг Vs  кассовый разрыв
факторинг Vs кассовый разрыв
 

Similar to Kampmeier ecn 2012

Bren - UCSB - Spooky spreadsheets
Bren - UCSB - Spooky spreadsheetsBren - UCSB - Spooky spreadsheets
Bren - UCSB - Spooky spreadsheets
Carly Strasser
 
Data Archiving and Sharing
Data Archiving and SharingData Archiving and Sharing
Data Archiving and Sharing
C. Tobin Magle
 
Ils on a shoe string budget
Ils on a shoe string budgetIls on a shoe string budget
Ils on a shoe string budget
Jolene81
 
NISO/DCMI September 25 Webinar: Implementing Linked Data in Developing Countr...
NISO/DCMI September 25 Webinar: Implementing Linked Data in Developing Countr...NISO/DCMI September 25 Webinar: Implementing Linked Data in Developing Countr...
NISO/DCMI September 25 Webinar: Implementing Linked Data in Developing Countr...
National Information Standards Organization (NISO)
 
Research Shared: researchobject.org
Research Shared: researchobject.orgResearch Shared: researchobject.org
Research Shared: researchobject.org
Norman Morrison
 
Improving access to special collections by automating descriptive metadata cr...
Improving access to special collections by automating descriptive metadata cr...Improving access to special collections by automating descriptive metadata cr...
Improving access to special collections by automating descriptive metadata cr...
aneatrour
 
ISA-Tab Standards at Metabolomics Society Meeting, Tsuruoka 2014, Japan
ISA-Tab Standards at Metabolomics Society Meeting, Tsuruoka 2014, JapanISA-Tab Standards at Metabolomics Society Meeting, Tsuruoka 2014, Japan
ISA-Tab Standards at Metabolomics Society Meeting, Tsuruoka 2014, Japan
Philippe Rocca-Serra
 
Semi-automated Exploration and Extraction of Data in Scientific Tables
Semi-automated Exploration and Extraction of Data in Scientific TablesSemi-automated Exploration and Extraction of Data in Scientific Tables
Semi-automated Exploration and Extraction of Data in Scientific Tables
Elsevier
 
A Guide for Reproducible Research
A Guide for Reproducible ResearchA Guide for Reproducible Research
A Guide for Reproducible Research
Yasmin AlNoamany, PhD
 
Coping with Data for WHOI JP Students
Coping with Data for WHOI JP StudentsCoping with Data for WHOI JP Students
Coping with Data for WHOI JP Students
Carly Strasser
 
Why I don't use Semantic Web technologies anymore, event if they still influe...
Why I don't use Semantic Web technologies anymore, event if they still influe...Why I don't use Semantic Web technologies anymore, event if they still influe...
Why I don't use Semantic Web technologies anymore, event if they still influe...
Gautier Poupeau
 
ontology.ppt
ontology.pptontology.ppt
ontology.ppt
Prerak10
 
The Rhetoric of Research Objects
The Rhetoric of Research ObjectsThe Rhetoric of Research Objects
The Rhetoric of Research Objects
Carole Goble
 
Intents Catalog Extensions
Intents Catalog ExtensionsIntents Catalog Extensions
Intents Catalog Extensions
Tri-State College Library Cooperative
 
Hide the Stack: Toward Usable Linked Data
Hide the Stack:Toward Usable Linked DataHide the Stack:Toward Usable Linked Data
Hide the Stack: Toward Usable Linked Data
aba-sah
 
Elasticsearch Introduction at BigData meetup
Elasticsearch Introduction at BigData meetupElasticsearch Introduction at BigData meetup
Elasticsearch Introduction at BigData meetup
Eric Rodriguez (Hiring in Lex)
 
Exploring the Semantic Web
Exploring the Semantic WebExploring the Semantic Web
Exploring the Semantic Web
Roberto García
 
Ld4 l triannon
Ld4 l triannonLd4 l triannon
Ld4 l triannon
Naomi Dushay
 
Putting Historical Data in Context: how to use DSpace-GLAM
Putting Historical Data in Context: how to use DSpace-GLAMPutting Historical Data in Context: how to use DSpace-GLAM
Putting Historical Data in Context: how to use DSpace-GLAM
4Science
 

Similar to Kampmeier ecn 2012 (20)

Bren - UCSB - Spooky spreadsheets
Bren - UCSB - Spooky spreadsheetsBren - UCSB - Spooky spreadsheets
Bren - UCSB - Spooky spreadsheets
 
Data Archiving and Sharing
Data Archiving and SharingData Archiving and Sharing
Data Archiving and Sharing
 
Ils on a shoe string budget
Ils on a shoe string budgetIls on a shoe string budget
Ils on a shoe string budget
 
NISO/DCMI September 25 Webinar: Implementing Linked Data in Developing Countr...
NISO/DCMI September 25 Webinar: Implementing Linked Data in Developing Countr...NISO/DCMI September 25 Webinar: Implementing Linked Data in Developing Countr...
NISO/DCMI September 25 Webinar: Implementing Linked Data in Developing Countr...
 
Research Shared: researchobject.org
Research Shared: researchobject.orgResearch Shared: researchobject.org
Research Shared: researchobject.org
 
Improving access to special collections by automating descriptive metadata cr...
Improving access to special collections by automating descriptive metadata cr...Improving access to special collections by automating descriptive metadata cr...
Improving access to special collections by automating descriptive metadata cr...
 
ISA-Tab Standards at Metabolomics Society Meeting, Tsuruoka 2014, Japan
ISA-Tab Standards at Metabolomics Society Meeting, Tsuruoka 2014, JapanISA-Tab Standards at Metabolomics Society Meeting, Tsuruoka 2014, Japan
ISA-Tab Standards at Metabolomics Society Meeting, Tsuruoka 2014, Japan
 
Semi-automated Exploration and Extraction of Data in Scientific Tables
Semi-automated Exploration and Extraction of Data in Scientific TablesSemi-automated Exploration and Extraction of Data in Scientific Tables
Semi-automated Exploration and Extraction of Data in Scientific Tables
 
A Guide for Reproducible Research
A Guide for Reproducible ResearchA Guide for Reproducible Research
A Guide for Reproducible Research
 
Coping with Data for WHOI JP Students
Coping with Data for WHOI JP StudentsCoping with Data for WHOI JP Students
Coping with Data for WHOI JP Students
 
Why I don't use Semantic Web technologies anymore, event if they still influe...
Why I don't use Semantic Web technologies anymore, event if they still influe...Why I don't use Semantic Web technologies anymore, event if they still influe...
Why I don't use Semantic Web technologies anymore, event if they still influe...
 
ontology.ppt
ontology.pptontology.ppt
ontology.ppt
 
The Rhetoric of Research Objects
The Rhetoric of Research ObjectsThe Rhetoric of Research Objects
The Rhetoric of Research Objects
 
Dbms rlde.ppt
Dbms rlde.pptDbms rlde.ppt
Dbms rlde.ppt
 
Intents Catalog Extensions
Intents Catalog ExtensionsIntents Catalog Extensions
Intents Catalog Extensions
 
Hide the Stack: Toward Usable Linked Data
Hide the Stack:Toward Usable Linked DataHide the Stack:Toward Usable Linked Data
Hide the Stack: Toward Usable Linked Data
 
Elasticsearch Introduction at BigData meetup
Elasticsearch Introduction at BigData meetupElasticsearch Introduction at BigData meetup
Elasticsearch Introduction at BigData meetup
 
Exploring the Semantic Web
Exploring the Semantic WebExploring the Semantic Web
Exploring the Semantic Web
 
Ld4 l triannon
Ld4 l triannonLd4 l triannon
Ld4 l triannon
 
Putting Historical Data in Context: how to use DSpace-GLAM
Putting Historical Data in Context: how to use DSpace-GLAMPutting Historical Data in Context: how to use DSpace-GLAM
Putting Historical Data in Context: how to use DSpace-GLAM
 

More from ECNOfficer

Janzen ecn2013
Janzen ecn2013Janzen ecn2013
Janzen ecn2013ECNOfficer
 
Nearns ecn2013
Nearns ecn2013Nearns ecn2013
Nearns ecn2013ECNOfficer
 
D paul ecn2013
D paul ecn2013D paul ecn2013
D paul ecn2013ECNOfficer
 
Giddens ecn2013
Giddens ecn2013Giddens ecn2013
Giddens ecn2013ECNOfficer
 
Dombroskie ecn2013
Dombroskie ecn2013Dombroskie ecn2013
Dombroskie ecn2013ECNOfficer
 
Dmitriev ecn2013
Dmitriev ecn2013Dmitriev ecn2013
Dmitriev ecn2013ECNOfficer
 
Oboyski ecn2013
Oboyski ecn2013Oboyski ecn2013
Oboyski ecn2013ECNOfficer
 
Jones ecn2013 the_goodbadugly conabio
Jones ecn2013 the_goodbadugly conabioJones ecn2013 the_goodbadugly conabio
Jones ecn2013 the_goodbadugly conabioECNOfficer
 
Austin ecn2013
Austin ecn2013Austin ecn2013
Austin ecn2013ECNOfficer
 
Yu ecn2013 cnc_databasing
Yu ecn2013 cnc_databasingYu ecn2013 cnc_databasing
Yu ecn2013 cnc_databasingECNOfficer
 
Solis ecn2013 usfws
Solis ecn2013 usfwsSolis ecn2013 usfws
Solis ecn2013 usfwsECNOfficer
 
Schuh ecn2013 tcn_data_structure
Schuh ecn2013 tcn_data_structureSchuh ecn2013 tcn_data_structure
Schuh ecn2013 tcn_data_structureECNOfficer
 
Gil ecn2013 ppt
Gil ecn2013 pptGil ecn2013 ppt
Gil ecn2013 pptECNOfficer
 
Dm smith ecn2013
Dm smith ecn2013Dm smith ecn2013
Dm smith ecn2013ECNOfficer
 
Abrahamson ecn2013 evaluating_naturalhistorycollectionuse
Abrahamson ecn2013 evaluating_naturalhistorycollectionuseAbrahamson ecn2013 evaluating_naturalhistorycollectionuse
Abrahamson ecn2013 evaluating_naturalhistorycollectionuseECNOfficer
 
Deans mikó ecn2013
Deans mikó ecn2013Deans mikó ecn2013
Deans mikó ecn2013ECNOfficer
 
Thayer ecn2013 renovation
Thayer ecn2013 renovationThayer ecn2013 renovation
Thayer ecn2013 renovationECNOfficer
 
Menard ecn 2012
Menard ecn 2012Menard ecn 2012
Menard ecn 2012ECNOfficer
 

More from ECNOfficer (20)

Ryder ecn2013
Ryder ecn2013Ryder ecn2013
Ryder ecn2013
 
Janzen ecn2013
Janzen ecn2013Janzen ecn2013
Janzen ecn2013
 
Nearns ecn2013
Nearns ecn2013Nearns ecn2013
Nearns ecn2013
 
Krell ecn2013
Krell ecn2013Krell ecn2013
Krell ecn2013
 
D paul ecn2013
D paul ecn2013D paul ecn2013
D paul ecn2013
 
Giddens ecn2013
Giddens ecn2013Giddens ecn2013
Giddens ecn2013
 
Dombroskie ecn2013
Dombroskie ecn2013Dombroskie ecn2013
Dombroskie ecn2013
 
Dmitriev ecn2013
Dmitriev ecn2013Dmitriev ecn2013
Dmitriev ecn2013
 
Oboyski ecn2013
Oboyski ecn2013Oboyski ecn2013
Oboyski ecn2013
 
Jones ecn2013 the_goodbadugly conabio
Jones ecn2013 the_goodbadugly conabioJones ecn2013 the_goodbadugly conabio
Jones ecn2013 the_goodbadugly conabio
 
Austin ecn2013
Austin ecn2013Austin ecn2013
Austin ecn2013
 
Yu ecn2013 cnc_databasing
Yu ecn2013 cnc_databasingYu ecn2013 cnc_databasing
Yu ecn2013 cnc_databasing
 
Solis ecn2013 usfws
Solis ecn2013 usfwsSolis ecn2013 usfws
Solis ecn2013 usfws
 
Schuh ecn2013 tcn_data_structure
Schuh ecn2013 tcn_data_structureSchuh ecn2013 tcn_data_structure
Schuh ecn2013 tcn_data_structure
 
Gil ecn2013 ppt
Gil ecn2013 pptGil ecn2013 ppt
Gil ecn2013 ppt
 
Dm smith ecn2013
Dm smith ecn2013Dm smith ecn2013
Dm smith ecn2013
 
Abrahamson ecn2013 evaluating_naturalhistorycollectionuse
Abrahamson ecn2013 evaluating_naturalhistorycollectionuseAbrahamson ecn2013 evaluating_naturalhistorycollectionuse
Abrahamson ecn2013 evaluating_naturalhistorycollectionuse
 
Deans mikó ecn2013
Deans mikó ecn2013Deans mikó ecn2013
Deans mikó ecn2013
 
Thayer ecn2013 renovation
Thayer ecn2013 renovationThayer ecn2013 renovation
Thayer ecn2013 renovation
 
Menard ecn 2012
Menard ecn 2012Menard ecn 2012
Menard ecn 2012
 

Recently uploaded

Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !
KatiaHIMEUR1
 
Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...
Product School
 
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered QualitySoftware Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Inflectra
 
Knowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and backKnowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and back
Elena Simperl
 
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 previewState of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
Prayukth K V
 
UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4
DianaGray10
 
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdfFIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance
 
PCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase TeamPCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase Team
ControlCase
 
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Ramesh Iyer
 
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdfFIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance
 
Generating a custom Ruby SDK for your web service or Rails API using Smithy
Generating a custom Ruby SDK for your web service or Rails API using SmithyGenerating a custom Ruby SDK for your web service or Rails API using Smithy
Generating a custom Ruby SDK for your web service or Rails API using Smithy
g2nightmarescribd
 
Key Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdfKey Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdf
Cheryl Hung
 
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdfFIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance
 
Neuro-symbolic is not enough, we need neuro-*semantic*
Neuro-symbolic is not enough, we need neuro-*semantic*Neuro-symbolic is not enough, we need neuro-*semantic*
Neuro-symbolic is not enough, we need neuro-*semantic*
Frank van Harmelen
 
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMsTo Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
Paul Groth
 
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
Product School
 
The Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and SalesThe Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and Sales
Laura Byrne
 
Leading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdfLeading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdf
OnBoard
 
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Tobias Schneck
 
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
Product School
 

Recently uploaded (20)

Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !
 
Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...
 
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered QualitySoftware Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
 
Knowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and backKnowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and back
 
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 previewState of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
 
UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4
 
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdfFIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
 
PCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase TeamPCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase Team
 
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
 
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdfFIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
 
Generating a custom Ruby SDK for your web service or Rails API using Smithy
Generating a custom Ruby SDK for your web service or Rails API using SmithyGenerating a custom Ruby SDK for your web service or Rails API using Smithy
Generating a custom Ruby SDK for your web service or Rails API using Smithy
 
Key Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdfKey Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdf
 
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdfFIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
 
Neuro-symbolic is not enough, we need neuro-*semantic*
Neuro-symbolic is not enough, we need neuro-*semantic*Neuro-symbolic is not enough, we need neuro-*semantic*
Neuro-symbolic is not enough, we need neuro-*semantic*
 
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMsTo Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
 
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
 
The Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and SalesThe Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and Sales
 
Leading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdfLeading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdf
 
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
 
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
 

Kampmeier ecn 2012

  • 1. Catalog magic: Behind the Scenes of Creating a World Catalog of the Therevidae Gail E. Kampmeier Illinois Natural History Survey, Prairie Research Institute University of Illinois at Urbana-Champaign gkamp@illinois.edu Irina Brake National Museum of Natural History, London Kristin Algmin University of Illinois at Urbana-Champaign
  • 2. Why is it so Difficult to get from Here… to… Here? Therevidae
  • 3. What Would Taxonomists Rather Be Doing?
  • 4. What do Taxonomists Wish Would Happen?
  • 5. 1995 Freshmen of NSF PEET* • Towards a World Monograph of the Therevidae (Insecta: Diptera) – 1995 – 2006 • Therevidae is medium-sized family with (now) – 4 subfamilies – ~130 genera – ~1150 species *National Science Foundation's Partnerships for Enhancing Expertise in Taxonomy
  • 6. Products • Trained – 9 dipterists, 7 through Ph.D. – Scientific illustrator – Dozens of students in databasing • Publications – 71 publications during grant – 20 more since & counting • Digitization – Mandala database 1995- – Website – Collaborations with DiscoverLife.org & GBIF …and the world is unlikely to run out of flies to study!
  • 7. Process: Specimens • Collect, sort, curate, label, sex, determine, & database specimen information – Assign unique identifiers where none exist • Visit & borrow material from museums • Examine types
  • 8. Is All that Work Worthwhile? • "A taxonomic paper often plants the very seeds of its own obsolescence." (Johnson 2011) • There is no getting around the work required to produce a catalog or any taxonomic treatment. • What we can do, is make sure that the information is accessible and reusable. • Is it time to ditch traditional catalogs? Henicomyia by J. Marie Metz
  • 9. What Choices Do You Have? • Last year's symposium on Arthropod Collections databases explored some of your options, but not all are suitable. – Online collections database platforms (not suitable for creating taxonomic catalogues that cross collections not included) • Arctos • Specify 6 – Online taxonomic database platforms – optimize creation of species pages • Species File – taxonomic authority files • Scratchpads – community-oriented contributions • 3I – online revisions of taxa • Encyclopedia of Life – Expert LifeDesks
  • 10. What Choices Do You Have? • Last year's symposium on Arthropod Collections databases explored some of your options, but not all are suitable. – Online platforms designed to parse or take parsed data & repurpose it (incl. online taxonomic database platforms above) • GBIF's Integrated Publishing Toolkit (IPT) – not thought of as a workbench-level tool • LUCID – especially good for keys & descriptive data • Biodiversity Informatics Journal - Will take in parsed data from Scratchpads and IPT & eventually databases (mechanism unclear) – Desktop or server-based platforms – usually in Filemaker or 4D or MSAccess • Mandala – http://www.inhs.illinois.edu/research/mandala/ • Biota - http://viceroy.eeb.uconn.edu/biota/ • Mantis - http://insects.oeb.harvard.edu/etypes/Downloads.htm
  • 11. The Process: Decide on a Format • Was decided to publish as traditional Myia catalog • Expectations about what is in a "traditional catalog" or taxonomic treatment & how it should be formatted – Print styles (italics, bold, centered, hanging indents) – Accented characters (for literature references, authority names, and localities) – Special characters (for ♂ and ♀ signs) – Notes kept with the taxon entry or as an appendix? • Use Mandala to achieve retrievability & formatting of output
  • 12. General Workflow: TherevidMandala Database • Input raw data: The Bulk of the Work is HERE! • Link data in related tables • Create fields for catalog output for Taxa & their history Literature (including disambiguation of similar citations) List of countries (& selected states/provinces) by biogeographic region for valid taxa Create & number notes for listing in appendix • Create a script that finds data to be exported • Create scripts to format data including styles (bold, italics, codes for paragraph formatting) • Export TaxonID& catalog output field only to Filemaker Pro to isolate output & preserve formatting including accented characters Mandala production db Acrobat MSWord Catalog Catalog Output to new FMP db
  • 13. Things Can Get Messy • Some operations require expert eyes to determine fitness-for-use • A database can find, sort, & summarize, but ultimately does not "see" anomalies unless specifically programmed to do so • Automation (scripting, creation of calculated fields) requires time, refinement, & expertise • Parsed data are key to flexibility
  • 14. Create Taxonomic Hierarchy Use to automate searches & sort catalog output by classification hierarchy, rank, & alphabetically
  • 15. Use Reason for Status to Dictate Formatting
  • 16. We Used the Specimen* Table to Define our Distribution *based on 105,889 specimens with valid names & parsed localities
  • 17. Script to Find & Sort Specimens • Once sorted, export a summary for each taxon
  • 18. • Summary can then be formatted in MSWord • Bring back into Filemaker for final formatting • Spot possible outliers • Match TaxonID to import formatted information into production db TaxonIDx Biogeographic Region x Country x State/Province
  • 19. Filling in the Cracks • All taxa, literature, and specimens to be included in the catalog were marked by an expert with a code for easier retrieval • Communication about scripts & field calculations were done in Google Docs • Literature with the same authors and years had to be disambiguated with letters following the year. – Used in both the literature cited and text of the catalog • After including the notes in the text flow, it was decided by the authors to number and put them into an appendix. – Finding & sorting of these could be automated – Replace with series allowed numbering of notes – Awkward (but necessary) to renumber notes when new ones were found to be needed.
  • 20. General Workflow • TaxonID is for reference only • Resize catalog output field (in layout mode) so all contents will always be seen (page size) & make sure to size the field to fit the contents • Open in Preview to check • Save as PDF Mandala production db Catalog Output to new FMP db
  • 21. General Workflow • This step mainly preserves catalog text styles & accented characters out of FMP • Save As MS Word document after verifying expected results. • Saving as Word will collapse the formatting into giant paragraphs Mandala production db Catalog Output to new FMP db Acrobat
  • 22. General Workflow Mandala production db Acrobat MSWord Catalog Catalog Output to new FMP db • Create styles in MSWord for formatting text & paragraphs • Search & replace special characters (%%, $$, zzz, ||, //); ♂ and ♀ signs • Clean up extra spaces, paragraphs, & punctuation • Using Google Docs is not (yet) an option for a traditionally published catalog as the formatting tools aren't adequate
  • 23. Send Out to Experts
  • 24. Consensus! • When the experts are happy, we're done, right? • Still have to update the database & web output online – complements printed catalog as it is dynamic • Push corrections to public portals of data (own website, DiscoverLife, GBIF, etc.) • So "magic" is a relative, kind of wishful term—the future is more likely in platforms such as those being coordinated by Pensoft.
  • 25. References, Resources • Miller, J. et al. 2012. From taxonomic literature to cybertaxonomic content. BMC Biology 10:87http://www.biomedcentral.com/content/pdf/1741- 7007-10-87.pdf • Johnson, N.F. 2012. A collaborative, integrated and electronic future for taxonomy. Invertebrate Systematics 25: 471–475. http://www.publish.csiro.au/?act=view_file&file_id=IS11052.pdf • Biodiversity Data Journal (publication debut Dec. 2012)http://www.pensoft.net/journals/bdj • Symposium: Arthropod Collections Databases. 2011 ECN meeting, Reno, NV http://www.ecnweb.org/past/2011 • Darwin Core Standard http://rs.tdwg.org/dwc/ • Kampmeier, G. E. and M. E. Irwin. 2009. Meeting the interrelated challenges of tracking specimen, nomenclature, and literature data in Mandala. Chapter 15 in T. Pape, D. Bickel, and R. Meier (eds.) Diptera Diversity: Status, Challenges and Tools. Leiden: Brill Academic Publishers, pp. 407-437. http://www.inhs.illinois.edu/research/mandala/Ch15_Mandala_DiptDiv2009.pdf
  • 26. More Refs & Resources • Kennedy, J., R. Hyam, R. Kukla, T. Paterson. 2006. Standard data model representation for taxonomic information. A Journal of Integrative Biology 10(2): 220-230. http://www.hyam.net/publications/omi.2006.10.220.pdf • Penev, L., T. Georgiev, P. Stoev, D. Roberts, V. Smith. 2012. Making small data big! The Biodiversity Data Journal (BDJ). TDWG 2012, Beijing, 22-26 October. http://www.tdwg.org/fileadmin/2012conference/slides/Biodiversity_Data _Journal.pdf • Catalog of Life http://www.catalogueoflife.org/colwebsite/sites/default/files/2012_CoL- Standard_Dataset_v6_3.pdf
  • 27. Acknowledgements • Michael E. Irwin • F. Chris Thompson • Neal Evenhuis • Christine Lambkin • Shaun Winterton • Don Webb • Mark Metz • Martin Hauser • Kevin Holston • Steve Gaimari • J. Marie Metz • David Yeates • Amanda Buck • Brian Wiegmann • Evert Schlinger • John Pickering • FMWebschool • National Science Foundation • Schlinger Foundation • Illinois Natural History Survey • University of Illinois • Discover Life • Biodiversity Information Standards (TDWG) NSF Projects: Therevid PEET: DEB-95-21925; 99-77958 Fiji Arthropod Survey: DEB- 0425790 FLYTREE: EF- 0334948 Tabanid PEET: DEB 07-31528
  • 28. ©2012 University of Illinois Board of Trustees. All rights reserved. For permission information, contact the Illinois Natural History Survey. References to commercial products are for informational purposes only and do not imply endorsement.
  • 29. Appendix Additional information for the curious of slides jettisoned for time
  • 30. Why Use A Database? • Flexibility – Finely parsed data may be pieced together for publication, labels – Scripting of often used functions • Reuse/repurposing of data – Sharing with GBIF, DiscoverLife.org, museums • Centralization of work environment – Workers can be anywhere, any time zone – Backup can be automated • Individual work environment – Choice with platforms not required to be online (although trade-off)
  • 31. Vision • "Taxonomy should fully embrace electronic media and informatics tools. Particularly, this step requires the development and widespread implementation of community data standards. The barriers to progress in these areas are not technological, but are primarily social. The community needs to see clear evidence of the value added through these changes in procedures and insist upon their use as standard practice." Johnson, N.F. 2011. A collaborative, integrated and electronic future for taxonomy. Invertebrate Systematics 25: 471.
  • 32. Any Database Can Record the Basics, but… • How the information is related is also key – defining taxonomic ranks as parent-child relationship – valid taxonomic entities related to their synonyms – types and specimens determined for a taxon – literature associated with a taxonomic name – collecting localities and collecting events • Readability – if a published work rather than raw database output • Format – Based on existing print models? – Print styles (italics, bold, centered, hanging indents) – Accented characters (for literature references, authority names, and localities) – Special characters (for ♂ and ♀ signs) – Notes kept with the taxon entry or as an appendix?
  • 33. Mandala Data Model • Not all of this is required for a traditional catalog, but these tables contain a wealth of vital, interrelated data. • Tables with rounded edges are authority files
  • 34. Use the Classification Hierarchy to Automate Searches
  • 35. Reason for Status Used for Formatting

Editor's Notes

  1. First a little background…
  2. We were fortunate to have two rounds of funding for this project on a medium-sized family of flies. We trained dipterists that are contributing their expertise even today, and continuing to work on the family Therevidae as well as other Diptera.
  3. For better or worse, not yet.
  4. The main part of the work, which has consumed many person hours to enter and verify is in the Mandala database devoted to the Therevidae.
  5. Find all specimens with valid names and a localityID
  6. You cannot create style sheets in Acrobat
  7. Photo of Kevin,
  8. A spreadsheet is not flexible, neither is a field notebook or index cards
  9. But not just the community, the individual also needs to see and embrace this for him or herself
  10. All this goes on in the background, once you have indicated which taxa you want to delimit.