Assisted restructure of web content for paper-based presentation: a look at workflows and data representations

Assisted restructure of web content for
paper-based presentation: a look at work ﬂows
and data representations
Alice Heaton, Simon Rycroft, Ed Baker, Katherine Bouton, Ben Scott,
Koureas Dimitris, Laurence Livermore, Dave Roberts, Vince Smith

This presentation is about:
Creating paper based publications from web content;
Restructuring data;
Looking at what happens to data at implementation
level;
Understanding the implications for data storage and
processing.

Web interface

Import
(Excel ﬁle,
GBIF, etc.)
Content
author

Hyperlinks

Faceted
Search

Visitor

Scratchpads Visitor

Page

Hyperlink

Page

Hyperlink

Page

Location

Specimen

Taxon

Term

Taxonomic
Name

Content
Type

Location
Reference

Content Author

Content
Type

Paper publication

? ? ? ? ? ?

Location

Specimen

Taxon

Term

Taxonomic
Name

Content
Type

Node
Reference

Scratchpads Maintainer

Content
Type

Enter
Publication
Speciﬁc
Data
Abstract,
Acknowledgement,
etc.

Select
existing
data
Taxon description,
Holotype,
etc.

Automatic import
of related
content
Bibliographic
references,
images,
etc.

Publication

Select
Taxon
Description
Fields

Add
new
Treatment

Select
Holotype
Select
Taxon
Select
Paratype
Override
Taxon
name

Scratchpads
Maintainer

Automated
Process
(Scratchpads)

User
Interface

XSD, XML,
Jason, etc.

Scratchpads
Data
(Web/Default
representation)

Scratchpads
Data
(Publication
representation)

Online transmission

Scratchpads
Data
(External
service
representation)

REST, SOAP,
etc.

External
service

Entity

Entities

Field

Relational
Data

Field

Relational
Data

Relational
Data

Field

Relational
Data

Entities

Restructure through
User Interface

Entities

Mapping
Schema
Array

XSD Schema
Definition

Schema Array:
Internal structure to define how
to map entities and entity fields
to XML tags

Schema
Array

Final
Schema
Array

Final
schema
array

Restructed
entities

Process

Publication
XML

Horizontal ﬂows
Schema
array

Select
existing
data
Automatic import
of related
content
Enter
Publication
Speciﬁc
Data

Restructured
entities

Process

Publication
XML

Vertical ﬂows
Entity
Process

Entity

Final
schema
array

PHP
arrays

Schema
array

Entity

Fields

Relational
data

XML

XSD
Schema

Schema
array
mapping

Data constantly changes form;
This process happens both horizontally and
vertically;
Data ﬂows are more important than data
structures;
In that context model abstraction and modularity
is more important than data storage;
Restructure and export code is sensitive to
change: It need a robust architecture.

Thank you ! Any questions ?

ViBRANT

Virtual Biodiversity

-infrastructure

SEVENTH FRAMEWORK
PROGRAMME

http://scratchpads.eu

Funded by Vibrant RI-261532.

Assisted restructure of web content for paper-based presentation: a look at workflows and data representations

More Related Content

What's hot

Viewers also liked

More from Vince Smith

Recently uploaded

Assisted restructure of web content for paper-based presentation: a look at workflows and data representations