Presentation given by Steffen Hankiewicz from intranda about the developments in Goobi over the last two years. Additionally there was given an overview of current and future development plannings for Goobi by intranda.
40. 2.
How
is
Goobi
used
across
Europe?
Using
the
same
sogware
-‐
but
doing
different
stuff
40
30.05.2014Steffen
Hankiewicz,
intranda
GmbH
41. 41
30.05.2014Steffen
Hankiewicz,
intranda
GmbH
Common
workflows
‣ Typical
steps
‣ Legacy
import
2.
How
is
Goobi
used
across
Europe?
Very
special
cases
‣ Material
‣ Workflow
‣ Sources
‣ Targets
51. 51
30.05.2014Steffen
Hankiewicz,
intranda
GmbH
2.10.
Wellcome
Library:
FTP
and
IA
Content Analyser Deamon analyses if
upload is finished, valid and no viruses
are found. If all conditions are fulfilled
the clean and valid content gets moved
into secure area on internal storage
Goobi: FTP upload
Version: 1.1
Author: Steffen Hankiewicz
Copyright: intranda GmbH, 2014
The FTP upload consists of two automatic steps:
Step 1: Content Analyser
The Content Analyser analyses the uploaded content and makes sure, that its upload is finished already. The following
scenarios may happen:
1a) The content upload of a folder gets never finishes: After a given time (e.g. one month) the Content Analyser should create
an error message and move the folder to a separate storage area 'FTP suspicious upload' where problematic folders can be
checked by a human person to clarify the status with the customer
1b) The content is uploaded but a virus is found: The content gets moved into a 'FTP suspicious upload' folder. A human
person gets involved to clarify with the customer what to do next
1c) The content folder is successfully uploaded and clean: The content is moved into the secure internal 'FTP Migration Area'.
Step 2: Content Migrator
The Content Migrator checks if in the 'FTP Migration Area' are new folders available that shall be imported into Goobi. During
the analyzation of this content the following scenarios may happen:
2a) In case that no Goobi process exists where this content can be assigned to, the content stays in the 'FTP Migration Area'
for a given time (e.g. one month)
2b) In case of invalid content, errors or if no matching Goobi process can be found in a given time range (e.g. in one month
after uploading) the content is moved into a separate 'FTP suspicious content' folder to be manually checked by the Wellcome
stuff; the content folder gets a readable error description message to help the user to understand what kind of error has
happened
3c) In case of valid content and an existing Goobi process for this content can be found, the content gets moved into the
matching Goobi process folder. Afterwards the workflow step 'Receive content from FTP server' gets closed by the Content
Migrator to trigger the next Goobi task for this process (e.g. SDB ingest).
FTP upload service
Public available
FTP server writes
data into insecure
storage area
Customer 1
Customer 2
Customer 3
FTP Server
etc.
DMZ storage
FTP suspicious upload
Content from
customer 3
Content from
customer 2
Content from
customer 1
etc.
Internal storage for all Wellcome services
FTP Migration area
etc.
Content from
customer 3
Content from
customer 2
Content from
customer 1
Storage
area for
Wellcome
Service 1
Content Migrator
Deamon analyses
uploaded and clean
content and triggers
data import into Goobi.
Storage
area for
Wellcome
Service 2
Storage
area for
Wellcome
Service 3
Storage
area for
Goobi
FTP
suspicious
content
52. 3.
What
is
coming
next
in
Goobi?
Currently,
s?ll
this
year
and
poten?ally
soon
…
52
30.05.2014Steffen
Hankiewicz,
intranda
GmbH
53. 53
30.05.2014Steffen
Hankiewicz,
intranda
GmbH
3.
What
is
coming
next
in
Goobi?
2.0 3.0
‣ Finish
new
UI
‣ Add
some
minor
features
for
usability
‣ Metadata
&
formats
‣ Scalability
‣ Func?onality
enrichment
with
plugins
‣ Efficiency
/
Usability
‣ More
automa?c
‣ Data
enrichment
70. 3.3.
Perspectives
70
30.05.2014Steffen
Hankiewicz,
intranda
GmbH
‣ Newspaper
material
‣ Alterna?ve
Editors
‣ TEI
‣ (addi?onal)
content
for
web
presenta?on
‣ Thumbnails
in
METS-‐Editor
‣ Indexing
of
structure
data,
page
assignment
and
metadata
in
one
single
step
‣ OLR-‐Support
for
indexing
‣ OCR-‐supported
pagina?on
71. 3.3.
Perspectives
71
30.05.2014Steffen
Hankiewicz,
intranda
GmbH
‣ Quality
assurance
‣ Quality
control
in
the
GUI
‣ More
automa?c
valida?on
‣ Cropping
/
Deskew
!
‣ Controlling
‣ BeVer
restric?on
model
‣ Sta?s?cs
for
Non-‐Admins
!
‣ Op?misa?on
‣ SQL-‐Op?misa?on
‣ Search
for
slowest
workflow
steps
(Snagging)
‣ Clustering
72. 3.3.
Perspectives
72
30.05.2014Steffen
Hankiewicz,
intranda
GmbH
‣ Workflows
‣ Start
Workflows
before
the
digi?sa?on
begins
‣ Status
‚skipped‘
‣ Op?misa?on
for
the
coopera?on
between
mul?ple
Goobis
(Data
import,
Sync2Goobi
etc.)
‣ BeVer
handling
for
mass
manipula?ons
‣ Extension
of
search
func?onality
(incl.
informa?ons
about
the
file
system)
73. 3.3.
Perspectives
73
30.05.2014Steffen
Hankiewicz,
intranda
GmbH
‣ Other
‣ IPTC-‐Header
‣ Archiving
/
Repository
‣ Correc?on
loops
‣ Correc?ons
triggered
from
external
persons
‣ GoobiScript
as
plugins
‣ More
plugins
(Sidebar
etc.)