SlideShare a Scribd company logo
http://www.forgetit-project.eu
Some store to remember, some store to forget
Olivier Dobberkau
CEO of dkd Internet Service GmbH
Frankfurt, Germany
About me
ForgetIT Project TYPO3Camp Milano 2014
What this is all about
The problem
Storage capacity is ever increasing
Prices for storage are falling
How large is large?
Size references
A simple text: an average Wikipedia article ≈ 3.78 kB (no markup)
Lots of text: complete Wikipedia ≈ 13.5 GB (text only, no markup)
An average image (12MP) ≈ 1.3 MB (JPG 90% quality; 24bit/pixel)
An average movie stored on Blu-ray Disc ≈ 25.48 GB
1955 – The IBM 355
Capacity: 12 MB
Cost: 6,233.33 USD/MB
3,250 90
✘
0
✘
0.16 kB
1970 – The IBM 3330
Capacity: 100 MB
Cost: 259.70 USD/MB
3.94 kB27,089 76 0
✘
0
✘
1988 – Seagate ST-238
Capacity: 30 MB
Cost: 9.97 USD/MB
102.71 kB8,126 23 0
✘
0
✘
2000 – Western Digital WD600AB
Capacity: 60 GB
Cost: 0.00275 USD/MB
16,644,063 4 47,261 2 363.64 MB
2010 – Seagate ST32000542AS
Capacity: 2 TB
Cost: 0.0000450 USD/MB
≈ 5 cent/GB
541,798,941 148 1,538,461 76 21.7 GB
2013 – NSA
Capacity: ∞
Cost: free
∞ ∞ ∞ ∞ it’s free :)
✘
Let’s store everything, then!
Cool!
Or, maybe not...
There’s a lot more costs
Retrieval
Maintenance
Indexing
Updates
We need to keep our information
Accessible
Usable
Useful
The concept of Memory Buoyancy
Let’s start to forget!
Memory Buoyancy
time
memory
Memory Buoyancy
Memory Buoyancy
A short overview
The ForgetIT Project
ForgetIT project overview
Consortium of 11 partners
Project start was in February 2013
3 years of research & development
http://www.forgetit-project.eu
The ForgetIT project is funded by the EC within the 7th Framework
Programme under the objective "Digital Preservation"
(GA 600826).
Project Partners 1/2
Centre for Research and Technology Hellas
dkd Internet Service GmbH
Deutsches Forschungszentrum für Künstliche Intelligenz GmbH
Eurix Srl
Gottfried Wilhelm Leibniz Universität Hannover
Project Partners 2/2
IBM Israel - Science and Technology Ltd
Luleå Tekniska Universitet
The Chancellor, Masters and Scholars of the University of Oxford
The University of Edinburgh
The University of Sheffield
Turk Telekomunikasyon AS
Inspiring people to share!
TYPO3 is the CMS used for the organisational use cases
TYPO3 was chosen because it’s Open Source
We want to raise awareness on the matter of preservation
We will publish our modules under open source licenses
ForgetIT core concepts
Managed Forgetting
Synergetic Preservation
Contextualised Remembering
“meta-data is a love note to the future” (Jason Scott)
Do you preserve?
What is preservation?
“Preservation — The protection of cultural
property through activities that minimize
chemical and physical deterioration and
damage and that prevent loss of informational
content. The primary goal of preservation is to
prolong the existence of cultural property.”
Preservation 101
Problems are caused by
storage medium (disks, tapes, DVD, etc.)
format of the data
availability of the software or operating system
possible encryption
“The digital dark age is a possible future
situation where it will be difficult or impossible
to read historical electronic documents and
multimedia, because they have been stored in
an obsolete and obscure file format.” Wikipedia
Digital Dark Age
Preserving a website is not trivial
What do want you preserve?
Content only?
Content and Design?
How often? Stock prices vs. Company History page
How do you deal with browser differences?
How do you preserve functionality? E.g. insurance fee calculator
Preservation Value
~ 5,000 €~ 200,000 €
Private
Organisational
The ForgetIT Use Cases
A personal use case:
How to organise an ever growing picture collection
Personal Preservation
Typical use cases in the daily work with TYPO3-driven company
websites.
Organisational Preservation
Organisational Use Cases
Digital Asset Management
Versioning
Archiving a complete Website
Individual genres and their specific requirements
Example: Press Release
An organisational use case
Press Release Example
Elements of a Press Release
text
image
links
documents
Meta information
Presseinformationen Spielwarenmesse
Global Toy Conference Now on Saturday at the
Spielwarenmesse
* Customised programme for retailers: “How to get
your customer into the shop”
* Conference will take place for the 5th time in
Nuremberg on 1 February 2014
All around the world, retailers are wondering how
they can still get their customers in their shops
in the age of the Internet – because competition
for the sale of consumer goods online is growing
dramatically. With the topic “How to Get
Customers into Your Shop – Successful Pricing,
Presentation and Selling” the Global Toy
Conference of the Spielwarenmesse demonstrates
what parameters business owners can adjust for
the future. The conference will take place for
the first time in the St Petersburg hall in the
NCC East on Saturday. The new earlier date means
that more international retailers can take
advantage of the knowledge on offer at the toy
industry's leading trade fair – from 9 a.m. to 4
p.m. on 1 February 2014.
...
Translations
German English
…
Levels of significance
archive value
Action: keep forever
legal value
Action: keep for legal time
Archive
present value
Action: Keep for x days
trigger value
Action: Check significance
KeepDelete
media
meta info
media
meta info
Content Management Systemmedia
meta info
copy
move
refer
media
meta info
media
asset
meta info
media
asset
meta info
etc.
meta info
editable
content
meta info
structure
(code, users,
plugins,
extensions,
etc.)
meta info
external
Digital Asset (DAM)
internal
Archive 1
Info Level 2
Info Level 3
media
meta info
media
asset
meta info
media
asset
meta info
etc.
meta info
editable
content
meta info
structure
(code, users,
plugins,
extensions,
etc.
meta info
Info Level 1
(semi)automatic
static
dynamic
Info Level 4, etc.
Output
Archive 2
Delete
Archive 1
Info Level 2
Info Level 3
media
meta info
media
asset
meta info
media
asset
meta info
etc.
meta info
editable
content
meta info
structure
(code, users,
plugins,
extensions,
etc.
meta info
Info Level 1
(semi)automatic
static
dynamic
Info Level 4, etc.
Output
Archive 2
Delete
Archive 1 Archive 2
Delete
L2
L1
L3
L4
L2
L1
L3
L4
T-CM (Todays Content Management) F-CM (Future Content Management)
Retrieve Service
Information Lifecycle
Collect Create Process Publish Analyse Archive
Collect
Create
Process
Publish
Analyse
Archive
Information Lifecycle
Collect Create Process Publish Analyse ArchiveProcess
Annotations
Example Press Release
Annotation (text) Annotation (image)
global toy
conference,
conference,
podium, speaker,
lights
A game about forgetting.
Do you remember?
or how you can participate
Next steps
We’d love to see you participate!
Reflect your thoughts with us
Take our short survey: http://tinyurl.com/forgetit-webarchiving
Tell us your use cases
Join the development of TYPO3 features
Thank you for your attention!
Sources, Books, Images
References
References (Sources) 1/2
Size of Wikipedia (as of 2013-10-04): https://en.wikipedia.org/wiki/
Wikipedia:Size_comparisons
Average JPG size: http://web.forret.com/tools/megapixel.asp?
title=12+Megapixel+camera&width=4000&height=3000
Average movie size: http://answers.yahoo.com/question/index?
qid=20110807095141AABGQm8
Storage Prices: http://www.jcmit.com/diskprice.htm
References (Sources) 2/2
Forget IT Website: http://www.forgetit-project.eu
Preservation: http://unfacilitated.preservation101.org/session1/
expl_whatis-definitions.asp
Digital Dark Age: https://en.wikipedia.org/wiki/Digital_dark_age
References (Books)
Delete: The Virtue of Forgetting in the Digital Age, Viktor Mayer-
Schönberger
References (Images) 1/8
“About me”: all images by Søren Schaffstein
“ForgetIT Team” by Søren Schaffstein
“The Problem/Knot”: http://www.istockphoto.com/stock-
photo-8933647-rope-with-knot.php
“1 Dollar”: http://www.istockphoto.com/stock-photo-17830696-fan-
dollars-isolated-on-white.php
Starbucks Cups: http://5feetonagoodday.files.wordpress.com/
2012/01/starbucks-coffee-cups-sizes-tall-grande-venti-trenta.jpg
References (Images) 2/8
IBM 355: http://www-03.ibm.com/ibm/history/exhibits/storage/
storage_355.html
IBM 3330: http://www-03.ibm.com/ibm/history/exhibits/storage/
storage_3330.html
Seagate ST-238: http://www.redlop.de/bilder/produkte/gross/
Seagate-WREN-5-ST4702N-702-MB-.png
Western Digital WD600AB: http://www.junek.de/thomas/bilder/
WD600AB.jpg
References (Images) 3/8
Seagate ST32000542AS: http://bilder.afterbuy.de/images/ZZNLZ/
seagatesata.jpg
Finger “Forget”: http://www.istockphoto.com/stock-photo-7252836-
string-finger-reminder-on-white.php
Memory Buoyancy: http://www.istockphoto.com/stock-
photo-16244755-fishing-hook-underwater.php?st=0320b45
Fish: http://www.istockphoto.com/stock-photo-14623368-gold-fish-
and-piranha.php
References (Images) 4/8
Game pieces by Søren Schaffstein
Managed Forgetting: http://www.istockphoto.com/stock-
photo-3533508-colorful-memos.php?st=0320b45
Synergetic Preservation: http://www.istockphoto.com/stock-
photo-13301920-goldfish-jump.php
Contextualised Remembering: http://www.istockphoto.com/stock-
photo-14370511-shoebox-of-old-photos-too.php
References (Images) 5/8
Cans: http://www.istockphoto.com/stock-photo-16948268-three-
metallic-goods-can-with-key.php
5 1/4” Disk: https://secure.flickr.com/photos/twicepix/4330813840/
sizes/z/in/photostream/
5 1/4” Disk Drawing: https://secure.flickr.com/photos/
flattop341/2094771560/sizes/z/in/photostream/
Ami Pro: http://www.os2museum.com/wp/?attachment_id=99
Digital Dark Age by Søren Schaffstein
References (Images) 6/8
Gauges: http://www.istockphoto.com/stock-photo-9059088-old-
gauges.php
Golf Car: http://www.netzeitung.de/default/337276.html#
Golf Car Papers: http://www.motor-talk.de/news/das-heilige-blech-
wieder-unterm-hammer-t4421282.html
Create: http://hdwallsize.com/wp-content/uploads/2013/04/
Abstract-Art-Wallpaper-Dekstop.jpg
References (Images) 7/8
Process by Søren Schaffstein
Publish: http://www.istockphoto.com/stock-photo-25712828-british-
dog-reading.php?st=e5bf164
Analyse: http://www.istockphoto.com/stock-photo-28297160-
laboratory-experimental-testing.php?st=239c76e
Archive: http://www.istockphoto.com/stock-photo-18865341-old-
wooden-card-catalogue-with-one-opened-drawer.php
References (Images) 8/8
Shoes: http://www.istockphoto.com/stock-photo-2457744-what-s-
your-walking-style.php?st=e12d3d2
Questions: http://www.istockphoto.com/stock-photo-17686236-
decision-making.php

More Related Content

Similar to ForgetIT Project TYPO3Camp Milano 2014

ForgetIT – Some store to remember, some store to forget
ForgetIT – Some store to remember, some store to forgetForgetIT – Some store to remember, some store to forget
ForgetIT – Some store to remember, some store to forget
Søren Schaffstein
 
Reading information technologies and communication basic concepts
Reading information technologies and communication basic conceptsReading information technologies and communication basic concepts
Reading information technologies and communication basic concepts
Miguel Angel Romero Ochoa
 
Integrating Technology Into The Classroom
Integrating Technology Into The ClassroomIntegrating Technology Into The Classroom
Integrating Technology Into The Classroom
abutcher3306
 
Preserve or preserve not
Preserve or preserve notPreserve or preserve not
Preserve or preserve not
National Library of Australia
 
Electroniquev2
Electroniquev2Electroniquev2
Electroniquev2
Mehdi zizi
 
Trm Introduction
Trm IntroductionTrm Introduction
Trm Introduction
DigitalPreservationEurope
 
Digital Preservation Best Practices: Lessons Learned From Across the Pond
Digital Preservation Best Practices: Lessons Learned From Across the PondDigital Preservation Best Practices: Lessons Learned From Across the Pond
Digital Preservation Best Practices: Lessons Learned From Across the Pond
Benoit Pauwels
 
Digital Presentation Best Practices: Lessons Learned From Across the Pond
Digital Presentation Best Practices: Lessons Learned From Across the PondDigital Presentation Best Practices: Lessons Learned From Across the Pond
Digital Presentation Best Practices: Lessons Learned From Across the Pond
ULB - Bibliothèques
 
Integrating technology into the classroom
Integrating technology into the classroomIntegrating technology into the classroom
Integrating technology into the classroom
TammiRice
 
5 module data storage 1
5 module data storage 15 module data storage 1
5 module data storage 1
ConnectYourCommunity
 
5 module data storage
5 module data storage5 module data storage
5 module data storage
ConnectYourCommunity
 
5 module data storage 1
5 module data storage 15 module data storage 1
5 module data storage 1
Rozell Sneede
 
IBM Tape the future of tape
IBM Tape the future of tapeIBM Tape the future of tape
IBM Tape the future of tape
Josef Weingand
 
Hospitality Information Systems
Hospitality Information SystemsHospitality Information Systems
Hospitality Information Systems
Anil Bilgihan
 
Puglia marac-file formats-20111020
Puglia marac-file formats-20111020Puglia marac-file formats-20111020
Puglia marac-file formats-20111020
MARAC Bethlehem PC
 
Flash Memory Ted 111
Flash Memory Ted 111Flash Memory Ted 111
Flash Memory Ted 111
mil1375
 
MyLifeBits van Microsoft
MyLifeBits van MicrosoftMyLifeBits van Microsoft
MyLifeBits van Microsoft
Edwin Mijnsbergen
 
Personal Digital Archiving Initiatives at the Library of Congress
Personal Digital Archiving Initiatives at the Library of CongressPersonal Digital Archiving Initiatives at the Library of Congress
Personal Digital Archiving Initiatives at the Library of Congress
lljohnston
 
Planning and Managing Digital Library & Archive Projects
Planning and Managing Digital Library & Archive ProjectsPlanning and Managing Digital Library & Archive Projects
Planning and Managing Digital Library & Archive Projects
ac2182
 
DAMbusters: IWM’s Mission to Design and Implement a Bespoke DAMS
DAMbusters: IWM’s Mission to Design and Implement a Bespoke DAMSDAMbusters: IWM’s Mission to Design and Implement a Bespoke DAMS
DAMbusters: IWM’s Mission to Design and Implement a Bespoke DAMS
Axiell ALM
 

Similar to ForgetIT Project TYPO3Camp Milano 2014 (20)

ForgetIT – Some store to remember, some store to forget
ForgetIT – Some store to remember, some store to forgetForgetIT – Some store to remember, some store to forget
ForgetIT – Some store to remember, some store to forget
 
Reading information technologies and communication basic concepts
Reading information technologies and communication basic conceptsReading information technologies and communication basic concepts
Reading information technologies and communication basic concepts
 
Integrating Technology Into The Classroom
Integrating Technology Into The ClassroomIntegrating Technology Into The Classroom
Integrating Technology Into The Classroom
 
Preserve or preserve not
Preserve or preserve notPreserve or preserve not
Preserve or preserve not
 
Electroniquev2
Electroniquev2Electroniquev2
Electroniquev2
 
Trm Introduction
Trm IntroductionTrm Introduction
Trm Introduction
 
Digital Preservation Best Practices: Lessons Learned From Across the Pond
Digital Preservation Best Practices: Lessons Learned From Across the PondDigital Preservation Best Practices: Lessons Learned From Across the Pond
Digital Preservation Best Practices: Lessons Learned From Across the Pond
 
Digital Presentation Best Practices: Lessons Learned From Across the Pond
Digital Presentation Best Practices: Lessons Learned From Across the PondDigital Presentation Best Practices: Lessons Learned From Across the Pond
Digital Presentation Best Practices: Lessons Learned From Across the Pond
 
Integrating technology into the classroom
Integrating technology into the classroomIntegrating technology into the classroom
Integrating technology into the classroom
 
5 module data storage 1
5 module data storage 15 module data storage 1
5 module data storage 1
 
5 module data storage
5 module data storage5 module data storage
5 module data storage
 
5 module data storage 1
5 module data storage 15 module data storage 1
5 module data storage 1
 
IBM Tape the future of tape
IBM Tape the future of tapeIBM Tape the future of tape
IBM Tape the future of tape
 
Hospitality Information Systems
Hospitality Information SystemsHospitality Information Systems
Hospitality Information Systems
 
Puglia marac-file formats-20111020
Puglia marac-file formats-20111020Puglia marac-file formats-20111020
Puglia marac-file formats-20111020
 
Flash Memory Ted 111
Flash Memory Ted 111Flash Memory Ted 111
Flash Memory Ted 111
 
MyLifeBits van Microsoft
MyLifeBits van MicrosoftMyLifeBits van Microsoft
MyLifeBits van Microsoft
 
Personal Digital Archiving Initiatives at the Library of Congress
Personal Digital Archiving Initiatives at the Library of CongressPersonal Digital Archiving Initiatives at the Library of Congress
Personal Digital Archiving Initiatives at the Library of Congress
 
Planning and Managing Digital Library & Archive Projects
Planning and Managing Digital Library & Archive ProjectsPlanning and Managing Digital Library & Archive Projects
Planning and Managing Digital Library & Archive Projects
 
DAMbusters: IWM’s Mission to Design and Implement a Bespoke DAMS
DAMbusters: IWM’s Mission to Design and Implement a Bespoke DAMSDAMbusters: IWM’s Mission to Design and Implement a Bespoke DAMS
DAMbusters: IWM’s Mission to Design and Implement a Bespoke DAMS
 

More from Olivier Dobberkau

Meet TYPO3 Vienna - Solr die Suchmachine für TYPO3
Meet TYPO3 Vienna - Solr die Suchmachine für TYPO3Meet TYPO3 Vienna - Solr die Suchmachine für TYPO3
Meet TYPO3 Vienna - Solr die Suchmachine für TYPO3
Olivier Dobberkau
 
Apache Solr for TYPO3: More than a search engine
Apache Solr for TYPO3: More than a search engineApache Solr for TYPO3: More than a search engine
Apache Solr for TYPO3: More than a search engine
Olivier Dobberkau
 
TYPO3 v8 LTS in the cloud
TYPO3 v8 LTS in the cloudTYPO3 v8 LTS in the cloud
TYPO3 v8 LTS in the cloud
Olivier Dobberkau
 
With a little help from my friends (english)
With a little help  from my friends (english)With a little help  from my friends (english)
With a little help from my friends (english)
Olivier Dobberkau
 
With a little help from my friends
With a little help from my friendsWith a little help from my friends
With a little help from my friends
Olivier Dobberkau
 
TYPO3 & You
TYPO3 & YouTYPO3 & You
TYPO3 & You
Olivier Dobberkau
 
Sonnenschein für ihre Website
Sonnenschein für ihre WebsiteSonnenschein für ihre Website
Sonnenschein für ihre Website
Olivier Dobberkau
 
TYPO3 Camp Poznan - Solr Usecases with Hosted Solr
TYPO3 Camp Poznan - Solr Usecases with Hosted SolrTYPO3 Camp Poznan - Solr Usecases with Hosted Solr
TYPO3 Camp Poznan - Solr Usecases with Hosted Solr
Olivier Dobberkau
 
TYPO3 and CMIS
TYPO3 and CMISTYPO3 and CMIS
TYPO3 and CMIS
Olivier Dobberkau
 
ForgetIT: Beyond the page: Giving content a meaning and value
ForgetIT: Beyond the page: Giving content a meaning and valueForgetIT: Beyond the page: Giving content a meaning and value
ForgetIT: Beyond the page: Giving content a meaning and value
Olivier Dobberkau
 
Apache Solr for TYPO3 CMS 101
Apache Solr for TYPO3 CMS 101Apache Solr for TYPO3 CMS 101
Apache Solr for TYPO3 CMS 101
Olivier Dobberkau
 
Outside the Box - Panel on CMS at TYPO3 Camp Mallorca
Outside the Box - Panel on CMS at TYPO3 Camp MallorcaOutside the Box - Panel on CMS at TYPO3 Camp Mallorca
Outside the Box - Panel on CMS at TYPO3 Camp Mallorca
Olivier Dobberkau
 
Status & Outlook on EXT:solr for TYPO3 CMS
Status & Outlook on EXT:solr for TYPO3 CMSStatus & Outlook on EXT:solr for TYPO3 CMS
Status & Outlook on EXT:solr for TYPO3 CMS
Olivier Dobberkau
 
Digital dark age - Are we doing enough to preserve our website heritage?
Digital dark age - Are we doing enough to preserve our website heritage?Digital dark age - Are we doing enough to preserve our website heritage?
Digital dark age - Are we doing enough to preserve our website heritage?
Olivier Dobberkau
 
Everything you always wanted to know about search in typo3
Everything you always wanted to know about search in typo3Everything you always wanted to know about search in typo3
Everything you always wanted to know about search in typo3
Olivier Dobberkau
 
Alles was-sie-ueber-suche-wissen-wollten
Alles was-sie-ueber-suche-wissen-wolltenAlles was-sie-ueber-suche-wissen-wollten
Alles was-sie-ueber-suche-wissen-wollten
Olivier Dobberkau
 
Searching does not mean finding Stuff - Apache Solr for TYPO3
Searching does not mean finding Stuff - Apache Solr for TYPO3Searching does not mean finding Stuff - Apache Solr for TYPO3
Searching does not mean finding Stuff - Apache Solr for TYPO3
Olivier Dobberkau
 
Das Solr System - Suche nicht nur auf Planet TYPO3
Das Solr System - Suche nicht nur auf Planet TYPO3Das Solr System - Suche nicht nur auf Planet TYPO3
Das Solr System - Suche nicht nur auf Planet TYPO3
Olivier Dobberkau
 

More from Olivier Dobberkau (18)

Meet TYPO3 Vienna - Solr die Suchmachine für TYPO3
Meet TYPO3 Vienna - Solr die Suchmachine für TYPO3Meet TYPO3 Vienna - Solr die Suchmachine für TYPO3
Meet TYPO3 Vienna - Solr die Suchmachine für TYPO3
 
Apache Solr for TYPO3: More than a search engine
Apache Solr for TYPO3: More than a search engineApache Solr for TYPO3: More than a search engine
Apache Solr for TYPO3: More than a search engine
 
TYPO3 v8 LTS in the cloud
TYPO3 v8 LTS in the cloudTYPO3 v8 LTS in the cloud
TYPO3 v8 LTS in the cloud
 
With a little help from my friends (english)
With a little help  from my friends (english)With a little help  from my friends (english)
With a little help from my friends (english)
 
With a little help from my friends
With a little help from my friendsWith a little help from my friends
With a little help from my friends
 
TYPO3 & You
TYPO3 & YouTYPO3 & You
TYPO3 & You
 
Sonnenschein für ihre Website
Sonnenschein für ihre WebsiteSonnenschein für ihre Website
Sonnenschein für ihre Website
 
TYPO3 Camp Poznan - Solr Usecases with Hosted Solr
TYPO3 Camp Poznan - Solr Usecases with Hosted SolrTYPO3 Camp Poznan - Solr Usecases with Hosted Solr
TYPO3 Camp Poznan - Solr Usecases with Hosted Solr
 
TYPO3 and CMIS
TYPO3 and CMISTYPO3 and CMIS
TYPO3 and CMIS
 
ForgetIT: Beyond the page: Giving content a meaning and value
ForgetIT: Beyond the page: Giving content a meaning and valueForgetIT: Beyond the page: Giving content a meaning and value
ForgetIT: Beyond the page: Giving content a meaning and value
 
Apache Solr for TYPO3 CMS 101
Apache Solr for TYPO3 CMS 101Apache Solr for TYPO3 CMS 101
Apache Solr for TYPO3 CMS 101
 
Outside the Box - Panel on CMS at TYPO3 Camp Mallorca
Outside the Box - Panel on CMS at TYPO3 Camp MallorcaOutside the Box - Panel on CMS at TYPO3 Camp Mallorca
Outside the Box - Panel on CMS at TYPO3 Camp Mallorca
 
Status & Outlook on EXT:solr for TYPO3 CMS
Status & Outlook on EXT:solr for TYPO3 CMSStatus & Outlook on EXT:solr for TYPO3 CMS
Status & Outlook on EXT:solr for TYPO3 CMS
 
Digital dark age - Are we doing enough to preserve our website heritage?
Digital dark age - Are we doing enough to preserve our website heritage?Digital dark age - Are we doing enough to preserve our website heritage?
Digital dark age - Are we doing enough to preserve our website heritage?
 
Everything you always wanted to know about search in typo3
Everything you always wanted to know about search in typo3Everything you always wanted to know about search in typo3
Everything you always wanted to know about search in typo3
 
Alles was-sie-ueber-suche-wissen-wollten
Alles was-sie-ueber-suche-wissen-wolltenAlles was-sie-ueber-suche-wissen-wollten
Alles was-sie-ueber-suche-wissen-wollten
 
Searching does not mean finding Stuff - Apache Solr for TYPO3
Searching does not mean finding Stuff - Apache Solr for TYPO3Searching does not mean finding Stuff - Apache Solr for TYPO3
Searching does not mean finding Stuff - Apache Solr for TYPO3
 
Das Solr System - Suche nicht nur auf Planet TYPO3
Das Solr System - Suche nicht nur auf Planet TYPO3Das Solr System - Suche nicht nur auf Planet TYPO3
Das Solr System - Suche nicht nur auf Planet TYPO3
 

Recently uploaded

optimized green synthesis characterization and evaluation
optimized green synthesis characterization and evaluationoptimized green synthesis characterization and evaluation
optimized green synthesis characterization and evaluation
ManojKumarr75
 
Girls Call Shimla 000XX00000 Provide Best And Top Girl Service And No1 in City
Girls Call Shimla 000XX00000 Provide Best And Top Girl Service And No1 in CityGirls Call Shimla 000XX00000 Provide Best And Top Girl Service And No1 in City
Girls Call Shimla 000XX00000 Provide Best And Top Girl Service And No1 in City
dilbaagsingh0898
 
Portugal Dreamin 24 - How to easily use an API with Flows
Portugal Dreamin 24  - How to easily use an API with FlowsPortugal Dreamin 24  - How to easily use an API with Flows
Portugal Dreamin 24 - How to easily use an API with Flows
Thierry TROUIN ☁
 
DASH, presented by Elly Tawhai at PacNOG 33
DASH, presented by Elly Tawhai at PacNOG 33DASH, presented by Elly Tawhai at PacNOG 33
DASH, presented by Elly Tawhai at PacNOG 33
APNIC
 
Trading Strategy for London silver bullet
Trading Strategy for London silver bulletTrading Strategy for London silver bullet
Trading Strategy for London silver bullet
OkgatoSemadi1
 
6 Reasons to Use a VPN | 3S VPN Server App
6 Reasons to Use a VPN | 3S VPN Server App6 Reasons to Use a VPN | 3S VPN Server App
6 Reasons to Use a VPN | 3S VPN Server App
VPN Server
 
Information Systems Auditing, Controls and Assurance , tanapat limsaiprom
Information Systems Auditing, Controls and Assurance , tanapat limsaipromInformation Systems Auditing, Controls and Assurance , tanapat limsaiprom
Information Systems Auditing, Controls and Assurance , tanapat limsaiprom
TanapatLimsaiprom1
 
High Profile Girls Call ServiCe Chennai XX00XXX00X Tanisha Best High Class Ch...
High Profile Girls Call ServiCe Chennai XX00XXX00X Tanisha Best High Class Ch...High Profile Girls Call ServiCe Chennai XX00XXX00X Tanisha Best High Class Ch...
High Profile Girls Call ServiCe Chennai XX00XXX00X Tanisha Best High Class Ch...
shamrisumri
 
Bitcoin vs Ethereum Which Crypto Performed Better in Q2, 2024.docx
Bitcoin vs Ethereum Which Crypto Performed Better in Q2, 2024.docxBitcoin vs Ethereum Which Crypto Performed Better in Q2, 2024.docx
Bitcoin vs Ethereum Which Crypto Performed Better in Q2, 2024.docx
SFC Today
 
AWS Networking Basic , tanapat limsaiprom
AWS Networking Basic , tanapat limsaipromAWS Networking Basic , tanapat limsaiprom
AWS Networking Basic , tanapat limsaiprom
ธนาพัฒน์ ลิ้มสายพรหม
 
Build a Professional Resume using Canva , Tanapat Limsaiprom
Build a Professional Resume using Canva , Tanapat LimsaipromBuild a Professional Resume using Canva , Tanapat Limsaiprom
Build a Professional Resume using Canva , Tanapat Limsaiprom
TanapatLimsaiprom1
 
Iot-Internet-of-Things_Industrial revolution 4.0-ppt.pptx
Iot-Internet-of-Things_Industrial revolution 4.0-ppt.pptxIot-Internet-of-Things_Industrial revolution 4.0-ppt.pptx
Iot-Internet-of-Things_Industrial revolution 4.0-ppt.pptx
DeepakKumar862274
 
Kolkata @Girls @Call WhatsApp Numbers 🫦0000XX0000🫦 List For Friendship Girls ...
Kolkata @Girls @Call WhatsApp Numbers 🫦0000XX0000🫦 List For Friendship Girls ...Kolkata @Girls @Call WhatsApp Numbers 🫦0000XX0000🫦 List For Friendship Girls ...
Kolkata @Girls @Call WhatsApp Numbers 🫦0000XX0000🫦 List For Friendship Girls ...
paridubey2024#G05
 
Career Development Advice for Network Engineers across the Pacific, presented...
Career Development Advice for Network Engineers across the Pacific, presented...Career Development Advice for Network Engineers across the Pacific, presented...
Career Development Advice for Network Engineers across the Pacific, presented...
APNIC
 
Female Service Girls Call Delhi 9873940964 Provide Best And Top Girl Service ...
Female Service Girls Call Delhi 9873940964 Provide Best And Top Girl Service ...Female Service Girls Call Delhi 9873940964 Provide Best And Top Girl Service ...
Female Service Girls Call Delhi 9873940964 Provide Best And Top Girl Service ...
elbertablack
 
UMN degree offer diploma Transcript
UMN degree offer diploma TranscriptUMN degree offer diploma Transcript
UMN degree offer diploma Transcript
cenocb
 
Tarun Gaur On Data Breaches and Privacy Fears
Tarun Gaur On Data Breaches and Privacy FearsTarun Gaur On Data Breaches and Privacy Fears
Tarun Gaur On Data Breaches and Privacy Fears
Tarun Gaur
 
How-to-Diagnose-Hard-Drives-by-DFL-DDP-2024.pdf
How-to-Diagnose-Hard-Drives-by-DFL-DDP-2024.pdfHow-to-Diagnose-Hard-Drives-by-DFL-DDP-2024.pdf
How-to-Diagnose-Hard-Drives-by-DFL-DDP-2024.pdf
Dolphin Data Lab
 
Effective Tips for Creating the Best Rich Media Ads .pptx
Effective Tips for Creating the Best Rich Media Ads .pptxEffective Tips for Creating the Best Rich Media Ads .pptx
Effective Tips for Creating the Best Rich Media Ads .pptx
AirtoryInc
 
202254.com香蕉影视,在线观看《我才不要和你做朋友呢》在线观看最新电影,香蕉影视在线观看《我才不要和你做朋友呢》在线观看高清电影
202254.com香蕉影视,在线观看《我才不要和你做朋友呢》在线观看最新电影,香蕉影视在线观看《我才不要和你做朋友呢》在线观看高清电影202254.com香蕉影视,在线观看《我才不要和你做朋友呢》在线观看最新电影,香蕉影视在线观看《我才不要和你做朋友呢》在线观看高清电影
202254.com香蕉影视,在线观看《我才不要和你做朋友呢》在线观看最新电影,香蕉影视在线观看《我才不要和你做朋友呢》在线观看高清电影
ffg01100
 

Recently uploaded (20)

optimized green synthesis characterization and evaluation
optimized green synthesis characterization and evaluationoptimized green synthesis characterization and evaluation
optimized green synthesis characterization and evaluation
 
Girls Call Shimla 000XX00000 Provide Best And Top Girl Service And No1 in City
Girls Call Shimla 000XX00000 Provide Best And Top Girl Service And No1 in CityGirls Call Shimla 000XX00000 Provide Best And Top Girl Service And No1 in City
Girls Call Shimla 000XX00000 Provide Best And Top Girl Service And No1 in City
 
Portugal Dreamin 24 - How to easily use an API with Flows
Portugal Dreamin 24  - How to easily use an API with FlowsPortugal Dreamin 24  - How to easily use an API with Flows
Portugal Dreamin 24 - How to easily use an API with Flows
 
DASH, presented by Elly Tawhai at PacNOG 33
DASH, presented by Elly Tawhai at PacNOG 33DASH, presented by Elly Tawhai at PacNOG 33
DASH, presented by Elly Tawhai at PacNOG 33
 
Trading Strategy for London silver bullet
Trading Strategy for London silver bulletTrading Strategy for London silver bullet
Trading Strategy for London silver bullet
 
6 Reasons to Use a VPN | 3S VPN Server App
6 Reasons to Use a VPN | 3S VPN Server App6 Reasons to Use a VPN | 3S VPN Server App
6 Reasons to Use a VPN | 3S VPN Server App
 
Information Systems Auditing, Controls and Assurance , tanapat limsaiprom
Information Systems Auditing, Controls and Assurance , tanapat limsaipromInformation Systems Auditing, Controls and Assurance , tanapat limsaiprom
Information Systems Auditing, Controls and Assurance , tanapat limsaiprom
 
High Profile Girls Call ServiCe Chennai XX00XXX00X Tanisha Best High Class Ch...
High Profile Girls Call ServiCe Chennai XX00XXX00X Tanisha Best High Class Ch...High Profile Girls Call ServiCe Chennai XX00XXX00X Tanisha Best High Class Ch...
High Profile Girls Call ServiCe Chennai XX00XXX00X Tanisha Best High Class Ch...
 
Bitcoin vs Ethereum Which Crypto Performed Better in Q2, 2024.docx
Bitcoin vs Ethereum Which Crypto Performed Better in Q2, 2024.docxBitcoin vs Ethereum Which Crypto Performed Better in Q2, 2024.docx
Bitcoin vs Ethereum Which Crypto Performed Better in Q2, 2024.docx
 
AWS Networking Basic , tanapat limsaiprom
AWS Networking Basic , tanapat limsaipromAWS Networking Basic , tanapat limsaiprom
AWS Networking Basic , tanapat limsaiprom
 
Build a Professional Resume using Canva , Tanapat Limsaiprom
Build a Professional Resume using Canva , Tanapat LimsaipromBuild a Professional Resume using Canva , Tanapat Limsaiprom
Build a Professional Resume using Canva , Tanapat Limsaiprom
 
Iot-Internet-of-Things_Industrial revolution 4.0-ppt.pptx
Iot-Internet-of-Things_Industrial revolution 4.0-ppt.pptxIot-Internet-of-Things_Industrial revolution 4.0-ppt.pptx
Iot-Internet-of-Things_Industrial revolution 4.0-ppt.pptx
 
Kolkata @Girls @Call WhatsApp Numbers 🫦0000XX0000🫦 List For Friendship Girls ...
Kolkata @Girls @Call WhatsApp Numbers 🫦0000XX0000🫦 List For Friendship Girls ...Kolkata @Girls @Call WhatsApp Numbers 🫦0000XX0000🫦 List For Friendship Girls ...
Kolkata @Girls @Call WhatsApp Numbers 🫦0000XX0000🫦 List For Friendship Girls ...
 
Career Development Advice for Network Engineers across the Pacific, presented...
Career Development Advice for Network Engineers across the Pacific, presented...Career Development Advice for Network Engineers across the Pacific, presented...
Career Development Advice for Network Engineers across the Pacific, presented...
 
Female Service Girls Call Delhi 9873940964 Provide Best And Top Girl Service ...
Female Service Girls Call Delhi 9873940964 Provide Best And Top Girl Service ...Female Service Girls Call Delhi 9873940964 Provide Best And Top Girl Service ...
Female Service Girls Call Delhi 9873940964 Provide Best And Top Girl Service ...
 
UMN degree offer diploma Transcript
UMN degree offer diploma TranscriptUMN degree offer diploma Transcript
UMN degree offer diploma Transcript
 
Tarun Gaur On Data Breaches and Privacy Fears
Tarun Gaur On Data Breaches and Privacy FearsTarun Gaur On Data Breaches and Privacy Fears
Tarun Gaur On Data Breaches and Privacy Fears
 
How-to-Diagnose-Hard-Drives-by-DFL-DDP-2024.pdf
How-to-Diagnose-Hard-Drives-by-DFL-DDP-2024.pdfHow-to-Diagnose-Hard-Drives-by-DFL-DDP-2024.pdf
How-to-Diagnose-Hard-Drives-by-DFL-DDP-2024.pdf
 
Effective Tips for Creating the Best Rich Media Ads .pptx
Effective Tips for Creating the Best Rich Media Ads .pptxEffective Tips for Creating the Best Rich Media Ads .pptx
Effective Tips for Creating the Best Rich Media Ads .pptx
 
202254.com香蕉影视,在线观看《我才不要和你做朋友呢》在线观看最新电影,香蕉影视在线观看《我才不要和你做朋友呢》在线观看高清电影
202254.com香蕉影视,在线观看《我才不要和你做朋友呢》在线观看最新电影,香蕉影视在线观看《我才不要和你做朋友呢》在线观看高清电影202254.com香蕉影视,在线观看《我才不要和你做朋友呢》在线观看最新电影,香蕉影视在线观看《我才不要和你做朋友呢》在线观看高清电影
202254.com香蕉影视,在线观看《我才不要和你做朋友呢》在线观看最新电影,香蕉影视在线观看《我才不要和你做朋友呢》在线观看高清电影
 

ForgetIT Project TYPO3Camp Milano 2014

  • 2. Some store to remember, some store to forget
  • 3. Olivier Dobberkau CEO of dkd Internet Service GmbH Frankfurt, Germany About me
  • 5. What this is all about The problem
  • 6. Storage capacity is ever increasing Prices for storage are falling
  • 7. How large is large?
  • 8. Size references A simple text: an average Wikipedia article ≈ 3.78 kB (no markup) Lots of text: complete Wikipedia ≈ 13.5 GB (text only, no markup) An average image (12MP) ≈ 1.3 MB (JPG 90% quality; 24bit/pixel) An average movie stored on Blu-ray Disc ≈ 25.48 GB
  • 9. 1955 – The IBM 355 Capacity: 12 MB Cost: 6,233.33 USD/MB 3,250 90 ✘ 0 ✘ 0.16 kB
  • 10. 1970 – The IBM 3330 Capacity: 100 MB Cost: 259.70 USD/MB 3.94 kB27,089 76 0 ✘ 0 ✘
  • 11. 1988 – Seagate ST-238 Capacity: 30 MB Cost: 9.97 USD/MB 102.71 kB8,126 23 0 ✘ 0 ✘
  • 12. 2000 – Western Digital WD600AB Capacity: 60 GB Cost: 0.00275 USD/MB 16,644,063 4 47,261 2 363.64 MB
  • 13. 2010 – Seagate ST32000542AS Capacity: 2 TB Cost: 0.0000450 USD/MB ≈ 5 cent/GB 541,798,941 148 1,538,461 76 21.7 GB
  • 14. 2013 – NSA Capacity: ∞ Cost: free ∞ ∞ ∞ ∞ it’s free :) ✘
  • 16. Or, maybe not... There’s a lot more costs Retrieval Maintenance Indexing Updates
  • 17. We need to keep our information Accessible Usable Useful
  • 18. The concept of Memory Buoyancy Let’s start to forget!
  • 22. A short overview The ForgetIT Project
  • 23. ForgetIT project overview Consortium of 11 partners Project start was in February 2013 3 years of research & development http://www.forgetit-project.eu The ForgetIT project is funded by the EC within the 7th Framework Programme under the objective "Digital Preservation" (GA 600826).
  • 24. Project Partners 1/2 Centre for Research and Technology Hellas dkd Internet Service GmbH Deutsches Forschungszentrum für Künstliche Intelligenz GmbH Eurix Srl Gottfried Wilhelm Leibniz Universität Hannover
  • 25. Project Partners 2/2 IBM Israel - Science and Technology Ltd Luleå Tekniska Universitet The Chancellor, Masters and Scholars of the University of Oxford The University of Edinburgh The University of Sheffield Turk Telekomunikasyon AS
  • 26. Inspiring people to share! TYPO3 is the CMS used for the organisational use cases TYPO3 was chosen because it’s Open Source We want to raise awareness on the matter of preservation We will publish our modules under open source licenses
  • 31. “meta-data is a love note to the future” (Jason Scott) Do you preserve?
  • 32. What is preservation? “Preservation — The protection of cultural property through activities that minimize chemical and physical deterioration and damage and that prevent loss of informational content. The primary goal of preservation is to prolong the existence of cultural property.” Preservation 101
  • 33. Problems are caused by storage medium (disks, tapes, DVD, etc.) format of the data availability of the software or operating system possible encryption
  • 34. “The digital dark age is a possible future situation where it will be difficult or impossible to read historical electronic documents and multimedia, because they have been stored in an obsolete and obscure file format.” Wikipedia Digital Dark Age
  • 35. Preserving a website is not trivial What do want you preserve? Content only? Content and Design? How often? Stock prices vs. Company History page How do you deal with browser differences? How do you preserve functionality? E.g. insurance fee calculator
  • 36. Preservation Value ~ 5,000 €~ 200,000 €
  • 38. A personal use case: How to organise an ever growing picture collection Personal Preservation
  • 39. Typical use cases in the daily work with TYPO3-driven company websites. Organisational Preservation
  • 40. Organisational Use Cases Digital Asset Management Versioning Archiving a complete Website Individual genres and their specific requirements Example: Press Release
  • 41. An organisational use case Press Release Example
  • 42. Elements of a Press Release text image links documents
  • 43. Meta information Presseinformationen Spielwarenmesse Global Toy Conference Now on Saturday at the Spielwarenmesse * Customised programme for retailers: “How to get your customer into the shop” * Conference will take place for the 5th time in Nuremberg on 1 February 2014 All around the world, retailers are wondering how they can still get their customers in their shops in the age of the Internet – because competition for the sale of consumer goods online is growing dramatically. With the topic “How to Get Customers into Your Shop – Successful Pricing, Presentation and Selling” the Global Toy Conference of the Spielwarenmesse demonstrates what parameters business owners can adjust for the future. The conference will take place for the first time in the St Petersburg hall in the NCC East on Saturday. The new earlier date means that more international retailers can take advantage of the knowledge on offer at the toy industry's leading trade fair – from 9 a.m. to 4 p.m. on 1 February 2014. ...
  • 45.
  • 46. Levels of significance archive value Action: keep forever legal value Action: keep for legal time Archive present value Action: Keep for x days trigger value Action: Check significance KeepDelete
  • 49. Content Management Systemmedia meta info copy move refer
  • 50. media meta info media asset meta info media asset meta info etc. meta info editable content meta info structure (code, users, plugins, extensions, etc.) meta info external Digital Asset (DAM) internal
  • 51. Archive 1 Info Level 2 Info Level 3 media meta info media asset meta info media asset meta info etc. meta info editable content meta info structure (code, users, plugins, extensions, etc. meta info Info Level 1 (semi)automatic static dynamic Info Level 4, etc. Output Archive 2 Delete
  • 52. Archive 1 Info Level 2 Info Level 3 media meta info media asset meta info media asset meta info etc. meta info editable content meta info structure (code, users, plugins, extensions, etc. meta info Info Level 1 (semi)automatic static dynamic Info Level 4, etc. Output Archive 2 Delete
  • 53. Archive 1 Archive 2 Delete L2 L1 L3 L4 L2 L1 L3 L4 T-CM (Todays Content Management) F-CM (Future Content Management) Retrieve Service
  • 54. Information Lifecycle Collect Create Process Publish Analyse Archive
  • 61. Information Lifecycle Collect Create Process Publish Analyse ArchiveProcess Annotations
  • 62. Example Press Release Annotation (text) Annotation (image) global toy conference, conference, podium, speaker, lights
  • 63. A game about forgetting. Do you remember?
  • 64. or how you can participate Next steps
  • 65. We’d love to see you participate! Reflect your thoughts with us Take our short survey: http://tinyurl.com/forgetit-webarchiving Tell us your use cases Join the development of TYPO3 features
  • 66. Thank you for your attention!
  • 68. References (Sources) 1/2 Size of Wikipedia (as of 2013-10-04): https://en.wikipedia.org/wiki/ Wikipedia:Size_comparisons Average JPG size: http://web.forret.com/tools/megapixel.asp? title=12+Megapixel+camera&width=4000&height=3000 Average movie size: http://answers.yahoo.com/question/index? qid=20110807095141AABGQm8 Storage Prices: http://www.jcmit.com/diskprice.htm
  • 69. References (Sources) 2/2 Forget IT Website: http://www.forgetit-project.eu Preservation: http://unfacilitated.preservation101.org/session1/ expl_whatis-definitions.asp Digital Dark Age: https://en.wikipedia.org/wiki/Digital_dark_age
  • 70. References (Books) Delete: The Virtue of Forgetting in the Digital Age, Viktor Mayer- Schönberger
  • 71. References (Images) 1/8 “About me”: all images by Søren Schaffstein “ForgetIT Team” by Søren Schaffstein “The Problem/Knot”: http://www.istockphoto.com/stock- photo-8933647-rope-with-knot.php “1 Dollar”: http://www.istockphoto.com/stock-photo-17830696-fan- dollars-isolated-on-white.php Starbucks Cups: http://5feetonagoodday.files.wordpress.com/ 2012/01/starbucks-coffee-cups-sizes-tall-grande-venti-trenta.jpg
  • 72. References (Images) 2/8 IBM 355: http://www-03.ibm.com/ibm/history/exhibits/storage/ storage_355.html IBM 3330: http://www-03.ibm.com/ibm/history/exhibits/storage/ storage_3330.html Seagate ST-238: http://www.redlop.de/bilder/produkte/gross/ Seagate-WREN-5-ST4702N-702-MB-.png Western Digital WD600AB: http://www.junek.de/thomas/bilder/ WD600AB.jpg
  • 73. References (Images) 3/8 Seagate ST32000542AS: http://bilder.afterbuy.de/images/ZZNLZ/ seagatesata.jpg Finger “Forget”: http://www.istockphoto.com/stock-photo-7252836- string-finger-reminder-on-white.php Memory Buoyancy: http://www.istockphoto.com/stock- photo-16244755-fishing-hook-underwater.php?st=0320b45 Fish: http://www.istockphoto.com/stock-photo-14623368-gold-fish- and-piranha.php
  • 74. References (Images) 4/8 Game pieces by Søren Schaffstein Managed Forgetting: http://www.istockphoto.com/stock- photo-3533508-colorful-memos.php?st=0320b45 Synergetic Preservation: http://www.istockphoto.com/stock- photo-13301920-goldfish-jump.php Contextualised Remembering: http://www.istockphoto.com/stock- photo-14370511-shoebox-of-old-photos-too.php
  • 75. References (Images) 5/8 Cans: http://www.istockphoto.com/stock-photo-16948268-three- metallic-goods-can-with-key.php 5 1/4” Disk: https://secure.flickr.com/photos/twicepix/4330813840/ sizes/z/in/photostream/ 5 1/4” Disk Drawing: https://secure.flickr.com/photos/ flattop341/2094771560/sizes/z/in/photostream/ Ami Pro: http://www.os2museum.com/wp/?attachment_id=99 Digital Dark Age by Søren Schaffstein
  • 76. References (Images) 6/8 Gauges: http://www.istockphoto.com/stock-photo-9059088-old- gauges.php Golf Car: http://www.netzeitung.de/default/337276.html# Golf Car Papers: http://www.motor-talk.de/news/das-heilige-blech- wieder-unterm-hammer-t4421282.html Create: http://hdwallsize.com/wp-content/uploads/2013/04/ Abstract-Art-Wallpaper-Dekstop.jpg
  • 77. References (Images) 7/8 Process by Søren Schaffstein Publish: http://www.istockphoto.com/stock-photo-25712828-british- dog-reading.php?st=e5bf164 Analyse: http://www.istockphoto.com/stock-photo-28297160- laboratory-experimental-testing.php?st=239c76e Archive: http://www.istockphoto.com/stock-photo-18865341-old- wooden-card-catalogue-with-one-opened-drawer.php
  • 78. References (Images) 8/8 Shoes: http://www.istockphoto.com/stock-photo-2457744-what-s- your-walking-style.php?st=e12d3d2 Questions: http://www.istockphoto.com/stock-photo-17686236- decision-making.php