ForgetIT – Some store to remember, some store to forget
Upcoming SlideShare
Loading in...5
×
 

ForgetIT – Some store to remember, some store to forget

on

  • 4,387 views

With growing storage capacities and sinking storage prices, the paradigm of keeping everything is prevailing. However, keeping information accessible, useable and useful goes far beyond purely keeping ...

With growing storage capacities and sinking storage prices, the paradigm of keeping everything is prevailing. However, keeping information accessible, useable and useful goes far beyond purely keeping things, especially in the long run, and entails expenses much larger than just the storage costs. This issue especially applies to content in Content Management Systems where we increasingly face the situation of creating, managing and storing (preserving) multimedia content, which we might never access again due to the pure volume of content.

To overcome these issues, we envision the concept of flexible managed forgetting for information that progressively ceases in importance and finally becomes obsolete as well as for redundant information. We will extend TYPO3 with preservation and forgetting. The forgetting will also reduce the user’s cognitive burden for past activities and information in TYPO3 but still allows access if needed. The same as our brain will retrieve details of our past when remembering and getting associations, the approach will provide such means.

Within the Seventh Framework Programme for Research (FP7) of the European Union the "ForgetIT" project strives to build a solution for the mentioned problems. The project has a scope of 3 years and TYPO3 has been selected as CMS to build upon as it is Open Source Software and has an open and active community.

This talk will give an introduction into digital preservation and why companies can greatly profit from it. The current status of the research project will be demonstrated.

An overview of the project can be found on the projects website (of course made with TYPO3): http://www.forgetit-project.eu/

Statistics

Views

Total Views
4,387
Views on SlideShare
4,380
Embed Views
7

Actions

Likes
1
Downloads
1
Comments
0

1 Embed 7

https://twitter.com 7

Accessibility

Categories

Upload Details

Uploaded via as Adobe PDF

Usage Rights

© All Rights Reserved

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
    Processing…
Post Comment
Edit your comment

ForgetIT – Some store to remember, some store to forget ForgetIT – Some store to remember, some store to forget Presentation Transcript

  • Some store to remember, some store to forget
  • About me Søren Schaffstein CEO of dkd Internet Service GmbH Frankfurt, Germany
  • The problem What this is all about
  • Storage capacity is ever increasing Prices for storage are falling
  • How large is large?
  • Size references A simple text: an average Wikipedia article ≈ 3.78 kB (no markup) Lots of text: complete Wikipedia ≈ 13.5 GB (text only, no markup) An average image (12MP) ≈ 1.3 MB (JPG 90% quality; 24bit/pixel) An average movie stored on Blu-ray Disc ≈ 25.48 GB
  • 1955 – The IBM 355 Capacity: 12 MB Cost: 6,233.33 USD/MB ✘ 3,250 0 ✘ 9 0 0.16 kB
  • 1970 – The IBM 3330 Capacity: 100 MB Cost: 259.70 USD/MB ✘ 27,089 0 ✘ 76 0 3.94 kB
  • 1988 – Seagate ST-238 Capacity: 30 MB Cost: 9.97 USD/MB ✘ 8,126 0 ✘ 23 0 102.71 kB
  • 2000 – Western Digital WD600AB Capacity: 60 GB Cost: 0.00275 USD/MB 16,644,063 4 47,261 2 363.64 MB
  • 2010 – Seagate ST32000542AS Capacity: 2 TB Cost: 0.0000450 USD/MB ≈ 5 cent/GB 541,798,941 148 1,538,461 76 21.7 GB
  • 2013 – NSA Capacity: ∞ Cost: free ✘ ∞ ∞ ∞ ∞ it’s free :)
  • Cool! Let’s store everything, then!
  • Or, maybe not... There’s a lot more costs Retrieval Maintenance Indexing Updates
  • We need to keep our information Accessible Usable Useful
  • Let’s start to forget! The concept of Memory Buoyancy
  • Memory Buoyancy time memory
  • Memory Buoyancy
  • Memory Buoyancy
  • The ForgetIT Project A short overview
  • ForgetIT project overview Consortium of 11 partners Project start was in February 2013 3 years of research & development http://www.forgetit-project.eu The ForgetIT project is funded by the EC within the 7th Framework Programme under the objective "Digital Preservation" (GA 600826).
  • Project Partners 1/2 Centre for Research and Technology Hellas dkd Internet Service GmbH Deutsches Forschungszentrum für Künstliche Intelligenz GmbH Eurix Srl Gottfried Wilhelm Leibniz Universität Hannover
  • Project Partners 2/2 IBM Israel - Science and Technology Ltd Luleå Tekniska Universitet The Chancellor, Masters and Scholars of the University of Oxford The University of Edinburgh The University of Sheffield Turk Telekomunikasyon AS
  • Inspiring people to share! TYPO3 is the CMS used for the organisational use cases TYPO3 was chosen because it’s Open Source We want to raise awareness on the matter of preservation We will publish our modules under open source licenses
  • ForgetIT core concepts
  • Managed Forgetting
  • Synergetic Preservation
  • Contextualised Remembering
  • Do you preserve?
  • What is preservation? “Preservation — The protection of cultural property through activities that minimize chemical and physical deterioration and damage and that prevent loss of informational content. The primary goal of preservation is to prolong the existence of cultural property.” Preservation 101
  • Problems are caused by storage medium (disks, tapes, DVD, etc.)
  • Problems are caused by storage medium (disks, tapes, DVD, etc.) format of the data
  • Problems are caused by storage medium (disks, tapes, DVD, etc.) format of the data availability of the software or operating system possible encryption
  • Digital Dark Age
  • “The digital dark age is a possible future situation where it will be difficult or impossible to read historical electronic documents and multimedia, because they have been stored in an obsolete and obscure file format.” Wikipedia
  • Preserving a website is not trivial What do want you preserve? Content only? Content and Design? How often? Stock prices vs. Company History page How do you deal with browser differences? How do you preserve functionality? E.g. insurance fee calculator
  • Preservation Value
  • Preservation Value ~ 5,000 €
  • Preservation Value ~ 200,000 €
  • The ForgetIT Use Cases Private Organisational
  • Personal Preservation A personal use case: How to organise an ever growing picture collection
  • Organisational Preservation Typical use cases in the daily work with TYPO3-driven company websites.
  • Organisational Use Cases Digital Asset Management Versioning Archiving a complete Website Individual genres and their specific requirements Example: Press Release
  • Press Release Example An organisational use case
  • Elements of a Press Release text image links documents
  • Meta information Presseinformationen Spielwarenmesse Global Toy Conference Now on Saturday at the Spielwarenmesse * Customised programme for retailers: “How to get your customer into the shop” * Conference will take place for the 5th time in Nuremberg on 1 February 2014 All around the world, retailers are wondering how they can still get their customers in their shops in the age of the Internet – because competition for the sale of consumer goods online is growing dramatically. With the topic “How to Get Customers into Your Shop – Successful Pricing, Presentation and Selling” the Global Toy Conference of the Spielwarenmesse demonstrates what parameters business owners can adjust for the future. The conference will take place for the first time in the St Petersburg hall in the NCC East on Saturday. The new earlier date means that more international retailers can take advantage of the knowledge on offer at the toy industry's leading trade fair – from 9 a.m. to 4 p.m. on 1 February 2014. ...
  • Translations German English
  • Delete Keep Archive Levels of significance legal value Action: keep for legal time present value Action: Keep for x days archive value Action: keep forever trigger value Action: Check significance
  • meta info media
  • meta info media
  • meta info move media copy refer Content Management System
  • meta info media meta info media asset meta info meta info external Digital Asset (DAM) etc. editable content meta info media asset structure (code, users, plugins, extensions, etc.) meta info internal
  • Info Level 4, etc. dynamic Info Level 3 (semi)automatic Info Level 2 static Info Level 1 meta info media meta info meta info Output meta info Archive 1 media asset etc. Archive 2 editable content meta info media asset structure (code, users, plugins, extensions, etc. meta info Delete
  • Info Level 4, etc. dynamic Info Level 3 (semi)automatic Info Level 2 static Info Level 1 meta info media meta info meta info Output meta info Archive 1 media asset etc. Archive 2 editable content meta info media asset structure (code, users, plugins, extensions, etc. meta info Delete
  • Info Level 4, etc. dynamic Info Level 3 (semi)automatic Info Level 2 static Info Level 1 meta info media meta info meta info Output meta info Archive 1 media asset etc. Archive 2 editable content meta info media asset structure (code, users, plugins, extensions, etc. meta info Delete
  • Info Level 4, etc. dynamic Info Level 3 (semi)automatic Info Level 2 static Info Level 1 meta info media meta info meta info Output meta info Archive 1 media asset etc. Archive 2 editable content meta info media asset structure (code, users, plugins, extensions, etc. meta info Delete
  • Info Level 4, etc. dynamic Info Level 3 (semi)automatic Info Level 2 static Info Level 1 meta info media meta info meta info Output meta info Archive 1 media asset etc. Archive 2 editable content meta info media asset structure (code, users, plugins, extensions, etc. meta info Delete
  • T-CM (Todays Content Management) F-CM (Future Content Management) L4 L4 L3 L3 L2 L2 L1 L1 Retrieve Service Archive 1 Archive 2 Delete
  • The Information Lifecycle of a press release
  • Information Lifecycle Collect Create Process Publish Analyse Archive
  • Collect
  • Create
  • Process
  • Publish
  • Analyse
  • Archive
  • Information Lifecycle Annotations Collect Create Process Publish Analyse Archive
  • Example Press Release Annotation (text) Annotation (image) global toy conference, conference, podium, speaker, lights
  • Do you remember? A game about forgetting.
  • Do you remember the details? Which ocean was the ForgetIT Team examining?
  • Do you remember the details? Which ocean was the ForgetIT Team examining? Mediterranean Sea
  • Do you remember the details? Which ocean was the ForgetIT Team examining? Mediterranean Sea How many people of the ForgetIT Team were carrying a bag?
  • Do you remember the details? Which ocean was the ForgetIT Team examining? Mediterranean Sea How many people of the ForgetIT Team were carrying a bag?
  • Do you remember the details? Which ocean was the ForgetIT Team examining? Mediterranean Sea How many people of the ForgetIT Team were carrying a bag? How many barcodes are on the Western Digital WD600AB?
  • Do you remember the details? Which ocean was the ForgetIT Team examining? Mediterranean Sea How many people of the ForgetIT Team were carrying a bag? How many barcodes are on the Western Digital WD600AB?
  • Do you remember the details? Which ocean was the ForgetIT Team examining? Mediterranean Sea How many people of the ForgetIT Team were carrying a bag? How many barcodes are on the Western Digital WD600AB? How many pictures in the shoebox image are mostly blue?
  • Do you remember the details? Which ocean was the ForgetIT Team examining? Mediterranean Sea How many people of the ForgetIT Team were carrying a bag? How many barcodes are on the Western Digital WD600AB? How many pictures in the shoebox image are mostly blue?
  • Next steps or how you can participate
  • We’d love to see you participate! Reflect your thoughts with us Take our short survey: http://tinyurl.com/forgetit-webarchiving Tell us your use cases Join the development of TYPO3 features
  • Don’t forget them! We’d love to discuss them with you ... and a beer or two...
  • Thank you for your attention!
  • References Sources, Books, Images
  • References (Sources) 1/2 Size of Wikipedia (as of 2013-10-04): https://en.wikipedia.org/wiki/ Wikipedia:Size_comparisons Average JPG size: http://web.forret.com/tools/megapixel.asp? title=12+Megapixel+camera&width=4000&height=3000 Average movie size: http://answers.yahoo.com/question/index? qid=20110807095141AABGQm8 Storage Prices: http://www.jcmit.com/diskprice.htm
  • References (Sources) 2/2 Forget IT Website: http://www.forgetit-project.eu Preservation: http://unfacilitated.preservation101.org/session1/ expl_whatis-definitions.asp Digital Dark Age: https://en.wikipedia.org/wiki/Digital_dark_age
  • References (Books) Delete: The Virtue of Forgetting in the Digital Age, Viktor MayerSchönberger
  • References (Images) 1/8 “About me”: all images by Søren Schaffstein “ForgetIT Team” by Søren Schaffstein “The Problem/Knot”: http://www.istockphoto.com/stockphoto-8933647-rope-with-knot.php “1 Dollar”: http://www.istockphoto.com/stock-photo-17830696-fandollars-isolated-on-white.php Starbucks Cups: http://5feetonagoodday.files.wordpress.com/ 2012/01/starbucks-coffee-cups-sizes-tall-grande-venti-trenta.jpg
  • References (Images) 2/8 IBM 355: http://www-03.ibm.com/ibm/history/exhibits/storage/ storage_355.html IBM 3330: http://www-03.ibm.com/ibm/history/exhibits/storage/ storage_3330.html Seagate ST-238: http://www.redlop.de/bilder/produkte/gross/ Seagate-WREN-5-ST4702N-702-MB-.png Western Digital WD600AB: http://www.junek.de/thomas/bilder/ WD600AB.jpg
  • References (Images) 3/8 Seagate ST32000542AS: http://bilder.afterbuy.de/images/ZZNLZ/ seagatesata.jpg Finger “Forget”: http://www.istockphoto.com/stock-photo-7252836string-finger-reminder-on-white.php Memory Buoyancy: http://www.istockphoto.com/stockphoto-16244755-fishing-hook-underwater.php?st=0320b45 Fish: http://www.istockphoto.com/stock-photo-14623368-gold-fishand-piranha.php
  • References (Images) 4/8 Game pieces by Søren Schaffstein Managed Forgetting: http://www.istockphoto.com/stockphoto-3533508-colorful-memos.php?st=0320b45 Synergetic Preservation: http://www.istockphoto.com/stockphoto-13301920-goldfish-jump.php Contextualised Remembering: http://www.istockphoto.com/stockphoto-14370511-shoebox-of-old-photos-too.php
  • References (Images) 5/8 Cans: http://www.istockphoto.com/stock-photo-16948268-threemetallic-goods-can-with-key.php 5 1/4” Disk: https://secure.flickr.com/photos/twicepix/4330813840/ sizes/z/in/photostream/ 5 1/4” Disk Drawing: https://secure.flickr.com/photos/ flattop341/2094771560/sizes/z/in/photostream/ Ami Pro: http://www.os2museum.com/wp/?attachment_id=99 Digital Dark Age by Søren Schaffstein
  • References (Images) 6/8 Gauges: http://www.istockphoto.com/stock-photo-9059088-oldgauges.php Golf Car: http://www.netzeitung.de/default/337276.html# Golf Car Papers: http://www.motor-talk.de/news/das-heilige-blechwieder-unterm-hammer-t4421282.html Create: http://hdwallsize.com/wp-content/uploads/2013/04/ Abstract-Art-Wallpaper-Dekstop.jpg
  • References (Images) 7/8 Process by Søren Schaffstein Publish: http://www.istockphoto.com/stock-photo-25712828-britishdog-reading.php?st=e5bf164 Analyse: http://www.istockphoto.com/stock-photo-28297160laboratory-experimental-testing.php?st=239c76e Archive: http://www.istockphoto.com/stock-photo-18865341-oldwooden-card-catalogue-with-one-opened-drawer.php
  • References (Images) 8/8 Shoes: http://www.istockphoto.com/stock-photo-2457744-what-syour-walking-style.php?st=e12d3d2 Questions: http://www.istockphoto.com/stock-photo-17686236decision-making.php