ForgetIT – Some store to remember, some store to forget

6,756 views

Published on

With growing storage capacities and sinking storage prices, the paradigm of keeping everything is prevailing. However, keeping information accessible, useable and useful goes far beyond purely keeping things, especially in the long run, and entails expenses much larger than just the storage costs. This issue especially applies to content in Content Management Systems where we increasingly face the situation of creating, managing and storing (preserving) multimedia content, which we might never access again due to the pure volume of content.

To overcome these issues, we envision the concept of flexible managed forgetting for information that progressively ceases in importance and finally becomes obsolete as well as for redundant information. We will extend TYPO3 with preservation and forgetting. The forgetting will also reduce the user’s cognitive burden for past activities and information in TYPO3 but still allows access if needed. The same as our brain will retrieve details of our past when remembering and getting associations, the approach will provide such means.

Within the Seventh Framework Programme for Research (FP7) of the European Union the "ForgetIT" project strives to build a solution for the mentioned problems. The project has a scope of 3 years and TYPO3 has been selected as CMS to build upon as it is Open Source Software and has an open and active community.

This talk will give an introduction into digital preservation and why companies can greatly profit from it. The current status of the research project will be demonstrated.

An overview of the project can be found on the projects website (of course made with TYPO3): http://www.forgetit-project.eu/

Published in: Technology
0 Comments
1 Like
Statistics
Notes
  • Be the first to comment

No Downloads
Views
Total views
6,756
On SlideShare
0
From Embeds
0
Number of Embeds
474
Actions
Shares
0
Downloads
7
Comments
0
Likes
1
Embeds 0
No embeds

No notes for slide

ForgetIT – Some store to remember, some store to forget

  1. 1. Some store to remember, some store to forget
  2. 2. About me Søren Schaffstein CEO of dkd Internet Service GmbH Frankfurt, Germany
  3. 3. The problem What this is all about
  4. 4. Storage capacity is ever increasing Prices for storage are falling
  5. 5. How large is large?
  6. 6. Size references A simple text: an average Wikipedia article ≈ 3.78 kB (no markup) Lots of text: complete Wikipedia ≈ 13.5 GB (text only, no markup) An average image (12MP) ≈ 1.3 MB (JPG 90% quality; 24bit/pixel) An average movie stored on Blu-ray Disc ≈ 25.48 GB
  7. 7. 1955 – The IBM 355 Capacity: 12 MB Cost: 6,233.33 USD/MB ✘ 3,250 0 ✘ 9 0 0.16 kB
  8. 8. 1970 – The IBM 3330 Capacity: 100 MB Cost: 259.70 USD/MB ✘ 27,089 0 ✘ 76 0 3.94 kB
  9. 9. 1988 – Seagate ST-238 Capacity: 30 MB Cost: 9.97 USD/MB ✘ 8,126 0 ✘ 23 0 102.71 kB
  10. 10. 2000 – Western Digital WD600AB Capacity: 60 GB Cost: 0.00275 USD/MB 16,644,063 4 47,261 2 363.64 MB
  11. 11. 2010 – Seagate ST32000542AS Capacity: 2 TB Cost: 0.0000450 USD/MB ≈ 5 cent/GB 541,798,941 148 1,538,461 76 21.7 GB
  12. 12. 2013 – NSA Capacity: ∞ Cost: free ✘ ∞ ∞ ∞ ∞ it’s free :)
  13. 13. Cool! Let’s store everything, then!
  14. 14. Or, maybe not... There’s a lot more costs Retrieval Maintenance Indexing Updates
  15. 15. We need to keep our information Accessible Usable Useful
  16. 16. Let’s start to forget! The concept of Memory Buoyancy
  17. 17. Memory Buoyancy time memory
  18. 18. Memory Buoyancy
  19. 19. Memory Buoyancy
  20. 20. The ForgetIT Project A short overview
  21. 21. ForgetIT project overview Consortium of 11 partners Project start was in February 2013 3 years of research & development http://www.forgetit-project.eu The ForgetIT project is funded by the EC within the 7th Framework Programme under the objective "Digital Preservation" (GA 600826).
  22. 22. Project Partners 1/2 Centre for Research and Technology Hellas dkd Internet Service GmbH Deutsches Forschungszentrum für Künstliche Intelligenz GmbH Eurix Srl Gottfried Wilhelm Leibniz Universität Hannover
  23. 23. Project Partners 2/2 IBM Israel - Science and Technology Ltd Luleå Tekniska Universitet The Chancellor, Masters and Scholars of the University of Oxford The University of Edinburgh The University of Sheffield Turk Telekomunikasyon AS
  24. 24. Inspiring people to share! TYPO3 is the CMS used for the organisational use cases TYPO3 was chosen because it’s Open Source We want to raise awareness on the matter of preservation We will publish our modules under open source licenses
  25. 25. ForgetIT core concepts
  26. 26. Managed Forgetting
  27. 27. Synergetic Preservation
  28. 28. Contextualised Remembering
  29. 29. Do you preserve?
  30. 30. What is preservation? “Preservation — The protection of cultural property through activities that minimize chemical and physical deterioration and damage and that prevent loss of informational content. The primary goal of preservation is to prolong the existence of cultural property.” Preservation 101
  31. 31. Problems are caused by storage medium (disks, tapes, DVD, etc.)
  32. 32. Problems are caused by storage medium (disks, tapes, DVD, etc.) format of the data
  33. 33. Problems are caused by storage medium (disks, tapes, DVD, etc.) format of the data availability of the software or operating system possible encryption
  34. 34. Digital Dark Age
  35. 35. “The digital dark age is a possible future situation where it will be difficult or impossible to read historical electronic documents and multimedia, because they have been stored in an obsolete and obscure file format.” Wikipedia
  36. 36. Preserving a website is not trivial What do want you preserve? Content only? Content and Design? How often? Stock prices vs. Company History page How do you deal with browser differences? How do you preserve functionality? E.g. insurance fee calculator
  37. 37. Preservation Value
  38. 38. Preservation Value ~ 5,000 €
  39. 39. Preservation Value ~ 200,000 €
  40. 40. The ForgetIT Use Cases Private Organisational
  41. 41. Personal Preservation A personal use case: How to organise an ever growing picture collection
  42. 42. Organisational Preservation Typical use cases in the daily work with TYPO3-driven company websites.
  43. 43. Organisational Use Cases Digital Asset Management Versioning Archiving a complete Website Individual genres and their specific requirements Example: Press Release
  44. 44. Press Release Example An organisational use case
  45. 45. Elements of a Press Release text image links documents
  46. 46. Meta information Presseinformationen Spielwarenmesse Global Toy Conference Now on Saturday at the Spielwarenmesse * Customised programme for retailers: “How to get your customer into the shop” * Conference will take place for the 5th time in Nuremberg on 1 February 2014 All around the world, retailers are wondering how they can still get their customers in their shops in the age of the Internet – because competition for the sale of consumer goods online is growing dramatically. With the topic “How to Get Customers into Your Shop – Successful Pricing, Presentation and Selling” the Global Toy Conference of the Spielwarenmesse demonstrates what parameters business owners can adjust for the future. The conference will take place for the first time in the St Petersburg hall in the NCC East on Saturday. The new earlier date means that more international retailers can take advantage of the knowledge on offer at the toy industry's leading trade fair – from 9 a.m. to 4 p.m. on 1 February 2014. ...
  47. 47. Translations German English
  48. 48.
  49. 49. Delete Keep Archive Levels of significance legal value Action: keep for legal time present value Action: Keep for x days archive value Action: keep forever trigger value Action: Check significance
  50. 50. meta info media
  51. 51. meta info media
  52. 52. meta info move media copy refer Content Management System
  53. 53. meta info media meta info media asset meta info meta info external Digital Asset (DAM) etc. editable content meta info media asset structure (code, users, plugins, extensions, etc.) meta info internal
  54. 54. Info Level 4, etc. dynamic Info Level 3 (semi)automatic Info Level 2 static Info Level 1 meta info media meta info meta info Output meta info Archive 1 media asset etc. Archive 2 editable content meta info media asset structure (code, users, plugins, extensions, etc. meta info Delete
  55. 55. Info Level 4, etc. dynamic Info Level 3 (semi)automatic Info Level 2 static Info Level 1 meta info media meta info meta info Output meta info Archive 1 media asset etc. Archive 2 editable content meta info media asset structure (code, users, plugins, extensions, etc. meta info Delete
  56. 56. Info Level 4, etc. dynamic Info Level 3 (semi)automatic Info Level 2 static Info Level 1 meta info media meta info meta info Output meta info Archive 1 media asset etc. Archive 2 editable content meta info media asset structure (code, users, plugins, extensions, etc. meta info Delete
  57. 57. Info Level 4, etc. dynamic Info Level 3 (semi)automatic Info Level 2 static Info Level 1 meta info media meta info meta info Output meta info Archive 1 media asset etc. Archive 2 editable content meta info media asset structure (code, users, plugins, extensions, etc. meta info Delete
  58. 58. Info Level 4, etc. dynamic Info Level 3 (semi)automatic Info Level 2 static Info Level 1 meta info media meta info meta info Output meta info Archive 1 media asset etc. Archive 2 editable content meta info media asset structure (code, users, plugins, extensions, etc. meta info Delete
  59. 59. T-CM (Todays Content Management) F-CM (Future Content Management) L4 L4 L3 L3 L2 L2 L1 L1 Retrieve Service Archive 1 Archive 2 Delete
  60. 60. The Information Lifecycle of a press release
  61. 61. Information Lifecycle Collect Create Process Publish Analyse Archive
  62. 62. Collect
  63. 63. Create
  64. 64. Process
  65. 65. Publish
  66. 66. Analyse
  67. 67. Archive
  68. 68. Information Lifecycle Annotations Collect Create Process Publish Analyse Archive
  69. 69. Example Press Release Annotation (text) Annotation (image) global toy conference, conference, podium, speaker, lights
  70. 70. Do you remember? A game about forgetting.
  71. 71. Do you remember the details? Which ocean was the ForgetIT Team examining?
  72. 72. Do you remember the details? Which ocean was the ForgetIT Team examining? Mediterranean Sea
  73. 73. Do you remember the details? Which ocean was the ForgetIT Team examining? Mediterranean Sea How many people of the ForgetIT Team were carrying a bag?
  74. 74. Do you remember the details? Which ocean was the ForgetIT Team examining? Mediterranean Sea How many people of the ForgetIT Team were carrying a bag?
  75. 75. Do you remember the details? Which ocean was the ForgetIT Team examining? Mediterranean Sea How many people of the ForgetIT Team were carrying a bag? How many barcodes are on the Western Digital WD600AB?
  76. 76. Do you remember the details? Which ocean was the ForgetIT Team examining? Mediterranean Sea How many people of the ForgetIT Team were carrying a bag? How many barcodes are on the Western Digital WD600AB?
  77. 77. Do you remember the details? Which ocean was the ForgetIT Team examining? Mediterranean Sea How many people of the ForgetIT Team were carrying a bag? How many barcodes are on the Western Digital WD600AB? How many pictures in the shoebox image are mostly blue?
  78. 78. Do you remember the details? Which ocean was the ForgetIT Team examining? Mediterranean Sea How many people of the ForgetIT Team were carrying a bag? How many barcodes are on the Western Digital WD600AB? How many pictures in the shoebox image are mostly blue?
  79. 79. Next steps or how you can participate
  80. 80. We’d love to see you participate! Reflect your thoughts with us Take our short survey: http://tinyurl.com/forgetit-webarchiving Tell us your use cases Join the development of TYPO3 features
  81. 81. Don’t forget them! We’d love to discuss them with you ... and a beer or two...
  82. 82. Thank you for your attention!
  83. 83. References Sources, Books, Images
  84. 84. References (Sources) 1/2 Size of Wikipedia (as of 2013-10-04): https://en.wikipedia.org/wiki/ Wikipedia:Size_comparisons Average JPG size: http://web.forret.com/tools/megapixel.asp? title=12+Megapixel+camera&width=4000&height=3000 Average movie size: http://answers.yahoo.com/question/index? qid=20110807095141AABGQm8 Storage Prices: http://www.jcmit.com/diskprice.htm
  85. 85. References (Sources) 2/2 Forget IT Website: http://www.forgetit-project.eu Preservation: http://unfacilitated.preservation101.org/session1/ expl_whatis-definitions.asp Digital Dark Age: https://en.wikipedia.org/wiki/Digital_dark_age
  86. 86. References (Books) Delete: The Virtue of Forgetting in the Digital Age, Viktor MayerSchönberger
  87. 87. References (Images) 1/8 “About me”: all images by Søren Schaffstein “ForgetIT Team” by Søren Schaffstein “The Problem/Knot”: http://www.istockphoto.com/stockphoto-8933647-rope-with-knot.php “1 Dollar”: http://www.istockphoto.com/stock-photo-17830696-fandollars-isolated-on-white.php Starbucks Cups: http://5feetonagoodday.files.wordpress.com/ 2012/01/starbucks-coffee-cups-sizes-tall-grande-venti-trenta.jpg
  88. 88. References (Images) 2/8 IBM 355: http://www-03.ibm.com/ibm/history/exhibits/storage/ storage_355.html IBM 3330: http://www-03.ibm.com/ibm/history/exhibits/storage/ storage_3330.html Seagate ST-238: http://www.redlop.de/bilder/produkte/gross/ Seagate-WREN-5-ST4702N-702-MB-.png Western Digital WD600AB: http://www.junek.de/thomas/bilder/ WD600AB.jpg
  89. 89. References (Images) 3/8 Seagate ST32000542AS: http://bilder.afterbuy.de/images/ZZNLZ/ seagatesata.jpg Finger “Forget”: http://www.istockphoto.com/stock-photo-7252836string-finger-reminder-on-white.php Memory Buoyancy: http://www.istockphoto.com/stockphoto-16244755-fishing-hook-underwater.php?st=0320b45 Fish: http://www.istockphoto.com/stock-photo-14623368-gold-fishand-piranha.php
  90. 90. References (Images) 4/8 Game pieces by Søren Schaffstein Managed Forgetting: http://www.istockphoto.com/stockphoto-3533508-colorful-memos.php?st=0320b45 Synergetic Preservation: http://www.istockphoto.com/stockphoto-13301920-goldfish-jump.php Contextualised Remembering: http://www.istockphoto.com/stockphoto-14370511-shoebox-of-old-photos-too.php
  91. 91. References (Images) 5/8 Cans: http://www.istockphoto.com/stock-photo-16948268-threemetallic-goods-can-with-key.php 5 1/4” Disk: https://secure.flickr.com/photos/twicepix/4330813840/ sizes/z/in/photostream/ 5 1/4” Disk Drawing: https://secure.flickr.com/photos/ flattop341/2094771560/sizes/z/in/photostream/ Ami Pro: http://www.os2museum.com/wp/?attachment_id=99 Digital Dark Age by Søren Schaffstein
  92. 92. References (Images) 6/8 Gauges: http://www.istockphoto.com/stock-photo-9059088-oldgauges.php Golf Car: http://www.netzeitung.de/default/337276.html# Golf Car Papers: http://www.motor-talk.de/news/das-heilige-blechwieder-unterm-hammer-t4421282.html Create: http://hdwallsize.com/wp-content/uploads/2013/04/ Abstract-Art-Wallpaper-Dekstop.jpg
  93. 93. References (Images) 7/8 Process by Søren Schaffstein Publish: http://www.istockphoto.com/stock-photo-25712828-britishdog-reading.php?st=e5bf164 Analyse: http://www.istockphoto.com/stock-photo-28297160laboratory-experimental-testing.php?st=239c76e Archive: http://www.istockphoto.com/stock-photo-18865341-oldwooden-card-catalogue-with-one-opened-drawer.php
  94. 94. References (Images) 8/8 Shoes: http://www.istockphoto.com/stock-photo-2457744-what-syour-walking-style.php?st=e12d3d2 Questions: http://www.istockphoto.com/stock-photo-17686236decision-making.php

×