SlideShare a Scribd company logo
hoard.it : Stealing your data
Or... “Where is your online value?”
Or... “Originality sucks”
Dan Zambonini
www.boxuk.com

Museums and the Web 2009, Indianapolis, April 16
WARNING
WARNING
1. I am playing Devil’s Advocate

2. These are‘thoughts in progress’
Introduction
1. The hoard.it project

2. Museums and the Web:
   where’s the value?
Introduction
1. The hoard.it project

2. Museums and the Web:
   where’s the value?
2.5 - 15%
2.5 - 15%
Cross-Collections Projects

  “Search through the cultural collections of Europe”



            “explore and comment on collections”


     “find and explore digital collections from museums”


                   “Discover cultural objects, collections”
Why is this a Problem?
1. Some duplication of effort
  • £25,000 - £100,000 to put collections online
  • £1,500 - £6,500 per cross-collection project
2. Potential end-user confusion
3. Usually only include larger institutions
4. Is there really a need?
Our Approach
• Use data that already exists
   • No cost/duplication of effort
• No input or changes from museums
   • Lightweight, open to all
• Re-expose the data programmatically
   • Enable easy re-use
How it works
Screen-Scraper + Spider
How it works
Screen-Scraper + Spider
How it works
Screen-Scraper + Spider
Difficulties and Limitations
•   Must have collections online
•   Must have a consistent template
•   Slow; not real-time
•   Technical variations (encoding, standards)
•   Rudimentary: Flash/Forms a barrier
Difficulties: Normalization
•   Dates
    •   circa 19th century, 1960s, 2008-01, 1Jan ’52, 2000 BC, 30s, April 4 1934,
        04-76, 1783-25-04, 10-11-64, about 200 AD, Victorian, 1100-1150, ...

    •   http://feeds.boxuk.com/convert/date/


•   Location
    •   Points of interest, cities, towns, countries, administrative regions, political
        regions, ancient names, continents, postal codes, co-ordinates, ...

    •   http://developer.yahoo.com/geo/
The Data
   Virtual Museum of Canada!

     Carnegie Museum of Art!

          Smithsonian NASM!

 National Museum of Australia!

      National Portrait Gallery!

        Imperial War Museum!

National Museums of Scotland!

                     Ingenious!

  Museum of London: E20CL!

               British Museum!

  Victoria and Albert Museum!

   National Maritime Museum!

                  Powerhouse!

             Science Museum!

             24 Hour Museum!

            Freebase: Events!

    Wikipedia: List of Painters!

                                   0!   2000!   4000!   6000!   8000!   10000!   12000!   14000!   16000!
The Data
   Virtual Museum of Canada!

     Carnegie Museum of Art!

          Smithsonian NASM!

 National Museum of Australia!

      National Portrait Gallery!

        Imperial War Museum!

National Museums of Scotland!

                     Ingenious!

  Museum of London: E20CL!

               British Museum!

  Victoria and Albert Museum!

   National Maritime Museum!

                  Powerhouse!

             Science Museum!

             24 Hour Museum!

            Freebase: Events!

    Wikipedia: List of Painters!

                                   0!   2000!   4000!   6000!   8000!   10000!   12000!   14000!   16000!


                                                                            70,000 objects
The Data
 • URL            100%
 • Identifier     95%
 • Title          100%
 • Description    70%
 • Image          85%
 • Creator        50%
 • Created Date   75%
 • Copyright      50%
 • Dimensions     45%
 • Subject        65%
 • Location       45%
 • Materials      65%
Data Mining - Location
                                       65%   Europe
                                       15%   Asia
                                       14%   North America
                                       4%    Oceania




Percentage of objects from the same continent as museum:

• North America: 85%
• Europe:        75%
• Oceania:       65%
% of objects by continent of origin!




             0!
                  10!
                        20!
                                  30!
                                          40!
                                                  50!
                                                          60!
                                                                     70!
                                                                           80!
                                                                                 90!
        -1000!
         -900!
         -800!
         -700!
         -600!
         -500!
         -400!
         -300!
         -200!
         -100!
            0!
          100!
          200!
          300!
          400!
          500!




Year!
          600!
          700!
          800!
          900!
         1000!
         1100!
         1200!
         1300!
         1400!
         1500!
         1600!
         1700!
         1800!
         1900!
         2000!
                                 Asia!
                                 Africa!
                                 Europe!
                                 Oceania!
                                 North America!
                                 South America!
                                                                                       Data Mining - Date/Location
% of objects by material!




                      0!
                           5!
                                10!
                                                15!
                                                                  20!
                                                                            25!
                                                                                  30!
                                                                                        35!
                                                                                              40!
              0!
         10
              0!
         20
              0!
         30
              0!
         40
              0!
         50
              0!
         60
              0!
         70
              0!
         80
              0!
         90
              0!
        10
          00
               !




Year!
        11
           0  0!
        12
             00
                  !
        13
             00
                  !
        14
             00
                  !
        15
          00
               !
        16
             00
                  !
        17
             00
                  !
        18
             00
                  !
        19
             00
                  !
        20
             00
                  !
                                                          Clay!

                                                  Gold!

                                      Silver!
                                                                   Stone!
                                                                                                    Data Mining - Date/Material
How it has been used
•   Experiments: http://hoard.it/labs/




•   UK Museums on the
    Web 2008 Hack Day


•   Who knows...?
                                         Photo courtesy of Brian Kelly
How it has been used
Next steps...
Next steps...


 ABSOLUTELY
  NOTHING
Do you offer anything?
dbPedia, Freebase
What can you offer?
•   Expertise
•   Media
•   The Physical Space
•   Reputation and Trust
•   Audience
•   Voice, Exposure and Influence
What’s changed?
“...not all information should flow everywhere; only the
meaningful should be transmitted.

But in the network economy only signals in real time (or
close to it) are truly meaningful.

Examine the speed of knowledge in your system. How
can it be brought closer to real time? If this requires the
cooperation of subcontractors, distant partners, and far-
flung customers, so much the better.”

Kevin Kelly
http://www.kk.org/newrules/blog/2009/04/if-you-are-not-in-real-time-yo.php
What’s changed?


                  !quot;#$%#$&
!quot;#$%&




                  '($(&
                  )%*+,-%.&




          '()%&
What’s changed?
What’s changed?

 EXECUTION
    not
   IDEAS
What’s changed?

              !quot;#$%&'()
              *+#,)




                      !quot;#$%&'(
                      )*#+%$%&'(
                      ,--.**%+%$&'(
                      /0.(1%20&(3.#"4.*(
                      5.*%26(
UK Newspaper Example
                                ,-./012345quot;
                                 #!quot;
                                  +quot;
                                  *quot;
                F44:G2.:=quot;                        6278925:quot;
                                  )quot;
                                  (quot;
                                  'quot;
                                                                               H2-1Iquot;JKL.8==quot;
                                  "
                                                                               H2-1Iquot;A2-1quot;
                                  %quot;
                                                                               H2-1Iquot;A-..4.quot;
                                  $quot;
                                                                               H2-1Iquot;CM2.quot;
                                  #quot;
                                                                               H2-1Iquot;>8187.2LBquot;
                                  !quot;
D5-E08quot;D=8.=quot;                                                 ;2/8<44:quot;;25=quot;
                                                                               ;-525/-21quot;>-G8=quot;
                                                                               >B8quot;N02.O-25quot;
                                                                               >B8quot;P5O8L85O85Mquot;
                                                                               >B8quot;C05quot;
                                                                               >B8quot;>-G8=quot;



         9CCquot;C0<=/.-<8.=quot;                         >?-@8.quot;;4114?8.=quot;




                             A85345=quot;-5quot;$&quot;B.=quot;
For example
•   Let your patrons collaborate
•   Let your patrons run your space
•   Give local communities a voice
•   Provide advice and guidance
•   Collect & distribute niche knowledge
•   ...


•   You know better than I do.
What has to change?
•   A focus on proven user needs
•   Re-usable services, not more data
•   Smaller projects
•   Iterative approaches
•   A real commitment to the web platform
•   (At least some) In-house development
How do we get there?
•   Should web projects generate revenue?
•   Don’t be afraid of re-inventing the wheel
•   Demand all projects use/expose APIs that
    are easy (REST not SOAP/OAI) and publicized
•   Show early, show often
•   Annoy funding bodies to support more,
    smaller, longer (i.e. iterative) ‘boring’ projects,
    and less ‘big, audacious’ projects.
Summary
•   We stole your data...
•   But then so are lots of other people...
•   So produce value elsewhere.


•   Ideas are harmful: do what’s proven...
•   But do it brilliantly.
•   And to do that, we need change.
Thank you
      www.boxuk.com


      dan@boxuk.com


    twitter.com/zambonini
Thank you
      www.boxuk.com


      dan@boxuk.com


    twitter.com/zambonini

More Related Content

Viewers also liked

Matéria "Viagem ao Interior"
Matéria "Viagem ao Interior"Matéria "Viagem ao Interior"
Matéria "Viagem ao Interior"
Asyst News
 
Saudi waste (recycling, energy, composte, water) solution
Saudi waste (recycling, energy, composte, water) solutionSaudi waste (recycling, energy, composte, water) solution
Saudi waste (recycling, energy, composte, water) solution
Brandon Dooley
 
Practica colas (if, else)
Practica colas (if, else)Practica colas (if, else)
Practica colas (if, else)
Eli Diaz
 

Viewers also liked (11)

Intranet y sus beneficios
Intranet y sus beneficiosIntranet y sus beneficios
Intranet y sus beneficios
 
12 san francisco museum of modern art
12 san francisco museum of modern art12 san francisco museum of modern art
12 san francisco museum of modern art
 
Matéria "Viagem ao Interior"
Matéria "Viagem ao Interior"Matéria "Viagem ao Interior"
Matéria "Viagem ao Interior"
 
Rob Stein, Charles Moad, Ed Bachta, Museums and Cloud Computing ...
Rob Stein, Charles Moad, Ed Bachta, Museums and Cloud Computing  ...Rob Stein, Charles Moad, Ed Bachta, Museums and Cloud Computing  ...
Rob Stein, Charles Moad, Ed Bachta, Museums and Cloud Computing ...
 
Saudi waste (recycling, energy, composte, water) solution
Saudi waste (recycling, energy, composte, water) solutionSaudi waste (recycling, energy, composte, water) solution
Saudi waste (recycling, energy, composte, water) solution
 
Digital Storytelling - Palestra Santa Maria (19/11/16)
Digital Storytelling - Palestra Santa Maria (19/11/16)Digital Storytelling - Palestra Santa Maria (19/11/16)
Digital Storytelling - Palestra Santa Maria (19/11/16)
 
PicNic no Monet
PicNic no MonetPicNic no Monet
PicNic no Monet
 
Practica colas (if, else)
Practica colas (if, else)Practica colas (if, else)
Practica colas (if, else)
 
Kittim silva-manual-practico-de-homilitica(1)
Kittim silva-manual-practico-de-homilitica(1)Kittim silva-manual-practico-de-homilitica(1)
Kittim silva-manual-practico-de-homilitica(1)
 
Proyecto Final
Proyecto FinalProyecto Final
Proyecto Final
 
Inspiring Shopper Behaviours
Inspiring Shopper BehavioursInspiring Shopper Behaviours
Inspiring Shopper Behaviours
 

Similar to Dan Zambonini and Mike Ellis, hoard.it: Aggregating, displaying and mining object-data without consent

TBF 2011- Ezequiel Singer: "Google Workshop"
TBF 2011- Ezequiel Singer: "Google Workshop"TBF 2011- Ezequiel Singer: "Google Workshop"
TBF 2011- Ezequiel Singer: "Google Workshop"
Karla Witte
 
Transforming Fuels and Vehicle Technology - Drew Kodjak - ICCT - Transforming...
Transforming Fuels and Vehicle Technology - Drew Kodjak - ICCT - Transforming...Transforming Fuels and Vehicle Technology - Drew Kodjak - ICCT - Transforming...
Transforming Fuels and Vehicle Technology - Drew Kodjak - ICCT - Transforming...
WRI Ross Center for Sustainable Cities
 
B2 B Channel Newsletter Q4 2009
B2 B Channel Newsletter Q4 2009B2 B Channel Newsletter Q4 2009
B2 B Channel Newsletter Q4 2009
adminfbgroup
 
B2 B Channel Newsletter Q4 2009
B2 B Channel Newsletter Q4 2009B2 B Channel Newsletter Q4 2009
B2 B Channel Newsletter Q4 2009
guest3117009
 

Similar to Dan Zambonini and Mike Ellis, hoard.it: Aggregating, displaying and mining object-data without consent (9)

RTFM (Read The Factual Mails) --Augmenting Program Comprehension with REmail
RTFM (Read The Factual Mails) --Augmenting Program Comprehension with REmailRTFM (Read The Factual Mails) --Augmenting Program Comprehension with REmail
RTFM (Read The Factual Mails) --Augmenting Program Comprehension with REmail
 
The Future of The Web: Transmission TX2 Talk
The Future of The Web: Transmission TX2 TalkThe Future of The Web: Transmission TX2 Talk
The Future of The Web: Transmission TX2 Talk
 
Cloud computing
Cloud computingCloud computing
Cloud computing
 
TBF 2011- Ezequiel Singer: "Google Workshop"
TBF 2011- Ezequiel Singer: "Google Workshop"TBF 2011- Ezequiel Singer: "Google Workshop"
TBF 2011- Ezequiel Singer: "Google Workshop"
 
Transforming Fuels and Vehicle Technology - Drew Kodjak - ICCT - Transforming...
Transforming Fuels and Vehicle Technology - Drew Kodjak - ICCT - Transforming...Transforming Fuels and Vehicle Technology - Drew Kodjak - ICCT - Transforming...
Transforming Fuels and Vehicle Technology - Drew Kodjak - ICCT - Transforming...
 
ORF2011「学びの対話ワークショップ:クリエイティブ・ラーニングと人材育成」
ORF2011「学びの対話ワークショップ:クリエイティブ・ラーニングと人材育成」ORF2011「学びの対話ワークショップ:クリエイティブ・ラーニングと人材育成」
ORF2011「学びの対話ワークショップ:クリエイティブ・ラーニングと人材育成」
 
Jane Jacobs and the Voice of the Monstrous Hybrid
Jane Jacobs and the Voice of the Monstrous Hybrid Jane Jacobs and the Voice of the Monstrous Hybrid
Jane Jacobs and the Voice of the Monstrous Hybrid
 
B2 B Channel Newsletter Q4 2009
B2 B Channel Newsletter Q4 2009B2 B Channel Newsletter Q4 2009
B2 B Channel Newsletter Q4 2009
 
B2 B Channel Newsletter Q4 2009
B2 B Channel Newsletter Q4 2009B2 B Channel Newsletter Q4 2009
B2 B Channel Newsletter Q4 2009
 

More from museums and the web

MW2011: N. Di Blas +, A “Smart” Authoring and Delivery Tool for Multichannel ...
MW2011: N. Di Blas +, A “Smart” Authoring and Delivery Tool for Multichannel ...MW2011: N. Di Blas +, A “Smart” Authoring and Delivery Tool for Multichannel ...
MW2011: N. Di Blas +, A “Smart” Authoring and Delivery Tool for Multichannel ...
museums and the web
 
MW2011: D. Birchall + M. Henson, Gaming the museum
MW2011: D. Birchall + M. Henson, Gaming the museumMW2011: D. Birchall + M. Henson, Gaming the museum
MW2011: D. Birchall + M. Henson, Gaming the museum
museums and the web
 
MW2011: G. Chae +, Can Social Tagging Be a Tool to Reduce the Semantic Gap be...
MW2011: G. Chae +, Can Social Tagging Be a Tool to Reduce the Semantic Gap be...MW2011: G. Chae +, Can Social Tagging Be a Tool to Reduce the Semantic Gap be...
MW2011: G. Chae +, Can Social Tagging Be a Tool to Reduce the Semantic Gap be...
museums and the web
 
MW2011: Klavans, J. +, Computational Linguistics in Museums: Applications fo...
MW2011: Klavans, J.  +, Computational Linguistics in Museums: Applications fo...MW2011: Klavans, J.  +, Computational Linguistics in Museums: Applications fo...
MW2011: Klavans, J. +, Computational Linguistics in Museums: Applications fo...
museums and the web
 
MW2011: L. Tallon + I. Froes, Going Mobile? Insights into the museum communit...
MW2011: L. Tallon + I. Froes, Going Mobile? Insights into the museum communit...MW2011: L. Tallon + I. Froes, Going Mobile? Insights into the museum communit...
MW2011: L. Tallon + I. Froes, Going Mobile? Insights into the museum communit...
museums and the web
 
MW2011: D. Laursen, Guided expectations: a case study of a sound collage audi...
MW2011: D. Laursen, Guided expectations: a case study of a sound collage audi...MW2011: D. Laursen, Guided expectations: a case study of a sound collage audi...
MW2011: D. Laursen, Guided expectations: a case study of a sound collage audi...
museums and the web
 
MW2011: J. Flemming +, Launching the MFA Multimedia Guide
MW2011: J. Flemming +, Launching the MFA Multimedia GuideMW2011: J. Flemming +, Launching the MFA Multimedia Guide
MW2011: J. Flemming +, Launching the MFA Multimedia Guide
museums and the web
 
MW2011: S. Fantoni, Mobile devices for orientation and way finding: the case ...
MW2011: S. Fantoni, Mobile devices for orientation and way finding: the case ...MW2011: S. Fantoni, Mobile devices for orientation and way finding: the case ...
MW2011: S. Fantoni, Mobile devices for orientation and way finding: the case ...
museums and the web
 
MW2011: Quigley, S., Integration of Print and Digital Publishing Workflows at...
MW2011: Quigley, S., Integration of Print and Digital Publishing Workflows at...MW2011: Quigley, S., Integration of Print and Digital Publishing Workflows at...
MW2011: Quigley, S., Integration of Print and Digital Publishing Workflows at...
museums and the web
 
MW2011: Cope, A., Authority Records, Future Computers and Other Unfinished Hi...
MW2011: Cope, A., Authority Records, Future Computers and Other Unfinished Hi...MW2011: Cope, A., Authority Records, Future Computers and Other Unfinished Hi...
MW2011: Cope, A., Authority Records, Future Computers and Other Unfinished Hi...
museums and the web
 
MW2010: N. Proctor, The Museum Is Mobile: Cross-platform content design for a...
MW2010: N. Proctor, The Museum Is Mobile: Cross-platform content design for a...MW2010: N. Proctor, The Museum Is Mobile: Cross-platform content design for a...
MW2010: N. Proctor, The Museum Is Mobile: Cross-platform content design for a...
museums and the web
 
MW2010: M. Petrie + L. Tallon, The iPhone effect?: Comparing visitors’ and mu...
MW2010: M. Petrie + L. Tallon, The iPhone effect?: Comparing visitors’ and mu...MW2010: M. Petrie + L. Tallon, The iPhone effect?: Comparing visitors’ and mu...
MW2010: M. Petrie + L. Tallon, The iPhone effect?: Comparing visitors’ and mu...
museums and the web
 
MW2010: Building an online research community: The Reciprocal Research Network
MW2010: Building an online research community: The Reciprocal Research Network MW2010: Building an online research community: The Reciprocal Research Network
MW2010: Building an online research community: The Reciprocal Research Network
museums and the web
 
MW2010: S. Hazan et al., ATHENA: A Mechanism for Harvesting Europe's Museum H...
MW2010: S. Hazan et al., ATHENA: A Mechanism for Harvesting Europe's Museum H...MW2010: S. Hazan et al., ATHENA: A Mechanism for Harvesting Europe's Museum H...
MW2010: S. Hazan et al., ATHENA: A Mechanism for Harvesting Europe's Museum H...
museums and the web
 

More from museums and the web (20)

How to Give an Accessible Presentation - Yue-Ting Siu
How to Give an Accessible Presentation - Yue-Ting SiuHow to Give an Accessible Presentation - Yue-Ting Siu
How to Give an Accessible Presentation - Yue-Ting Siu
 
MW2011: N. Di Blas +, A “Smart” Authoring and Delivery Tool for Multichannel ...
MW2011: N. Di Blas +, A “Smart” Authoring and Delivery Tool for Multichannel ...MW2011: N. Di Blas +, A “Smart” Authoring and Delivery Tool for Multichannel ...
MW2011: N. Di Blas +, A “Smart” Authoring and Delivery Tool for Multichannel ...
 
MW2011: D. Birchall + M. Henson, Gaming the museum
MW2011: D. Birchall + M. Henson, Gaming the museumMW2011: D. Birchall + M. Henson, Gaming the museum
MW2011: D. Birchall + M. Henson, Gaming the museum
 
MW2011: G. Chae +, Can Social Tagging Be a Tool to Reduce the Semantic Gap be...
MW2011: G. Chae +, Can Social Tagging Be a Tool to Reduce the Semantic Gap be...MW2011: G. Chae +, Can Social Tagging Be a Tool to Reduce the Semantic Gap be...
MW2011: G. Chae +, Can Social Tagging Be a Tool to Reduce the Semantic Gap be...
 
MW2011: Klavans, J. +, Computational Linguistics in Museums: Applications fo...
MW2011: Klavans, J.  +, Computational Linguistics in Museums: Applications fo...MW2011: Klavans, J.  +, Computational Linguistics in Museums: Applications fo...
MW2011: Klavans, J. +, Computational Linguistics in Museums: Applications fo...
 
MW2011: L. Tallon + I. Froes, Going Mobile? Insights into the museum communit...
MW2011: L. Tallon + I. Froes, Going Mobile? Insights into the museum communit...MW2011: L. Tallon + I. Froes, Going Mobile? Insights into the museum communit...
MW2011: L. Tallon + I. Froes, Going Mobile? Insights into the museum communit...
 
MW2011: D. Laursen, Guided expectations: a case study of a sound collage audi...
MW2011: D. Laursen, Guided expectations: a case study of a sound collage audi...MW2011: D. Laursen, Guided expectations: a case study of a sound collage audi...
MW2011: D. Laursen, Guided expectations: a case study of a sound collage audi...
 
MW2011: J. Flemming +, Launching the MFA Multimedia Guide
MW2011: J. Flemming +, Launching the MFA Multimedia GuideMW2011: J. Flemming +, Launching the MFA Multimedia Guide
MW2011: J. Flemming +, Launching the MFA Multimedia Guide
 
MW2011: S. Fantoni, Mobile devices for orientation and way finding: the case ...
MW2011: S. Fantoni, Mobile devices for orientation and way finding: the case ...MW2011: S. Fantoni, Mobile devices for orientation and way finding: the case ...
MW2011: S. Fantoni, Mobile devices for orientation and way finding: the case ...
 
MW2011: J. Bickersteth + C. Ainsley, Mobile Phones and Visitor Tracking
MW2011: J. Bickersteth + C. Ainsley, Mobile Phones and Visitor TrackingMW2011: J. Bickersteth + C. Ainsley, Mobile Phones and Visitor Tracking
MW2011: J. Bickersteth + C. Ainsley, Mobile Phones and Visitor Tracking
 
MW2011 Best of the Web Awards
MW2011 Best of the Web AwardsMW2011 Best of the Web Awards
MW2011 Best of the Web Awards
 
MW2011: Quigley, S., Integration of Print and Digital Publishing Workflows at...
MW2011: Quigley, S., Integration of Print and Digital Publishing Workflows at...MW2011: Quigley, S., Integration of Print and Digital Publishing Workflows at...
MW2011: Quigley, S., Integration of Print and Digital Publishing Workflows at...
 
MW2011: Cope, A., Authority Records, Future Computers and Other Unfinished Hi...
MW2011: Cope, A., Authority Records, Future Computers and Other Unfinished Hi...MW2011: Cope, A., Authority Records, Future Computers and Other Unfinished Hi...
MW2011: Cope, A., Authority Records, Future Computers and Other Unfinished Hi...
 
MW2011: S. Kenderdine, Cultural Data Sculpting
MW2011: S. Kenderdine, Cultural Data SculptingMW2011: S. Kenderdine, Cultural Data Sculpting
MW2011: S. Kenderdine, Cultural Data Sculpting
 
MW2010: N. Proctor, The Museum Is Mobile: Cross-platform content design for a...
MW2010: N. Proctor, The Museum Is Mobile: Cross-platform content design for a...MW2010: N. Proctor, The Museum Is Mobile: Cross-platform content design for a...
MW2010: N. Proctor, The Museum Is Mobile: Cross-platform content design for a...
 
MW2010: J. Doyle + M. Doyle, Mixing Social Glue with Brick and Mortar: Experi...
MW2010: J. Doyle + M. Doyle, Mixing Social Glue with Brick and Mortar: Experi...MW2010: J. Doyle + M. Doyle, Mixing Social Glue with Brick and Mortar: Experi...
MW2010: J. Doyle + M. Doyle, Mixing Social Glue with Brick and Mortar: Experi...
 
MW2010: M. Petrie + L. Tallon, The iPhone effect?: Comparing visitors’ and mu...
MW2010: M. Petrie + L. Tallon, The iPhone effect?: Comparing visitors’ and mu...MW2010: M. Petrie + L. Tallon, The iPhone effect?: Comparing visitors’ and mu...
MW2010: M. Petrie + L. Tallon, The iPhone effect?: Comparing visitors’ and mu...
 
MW2010: Building an online research community: The Reciprocal Research Network
MW2010: Building an online research community: The Reciprocal Research Network MW2010: Building an online research community: The Reciprocal Research Network
MW2010: Building an online research community: The Reciprocal Research Network
 
MW2010: S. Hazan et al., ATHENA: A Mechanism for Harvesting Europe's Museum H...
MW2010: S. Hazan et al., ATHENA: A Mechanism for Harvesting Europe's Museum H...MW2010: S. Hazan et al., ATHENA: A Mechanism for Harvesting Europe's Museum H...
MW2010: S. Hazan et al., ATHENA: A Mechanism for Harvesting Europe's Museum H...
 
MW2010: D. Peacock, Putting Mallala on the map: Creating a wiki community wit...
MW2010: D. Peacock, Putting Mallala on the map: Creating a wiki community wit...MW2010: D. Peacock, Putting Mallala on the map: Creating a wiki community wit...
MW2010: D. Peacock, Putting Mallala on the map: Creating a wiki community wit...
 

Recently uploaded

Essentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with ParametersEssentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with Parameters
Safe Software
 
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo DiehlFuture Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
Peter Udo Diehl
 

Recently uploaded (20)

Measures in SQL (a talk at SF Distributed Systems meetup, 2024-05-22)
Measures in SQL (a talk at SF Distributed Systems meetup, 2024-05-22)Measures in SQL (a talk at SF Distributed Systems meetup, 2024-05-22)
Measures in SQL (a talk at SF Distributed Systems meetup, 2024-05-22)
 
Free and Effective: Making Flows Publicly Accessible, Yumi Ibrahimzade
Free and Effective: Making Flows Publicly Accessible, Yumi IbrahimzadeFree and Effective: Making Flows Publicly Accessible, Yumi Ibrahimzade
Free and Effective: Making Flows Publicly Accessible, Yumi Ibrahimzade
 
Essentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with ParametersEssentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with Parameters
 
Assuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyesAssuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyes
 
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo DiehlFuture Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
 
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
 
When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...
 
Integrating Telephony Systems with Salesforce: Insights and Considerations, B...
Integrating Telephony Systems with Salesforce: Insights and Considerations, B...Integrating Telephony Systems with Salesforce: Insights and Considerations, B...
Integrating Telephony Systems with Salesforce: Insights and Considerations, B...
 
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
 
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
 
UiPath Test Automation using UiPath Test Suite series, part 1
UiPath Test Automation using UiPath Test Suite series, part 1UiPath Test Automation using UiPath Test Suite series, part 1
UiPath Test Automation using UiPath Test Suite series, part 1
 
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
 
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered QualitySoftware Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
 
Bits & Pixels using AI for Good.........
Bits & Pixels using AI for Good.........Bits & Pixels using AI for Good.........
Bits & Pixels using AI for Good.........
 
Exploring UiPath Orchestrator API: updates and limits in 2024 🚀
Exploring UiPath Orchestrator API: updates and limits in 2024 🚀Exploring UiPath Orchestrator API: updates and limits in 2024 🚀
Exploring UiPath Orchestrator API: updates and limits in 2024 🚀
 
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMsTo Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
 
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
 
10 Differences between Sales Cloud and CPQ, Blanka Doktorová
10 Differences between Sales Cloud and CPQ, Blanka Doktorová10 Differences between Sales Cloud and CPQ, Blanka Doktorová
10 Differences between Sales Cloud and CPQ, Blanka Doktorová
 
IoT Analytics Company Presentation May 2024
IoT Analytics Company Presentation May 2024IoT Analytics Company Presentation May 2024
IoT Analytics Company Presentation May 2024
 
In-Depth Performance Testing Guide for IT Professionals
In-Depth Performance Testing Guide for IT ProfessionalsIn-Depth Performance Testing Guide for IT Professionals
In-Depth Performance Testing Guide for IT Professionals
 

Dan Zambonini and Mike Ellis, hoard.it: Aggregating, displaying and mining object-data without consent

  • 1. hoard.it : Stealing your data Or... “Where is your online value?” Or... “Originality sucks” Dan Zambonini www.boxuk.com Museums and the Web 2009, Indianapolis, April 16
  • 3. WARNING 1. I am playing Devil’s Advocate 2. These are‘thoughts in progress’
  • 4. Introduction 1. The hoard.it project 2. Museums and the Web: where’s the value?
  • 5. Introduction 1. The hoard.it project 2. Museums and the Web: where’s the value?
  • 8. Cross-Collections Projects “Search through the cultural collections of Europe” “explore and comment on collections” “find and explore digital collections from museums” “Discover cultural objects, collections”
  • 9. Why is this a Problem? 1. Some duplication of effort • £25,000 - £100,000 to put collections online • £1,500 - £6,500 per cross-collection project 2. Potential end-user confusion 3. Usually only include larger institutions 4. Is there really a need?
  • 10. Our Approach • Use data that already exists • No cost/duplication of effort • No input or changes from museums • Lightweight, open to all • Re-expose the data programmatically • Enable easy re-use
  • 14. Difficulties and Limitations • Must have collections online • Must have a consistent template • Slow; not real-time • Technical variations (encoding, standards) • Rudimentary: Flash/Forms a barrier
  • 15. Difficulties: Normalization • Dates • circa 19th century, 1960s, 2008-01, 1Jan ’52, 2000 BC, 30s, April 4 1934, 04-76, 1783-25-04, 10-11-64, about 200 AD, Victorian, 1100-1150, ... • http://feeds.boxuk.com/convert/date/ • Location • Points of interest, cities, towns, countries, administrative regions, political regions, ancient names, continents, postal codes, co-ordinates, ... • http://developer.yahoo.com/geo/
  • 16. The Data Virtual Museum of Canada! Carnegie Museum of Art! Smithsonian NASM! National Museum of Australia! National Portrait Gallery! Imperial War Museum! National Museums of Scotland! Ingenious! Museum of London: E20CL! British Museum! Victoria and Albert Museum! National Maritime Museum! Powerhouse! Science Museum! 24 Hour Museum! Freebase: Events! Wikipedia: List of Painters! 0! 2000! 4000! 6000! 8000! 10000! 12000! 14000! 16000!
  • 17. The Data Virtual Museum of Canada! Carnegie Museum of Art! Smithsonian NASM! National Museum of Australia! National Portrait Gallery! Imperial War Museum! National Museums of Scotland! Ingenious! Museum of London: E20CL! British Museum! Victoria and Albert Museum! National Maritime Museum! Powerhouse! Science Museum! 24 Hour Museum! Freebase: Events! Wikipedia: List of Painters! 0! 2000! 4000! 6000! 8000! 10000! 12000! 14000! 16000! 70,000 objects
  • 18. The Data • URL 100% • Identifier 95% • Title 100% • Description 70% • Image 85% • Creator 50% • Created Date 75% • Copyright 50% • Dimensions 45% • Subject 65% • Location 45% • Materials 65%
  • 19. Data Mining - Location 65% Europe 15% Asia 14% North America 4% Oceania Percentage of objects from the same continent as museum: • North America: 85% • Europe: 75% • Oceania: 65%
  • 20. % of objects by continent of origin! 0! 10! 20! 30! 40! 50! 60! 70! 80! 90! -1000! -900! -800! -700! -600! -500! -400! -300! -200! -100! 0! 100! 200! 300! 400! 500! Year! 600! 700! 800! 900! 1000! 1100! 1200! 1300! 1400! 1500! 1600! 1700! 1800! 1900! 2000! Asia! Africa! Europe! Oceania! North America! South America! Data Mining - Date/Location
  • 21. % of objects by material! 0! 5! 10! 15! 20! 25! 30! 35! 40! 0! 10 0! 20 0! 30 0! 40 0! 50 0! 60 0! 70 0! 80 0! 90 0! 10 00 ! Year! 11 0 0! 12 00 ! 13 00 ! 14 00 ! 15 00 ! 16 00 ! 17 00 ! 18 00 ! 19 00 ! 20 00 ! Clay! Gold! Silver! Stone! Data Mining - Date/Material
  • 22. How it has been used • Experiments: http://hoard.it/labs/ • UK Museums on the Web 2008 Hack Day • Who knows...? Photo courtesy of Brian Kelly
  • 23. How it has been used
  • 26. Do you offer anything? dbPedia, Freebase
  • 27. What can you offer? • Expertise • Media • The Physical Space • Reputation and Trust • Audience • Voice, Exposure and Influence
  • 28. What’s changed? “...not all information should flow everywhere; only the meaningful should be transmitted. But in the network economy only signals in real time (or close to it) are truly meaningful. Examine the speed of knowledge in your system. How can it be brought closer to real time? If this requires the cooperation of subcontractors, distant partners, and far- flung customers, so much the better.” Kevin Kelly http://www.kk.org/newrules/blog/2009/04/if-you-are-not-in-real-time-yo.php
  • 29. What’s changed? !quot;#$%#$& !quot;#$%& '($(& )%*+,-%.& '()%&
  • 32. What’s changed? !quot;#$%&'() *+#,) !quot;#$%&'( )*#+%$%&'( ,--.**%+%$&'( /0.(1%20&(3.#&quot;4.*( 5.*%26(
  • 33. UK Newspaper Example ,-./012345quot; #!quot; +quot; *quot; F44:G2.:=quot; 6278925:quot; )quot; (quot; 'quot; H2-1Iquot;JKL.8==quot; &quot; H2-1Iquot;A2-1quot; %quot; H2-1Iquot;A-..4.quot; $quot; H2-1Iquot;CM2.quot; #quot; H2-1Iquot;>8187.2LBquot; !quot; D5-E08quot;D=8.=quot; ;2/8<44:quot;;25=quot; ;-525/-21quot;>-G8=quot; >B8quot;N02.O-25quot; >B8quot;P5O8L85O85Mquot; >B8quot;C05quot; >B8quot;>-G8=quot; 9CCquot;C0<=/.-<8.=quot; >?-@8.quot;;4114?8.=quot; A85345=quot;-5quot;$&quot;B.=quot;
  • 34. For example • Let your patrons collaborate • Let your patrons run your space • Give local communities a voice • Provide advice and guidance • Collect & distribute niche knowledge • ... • You know better than I do.
  • 35. What has to change? • A focus on proven user needs • Re-usable services, not more data • Smaller projects • Iterative approaches • A real commitment to the web platform • (At least some) In-house development
  • 36. How do we get there? • Should web projects generate revenue? • Don’t be afraid of re-inventing the wheel • Demand all projects use/expose APIs that are easy (REST not SOAP/OAI) and publicized • Show early, show often • Annoy funding bodies to support more, smaller, longer (i.e. iterative) ‘boring’ projects, and less ‘big, audacious’ projects.
  • 37. Summary • We stole your data... • But then so are lots of other people... • So produce value elsewhere. • Ideas are harmful: do what’s proven... • But do it brilliantly. • And to do that, we need change.
  • 38. Thank you www.boxuk.com dan@boxuk.com twitter.com/zambonini
  • 39. Thank you www.boxuk.com dan@boxuk.com twitter.com/zambonini

Editor's Notes