SlideShare a Scribd company logo
1 of 22
ACCESSING
VITO
                   BUILDING ARCHIVES
        @ THE NEW YORK PUBLIC LIBRARY
   DONALD MENNERICH, CURATEGEAR 2013
VITO RUSSO
THE COLLECTION
• 18 5.25 inch floppy disks


• Hard copies printed on a borrowed Kaypro IV, sometime in the
  1990s


• 2 of the 18 disks were marked as being ‘unreadable’
CP/M
WORKFLOW
            Imaging


       disk images




            Analysis   Restoration


                               restored files
        metadata
                        Migration


                       migrated files



            Indexing



      Solr documents


    Index                               Repository
DISK IMAGING
DFXML
                                        JPG


Imaging


          Metadata
          Extraction


                          Photography


PREMIS                                        Backup




           METS              Files
                            Document
RESTORATION /
CPMTOOLS
dm$ cpmls -f kpiv M2654-0001.001
0:
cher
drugs
elders
filtr
filtr.bak
fran
hambrick
hamlin
jed
judy
kaminsky
AUTOMATION
/ CPMRESTORE
private void getFiles() throws IOException, InterruptedException{
           imageFile = new File(image);
           if(!imageFile.exists()){
        System.err.println("IMAGE FILE DOES NOT EXIST");
        System.exit(1);
    }
    Process p = Runtime.getRuntime().exec("/usr/local/bin/cpmls…
    p.waitFor();
    BufferedReader reader = new BufferedReader(new
InputStreamReader(p.getInputStream()));
    String line=reader.readLine();
    int count = 0;
ANALYSIS / FTK
MIGRATION
Donalds-MacBook-Pro:M2654-010 dm$ xxd albany | less


0000000: 0a0a 7669 746f 2072 7573 736f 202d 2073 ..vito russo - s
0000010: 7065 6563 6820 6769 7665 6e20 696e 2057 peech given in W
0000020: 6173 6869 6e67 746f 6e20 442e 432e 206f ashington D.C. o
0000030: 6e20 4f63 746f 6265 7220 3130 2c20 3139 n October 10, 19
0000040: 3838 0a0a 0a20 2020 2020 c120 4652 4945 88...    . FRIE
0000050: 4ec4 204f c620 4d49 4ec5 2048 41d3 20c1 N. O. MIN. HA. .
0000060: 2048 414c c620 4641 52c5 2054 5241 4e53 HAL. FAR. TRANS
0000070: 49d4 2043 4152 c420 5748 4943 c820 48c5 I. CAR. WHIC. H.
0000080: 2055 5345 d320 8d0a 8d0a 8d0a 4fce 2042 USE. ......O. B
MIGRATION
April 12, 1988
Ted Schachter
MGM Telecommunications Inc.
10000 Washington
Boulevard
Culver City, California 90232

Dear Mr. Schachter,

I am writing to request the use of a brief clip from the MGM/UA film LA
CAGE AUX FOLLES in connectioon with my lecture presentation based on
my book THE CELLULOID CLOSET. Published by Harper & Row, THE
CELLULOID CLOSET is a critically acclaimed and highly respected
scholarly treatment of the various ways in which gay people have been
portrayed onscreen from silent movies to the present.
INDEXING
INDEXING

         Word                                        .CSV
       Documents




                                                      Legacy
                                                     Metadata

      Tika Gateway               Full Text


                           Language
        Buffered             Code
        Reader
                                                      File
                     Names                           Model


                       Orgs
       OpenNLP
                     Locations                 Solr Document



                                             Index
SEARCH




         a live demo was here
ACCESS?
donaldmennerich@nypl.org

More Related Content

Similar to Accessing Vito

DefCon 2012 - Anti-Forensics and Anti-Anti-Forensics
DefCon 2012 - Anti-Forensics and Anti-Anti-ForensicsDefCon 2012 - Anti-Forensics and Anti-Anti-Forensics
DefCon 2012 - Anti-Forensics and Anti-Anti-ForensicsMichael Smith
 
Rapid Response: Debugging and Profiling to the Rescue
Rapid Response: Debugging and Profiling to the RescueRapid Response: Debugging and Profiling to the Rescue
Rapid Response: Debugging and Profiling to the RescueEric Kavanagh
 
Scaling Big Data Mining Infrastructure Twitter Experience
Scaling Big Data Mining Infrastructure Twitter ExperienceScaling Big Data Mining Infrastructure Twitter Experience
Scaling Big Data Mining Infrastructure Twitter ExperienceDataWorks Summit
 
Data Hiding Techniques
Data Hiding TechniquesData Hiding Techniques
Data Hiding Techniquesprashant3535
 
the productive programer: mechanics
the productive programer: mechanicsthe productive programer: mechanics
the productive programer: mechanicselliando dias
 
Linux Kernel Exploitation
Linux Kernel ExploitationLinux Kernel Exploitation
Linux Kernel ExploitationScio Security
 
GOTO 2011 preso: 3x Hadoop
GOTO 2011 preso: 3x HadoopGOTO 2011 preso: 3x Hadoop
GOTO 2011 preso: 3x Hadoopfvanvollenhoven
 
Docker for Developers: Dev, Test, Deploy @ BucksCo Devops at MeetMe HQ
Docker for Developers: Dev, Test, Deploy @ BucksCo Devops at MeetMe HQDocker for Developers: Dev, Test, Deploy @ BucksCo Devops at MeetMe HQ
Docker for Developers: Dev, Test, Deploy @ BucksCo Devops at MeetMe HQErica Windisch
 
Digital Forensics
Digital ForensicsDigital Forensics
Digital ForensicsOldsun
 
深層学習ライブラリの環境問題Chainer Meetup2016 07-02
深層学習ライブラリの環境問題Chainer Meetup2016 07-02深層学習ライブラリの環境問題Chainer Meetup2016 07-02
深層学習ライブラリの環境問題Chainer Meetup2016 07-02Yuta Kashino
 
Java Hates Linux. Deal With It.
Java Hates Linux.  Deal With It.Java Hates Linux.  Deal With It.
Java Hates Linux. Deal With It.Greg Banks
 
UNIX v6で見るdmrの貢献
UNIX v6で見るdmrの貢献UNIX v6で見るdmrの貢献
UNIX v6で見るdmrの貢献Ryousei Takano
 
DLL Hijacking on OS X
DLL Hijacking on OS XDLL Hijacking on OS X
DLL Hijacking on OS XSynack
 
WTF my container just spawned a shell!
WTF my container just spawned a shell!WTF my container just spawned a shell!
WTF my container just spawned a shell!Sysdig
 
CGI trainees workshop Distributed Deep Learning, 24/5 2019, Kim Hammar
CGI trainees workshop Distributed Deep Learning, 24/5 2019, Kim HammarCGI trainees workshop Distributed Deep Learning, 24/5 2019, Kim Hammar
CGI trainees workshop Distributed Deep Learning, 24/5 2019, Kim HammarKim Hammar
 
Anomaly Detection at Scale
Anomaly Detection at ScaleAnomaly Detection at Scale
Anomaly Detection at ScaleJeff Henrikson
 

Similar to Accessing Vito (20)

DefCon 2012 - Anti-Forensics and Anti-Anti-Forensics
DefCon 2012 - Anti-Forensics and Anti-Anti-ForensicsDefCon 2012 - Anti-Forensics and Anti-Anti-Forensics
DefCon 2012 - Anti-Forensics and Anti-Anti-Forensics
 
Rapid Response: Debugging and Profiling to the Rescue
Rapid Response: Debugging and Profiling to the RescueRapid Response: Debugging and Profiling to the Rescue
Rapid Response: Debugging and Profiling to the Rescue
 
You suck at Memory Analysis
You suck at Memory AnalysisYou suck at Memory Analysis
You suck at Memory Analysis
 
Scaling Big Data Mining Infrastructure Twitter Experience
Scaling Big Data Mining Infrastructure Twitter ExperienceScaling Big Data Mining Infrastructure Twitter Experience
Scaling Big Data Mining Infrastructure Twitter Experience
 
Data Hiding Techniques
Data Hiding TechniquesData Hiding Techniques
Data Hiding Techniques
 
淺談探索 Linux 系統設計之道
淺談探索 Linux 系統設計之道 淺談探索 Linux 系統設計之道
淺談探索 Linux 系統設計之道
 
the productive programer: mechanics
the productive programer: mechanicsthe productive programer: mechanics
the productive programer: mechanics
 
Linux Kernel Exploitation
Linux Kernel ExploitationLinux Kernel Exploitation
Linux Kernel Exploitation
 
Bids talk 9.18
Bids talk 9.18Bids talk 9.18
Bids talk 9.18
 
GOTO 2011 preso: 3x Hadoop
GOTO 2011 preso: 3x HadoopGOTO 2011 preso: 3x Hadoop
GOTO 2011 preso: 3x Hadoop
 
Docker for Developers: Dev, Test, Deploy @ BucksCo Devops at MeetMe HQ
Docker for Developers: Dev, Test, Deploy @ BucksCo Devops at MeetMe HQDocker for Developers: Dev, Test, Deploy @ BucksCo Devops at MeetMe HQ
Docker for Developers: Dev, Test, Deploy @ BucksCo Devops at MeetMe HQ
 
Digital Forensics
Digital ForensicsDigital Forensics
Digital Forensics
 
深層学習ライブラリの環境問題Chainer Meetup2016 07-02
深層学習ライブラリの環境問題Chainer Meetup2016 07-02深層学習ライブラリの環境問題Chainer Meetup2016 07-02
深層学習ライブラリの環境問題Chainer Meetup2016 07-02
 
Java Hates Linux. Deal With It.
Java Hates Linux.  Deal With It.Java Hates Linux.  Deal With It.
Java Hates Linux. Deal With It.
 
West coastrollout
West coastrolloutWest coastrollout
West coastrollout
 
UNIX v6で見るdmrの貢献
UNIX v6で見るdmrの貢献UNIX v6で見るdmrの貢献
UNIX v6で見るdmrの貢献
 
DLL Hijacking on OS X
DLL Hijacking on OS XDLL Hijacking on OS X
DLL Hijacking on OS X
 
WTF my container just spawned a shell!
WTF my container just spawned a shell!WTF my container just spawned a shell!
WTF my container just spawned a shell!
 
CGI trainees workshop Distributed Deep Learning, 24/5 2019, Kim Hammar
CGI trainees workshop Distributed Deep Learning, 24/5 2019, Kim HammarCGI trainees workshop Distributed Deep Learning, 24/5 2019, Kim Hammar
CGI trainees workshop Distributed Deep Learning, 24/5 2019, Kim Hammar
 
Anomaly Detection at Scale
Anomaly Detection at ScaleAnomaly Detection at Scale
Anomaly Detection at Scale
 

Accessing Vito

  • 1. ACCESSING VITO BUILDING ARCHIVES @ THE NEW YORK PUBLIC LIBRARY DONALD MENNERICH, CURATEGEAR 2013
  • 3. THE COLLECTION • 18 5.25 inch floppy disks • Hard copies printed on a borrowed Kaypro IV, sometime in the 1990s • 2 of the 18 disks were marked as being ‘unreadable’
  • 5. WORKFLOW Imaging disk images Analysis Restoration restored files metadata Migration migrated files Indexing Solr documents Index Repository
  • 7. DFXML JPG Imaging Metadata Extraction Photography PREMIS Backup METS Files Document
  • 8.
  • 9. RESTORATION / CPMTOOLS dm$ cpmls -f kpiv M2654-0001.001 0: cher drugs elders filtr filtr.bak fran hambrick hamlin jed judy kaminsky
  • 10. AUTOMATION / CPMRESTORE private void getFiles() throws IOException, InterruptedException{ imageFile = new File(image); if(!imageFile.exists()){ System.err.println("IMAGE FILE DOES NOT EXIST"); System.exit(1); } Process p = Runtime.getRuntime().exec("/usr/local/bin/cpmls… p.waitFor(); BufferedReader reader = new BufferedReader(new InputStreamReader(p.getInputStream())); String line=reader.readLine(); int count = 0;
  • 12.
  • 13.
  • 14. MIGRATION Donalds-MacBook-Pro:M2654-010 dm$ xxd albany | less 0000000: 0a0a 7669 746f 2072 7573 736f 202d 2073 ..vito russo - s 0000010: 7065 6563 6820 6769 7665 6e20 696e 2057 peech given in W 0000020: 6173 6869 6e67 746f 6e20 442e 432e 206f ashington D.C. o 0000030: 6e20 4f63 746f 6265 7220 3130 2c20 3139 n October 10, 19 0000040: 3838 0a0a 0a20 2020 2020 c120 4652 4945 88... . FRIE 0000050: 4ec4 204f c620 4d49 4ec5 2048 41d3 20c1 N. O. MIN. HA. . 0000060: 2048 414c c620 4641 52c5 2054 5241 4e53 HAL. FAR. TRANS 0000070: 49d4 2043 4152 c420 5748 4943 c820 48c5 I. CAR. WHIC. H. 0000080: 2055 5345 d320 8d0a 8d0a 8d0a 4fce 2042 USE. ......O. B
  • 15.
  • 16.
  • 17. MIGRATION April 12, 1988 Ted Schachter MGM Telecommunications Inc. 10000 Washington Boulevard Culver City, California 90232 Dear Mr. Schachter, I am writing to request the use of a brief clip from the MGM/UA film LA CAGE AUX FOLLES in connectioon with my lecture presentation based on my book THE CELLULOID CLOSET. Published by Harper & Row, THE CELLULOID CLOSET is a critically acclaimed and highly respected scholarly treatment of the various ways in which gay people have been portrayed onscreen from silent movies to the present.
  • 19. INDEXING Word .CSV Documents Legacy Metadata Tika Gateway Full Text Language Buffered Code Reader File Names Model Orgs OpenNLP Locations Solr Document Index
  • 20. SEARCH a live demo was here

Editor's Notes

  1. Film Critic, Gay Civil Rights and AIDS activist – founding member of ACT UP (To Unleash power)Author of the ‘Celluloid closet’Michael Chiavi: “Celluloid Activist”Subject of recent HBO documentary “Vito”Prominently featured in 2012 documentary “How To Survive a Plague”
  2. CP/M “Control Program/Monitor”Control Program for Microcomputers8080 and Z80, other CPUs through expansion cards (Apple ][, C64)
  3. Demo CPMRestore
  4. XXD view of a restored Wordstar File – note the encoding problems
  5. Text read to be indexed after migration to Word docx format
  6. Java application to index docsTika as text parser and language detectionOpenNLP as named entity extractionThis is currently ad-hoc (plan to add Mahout for clustering and classification)
  7. “Larry Kramer” “Rita Mae Brown”