3. THE COLLECTION
• 18 5.25 inch floppy disks
• Hard copies printed on a borrowed Kaypro IV, sometime in the
1990s
• 2 of the 18 disks were marked as being ‘unreadable’
14. MIGRATION
Donalds-MacBook-Pro:M2654-010 dm$ xxd albany | less
0000000: 0a0a 7669 746f 2072 7573 736f 202d 2073 ..vito russo - s
0000010: 7065 6563 6820 6769 7665 6e20 696e 2057 peech given in W
0000020: 6173 6869 6e67 746f 6e20 442e 432e 206f ashington D.C. o
0000030: 6e20 4f63 746f 6265 7220 3130 2c20 3139 n October 10, 19
0000040: 3838 0a0a 0a20 2020 2020 c120 4652 4945 88... . FRIE
0000050: 4ec4 204f c620 4d49 4ec5 2048 41d3 20c1 N. O. MIN. HA. .
0000060: 2048 414c c620 4641 52c5 2054 5241 4e53 HAL. FAR. TRANS
0000070: 49d4 2043 4152 c420 5748 4943 c820 48c5 I. CAR. WHIC. H.
0000080: 2055 5345 d320 8d0a 8d0a 8d0a 4fce 2042 USE. ......O. B
15.
16.
17. MIGRATION
April 12, 1988
Ted Schachter
MGM Telecommunications Inc.
10000 Washington
Boulevard
Culver City, California 90232
Dear Mr. Schachter,
I am writing to request the use of a brief clip from the MGM/UA film LA
CAGE AUX FOLLES in connectioon with my lecture presentation based on
my book THE CELLULOID CLOSET. Published by Harper & Row, THE
CELLULOID CLOSET is a critically acclaimed and highly respected
scholarly treatment of the various ways in which gay people have been
portrayed onscreen from silent movies to the present.
19. INDEXING
Word .CSV
Documents
Legacy
Metadata
Tika Gateway Full Text
Language
Buffered Code
Reader
File
Names Model
Orgs
OpenNLP
Locations Solr Document
Index
Film Critic, Gay Civil Rights and AIDS activist – founding member of ACT UP (To Unleash power)Author of the ‘Celluloid closet’Michael Chiavi: “Celluloid Activist”Subject of recent HBO documentary “Vito”Prominently featured in 2012 documentary “How To Survive a Plague”
CP/M “Control Program/Monitor”Control Program for Microcomputers8080 and Z80, other CPUs through expansion cards (Apple ][, C64)
Demo CPMRestore
XXD view of a restored Wordstar File – note the encoding problems
Text read to be indexed after migration to Word docx format
Java application to index docsTika as text parser and language detectionOpenNLP as named entity extractionThis is currently ad-hoc (plan to add Mahout for clustering and classification)