SlideShare a Scribd company logo
1 of 27
Download to read offline
NewsScape: Preserving TV News
Tim Groeling (groeling@ucla.edu)
Who Are We?
• Prof. Francis Steen: Director, Communication Studies
Archive
• Me: Leading the analog digitization effort
• Predecessor & Archive Founder: Prof. Paul Rosenthal
(emeritus)
• UCLA Library: Helps support the collection, store files,
and host main “public” site (tvnews.library.ucla.edu )
• Other supporters: UCLA Chancellor and Dean of
Social Sciences, Arcadia Fund, UCLA Social Sciences
Computing, the NSF, the California Endowment, UCLA
Office for Instructional Development, and UCLA CCLE.
Collections?
• Oldest collection: UCLA Campus
Speakers, from 1950s-1980s (over 500
audio recordings)
• Digitized to coincide with 40th
anniversary of my department.
• Originally planned to exhibit & host on
our website. Target at alumni.
• Moved to YouTube: Now most traffic
(77%) comes from YouTube search/
suggested videos/browsing.
• Issues: commenters & copyright
NewsScape
• Largest collection: TV news and public
affairs programs (local & national)
• Started during Watergate (preserve
ephemera). Shoestring budget (until
recently, only about $10k per year plus
volunteer labor & donated equipment)
• 1979: Started trying to record all the local
and national TV news viewable in LA.
• 2006: Started daily straight-to-digital
recording.
• Since 2006: Added other cities.
Pre-2006 Holdings?
• Recordings spread across three campus
organizations (Comm Studies, Library, and TFT)
and at least four on- and off-campus storage
sites.
• Good records for some portions; very poor
records for others.
• Not sure how many tapes are in the
collection overall.
• Even where we know what should be on a tape,
some problems with tape, VCR, or schedule.
Tapes
• Earliest recordings (1970s) Around 500
U-Matic tapes.
• Middle period (1979-early 1990s):
about 50k hours on Betamax
• Late period (1990s-2006): Around 160k
hours on VHS, plus some redundancy.
Preservation
• VHS are actually most threatened, despite being newest
tapes.
• Coincided with cable TV expansion of news programming:
stretched same budget to cover more news programming.
• 8 hours per consumer VHS tape (Betas and U-matics
were higher quality tapes; less recorded on each tape)
• Poor quality consumer-grade VCRs
• Limited spot-checking for quality (failing VCRs or poor
signal quality not noticed for long stretches).
• Originals still in hand, but dead.
• Improperly stored (even faculty didn’t have A/C)
Cost to Digitize?
• Got bids from another archive and
commercial providers: $1.5 million (just for
first 150k hours of VHS).
• Instead, shoestring again.
• $20k (and some donated surplus
machines) for hardware, software, and
furniture for digitization lab.
• Run by me, part time lab manager, and 10
work-study students. Steen handles files.
• [Shifting to Betamax will be costly, though]
Lab Details
• 22 digitization stations (VCR, encoder, computer).
3 local RAID file servers and 1 Filemaker Server.
• All computers: surplus or eBay Macs (circa 2008)
• Encoders: EyeTV using Hauppauge 950q or
EyeTV Hybrid hardware MPEG-2 encoders (get
CC). Export and sync scripted.
• VCRs: After testing, settled on JVC S-VHS VCRs
(consumer to pro).
• Use pre-printed barcode stickers for inventory.
Custom Filemaker database for tracking
digitization attempts and quality control. Filemaker
Go Mobile (via cell phones) for asset tracking.
• Files are quality-checked, compressed to h.264,
closed captioning extracted via Hoffman Cluster.
Progress
• Fall 2015: Process design &
workstation configuration testing
• Winter 2016: hired students, fixed
network issues, and ramped up
workstations.
• Summer 2016: Filemaker inventory
control; two daily shifts.
• March-Oct 2016: 4.5k tapes
encoded (about 36k hours).
• Delaying splitting files into shows.
Problems: Lots
• Recording/playback VCRs out of spec
(solution: quality control aggregation helps
find love connection)
• Varying program names over time (database
tracking alternate show names; day/time/
channel 30-minute bloc)
• Buzzing audio? Computer RF interference with
VCR audio. (Used dead VCRs as spacers.)
• Lot of other problems, but in most cases, just
means another encoding attempt.
Post 2006

Straight-to-Digital
• 46 networks (US and beyond)
• 2,525 Series
• Total video files: 383,550
• Duration in hours: 297,596
• Closed caption files: 383,739
• Words in caption files: 2,419,185,351
• OCR files: 371,426
• Words in OCR files: 825,662,597
• Total thumbnail images: 107,134,425
• Storage: 106.93 terabytes
• Limited public access link: tvnews.library.ucla.edu
Unlocking the Content
• Preservation is just first step:
Needs to be more than "world’s
best DVR"
• Want to provide tools to make
the collection more useful and
relevant beyond UCLA: Help
people
• Understand TV news, and…
• Share what they find
Example: Obama
• Preservation is just first step: Needs to be more
than world’s "best VCR"
• Want to provide tools to make the collection more
useful and relevant beyond UCLA: Help people
• Understand TV news, and…
• Share what they find
Good to know, but…
Tools to Understand News
• Not just view, but analyze
• Help understand and
visualize patterns of news
coverage, not just
individual stories. Forest,
not just trees.
• Tools are already being
developed, but are
complex
Tools to Understand Text
• Not just view, but analyze
• Help understand and visualize patterns of news
coverage, not just individual stories (copyright, too)
• Tools are already being developed, but are
complex
Ambitious Goal: Visuals
• Text analysis is fairly mature (more
than 2 billion words in NewsScape
index)
• Named entities, parts of speech,
topic detection are all working
now (sentiment is harder)
• Analysis of visuals is challenging.
• Facial detection & analysis tools
are becoming more useful;
scalable
Automated Analysis of
Visuals
• Goal: Be able to understand patterns of visual communication in
election news. Hard to study.
• Mostly hand-coded & focus on still newspaper or web photos
• Trouble scaling to massive volume of images.
• Subjectivity
• Machine learning and big data as solution
• Presented pilot study at this year’s American Political Science
Association conference categorizing presidential candidate
faces (smiling or not).
Face Validity
>1714 ~ 1711 ~ 148 ~ 11
< -13 -13 ~ -10 -10 ~ -7 -7 ~ -4 -4 ~ -1 -1 ~ 2
2 ~ 5 5 ~ 8
>1714 ~ 1711 ~ 148 ~ 11
< -13 -13 ~ -10 -10 ~ -7 -7 ~ -4 -4 ~ -1 -1 ~ 2
2 ~ 5 5 ~ 8
Face Validity
Weekly topic tracking

(filter by outlet) with 

metadata (who, what,

where, how much)
Daily topic trajectory.
News topics are
detected by clustering
every day, and then
linked the detected
topics to generate topic
tracking trajectories (Li,
Joo, Qi, & Zhu, 2015)
Other Goal: Sharing
• Help people share what they
learn (within bounds of copyright)
• Solution #1: Share analysis, rather
than raw material
• Solution #2: use familiar,
copyright-compliant tools to
create and share.
Social Sharing Tools
• Trying to develop two tools:
• Animated GIF generator:
(short clip; small file; on-
screen captioning; easy to
play)
• "Supercut" generator:
Assemble short examples
from archives; share
compilation
Summing Up
• Preservation as goal, but also as starting point.
• Excited to be able to understand long-term
changes in news content & norms.
• Lot of work ahead of us.
• Appreciate any help or advice (or funding) you
can offer.

More Related Content

What's hot

Joining the National Digital Humanities Conversation: Communities, Conference...
Joining the National Digital Humanities Conversation: Communities, Conference...Joining the National Digital Humanities Conversation: Communities, Conference...
Joining the National Digital Humanities Conversation: Communities, Conference...Rebecca Davis
 
Ho'okele: Navigating Copyright to Provide Access and Use
Ho'okele: Navigating Copyright to Provide Access and UseHo'okele: Navigating Copyright to Provide Access and Use
Ho'okele: Navigating Copyright to Provide Access and UseSound and Vision R&D
 
British Library Labs Presentation at Edge Hill University
British Library Labs Presentation at Edge Hill UniversityBritish Library Labs Presentation at Edge Hill University
British Library Labs Presentation at Edge Hill Universitylabsbl
 
British Library Labs Presentation Hertfordshire
British Library Labs Presentation HertfordshireBritish Library Labs Presentation Hertfordshire
British Library Labs Presentation Hertfordshirelabsbl
 
British Library Labs Presentation to City University London
British Library Labs Presentation to City University LondonBritish Library Labs Presentation to City University London
British Library Labs Presentation to City University Londonlabsbl
 
Open educational practices, a tool for empowerment
 Open educational practices, a tool for empowerment Open educational practices, a tool for empowerment
Open educational practices, a tool for empowermentLangOER
 
Making the most of broadcast media in science education
Making the most of broadcast media in science educationMaking the most of broadcast media in science education
Making the most of broadcast media in science educationChris Willmott
 
Europeana Sounds: Project Overview
Europeana Sounds: Project OverviewEuropeana Sounds: Project Overview
Europeana Sounds: Project OverviewEuropeana_Sounds
 
"Breaking the Barriers to Citizen Science"
"Breaking the Barriers to Citizen Science""Breaking the Barriers to Citizen Science"
"Breaking the Barriers to Citizen Science"Alice Sheppard
 

What's hot (9)

Joining the National Digital Humanities Conversation: Communities, Conference...
Joining the National Digital Humanities Conversation: Communities, Conference...Joining the National Digital Humanities Conversation: Communities, Conference...
Joining the National Digital Humanities Conversation: Communities, Conference...
 
Ho'okele: Navigating Copyright to Provide Access and Use
Ho'okele: Navigating Copyright to Provide Access and UseHo'okele: Navigating Copyright to Provide Access and Use
Ho'okele: Navigating Copyright to Provide Access and Use
 
British Library Labs Presentation at Edge Hill University
British Library Labs Presentation at Edge Hill UniversityBritish Library Labs Presentation at Edge Hill University
British Library Labs Presentation at Edge Hill University
 
British Library Labs Presentation Hertfordshire
British Library Labs Presentation HertfordshireBritish Library Labs Presentation Hertfordshire
British Library Labs Presentation Hertfordshire
 
British Library Labs Presentation to City University London
British Library Labs Presentation to City University LondonBritish Library Labs Presentation to City University London
British Library Labs Presentation to City University London
 
Open educational practices, a tool for empowerment
 Open educational practices, a tool for empowerment Open educational practices, a tool for empowerment
Open educational practices, a tool for empowerment
 
Making the most of broadcast media in science education
Making the most of broadcast media in science educationMaking the most of broadcast media in science education
Making the most of broadcast media in science education
 
Europeana Sounds: Project Overview
Europeana Sounds: Project OverviewEuropeana Sounds: Project Overview
Europeana Sounds: Project Overview
 
"Breaking the Barriers to Citizen Science"
"Breaking the Barriers to Citizen Science""Breaking the Barriers to Citizen Science"
"Breaking the Barriers to Citizen Science"
 

Similar to Groeling, Tim: NewsScape: Preserving TV News

Improving Access to Historic Public Broadcasting through Speech-to-Text, Crow...
Improving Access to Historic Public Broadcasting through Speech-to-Text, Crow...Improving Access to Historic Public Broadcasting through Speech-to-Text, Crow...
Improving Access to Historic Public Broadcasting through Speech-to-Text, Crow...WGBH Media Library and Archives
 
AV Digitization Projects: Tools and Strategies for Enhancing Impact and Engag...
AV Digitization Projects: Tools and Strategies for Enhancing Impact and Engag...AV Digitization Projects: Tools and Strategies for Enhancing Impact and Engag...
AV Digitization Projects: Tools and Strategies for Enhancing Impact and Engag...WGBH Media Library and Archives
 
Challenges, Workflows, and Insights in the Collaboration to Preserve America'...
Challenges, Workflows, and Insights in the Collaboration to Preserve America'...Challenges, Workflows, and Insights in the Collaboration to Preserve America'...
Challenges, Workflows, and Insights in the Collaboration to Preserve America'...WGBH Media Library and Archives
 
AMIA: Examining AV Enterprise at a Regional Academic Archive
AMIA: Examining AV Enterprise at a Regional Academic ArchiveAMIA: Examining AV Enterprise at a Regional Academic Archive
AMIA: Examining AV Enterprise at a Regional Academic ArchiveJessica Breiman
 
A la recherche
 A la recherche A la recherche
A la rechercheEd Weiss
 
Planning and Managing Digital Library & Archive Projects
Planning and Managing Digital Library & Archive ProjectsPlanning and Managing Digital Library & Archive Projects
Planning and Managing Digital Library & Archive Projectsac2182
 
Webinar: Getting Started with Digitization An Introduction for Libraries-2016...
Webinar: Getting Started with Digitization An Introduction for Libraries-2016...Webinar: Getting Started with Digitization An Introduction for Libraries-2016...
Webinar: Getting Started with Digitization An Introduction for Libraries-2016...TechSoup
 
Webinar - Lights, Camera, Advocacy to Action: Digital Storytelling for Libraries
Webinar - Lights, Camera, Advocacy to Action: Digital Storytelling for LibrariesWebinar - Lights, Camera, Advocacy to Action: Digital Storytelling for Libraries
Webinar - Lights, Camera, Advocacy to Action: Digital Storytelling for LibrariesSusan Hope Bard
 
Keeping the Broadcast Historic Record: An Archive of Public Media in the Making
Keeping the Broadcast Historic Record: An Archive of Public Media in the MakingKeeping the Broadcast Historic Record: An Archive of Public Media in the Making
Keeping the Broadcast Historic Record: An Archive of Public Media in the MakingWGBH Media Library and Archives
 
VERDOODT Measuring clouds. A large scale acquisition and preservation service...
VERDOODT Measuring clouds. A large scale acquisition and preservation service...VERDOODT Measuring clouds. A large scale acquisition and preservation service...
VERDOODT Measuring clouds. A large scale acquisition and preservation service...FIAT/IFTA
 
Sharing Economy 2.0 & The Internet of People (IoP) Workshop
Sharing Economy 2.0 & The Internet of People (IoP) WorkshopSharing Economy 2.0 & The Internet of People (IoP) Workshop
Sharing Economy 2.0 & The Internet of People (IoP) WorkshopHristian Daskalov
 
Presentation Slides, “Creating Access to Audio & Video Digital Media: The Va...
Presentation Slides, “Creating Access to Audio & Video Digital Media:  The Va...Presentation Slides, “Creating Access to Audio & Video Digital Media:  The Va...
Presentation Slides, “Creating Access to Audio & Video Digital Media: The Va...DuraSpace
 
2-21-12 Preservation Planning Success Stories Slides
2-21-12 Preservation Planning Success Stories Slides2-21-12 Preservation Planning Success Stories Slides
2-21-12 Preservation Planning Success Stories SlidesDuraSpace
 
Webinar: Technology Skills Training Programs for Library Staff - 2016-01-27
Webinar: Technology Skills Training Programs for Library Staff - 2016-01-27Webinar: Technology Skills Training Programs for Library Staff - 2016-01-27
Webinar: Technology Skills Training Programs for Library Staff - 2016-01-27TechSoup
 
Just Digitise It - Daniel Wilksch of the Public Records Office Victoria
Just Digitise It - Daniel Wilksch of the Public Records Office VictoriaJust Digitise It - Daniel Wilksch of the Public Records Office Victoria
Just Digitise It - Daniel Wilksch of the Public Records Office VictoriaNational Library of Australia
 
Webinar - Technology Planning Tips for Small Libraries - 2015-08-19
Webinar - Technology Planning Tips for Small Libraries - 2015-08-19Webinar - Technology Planning Tips for Small Libraries - 2015-08-19
Webinar - Technology Planning Tips for Small Libraries - 2015-08-19TechSoup
 
Navigating the Analog Waves: Digitizing Audio Cassettes for Your Collection
Navigating the Analog Waves: Digitizing Audio Cassettes for Your CollectionNavigating the Analog Waves: Digitizing Audio Cassettes for Your Collection
Navigating the Analog Waves: Digitizing Audio Cassettes for Your CollectionKay Gregg
 

Similar to Groeling, Tim: NewsScape: Preserving TV News (20)

Improving Access to Historic Public Broadcasting through Speech-to-Text, Crow...
Improving Access to Historic Public Broadcasting through Speech-to-Text, Crow...Improving Access to Historic Public Broadcasting through Speech-to-Text, Crow...
Improving Access to Historic Public Broadcasting through Speech-to-Text, Crow...
 
Engage Your Community to Celebrate Your History
Engage Your Community to Celebrate Your HistoryEngage Your Community to Celebrate Your History
Engage Your Community to Celebrate Your History
 
AV Digitization Projects: Tools and Strategies for Enhancing Impact and Engag...
AV Digitization Projects: Tools and Strategies for Enhancing Impact and Engag...AV Digitization Projects: Tools and Strategies for Enhancing Impact and Engag...
AV Digitization Projects: Tools and Strategies for Enhancing Impact and Engag...
 
Challenges, Workflows, and Insights in the Collaboration to Preserve America'...
Challenges, Workflows, and Insights in the Collaboration to Preserve America'...Challenges, Workflows, and Insights in the Collaboration to Preserve America'...
Challenges, Workflows, and Insights in the Collaboration to Preserve America'...
 
AMIA: Examining AV Enterprise at a Regional Academic Archive
AMIA: Examining AV Enterprise at a Regional Academic ArchiveAMIA: Examining AV Enterprise at a Regional Academic Archive
AMIA: Examining AV Enterprise at a Regional Academic Archive
 
A la recherche
 A la recherche A la recherche
A la recherche
 
Planning and Managing Digital Library & Archive Projects
Planning and Managing Digital Library & Archive ProjectsPlanning and Managing Digital Library & Archive Projects
Planning and Managing Digital Library & Archive Projects
 
Webinar: Getting Started with Digitization An Introduction for Libraries-2016...
Webinar: Getting Started with Digitization An Introduction for Libraries-2016...Webinar: Getting Started with Digitization An Introduction for Libraries-2016...
Webinar: Getting Started with Digitization An Introduction for Libraries-2016...
 
Webinar - Lights, Camera, Advocacy to Action: Digital Storytelling for Libraries
Webinar - Lights, Camera, Advocacy to Action: Digital Storytelling for LibrariesWebinar - Lights, Camera, Advocacy to Action: Digital Storytelling for Libraries
Webinar - Lights, Camera, Advocacy to Action: Digital Storytelling for Libraries
 
Keeping the Broadcast Historic Record: An Archive of Public Media in the Making
Keeping the Broadcast Historic Record: An Archive of Public Media in the MakingKeeping the Broadcast Historic Record: An Archive of Public Media in the Making
Keeping the Broadcast Historic Record: An Archive of Public Media in the Making
 
VERDOODT Measuring clouds. A large scale acquisition and preservation service...
VERDOODT Measuring clouds. A large scale acquisition and preservation service...VERDOODT Measuring clouds. A large scale acquisition and preservation service...
VERDOODT Measuring clouds. A large scale acquisition and preservation service...
 
Sharing Economy 2.0 & The Internet of People (IoP) Workshop
Sharing Economy 2.0 & The Internet of People (IoP) WorkshopSharing Economy 2.0 & The Internet of People (IoP) Workshop
Sharing Economy 2.0 & The Internet of People (IoP) Workshop
 
Presentation Slides, “Creating Access to Audio & Video Digital Media: The Va...
Presentation Slides, “Creating Access to Audio & Video Digital Media:  The Va...Presentation Slides, “Creating Access to Audio & Video Digital Media:  The Va...
Presentation Slides, “Creating Access to Audio & Video Digital Media: The Va...
 
2-21-12 Preservation Planning Success Stories Slides
2-21-12 Preservation Planning Success Stories Slides2-21-12 Preservation Planning Success Stories Slides
2-21-12 Preservation Planning Success Stories Slides
 
Webinar: Technology Skills Training Programs for Library Staff - 2016-01-27
Webinar: Technology Skills Training Programs for Library Staff - 2016-01-27Webinar: Technology Skills Training Programs for Library Staff - 2016-01-27
Webinar: Technology Skills Training Programs for Library Staff - 2016-01-27
 
Planning a Successful Digital Project
Planning a Successful Digital ProjectPlanning a Successful Digital Project
Planning a Successful Digital Project
 
Just Digitise It - Daniel Wilksch of the Public Records Office Victoria
Just Digitise It - Daniel Wilksch of the Public Records Office VictoriaJust Digitise It - Daniel Wilksch of the Public Records Office Victoria
Just Digitise It - Daniel Wilksch of the Public Records Office Victoria
 
Television News Search and Analysis with Lucene/Solr
Television News Search and Analysis with Lucene/SolrTelevision News Search and Analysis with Lucene/Solr
Television News Search and Analysis with Lucene/Solr
 
Webinar - Technology Planning Tips for Small Libraries - 2015-08-19
Webinar - Technology Planning Tips for Small Libraries - 2015-08-19Webinar - Technology Planning Tips for Small Libraries - 2015-08-19
Webinar - Technology Planning Tips for Small Libraries - 2015-08-19
 
Navigating the Analog Waves: Digitizing Audio Cassettes for Your Collection
Navigating the Analog Waves: Digitizing Audio Cassettes for Your CollectionNavigating the Analog Waves: Digitizing Audio Cassettes for Your Collection
Navigating the Analog Waves: Digitizing Audio Cassettes for Your Collection
 

More from Reynolds Journalism Institute (RJI)

John Rampton: Advanced content promotion — how the big boys are doing it
John Rampton: Advanced content promotion — how the big boys are doing itJohn Rampton: Advanced content promotion — how the big boys are doing it
John Rampton: Advanced content promotion — how the big boys are doing itReynolds Journalism Institute (RJI)
 
Welsh, Ben: The framework fix: how to build better archives by helping news n...
Welsh, Ben: The framework fix: how to build better archives by helping news n...Welsh, Ben: The framework fix: how to build better archives by helping news n...
Welsh, Ben: The framework fix: how to build better archives by helping news n...Reynolds Journalism Institute (RJI)
 
Zwaard, Kate: Technology and Community: Why we need partners, collaborators a...
Zwaard, Kate: Technology and Community: Why we need partners, collaborators a...Zwaard, Kate: Technology and Community: Why we need partners, collaborators a...
Zwaard, Kate: Technology and Community: Why we need partners, collaborators a...Reynolds Journalism Institute (RJI)
 
Younger, Jennifer: lighntning talk, Digital Preservation: Aggregated, Collabo...
Younger, Jennifer: lighntning talk, Digital Preservation: Aggregated, Collabo...Younger, Jennifer: lighntning talk, Digital Preservation: Aggregated, Collabo...
Younger, Jennifer: lighntning talk, Digital Preservation: Aggregated, Collabo...Reynolds Journalism Institute (RJI)
 

More from Reynolds Journalism Institute (RJI) (20)

Art Holliday, BJ ’76, alumni speaker
Art Holliday, BJ ’76, alumni speakerArt Holliday, BJ ’76, alumni speaker
Art Holliday, BJ ’76, alumni speaker
 
Christopher Guess presents Push
Christopher Guess presents PushChristopher Guess presents Push
Christopher Guess presents Push
 
Hurley Symposium 2017 fake news survey
Hurley Symposium 2017 fake news surveyHurley Symposium 2017 fake news survey
Hurley Symposium 2017 fake news survey
 
Archie Thornton: Content and the Fourth Industrial Revolution
Archie Thornton: Content and the Fourth Industrial RevolutionArchie Thornton: Content and the Fourth Industrial Revolution
Archie Thornton: Content and the Fourth Industrial Revolution
 
Victor Hernandez: 50 things we learned at RJI-Distribution
Victor Hernandez: 50 things we learned at RJI-DistributionVictor Hernandez: 50 things we learned at RJI-Distribution
Victor Hernandez: 50 things we learned at RJI-Distribution
 
Kaizar Campwala: Distribution in service of brand loyalty
Kaizar Campwala: Distribution in service of brand loyaltyKaizar Campwala: Distribution in service of brand loyalty
Kaizar Campwala: Distribution in service of brand loyalty
 
Zahra rasool presentation
Zahra rasool presentationZahra rasool presentation
Zahra rasool presentation
 
Uzo Iweala: Who speaks for Africa
Uzo Iweala: Who speaks for AfricaUzo Iweala: Who speaks for Africa
Uzo Iweala: Who speaks for Africa
 
Adam Falk: Subscribe now or forever hold your audience?
Adam Falk: Subscribe now or forever hold your audience?Adam Falk: Subscribe now or forever hold your audience?
Adam Falk: Subscribe now or forever hold your audience?
 
Ben Norskov and Mohini Duta: Playing news
Ben Norskov and Mohini Duta: Playing newsBen Norskov and Mohini Duta: Playing news
Ben Norskov and Mohini Duta: Playing news
 
Katherine Bell: Beyond the funnel
Katherine Bell: Beyond the funnelKatherine Bell: Beyond the funnel
Katherine Bell: Beyond the funnel
 
John Rampton: Advanced content promotion — how the big boys are doing it
John Rampton: Advanced content promotion — how the big boys are doing itJohn Rampton: Advanced content promotion — how the big boys are doing it
John Rampton: Advanced content promotion — how the big boys are doing it
 
Sarah Hill: The uncanny valley of VR distribution
Sarah Hill: The uncanny valley of VR distributionSarah Hill: The uncanny valley of VR distribution
Sarah Hill: The uncanny valley of VR distribution
 
Alejandro González: The story is your mothership
Alejandro González: The story is your mothershipAlejandro González: The story is your mothership
Alejandro González: The story is your mothership
 
Kari Paul: Brand loyalty as a distribution strategy
Kari Paul: Brand loyalty as a distribution strategyKari Paul: Brand loyalty as a distribution strategy
Kari Paul: Brand loyalty as a distribution strategy
 
Welsh, Ben: The framework fix: how to build better archives by helping news n...
Welsh, Ben: The framework fix: how to build better archives by helping news n...Welsh, Ben: The framework fix: how to build better archives by helping news n...
Welsh, Ben: The framework fix: how to build better archives by helping news n...
 
Skinner, Katherine: Alignment and Reciprocity
Skinner, Katherine: Alignment and ReciprocitySkinner, Katherine: Alignment and Reciprocity
Skinner, Katherine: Alignment and Reciprocity
 
Leetaru, Kalev: The GDELT Project
Leetaru, Kalev: The GDELT ProjectLeetaru, Kalev: The GDELT Project
Leetaru, Kalev: The GDELT Project
 
Zwaard, Kate: Technology and Community: Why we need partners, collaborators a...
Zwaard, Kate: Technology and Community: Why we need partners, collaborators a...Zwaard, Kate: Technology and Community: Why we need partners, collaborators a...
Zwaard, Kate: Technology and Community: Why we need partners, collaborators a...
 
Younger, Jennifer: lighntning talk, Digital Preservation: Aggregated, Collabo...
Younger, Jennifer: lighntning talk, Digital Preservation: Aggregated, Collabo...Younger, Jennifer: lighntning talk, Digital Preservation: Aggregated, Collabo...
Younger, Jennifer: lighntning talk, Digital Preservation: Aggregated, Collabo...
 

Recently uploaded

Israel Palestine Conflict, The issue and historical context!
Israel Palestine Conflict, The issue and historical context!Israel Palestine Conflict, The issue and historical context!
Israel Palestine Conflict, The issue and historical context!Krish109503
 
Kishan Reddy Report To People (2019-24).pdf
Kishan Reddy Report To People (2019-24).pdfKishan Reddy Report To People (2019-24).pdf
Kishan Reddy Report To People (2019-24).pdfKISHAN REDDY OFFICE
 
Enjoy Night⚡Call Girls Rajokri Delhi >༒8448380779 Escort Service
Enjoy Night⚡Call Girls Rajokri Delhi >༒8448380779 Escort ServiceEnjoy Night⚡Call Girls Rajokri Delhi >༒8448380779 Escort Service
Enjoy Night⚡Call Girls Rajokri Delhi >༒8448380779 Escort ServiceDelhi Call girls
 
Enjoy Night⚡Call Girls Iffco Chowk Gurgaon >༒8448380779 Escort Service
Enjoy Night⚡Call Girls Iffco Chowk Gurgaon >༒8448380779 Escort ServiceEnjoy Night⚡Call Girls Iffco Chowk Gurgaon >༒8448380779 Escort Service
Enjoy Night⚡Call Girls Iffco Chowk Gurgaon >༒8448380779 Escort ServiceDelhi Call girls
 
Embed-4.pdf lkdiinlajeklhndklheduhuekjdh
Embed-4.pdf lkdiinlajeklhndklheduhuekjdhEmbed-4.pdf lkdiinlajeklhndklheduhuekjdh
Embed-4.pdf lkdiinlajeklhndklheduhuekjdhbhavenpr
 
Gujarat-SEBCs.pdf pfpkoopapriorjfperjreie
Gujarat-SEBCs.pdf pfpkoopapriorjfperjreieGujarat-SEBCs.pdf pfpkoopapriorjfperjreie
Gujarat-SEBCs.pdf pfpkoopapriorjfperjreiebhavenpr
 
BDSM⚡Call Girls in Sector 143 Noida Escorts >༒8448380779 Escort Service
BDSM⚡Call Girls in Sector 143 Noida Escorts >༒8448380779 Escort ServiceBDSM⚡Call Girls in Sector 143 Noida Escorts >༒8448380779 Escort Service
BDSM⚡Call Girls in Sector 143 Noida Escorts >༒8448380779 Escort ServiceDelhi Call girls
 
30042024_First India Newspaper Jaipur.pdf
30042024_First India Newspaper Jaipur.pdf30042024_First India Newspaper Jaipur.pdf
30042024_First India Newspaper Jaipur.pdfFIRST INDIA
 
28042024_First India Newspaper Jaipur.pdf
28042024_First India Newspaper Jaipur.pdf28042024_First India Newspaper Jaipur.pdf
28042024_First India Newspaper Jaipur.pdfFIRST INDIA
 
Pakistan PMLN Election Manifesto 2024.pdf
Pakistan PMLN Election Manifesto 2024.pdfPakistan PMLN Election Manifesto 2024.pdf
Pakistan PMLN Election Manifesto 2024.pdfFahimUddin61
 
BDSM⚡Call Girls in Greater Noida Escorts >༒8448380779 Escort Service
BDSM⚡Call Girls in Greater Noida Escorts >༒8448380779 Escort ServiceBDSM⚡Call Girls in Greater Noida Escorts >༒8448380779 Escort Service
BDSM⚡Call Girls in Greater Noida Escorts >༒8448380779 Escort ServiceDelhi Call girls
 
BDSM⚡Call Girls in Sector 135 Noida Escorts >༒8448380779 Escort Service
BDSM⚡Call Girls in Sector 135 Noida Escorts >༒8448380779 Escort ServiceBDSM⚡Call Girls in Sector 135 Noida Escorts >༒8448380779 Escort Service
BDSM⚡Call Girls in Sector 135 Noida Escorts >༒8448380779 Escort ServiceDelhi Call girls
 
BDSM⚡Call Girls in Indirapuram Escorts >༒8448380779 Escort Service
BDSM⚡Call Girls in Indirapuram Escorts >༒8448380779 Escort ServiceBDSM⚡Call Girls in Indirapuram Escorts >༒8448380779 Escort Service
BDSM⚡Call Girls in Indirapuram Escorts >༒8448380779 Escort ServiceDelhi Call girls
 
WhatsApp 📞 8448380779 ✅Call Girls In Chaura Sector 22 ( Noida)
WhatsApp 📞 8448380779 ✅Call Girls In Chaura Sector 22 ( Noida)WhatsApp 📞 8448380779 ✅Call Girls In Chaura Sector 22 ( Noida)
WhatsApp 📞 8448380779 ✅Call Girls In Chaura Sector 22 ( Noida)Delhi Call girls
 
如何办理(BU学位证书)美国贝翰文大学毕业证学位证书
如何办理(BU学位证书)美国贝翰文大学毕业证学位证书如何办理(BU学位证书)美国贝翰文大学毕业证学位证书
如何办理(BU学位证书)美国贝翰文大学毕业证学位证书Fi L
 
2024 03 13 AZ GOP LD4 Gen Meeting Minutes_FINAL.docx
2024 03 13 AZ GOP LD4 Gen Meeting Minutes_FINAL.docx2024 03 13 AZ GOP LD4 Gen Meeting Minutes_FINAL.docx
2024 03 13 AZ GOP LD4 Gen Meeting Minutes_FINAL.docxkfjstone13
 
AI as Research Assistant: Upscaling Content Analysis to Identify Patterns of ...
AI as Research Assistant: Upscaling Content Analysis to Identify Patterns of ...AI as Research Assistant: Upscaling Content Analysis to Identify Patterns of ...
AI as Research Assistant: Upscaling Content Analysis to Identify Patterns of ...Axel Bruns
 
KAHULUGAN AT KAHALAGAHAN NG GAWAING PANSIBIKO.pptx
KAHULUGAN AT KAHALAGAHAN NG GAWAING PANSIBIKO.pptxKAHULUGAN AT KAHALAGAHAN NG GAWAING PANSIBIKO.pptx
KAHULUGAN AT KAHALAGAHAN NG GAWAING PANSIBIKO.pptxjohnandrewcarlos
 
Minto-Morley Reforms 1909 (constitution).pptx
Minto-Morley Reforms 1909 (constitution).pptxMinto-Morley Reforms 1909 (constitution).pptx
Minto-Morley Reforms 1909 (constitution).pptxAwaiskhalid96
 
TDP As the Party of Hope For AP Youth Under N Chandrababu Naidu’s Leadership
TDP As the Party of Hope For AP Youth Under N Chandrababu Naidu’s LeadershipTDP As the Party of Hope For AP Youth Under N Chandrababu Naidu’s Leadership
TDP As the Party of Hope For AP Youth Under N Chandrababu Naidu’s Leadershipanjanibaddipudi1
 

Recently uploaded (20)

Israel Palestine Conflict, The issue and historical context!
Israel Palestine Conflict, The issue and historical context!Israel Palestine Conflict, The issue and historical context!
Israel Palestine Conflict, The issue and historical context!
 
Kishan Reddy Report To People (2019-24).pdf
Kishan Reddy Report To People (2019-24).pdfKishan Reddy Report To People (2019-24).pdf
Kishan Reddy Report To People (2019-24).pdf
 
Enjoy Night⚡Call Girls Rajokri Delhi >༒8448380779 Escort Service
Enjoy Night⚡Call Girls Rajokri Delhi >༒8448380779 Escort ServiceEnjoy Night⚡Call Girls Rajokri Delhi >༒8448380779 Escort Service
Enjoy Night⚡Call Girls Rajokri Delhi >༒8448380779 Escort Service
 
Enjoy Night⚡Call Girls Iffco Chowk Gurgaon >༒8448380779 Escort Service
Enjoy Night⚡Call Girls Iffco Chowk Gurgaon >༒8448380779 Escort ServiceEnjoy Night⚡Call Girls Iffco Chowk Gurgaon >༒8448380779 Escort Service
Enjoy Night⚡Call Girls Iffco Chowk Gurgaon >༒8448380779 Escort Service
 
Embed-4.pdf lkdiinlajeklhndklheduhuekjdh
Embed-4.pdf lkdiinlajeklhndklheduhuekjdhEmbed-4.pdf lkdiinlajeklhndklheduhuekjdh
Embed-4.pdf lkdiinlajeklhndklheduhuekjdh
 
Gujarat-SEBCs.pdf pfpkoopapriorjfperjreie
Gujarat-SEBCs.pdf pfpkoopapriorjfperjreieGujarat-SEBCs.pdf pfpkoopapriorjfperjreie
Gujarat-SEBCs.pdf pfpkoopapriorjfperjreie
 
BDSM⚡Call Girls in Sector 143 Noida Escorts >༒8448380779 Escort Service
BDSM⚡Call Girls in Sector 143 Noida Escorts >༒8448380779 Escort ServiceBDSM⚡Call Girls in Sector 143 Noida Escorts >༒8448380779 Escort Service
BDSM⚡Call Girls in Sector 143 Noida Escorts >༒8448380779 Escort Service
 
30042024_First India Newspaper Jaipur.pdf
30042024_First India Newspaper Jaipur.pdf30042024_First India Newspaper Jaipur.pdf
30042024_First India Newspaper Jaipur.pdf
 
28042024_First India Newspaper Jaipur.pdf
28042024_First India Newspaper Jaipur.pdf28042024_First India Newspaper Jaipur.pdf
28042024_First India Newspaper Jaipur.pdf
 
Pakistan PMLN Election Manifesto 2024.pdf
Pakistan PMLN Election Manifesto 2024.pdfPakistan PMLN Election Manifesto 2024.pdf
Pakistan PMLN Election Manifesto 2024.pdf
 
BDSM⚡Call Girls in Greater Noida Escorts >༒8448380779 Escort Service
BDSM⚡Call Girls in Greater Noida Escorts >༒8448380779 Escort ServiceBDSM⚡Call Girls in Greater Noida Escorts >༒8448380779 Escort Service
BDSM⚡Call Girls in Greater Noida Escorts >༒8448380779 Escort Service
 
BDSM⚡Call Girls in Sector 135 Noida Escorts >༒8448380779 Escort Service
BDSM⚡Call Girls in Sector 135 Noida Escorts >༒8448380779 Escort ServiceBDSM⚡Call Girls in Sector 135 Noida Escorts >༒8448380779 Escort Service
BDSM⚡Call Girls in Sector 135 Noida Escorts >༒8448380779 Escort Service
 
BDSM⚡Call Girls in Indirapuram Escorts >༒8448380779 Escort Service
BDSM⚡Call Girls in Indirapuram Escorts >༒8448380779 Escort ServiceBDSM⚡Call Girls in Indirapuram Escorts >༒8448380779 Escort Service
BDSM⚡Call Girls in Indirapuram Escorts >༒8448380779 Escort Service
 
WhatsApp 📞 8448380779 ✅Call Girls In Chaura Sector 22 ( Noida)
WhatsApp 📞 8448380779 ✅Call Girls In Chaura Sector 22 ( Noida)WhatsApp 📞 8448380779 ✅Call Girls In Chaura Sector 22 ( Noida)
WhatsApp 📞 8448380779 ✅Call Girls In Chaura Sector 22 ( Noida)
 
如何办理(BU学位证书)美国贝翰文大学毕业证学位证书
如何办理(BU学位证书)美国贝翰文大学毕业证学位证书如何办理(BU学位证书)美国贝翰文大学毕业证学位证书
如何办理(BU学位证书)美国贝翰文大学毕业证学位证书
 
2024 03 13 AZ GOP LD4 Gen Meeting Minutes_FINAL.docx
2024 03 13 AZ GOP LD4 Gen Meeting Minutes_FINAL.docx2024 03 13 AZ GOP LD4 Gen Meeting Minutes_FINAL.docx
2024 03 13 AZ GOP LD4 Gen Meeting Minutes_FINAL.docx
 
AI as Research Assistant: Upscaling Content Analysis to Identify Patterns of ...
AI as Research Assistant: Upscaling Content Analysis to Identify Patterns of ...AI as Research Assistant: Upscaling Content Analysis to Identify Patterns of ...
AI as Research Assistant: Upscaling Content Analysis to Identify Patterns of ...
 
KAHULUGAN AT KAHALAGAHAN NG GAWAING PANSIBIKO.pptx
KAHULUGAN AT KAHALAGAHAN NG GAWAING PANSIBIKO.pptxKAHULUGAN AT KAHALAGAHAN NG GAWAING PANSIBIKO.pptx
KAHULUGAN AT KAHALAGAHAN NG GAWAING PANSIBIKO.pptx
 
Minto-Morley Reforms 1909 (constitution).pptx
Minto-Morley Reforms 1909 (constitution).pptxMinto-Morley Reforms 1909 (constitution).pptx
Minto-Morley Reforms 1909 (constitution).pptx
 
TDP As the Party of Hope For AP Youth Under N Chandrababu Naidu’s Leadership
TDP As the Party of Hope For AP Youth Under N Chandrababu Naidu’s LeadershipTDP As the Party of Hope For AP Youth Under N Chandrababu Naidu’s Leadership
TDP As the Party of Hope For AP Youth Under N Chandrababu Naidu’s Leadership
 

Groeling, Tim: NewsScape: Preserving TV News

  • 1. NewsScape: Preserving TV News Tim Groeling (groeling@ucla.edu)
  • 2. Who Are We? • Prof. Francis Steen: Director, Communication Studies Archive • Me: Leading the analog digitization effort • Predecessor & Archive Founder: Prof. Paul Rosenthal (emeritus) • UCLA Library: Helps support the collection, store files, and host main “public” site (tvnews.library.ucla.edu ) • Other supporters: UCLA Chancellor and Dean of Social Sciences, Arcadia Fund, UCLA Social Sciences Computing, the NSF, the California Endowment, UCLA Office for Instructional Development, and UCLA CCLE.
  • 3. Collections? • Oldest collection: UCLA Campus Speakers, from 1950s-1980s (over 500 audio recordings) • Digitized to coincide with 40th anniversary of my department. • Originally planned to exhibit & host on our website. Target at alumni. • Moved to YouTube: Now most traffic (77%) comes from YouTube search/ suggested videos/browsing. • Issues: commenters & copyright
  • 4. NewsScape • Largest collection: TV news and public affairs programs (local & national) • Started during Watergate (preserve ephemera). Shoestring budget (until recently, only about $10k per year plus volunteer labor & donated equipment) • 1979: Started trying to record all the local and national TV news viewable in LA. • 2006: Started daily straight-to-digital recording. • Since 2006: Added other cities.
  • 5. Pre-2006 Holdings? • Recordings spread across three campus organizations (Comm Studies, Library, and TFT) and at least four on- and off-campus storage sites. • Good records for some portions; very poor records for others. • Not sure how many tapes are in the collection overall. • Even where we know what should be on a tape, some problems with tape, VCR, or schedule.
  • 6. Tapes • Earliest recordings (1970s) Around 500 U-Matic tapes. • Middle period (1979-early 1990s): about 50k hours on Betamax • Late period (1990s-2006): Around 160k hours on VHS, plus some redundancy.
  • 7. Preservation • VHS are actually most threatened, despite being newest tapes. • Coincided with cable TV expansion of news programming: stretched same budget to cover more news programming. • 8 hours per consumer VHS tape (Betas and U-matics were higher quality tapes; less recorded on each tape) • Poor quality consumer-grade VCRs • Limited spot-checking for quality (failing VCRs or poor signal quality not noticed for long stretches). • Originals still in hand, but dead. • Improperly stored (even faculty didn’t have A/C)
  • 8. Cost to Digitize? • Got bids from another archive and commercial providers: $1.5 million (just for first 150k hours of VHS). • Instead, shoestring again. • $20k (and some donated surplus machines) for hardware, software, and furniture for digitization lab. • Run by me, part time lab manager, and 10 work-study students. Steen handles files. • [Shifting to Betamax will be costly, though]
  • 9. Lab Details • 22 digitization stations (VCR, encoder, computer). 3 local RAID file servers and 1 Filemaker Server. • All computers: surplus or eBay Macs (circa 2008) • Encoders: EyeTV using Hauppauge 950q or EyeTV Hybrid hardware MPEG-2 encoders (get CC). Export and sync scripted. • VCRs: After testing, settled on JVC S-VHS VCRs (consumer to pro). • Use pre-printed barcode stickers for inventory. Custom Filemaker database for tracking digitization attempts and quality control. Filemaker Go Mobile (via cell phones) for asset tracking. • Files are quality-checked, compressed to h.264, closed captioning extracted via Hoffman Cluster.
  • 10. Progress • Fall 2015: Process design & workstation configuration testing • Winter 2016: hired students, fixed network issues, and ramped up workstations. • Summer 2016: Filemaker inventory control; two daily shifts. • March-Oct 2016: 4.5k tapes encoded (about 36k hours). • Delaying splitting files into shows.
  • 11. Problems: Lots • Recording/playback VCRs out of spec (solution: quality control aggregation helps find love connection) • Varying program names over time (database tracking alternate show names; day/time/ channel 30-minute bloc) • Buzzing audio? Computer RF interference with VCR audio. (Used dead VCRs as spacers.) • Lot of other problems, but in most cases, just means another encoding attempt.
  • 12. Post 2006
 Straight-to-Digital • 46 networks (US and beyond) • 2,525 Series • Total video files: 383,550 • Duration in hours: 297,596 • Closed caption files: 383,739 • Words in caption files: 2,419,185,351 • OCR files: 371,426 • Words in OCR files: 825,662,597 • Total thumbnail images: 107,134,425 • Storage: 106.93 terabytes • Limited public access link: tvnews.library.ucla.edu
  • 13. Unlocking the Content • Preservation is just first step: Needs to be more than "world’s best DVR" • Want to provide tools to make the collection more useful and relevant beyond UCLA: Help people • Understand TV news, and… • Share what they find
  • 14. Example: Obama • Preservation is just first step: Needs to be more than world’s "best VCR" • Want to provide tools to make the collection more useful and relevant beyond UCLA: Help people • Understand TV news, and… • Share what they find Good to know, but…
  • 15. Tools to Understand News • Not just view, but analyze • Help understand and visualize patterns of news coverage, not just individual stories. Forest, not just trees. • Tools are already being developed, but are complex
  • 16. Tools to Understand Text • Not just view, but analyze • Help understand and visualize patterns of news coverage, not just individual stories (copyright, too) • Tools are already being developed, but are complex
  • 17. Ambitious Goal: Visuals • Text analysis is fairly mature (more than 2 billion words in NewsScape index) • Named entities, parts of speech, topic detection are all working now (sentiment is harder) • Analysis of visuals is challenging. • Facial detection & analysis tools are becoming more useful; scalable
  • 18. Automated Analysis of Visuals • Goal: Be able to understand patterns of visual communication in election news. Hard to study. • Mostly hand-coded & focus on still newspaper or web photos • Trouble scaling to massive volume of images. • Subjectivity • Machine learning and big data as solution • Presented pilot study at this year’s American Political Science Association conference categorizing presidential candidate faces (smiling or not).
  • 19. Face Validity >1714 ~ 1711 ~ 148 ~ 11 < -13 -13 ~ -10 -10 ~ -7 -7 ~ -4 -4 ~ -1 -1 ~ 2 2 ~ 5 5 ~ 8
  • 20. >1714 ~ 1711 ~ 148 ~ 11 < -13 -13 ~ -10 -10 ~ -7 -7 ~ -4 -4 ~ -1 -1 ~ 2 2 ~ 5 5 ~ 8 Face Validity
  • 21.
  • 22. Weekly topic tracking
 (filter by outlet) with 
 metadata (who, what,
 where, how much) Daily topic trajectory. News topics are detected by clustering every day, and then linked the detected topics to generate topic tracking trajectories (Li, Joo, Qi, & Zhu, 2015)
  • 23.
  • 24.
  • 25. Other Goal: Sharing • Help people share what they learn (within bounds of copyright) • Solution #1: Share analysis, rather than raw material • Solution #2: use familiar, copyright-compliant tools to create and share.
  • 26. Social Sharing Tools • Trying to develop two tools: • Animated GIF generator: (short clip; small file; on- screen captioning; easy to play) • "Supercut" generator: Assemble short examples from archives; share compilation
  • 27. Summing Up • Preservation as goal, but also as starting point. • Excited to be able to understand long-term changes in news content & norms. • Lot of work ahead of us. • Appreciate any help or advice (or funding) you can offer.