This document discusses NewsScape, an archive of international television news maintained by UCLA. It contains over 196,000 news program recordings from over 20 sources captured since 2010. Programs are digitized, analyzed, and indexed with closed captions and images to enable search and access. The archive aims to further teaching, research, and publication through an expanding collection of television news content and tools for multi-modal analysis. Key challenges addressed include obtaining and providing access to copyrighted news content while staying within fair use guidelines.
Television news search and analysis with lucene solrlucenerevolution
Presented by Kai Chan | UCLA - See conference video - http://www.lucidimagination.com/devzone/events/conferences/lucene-revolution-2012
UCLA Communication Studies Archive hosts a collection of over 100,000 hours of digital television news, updated daily. Its search engine provides closed captioning search and online streaming of videos. The search engine allows researchers and students in various fields to study television news, images and language usage, in ways that were not possible before. In this presentation, we will show the setup of our Lucene/Solr-powered search engine, as well as how it is being used. We will discuss our work on custom result formats, such as linking search result text to the video at particular timestamps, counting occurrences of words, phrases or patterns, grouping the result by fields such as month or show, and creating interactive charts. We will also discuss our work on extending Lucene’s proximity searches, and creating custom query types, such as segment-enclosed (two or more words, phrases or patterns occurring within a story-based text segment), time-enclosed (two or more words, phrases or patterns occurring within a certain time), and multi-word regular expression queries. Future goals will also be discussed, such as supporting multiple languages, multiple sources (speech-to-text along side closed-captioning text), searching user-contributed and generated metadata (programs that identify story segments, objects in video, etc.), and syntactic tags (such as parts of speech).
Television news search and analysis with lucene solrlucenerevolution
Presented by Kai Chan | UCLA - See conference video - http://www.lucidimagination.com/devzone/events/conferences/lucene-revolution-2012
UCLA Communication Studies Archive hosts a collection of over 100,000 hours of digital television news, updated daily. Its search engine provides closed captioning search and online streaming of videos. The search engine allows researchers and students in various fields to study television news, images and language usage, in ways that were not possible before. In this presentation, we will show the setup of our Lucene/Solr-powered search engine, as well as how it is being used. We will discuss our work on custom result formats, such as linking search result text to the video at particular timestamps, counting occurrences of words, phrases or patterns, grouping the result by fields such as month or show, and creating interactive charts. We will also discuss our work on extending Lucene’s proximity searches, and creating custom query types, such as segment-enclosed (two or more words, phrases or patterns occurring within a story-based text segment), time-enclosed (two or more words, phrases or patterns occurring within a certain time), and multi-word regular expression queries. Future goals will also be discussed, such as supporting multiple languages, multiple sources (speech-to-text along side closed-captioning text), searching user-contributed and generated metadata (programs that identify story segments, objects in video, etc.), and syntactic tags (such as parts of speech).
Television News Search and Analysis with Lucene/Solrlucenerevolution
Presented by Kai Chan | UCLA - See complete conference videos - http://www.lucidimagination.com/devzone/events/conferences/lucene-revolution-2012
UCLA Communication Studies Archive hosts a collection of over 100,000 hours of digital television news, updated daily. Its search engine provides closed captioning search and online streaming of videos. The search engine allows researchers and students in various fields to study television news, images and language usage, in ways that were not possible before. In this presentation, we will show the setup of our Lucene/Solr-powered search engine, as well as how it is being used. We will discuss our work on custom result formats, such as linking search result text to the video at particular timestamps, counting occurrences of words, phrases or patterns, grouping the result by fields such as month or show, and creating interactive charts. We will also discuss our work on extending Lucene’s proximity searches, and creating custom query types, such as segment-enclosed (two or more words, phrases or patterns occurring within a story-based text segment), time-enclosed (two or more words, phrases or patterns occurring within a certain time), and multi-word regular expression queries. Future goals will also be discussed, such as supporting multiple languages, multiple sources (speech-to-text along side closed-captioning text), searching user-contributed and generated metadata (programs that identify story segments, objects in video, etc.), and syntactic tags (such as parts of speech).
Television News Search and Analysis with Lucene/Solrlucenerevolution
Presented by Kai Chan | UCLA - See complete conference videos - http://www.lucidimagination.com/devzone/events/conferences/lucene-revolution-2012
UCLA Communication Studies Archive hosts a collection of over 100,000 hours of digital television news, updated daily. Its search engine provides closed captioning search and online streaming of videos. The search engine allows researchers and students in various fields to study television news, images and language usage, in ways that were not possible before. In this presentation, we will show the setup of our Lucene/Solr-powered search engine, as well as how it is being used. We will discuss our work on custom result formats, such as linking search result text to the video at particular timestamps, counting occurrences of words, phrases or patterns, grouping the result by fields such as month or show, and creating interactive charts. We will also discuss our work on extending Lucene’s proximity searches, and creating custom query types, such as segment-enclosed (two or more words, phrases or patterns occurring within a story-based text segment), time-enclosed (two or more words, phrases or patterns occurring within a certain time), and multi-word regular expression queries. Future goals will also be discussed, such as supporting multiple languages, multiple sources (speech-to-text along side closed-captioning text), searching user-contributed and generated metadata (programs that identify story segments, objects in video, etc.), and syntactic tags (such as parts of speech).
, AV Foundation moves to center stage as the essential media framework on the device, offering support for playing, capturing, and even editing audio and video. Borrowing some of the core ideas from the Mac's QuickTime, while adding many new concepts of its own, AV Foundation offers extraordinary capabilities for application programmers. This talk will offer a high-level overview of what's in AV Foundation, and a taste of what it can do.
DCEU 18: Provisioning and Managing Storage for Docker ContainersDocker, Inc.
Anshul Pundir - Senior Software Engineer, Docker
Anusha Ragunathan - Senior Software Engineer, Docker Inc
In this talk, we will discuss storage concepts related to containers on the Docker platform with the perspective of what is important throughout the lifecycle of an application., We will focus on application provisioning: creating persistent volumes and policies for stateful data and management: replication and failover scenarios, backup/restore, monitoring etc. Through this talk, we will cover the latest storage features and also some of the current and future direction of container storage. Key concepts covered about running stateful applications: - Persistent Volumes - Provisioning (Static vs Topology-aware) - Data Availability (failover with scheduler policies) - Data Protection (using Backup/Restore) - Monitoring (using Prometheus/Grafana dashboards) We will look at each of the characteristics in detail with demos.
A Segmentation based Sequential Pattern Matching for Efficient Video Copy De...SWAMI06
A considerable number of videos are illegal copies or manipulated versions of existing media, making copyright management a complicated process.
Call for Change:-
Today’s widespread video copyright infringement calls for the development of fast and accurate copy-detection algorithms.
As video is the most complex type of digital media, it has so far received the least attention regarding copyright management.
Protect Data:-
Content-based copy detection (CBCD) ,a promising technique for video monitoring and copyright protection.
Extending the Reach of Southern Audiovisual Sourcesekemeyer
The Southern Folklife Collection at the University of North Carolina at Chapel Hill is currently developing a large-scale audiovisual preservation and access program for its archival recordings. This presentation serves as an introduction to the research and development phase carried out this past year, as well as the work to be accomplished over the next three years.
Video and slides synchronized, mp3 and slide download available at URL http://bit.ly/1Rzjtjm.
Josh Evans talks about the Netflix journey of failure, innovation, and ubiquity. He reviews the many facets of globalization then delves deep into the architectural patterns that enable seamless, multi-region traffic management, reliable, fast data propagation, and efficient service infrastructure. The patterns presented are broadly applicable to Internet services with global aspirations. Filmed at qconlondon.com.
Josh Evans is Director of Operations Engineering at Netflix, with experience in e-commerce, playback control services, infrastructure, tools, testing, and operations.
2024.06.01 Introducing a competency framework for languag learning materials ...Sandy Millin
http://sandymillin.wordpress.com/iateflwebinar2024
Published classroom materials form the basis of syllabuses, drive teacher professional development, and have a potentially huge influence on learners, teachers and education systems. All teachers also create their own materials, whether a few sentences on a blackboard, a highly-structured fully-realised online course, or anything in between. Despite this, the knowledge and skills needed to create effective language learning materials are rarely part of teacher training, and are mostly learnt by trial and error.
Knowledge and skills frameworks, generally called competency frameworks, for ELT teachers, trainers and managers have existed for a few years now. However, until I created one for my MA dissertation, there wasn’t one drawing together what we need to know and do to be able to effectively produce language learning materials.
This webinar will introduce you to my framework, highlighting the key competencies I identified from my research. It will also show how anybody involved in language teaching (any language, not just English!), teacher training, managing schools or developing language learning materials can benefit from using the framework.
The French Revolution, which began in 1789, was a period of radical social and political upheaval in France. It marked the decline of absolute monarchies, the rise of secular and democratic republics, and the eventual rise of Napoleon Bonaparte. This revolutionary period is crucial in understanding the transition from feudalism to modernity in Europe.
For more information, visit-www.vavaclasses.com
Synthetic Fiber Construction in lab .pptxPavel ( NSTU)
Synthetic fiber production is a fascinating and complex field that blends chemistry, engineering, and environmental science. By understanding these aspects, students can gain a comprehensive view of synthetic fiber production, its impact on society and the environment, and the potential for future innovations. Synthetic fibers play a crucial role in modern society, impacting various aspects of daily life, industry, and the environment. ynthetic fibers are integral to modern life, offering a range of benefits from cost-effectiveness and versatility to innovative applications and performance characteristics. While they pose environmental challenges, ongoing research and development aim to create more sustainable and eco-friendly alternatives. Understanding the importance of synthetic fibers helps in appreciating their role in the economy, industry, and daily life, while also emphasizing the need for sustainable practices and innovation.
Normal Labour/ Stages of Labour/ Mechanism of LabourWasim Ak
Normal labor is also termed spontaneous labor, defined as the natural physiological process through which the fetus, placenta, and membranes are expelled from the uterus through the birth canal at term (37 to 42 weeks
June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...Levi Shapiro
Letter from the Congress of the United States regarding Anti-Semitism sent June 3rd to MIT President Sally Kornbluth, MIT Corp Chair, Mark Gorenberg
Dear Dr. Kornbluth and Mr. Gorenberg,
The US House of Representatives is deeply concerned by ongoing and pervasive acts of antisemitic
harassment and intimidation at the Massachusetts Institute of Technology (MIT). Failing to act decisively to ensure a safe learning environment for all students would be a grave dereliction of your responsibilities as President of MIT and Chair of the MIT Corporation.
This Congress will not stand idly by and allow an environment hostile to Jewish students to persist. The House believes that your institution is in violation of Title VI of the Civil Rights Act, and the inability or
unwillingness to rectify this violation through action requires accountability.
Postsecondary education is a unique opportunity for students to learn and have their ideas and beliefs challenged. However, universities receiving hundreds of millions of federal funds annually have denied
students that opportunity and have been hijacked to become venues for the promotion of terrorism, antisemitic harassment and intimidation, unlawful encampments, and in some cases, assaults and riots.
The House of Representatives will not countenance the use of federal funds to indoctrinate students into hateful, antisemitic, anti-American supporters of terrorism. Investigations into campus antisemitism by the Committee on Education and the Workforce and the Committee on Ways and Means have been expanded into a Congress-wide probe across all relevant jurisdictions to address this national crisis. The undersigned Committees will conduct oversight into the use of federal funds at MIT and its learning environment under authorities granted to each Committee.
• The Committee on Education and the Workforce has been investigating your institution since December 7, 2023. The Committee has broad jurisdiction over postsecondary education, including its compliance with Title VI of the Civil Rights Act, campus safety concerns over disruptions to the learning environment, and the awarding of federal student aid under the Higher Education Act.
• The Committee on Oversight and Accountability is investigating the sources of funding and other support flowing to groups espousing pro-Hamas propaganda and engaged in antisemitic harassment and intimidation of students. The Committee on Oversight and Accountability is the principal oversight committee of the US House of Representatives and has broad authority to investigate “any matter” at “any time” under House Rule X.
• The Committee on Ways and Means has been investigating several universities since November 15, 2023, when the Committee held a hearing entitled From Ivory Towers to Dark Corners: Investigating the Nexus Between Antisemitism, Tax-Exempt Universities, and Terror Financing. The Committee followed the hearing with letters to those institutions on January 10, 202
Biological screening of herbal drugs: Introduction and Need for
Phyto-Pharmacological Screening, New Strategies for evaluating
Natural Products, In vitro evaluation techniques for Antioxidants, Antimicrobial and Anticancer drugs. In vivo evaluation techniques
for Anti-inflammatory, Antiulcer, Anticancer, Wound healing, Antidiabetic, Hepatoprotective, Cardio protective, Diuretics and
Antifertility, Toxicity studies as per OECD guidelines
Acetabularia Information For Class 9 .docxvaibhavrinwa19
Acetabularia acetabulum is a single-celled green alga that in its vegetative state is morphologically differentiated into a basal rhizoid and an axially elongated stalk, which bears whorls of branching hairs. The single diploid nucleus resides in the rhizoid.
Exploiting Artificial Intelligence for Empowering Researchers and Faculty, In...Dr. Vinod Kumar Kanvaria
Exploiting Artificial Intelligence for Empowering Researchers and Faculty,
International FDP on Fundamentals of Research in Social Sciences
at Integral University, Lucknow, 06.06.2024
By Dr. Vinod Kumar Kanvaria
Safalta Digital marketing institute in Noida, provide complete applications that encompass a huge range of virtual advertising and marketing additives, which includes search engine optimization, virtual communication advertising, pay-per-click on marketing, content material advertising, internet analytics, and greater. These university courses are designed for students who possess a comprehensive understanding of virtual marketing strategies and attributes.Safalta Digital Marketing Institute in Noida is a first choice for young individuals or students who are looking to start their careers in the field of digital advertising. The institute gives specialized courses designed and certification.
for beginners, providing thorough training in areas such as SEO, digital communication marketing, and PPC training in Noida. After finishing the program, students receive the certifications recognised by top different universitie, setting a strong foundation for a successful career in digital marketing.
Unit 8 - Information and Communication Technology (Paper I).pdfThiyagu K
This slides describes the basic concepts of ICT, basics of Email, Emerging Technology and Digital Initiatives in Education. This presentations aligns with the UGC Paper I syllabus.
1. UCLA NewsScape: The Archive of
International Television News
A Transformative Approach to Using the News
in Teaching, Research, and Publication
Sharon E. Farb
Todd Grappone
Add slideshare link UCLA
2. What is NewsScape? A UCLA Broadcast
(and more) News Archive
• Expanding archive of over 196,000 distinct recordings
of news programs captured and digitized
• Searching of program-level metadata as well as actual
program content, utilizing the more than 1.1 billion
words of closed-caption texts, as well as on-screen
texts, detected visual shapes, and other attributes of
the audiovisual stream
• Over 11 Billion Images
– 44 million snapshot images, taken once every 10 seconds
to aid in visual navigation of each recording
• Grows at about 1TB per month
3. It all starts with the mission…
UC Policy on Copyright Ownership
Section I.
Preamble
“The creation of copyrighted works is one of the ways the University
fulfills its mission of contributing to the body of knowledge for the
public good. The University encourages the creation of original works of
authorship and the free expression and exchange of ideas.”
http://www.universityofcalifornia.edu/copyright/systemwide/pcoi.html
Intellectual Property in the Digital Age
Series
4. The Mission of Higher Education
• 4 Principles (Pelikan, 1992)
• The advancement of knowledge through research
• The extension of knowledge through teaching
• The preservation of knowledge in libraries, galleries and
museums
• The diffusion of knowledge through scholarly publication
• Jaroslav Pelikan, The Idea of the University: A Reexamination, 1992
CLIR Postdoctoral Fellows 2008
5. UCLA TV News Archive Infrastructure
Web Searching & browsing
Account Access
server control serve Playback requests Users
r
Audio
and
video
Search
index &
DB
Scheduler
Capture Authorized
videos
Encoder Requests
Encoded Low-
video
latency Streaming
storage Videos server
Snapshots
Analysis
scripts
Remote High-
backup Mirroring capacity
storage
6. Sources of TV news videos
Terrestrial
broadcast,
cable, or
satellite TV
signal.
Contains
video, audio
and closed
captioning
streams.
7. Video capture scheduling and monitoring
Audio
and
video
Scheduler
Capture
Scheduler interface to select programs for recording
8. Video capture, encoding, and storage
Audio
and
video
Raw ATSC video (approx. 60-70 GB/hour)
compressed to
Scheduler
Capture H.264 448x336, AAC at 96 Kbps (250 MB/hour)
Encoder
Encoded Low-
video
latency Isilon X200
storage network-attached
Snapshots
Analysis disk storage array
scripts
Images taken at 10-second intervals to enable visual navigation
9. Metadata updates
TOP|20110204130001|2011-02-
04_0500_KCET_BBC_World_News
COL|Communication Studies Archive, UCLA
UID|d9149328-3062-11e0-8555-001517add60e
Audio DUR|0:29:49.48
and
video
Search
The Apache Solr search index
index &
is updated in real-time
DB
Scheduler
Capture
CC1|20110204130013|>> THIS IS "BBC WORLD NEWS."
CC1|20110204130016|FUNDING FOR THIS PRESENTATION
Encoder CC1|20110204130018|IS MADE POSSIBLE BY THE FREEMAN
CC1|20110204130023|FOUNDATION OF NEW YORK, STOWE,
Encoded Low-
videoCC1|20110204130024|VERMONT, AND HONOLULU.
latency
CC1|20110204130027|NEWMAN'S OWN FOUNDATION.
storage
CC1|20110204130028|THE JOHN D. AND CATHERINE T.
CC1|20110204130029|MACARTHUR FOUNDATION.
Snapshots
Analysis
CC1|20110204130030|AND UNION BANK.
scripts CC1|20110204130058|>> AND NOW, "BBC WORLD NEWS."
CC1|20110204130104|>> EGYPT'S PROTESTERS CALL THIS
CC1|20110204130110|THE DAY OF DEPARTURE.
CC1|20110204130111|THEY WANT THE PRESIDENT TO STEP
CC1|20110204130113|DOWN IMMEDIATELY.
10. Video data backup
Audio
and
video
Search
index &
DB
Scheduler
Capture
Encoder
Encoded Low-
video
latency
storage
Snapshots
Analysis
scripts
Remote High-
backup Mirroring capacity
storage
11. User interaction and video selection
Web Searching & browsing
serve Users
r
Audio
and
video
Search
index &
DB
Scheduler
Capture Authorized
videos
Encoder
Encoded Low-
video
latency
storage
Snapshots
Analysis
scripts
Remote High-
backup Mirroring capacity
storage
13. User authentication and video viewing
Web Searching & browsing
Account Access
server control serve Playback requests Users
r
Audio
and
video
Search
index &
DB
Scheduler
Capture Authorized
videos
Encoder Requests
Encoded Low-
video
latency Streaming
storage Videos server
Snapshots
Analysis
scripts
Remote High-
backup Mirroring capacity
storage
15. What is news, n. ?*
• New things; novelties. Obs.
• The report or account of recent (esp. important or
interesting) events or occurrences, brought or coming
to one as new information; new occurrences as a
subject of report or talk; tidings.
• With sing. concord. Now esp. such information as
published or broadcast.
• As predicate: a person, thing, or place regarded as
worthy of discussion or of reporting by the media.
* Oxford English Dictionary Online
16.
17. All material must be used within Title 17 USC 108 (f) (3)
Core Collection:
ABC, CBS, NBC, CNN,
Fox News, special
news such as
Watergate, 9/11, etc
Loan requests receive
DVDs
Fees are charged to
cover costs
18. Section 108 Study Group Report Executive
Summary Television News Exemption
• The television news exemption should be amended
to allow libraries and achieves to transmit view-only
copies of television news programs electronically by
streaming and similar technologies to other section
108 eligible libraries and archives for purposes of
private study, scholarship, or research under certain
conditions, and after a reasonable period has passed
since the original transmission.
• Any amendment should not include an exception
permitting libraries and archived to transmit
downloadable copies.
19. Internet Archive: Search
and DVD Borrowing
350,000 news programs collected over
3 years from national U.S. networks
and stations in San Francisco and
Washington D.C.
The archive is updated with new
broadcasts 24 hours after they are
aired.
21 Networks
20. Fair Use: Tranformativeness
Beyond Vanderbilt
4 Factors 17 USC 107 Transformativeness
• The purpose and character • Did the unlicensed use
of the use “transform” the
• The nature of the copyrighted material by
copyrighted work using it for a different
• The amount and purpose?
substantiality of the • Was the amount and
portion used nature of the material used
• The effect of the use on the appropriate?
market or value of the
copyrighted work
21. UCLA Transformativeness
Post Capture Processing
• Selection of sources useful
in teaching and research
• Video capture
• Metadata updates
• Archive search
• Tool development
• Etc
• etc
22. NewsScape as a research archive
Mixing with other digital collections.
23. Use in Research/Best Practices
• Use Cases news archive in teaching and research
• Multi-modal research and teaching
• Comparative studies
• Using copyrighted material as the object of a
social, political, or cultural critique
• Using copyrighted material for illustration or
example
• Capturing copyrighted material incidentally or
accidentally
• Capturing, Reproducing, to memorlize or
preserve
24.
25. • “Libraries of all kinds during the centuries of their existence have
had a common objective—one so generally accepted that it is
seldom made explicit. It is the conservation and organization of
the world’s resources of recorded thought and fact so as to make
them available for present and future users.”
• Robert D. Leigh, The Public Library in the United States, 1950
27. References
• 17 USC 107
• 17 USC 108
• Bill Graham Archives v. Dorling Kindersley Limited (2nd Cir. 2006)
448 F. 3d. 605
• Code of Best Practices in Fair Use for Online Video—Center for
Social Media
• Code of Best Practices in Fair Use for Academic and Research
Libraries
• Library Copyright Alliance Brief on Streaming of Films for
Educational Purposes
• Jaszi, Peter. Reclaiming Fair Use, University of Chicago Press,
2011
• Netanel, Neil W., Making Sense of Fair Use. Lewis & Clark Law
Review, Vol. 15; UCLA School of Law
Editor's Notes
Project history – Dates back to watergate digital iteratinos for the past 7 years.Presentation will cover – sharon : legal and technical issues todd: tech scholarly output roshoman style storytelling History with the archive; legal issues and tech issues lead to Dynamic dou
Controlled chaaos:About 100 us channels 40 channels from the US, UK, France, and Russia are currently included on the recording schedule; there are also a couple of Internet sources~150 regularly scheduled TV news shows are recorded per dayYoutube – political addsBroadcast news is of interest to all sorts of scholarsCollecting news paradigm shift: collections & social media
MPAA RIAA
Capture 100 us news stations 40 international news archives: chech,russian
Programms plus closed captiones. Non-scheduled programs are captured. Off youtu=be as well.
Pgrogamatically
Story segmentation
Apis
About 60 TB of data gorws at about 1 TB a month.
Results page with embeded player. Clips are currently delivered in moodle as well as hereMontage, text, video
Internet Archive
Another news arvhice and partner. IA Helpoing with digitization of offline content.
Juan in a Hundred anecdote
Strong library mission to create news archivesHoles in our collection. Faculty participation is key. If you havent driven patron driven digital library