SlideShare a Scribd company logo
1 of 12
Download to read offline
VIDEO TO TEXT SUMMARIZER
USING AI
DONE BY
DHINAGARAN P
GOKULNATH G
PERINBAN M
SUDHARSAN M
PROJECT GUIDE
DR . S V MANISEKARAN
ASSISTANT PROFESSOR
DEPARTMENT OF INFORMATION TECHNOLOGY
ANNA UNIVERSITY REGIONAL CAMPUS
COIMBATORE
ABSTRACT
● Video to Text Summarizer is the process of converting a large video to text. And
this has been summarized in the smaller one. It helps for various time consuming
process to save the time.
● In this mini project, the various Techniques and Algorithms have been used for the
summarization. You will get exact summarization of a particular video and this will
help to learn more things in easy way and understandable manner.
● The video-to-text Summarizer provides numerous applications including efficient
content browsing, video indexing, and information retrieval. Users can quickly
obtain an overview of video content, enabling them to make informed decisions
about further engagement or to locate specific information within a large video
dataset.
OVERALL DESIGN
ALGORITHM - TextRank
● Textrank is a graph-based ranking algorithm like Google’s Pagerank
algorithm which has been successfully implemented in citation analysis.
● It is used for keyword extraction , automated text summarization and
phrase ranking.
● The textrank algorithm ( keyword extraction / sentence ranking)
constructs a word network by looking which words follow one another
and setting a link between two words if they follow one another, the link
getting a higher weight if these two words occur frequently next to each
other.
SYSTEM IMPLEMENTATION
Data Acquisition Module: This module handles the acquisition of video data from various
sources, such as YouTube or local video files. It may include functionalities for downloading videos,
extracting audio, and handling different video formats.
Speech Recognition Module: This module is responsible for converting the audio content of
the videos into text. It utilizes speech recognition algorithms or services to transcribe the spoken
words into textual form.
Text Processing Module: This module processes the transcribed text to extract key information. It
may include functionalities such as text cleaning, tokenization, part-of speech tagging, named
entity recognition, and syntactic parsing.
Summarization Module: This module generates concise summaries based on the processed
text. It utilizes summarization algorithms, which could be extractive (selecting important
sentences or phrases) or abstractive (generating new sentences) in nature
TOOLS REQUIRED
Python: The code is written in Python, a widely used programming language known for
its simplicity and versatility.
Speech Recognition library: The speech recognition library in Python provides an
interface to access multiple speech recognition APIs and services.
Hugging Face's Transformers library: The transformers library, specifically the
pipeline module, is part of the Hugging Face's Transformers library. It enables easy usage
of pre-trained models for natural language processing tasks, such as summarization.
EXPERIMENTAL RESULTS
ADVANTAGES
Efficient information extraction: Video-to-text summarizers can quickly analyze and
extract key information from videos, saving users time and effort.
Easy navigation and referencing: Once a video is converted into text, the summarized
version can be easily scanned and searched for specific keywords or sections of interest.
Accessibility and inclusivity: Text-based summaries make video content more
accessible to individuals with hearing impairments or those who prefer reading over
watching videos
DISADVANTAGES
Quality of the video source: The quality of the video source can impact the accuracy
and effectiveness of video-to-text summarization. Videos with poor audio quality, low
resolution, or unclear visuals may result in less accurate transcriptions and subsequently
affect the quality of the text summary.
Internet Connectivity: During the process of summarization there should be a good
internet connection is must otherwise the process take time.
Video Duration: The Video Duration is also a important factor if the duration of the
summarizing video is larger it takes more time to summarize
REFERENCES
1. "Transcribing Video Content to Text" by Haowen Xu, Shih-Fu Chang, and Tat-Seng
Chua. (2019) - This paper introduces a method for transcribing video content to text using
a multimodal approach, combining visual and auditory cues.
2. "Video Summarization Using Deep Semantic Features" by Luming Tang, Xinxiao Wu,
and Wenjing Jia. (2018) - The authors propose a video summarization technique that
utilizes deep semantic features to extract key information from videos and generate
concise summaries.
3. "Unsupervised Video Summarization with Adversarial LSTM Networks" by Mahnaz
Koupaee and William Yang Wang. (2018) - This paper presents an unsupervised video
summarization method that employs adversarial LSTM networks to generate summaries
by selecting representative frames.
REFERENCES
4. "Video Summarization by Learning from Unpaired Data" by Debidatta Dwibedi, Yusuf
Aytar, Jonathan Tompson, Pierre Sermanet, and Andrew Zisserman. (2018) - The authors
propose a video summarization approach that learns from unpaired videos and their
corresponding titles, leveraging a combination of deep reinforcement learning and
unsupervised learning.
5. "Deep Reinforcement Learning for Unsupervised Video Summarization with
Diversity-Representativeness Reward" by Yu Gong, Qiang Zhang, and Ming-Hsuan
Yang. (2017) - This paper introduces a deep reinforcement learning framework for
unsupervised video summarization
THANK YOU

More Related Content

Similar to VIDEO TO TEXT SUMMARIZER USING AI.pdf

IDE Code Compiler for the physically challenged (Deaf, Blind & Mute)
IDE Code Compiler for the physically challenged (Deaf, Blind & Mute)IDE Code Compiler for the physically challenged (Deaf, Blind & Mute)
IDE Code Compiler for the physically challenged (Deaf, Blind & Mute)IRJET Journal
 
Real Time Direct Speech-to-Speech Translation
Real Time Direct Speech-to-Speech TranslationReal Time Direct Speech-to-Speech Translation
Real Time Direct Speech-to-Speech TranslationIRJET Journal
 
SANGEETA_YADAV_AI_VIDEO_SUMMARIZER_WEB_APP.pptx
SANGEETA_YADAV_AI_VIDEO_SUMMARIZER_WEB_APP.pptxSANGEETA_YADAV_AI_VIDEO_SUMMARIZER_WEB_APP.pptx
SANGEETA_YADAV_AI_VIDEO_SUMMARIZER_WEB_APP.pptxSangeetaYadav843179
 
Video content analysis and retrieval system using video storytelling and inde...
Video content analysis and retrieval system using video storytelling and inde...Video content analysis and retrieval system using video storytelling and inde...
Video content analysis and retrieval system using video storytelling and inde...IJECEIAES
 
IRJET- Transcription of Conferences
IRJET- Transcription of ConferencesIRJET- Transcription of Conferences
IRJET- Transcription of ConferencesIRJET Journal
 
Automatic Subtitle Generation for Sound in Videos
Automatic Subtitle Generation for Sound in VideosAutomatic Subtitle Generation for Sound in Videos
Automatic Subtitle Generation for Sound in VideosIRJET Journal
 
Automatic Subtitle Generation For Sound In Videos
Automatic Subtitle Generation For Sound In VideosAutomatic Subtitle Generation For Sound In Videos
Automatic Subtitle Generation For Sound In VideosAsia Smith
 
A Study on FFmpeg Multimedia Framework
A Study on FFmpeg Multimedia FrameworkA Study on FFmpeg Multimedia Framework
A Study on FFmpeg Multimedia Frameworkijtsrd
 
Speech To Speech Translation
Speech To Speech TranslationSpeech To Speech Translation
Speech To Speech TranslationIRJET Journal
 
Key frame extraction methodology for video annotation
Key frame extraction methodology for video annotationKey frame extraction methodology for video annotation
Key frame extraction methodology for video annotationIAEME Publication
 
IRJET- Communication System for Blind, Deaf and Dumb People using Internet of...
IRJET- Communication System for Blind, Deaf and Dumb People using Internet of...IRJET- Communication System for Blind, Deaf and Dumb People using Internet of...
IRJET- Communication System for Blind, Deaf and Dumb People using Internet of...IRJET Journal
 
AI POWERED TRANSCRIBER TO RECORD THE CONFERENCE PROCEEDINGS
AI POWERED TRANSCRIBER TO RECORD THE CONFERENCE PROCEEDINGSAI POWERED TRANSCRIBER TO RECORD THE CONFERENCE PROCEEDINGS
AI POWERED TRANSCRIBER TO RECORD THE CONFERENCE PROCEEDINGSIRJET Journal
 
What are close captioning services.pptx
What are close captioning services.pptxWhat are close captioning services.pptx
What are close captioning services.pptxjaya660272
 
IRJET - Optical Character Recognition and Translation
IRJET -  	  Optical Character Recognition and TranslationIRJET -  	  Optical Character Recognition and Translation
IRJET - Optical Character Recognition and TranslationIRJET Journal
 
IRJET- Voice based Billing System
IRJET-  	  Voice based Billing SystemIRJET-  	  Voice based Billing System
IRJET- Voice based Billing SystemIRJET Journal
 
Rosinski ibm ai overview with several examples of projects in the media and l...
Rosinski ibm ai overview with several examples of projects in the media and l...Rosinski ibm ai overview with several examples of projects in the media and l...
Rosinski ibm ai overview with several examples of projects in the media and l...FIAT/IFTA
 
SUBJECTIVE QUALITY EVALUATION OF H.264 AND H.265 ENCODED VIDEO SEQUENCES STRE...
SUBJECTIVE QUALITY EVALUATION OF H.264 AND H.265 ENCODED VIDEO SEQUENCES STRE...SUBJECTIVE QUALITY EVALUATION OF H.264 AND H.265 ENCODED VIDEO SEQUENCES STRE...
SUBJECTIVE QUALITY EVALUATION OF H.264 AND H.265 ENCODED VIDEO SEQUENCES STRE...ijma
 
Subjective Quality Evaluation of H.264 and H.265 Encoded Video Sequences Stre...
Subjective Quality Evaluation of H.264 and H.265 Encoded Video Sequences Stre...Subjective Quality Evaluation of H.264 and H.265 Encoded Video Sequences Stre...
Subjective Quality Evaluation of H.264 and H.265 Encoded Video Sequences Stre...ijma
 
IRJET- Audio Data Summarization System using Natural Language Processing
IRJET- Audio Data Summarization System using Natural Language ProcessingIRJET- Audio Data Summarization System using Natural Language Processing
IRJET- Audio Data Summarization System using Natural Language ProcessingIRJET Journal
 

Similar to VIDEO TO TEXT SUMMARIZER USING AI.pdf (20)

Sub1577
Sub1577Sub1577
Sub1577
 
IDE Code Compiler for the physically challenged (Deaf, Blind & Mute)
IDE Code Compiler for the physically challenged (Deaf, Blind & Mute)IDE Code Compiler for the physically challenged (Deaf, Blind & Mute)
IDE Code Compiler for the physically challenged (Deaf, Blind & Mute)
 
Real Time Direct Speech-to-Speech Translation
Real Time Direct Speech-to-Speech TranslationReal Time Direct Speech-to-Speech Translation
Real Time Direct Speech-to-Speech Translation
 
SANGEETA_YADAV_AI_VIDEO_SUMMARIZER_WEB_APP.pptx
SANGEETA_YADAV_AI_VIDEO_SUMMARIZER_WEB_APP.pptxSANGEETA_YADAV_AI_VIDEO_SUMMARIZER_WEB_APP.pptx
SANGEETA_YADAV_AI_VIDEO_SUMMARIZER_WEB_APP.pptx
 
Video content analysis and retrieval system using video storytelling and inde...
Video content analysis and retrieval system using video storytelling and inde...Video content analysis and retrieval system using video storytelling and inde...
Video content analysis and retrieval system using video storytelling and inde...
 
IRJET- Transcription of Conferences
IRJET- Transcription of ConferencesIRJET- Transcription of Conferences
IRJET- Transcription of Conferences
 
Automatic Subtitle Generation for Sound in Videos
Automatic Subtitle Generation for Sound in VideosAutomatic Subtitle Generation for Sound in Videos
Automatic Subtitle Generation for Sound in Videos
 
Automatic Subtitle Generation For Sound In Videos
Automatic Subtitle Generation For Sound In VideosAutomatic Subtitle Generation For Sound In Videos
Automatic Subtitle Generation For Sound In Videos
 
A Study on FFmpeg Multimedia Framework
A Study on FFmpeg Multimedia FrameworkA Study on FFmpeg Multimedia Framework
A Study on FFmpeg Multimedia Framework
 
Speech To Speech Translation
Speech To Speech TranslationSpeech To Speech Translation
Speech To Speech Translation
 
Key frame extraction methodology for video annotation
Key frame extraction methodology for video annotationKey frame extraction methodology for video annotation
Key frame extraction methodology for video annotation
 
IRJET- Communication System for Blind, Deaf and Dumb People using Internet of...
IRJET- Communication System for Blind, Deaf and Dumb People using Internet of...IRJET- Communication System for Blind, Deaf and Dumb People using Internet of...
IRJET- Communication System for Blind, Deaf and Dumb People using Internet of...
 
AI POWERED TRANSCRIBER TO RECORD THE CONFERENCE PROCEEDINGS
AI POWERED TRANSCRIBER TO RECORD THE CONFERENCE PROCEEDINGSAI POWERED TRANSCRIBER TO RECORD THE CONFERENCE PROCEEDINGS
AI POWERED TRANSCRIBER TO RECORD THE CONFERENCE PROCEEDINGS
 
What are close captioning services.pptx
What are close captioning services.pptxWhat are close captioning services.pptx
What are close captioning services.pptx
 
IRJET - Optical Character Recognition and Translation
IRJET -  	  Optical Character Recognition and TranslationIRJET -  	  Optical Character Recognition and Translation
IRJET - Optical Character Recognition and Translation
 
IRJET- Voice based Billing System
IRJET-  	  Voice based Billing SystemIRJET-  	  Voice based Billing System
IRJET- Voice based Billing System
 
Rosinski ibm ai overview with several examples of projects in the media and l...
Rosinski ibm ai overview with several examples of projects in the media and l...Rosinski ibm ai overview with several examples of projects in the media and l...
Rosinski ibm ai overview with several examples of projects in the media and l...
 
SUBJECTIVE QUALITY EVALUATION OF H.264 AND H.265 ENCODED VIDEO SEQUENCES STRE...
SUBJECTIVE QUALITY EVALUATION OF H.264 AND H.265 ENCODED VIDEO SEQUENCES STRE...SUBJECTIVE QUALITY EVALUATION OF H.264 AND H.265 ENCODED VIDEO SEQUENCES STRE...
SUBJECTIVE QUALITY EVALUATION OF H.264 AND H.265 ENCODED VIDEO SEQUENCES STRE...
 
Subjective Quality Evaluation of H.264 and H.265 Encoded Video Sequences Stre...
Subjective Quality Evaluation of H.264 and H.265 Encoded Video Sequences Stre...Subjective Quality Evaluation of H.264 and H.265 Encoded Video Sequences Stre...
Subjective Quality Evaluation of H.264 and H.265 Encoded Video Sequences Stre...
 
IRJET- Audio Data Summarization System using Natural Language Processing
IRJET- Audio Data Summarization System using Natural Language ProcessingIRJET- Audio Data Summarization System using Natural Language Processing
IRJET- Audio Data Summarization System using Natural Language Processing
 

Recently uploaded

Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝
Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝
Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝soniya singh
 
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...ranjana rawat
 
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130Suhani Kapoor
 
Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024hassan khalil
 
GDSC ASEB Gen AI study jams presentation
GDSC ASEB Gen AI study jams presentationGDSC ASEB Gen AI study jams presentation
GDSC ASEB Gen AI study jams presentationGDSCAESB
 
What are the advantages and disadvantages of membrane structures.pptx
What are the advantages and disadvantages of membrane structures.pptxWhat are the advantages and disadvantages of membrane structures.pptx
What are the advantages and disadvantages of membrane structures.pptxwendy cai
 
Introduction to IEEE STANDARDS and its different types.pptx
Introduction to IEEE STANDARDS and its different types.pptxIntroduction to IEEE STANDARDS and its different types.pptx
Introduction to IEEE STANDARDS and its different types.pptxupamatechverse
 
(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service
(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service
(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Serviceranjana rawat
 
Internship report on mechanical engineering
Internship report on mechanical engineeringInternship report on mechanical engineering
Internship report on mechanical engineeringmalavadedarshan25
 
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...Dr.Costas Sachpazis
 
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile
 
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...Soham Mondal
 
Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptx
Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptxDecoding Kotlin - Your guide to solving the mysterious in Kotlin.pptx
Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptxJoão Esperancinha
 
Coefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptxCoefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptxAsutosh Ranjan
 
ZXCTN 5804 / ZTE PTN / ZTE POTN / ZTE 5804 PTN / ZTE POTN 5804 ( 100/200 GE Z...
ZXCTN 5804 / ZTE PTN / ZTE POTN / ZTE 5804 PTN / ZTE POTN 5804 ( 100/200 GE Z...ZXCTN 5804 / ZTE PTN / ZTE POTN / ZTE 5804 PTN / ZTE POTN 5804 ( 100/200 GE Z...
ZXCTN 5804 / ZTE PTN / ZTE POTN / ZTE 5804 PTN / ZTE POTN 5804 ( 100/200 GE Z...ZTE
 
College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service NashikCollege Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service NashikCall Girls in Nagpur High Profile
 
Study on Air-Water & Water-Water Heat Exchange in a Finned Tube Exchanger
Study on Air-Water & Water-Water Heat Exchange in a Finned Tube ExchangerStudy on Air-Water & Water-Water Heat Exchange in a Finned Tube Exchanger
Study on Air-Water & Water-Water Heat Exchange in a Finned Tube ExchangerAnamika Sarkar
 

Recently uploaded (20)

Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝
Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝
Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝
 
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
 
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
 
Call Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCR
Call Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCRCall Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCR
Call Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCR
 
9953056974 Call Girls In South Ex, Escorts (Delhi) NCR.pdf
9953056974 Call Girls In South Ex, Escorts (Delhi) NCR.pdf9953056974 Call Girls In South Ex, Escorts (Delhi) NCR.pdf
9953056974 Call Girls In South Ex, Escorts (Delhi) NCR.pdf
 
Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024
 
GDSC ASEB Gen AI study jams presentation
GDSC ASEB Gen AI study jams presentationGDSC ASEB Gen AI study jams presentation
GDSC ASEB Gen AI study jams presentation
 
What are the advantages and disadvantages of membrane structures.pptx
What are the advantages and disadvantages of membrane structures.pptxWhat are the advantages and disadvantages of membrane structures.pptx
What are the advantages and disadvantages of membrane structures.pptx
 
Introduction to IEEE STANDARDS and its different types.pptx
Introduction to IEEE STANDARDS and its different types.pptxIntroduction to IEEE STANDARDS and its different types.pptx
Introduction to IEEE STANDARDS and its different types.pptx
 
(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service
(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service
(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service
 
Internship report on mechanical engineering
Internship report on mechanical engineeringInternship report on mechanical engineering
Internship report on mechanical engineering
 
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
 
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
 
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
 
Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptx
Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptxDecoding Kotlin - Your guide to solving the mysterious in Kotlin.pptx
Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptx
 
★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR
★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR
★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR
 
Coefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptxCoefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptx
 
ZXCTN 5804 / ZTE PTN / ZTE POTN / ZTE 5804 PTN / ZTE POTN 5804 ( 100/200 GE Z...
ZXCTN 5804 / ZTE PTN / ZTE POTN / ZTE 5804 PTN / ZTE POTN 5804 ( 100/200 GE Z...ZXCTN 5804 / ZTE PTN / ZTE POTN / ZTE 5804 PTN / ZTE POTN 5804 ( 100/200 GE Z...
ZXCTN 5804 / ZTE PTN / ZTE POTN / ZTE 5804 PTN / ZTE POTN 5804 ( 100/200 GE Z...
 
College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service NashikCollege Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
 
Study on Air-Water & Water-Water Heat Exchange in a Finned Tube Exchanger
Study on Air-Water & Water-Water Heat Exchange in a Finned Tube ExchangerStudy on Air-Water & Water-Water Heat Exchange in a Finned Tube Exchanger
Study on Air-Water & Water-Water Heat Exchange in a Finned Tube Exchanger
 

VIDEO TO TEXT SUMMARIZER USING AI.pdf

  • 1. VIDEO TO TEXT SUMMARIZER USING AI DONE BY DHINAGARAN P GOKULNATH G PERINBAN M SUDHARSAN M PROJECT GUIDE DR . S V MANISEKARAN ASSISTANT PROFESSOR DEPARTMENT OF INFORMATION TECHNOLOGY ANNA UNIVERSITY REGIONAL CAMPUS COIMBATORE
  • 2. ABSTRACT ● Video to Text Summarizer is the process of converting a large video to text. And this has been summarized in the smaller one. It helps for various time consuming process to save the time. ● In this mini project, the various Techniques and Algorithms have been used for the summarization. You will get exact summarization of a particular video and this will help to learn more things in easy way and understandable manner. ● The video-to-text Summarizer provides numerous applications including efficient content browsing, video indexing, and information retrieval. Users can quickly obtain an overview of video content, enabling them to make informed decisions about further engagement or to locate specific information within a large video dataset.
  • 4. ALGORITHM - TextRank ● Textrank is a graph-based ranking algorithm like Google’s Pagerank algorithm which has been successfully implemented in citation analysis. ● It is used for keyword extraction , automated text summarization and phrase ranking. ● The textrank algorithm ( keyword extraction / sentence ranking) constructs a word network by looking which words follow one another and setting a link between two words if they follow one another, the link getting a higher weight if these two words occur frequently next to each other.
  • 5. SYSTEM IMPLEMENTATION Data Acquisition Module: This module handles the acquisition of video data from various sources, such as YouTube or local video files. It may include functionalities for downloading videos, extracting audio, and handling different video formats. Speech Recognition Module: This module is responsible for converting the audio content of the videos into text. It utilizes speech recognition algorithms or services to transcribe the spoken words into textual form. Text Processing Module: This module processes the transcribed text to extract key information. It may include functionalities such as text cleaning, tokenization, part-of speech tagging, named entity recognition, and syntactic parsing. Summarization Module: This module generates concise summaries based on the processed text. It utilizes summarization algorithms, which could be extractive (selecting important sentences or phrases) or abstractive (generating new sentences) in nature
  • 6. TOOLS REQUIRED Python: The code is written in Python, a widely used programming language known for its simplicity and versatility. Speech Recognition library: The speech recognition library in Python provides an interface to access multiple speech recognition APIs and services. Hugging Face's Transformers library: The transformers library, specifically the pipeline module, is part of the Hugging Face's Transformers library. It enables easy usage of pre-trained models for natural language processing tasks, such as summarization.
  • 8. ADVANTAGES Efficient information extraction: Video-to-text summarizers can quickly analyze and extract key information from videos, saving users time and effort. Easy navigation and referencing: Once a video is converted into text, the summarized version can be easily scanned and searched for specific keywords or sections of interest. Accessibility and inclusivity: Text-based summaries make video content more accessible to individuals with hearing impairments or those who prefer reading over watching videos
  • 9. DISADVANTAGES Quality of the video source: The quality of the video source can impact the accuracy and effectiveness of video-to-text summarization. Videos with poor audio quality, low resolution, or unclear visuals may result in less accurate transcriptions and subsequently affect the quality of the text summary. Internet Connectivity: During the process of summarization there should be a good internet connection is must otherwise the process take time. Video Duration: The Video Duration is also a important factor if the duration of the summarizing video is larger it takes more time to summarize
  • 10. REFERENCES 1. "Transcribing Video Content to Text" by Haowen Xu, Shih-Fu Chang, and Tat-Seng Chua. (2019) - This paper introduces a method for transcribing video content to text using a multimodal approach, combining visual and auditory cues. 2. "Video Summarization Using Deep Semantic Features" by Luming Tang, Xinxiao Wu, and Wenjing Jia. (2018) - The authors propose a video summarization technique that utilizes deep semantic features to extract key information from videos and generate concise summaries. 3. "Unsupervised Video Summarization with Adversarial LSTM Networks" by Mahnaz Koupaee and William Yang Wang. (2018) - This paper presents an unsupervised video summarization method that employs adversarial LSTM networks to generate summaries by selecting representative frames.
  • 11. REFERENCES 4. "Video Summarization by Learning from Unpaired Data" by Debidatta Dwibedi, Yusuf Aytar, Jonathan Tompson, Pierre Sermanet, and Andrew Zisserman. (2018) - The authors propose a video summarization approach that learns from unpaired videos and their corresponding titles, leveraging a combination of deep reinforcement learning and unsupervised learning. 5. "Deep Reinforcement Learning for Unsupervised Video Summarization with Diversity-Representativeness Reward" by Yu Gong, Qiang Zhang, and Ming-Hsuan Yang. (2017) - This paper introduces a deep reinforcement learning framework for unsupervised video summarization