SlideShare a Scribd company logo
1 of 11
Real-time Video Copy Detection Based on
Hadoop
Hardik Parmar
Sanket Thakur
Pranav Sangam
Sachin Tripathi
ABSTRACT:
With the development of multimedia technology and Internet, the amount of videos in the Internet is
increasing quickly.
Among the large amount of videos in the Internet, a considerable number of them are copies of original
videos, which are simply revised versions of the original ones.
Introduction
• Introduction to Video Copy Detection
Due to rapid development of multimedia hardware and software technologies, the cost of image
and video data collection, creation, and storage is becoming low.
Among these huge volumes of videos, there exist large numbers of copies.
Introduction to Hadoop Platform
• Hadoop was developed by the Apache Foundation. IT consists of map reduce model.
• MapReduce is the programming model of Hadoop which includes Map function and Reduce
function.
Proposed System
We propose a video copy detection using method based on Brightness sequence and the method
based on TIRI-DCT algorithm.
High accuracy in locating copies.
Very reliable for detecting copied videos.
Advantages Of Proposed System
1. The performance on detecting copies from large data set is satisfactory and Hadoop platform
can significantly improve the efficiency of video copy detection.
2. The proposed system high fault tolerance, high throughput, easy scalability and etc.
3. The measurement of copy detections performance System making result is faster than
existing system.
4. The proposed system video hashing algorithm methods has strong robustness, high
distinction, high compactness and low complexity.
Flow Chart
:
Yes
NO
YES
NO
Start
FFMPEG Transcoding
Close
Upload/ Querying Video Process
Calculate images hash value
Convert video into
pictures in the form of
frames
Create hash value library
Calculate distance between hash values
Match hash value
Enter Username
and Password
FFMPEG Transcoding
Training Videos
Calculate images hash value
Convert video into
pictures in the form of
frames
Stored in HDFS
Create hash value library
Database
Distance value
< Threshold
value
Detection of video copy & Reject video
Upload into database
Hardware and Software requirements
• Hardware:
Processor: Pentium 4
RAM: 4GB or more
Hard disk: 16 GB or more
• Software Specification:
Windows Operating System.
Eclipse
NetBeans
Java
Apache Tomcat Server
MySQL
Hadoop
Applications
• Using the same scenario of system, we can implement following mobile
applications:
1. For Government Services Advertisement Video.
2. For Online Shopping System Advertisement Video.
3. For Video Upload in private own cloud.
CONCLUSION
• In this paper, two video copy detection methods, the method based on brightness sequence and the
method based on TIRI-DCT are implemented and the recalls and precisions of the two methods with
different video numbers and different thresholds are analyzed.
• The algorithms are implemented on Hadoop distributed computing platform and the efficiencies are
compared in different video amounts and different map amounts.
THANK YOU!

More Related Content

Similar to Real time video copy detection based on hadoop

A Study on FFmpeg Multimedia Framework
A Study on FFmpeg Multimedia FrameworkA Study on FFmpeg Multimedia Framework
A Study on FFmpeg Multimedia Frameworkijtsrd
 
Architecting a Video Encoding Strategy Designed For Growth
Architecting a Video Encoding Strategy Designed For GrowthArchitecting a Video Encoding Strategy Designed For Growth
Architecting a Video Encoding Strategy Designed For GrowthZencoder
 
IBC Content Everywhere Hub Presentation: HTML5 And Fastest Encoding
IBC Content Everywhere Hub Presentation: HTML5 And Fastest EncodingIBC Content Everywhere Hub Presentation: HTML5 And Fastest Encoding
IBC Content Everywhere Hub Presentation: HTML5 And Fastest EncodingBitmovin Inc
 
Монетизация сетевой инфраструктуры
Монетизация сетевой инфраструктурыМонетизация сетевой инфраструктуры
Монетизация сетевой инфраструктурыBAKOTECH
 
DISNEY DOES DATA: Data management implications of using animated video as tra...
DISNEY DOES DATA: Data management implications of using animated video as tra...DISNEY DOES DATA: Data management implications of using animated video as tra...
DISNEY DOES DATA: Data management implications of using animated video as tra...Louise Patterton
 
Video performance snowcamp
Video performance snowcampVideo performance snowcamp
Video performance snowcampDoug Sillars
 
NodeJS Edinburgh Video Killed My Data Plan
NodeJS Edinburgh Video Killed My Data PlanNodeJS Edinburgh Video Killed My Data Plan
NodeJS Edinburgh Video Killed My Data PlanDoug Sillars
 
Serverless Media Workflow
Serverless Media WorkflowServerless Media Workflow
Serverless Media WorkflowMooYeol Lee
 
cas_Knowledge_Network
cas_Knowledge_Networkcas_Knowledge_Network
cas_Knowledge_NetworkOliver Eichel
 
Paper id 28201439
Paper id 28201439Paper id 28201439
Paper id 28201439IJRAT
 
The Truth About All-Flash Array Deduplication
The Truth About All-Flash Array DeduplicationThe Truth About All-Flash Array Deduplication
The Truth About All-Flash Array DeduplicationStorage Switzerland
 
Video Killed My Data Plan: Helsinki
Video Killed My Data Plan: HelsinkiVideo Killed My Data Plan: Helsinki
Video Killed My Data Plan: HelsinkiDoug Sillars
 
Adaptive Streaming of Traditional and Omnidirectional Media
Adaptive Streaming of Traditional and Omnidirectional MediaAdaptive Streaming of Traditional and Omnidirectional Media
Adaptive Streaming of Traditional and Omnidirectional MediaAlpen-Adria-Universität
 
IBM VideoCharger and Digital Library MediaBase.doc
IBM VideoCharger and Digital Library MediaBase.docIBM VideoCharger and Digital Library MediaBase.doc
IBM VideoCharger and Digital Library MediaBase.docVideoguy
 
Monitoring whole mpeg transport stream
Monitoring whole mpeg transport streamMonitoring whole mpeg transport stream
Monitoring whole mpeg transport streamVolicon
 
Technology Presentation and Disclosures October 2017
Technology Presentation and Disclosures October 2017Technology Presentation and Disclosures October 2017
Technology Presentation and Disclosures October 2017P. Stephen Lamont
 
FutureComm 2010: Video Quality Analysis and Measurement
FutureComm 2010: Video Quality Analysis and MeasurementFutureComm 2010: Video Quality Analysis and Measurement
FutureComm 2010: Video Quality Analysis and MeasurementRADVISION Ltd.
 
Application Delivery Platform Towards Edge Computing - Bukhary Ikhwan
Application Delivery Platform Towards Edge Computing - Bukhary IkhwanApplication Delivery Platform Towards Edge Computing - Bukhary Ikhwan
Application Delivery Platform Towards Edge Computing - Bukhary IkhwanOpenNebula Project
 

Similar to Real time video copy detection based on hadoop (20)

A Study on FFmpeg Multimedia Framework
A Study on FFmpeg Multimedia FrameworkA Study on FFmpeg Multimedia Framework
A Study on FFmpeg Multimedia Framework
 
Architecting a Video Encoding Strategy Designed For Growth
Architecting a Video Encoding Strategy Designed For GrowthArchitecting a Video Encoding Strategy Designed For Growth
Architecting a Video Encoding Strategy Designed For Growth
 
IBC Content Everywhere Hub Presentation: HTML5 And Fastest Encoding
IBC Content Everywhere Hub Presentation: HTML5 And Fastest EncodingIBC Content Everywhere Hub Presentation: HTML5 And Fastest Encoding
IBC Content Everywhere Hub Presentation: HTML5 And Fastest Encoding
 
Монетизация сетевой инфраструктуры
Монетизация сетевой инфраструктурыМонетизация сетевой инфраструктуры
Монетизация сетевой инфраструктуры
 
DISNEY DOES DATA: Data management implications of using animated video as tra...
DISNEY DOES DATA: Data management implications of using animated video as tra...DISNEY DOES DATA: Data management implications of using animated video as tra...
DISNEY DOES DATA: Data management implications of using animated video as tra...
 
Video performance snowcamp
Video performance snowcampVideo performance snowcamp
Video performance snowcamp
 
060320 mmtf presentation
060320 mmtf presentation060320 mmtf presentation
060320 mmtf presentation
 
NodeJS Edinburgh Video Killed My Data Plan
NodeJS Edinburgh Video Killed My Data PlanNodeJS Edinburgh Video Killed My Data Plan
NodeJS Edinburgh Video Killed My Data Plan
 
Serverless Media Workflow
Serverless Media WorkflowServerless Media Workflow
Serverless Media Workflow
 
cas_Knowledge_Network
cas_Knowledge_Networkcas_Knowledge_Network
cas_Knowledge_Network
 
Paper id 28201439
Paper id 28201439Paper id 28201439
Paper id 28201439
 
The Truth About All-Flash Array Deduplication
The Truth About All-Flash Array DeduplicationThe Truth About All-Flash Array Deduplication
The Truth About All-Flash Array Deduplication
 
Multimedia streaming
Multimedia streamingMultimedia streaming
Multimedia streaming
 
Video Killed My Data Plan: Helsinki
Video Killed My Data Plan: HelsinkiVideo Killed My Data Plan: Helsinki
Video Killed My Data Plan: Helsinki
 
Adaptive Streaming of Traditional and Omnidirectional Media
Adaptive Streaming of Traditional and Omnidirectional MediaAdaptive Streaming of Traditional and Omnidirectional Media
Adaptive Streaming of Traditional and Omnidirectional Media
 
IBM VideoCharger and Digital Library MediaBase.doc
IBM VideoCharger and Digital Library MediaBase.docIBM VideoCharger and Digital Library MediaBase.doc
IBM VideoCharger and Digital Library MediaBase.doc
 
Monitoring whole mpeg transport stream
Monitoring whole mpeg transport streamMonitoring whole mpeg transport stream
Monitoring whole mpeg transport stream
 
Technology Presentation and Disclosures October 2017
Technology Presentation and Disclosures October 2017Technology Presentation and Disclosures October 2017
Technology Presentation and Disclosures October 2017
 
FutureComm 2010: Video Quality Analysis and Measurement
FutureComm 2010: Video Quality Analysis and MeasurementFutureComm 2010: Video Quality Analysis and Measurement
FutureComm 2010: Video Quality Analysis and Measurement
 
Application Delivery Platform Towards Edge Computing - Bukhary Ikhwan
Application Delivery Platform Towards Edge Computing - Bukhary IkhwanApplication Delivery Platform Towards Edge Computing - Bukhary Ikhwan
Application Delivery Platform Towards Edge Computing - Bukhary Ikhwan
 

Recently uploaded

Decarbonising Commercial Real Estate: The Role of Operational Performance
Decarbonising Commercial Real Estate: The Role of Operational PerformanceDecarbonising Commercial Real Estate: The Role of Operational Performance
Decarbonising Commercial Real Estate: The Role of Operational PerformanceIES VE
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxRustici Software
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodJuan lago vázquez
 
Introduction to use of FHIR Documents in ABDM
Introduction to use of FHIR Documents in ABDMIntroduction to use of FHIR Documents in ABDM
Introduction to use of FHIR Documents in ABDMKumar Satyam
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MIND CTI
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistandanishmna97
 
ChatGPT and Beyond - Elevating DevOps Productivity
ChatGPT and Beyond - Elevating DevOps ProductivityChatGPT and Beyond - Elevating DevOps Productivity
ChatGPT and Beyond - Elevating DevOps ProductivityVictorSzoltysek
 
Quantum Leap in Next-Generation Computing
Quantum Leap in Next-Generation ComputingQuantum Leap in Next-Generation Computing
Quantum Leap in Next-Generation ComputingWSO2
 
JohnPollard-hybrid-app-RailsConf2024.pptx
JohnPollard-hybrid-app-RailsConf2024.pptxJohnPollard-hybrid-app-RailsConf2024.pptx
JohnPollard-hybrid-app-RailsConf2024.pptxJohnPollard37
 
Platformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityPlatformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityWSO2
 
Less Is More: Utilizing Ballerina to Architect a Cloud Data Platform
Less Is More: Utilizing Ballerina to Architect a Cloud Data PlatformLess Is More: Utilizing Ballerina to Architect a Cloud Data Platform
Less Is More: Utilizing Ballerina to Architect a Cloud Data PlatformWSO2
 
How to Check CNIC Information Online with Pakdata cf
How to Check CNIC Information Online with Pakdata cfHow to Check CNIC Information Online with Pakdata cf
How to Check CNIC Information Online with Pakdata cfdanishmna97
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Jeffrey Haguewood
 
WSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software
 
Modernizing Legacy Systems Using Ballerina
Modernizing Legacy Systems Using BallerinaModernizing Legacy Systems Using Ballerina
Modernizing Legacy Systems Using BallerinaWSO2
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc
 
AI in Action: Real World Use Cases by Anitaraj
AI in Action: Real World Use Cases by AnitarajAI in Action: Real World Use Cases by Anitaraj
AI in Action: Real World Use Cases by AnitarajAnitaRaj43
 
Navigating Identity and Access Management in the Modern Enterprise
Navigating Identity and Access Management in the Modern EnterpriseNavigating Identity and Access Management in the Modern Enterprise
Navigating Identity and Access Management in the Modern EnterpriseWSO2
 
Choreo: Empowering the Future of Enterprise Software Engineering
Choreo: Empowering the Future of Enterprise Software EngineeringChoreo: Empowering the Future of Enterprise Software Engineering
Choreo: Empowering the Future of Enterprise Software EngineeringWSO2
 

Recently uploaded (20)

Decarbonising Commercial Real Estate: The Role of Operational Performance
Decarbonising Commercial Real Estate: The Role of Operational PerformanceDecarbonising Commercial Real Estate: The Role of Operational Performance
Decarbonising Commercial Real Estate: The Role of Operational Performance
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Introduction to use of FHIR Documents in ABDM
Introduction to use of FHIR Documents in ABDMIntroduction to use of FHIR Documents in ABDM
Introduction to use of FHIR Documents in ABDM
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 
ChatGPT and Beyond - Elevating DevOps Productivity
ChatGPT and Beyond - Elevating DevOps ProductivityChatGPT and Beyond - Elevating DevOps Productivity
ChatGPT and Beyond - Elevating DevOps Productivity
 
Quantum Leap in Next-Generation Computing
Quantum Leap in Next-Generation ComputingQuantum Leap in Next-Generation Computing
Quantum Leap in Next-Generation Computing
 
JohnPollard-hybrid-app-RailsConf2024.pptx
JohnPollard-hybrid-app-RailsConf2024.pptxJohnPollard-hybrid-app-RailsConf2024.pptx
JohnPollard-hybrid-app-RailsConf2024.pptx
 
Platformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityPlatformless Horizons for Digital Adaptability
Platformless Horizons for Digital Adaptability
 
Less Is More: Utilizing Ballerina to Architect a Cloud Data Platform
Less Is More: Utilizing Ballerina to Architect a Cloud Data PlatformLess Is More: Utilizing Ballerina to Architect a Cloud Data Platform
Less Is More: Utilizing Ballerina to Architect a Cloud Data Platform
 
How to Check CNIC Information Online with Pakdata cf
How to Check CNIC Information Online with Pakdata cfHow to Check CNIC Information Online with Pakdata cf
How to Check CNIC Information Online with Pakdata cf
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
WSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering Developers
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Modernizing Legacy Systems Using Ballerina
Modernizing Legacy Systems Using BallerinaModernizing Legacy Systems Using Ballerina
Modernizing Legacy Systems Using Ballerina
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
AI in Action: Real World Use Cases by Anitaraj
AI in Action: Real World Use Cases by AnitarajAI in Action: Real World Use Cases by Anitaraj
AI in Action: Real World Use Cases by Anitaraj
 
Navigating Identity and Access Management in the Modern Enterprise
Navigating Identity and Access Management in the Modern EnterpriseNavigating Identity and Access Management in the Modern Enterprise
Navigating Identity and Access Management in the Modern Enterprise
 
Choreo: Empowering the Future of Enterprise Software Engineering
Choreo: Empowering the Future of Enterprise Software EngineeringChoreo: Empowering the Future of Enterprise Software Engineering
Choreo: Empowering the Future of Enterprise Software Engineering
 

Real time video copy detection based on hadoop

  • 1. Real-time Video Copy Detection Based on Hadoop Hardik Parmar Sanket Thakur Pranav Sangam Sachin Tripathi
  • 2. ABSTRACT: With the development of multimedia technology and Internet, the amount of videos in the Internet is increasing quickly. Among the large amount of videos in the Internet, a considerable number of them are copies of original videos, which are simply revised versions of the original ones.
  • 3. Introduction • Introduction to Video Copy Detection Due to rapid development of multimedia hardware and software technologies, the cost of image and video data collection, creation, and storage is becoming low. Among these huge volumes of videos, there exist large numbers of copies.
  • 4. Introduction to Hadoop Platform • Hadoop was developed by the Apache Foundation. IT consists of map reduce model. • MapReduce is the programming model of Hadoop which includes Map function and Reduce function.
  • 5. Proposed System We propose a video copy detection using method based on Brightness sequence and the method based on TIRI-DCT algorithm. High accuracy in locating copies. Very reliable for detecting copied videos.
  • 6. Advantages Of Proposed System 1. The performance on detecting copies from large data set is satisfactory and Hadoop platform can significantly improve the efficiency of video copy detection. 2. The proposed system high fault tolerance, high throughput, easy scalability and etc. 3. The measurement of copy detections performance System making result is faster than existing system. 4. The proposed system video hashing algorithm methods has strong robustness, high distinction, high compactness and low complexity.
  • 7. Flow Chart : Yes NO YES NO Start FFMPEG Transcoding Close Upload/ Querying Video Process Calculate images hash value Convert video into pictures in the form of frames Create hash value library Calculate distance between hash values Match hash value Enter Username and Password FFMPEG Transcoding Training Videos Calculate images hash value Convert video into pictures in the form of frames Stored in HDFS Create hash value library Database Distance value < Threshold value Detection of video copy & Reject video Upload into database
  • 8. Hardware and Software requirements • Hardware: Processor: Pentium 4 RAM: 4GB or more Hard disk: 16 GB or more • Software Specification: Windows Operating System. Eclipse NetBeans Java Apache Tomcat Server MySQL Hadoop
  • 9. Applications • Using the same scenario of system, we can implement following mobile applications: 1. For Government Services Advertisement Video. 2. For Online Shopping System Advertisement Video. 3. For Video Upload in private own cloud.
  • 10. CONCLUSION • In this paper, two video copy detection methods, the method based on brightness sequence and the method based on TIRI-DCT are implemented and the recalls and precisions of the two methods with different video numbers and different thresholds are analyzed. • The algorithms are implemented on Hadoop distributed computing platform and the efficiencies are compared in different video amounts and different map amounts.

Editor's Notes

  1. 1