SlideShare a Scribd company logo
1 of 15
Naive Soul Guardian
Bloody Scenes Detection
with Deep Convolutional Neural Network
B99902080 李冠穎
R03944007 張人尹
Outline
● Motivation
● System Overview
o Convolutional Neural Network
o Fully-Convolutional Net
o Pixelation
● Experiment
● Future Work
● Reference
● Demo 1
Motivation
● Lots of videos contain bloody scenes, we want to
protect kids from these inappropriate scenes
● Our system aims to detect and pixelate bloody
scenes automatically
2
Motivation
● Lots of videos contain bloody scenes, we want to
protect kids from these inappropriate scenes
● Our system aims to detect and pixelate bloody
scenes automatically
3
System Overview
4
Videos
Frames
Pixelated frames
Ignored frames
Pixelated videos
Decode
Encode
0
1
Convolutional Neural Network
● Fine-tune pre-trained CaffeNet(ImageNet)
o Human-labeled frames without bounding box
● Predict decoded frames
o Background(0) ignored frames
o Bloody frame(1) fully-convolutional net
5
Fully-Convolutional Net
● Classification for each 227 × 227 box with stride
32 on 451 x 451 image
● Generate a 8 x 8 classification map
o Interpolate probabilities to obtain heat map
6
Fully-Convolutional
Net
Pixelation
● Resize heat map to frame size
● Base on heat map, blur frames by Gaussian filter
7
Experiment (I)
● Run on cml21
● Decoding/Encoding done by FFmpeg
● Decoded frames as training/validation data
o Pos = Segments from Saw 1, 2, 3, 7, Final Destination 4,
5…… + Crawled images from google images
o Neg = Segments from The Big Bang Theory S8E11…… +
Part of ILSVRC 2013 val/test
o Random sample Pos : Neg = 2500 : 2500
8
Experiment (II)
● Classification Accuracy
o 73.46%
9
Experiment (III)
● Time(sec) of Processing a video clip
10
Decoding Classification Heat map Pixelation Encoding Average
time
Saw6
(139 frames,720x404)
0.34 41.18 22.99 72.43 0.02 0.99
sec/frame
CWL
(109 frames,1280x720)
0.79 36.95 0 0 1.24 0.36
sec/frame
FD5
(121 frames,1024x576)
0.44 36.23 3.87 28.72 0.81 0.58
sec/frame
Future Work
● Train our model with more diverse data to
increase accuracy and reduce false-positive
● Accelerate blurring and smooth boundaries
● Implement on surveillance camera for security
● Combine shot detection and motion vector to
reduce computation
11
Reference
● Caffe | Deep Learning Framework
○ http://caffe.berkeleyvision.org/
○ Classifying ImageNet: the instant Caffe way
○ Net Surgery for a Fully-Convolutional Model
● FFmpeg
○ https://www.ffmpeg.org/
● ImageNet
○ http://www.image-net.org/
● Tutorials by Hsinfu, Shiro, Jocelyn
12
Demo
13
Finally,
I wanna play a…
Q & A game
14

More Related Content

Viewers also liked

Viewers also liked (8)

Sorting nr
Sorting nrSorting nr
Sorting nr
 
Juvenile delinquency
Juvenile delinquencyJuvenile delinquency
Juvenile delinquency
 
Assessment Workshop
Assessment WorkshopAssessment Workshop
Assessment Workshop
 
Mix. 1 a
Mix. 1 aMix. 1 a
Mix. 1 a
 
Falta de seguridad social en las empresas de confecciones.sh
Falta de seguridad social en las empresas de confecciones.shFalta de seguridad social en las empresas de confecciones.sh
Falta de seguridad social en las empresas de confecciones.sh
 
Fairtrade Asia-Pacific, 2016
Fairtrade Asia-Pacific, 2016Fairtrade Asia-Pacific, 2016
Fairtrade Asia-Pacific, 2016
 
Concurso Pitch Sesi Acelera Startups 2016
Concurso Pitch Sesi  Acelera Startups 2016  Concurso Pitch Sesi  Acelera Startups 2016
Concurso Pitch Sesi Acelera Startups 2016
 
Infographic Design Tools & Tips
Infographic Design Tools & TipsInfographic Design Tools & Tips
Infographic Design Tools & Tips
 

Similar to Mmai 2014 final

Relevance-Based Compression of Cataract Surgery Videos Using Convolutional Ne...
Relevance-Based Compression of Cataract Surgery Videos Using Convolutional Ne...Relevance-Based Compression of Cataract Surgery Videos Using Convolutional Ne...
Relevance-Based Compression of Cataract Surgery Videos Using Convolutional Ne...
Alpen-Adria-Universität
 
Simulator-based Explanation and Debugging of Hazard-triggering Events in DNN-...
Simulator-based Explanation and Debugging of Hazard-triggering Events in DNN-...Simulator-based Explanation and Debugging of Hazard-triggering Events in DNN-...
Simulator-based Explanation and Debugging of Hazard-triggering Events in DNN-...
Lionel Briand
 

Similar to Mmai 2014 final (20)

StoryVisualization using StoryGAN implemented by pytorch on Pororro Dataset​....
StoryVisualization using StoryGAN implemented by pytorch on Pororro Dataset​....StoryVisualization using StoryGAN implemented by pytorch on Pororro Dataset​....
StoryVisualization using StoryGAN implemented by pytorch on Pororro Dataset​....
 
Salt Identification Challenge
Salt Identification ChallengeSalt Identification Challenge
Salt Identification Challenge
 
Embryo selection using AI
Embryo selection using AIEmbryo selection using AI
Embryo selection using AI
 
MIT's experience on OpenPOWER/POWER 9 platform
MIT's experience on OpenPOWER/POWER 9 platformMIT's experience on OpenPOWER/POWER 9 platform
MIT's experience on OpenPOWER/POWER 9 platform
 
A real time automatic eye tracking system for ophthalmology
A real time automatic eye tracking system for ophthalmologyA real time automatic eye tracking system for ophthalmology
A real time automatic eye tracking system for ophthalmology
 
Convolutional Neural Networks for Computer vision Applications
Convolutional Neural Networks for Computer vision ApplicationsConvolutional Neural Networks for Computer vision Applications
Convolutional Neural Networks for Computer vision Applications
 
Ultrasound Nerve Segmentation
Ultrasound Nerve Segmentation Ultrasound Nerve Segmentation
Ultrasound Nerve Segmentation
 
Getting your hands dirty with deep learning in java
Getting your hands dirty with deep learning in javaGetting your hands dirty with deep learning in java
Getting your hands dirty with deep learning in java
 
2018AOI論壇_深度學習於表面瑕疪檢測_元智大學蔡篤銘
2018AOI論壇_深度學習於表面瑕疪檢測_元智大學蔡篤銘2018AOI論壇_深度學習於表面瑕疪檢測_元智大學蔡篤銘
2018AOI論壇_深度學習於表面瑕疪檢測_元智大學蔡篤銘
 
Video Classification: Human Action Recognition on HMDB-51 dataset
Video Classification: Human Action Recognition on HMDB-51 datasetVideo Classification: Human Action Recognition on HMDB-51 dataset
Video Classification: Human Action Recognition on HMDB-51 dataset
 
NMSL_2017summer
NMSL_2017summerNMSL_2017summer
NMSL_2017summer
 
Object detection with deep learning
Object detection with deep learningObject detection with deep learning
Object detection with deep learning
 
Scalawox deeplearning
Scalawox deeplearningScalawox deeplearning
Scalawox deeplearning
 
Resolution Independent 2D Cartoon Video Conversion
Resolution Independent 2D Cartoon Video ConversionResolution Independent 2D Cartoon Video Conversion
Resolution Independent 2D Cartoon Video Conversion
 
Deep Learning for Computer Vision - PyconDE 2017
Deep Learning for Computer Vision - PyconDE 2017Deep Learning for Computer Vision - PyconDE 2017
Deep Learning for Computer Vision - PyconDE 2017
 
Team16_Narayana_InstanceSegmentation.pptx
Team16_Narayana_InstanceSegmentation.pptxTeam16_Narayana_InstanceSegmentation.pptx
Team16_Narayana_InstanceSegmentation.pptx
 
Unsupervised Video Anomaly Detection: A brief overview
Unsupervised Video Anomaly Detection: A brief overviewUnsupervised Video Anomaly Detection: A brief overview
Unsupervised Video Anomaly Detection: A brief overview
 
ANN Slides.pdf
ANN Slides.pdfANN Slides.pdf
ANN Slides.pdf
 
Relevance-Based Compression of Cataract Surgery Videos Using Convolutional Ne...
Relevance-Based Compression of Cataract Surgery Videos Using Convolutional Ne...Relevance-Based Compression of Cataract Surgery Videos Using Convolutional Ne...
Relevance-Based Compression of Cataract Surgery Videos Using Convolutional Ne...
 
Simulator-based Explanation and Debugging of Hazard-triggering Events in DNN-...
Simulator-based Explanation and Debugging of Hazard-triggering Events in DNN-...Simulator-based Explanation and Debugging of Hazard-triggering Events in DNN-...
Simulator-based Explanation and Debugging of Hazard-triggering Events in DNN-...
 

Recently uploaded

Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
giselly40
 

Recently uploaded (20)

[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 

Mmai 2014 final

  • 1. Naive Soul Guardian Bloody Scenes Detection with Deep Convolutional Neural Network B99902080 李冠穎 R03944007 張人尹
  • 2. Outline ● Motivation ● System Overview o Convolutional Neural Network o Fully-Convolutional Net o Pixelation ● Experiment ● Future Work ● Reference ● Demo 1
  • 3. Motivation ● Lots of videos contain bloody scenes, we want to protect kids from these inappropriate scenes ● Our system aims to detect and pixelate bloody scenes automatically 2
  • 4. Motivation ● Lots of videos contain bloody scenes, we want to protect kids from these inappropriate scenes ● Our system aims to detect and pixelate bloody scenes automatically 3
  • 5. System Overview 4 Videos Frames Pixelated frames Ignored frames Pixelated videos Decode Encode 0 1
  • 6. Convolutional Neural Network ● Fine-tune pre-trained CaffeNet(ImageNet) o Human-labeled frames without bounding box ● Predict decoded frames o Background(0) ignored frames o Bloody frame(1) fully-convolutional net 5
  • 7. Fully-Convolutional Net ● Classification for each 227 × 227 box with stride 32 on 451 x 451 image ● Generate a 8 x 8 classification map o Interpolate probabilities to obtain heat map 6 Fully-Convolutional Net
  • 8. Pixelation ● Resize heat map to frame size ● Base on heat map, blur frames by Gaussian filter 7
  • 9. Experiment (I) ● Run on cml21 ● Decoding/Encoding done by FFmpeg ● Decoded frames as training/validation data o Pos = Segments from Saw 1, 2, 3, 7, Final Destination 4, 5…… + Crawled images from google images o Neg = Segments from The Big Bang Theory S8E11…… + Part of ILSVRC 2013 val/test o Random sample Pos : Neg = 2500 : 2500 8
  • 10. Experiment (II) ● Classification Accuracy o 73.46% 9
  • 11. Experiment (III) ● Time(sec) of Processing a video clip 10 Decoding Classification Heat map Pixelation Encoding Average time Saw6 (139 frames,720x404) 0.34 41.18 22.99 72.43 0.02 0.99 sec/frame CWL (109 frames,1280x720) 0.79 36.95 0 0 1.24 0.36 sec/frame FD5 (121 frames,1024x576) 0.44 36.23 3.87 28.72 0.81 0.58 sec/frame
  • 12. Future Work ● Train our model with more diverse data to increase accuracy and reduce false-positive ● Accelerate blurring and smooth boundaries ● Implement on surveillance camera for security ● Combine shot detection and motion vector to reduce computation 11
  • 13. Reference ● Caffe | Deep Learning Framework ○ http://caffe.berkeleyvision.org/ ○ Classifying ImageNet: the instant Caffe way ○ Net Surgery for a Fully-Convolutional Model ● FFmpeg ○ https://www.ffmpeg.org/ ● ImageNet ○ http://www.image-net.org/ ● Tutorials by Hsinfu, Shiro, Jocelyn 12
  • 15. Finally, I wanna play a… Q & A game 14

Editor's Notes

  1. Schedule guidance
  2. Test time