Deep learning features and similarity of movies based on their video content

Deep learning features and
similarity of movies based on their
video content
Summer Camp - Show Max - Lukáš Lopatovský

Assignment
● Deep learning allows extracting useful features from video
frames. Your task is to apply new deep learning frameworks
to extract features from video frames of selected movies
available in the ShowMax streaming platform.
● Goals:
● Extract deep features from video frames. Explore similar
movies in the space of latent features and adjust the
extraction process in order to create clusters of video assets
(eg. TV episodes).

Residual Networks
● Enable to build deeper (convolutional) neural
network. (State of the art method for the image
recognition.)

Residual Networks
● To enable to build deeper network, the residual
nets use the simple trick. They maintain the
residuum from the previous layer ( so do not
loose the previously known information )

Torch
- Efficient Tensor library (like NumPy) with an
efficient CUDA backend
- Neural Networks package -- build arbitrary
acyclic computation graphs with automatic
differentiation
- fast CUDA and CPU backends
- Good community and industry support - several
hundred community-built and maintained
packages.

● Torch example of ‘nn’ library

What has been done
● The movies were classified using arbitrary
number of picture frames.
● We have used already trained ImageNet FB-
resnet network and own data set trained and
fine-tuned networks to classify movies.
● To detect the object in the image, we have
classify the whole image, as well as we have
made the various crops to get more accurate
predictions. (Cropping showed better results)

Classification output
● By classification of frames in the movie, the
special file is produced (.res). It is in a form to
contain all the important data. It can be later post-
process according to the special needs of the user:
- To create Object detection .srt file.
- To get various cumulative classification results.
- To trace the appearance of the object at the time-
line.

Own datasets
● The network was successfully trained and fine-tuned
from the ResNet network
● However, it showed some problems based from
improper dataset.
- Some categories contain many irrelevant pictures in second half
of the search. (Special case: “The doctor House”)
- The style of the images in the search is often very different to the
style found in the movie. (kitchen, car)
- Movies mostly contain images full of people, so the categories
containing people make false positive prediction. (cinema, theater)

Classification example
⊙ω⊙

Next step
● After the discussion in the company, the
programs were transformed to the easily usable
form.
● The feature vectors of the classification will be
used to find similarities among movies.
Compared to the existing algorithms and if
successful, incorporated into the current
recommendation system.

Deep learning features and similarity of movies based on their video content

Recommended

Recommended

More Related Content

Similar to Deep learning features and similarity of movies based on their video content

Similar to Deep learning features and similarity of movies based on their video content (20)

More from Showmax Engineering

More from Showmax Engineering (6)

Recently uploaded

Recently uploaded (20)

Deep learning features and similarity of movies based on their video content