We present a novel solution to the MediaEval 2014 Event
Synchronization Task: Synchronization of Multi-User Event
Media (SEM). The framework is based on a probabilistic
graphical model. Thanks to the simple topology of the
graph, the estimation of the true temporal displacement
among multiple photo collections can be performed eciently
through exact inference. The underlying tness function is
dened in a exible way, for which it is possible to integrate
easily new information (e.g., text tags or social network
data). The exibility makes the framework suitable and
adaptable to cope with many real situations. The method
is evaluated on two datasets obtaining an overall accuracy
of more than 85% in both cases
5. Stereo Matching: Model
Observed node = pixel intensity
Latent node = possible disparity for a
given node
Potentials associated to each link in
the network
Inference algorithm
Disparity
6. Stereo Matching vs. Photo Galleries
Synchronization
• Pixels Photos
• Pixel Intensitities Image features (Timestamp, GPS, SURF, color
histograms)
• States: possible diparities States: possible offsets
21. Results: Synchronization
Dataset Precision Accuracy
Vancouver 0.35 0.86
London 0.25 0.89
• Good performance in terms of accuracy
22. Results: Synchronization
Dataset Precision Accuracy
Vancouver 0.35 0.86
London 0.25 0.89
• Good performance in terms of accuracy
• Only 1/4 galleries correctly synchronized
23. Results: Clustering
Dataset Run Features Jaccard Index
Vancouver
1 CSD-64+LBP3x3 0.1673
2 CSD-6 0.1382
3 CSD-6+TIME 0.1315
London
1 CSD-64+LBP3x3 0.1287
2 CSD-6 0.0742
3 CSD-6+TIME 0.0885
24. Results: Clustering
Dataset Run Features Jaccard Index
Vancouver
1 CSD-64+LBP3x3 0.1673
2 CSD-6 0.1382
3 CSD-6+TIME 0.1315
London
1 CSD-64+LBP3x3 0.1287
2 CSD-6 0.0742
3 CSD-6+TIME 0.0885
• Performance are decreased by using only color descriptors
25. Results: Clustering
Dataset Run Features Jaccard Index
Vancouver
1 CSD-64+LBP3x3 0.1673
2 CSD-6 0.1382
3 CSD-6+TIME 0.1315
London
1 CSD-64+LBP3x3 0.1287
2 CSD-6 0.0742
3 CSD-6+TIME 0.0885
• Performance are decreased by using only color descriptors
• In the Vancouver dataset time doesn’t help in clustering
26. Results: Clustering
Dataset Run Features Jaccard Index
Vancouver
1 CSD-64+LBP3x3 0.1673
2 CSD-6 0.1382
3 CSD-6+TIME 0.1315
London
1 CSD-64+LBP3x3 0.1287
2 CSD-6 0.0742
3 CSD-6+TIME 0.0885
• Performance are decreased by using only color descriptors
• In the Vancouver dataset time doesn’t help in clustering
• In the London dataset time becomes a more reliable feature
29. Lessons Learned
Synchronization:
Only 1/4 galleries correctly synchronized
• new features (e.g. text tags, social network data)
• “no association” between a pair of images
30. Lessons Learned
Synchronization:
Only 1/4 galleries correctly synchronized
• new features (e.g. text tags, social network data)
• “no association” between a pair of images
Clustering:
Time is important!