Blind Verification of Digital Image Originality: A Statistical Approach

Blind Veriﬁcation of Digital Image Originality:
A Statistical Approach
Babak Mahdian, Radim Nedbal, and Stanislav Saic
Sibelius Seraphini
CSI 445 - Digital Image Forensics
Sibelius Seraphini (CSI 445) Digital Image Forensics 1 / 11

Introduction
Trustworthiness of digital images is essential for many areas
forensic investigation
criminal investigation
journalism

Introduction
journalism
One possible approach to check image integrity
extract image features
compare these features with a reference set

Introduction
journalism
One possible approach to check image integrity
extract image features
compare these features with a reference set
Problem of this approach
reference sets for veriﬁcation of digital image integrity
collected from unknown environments

Problem addressed in this paper
Given a database consisting of “unguaranteed” images;
How to identify which images are original from the camera and which
have been modiﬁed by software ?

Related Work
Active Approaches
data hiding
digital signatures

Related Work
Active Approaches
data hiding
digital signatures
Blind Methods
image splicing
color ﬁlter array interpolation
geometric transformations
cloning
computer graphics generated photos
JPEG compression inconsistencies
ﬁngerprint based

Basic Notation
Digital Images
pixel data
metadata

Basic Notation
Digital Images
pixel data
metadata
Camera ID vector (−→cm)
maker
model

Basic Notation
Digital Images
pixel data
metadata
Camera ID vector (−→cm)
maker
model
Fingerprint vector (
−→
θ )
quantization table
thumbnail

Reference Data Set - S
S = Cm × Θ × U

S = Cm × Θ × U
Cm: contains all the camera ID vectors
Θ: contains all possible ﬁngerprints vectors
U: contains all users ID.

S = Cm × Θ × U
Cm: contains all the camera ID vectors
Θ: contains all possible ﬁngerprints vectors
U: contains all users ID.
−→cm,
−→
θ , u
u has taken the photo
−→cm is the camera used
−→
θ the left ﬁngerprint vector

A Statistical Approach for Noise Removal
“Testing” tuple
t0 = −−→cm0,
−→
θ0

“Testing” tuple
t0 = −−→cm0,
−→
θ0
Null hypothesis
H0 :
−→
θ0 can’t be a ﬁngerprint of −−→cm0

“Testing” tuple
t0 = −−→cm0,
−→
θ0
Null hypothesis
H0 :
−→
Test statistic
T −−→cm0,
−→
θ0 = u| −−→cm0,
−→
θ0, u ∈ S
hypergeometric distribution

“Testing” tuple
t0 = −−→cm0,
−→
θ0
Null hypothesis
H0 :
−→
Test statistic
T −−→cm0,
−→
θ0 = u| −−→cm0,
−→
θ0, u ∈ S
hypergeometric distribution
Rejecting H0
if T is too big and greater than a threshold

Experimental Results
Proposed ﬁngerprints
FMarkers - EXIF Markers
FQTs - luminance and chrominance quantization tables
FThumb - information on the JPEG thumbnail image

Reference Image Data Set
5 million images of Flickr
Ground-truth data - 2400 images (24 cameras, 100 digital images each)

Fingerprints not in the reference image data set have been removed

Fingerprints not in the reference image data set have been removed
Worked well to check the digital images integrity

Conclusions
image ﬁngerprints are useful to identify the image originality

Conclusions
this paper provides a statistical approach to handle information noise
in a “unguaranted” databases of images

Conclusions
this paper provides a statistical approach to handle information noise
in a “unguaranted” databases of images
positive results in identiﬁcation of original images

Discussions
strength

Discussions
strength
can identify with which camera a photo was taken

Discussions
strength
check the integrity of a database of images instead of just one image
per time

Discussions
strength
per time
provide a conﬁdence value of how likely is a image original or modiﬁed

Discussions
strength
per time
weakness

Discussions
strength
per time
weakness
can only be applied to database of users that took the picture

Discussions
strength
per time
weakness
extracting camera ID vector and image ﬁngerprint could be misleading

Discussions
strength
per time
weakness
cannot handle ﬁngerprints that are not in the reference set

Discussions
strength
per time
weakness
a reference data set with ground-truth information is needed to
validate the image originality

Discussions
strength
per time
weakness
improvements

Discussions
strength
per time
weakness
improvements
employ another blind veriﬁcation method to obtain the ground-truth
information

Discussions
strength
per time
weakness
improvements
employ another blind veriﬁcation method to obtain the ground-truth
information
extract the image ﬁngerprint from the pixel data

Questions ?

Blind Verification of Digital Image Originality: A Statistical Approach

More Related Content

Similar to Blind Verification of Digital Image Originality: A Statistical Approach

More from Lar21

Recently uploaded

Blind Verification of Digital Image Originality: A Statistical Approach