PROJECT REPORT

•Download as DOCX, PDF•

0 likes•32 views

pmm330

PROJECT REPORT
-FRAUD DETECTION
Problem Statement: The main objective of this project is to prevent a fraudster from applying
loan twice in a particular bank. When an image comes from the user registration page the
algorithm will check whether the image already exists in the bank’s database or not. If a match is
found a warning message will be displayed to the bank manager who can take appropiate action
based on the details of that particular user.
Dataset Description: The dataset consist of frontal-face of a person’s image. Since the problem
is related to a bank we can safely assume that all the images will be of a good quality(image will
not be of mobile camera quality) and the person will be facing the camera.
Pre-processing: All the images were resized to a common size and then converted to gray-scale
to remove the remove high frequencies and detail.
Model developement: Using OpenCV haarcascade-frontal face classifier we first detect the face
of a person from an image and crop it out and store it in the local filesystem. After this PCA was
used for image re-construction. This step was necessary as it will further remove the noise that
will be present in the image after being converted to gray-scale. After this 2 similarity measures
were used to check for image duplication
a. dHash Algorithm: dHash is a perceptual hashing algorithm. A perceptual
hashing algorithm that takes a fingerprint of a multimedia file by deriving it from
various features from its content so it can take into account transformations on a
given input and yet be flexible enough to distinguish between dissimilar files.
dHash (difference hashing) algorithm computes the difference in brightness
between adjacent pixels, identifying the relative gradient direction. This algorithm
is very fast and we use this algorithm to filter out those images which differ vastly
from the input image.
b. Structural Similarity Index(SSIM): The structural similarity (SSIM) index is a
method for predicting the perceived quality of digital television and cinematic
pictures, as well as other kinds of digital images and videos. SSIM is used for
measuring the similarity between two images. The SSIM index is a full reference
metric; in other words, the measurement or prediction of image quality is based
on an initial uncompressed or distortion-free image as reference. SSIM is
designed to improve on traditional methods such as peak signal-to-noise ratio
(PSNR) and mean squared error (MSE), which have proven to be inconsistent
with human visual perception. Based on this measure we calculate our final
accuracy.

Output: our final output displays the proper message to the bank manager based on whether
fraud has been detected or not. Based on the output the bank manager will take appropiate action.
If fraud has been detected on the output window will display the image of the fraudster and
his/her matching image from the bank’s database and the customer id no.
Accuracy of the model: Model accuracy is around 61 %
Drawbacks:
1. The model does not give accurate results if the image quality is poor as OpenCV fails to
detect the face from an image.(eg . if the photo is taken from a mobile phone)
2. The model gives poor results if the user is not facing the camera or standing far away
from the camera.
Tools Used:
1. Python
2. OpenCV for Image Processing
3. Apache Spark for scaling up the model
4. Bottle Framework for Integrating with Java
5. Java for frontend development
References:
1. Wikipedia
2. Github
3. http://blog.iconfinder.com/detecting-duplicate-images-using-python/
4. Apache Spark Documentation
5. Stackoverflow

What's hot

Disease prediction using machine learningJinishaKG

Anomaly Detection and Spark Implementation - Meetup Presentation.pptxImpetus Technologies

Fraud Detection with Ensemble Learning TechniqueFrancesca Pappalardo

Machine learning - AIWitekio

IRJET - Fake Currency Detection using CNNIRJET Journal

Machine LearningRabab Munawar

Anomaly Detection - Real World Scenarios, Approaches and Live ImplementationImpetus Technologies

Modex Talks - AI Conceptual OverviewModex

Fraud detection MLMaatougSelim

Machine Learning Algorithm & Anomaly detection 2021Chakrit Phain

Deep neural networks and tabular dataJimmyLiang20

Human or Intelligent Machine?ameyakulk

Emerging technologies enabling in fraud detectionUmasree Raghunath

Hand dominant data classification based on smartphone sensitivityAzriidros

Intro to Machine Learning & AIMostafa Elsheikh

Basics of machine learningGopal Verma

Building Intelligent Data ProductsStephen Whitworth

What's hot (17)

Disease prediction using machine learning

Anomaly Detection and Spark Implementation - Meetup Presentation.pptx

Fraud Detection with Ensemble Learning Technique

Machine learning - AI

IRJET - Fake Currency Detection using CNN

Machine Learning

Anomaly Detection - Real World Scenarios, Approaches and Live Implementation

Modex Talks - AI Conceptual Overview

Fraud detection ML

Machine Learning Algorithm & Anomaly detection 2021

Deep neural networks and tabular data

Human or Intelligent Machine?

Emerging technologies enabling in fraud detection

Hand dominant data classification based on smartphone sensitivity

Intro to Machine Learning & AI

Basics of machine learning

Building Intelligent Data Products

Similar to PROJECT REPORT

Driver drowsiness and lane detection screenshotsVenkat Projects

IRJET- Advance Driver Assistance System using Artificial IntelligenceIRJET Journal

IRJET- Face Detection and Recognition using OpenCVIRJET Journal

IRJET - A Review on Face Recognition using Deep Learning AlgorithmIRJET Journal

IRJET- Smart Surveillance Cam using Face Recongition AlogrithmIRJET Journal

Fake Multi Biometric Detection using Image Quality Assessmentijsrd.com

Image recognition technology (Medical Presentation)saravanan guru

Smart Doorbell System Based on Face RecognitionIRJET Journal

IRJET - Visual Enhancement of E-Commerce ProductsIRJET Journal

IRJET- Facial Expression Recognition using GPA AnalysisIRJET Journal

IRJET- Credit Card Authentication using Facial RecognitionIRJET Journal

Pattern recognition 3d face recognitionMazin Alwaaly

Project on fake currency recognition using image processing ppt final (3).pptx426SahithiBaiMiriska

Review A DCNN APPROACH FOR REAL TIME UNCONSTRAINED FACE.pptxAravindHari22

Facial recognition technology by vaibhavVaibhav P

Face Recognition Based on Image Processing in an Advanced Robotic SystemIRJET Journal

Smart Bank Locker Access System Using Iris ,Fingerprints,Face Recognization A...IJERA Editor

foot crime final for awareness site .pptxImmanImman6

IRJET - Emotion Recognising System-Crowd Behavior AnalysisIRJET Journal

Computer Vision - Real Time Face Recognition using Open CV and PythonAkash Satamkar

Similar to PROJECT REPORT (20)

Driver drowsiness and lane detection screenshots

IRJET- Advance Driver Assistance System using Artificial Intelligence

IRJET- Face Detection and Recognition using OpenCV

IRJET - A Review on Face Recognition using Deep Learning Algorithm

IRJET- Smart Surveillance Cam using Face Recongition Alogrithm

Fake Multi Biometric Detection using Image Quality Assessment

Image recognition technology (Medical Presentation)

Smart Doorbell System Based on Face Recognition

IRJET - Visual Enhancement of E-Commerce Products

IRJET- Facial Expression Recognition using GPA Analysis

IRJET- Credit Card Authentication using Facial Recognition

Pattern recognition 3d face recognition

Project on fake currency recognition using image processing ppt final (3).pptx

Review A DCNN APPROACH FOR REAL TIME UNCONSTRAINED FACE.pptx

Facial recognition technology by vaibhav

Face Recognition Based on Image Processing in an Advanced Robotic System

Smart Bank Locker Access System Using Iris ,Fingerprints,Face Recognization A...

foot crime final for awareness site .pptx

IRJET - Emotion Recognising System-Crowd Behavior Analysis

Computer Vision - Real Time Face Recognition using Open CV and Python

PROJECT REPORT

1. PROJECT REPORT -FRAUD DETECTION Problem Statement: The main objective of this project is to prevent a fraudster from applying loan twice in a particular bank. When an image comes from the user registration page the algorithm will check whether the image already exists in the bank’s database or not. If a match is found a warning message will be displayed to the bank manager who can take appropiate action based on the details of that particular user. Dataset Description: The dataset consist of frontal-face of a person’s image. Since the problem is related to a bank we can safely assume that all the images will be of a good quality(image will not be of mobile camera quality) and the person will be facing the camera. Pre-processing: All the images were resized to a common size and then converted to gray-scale to remove the remove high frequencies and detail. Model developement: Using OpenCV haarcascade-frontal face classifier we first detect the face of a person from an image and crop it out and store it in the local filesystem. After this PCA was used for image re-construction. This step was necessary as it will further remove the noise that will be present in the image after being converted to gray-scale. After this 2 similarity measures were used to check for image duplication a. dHash Algorithm: dHash is a perceptual hashing algorithm. A perceptual hashing algorithm that takes a fingerprint of a multimedia file by deriving it from various features from its content so it can take into account transformations on a given input and yet be flexible enough to distinguish between dissimilar files. dHash (difference hashing) algorithm computes the difference in brightness between adjacent pixels, identifying the relative gradient direction. This algorithm is very fast and we use this algorithm to filter out those images which differ vastly from the input image. b. Structural Similarity Index(SSIM): The structural similarity (SSIM) index is a method for predicting the perceived quality of digital television and cinematic pictures, as well as other kinds of digital images and videos. SSIM is used for measuring the similarity between two images. The SSIM index is a full reference metric; in other words, the measurement or prediction of image quality is based on an initial uncompressed or distortion-free image as reference. SSIM is designed to improve on traditional methods such as peak signal-to-noise ratio (PSNR) and mean squared error (MSE), which have proven to be inconsistent with human visual perception. Based on this measure we calculate our final accuracy.

2. Output: our final output displays the proper message to the bank manager based on whether fraud has been detected or not. Based on the output the bank manager will take appropiate action. If fraud has been detected on the output window will display the image of the fraudster and his/her matching image from the bank’s database and the customer id no. Accuracy of the model: Model accuracy is around 61 % Drawbacks: 1. The model does not give accurate results if the image quality is poor as OpenCV fails to detect the face from an image.(eg . if the photo is taken from a mobile phone) 2. The model gives poor results if the user is not facing the camera or standing far away from the camera. Tools Used: 1. Python 2. OpenCV for Image Processing 3. Apache Spark for scaling up the model 4. Bottle Framework for Integrating with Java 5. Java for frontend development References: 1. Wikipedia 2. Github 3. http://blog.iconfinder.com/detecting-duplicate-images-using-python/ 4. Apache Spark Documentation 5. Stackoverflow

PROJECT REPORT

Recommended

Recommended

More Related Content

What's hot

What's hot (17)

Similar to PROJECT REPORT

Similar to PROJECT REPORT (20)

PROJECT REPORT