SlideShare a Scribd company logo
1 of 25
Guided by- Ms. Safa Hamdare
Group Members
.
Quora Duplicate Question Pair
Detection Using Semantic Analysis
Name Roll No.
Jai Mulye 64
Anshul Pawaskar 87
Tannmay Redij 88
Akshata Talankar 89
St. Francis Institute of Technology
Department of Computer Engineering
Quora Duplicate Question Pair Detection using Semantic Analysis
1 28/05/2021
Content
● Introduction
● Literature
● Problem Statement
● Proposed Solution
● Work Flow of the system
● Algorithm with Implementation details
● Experimental Set Up
● Data Set
● Performance Evaluation Parameters
● Validation with Test Cases
● Results & Discussion
● Conclusion
● References
28/05/2021 Quora Duplicate Question Pair Detection using Semantic Analysis 2
Introduction
• What is Quora?
28/05/2021 3
Quora Duplicate Question Pair Detection using Semantic Analysis
Current Scenario:
Quora uses Random Forest technique to identify duplicate
questions.
Let’s look at two hypothetical questions:
1. Is it true that time flies like an arrow?
2. Do fruit flies like a banana?
There are two common words in these questions, flies and
like.
4
28/05/2021 Quora Duplicate Question Pair Detection using Semantic Analysis 4
Let’s consider these
5
28/05/2021 Quora Duplicate Question Pair Detection using Semantic Analysis 5
Literature
• The paper[1] explores the Transformer based
Universal Sentence Encoder which relies on
attention mechanism.
• The paper[2] introduces Deep Averaging Network
which performs well with neural networks that model
semantic and syntactic compositionality.
6
28/05/2021 Quora Duplicate Question Pair Detection using Semantic Analysis
Literature
• The paper cited [3] explores the two variants of
Universal Sentence Encoder- the transformer and
the deep averaging network (DAN).
• The paper cited [4] analyses several neural network
designs and their variations for sentence pair
modelling and compare their performance
extensively across eight datasets, including
paraphrase identification, semantic textual similarity,
natural language inference, and question answering
tasks.
7
28/05/2021 Quora Duplicate Question Pair Detection using Semantic Analysis
Problem Statement
• On Quora, there may be people who might ask same
questions differently from an existing question. Solving
this problem will help to reduce the redundancy on the
platform and the manual task of identifying the questions
to match the correct answer for same. The task to identify
which questions asked on Quora are duplicates of
questions that have already been asked could be useful to
instantly provide answers of existing questions.
• A model created which can predict if the questions
entered are similar in meaning based on deep learning
approach using DAN & Transformer model.
28/05/2021 8
Quora Duplicate Question Pair Detection using Semantic Analysis
Proposed Solution
1. Pre Processing 3. Deep Learning Approach
(DAN & Transformer)
2. Sentence to Vector
Conversion (USE)
28/05/2021 Quora Duplicate Question Pair Detection using Semantic Analysis 9
Fig 1: Workflow of the System
Work Flow of the system
28/05/2021 10
Quora Duplicate Question Pair Detection using Semantic Analysis
Fig 2: Architecture Diagram
Algorithm with Implementation
Details
28/05/2021 11
Quora Duplicate Question Pair Detection using Semantic Analysis
Fig 3: Algorithm
Algorithm with Implementation
Details
28/05/2021 12
Quora Duplicate Question Pair Detection using Semantic Analysis
Fig 4: Implementation
Experimental Setup
28/05/2021 13
Quora Duplicate Question Pair Detection using Semantic Analysis
Fig 5: Dataset[5]
Experimental Setup
28/05/2021 14
Fig 6: Model accuracy of
Transformer
Fig 7: Model loss of
Transformer
Quora Duplicate Question Pair Detection using Semantic Analysis
Experimental Setup
28/05/2021 15
Fig 8: Model accuracy of DAN
Fig 9: Model loss of DAN
Quora Duplicate Question Pair Detection using Semantic Analysis
Validation with Test cases
28/05/2021 16
Quora Duplicate Question Pair Detection using Semantic Analysis
Results and Discussions
28/05/2021 17
Quora Duplicate Question Pair Detection using Semantic Analysis
Fig 10: Browse Questions
Results and Discussions
28/05/2021 18
Quora Duplicate Question Pair Detection using Semantic Analysis
Fig 11: Post Questions
Results and Discussions
28/05/2021 19
Quora Duplicate Question Pair Detection using Semantic Analysis
Fig 12: Results by DAN Model
Results and Discussions
28/05/2021 20
Quora Duplicate Question Pair Detection using Semantic Analysis
Fig 13: Results by Transformer Model
Conclusion
28/05/2021 21
Quora Duplicate Question Pair Detection using Semantic Analysis
Model Embedding technique
F1-score
weighted average
F1- Score macro
average
Logistic
Regression
Word2Vec, Similarity
scores
0.66 0.62
Random Forest
Word2Vec, Similarity
scores
0.70 0.69
Table 1:Accuracy of machine learning models
Conclusion
28/05/2021 22
Quora Duplicate Question Pair Detection using Semantic Analysis
Table 2:Accuracy of Deep learning models (DAN & Transformer)
Model
Embedding
technique
Epochs
Training
accuracy (%)
Validation
accuracy (%)
Neural
Network
Universal Sentence
Encoder (DAN)
20 88.63 86
Neural
Network
Universal Sentence
Encoder
(Transformer)
20 89.16 85
Conclusion
• Deep learning models using sentence level
embedding outperform the basic classification
model.
• DAN Model sometimes under performs with the
questions having double negation.
• Transformer based Universal Sentence Encoder can
be used.
28/05/2021 23
Quora Duplicate Question Pair Detection using Semantic Analysis
References
[1] Mueller J, Thyagarajan A. Siamese recurrent architectures for learning
sentence similarity. In: Proceedings of the thirtieth AAAI conference on artificial
intelligence. (2016)
[2] Eneko Agirre, Aitor Gonzalez-Agirre, Inigo Lopez-Gazpio, Montse Maritxalar,
German Rigau, and Larraitz Uria. Semeval-2016 task 2: Interpretable semantic
textual similarity. In: Proceedings of the 10th International Workshop on Semantic
Evaluation (2016).
[3] Vaswani, Ashish, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones,
Aidan N. Gomez, Łukasz Kaiser, and Illia Polosukhin. Attention is all you need.
Advances in neural information processing systems, pp. 5998-6008. 2017. (2017)
[4] Cer D, Yang Y, Kong S-Y, et al. Universal Sentence Encoder for English. In:
Proceedings of the 2018 Conference on Empirical Methods in Natural Language
Processing: System Demonstrations. doi: 10.18653/v1/d18-2029 (2018)
[5] https://www.kaggle.com/c/quora-question-pairs/data
28/05/2021 24
Quora Duplicate Question Pair Detection using Semantic Analysis
28/05/2021 25
Thank you
Quora Duplicate Question Pair Detection using Semantic Analysis

More Related Content

What's hot

Genetic algorithms vs Traditional algorithms
Genetic algorithms vs Traditional algorithmsGenetic algorithms vs Traditional algorithms
Genetic algorithms vs Traditional algorithmsDr. C.V. Suresh Babu
 
Hands on machine learning with scikit-learn and tensor flow by ahmed yousry
Hands on machine learning with scikit-learn and tensor flow by ahmed yousryHands on machine learning with scikit-learn and tensor flow by ahmed yousry
Hands on machine learning with scikit-learn and tensor flow by ahmed yousryAhmed Yousry
 
Word2vec algorithm
Word2vec algorithmWord2vec algorithm
Word2vec algorithmAndrew Koo
 
Performance Metrics for Machine Learning Algorithms
Performance Metrics for Machine Learning AlgorithmsPerformance Metrics for Machine Learning Algorithms
Performance Metrics for Machine Learning AlgorithmsKush Kulshrestha
 
WEKA Tutorial
WEKA TutorialWEKA Tutorial
WEKA Tutorialbutest
 
Artificial Intelligence techniques
Artificial Intelligence techniquesArtificial Intelligence techniques
Artificial Intelligence techniquesPavan Kumar Talla
 
Deep learning with Keras
Deep learning with KerasDeep learning with Keras
Deep learning with KerasQuantUniversity
 
Machine learning ppt.
Machine learning ppt.Machine learning ppt.
Machine learning ppt.ASHOK KUMAR
 
Autoencoders
AutoencodersAutoencoders
AutoencodersCloudxLab
 
Machine Learning Pipelines
Machine Learning PipelinesMachine Learning Pipelines
Machine Learning Pipelinesjeykottalam
 
Recurrent Neural Networks for Text Analysis
Recurrent Neural Networks for Text AnalysisRecurrent Neural Networks for Text Analysis
Recurrent Neural Networks for Text Analysisodsc
 
Autoencoders Tutorial | Autoencoders In Deep Learning | Tensorflow Training |...
Autoencoders Tutorial | Autoencoders In Deep Learning | Tensorflow Training |...Autoencoders Tutorial | Autoencoders In Deep Learning | Tensorflow Training |...
Autoencoders Tutorial | Autoencoders In Deep Learning | Tensorflow Training |...Edureka!
 
Introduction to Machine Learning
Introduction to Machine LearningIntroduction to Machine Learning
Introduction to Machine LearningRahul Jain
 
Data Analysis: Evaluation Metrics for Supervised Learning Models of Machine L...
Data Analysis: Evaluation Metrics for Supervised Learning Models of Machine L...Data Analysis: Evaluation Metrics for Supervised Learning Models of Machine L...
Data Analysis: Evaluation Metrics for Supervised Learning Models of Machine L...Md. Main Uddin Rony
 
Neural networks...
Neural networks...Neural networks...
Neural networks...Molly Chugh
 
House Sale Price Prediction
House Sale Price PredictionHouse Sale Price Prediction
House Sale Price Predictionsriram30691
 
Knowledge representation and Predicate logic
Knowledge representation and Predicate logicKnowledge representation and Predicate logic
Knowledge representation and Predicate logicAmey Kerkar
 
Differences Between Machine Learning Ml Artificial Intelligence Ai And Deep L...
Differences Between Machine Learning Ml Artificial Intelligence Ai And Deep L...Differences Between Machine Learning Ml Artificial Intelligence Ai And Deep L...
Differences Between Machine Learning Ml Artificial Intelligence Ai And Deep L...SlideTeam
 

What's hot (20)

Genetic algorithms vs Traditional algorithms
Genetic algorithms vs Traditional algorithmsGenetic algorithms vs Traditional algorithms
Genetic algorithms vs Traditional algorithms
 
Hands on machine learning with scikit-learn and tensor flow by ahmed yousry
Hands on machine learning with scikit-learn and tensor flow by ahmed yousryHands on machine learning with scikit-learn and tensor flow by ahmed yousry
Hands on machine learning with scikit-learn and tensor flow by ahmed yousry
 
Word2vec algorithm
Word2vec algorithmWord2vec algorithm
Word2vec algorithm
 
Performance Metrics for Machine Learning Algorithms
Performance Metrics for Machine Learning AlgorithmsPerformance Metrics for Machine Learning Algorithms
Performance Metrics for Machine Learning Algorithms
 
WEKA Tutorial
WEKA TutorialWEKA Tutorial
WEKA Tutorial
 
Artificial Intelligence techniques
Artificial Intelligence techniquesArtificial Intelligence techniques
Artificial Intelligence techniques
 
Deep learning with Keras
Deep learning with KerasDeep learning with Keras
Deep learning with Keras
 
Machine learning ppt.
Machine learning ppt.Machine learning ppt.
Machine learning ppt.
 
Autoencoders
AutoencodersAutoencoders
Autoencoders
 
Machine Learning Pipelines
Machine Learning PipelinesMachine Learning Pipelines
Machine Learning Pipelines
 
Recurrent Neural Networks for Text Analysis
Recurrent Neural Networks for Text AnalysisRecurrent Neural Networks for Text Analysis
Recurrent Neural Networks for Text Analysis
 
Federated Learning
Federated LearningFederated Learning
Federated Learning
 
Autoencoders Tutorial | Autoencoders In Deep Learning | Tensorflow Training |...
Autoencoders Tutorial | Autoencoders In Deep Learning | Tensorflow Training |...Autoencoders Tutorial | Autoencoders In Deep Learning | Tensorflow Training |...
Autoencoders Tutorial | Autoencoders In Deep Learning | Tensorflow Training |...
 
House price prediction
House price predictionHouse price prediction
House price prediction
 
Introduction to Machine Learning
Introduction to Machine LearningIntroduction to Machine Learning
Introduction to Machine Learning
 
Data Analysis: Evaluation Metrics for Supervised Learning Models of Machine L...
Data Analysis: Evaluation Metrics for Supervised Learning Models of Machine L...Data Analysis: Evaluation Metrics for Supervised Learning Models of Machine L...
Data Analysis: Evaluation Metrics for Supervised Learning Models of Machine L...
 
Neural networks...
Neural networks...Neural networks...
Neural networks...
 
House Sale Price Prediction
House Sale Price PredictionHouse Sale Price Prediction
House Sale Price Prediction
 
Knowledge representation and Predicate logic
Knowledge representation and Predicate logicKnowledge representation and Predicate logic
Knowledge representation and Predicate logic
 
Differences Between Machine Learning Ml Artificial Intelligence Ai And Deep L...
Differences Between Machine Learning Ml Artificial Intelligence Ai And Deep L...Differences Between Machine Learning Ml Artificial Intelligence Ai And Deep L...
Differences Between Machine Learning Ml Artificial Intelligence Ai And Deep L...
 

Similar to Quora Duplicate Question Detection Using Semantic Analysis

Naver learning to rank question answer pairs using hrde-ltc
Naver learning to rank question answer pairs using hrde-ltcNaver learning to rank question answer pairs using hrde-ltc
Naver learning to rank question answer pairs using hrde-ltcNAVER Engineering
 
JAVA 2013 IEEE DATAMINING PROJECT Comparable entity mining from comparative q...
JAVA 2013 IEEE DATAMINING PROJECT Comparable entity mining from comparative q...JAVA 2013 IEEE DATAMINING PROJECT Comparable entity mining from comparative q...
JAVA 2013 IEEE DATAMINING PROJECT Comparable entity mining from comparative q...IEEEGLOBALSOFTTECHNOLOGIES
 
Comparable entity mining from comparative questions
Comparable entity mining from comparative questionsComparable entity mining from comparative questions
Comparable entity mining from comparative questionsIEEEFINALYEARPROJECTS
 
Manta ray optimized deep contextualized bi-directional long short-term memor...
Manta ray optimized deep contextualized bi-directional long  short-term memor...Manta ray optimized deep contextualized bi-directional long  short-term memor...
Manta ray optimized deep contextualized bi-directional long short-term memor...IJECEIAES
 
Répondre à la question automatique avec le web
Répondre à la question automatique avec le webRépondre à la question automatique avec le web
Répondre à la question automatique avec le webAhmed Hammami
 
Cloudera Data Science Challenge 3 Solution by Doug Needham
Cloudera Data Science Challenge 3 Solution by Doug NeedhamCloudera Data Science Challenge 3 Solution by Doug Needham
Cloudera Data Science Challenge 3 Solution by Doug NeedhamDoug Needham
 
Record matching over multiple query result - Document
Record matching over multiple query result - DocumentRecord matching over multiple query result - Document
Record matching over multiple query result - DocumentNishna Ma
 
[DSC Europe 23] Dmitry Ustalov - Design and Evaluation of Large Language Models
[DSC Europe 23] Dmitry Ustalov - Design and Evaluation of Large Language Models[DSC Europe 23] Dmitry Ustalov - Design and Evaluation of Large Language Models
[DSC Europe 23] Dmitry Ustalov - Design and Evaluation of Large Language ModelsDataScienceConferenc1
 
Deep Learning Models for Question Answering
Deep Learning Models for Question AnsweringDeep Learning Models for Question Answering
Deep Learning Models for Question AnsweringSujit Pal
 
IRJET- Semantic Question Matching
IRJET- Semantic Question MatchingIRJET- Semantic Question Matching
IRJET- Semantic Question MatchingIRJET Journal
 
Eswc2009
Eswc2009Eswc2009
Eswc2009fanizzi
 
Nature Inspired Models And The Semantic Web
Nature Inspired Models And The Semantic WebNature Inspired Models And The Semantic Web
Nature Inspired Models And The Semantic WebStefan Ceriu
 
The Status of ML Algorithms for Structure-property Relationships Using Matb...
The Status of ML Algorithms for Structure-property Relationships Using Matb...The Status of ML Algorithms for Structure-property Relationships Using Matb...
The Status of ML Algorithms for Structure-property Relationships Using Matb...Anubhav Jain
 
A Comparison Of The Rule And Case-Based Reasoning Approaches For The Automati...
A Comparison Of The Rule And Case-Based Reasoning Approaches For The Automati...A Comparison Of The Rule And Case-Based Reasoning Approaches For The Automati...
A Comparison Of The Rule And Case-Based Reasoning Approaches For The Automati...Darian Pruitt
 
Question Retrieval in Community Question Answering via NON-Negative Matrix Fa...
Question Retrieval in Community Question Answering via NON-Negative Matrix Fa...Question Retrieval in Community Question Answering via NON-Negative Matrix Fa...
Question Retrieval in Community Question Answering via NON-Negative Matrix Fa...IRJET Journal
 
Predicting Fault-Prone Files using Machine Learning
Predicting Fault-Prone Files using Machine LearningPredicting Fault-Prone Files using Machine Learning
Predicting Fault-Prone Files using Machine LearningGuido A. Ciollaro
 
Verifications and Validations in Finite Element Analysis (FEA)
Verifications and Validations in Finite Element Analysis (FEA)Verifications and Validations in Finite Element Analysis (FEA)
Verifications and Validations in Finite Element Analysis (FEA)Kartik Srinivas
 

Similar to Quora Duplicate Question Detection Using Semantic Analysis (20)

Naver learning to rank question answer pairs using hrde-ltc
Naver learning to rank question answer pairs using hrde-ltcNaver learning to rank question answer pairs using hrde-ltc
Naver learning to rank question answer pairs using hrde-ltc
 
JAVA 2013 IEEE DATAMINING PROJECT Comparable entity mining from comparative q...
JAVA 2013 IEEE DATAMINING PROJECT Comparable entity mining from comparative q...JAVA 2013 IEEE DATAMINING PROJECT Comparable entity mining from comparative q...
JAVA 2013 IEEE DATAMINING PROJECT Comparable entity mining from comparative q...
 
Comparable entity mining from comparative questions
Comparable entity mining from comparative questionsComparable entity mining from comparative questions
Comparable entity mining from comparative questions
 
ISEC-2021-Presentation-Saikat-Mondal
ISEC-2021-Presentation-Saikat-MondalISEC-2021-Presentation-Saikat-Mondal
ISEC-2021-Presentation-Saikat-Mondal
 
Manta ray optimized deep contextualized bi-directional long short-term memor...
Manta ray optimized deep contextualized bi-directional long  short-term memor...Manta ray optimized deep contextualized bi-directional long  short-term memor...
Manta ray optimized deep contextualized bi-directional long short-term memor...
 
Répondre à la question automatique avec le web
Répondre à la question automatique avec le webRépondre à la question automatique avec le web
Répondre à la question automatique avec le web
 
Cloudera Data Science Challenge 3 Solution by Doug Needham
Cloudera Data Science Challenge 3 Solution by Doug NeedhamCloudera Data Science Challenge 3 Solution by Doug Needham
Cloudera Data Science Challenge 3 Solution by Doug Needham
 
Record matching over multiple query result - Document
Record matching over multiple query result - DocumentRecord matching over multiple query result - Document
Record matching over multiple query result - Document
 
[DSC Europe 23] Dmitry Ustalov - Design and Evaluation of Large Language Models
[DSC Europe 23] Dmitry Ustalov - Design and Evaluation of Large Language Models[DSC Europe 23] Dmitry Ustalov - Design and Evaluation of Large Language Models
[DSC Europe 23] Dmitry Ustalov - Design and Evaluation of Large Language Models
 
Deep Learning Models for Question Answering
Deep Learning Models for Question AnsweringDeep Learning Models for Question Answering
Deep Learning Models for Question Answering
 
IRJET- Semantic Question Matching
IRJET- Semantic Question MatchingIRJET- Semantic Question Matching
IRJET- Semantic Question Matching
 
Eswc2009
Eswc2009Eswc2009
Eswc2009
 
midterm_fa08.pdf
midterm_fa08.pdfmidterm_fa08.pdf
midterm_fa08.pdf
 
Nature Inspired Models And The Semantic Web
Nature Inspired Models And The Semantic WebNature Inspired Models And The Semantic Web
Nature Inspired Models And The Semantic Web
 
The Status of ML Algorithms for Structure-property Relationships Using Matb...
The Status of ML Algorithms for Structure-property Relationships Using Matb...The Status of ML Algorithms for Structure-property Relationships Using Matb...
The Status of ML Algorithms for Structure-property Relationships Using Matb...
 
A Comparison Of The Rule And Case-Based Reasoning Approaches For The Automati...
A Comparison Of The Rule And Case-Based Reasoning Approaches For The Automati...A Comparison Of The Rule And Case-Based Reasoning Approaches For The Automati...
A Comparison Of The Rule And Case-Based Reasoning Approaches For The Automati...
 
Ssbse12b.ppt
Ssbse12b.pptSsbse12b.ppt
Ssbse12b.ppt
 
Question Retrieval in Community Question Answering via NON-Negative Matrix Fa...
Question Retrieval in Community Question Answering via NON-Negative Matrix Fa...Question Retrieval in Community Question Answering via NON-Negative Matrix Fa...
Question Retrieval in Community Question Answering via NON-Negative Matrix Fa...
 
Predicting Fault-Prone Files using Machine Learning
Predicting Fault-Prone Files using Machine LearningPredicting Fault-Prone Files using Machine Learning
Predicting Fault-Prone Files using Machine Learning
 
Verifications and Validations in Finite Element Analysis (FEA)
Verifications and Validations in Finite Element Analysis (FEA)Verifications and Validations in Finite Element Analysis (FEA)
Verifications and Validations in Finite Element Analysis (FEA)
 

Recently uploaded

UNIT-II FMM-Flow Through Circular Conduits
UNIT-II FMM-Flow Through Circular ConduitsUNIT-II FMM-Flow Through Circular Conduits
UNIT-II FMM-Flow Through Circular Conduitsrknatarajan
 
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service Nashik
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service NashikCall Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service Nashik
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service NashikCall Girls in Nagpur High Profile
 
Booking open Available Pune Call Girls Koregaon Park 6297143586 Call Hot Ind...
Booking open Available Pune Call Girls Koregaon Park  6297143586 Call Hot Ind...Booking open Available Pune Call Girls Koregaon Park  6297143586 Call Hot Ind...
Booking open Available Pune Call Girls Koregaon Park 6297143586 Call Hot Ind...Call Girls in Nagpur High Profile
 
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICSAPPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICSKurinjimalarL3
 
Software Development Life Cycle By Team Orange (Dept. of Pharmacy)
Software Development Life Cycle By  Team Orange (Dept. of Pharmacy)Software Development Life Cycle By  Team Orange (Dept. of Pharmacy)
Software Development Life Cycle By Team Orange (Dept. of Pharmacy)Suman Mia
 
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130Suhani Kapoor
 
(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service
(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service
(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Serviceranjana rawat
 
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).pptssuser5c9d4b1
 
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile
 
Introduction and different types of Ethernet.pptx
Introduction and different types of Ethernet.pptxIntroduction and different types of Ethernet.pptx
Introduction and different types of Ethernet.pptxupamatechverse
 
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130Suhani Kapoor
 
SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )Tsuyoshi Horigome
 
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICSHARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICSRajkumarAkumalla
 
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile
 
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...Christo Ananth
 
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...ranjana rawat
 
Processing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptxProcessing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptxpranjaldaimarysona
 
UNIT-III FMM. DIMENSIONAL ANALYSIS
UNIT-III FMM.        DIMENSIONAL ANALYSISUNIT-III FMM.        DIMENSIONAL ANALYSIS
UNIT-III FMM. DIMENSIONAL ANALYSISrknatarajan
 
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...roncy bisnoi
 
KubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghlyKubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghlysanyuktamishra911
 

Recently uploaded (20)

UNIT-II FMM-Flow Through Circular Conduits
UNIT-II FMM-Flow Through Circular ConduitsUNIT-II FMM-Flow Through Circular Conduits
UNIT-II FMM-Flow Through Circular Conduits
 
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service Nashik
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service NashikCall Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service Nashik
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service Nashik
 
Booking open Available Pune Call Girls Koregaon Park 6297143586 Call Hot Ind...
Booking open Available Pune Call Girls Koregaon Park  6297143586 Call Hot Ind...Booking open Available Pune Call Girls Koregaon Park  6297143586 Call Hot Ind...
Booking open Available Pune Call Girls Koregaon Park 6297143586 Call Hot Ind...
 
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICSAPPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
 
Software Development Life Cycle By Team Orange (Dept. of Pharmacy)
Software Development Life Cycle By  Team Orange (Dept. of Pharmacy)Software Development Life Cycle By  Team Orange (Dept. of Pharmacy)
Software Development Life Cycle By Team Orange (Dept. of Pharmacy)
 
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
 
(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service
(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service
(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service
 
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
 
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
 
Introduction and different types of Ethernet.pptx
Introduction and different types of Ethernet.pptxIntroduction and different types of Ethernet.pptx
Introduction and different types of Ethernet.pptx
 
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
 
SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )
 
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICSHARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS
 
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
 
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
 
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
 
Processing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptxProcessing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptx
 
UNIT-III FMM. DIMENSIONAL ANALYSIS
UNIT-III FMM.        DIMENSIONAL ANALYSISUNIT-III FMM.        DIMENSIONAL ANALYSIS
UNIT-III FMM. DIMENSIONAL ANALYSIS
 
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
 
KubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghlyKubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghly
 

Quora Duplicate Question Detection Using Semantic Analysis

  • 1. Guided by- Ms. Safa Hamdare Group Members . Quora Duplicate Question Pair Detection Using Semantic Analysis Name Roll No. Jai Mulye 64 Anshul Pawaskar 87 Tannmay Redij 88 Akshata Talankar 89 St. Francis Institute of Technology Department of Computer Engineering Quora Duplicate Question Pair Detection using Semantic Analysis 1 28/05/2021
  • 2. Content ● Introduction ● Literature ● Problem Statement ● Proposed Solution ● Work Flow of the system ● Algorithm with Implementation details ● Experimental Set Up ● Data Set ● Performance Evaluation Parameters ● Validation with Test Cases ● Results & Discussion ● Conclusion ● References 28/05/2021 Quora Duplicate Question Pair Detection using Semantic Analysis 2
  • 3. Introduction • What is Quora? 28/05/2021 3 Quora Duplicate Question Pair Detection using Semantic Analysis
  • 4. Current Scenario: Quora uses Random Forest technique to identify duplicate questions. Let’s look at two hypothetical questions: 1. Is it true that time flies like an arrow? 2. Do fruit flies like a banana? There are two common words in these questions, flies and like. 4 28/05/2021 Quora Duplicate Question Pair Detection using Semantic Analysis 4
  • 5. Let’s consider these 5 28/05/2021 Quora Duplicate Question Pair Detection using Semantic Analysis 5
  • 6. Literature • The paper[1] explores the Transformer based Universal Sentence Encoder which relies on attention mechanism. • The paper[2] introduces Deep Averaging Network which performs well with neural networks that model semantic and syntactic compositionality. 6 28/05/2021 Quora Duplicate Question Pair Detection using Semantic Analysis
  • 7. Literature • The paper cited [3] explores the two variants of Universal Sentence Encoder- the transformer and the deep averaging network (DAN). • The paper cited [4] analyses several neural network designs and their variations for sentence pair modelling and compare their performance extensively across eight datasets, including paraphrase identification, semantic textual similarity, natural language inference, and question answering tasks. 7 28/05/2021 Quora Duplicate Question Pair Detection using Semantic Analysis
  • 8. Problem Statement • On Quora, there may be people who might ask same questions differently from an existing question. Solving this problem will help to reduce the redundancy on the platform and the manual task of identifying the questions to match the correct answer for same. The task to identify which questions asked on Quora are duplicates of questions that have already been asked could be useful to instantly provide answers of existing questions. • A model created which can predict if the questions entered are similar in meaning based on deep learning approach using DAN & Transformer model. 28/05/2021 8 Quora Duplicate Question Pair Detection using Semantic Analysis
  • 9. Proposed Solution 1. Pre Processing 3. Deep Learning Approach (DAN & Transformer) 2. Sentence to Vector Conversion (USE) 28/05/2021 Quora Duplicate Question Pair Detection using Semantic Analysis 9 Fig 1: Workflow of the System
  • 10. Work Flow of the system 28/05/2021 10 Quora Duplicate Question Pair Detection using Semantic Analysis Fig 2: Architecture Diagram
  • 11. Algorithm with Implementation Details 28/05/2021 11 Quora Duplicate Question Pair Detection using Semantic Analysis Fig 3: Algorithm
  • 12. Algorithm with Implementation Details 28/05/2021 12 Quora Duplicate Question Pair Detection using Semantic Analysis Fig 4: Implementation
  • 13. Experimental Setup 28/05/2021 13 Quora Duplicate Question Pair Detection using Semantic Analysis Fig 5: Dataset[5]
  • 14. Experimental Setup 28/05/2021 14 Fig 6: Model accuracy of Transformer Fig 7: Model loss of Transformer Quora Duplicate Question Pair Detection using Semantic Analysis
  • 15. Experimental Setup 28/05/2021 15 Fig 8: Model accuracy of DAN Fig 9: Model loss of DAN Quora Duplicate Question Pair Detection using Semantic Analysis
  • 16. Validation with Test cases 28/05/2021 16 Quora Duplicate Question Pair Detection using Semantic Analysis
  • 17. Results and Discussions 28/05/2021 17 Quora Duplicate Question Pair Detection using Semantic Analysis Fig 10: Browse Questions
  • 18. Results and Discussions 28/05/2021 18 Quora Duplicate Question Pair Detection using Semantic Analysis Fig 11: Post Questions
  • 19. Results and Discussions 28/05/2021 19 Quora Duplicate Question Pair Detection using Semantic Analysis Fig 12: Results by DAN Model
  • 20. Results and Discussions 28/05/2021 20 Quora Duplicate Question Pair Detection using Semantic Analysis Fig 13: Results by Transformer Model
  • 21. Conclusion 28/05/2021 21 Quora Duplicate Question Pair Detection using Semantic Analysis Model Embedding technique F1-score weighted average F1- Score macro average Logistic Regression Word2Vec, Similarity scores 0.66 0.62 Random Forest Word2Vec, Similarity scores 0.70 0.69 Table 1:Accuracy of machine learning models
  • 22. Conclusion 28/05/2021 22 Quora Duplicate Question Pair Detection using Semantic Analysis Table 2:Accuracy of Deep learning models (DAN & Transformer) Model Embedding technique Epochs Training accuracy (%) Validation accuracy (%) Neural Network Universal Sentence Encoder (DAN) 20 88.63 86 Neural Network Universal Sentence Encoder (Transformer) 20 89.16 85
  • 23. Conclusion • Deep learning models using sentence level embedding outperform the basic classification model. • DAN Model sometimes under performs with the questions having double negation. • Transformer based Universal Sentence Encoder can be used. 28/05/2021 23 Quora Duplicate Question Pair Detection using Semantic Analysis
  • 24. References [1] Mueller J, Thyagarajan A. Siamese recurrent architectures for learning sentence similarity. In: Proceedings of the thirtieth AAAI conference on artificial intelligence. (2016) [2] Eneko Agirre, Aitor Gonzalez-Agirre, Inigo Lopez-Gazpio, Montse Maritxalar, German Rigau, and Larraitz Uria. Semeval-2016 task 2: Interpretable semantic textual similarity. In: Proceedings of the 10th International Workshop on Semantic Evaluation (2016). [3] Vaswani, Ashish, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, Łukasz Kaiser, and Illia Polosukhin. Attention is all you need. Advances in neural information processing systems, pp. 5998-6008. 2017. (2017) [4] Cer D, Yang Y, Kong S-Y, et al. Universal Sentence Encoder for English. In: Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing: System Demonstrations. doi: 10.18653/v1/d18-2029 (2018) [5] https://www.kaggle.com/c/quora-question-pairs/data 28/05/2021 24 Quora Duplicate Question Pair Detection using Semantic Analysis
  • 25. 28/05/2021 25 Thank you Quora Duplicate Question Pair Detection using Semantic Analysis