SlideShare a Scribd company logo
Final Report:
Construction through Deep RNN
Representation Learning
of Knowledge Bases
KoreaUniversity,
DepartmentofComputerScience&Radio
CommunicationEngineering
MASSIVEDATAMANAGEMENT
Professor JaewooKang
1
2015010661
2015011155
2016010646
MinhwanYu
YonghwaChoi
BumsooKim
MASSIVE DATA MANAGEMENT Presentation format
Contents
01.Introduction
02.PTransE
1-1.Whatdowewant?
1-3.TransH
2-1.Whatisdifferent?
2-2.RelationPathRepresentation
2-3.RecurrentNeuralNetwork
2
03.Ouralgorithm
3-1.Objectfunction
3-2.Activationfunction
3-3.LongShortTermMemory
04.Evaluation
4-1.Results
4-2.Comparison&Analysis
1-4.TransR
1-5.PTransE
1-2.TransE
MASSIVE DATA MANAGEMENT Presentation format
Introduction
1-2.TransE
1-3.TransH
3
1-4.TransR
1-5.PTransE
1-1.Whatdowewant?
MASSIVE DATA MANAGEMENT Presentation format
1. Introduction
4
1-1. What do we want?
Large-Scale
KnowledgeBases
Freebase
DBpedia
Yago
Realworld triples
KnowledgeBase(KB)
MASSIVE DATA MANAGEMENT Presentation format
1. Introduction
5
1-1. What do we want?
Large-Scale
KnowledgeBases
Freebase
Realworld triples
∴KBsareveryincomplete
Misrelation
1.Norelationfound
2.Wrongrelationfound
DBpedia
Yago
MASSIVE DATA MANAGEMENT Presentation format
1. Introduction
6
1-1. What do we want?
How doweaddress mis-relations?
MASSIVE DATA MANAGEMENT Presentation format
1. Introduction
7
1-1. What do we want?
Q.Howdoweaddressmisrelation?
A1. Addexternalsourcesfor
completion
Manualandintuitional
Requirestoomuchlabor
Inefficient,time-consuming
PerformancedropsasKBgetslarger
MASSIVE DATA MANAGEMENT Presentation format
1. Introduction
8
1-1. What do we want?
Q.Howdoweaddressmisrelation?
A1. Addexternalsourcesfor
completion
A2. Referencingothertuples
forcompletion
Powerful&Efficient
Largelyexpandable
MorepowerfulasKBgetslarger
MASSIVE DATA MANAGEMENT Presentation format
1. Introduction
9
1-1. What do we want?
Wetriedout…
TransE
TransH
TransR
PTransE
CompletionofKBbyaddressingmisrelationsthrough
co-entity,co-relationreference
MASSIVE DATA MANAGEMENT Presentation format
1. Introduction
10
1-2. TransE
relation#1Boy Girl
relation#1Queen
TransE
Representingdirectrelationswith
differencesbetweenentityvectors
Simple&Effective
reference:TransE(Bordesetal.NIPS2013)
MASSIVE DATA MANAGEMENT Presentation format
1. Introduction
11
1-2. TransE
TransE
Simple&Effective
relation#1Boy Girl
relation#1Queen King
Representingdirectrelationswith
differencesbetweenentityvectors
reference:TransE(Bordesetal.NIPS2013)
MASSIVE DATA MANAGEMENT Presentation format
1. Introduction
12
1-2. TransE
However…
TransEcan’trepresentmorethan
onerelationshipbetweenentities.
Inrealworld,weconstructmany
relationshipswithmanysubjects.
MASSIVE DATA MANAGEMENT Presentation format
1. Introduction
13
1-3. TransH
TransH
AbletorepresentM-to-Mrelations
Representingprojectedrelationswith
differencesbetweenentityvectors
Entitiesandrelationshavedifferentcharacteristics
However,theyarerepresentedinthesamespace
reference:TransH(Wangetal.AAAI2014)
MASSIVE DATA MANAGEMENT Presentation format
1. Introduction
14
1-4. TransR
Abletorepresentdifferentcharacteristics
betweenentitiesandrelations
Representingprojectedentitiesand
mappingthemintorelationspace‘r’
TransR
However,thesewereallnotgoodenoughfor
detectingandaddressingmisrelations!!
reference:TransR(AAAI2015)
MASSIVE DATA MANAGEMENT Presentation format
1. Introduction
15
1-5. PTransE
PTransE
Representingrelationsthroughcompositionof
relationsbetweenentityvectors
Widelyexpandable&powerfulmethod
h
t
e1
r1
r2
r=r1⋅r2
r1:father r1:father
reference:emnlpprocessing
MASSIVE DATA MANAGEMENT Presentation format
1. Introduction
16
1-5. PTransE
PTransE
Representingrelationsthroughcompositionof
relationsbetweenentityvectors
Widelyexpandable&powerfulmethod
h
t
e1
r1
r2
r=r1⋅r2
r:grandfather
reference:emnlpprocessing
MASSIVE DATA MANAGEMENT Presentation format
PTransE
2-2.RelationPathRepresentation
2-3.RecurrentNeuralNetwork
17
2-1.Whatisdifferent?
MASSIVE DATA MANAGEMENT Presentation format
2. PTransE
18
2-1. What is different?
PTransE
LearnsonlyonesinglematrixEachrelationneedsitsown
projectionmap.
TransR
50,000relationsneeds50,000
enormousprojectionmaps!
Learnsn-steprelationalpaths
Learnsrelationsthatevendon’t
haveenoughtrainingdataIfarelationdoesn’thaveenough
trainingdata,itwillsufferfromlow
performance.
Learnsrelationsonlyinthetraining
dataset.
Learnsrelationsthatarenotinthe
trainingdatasetthroughrelation
paths.(Zero-shotKBinference)
MASSIVE DATA MANAGEMENT Presentation format
2. PTransE
19
2-2. Relation Path Representation
MicrosoftisbasedinSeattle.
WhichcountryisMicrosoftlocatedin?
Microsoft Seattle
IsBasedInCountryIn
???
RelationNOTFOUND!(Misrelation)
MASSIVE DATA MANAGEMENT Presentation format
2. PTransE
20
2-2. Relation Path Representation
‘Microsoft’isbasedin‘Seattle’.
‘Seattle’islocatedinstate‘Washington’
Microsoft Seattle Washington USA
IsBasedIn
‘Washington’islocatedincountry‘USA’
StateIn CountryIn
MASSIVE DATA MANAGEMENT Presentation format
2. PTransE
21
2-2. Relation Path Representation
‘Microsoft’isbasedin‘Seattle’.
‘Seattle’islocatedinstate‘Washington’
Microsoft USA
‘Washington’islocatedincountry‘USA’
CountryLocatedIn
‘Microsoft’islocatedin‘USA’.
MASSIVE DATA MANAGEMENT Presentation format
2. PTransE
22
2-3. Recurrent Neural Network
Microsoft Seattle Washington
IsBasedIn StateIn CountryIn
USA
CountryLocatedIn
RNN
RNN
Learningamatrix 𝑾𝑾𝒓𝒓 for
constructingcomposition
vector
StateLocatedIn
p= 𝒇𝒇( 𝑾𝑾𝒓𝒓 𝒄𝒄𝟑𝟑;𝒄𝒄𝟒𝟒 ) = 𝒕𝒕𝒕𝒕𝒕𝒕𝒕𝒕( 𝑾𝑾𝒓𝒓 𝒄𝒄𝟑𝟑;𝒄𝒄𝟒𝟒 )
𝐜𝐜𝟑𝟑 = 𝒇𝒇( 𝑾𝑾𝒓𝒓 𝒄𝒄𝟏𝟏;𝒄𝒄𝟐𝟐 )
𝒄𝒄𝟏𝟏 𝒄𝒄𝟐𝟐
𝒄𝒄𝟒𝟒
reference:CompositionalVectorSpaceModelsforKnowledgeBaseInference
MASSIVE DATA MANAGEMENT Presentation format
Our Algorithm
3-2.Activationfunction
3-3.LongShortTermMemory
23
3-1.Objectivefunction
MASSIVE DATA MANAGEMENT Presentation format
3. Our Algorithm
24
3-1. Objective function
Me Father Grandfather
FatherOf FatherOf
RNN
Compositionvector
𝐩𝐩 = 𝒇𝒇( 𝑾𝑾𝒓𝒓 𝒄𝒄𝟏𝟏;𝒄𝒄𝟐𝟐 )
𝒄𝒄𝟏𝟏 𝒄𝒄𝟐𝟐
Me Grandfather
GrandfatherOf
Update 𝑾𝑾𝒓𝒓
MASSIVE DATA MANAGEMENT Presentation format
3. Our Algorithm
25
3-1. Objective function
Compositionvector
𝐩𝐩 = 𝒇𝒇( 𝑾𝑾𝒓𝒓 𝒄𝒄𝟏𝟏;𝒄𝒄𝟐𝟐 )GrandfatherOf
n-steprelationvector
𝒍𝒍𝒍𝒍𝒍𝒍𝒍𝒍
Updating 𝑾𝑾𝒓𝒓 …
𝒍𝒍𝒍𝒍𝒍𝒍𝒍𝒍BackPropagation
ForwardPropagation 𝒑𝒑𝒑𝒑𝒑𝒑𝒑𝒑
−𝒕𝒕𝒕𝒕𝒕𝒕𝒕𝒕𝒕𝒕𝒕𝒕𝑼𝑼𝑼𝑼𝑼𝑼𝑼𝑼𝑼𝑼𝑼𝑼
𝐖𝐖𝐫𝐫
MASSIVE DATA MANAGEMENT Presentation format
3. Our Algorithm
26
3-2. Activation function
However,NeuralNetworkisbasicallyalinearoperation
𝐩𝐩 = 𝒇𝒇( 𝑾𝑾𝒓𝒓 𝒄𝒄𝟏𝟏;𝒄𝒄𝟐𝟐 )
Applyinganon-linearaftereach
operationsinordertolearnnon-
lineardecisionboundary
tanhistheconventionaldefault
non-linearactivationfunctionfor
RNNmodels.
MASSIVE DATA MANAGEMENT Presentation format
3. Our Algorithm
27
3-2. Activation function
Then,what’swrongwithtanh?
Whenbackpropagatinggradients
inordertoupdatethematrix,
thesegradientstendtoconvergeto
0andeliminatesthegradientflow.
Therefore,nogradientflows
backward,andtheparameterstays
unmodified.
GradientVanishingProblem
MASSIVE DATA MANAGEMENT Presentation format
3. Our Algorithm
28
3-3. Long Short Term Memory
AddressesGradientVanishingproblems
HandleLongtermdependencies
ForgetGate
MASSIVE DATA MANAGEMENT Presentation format
Evaluation
4-2.Comparison&Analysis
29
4-1.Results
MASSIVE DATA MANAGEMENT Presentation format
4. Evaluation
30
4-1. Results
WeusedFB15K,andFB40Kdatasets
FB15K FB40K
# Relations 1,345 1,336
# Entities 14,951 39,528
# Training set 483,142 370,648
# Validation set 50,000 67,948
# Test set 59,071 96,678
MASSIVE DATA MANAGEMENT Presentation format
4. Evaluation
31
4-2. Comparisons & Analysis
Hit 10 (RAW) Hit 10 (FILTER)
TransE 34.9% 47.1%
TransH 45.7% 64.4%
TransR 43.8% 65.5%
Hit 10 (RAW) Hit 10 (FILTER)
PTransE(RNN)
(Original model)
50.6% 82.2%
PTransE(LSTM)
(Our model)
53.1% 86.6%
PTransE,RNNcomposition
Baseline
MASSIVE DATA MANAGEMENT Presentation format
4. Evaluation
32
4-2. Comparisons & Analysis
Hit 10 (RAW) Hit 10 (FILTER)
TransE 34.9% 47.1%
TransH 45.7% 64.4%
TransR 43.8% 65.5%
Hit 10 (RAW) Hit 10 (FILTER)
PTransE(RNN)
(Original model)
50.6% 82.2%
PTransE(LSTM)
(Our model)
53.1% 86.6%
PTransE,RNNcomposition
(+9.3%)
Baseline
MASSIVE DATA MANAGEMENT Presentation format
4. Evaluation
33
4-2. Comparisons & Analysis
Hit 10 (RAW) Hit 10 (FILTER)
TransE 34.9% 47.1%
TransH 45.7% 64.4%
TransR 43.8% 65.5%
Hit 10 (RAW) Hit 10 (FILTER)
PTransE(RNN)
(Original model)
50.6% 82.2%
PTransE(LSTM)
(Our model)
53.1% 86.6%
PTransE,RNNcomposition
(+21.1%)
Baseline
MASSIVE DATA MANAGEMENT Presentation format
4. Evaluation
34
4-2. Comparisons & Analysis
Hit 10 (RAW) Hit 10 (FILTER)
PTransE(ADD)
(Original model)
51.8% 83.4%
PTransE(ADD)
(Our model)
52.1% 84.1%
PTransE(MUL)
(Original model)
47.4% 77.7%
PTransE(MUL)
(Our model)
47.1% 77.2%
PTransE(RNN)
(Original model)
50.6% 82.2%
PTransE(LSTM)
(Our model)
53.1% 86.6%
PTransE,Compositioncomparison
Thank you for your attention!
35

More Related Content

What's hot

Presentation on "Mizan: A System for Dynamic Load Balancing in Large-scale Gr...
Presentation on "Mizan: A System for Dynamic Load Balancing in Large-scale Gr...Presentation on "Mizan: A System for Dynamic Load Balancing in Large-scale Gr...
Presentation on "Mizan: A System for Dynamic Load Balancing in Large-scale Gr...
Zuhair khayyat
 
Mathematics 08-00326
Mathematics 08-00326Mathematics 08-00326
Mathematics 08-00326
Hoopeer Hoopeer
 
Reactive/Proactive Connectivity Management in a Tactical Service-Oriented Inf...
Reactive/Proactive Connectivity Management in a Tactical Service-Oriented Inf...Reactive/Proactive Connectivity Management in a Tactical Service-Oriented Inf...
Reactive/Proactive Connectivity Management in a Tactical Service-Oriented Inf...
Roberto Rigolin F. Lopes
 
Segmentation of Images by using Fuzzy k-means clustering with ACO
Segmentation of Images by using Fuzzy k-means clustering with ACOSegmentation of Images by using Fuzzy k-means clustering with ACO
Segmentation of Images by using Fuzzy k-means clustering with ACO
IJTET Journal
 
Improvement of chaotic secure communication scheme based on steganographic me...
Improvement of chaotic secure communication scheme based on steganographic me...Improvement of chaotic secure communication scheme based on steganographic me...
Improvement of chaotic secure communication scheme based on steganographic me...
Massoud Khodadadzadeh
 
Cerebellar Model Articulation Controller
Cerebellar Model Articulation ControllerCerebellar Model Articulation Controller
Cerebellar Model Articulation Controller
Zahra Sadeghi
 
Electricity Demand Forecasting Using Fuzzy-Neural Network
Electricity Demand Forecasting Using Fuzzy-Neural NetworkElectricity Demand Forecasting Using Fuzzy-Neural Network
Electricity Demand Forecasting Using Fuzzy-Neural Network
Naren Chandra Kattla
 
Caravan insurance data mining prediction models
Caravan insurance data mining prediction modelsCaravan insurance data mining prediction models
Caravan insurance data mining prediction models
Muthu Kumaar Thangavelu
 
Reliability Prediction using the Fussel Algorithm
Reliability Prediction using the Fussel AlgorithmReliability Prediction using the Fussel Algorithm
Reliability Prediction using the Fussel Algorithm
IRJET Journal
 
11 construction productivity and cost estimation using artificial
11 construction productivity and cost estimation using artificial 11 construction productivity and cost estimation using artificial
11 construction productivity and cost estimation using artificial
Vivan17
 
Electricity Demand Forecasting Using ANN
Electricity Demand Forecasting Using ANNElectricity Demand Forecasting Using ANN
Electricity Demand Forecasting Using ANN
Naren Chandra Kattla
 
Traffic sign classification
Traffic sign classificationTraffic sign classification
Traffic sign classification
Bill Kromydas
 
IRJET- LS Chaotic based Image Encryption System Via Permutation Models
IRJET- LS Chaotic based Image Encryption System Via Permutation ModelsIRJET- LS Chaotic based Image Encryption System Via Permutation Models
IRJET- LS Chaotic based Image Encryption System Via Permutation Models
IRJET Journal
 

What's hot (13)

Presentation on "Mizan: A System for Dynamic Load Balancing in Large-scale Gr...
Presentation on "Mizan: A System for Dynamic Load Balancing in Large-scale Gr...Presentation on "Mizan: A System for Dynamic Load Balancing in Large-scale Gr...
Presentation on "Mizan: A System for Dynamic Load Balancing in Large-scale Gr...
 
Mathematics 08-00326
Mathematics 08-00326Mathematics 08-00326
Mathematics 08-00326
 
Reactive/Proactive Connectivity Management in a Tactical Service-Oriented Inf...
Reactive/Proactive Connectivity Management in a Tactical Service-Oriented Inf...Reactive/Proactive Connectivity Management in a Tactical Service-Oriented Inf...
Reactive/Proactive Connectivity Management in a Tactical Service-Oriented Inf...
 
Segmentation of Images by using Fuzzy k-means clustering with ACO
Segmentation of Images by using Fuzzy k-means clustering with ACOSegmentation of Images by using Fuzzy k-means clustering with ACO
Segmentation of Images by using Fuzzy k-means clustering with ACO
 
Improvement of chaotic secure communication scheme based on steganographic me...
Improvement of chaotic secure communication scheme based on steganographic me...Improvement of chaotic secure communication scheme based on steganographic me...
Improvement of chaotic secure communication scheme based on steganographic me...
 
Cerebellar Model Articulation Controller
Cerebellar Model Articulation ControllerCerebellar Model Articulation Controller
Cerebellar Model Articulation Controller
 
Electricity Demand Forecasting Using Fuzzy-Neural Network
Electricity Demand Forecasting Using Fuzzy-Neural NetworkElectricity Demand Forecasting Using Fuzzy-Neural Network
Electricity Demand Forecasting Using Fuzzy-Neural Network
 
Caravan insurance data mining prediction models
Caravan insurance data mining prediction modelsCaravan insurance data mining prediction models
Caravan insurance data mining prediction models
 
Reliability Prediction using the Fussel Algorithm
Reliability Prediction using the Fussel AlgorithmReliability Prediction using the Fussel Algorithm
Reliability Prediction using the Fussel Algorithm
 
11 construction productivity and cost estimation using artificial
11 construction productivity and cost estimation using artificial 11 construction productivity and cost estimation using artificial
11 construction productivity and cost estimation using artificial
 
Electricity Demand Forecasting Using ANN
Electricity Demand Forecasting Using ANNElectricity Demand Forecasting Using ANN
Electricity Demand Forecasting Using ANN
 
Traffic sign classification
Traffic sign classificationTraffic sign classification
Traffic sign classification
 
IRJET- LS Chaotic based Image Encryption System Via Permutation Models
IRJET- LS Chaotic based Image Encryption System Via Permutation ModelsIRJET- LS Chaotic based Image Encryption System Via Permutation Models
IRJET- LS Chaotic based Image Encryption System Via Permutation Models
 

Similar to Representation learning

24csit38.pdf
24csit38.pdf24csit38.pdf
24csit38.pdf
amine597585
 
BACKPROPAGATION LEARNING ALGORITHM BASED ON LEVENBERG MARQUARDT ALGORITHM
BACKPROPAGATION LEARNING ALGORITHM BASED ON LEVENBERG MARQUARDT ALGORITHMBACKPROPAGATION LEARNING ALGORITHM BASED ON LEVENBERG MARQUARDT ALGORITHM
BACKPROPAGATION LEARNING ALGORITHM BASED ON LEVENBERG MARQUARDT ALGORITHM
cscpconf
 
A COMPARATIVE STUDY OF BACKPROPAGATION ALGORITHMS IN FINANCIAL PREDICTION
A COMPARATIVE STUDY OF BACKPROPAGATION ALGORITHMS IN FINANCIAL PREDICTIONA COMPARATIVE STUDY OF BACKPROPAGATION ALGORITHMS IN FINANCIAL PREDICTION
A COMPARATIVE STUDY OF BACKPROPAGATION ALGORITHMS IN FINANCIAL PREDICTION
IJCSEA Journal
 
Neural Network Based Individual Classification System
Neural Network Based Individual Classification SystemNeural Network Based Individual Classification System
Neural Network Based Individual Classification System
IRJET Journal
 
Neural Network Implementation Control Mobile Robot
Neural Network Implementation Control Mobile RobotNeural Network Implementation Control Mobile Robot
Neural Network Implementation Control Mobile Robot
IRJET Journal
 
05 backpropagation automatic_differentiation
05 backpropagation automatic_differentiation05 backpropagation automatic_differentiation
05 backpropagation automatic_differentiation
Andres Mendez-Vazquez
 
Electricity Price Forecasting Using ELM-Tree Approach
Electricity Price Forecasting Using ELM-Tree ApproachElectricity Price Forecasting Using ELM-Tree Approach
Electricity Price Forecasting Using ELM-Tree Approach
IRJET Journal
 
Introduction to deep learning
Introduction to deep learningIntroduction to deep learning
Introduction to deep learning
Junaid Bhat
 
An Optimized Parallel Algorithm for Longest Common Subsequence Using Openmp –...
An Optimized Parallel Algorithm for Longest Common Subsequence Using Openmp –...An Optimized Parallel Algorithm for Longest Common Subsequence Using Openmp –...
An Optimized Parallel Algorithm for Longest Common Subsequence Using Openmp –...
IRJET Journal
 
A Connectionist Approach to Dynamic Resource Management for Virtualised Netwo...
A Connectionist Approach to Dynamic Resource Management for Virtualised Netwo...A Connectionist Approach to Dynamic Resource Management for Virtualised Netwo...
A Connectionist Approach to Dynamic Resource Management for Virtualised Netwo...
Rashid Mijumbi
 
Studies On Traffic Management Models for Wireless Communication Network
Studies On Traffic Management Models for Wireless Communication NetworkStudies On Traffic Management Models for Wireless Communication Network
Studies On Traffic Management Models for Wireless Communication Network
NeetaSingh38
 
220206 transformer interpretability beyond attention visualization
220206 transformer interpretability beyond attention visualization220206 transformer interpretability beyond attention visualization
220206 transformer interpretability beyond attention visualization
taeseon ryu
 
Manifold Blurring Mean Shift algorithms for manifold denoising, report, 2012
Manifold Blurring Mean Shift algorithms for manifold denoising, report, 2012Manifold Blurring Mean Shift algorithms for manifold denoising, report, 2012
Manifold Blurring Mean Shift algorithms for manifold denoising, report, 2012
Florent Renucci
 
Implementation of recurrent neural network for the forecasting of USD buy ra...
Implementation of recurrent neural network for the forecasting  of USD buy ra...Implementation of recurrent neural network for the forecasting  of USD buy ra...
Implementation of recurrent neural network for the forecasting of USD buy ra...
IJECEIAES
 
Analysis of Various Single Frame Super Resolution Techniques for better PSNR
Analysis of Various Single Frame Super Resolution Techniques for better PSNRAnalysis of Various Single Frame Super Resolution Techniques for better PSNR
Analysis of Various Single Frame Super Resolution Techniques for better PSNR
IRJET Journal
 
Paper review: Learned Optimizers that Scale and Generalize.
Paper review: Learned Optimizers that Scale and Generalize.Paper review: Learned Optimizers that Scale and Generalize.
Paper review: Learned Optimizers that Scale and Generalize.
Wuhyun Rico Shin
 
IRJET- RESULT:Wavelet Transform along with SPIHT Algorithm Used for Image Com...
IRJET- RESULT:Wavelet Transform along with SPIHT Algorithm Used for Image Com...IRJET- RESULT:Wavelet Transform along with SPIHT Algorithm Used for Image Com...
IRJET- RESULT:Wavelet Transform along with SPIHT Algorithm Used for Image Com...
IRJET Journal
 
I05745368
I05745368I05745368
I05745368
IOSR-JEN
 
Neural Network-Based Actuator Fault Diagnosis for a Non-Linear Multi-Tank System
Neural Network-Based Actuator Fault Diagnosis for a Non-Linear Multi-Tank SystemNeural Network-Based Actuator Fault Diagnosis for a Non-Linear Multi-Tank System
Neural Network-Based Actuator Fault Diagnosis for a Non-Linear Multi-Tank System
ISA Interchange
 
Jf3515881595
Jf3515881595Jf3515881595
Jf3515881595
IJERA Editor
 

Similar to Representation learning (20)

24csit38.pdf
24csit38.pdf24csit38.pdf
24csit38.pdf
 
BACKPROPAGATION LEARNING ALGORITHM BASED ON LEVENBERG MARQUARDT ALGORITHM
BACKPROPAGATION LEARNING ALGORITHM BASED ON LEVENBERG MARQUARDT ALGORITHMBACKPROPAGATION LEARNING ALGORITHM BASED ON LEVENBERG MARQUARDT ALGORITHM
BACKPROPAGATION LEARNING ALGORITHM BASED ON LEVENBERG MARQUARDT ALGORITHM
 
A COMPARATIVE STUDY OF BACKPROPAGATION ALGORITHMS IN FINANCIAL PREDICTION
A COMPARATIVE STUDY OF BACKPROPAGATION ALGORITHMS IN FINANCIAL PREDICTIONA COMPARATIVE STUDY OF BACKPROPAGATION ALGORITHMS IN FINANCIAL PREDICTION
A COMPARATIVE STUDY OF BACKPROPAGATION ALGORITHMS IN FINANCIAL PREDICTION
 
Neural Network Based Individual Classification System
Neural Network Based Individual Classification SystemNeural Network Based Individual Classification System
Neural Network Based Individual Classification System
 
Neural Network Implementation Control Mobile Robot
Neural Network Implementation Control Mobile RobotNeural Network Implementation Control Mobile Robot
Neural Network Implementation Control Mobile Robot
 
05 backpropagation automatic_differentiation
05 backpropagation automatic_differentiation05 backpropagation automatic_differentiation
05 backpropagation automatic_differentiation
 
Electricity Price Forecasting Using ELM-Tree Approach
Electricity Price Forecasting Using ELM-Tree ApproachElectricity Price Forecasting Using ELM-Tree Approach
Electricity Price Forecasting Using ELM-Tree Approach
 
Introduction to deep learning
Introduction to deep learningIntroduction to deep learning
Introduction to deep learning
 
An Optimized Parallel Algorithm for Longest Common Subsequence Using Openmp –...
An Optimized Parallel Algorithm for Longest Common Subsequence Using Openmp –...An Optimized Parallel Algorithm for Longest Common Subsequence Using Openmp –...
An Optimized Parallel Algorithm for Longest Common Subsequence Using Openmp –...
 
A Connectionist Approach to Dynamic Resource Management for Virtualised Netwo...
A Connectionist Approach to Dynamic Resource Management for Virtualised Netwo...A Connectionist Approach to Dynamic Resource Management for Virtualised Netwo...
A Connectionist Approach to Dynamic Resource Management for Virtualised Netwo...
 
Studies On Traffic Management Models for Wireless Communication Network
Studies On Traffic Management Models for Wireless Communication NetworkStudies On Traffic Management Models for Wireless Communication Network
Studies On Traffic Management Models for Wireless Communication Network
 
220206 transformer interpretability beyond attention visualization
220206 transformer interpretability beyond attention visualization220206 transformer interpretability beyond attention visualization
220206 transformer interpretability beyond attention visualization
 
Manifold Blurring Mean Shift algorithms for manifold denoising, report, 2012
Manifold Blurring Mean Shift algorithms for manifold denoising, report, 2012Manifold Blurring Mean Shift algorithms for manifold denoising, report, 2012
Manifold Blurring Mean Shift algorithms for manifold denoising, report, 2012
 
Implementation of recurrent neural network for the forecasting of USD buy ra...
Implementation of recurrent neural network for the forecasting  of USD buy ra...Implementation of recurrent neural network for the forecasting  of USD buy ra...
Implementation of recurrent neural network for the forecasting of USD buy ra...
 
Analysis of Various Single Frame Super Resolution Techniques for better PSNR
Analysis of Various Single Frame Super Resolution Techniques for better PSNRAnalysis of Various Single Frame Super Resolution Techniques for better PSNR
Analysis of Various Single Frame Super Resolution Techniques for better PSNR
 
Paper review: Learned Optimizers that Scale and Generalize.
Paper review: Learned Optimizers that Scale and Generalize.Paper review: Learned Optimizers that Scale and Generalize.
Paper review: Learned Optimizers that Scale and Generalize.
 
IRJET- RESULT:Wavelet Transform along with SPIHT Algorithm Used for Image Com...
IRJET- RESULT:Wavelet Transform along with SPIHT Algorithm Used for Image Com...IRJET- RESULT:Wavelet Transform along with SPIHT Algorithm Used for Image Com...
IRJET- RESULT:Wavelet Transform along with SPIHT Algorithm Used for Image Com...
 
I05745368
I05745368I05745368
I05745368
 
Neural Network-Based Actuator Fault Diagnosis for a Non-Linear Multi-Tank System
Neural Network-Based Actuator Fault Diagnosis for a Non-Linear Multi-Tank SystemNeural Network-Based Actuator Fault Diagnosis for a Non-Linear Multi-Tank System
Neural Network-Based Actuator Fault Diagnosis for a Non-Linear Multi-Tank System
 
Jf3515881595
Jf3515881595Jf3515881595
Jf3515881595
 

More from Brian Kim

FreeAnchor
FreeAnchorFreeAnchor
FreeAnchor
Brian Kim
 
20190708 bumsookim yolact
20190708 bumsookim yolact20190708 bumsookim yolact
20190708 bumsookim yolact
Brian Kim
 
20190718 bumsookim 2_attention
20190718 bumsookim 2_attention20190718 bumsookim 2_attention
20190718 bumsookim 2_attention
Brian Kim
 
Spectral cnn
Spectral cnnSpectral cnn
Spectral cnn
Brian Kim
 
Compressing neural language models by sparse word representation
Compressing neural language models by sparse word representationCompressing neural language models by sparse word representation
Compressing neural language models by sparse word representation
Brian Kim
 
Dcgan
DcganDcgan
Dcgan
Brian Kim
 
Google net
Google netGoogle net
Google net
Brian Kim
 

More from Brian Kim (7)

FreeAnchor
FreeAnchorFreeAnchor
FreeAnchor
 
20190708 bumsookim yolact
20190708 bumsookim yolact20190708 bumsookim yolact
20190708 bumsookim yolact
 
20190718 bumsookim 2_attention
20190718 bumsookim 2_attention20190718 bumsookim 2_attention
20190718 bumsookim 2_attention
 
Spectral cnn
Spectral cnnSpectral cnn
Spectral cnn
 
Compressing neural language models by sparse word representation
Compressing neural language models by sparse word representationCompressing neural language models by sparse word representation
Compressing neural language models by sparse word representation
 
Dcgan
DcganDcgan
Dcgan
 
Google net
Google netGoogle net
Google net
 

Recently uploaded

DSSML24_tspann_CodelessGenerativeAIPipelines
DSSML24_tspann_CodelessGenerativeAIPipelinesDSSML24_tspann_CodelessGenerativeAIPipelines
DSSML24_tspann_CodelessGenerativeAIPipelines
Timothy Spann
 
原版一比一多伦多大学毕业证(UofT毕业证书)如何办理
原版一比一多伦多大学毕业证(UofT毕业证书)如何办理原版一比一多伦多大学毕业证(UofT毕业证书)如何办理
原版一比一多伦多大学毕业证(UofT毕业证书)如何办理
mkkikqvo
 
原版制作(unimelb毕业证书)墨尔本大学毕业证Offer一模一样
原版制作(unimelb毕业证书)墨尔本大学毕业证Offer一模一样原版制作(unimelb毕业证书)墨尔本大学毕业证Offer一模一样
原版制作(unimelb毕业证书)墨尔本大学毕业证Offer一模一样
ihavuls
 
一比一原版(Unimelb毕业证书)墨尔本大学毕业证如何办理
一比一原版(Unimelb毕业证书)墨尔本大学毕业证如何办理一比一原版(Unimelb毕业证书)墨尔本大学毕业证如何办理
一比一原版(Unimelb毕业证书)墨尔本大学毕业证如何办理
xclpvhuk
 
原版一比一爱尔兰都柏林大学毕业证(UCD毕业证书)如何办理
原版一比一爱尔兰都柏林大学毕业证(UCD毕业证书)如何办理 原版一比一爱尔兰都柏林大学毕业证(UCD毕业证书)如何办理
原版一比一爱尔兰都柏林大学毕业证(UCD毕业证书)如何办理
tzu5xla
 
一比一原版英属哥伦比亚大学毕业证(UBC毕业证书)学历如何办理
一比一原版英属哥伦比亚大学毕业证(UBC毕业证书)学历如何办理一比一原版英属哥伦比亚大学毕业证(UBC毕业证书)学历如何办理
一比一原版英属哥伦比亚大学毕业证(UBC毕业证书)学历如何办理
z6osjkqvd
 
Building a Quantum Computer Neutral Atom.pdf
Building a Quantum Computer Neutral Atom.pdfBuilding a Quantum Computer Neutral Atom.pdf
Building a Quantum Computer Neutral Atom.pdf
cjimenez2581
 
一比一原版(曼大毕业证书)曼尼托巴大学毕业证如何办理
一比一原版(曼大毕业证书)曼尼托巴大学毕业证如何办理一比一原版(曼大毕业证书)曼尼托巴大学毕业证如何办理
一比一原版(曼大毕业证书)曼尼托巴大学毕业证如何办理
ytypuem
 
一比一原版英国赫特福德大学毕业证(hertfordshire毕业证书)如何办理
一比一原版英国赫特福德大学毕业证(hertfordshire毕业证书)如何办理一比一原版英国赫特福德大学毕业证(hertfordshire毕业证书)如何办理
一比一原版英国赫特福德大学毕业证(hertfordshire毕业证书)如何办理
nyvan3
 
End-to-end pipeline agility - Berlin Buzzwords 2024
End-to-end pipeline agility - Berlin Buzzwords 2024End-to-end pipeline agility - Berlin Buzzwords 2024
End-to-end pipeline agility - Berlin Buzzwords 2024
Lars Albertsson
 
Jio cinema Retention & Engagement Strategy.pdf
Jio cinema Retention & Engagement Strategy.pdfJio cinema Retention & Engagement Strategy.pdf
Jio cinema Retention & Engagement Strategy.pdf
inaya7568
 
Experts live - Improving user adoption with AI
Experts live - Improving user adoption with AIExperts live - Improving user adoption with AI
Experts live - Improving user adoption with AI
jitskeb
 
Template xxxxxxxx ssssssssssss Sertifikat.pptx
Template xxxxxxxx ssssssssssss Sertifikat.pptxTemplate xxxxxxxx ssssssssssss Sertifikat.pptx
Template xxxxxxxx ssssssssssss Sertifikat.pptx
TeukuEriSyahputra
 
[VCOSA] Monthly Report - Cotton & Yarn Statistics March 2024
[VCOSA] Monthly Report - Cotton & Yarn Statistics March 2024[VCOSA] Monthly Report - Cotton & Yarn Statistics March 2024
[VCOSA] Monthly Report - Cotton & Yarn Statistics March 2024
Vietnam Cotton & Spinning Association
 
一比一原版澳洲西澳大学毕业证(uwa毕业证书)如何办理
一比一原版澳洲西澳大学毕业证(uwa毕业证书)如何办理一比一原版澳洲西澳大学毕业证(uwa毕业证书)如何办理
一比一原版澳洲西澳大学毕业证(uwa毕业证书)如何办理
aguty
 
一比一原版(UO毕业证)渥太华大学毕业证如何办理
一比一原版(UO毕业证)渥太华大学毕业证如何办理一比一原版(UO毕业证)渥太华大学毕业证如何办理
一比一原版(UO毕业证)渥太华大学毕业证如何办理
bmucuha
 
writing report business partner b1+ .pdf
writing report business partner b1+ .pdfwriting report business partner b1+ .pdf
writing report business partner b1+ .pdf
VyNguyen709676
 
Palo Alto Cortex XDR presentation .......
Palo Alto Cortex XDR presentation .......Palo Alto Cortex XDR presentation .......
Palo Alto Cortex XDR presentation .......
Sachin Paul
 
UofT毕业证如何办理
UofT毕业证如何办理UofT毕业证如何办理
UofT毕业证如何办理
exukyp
 
一比一原版南十字星大学毕业证(SCU毕业证书)学历如何办理
一比一原版南十字星大学毕业证(SCU毕业证书)学历如何办理一比一原版南十字星大学毕业证(SCU毕业证书)学历如何办理
一比一原版南十字星大学毕业证(SCU毕业证书)学历如何办理
slg6lamcq
 

Recently uploaded (20)

DSSML24_tspann_CodelessGenerativeAIPipelines
DSSML24_tspann_CodelessGenerativeAIPipelinesDSSML24_tspann_CodelessGenerativeAIPipelines
DSSML24_tspann_CodelessGenerativeAIPipelines
 
原版一比一多伦多大学毕业证(UofT毕业证书)如何办理
原版一比一多伦多大学毕业证(UofT毕业证书)如何办理原版一比一多伦多大学毕业证(UofT毕业证书)如何办理
原版一比一多伦多大学毕业证(UofT毕业证书)如何办理
 
原版制作(unimelb毕业证书)墨尔本大学毕业证Offer一模一样
原版制作(unimelb毕业证书)墨尔本大学毕业证Offer一模一样原版制作(unimelb毕业证书)墨尔本大学毕业证Offer一模一样
原版制作(unimelb毕业证书)墨尔本大学毕业证Offer一模一样
 
一比一原版(Unimelb毕业证书)墨尔本大学毕业证如何办理
一比一原版(Unimelb毕业证书)墨尔本大学毕业证如何办理一比一原版(Unimelb毕业证书)墨尔本大学毕业证如何办理
一比一原版(Unimelb毕业证书)墨尔本大学毕业证如何办理
 
原版一比一爱尔兰都柏林大学毕业证(UCD毕业证书)如何办理
原版一比一爱尔兰都柏林大学毕业证(UCD毕业证书)如何办理 原版一比一爱尔兰都柏林大学毕业证(UCD毕业证书)如何办理
原版一比一爱尔兰都柏林大学毕业证(UCD毕业证书)如何办理
 
一比一原版英属哥伦比亚大学毕业证(UBC毕业证书)学历如何办理
一比一原版英属哥伦比亚大学毕业证(UBC毕业证书)学历如何办理一比一原版英属哥伦比亚大学毕业证(UBC毕业证书)学历如何办理
一比一原版英属哥伦比亚大学毕业证(UBC毕业证书)学历如何办理
 
Building a Quantum Computer Neutral Atom.pdf
Building a Quantum Computer Neutral Atom.pdfBuilding a Quantum Computer Neutral Atom.pdf
Building a Quantum Computer Neutral Atom.pdf
 
一比一原版(曼大毕业证书)曼尼托巴大学毕业证如何办理
一比一原版(曼大毕业证书)曼尼托巴大学毕业证如何办理一比一原版(曼大毕业证书)曼尼托巴大学毕业证如何办理
一比一原版(曼大毕业证书)曼尼托巴大学毕业证如何办理
 
一比一原版英国赫特福德大学毕业证(hertfordshire毕业证书)如何办理
一比一原版英国赫特福德大学毕业证(hertfordshire毕业证书)如何办理一比一原版英国赫特福德大学毕业证(hertfordshire毕业证书)如何办理
一比一原版英国赫特福德大学毕业证(hertfordshire毕业证书)如何办理
 
End-to-end pipeline agility - Berlin Buzzwords 2024
End-to-end pipeline agility - Berlin Buzzwords 2024End-to-end pipeline agility - Berlin Buzzwords 2024
End-to-end pipeline agility - Berlin Buzzwords 2024
 
Jio cinema Retention & Engagement Strategy.pdf
Jio cinema Retention & Engagement Strategy.pdfJio cinema Retention & Engagement Strategy.pdf
Jio cinema Retention & Engagement Strategy.pdf
 
Experts live - Improving user adoption with AI
Experts live - Improving user adoption with AIExperts live - Improving user adoption with AI
Experts live - Improving user adoption with AI
 
Template xxxxxxxx ssssssssssss Sertifikat.pptx
Template xxxxxxxx ssssssssssss Sertifikat.pptxTemplate xxxxxxxx ssssssssssss Sertifikat.pptx
Template xxxxxxxx ssssssssssss Sertifikat.pptx
 
[VCOSA] Monthly Report - Cotton & Yarn Statistics March 2024
[VCOSA] Monthly Report - Cotton & Yarn Statistics March 2024[VCOSA] Monthly Report - Cotton & Yarn Statistics March 2024
[VCOSA] Monthly Report - Cotton & Yarn Statistics March 2024
 
一比一原版澳洲西澳大学毕业证(uwa毕业证书)如何办理
一比一原版澳洲西澳大学毕业证(uwa毕业证书)如何办理一比一原版澳洲西澳大学毕业证(uwa毕业证书)如何办理
一比一原版澳洲西澳大学毕业证(uwa毕业证书)如何办理
 
一比一原版(UO毕业证)渥太华大学毕业证如何办理
一比一原版(UO毕业证)渥太华大学毕业证如何办理一比一原版(UO毕业证)渥太华大学毕业证如何办理
一比一原版(UO毕业证)渥太华大学毕业证如何办理
 
writing report business partner b1+ .pdf
writing report business partner b1+ .pdfwriting report business partner b1+ .pdf
writing report business partner b1+ .pdf
 
Palo Alto Cortex XDR presentation .......
Palo Alto Cortex XDR presentation .......Palo Alto Cortex XDR presentation .......
Palo Alto Cortex XDR presentation .......
 
UofT毕业证如何办理
UofT毕业证如何办理UofT毕业证如何办理
UofT毕业证如何办理
 
一比一原版南十字星大学毕业证(SCU毕业证书)学历如何办理
一比一原版南十字星大学毕业证(SCU毕业证书)学历如何办理一比一原版南十字星大学毕业证(SCU毕业证书)学历如何办理
一比一原版南十字星大学毕业证(SCU毕业证书)学历如何办理
 

Representation learning

  • 1. Final Report: Construction through Deep RNN Representation Learning of Knowledge Bases KoreaUniversity, DepartmentofComputerScience&Radio CommunicationEngineering MASSIVEDATAMANAGEMENT Professor JaewooKang 1 2015010661 2015011155 2016010646 MinhwanYu YonghwaChoi BumsooKim
  • 2. MASSIVE DATA MANAGEMENT Presentation format Contents 01.Introduction 02.PTransE 1-1.Whatdowewant? 1-3.TransH 2-1.Whatisdifferent? 2-2.RelationPathRepresentation 2-3.RecurrentNeuralNetwork 2 03.Ouralgorithm 3-1.Objectfunction 3-2.Activationfunction 3-3.LongShortTermMemory 04.Evaluation 4-1.Results 4-2.Comparison&Analysis 1-4.TransR 1-5.PTransE 1-2.TransE
  • 3. MASSIVE DATA MANAGEMENT Presentation format Introduction 1-2.TransE 1-3.TransH 3 1-4.TransR 1-5.PTransE 1-1.Whatdowewant?
  • 4. MASSIVE DATA MANAGEMENT Presentation format 1. Introduction 4 1-1. What do we want? Large-Scale KnowledgeBases Freebase DBpedia Yago Realworld triples KnowledgeBase(KB)
  • 5. MASSIVE DATA MANAGEMENT Presentation format 1. Introduction 5 1-1. What do we want? Large-Scale KnowledgeBases Freebase Realworld triples ∴KBsareveryincomplete Misrelation 1.Norelationfound 2.Wrongrelationfound DBpedia Yago
  • 6. MASSIVE DATA MANAGEMENT Presentation format 1. Introduction 6 1-1. What do we want? How doweaddress mis-relations?
  • 7. MASSIVE DATA MANAGEMENT Presentation format 1. Introduction 7 1-1. What do we want? Q.Howdoweaddressmisrelation? A1. Addexternalsourcesfor completion Manualandintuitional Requirestoomuchlabor Inefficient,time-consuming PerformancedropsasKBgetslarger
  • 8. MASSIVE DATA MANAGEMENT Presentation format 1. Introduction 8 1-1. What do we want? Q.Howdoweaddressmisrelation? A1. Addexternalsourcesfor completion A2. Referencingothertuples forcompletion Powerful&Efficient Largelyexpandable MorepowerfulasKBgetslarger
  • 9. MASSIVE DATA MANAGEMENT Presentation format 1. Introduction 9 1-1. What do we want? Wetriedout… TransE TransH TransR PTransE CompletionofKBbyaddressingmisrelationsthrough co-entity,co-relationreference
  • 10. MASSIVE DATA MANAGEMENT Presentation format 1. Introduction 10 1-2. TransE relation#1Boy Girl relation#1Queen TransE Representingdirectrelationswith differencesbetweenentityvectors Simple&Effective reference:TransE(Bordesetal.NIPS2013)
  • 11. MASSIVE DATA MANAGEMENT Presentation format 1. Introduction 11 1-2. TransE TransE Simple&Effective relation#1Boy Girl relation#1Queen King Representingdirectrelationswith differencesbetweenentityvectors reference:TransE(Bordesetal.NIPS2013)
  • 12. MASSIVE DATA MANAGEMENT Presentation format 1. Introduction 12 1-2. TransE However… TransEcan’trepresentmorethan onerelationshipbetweenentities. Inrealworld,weconstructmany relationshipswithmanysubjects.
  • 13. MASSIVE DATA MANAGEMENT Presentation format 1. Introduction 13 1-3. TransH TransH AbletorepresentM-to-Mrelations Representingprojectedrelationswith differencesbetweenentityvectors Entitiesandrelationshavedifferentcharacteristics However,theyarerepresentedinthesamespace reference:TransH(Wangetal.AAAI2014)
  • 14. MASSIVE DATA MANAGEMENT Presentation format 1. Introduction 14 1-4. TransR Abletorepresentdifferentcharacteristics betweenentitiesandrelations Representingprojectedentitiesand mappingthemintorelationspace‘r’ TransR However,thesewereallnotgoodenoughfor detectingandaddressingmisrelations!! reference:TransR(AAAI2015)
  • 15. MASSIVE DATA MANAGEMENT Presentation format 1. Introduction 15 1-5. PTransE PTransE Representingrelationsthroughcompositionof relationsbetweenentityvectors Widelyexpandable&powerfulmethod h t e1 r1 r2 r=r1⋅r2 r1:father r1:father reference:emnlpprocessing
  • 16. MASSIVE DATA MANAGEMENT Presentation format 1. Introduction 16 1-5. PTransE PTransE Representingrelationsthroughcompositionof relationsbetweenentityvectors Widelyexpandable&powerfulmethod h t e1 r1 r2 r=r1⋅r2 r:grandfather reference:emnlpprocessing
  • 17. MASSIVE DATA MANAGEMENT Presentation format PTransE 2-2.RelationPathRepresentation 2-3.RecurrentNeuralNetwork 17 2-1.Whatisdifferent?
  • 18. MASSIVE DATA MANAGEMENT Presentation format 2. PTransE 18 2-1. What is different? PTransE LearnsonlyonesinglematrixEachrelationneedsitsown projectionmap. TransR 50,000relationsneeds50,000 enormousprojectionmaps! Learnsn-steprelationalpaths Learnsrelationsthatevendon’t haveenoughtrainingdataIfarelationdoesn’thaveenough trainingdata,itwillsufferfromlow performance. Learnsrelationsonlyinthetraining dataset. Learnsrelationsthatarenotinthe trainingdatasetthroughrelation paths.(Zero-shotKBinference)
  • 19. MASSIVE DATA MANAGEMENT Presentation format 2. PTransE 19 2-2. Relation Path Representation MicrosoftisbasedinSeattle. WhichcountryisMicrosoftlocatedin? Microsoft Seattle IsBasedInCountryIn ??? RelationNOTFOUND!(Misrelation)
  • 20. MASSIVE DATA MANAGEMENT Presentation format 2. PTransE 20 2-2. Relation Path Representation ‘Microsoft’isbasedin‘Seattle’. ‘Seattle’islocatedinstate‘Washington’ Microsoft Seattle Washington USA IsBasedIn ‘Washington’islocatedincountry‘USA’ StateIn CountryIn
  • 21. MASSIVE DATA MANAGEMENT Presentation format 2. PTransE 21 2-2. Relation Path Representation ‘Microsoft’isbasedin‘Seattle’. ‘Seattle’islocatedinstate‘Washington’ Microsoft USA ‘Washington’islocatedincountry‘USA’ CountryLocatedIn ‘Microsoft’islocatedin‘USA’.
  • 22. MASSIVE DATA MANAGEMENT Presentation format 2. PTransE 22 2-3. Recurrent Neural Network Microsoft Seattle Washington IsBasedIn StateIn CountryIn USA CountryLocatedIn RNN RNN Learningamatrix 𝑾𝑾𝒓𝒓 for constructingcomposition vector StateLocatedIn p= 𝒇𝒇( 𝑾𝑾𝒓𝒓 𝒄𝒄𝟑𝟑;𝒄𝒄𝟒𝟒 ) = 𝒕𝒕𝒕𝒕𝒕𝒕𝒕𝒕( 𝑾𝑾𝒓𝒓 𝒄𝒄𝟑𝟑;𝒄𝒄𝟒𝟒 ) 𝐜𝐜𝟑𝟑 = 𝒇𝒇( 𝑾𝑾𝒓𝒓 𝒄𝒄𝟏𝟏;𝒄𝒄𝟐𝟐 ) 𝒄𝒄𝟏𝟏 𝒄𝒄𝟐𝟐 𝒄𝒄𝟒𝟒 reference:CompositionalVectorSpaceModelsforKnowledgeBaseInference
  • 23. MASSIVE DATA MANAGEMENT Presentation format Our Algorithm 3-2.Activationfunction 3-3.LongShortTermMemory 23 3-1.Objectivefunction
  • 24. MASSIVE DATA MANAGEMENT Presentation format 3. Our Algorithm 24 3-1. Objective function Me Father Grandfather FatherOf FatherOf RNN Compositionvector 𝐩𝐩 = 𝒇𝒇( 𝑾𝑾𝒓𝒓 𝒄𝒄𝟏𝟏;𝒄𝒄𝟐𝟐 ) 𝒄𝒄𝟏𝟏 𝒄𝒄𝟐𝟐 Me Grandfather GrandfatherOf Update 𝑾𝑾𝒓𝒓
  • 25. MASSIVE DATA MANAGEMENT Presentation format 3. Our Algorithm 25 3-1. Objective function Compositionvector 𝐩𝐩 = 𝒇𝒇( 𝑾𝑾𝒓𝒓 𝒄𝒄𝟏𝟏;𝒄𝒄𝟐𝟐 )GrandfatherOf n-steprelationvector 𝒍𝒍𝒍𝒍𝒍𝒍𝒍𝒍 Updating 𝑾𝑾𝒓𝒓 … 𝒍𝒍𝒍𝒍𝒍𝒍𝒍𝒍BackPropagation ForwardPropagation 𝒑𝒑𝒑𝒑𝒑𝒑𝒑𝒑 −𝒕𝒕𝒕𝒕𝒕𝒕𝒕𝒕𝒕𝒕𝒕𝒕𝑼𝑼𝑼𝑼𝑼𝑼𝑼𝑼𝑼𝑼𝑼𝑼 𝐖𝐖𝐫𝐫
  • 26. MASSIVE DATA MANAGEMENT Presentation format 3. Our Algorithm 26 3-2. Activation function However,NeuralNetworkisbasicallyalinearoperation 𝐩𝐩 = 𝒇𝒇( 𝑾𝑾𝒓𝒓 𝒄𝒄𝟏𝟏;𝒄𝒄𝟐𝟐 ) Applyinganon-linearaftereach operationsinordertolearnnon- lineardecisionboundary tanhistheconventionaldefault non-linearactivationfunctionfor RNNmodels.
  • 27. MASSIVE DATA MANAGEMENT Presentation format 3. Our Algorithm 27 3-2. Activation function Then,what’swrongwithtanh? Whenbackpropagatinggradients inordertoupdatethematrix, thesegradientstendtoconvergeto 0andeliminatesthegradientflow. Therefore,nogradientflows backward,andtheparameterstays unmodified. GradientVanishingProblem
  • 28. MASSIVE DATA MANAGEMENT Presentation format 3. Our Algorithm 28 3-3. Long Short Term Memory AddressesGradientVanishingproblems HandleLongtermdependencies ForgetGate
  • 29. MASSIVE DATA MANAGEMENT Presentation format Evaluation 4-2.Comparison&Analysis 29 4-1.Results
  • 30. MASSIVE DATA MANAGEMENT Presentation format 4. Evaluation 30 4-1. Results WeusedFB15K,andFB40Kdatasets FB15K FB40K # Relations 1,345 1,336 # Entities 14,951 39,528 # Training set 483,142 370,648 # Validation set 50,000 67,948 # Test set 59,071 96,678
  • 31. MASSIVE DATA MANAGEMENT Presentation format 4. Evaluation 31 4-2. Comparisons & Analysis Hit 10 (RAW) Hit 10 (FILTER) TransE 34.9% 47.1% TransH 45.7% 64.4% TransR 43.8% 65.5% Hit 10 (RAW) Hit 10 (FILTER) PTransE(RNN) (Original model) 50.6% 82.2% PTransE(LSTM) (Our model) 53.1% 86.6% PTransE,RNNcomposition Baseline
  • 32. MASSIVE DATA MANAGEMENT Presentation format 4. Evaluation 32 4-2. Comparisons & Analysis Hit 10 (RAW) Hit 10 (FILTER) TransE 34.9% 47.1% TransH 45.7% 64.4% TransR 43.8% 65.5% Hit 10 (RAW) Hit 10 (FILTER) PTransE(RNN) (Original model) 50.6% 82.2% PTransE(LSTM) (Our model) 53.1% 86.6% PTransE,RNNcomposition (+9.3%) Baseline
  • 33. MASSIVE DATA MANAGEMENT Presentation format 4. Evaluation 33 4-2. Comparisons & Analysis Hit 10 (RAW) Hit 10 (FILTER) TransE 34.9% 47.1% TransH 45.7% 64.4% TransR 43.8% 65.5% Hit 10 (RAW) Hit 10 (FILTER) PTransE(RNN) (Original model) 50.6% 82.2% PTransE(LSTM) (Our model) 53.1% 86.6% PTransE,RNNcomposition (+21.1%) Baseline
  • 34. MASSIVE DATA MANAGEMENT Presentation format 4. Evaluation 34 4-2. Comparisons & Analysis Hit 10 (RAW) Hit 10 (FILTER) PTransE(ADD) (Original model) 51.8% 83.4% PTransE(ADD) (Our model) 52.1% 84.1% PTransE(MUL) (Original model) 47.4% 77.7% PTransE(MUL) (Our model) 47.1% 77.2% PTransE(RNN) (Original model) 50.6% 82.2% PTransE(LSTM) (Our model) 53.1% 86.6% PTransE,Compositioncomparison
  • 35. Thank you for your attention! 35