SlideShare a Scribd company logo
1 of 11
Neuron level interpretation
of Deep NLP model
By Shreya Goyal (015251990)
Introduction
The proliferation of deep neural networks in various domains has seen an
increased need for the interpretability of these methods. A plethora of
research has been carried out to analyze and understand components of the
deep neural network models. Preliminary work done along these lines and
papers that surveyed such were focused on a more high-level representation
analysis.
What are neurons ?
State-of-the-art neural networks, such as RNNs or Transformer models consist of various
components such as blocks, layers, attention heads, gates/cells, etc. We use the term neuron
to refer to the output of a single dimension from any of the aforementioned neural network
components. For example, in the BERT base model, the output of a layer block has 768
neurons and the output of an attention head has 64 neurons. In the literature, neurons have
also been referred to as features, experts, and units.
The Objective of Neuron Analysis
Given a neural network model, neuron analysis aims to identify neurons that: i) learn a
specific concept, or ii) are important to the model. An important facet is that some methods
require a concept C as an additional input, in which case the importance is computed against
the concept, while other methods just require the model M, where importance is measured
with respect to the overall model and not a specific concept. For example, considering
gender as a concept, the goal of neuron analysis would be to extract neurons from a model
that encodes gender information.
Visualization
A simple way to discover the role of neurons is through visualizing their activations and
manually identifying patterns over a set of sentences. However given the large number of
neurons in a neural network model, it is cumbersome to visualize all of them.
Corpus Rank
Given a neuron, the Corpus Rank method sorts sentences based on the activation values. The
top sentences are then visualized to identify a concept the neuron is representing. For
example, In a survey, it is mentioned that words that maximally activate a neuron by
analyzing its top-k context from the corpus. The Corpus Rank method makes the process less
cumbersome by reducing the search space to only the top sentences, as opposed to the
Visualization method where the entire corpus or a random subset is analyzed by a human.
Corpus Generation
Corpus rank methods are efficacious in identifying neurons focusing on lexical concepts.
However, their space of analysis is limited to the underlying corpus. It is possible that a
neuron represents a diverse concept not featured in the corpus. The Corpus Generation
method addresses this problem by generating novel sentences that maximize a neuron’s
activations.
Mask based corpus selection
The corpus rank and corpus generation methods require humans in the loop to analyze the
concepts that are represented by the neurons. Now we discuss a set of methods that
automatically select a concept given a neuron. In a paper few researchers have proposed a
Masked-based Corpus Selection method to determine important neurons with respect to a
concept.
Neuron Probing
The visualization and corpus-based methods involve analyzing neurons from a data
perspective, relying on sentences from corpora or automatically generating them. The
Neuron Probing method trains a post-hoc classifier to identify neurons with respect to a pre-
defined concept. Specifically, given supervised data for a concept, say Noun, it extracts
activations of all the noun words in context.
Evaluation:
Ablation
While ablation has been used to discover salient neurons for the model, it has also been used
to evaluate the efficacy of the rankings. The idea is to ablate neurons in the model in the
order of their importance. For neuron ranking to be correct, removing the top neurons
should result in larger drop in performance compared to removing same amount of random
or bottom neurons from that list.
Classification Performance
Given salient neurons with respect to a concept, a simple method to evaluate their
correctness is to train a classifier using them as features and predict the concept of interest.
Applications
Controlling model behavior
By manipulating the values of the switch neuron at test time, researchers were able to successfully change
output translations from present to past tense. The authors additionally found neurons that capture
gender and number agreement concepts and manipulated them to control the system’s output.
Model Distillation and Efficiency
Someone devised an efficient feature-based transfer learning procedure, stemmed from their redundancy
analysis. By exploiting layer and neuron-specific redundancy in the transformer models, they were able to
reduce the feature set size to less than 100 neurons for several tasks while maintaining more than 97% of
the performance.

More Related Content

What's hot

Neural Networks for Pattern Recognition
Neural Networks for Pattern RecognitionNeural Networks for Pattern Recognition
Neural Networks for Pattern RecognitionVipra Singh
 
파이콘 한국 2020) 파이썬으로 구현하는 신경세포 기반의 인공 뇌 시뮬레이터
파이콘 한국 2020) 파이썬으로 구현하는 신경세포 기반의 인공 뇌 시뮬레이터파이콘 한국 2020) 파이썬으로 구현하는 신경세포 기반의 인공 뇌 시뮬레이터
파이콘 한국 2020) 파이썬으로 구현하는 신경세포 기반의 인공 뇌 시뮬레이터Seonghyun Kim
 
Artificial Neural Network
Artificial Neural NetworkArtificial Neural Network
Artificial Neural NetworkBurhan Muzafar
 
NEURAL NETWORKS
NEURAL NETWORKSNEURAL NETWORKS
NEURAL NETWORKSESCOM
 
Artificial Neural Network in Medical Diagnosis
Artificial Neural Network in Medical DiagnosisArtificial Neural Network in Medical Diagnosis
Artificial Neural Network in Medical DiagnosisAdityendra Kumar Singh
 
Artificial intelligence NEURAL NETWORKS
Artificial intelligence NEURAL NETWORKSArtificial intelligence NEURAL NETWORKS
Artificial intelligence NEURAL NETWORKSREHMAT ULLAH
 
Introduction Of Artificial neural network
Introduction Of Artificial neural networkIntroduction Of Artificial neural network
Introduction Of Artificial neural networkNagarajan
 
A framework for approaches to transfer of mind substrate
A framework for approaches to transfer of mind substrateA framework for approaches to transfer of mind substrate
A framework for approaches to transfer of mind substrateKarlos Svoboda
 
Theories of error back propagation in the brain review
Theories of error back propagation in the brain reviewTheories of error back propagation in the brain review
Theories of error back propagation in the brain reviewSeonghyun Kim
 
Artificial Neural Network
Artificial Neural NetworkArtificial Neural Network
Artificial Neural NetworkMuhammad Ishaq
 
Neuroscience Information Framework Ontologies: Nerve cells in Neurolex and NI...
Neuroscience Information Framework Ontologies: Nerve cells in Neurolex and NI...Neuroscience Information Framework Ontologies: Nerve cells in Neurolex and NI...
Neuroscience Information Framework Ontologies: Nerve cells in Neurolex and NI...Neuroscience Information Framework
 
Ai and neural networks
Ai and neural networksAi and neural networks
Ai and neural networksNikhil Kansari
 
Artificial Neural Network Paper Presentation
Artificial Neural Network Paper PresentationArtificial Neural Network Paper Presentation
Artificial Neural Network Paper Presentationguestac67362
 
Artifical neural networks
Artifical neural networksArtifical neural networks
Artifical neural networksalldesign
 
Artificial neural network
Artificial neural networkArtificial neural network
Artificial neural networkGauravPandey319
 
Artificial neural network
Artificial neural networkArtificial neural network
Artificial neural networkFaria Priya
 

What's hot (20)

Neural Networks for Pattern Recognition
Neural Networks for Pattern RecognitionNeural Networks for Pattern Recognition
Neural Networks for Pattern Recognition
 
파이콘 한국 2020) 파이썬으로 구현하는 신경세포 기반의 인공 뇌 시뮬레이터
파이콘 한국 2020) 파이썬으로 구현하는 신경세포 기반의 인공 뇌 시뮬레이터파이콘 한국 2020) 파이썬으로 구현하는 신경세포 기반의 인공 뇌 시뮬레이터
파이콘 한국 2020) 파이썬으로 구현하는 신경세포 기반의 인공 뇌 시뮬레이터
 
Artificial Neural Network
Artificial Neural NetworkArtificial Neural Network
Artificial Neural Network
 
NEURAL NETWORKS
NEURAL NETWORKSNEURAL NETWORKS
NEURAL NETWORKS
 
Artificial Neural Network in Medical Diagnosis
Artificial Neural Network in Medical DiagnosisArtificial Neural Network in Medical Diagnosis
Artificial Neural Network in Medical Diagnosis
 
Artificial intelligence NEURAL NETWORKS
Artificial intelligence NEURAL NETWORKSArtificial intelligence NEURAL NETWORKS
Artificial intelligence NEURAL NETWORKS
 
Introduction Of Artificial neural network
Introduction Of Artificial neural networkIntroduction Of Artificial neural network
Introduction Of Artificial neural network
 
A framework for approaches to transfer of mind substrate
A framework for approaches to transfer of mind substrateA framework for approaches to transfer of mind substrate
A framework for approaches to transfer of mind substrate
 
[IJET V2I2P20] Authors: Dr. Sanjeev S Sannakki, Ms.Anjanabhargavi A Kulkarni
[IJET V2I2P20] Authors: Dr. Sanjeev S Sannakki, Ms.Anjanabhargavi A Kulkarni[IJET V2I2P20] Authors: Dr. Sanjeev S Sannakki, Ms.Anjanabhargavi A Kulkarni
[IJET V2I2P20] Authors: Dr. Sanjeev S Sannakki, Ms.Anjanabhargavi A Kulkarni
 
Theories of error back propagation in the brain review
Theories of error back propagation in the brain reviewTheories of error back propagation in the brain review
Theories of error back propagation in the brain review
 
Artificial Neural Network
Artificial Neural NetworkArtificial Neural Network
Artificial Neural Network
 
Neuroscience Information Framework Ontologies: Nerve cells in Neurolex and NI...
Neuroscience Information Framework Ontologies: Nerve cells in Neurolex and NI...Neuroscience Information Framework Ontologies: Nerve cells in Neurolex and NI...
Neuroscience Information Framework Ontologies: Nerve cells in Neurolex and NI...
 
Ai and neural networks
Ai and neural networksAi and neural networks
Ai and neural networks
 
Artificial Neural Network Paper Presentation
Artificial Neural Network Paper PresentationArtificial Neural Network Paper Presentation
Artificial Neural Network Paper Presentation
 
Artifical neural networks
Artifical neural networksArtifical neural networks
Artifical neural networks
 
Neural networks
Neural networksNeural networks
Neural networks
 
Artificial neural network
Artificial neural networkArtificial neural network
Artificial neural network
 
VEU_CST499_FinalReport
VEU_CST499_FinalReportVEU_CST499_FinalReport
VEU_CST499_FinalReport
 
Artificial neural network
Artificial neural networkArtificial neural network
Artificial neural network
 
MaLAI_Hyderabad presentation
MaLAI_Hyderabad presentationMaLAI_Hyderabad presentation
MaLAI_Hyderabad presentation
 

Similar to Neuron level interpretation of deep nlp model

Computational neuropharmacology drug designing
Computational neuropharmacology drug designingComputational neuropharmacology drug designing
Computational neuropharmacology drug designingRevathi Boyina
 
AI Chapter VIIProblem Solving Using Searching .pptx
AI Chapter VIIProblem Solving Using Searching .pptxAI Chapter VIIProblem Solving Using Searching .pptx
AI Chapter VIIProblem Solving Using Searching .pptxwekineheshete
 
Dissertation character recognition - Report
Dissertation character recognition - ReportDissertation character recognition - Report
Dissertation character recognition - Reportsachinkumar Bharadva
 
CCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdf
CCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdfCCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdf
CCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdfAsst.prof M.Gokilavani
 
Topic Explain the theory of natural selection. Show you understand .docx
Topic Explain the theory of natural selection. Show you understand .docxTopic Explain the theory of natural selection. Show you understand .docx
Topic Explain the theory of natural selection. Show you understand .docxedwardmarivel
 
Summary Of Thesis
Summary Of ThesisSummary Of Thesis
Summary Of Thesisguestb452d6
 
NIFSTD and NeuroLex: A Comprehensive Ontology Development Based on Multiple B...
NIFSTD and NeuroLex: A Comprehensive Ontology Development Based on Multiple B...NIFSTD and NeuroLex: A Comprehensive Ontology Development Based on Multiple B...
NIFSTD and NeuroLex: A Comprehensive Ontology Development Based on Multiple B...Neuroscience Information Framework
 
Semester presentation
Semester presentationSemester presentation
Semester presentationkhush bakhat
 
Neural Network
Neural NetworkNeural Network
Neural NetworkSayyed Z
 
Neural networks are parallel computing devices.docx.pdf
Neural networks are parallel computing devices.docx.pdfNeural networks are parallel computing devices.docx.pdf
Neural networks are parallel computing devices.docx.pdfneelamsanjeevkumar
 
BASIC CONCEPT OF DEEP LEARNING.pptx
BASIC CONCEPT OF DEEP LEARNING.pptxBASIC CONCEPT OF DEEP LEARNING.pptx
BASIC CONCEPT OF DEEP LEARNING.pptxRiteshPandey184067
 
AR245-NE28-16 ARI 19 May 2005 1210An IntegrativeTheory .docx
AR245-NE28-16 ARI 19 May 2005 1210An IntegrativeTheory .docxAR245-NE28-16 ARI 19 May 2005 1210An IntegrativeTheory .docx
AR245-NE28-16 ARI 19 May 2005 1210An IntegrativeTheory .docxrossskuddershamus
 
Aspect oriented a candidate for neural networks and evolvable software
Aspect oriented a candidate for neural networks and evolvable softwareAspect oriented a candidate for neural networks and evolvable software
Aspect oriented a candidate for neural networks and evolvable softwareLinchuan Wang
 
CCS355 Neural Networks & Deep Learning Unit 1 PDF notes with Question bank .pdf
CCS355 Neural Networks & Deep Learning Unit 1 PDF notes with Question bank .pdfCCS355 Neural Networks & Deep Learning Unit 1 PDF notes with Question bank .pdf
CCS355 Neural Networks & Deep Learning Unit 1 PDF notes with Question bank .pdfAsst.prof M.Gokilavani
 
Artificial Neural Network report
Artificial Neural Network reportArtificial Neural Network report
Artificial Neural Network reportAnjali Agrawal
 

Similar to Neuron level interpretation of deep nlp model (20)

Unit+i
Unit+iUnit+i
Unit+i
 
Computational neuropharmacology drug designing
Computational neuropharmacology drug designingComputational neuropharmacology drug designing
Computational neuropharmacology drug designing
 
AI Chapter VIIProblem Solving Using Searching .pptx
AI Chapter VIIProblem Solving Using Searching .pptxAI Chapter VIIProblem Solving Using Searching .pptx
AI Chapter VIIProblem Solving Using Searching .pptx
 
Dissertation character recognition - Report
Dissertation character recognition - ReportDissertation character recognition - Report
Dissertation character recognition - Report
 
CCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdf
CCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdfCCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdf
CCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdf
 
Topic Explain the theory of natural selection. Show you understand .docx
Topic Explain the theory of natural selection. Show you understand .docxTopic Explain the theory of natural selection. Show you understand .docx
Topic Explain the theory of natural selection. Show you understand .docx
 
Summary Of Thesis
Summary Of ThesisSummary Of Thesis
Summary Of Thesis
 
NIFSTD and NeuroLex: A Comprehensive Ontology Development Based on Multiple B...
NIFSTD and NeuroLex: A Comprehensive Ontology Development Based on Multiple B...NIFSTD and NeuroLex: A Comprehensive Ontology Development Based on Multiple B...
NIFSTD and NeuroLex: A Comprehensive Ontology Development Based on Multiple B...
 
Semester presentation
Semester presentationSemester presentation
Semester presentation
 
AI Lesson 37
AI Lesson 37AI Lesson 37
AI Lesson 37
 
AS application
AS applicationAS application
AS application
 
Neural Network
Neural NetworkNeural Network
Neural Network
 
Neural Networks
Neural NetworksNeural Networks
Neural Networks
 
Neural networks are parallel computing devices.docx.pdf
Neural networks are parallel computing devices.docx.pdfNeural networks are parallel computing devices.docx.pdf
Neural networks are parallel computing devices.docx.pdf
 
BASIC CONCEPT OF DEEP LEARNING.pptx
BASIC CONCEPT OF DEEP LEARNING.pptxBASIC CONCEPT OF DEEP LEARNING.pptx
BASIC CONCEPT OF DEEP LEARNING.pptx
 
AR245-NE28-16 ARI 19 May 2005 1210An IntegrativeTheory .docx
AR245-NE28-16 ARI 19 May 2005 1210An IntegrativeTheory .docxAR245-NE28-16 ARI 19 May 2005 1210An IntegrativeTheory .docx
AR245-NE28-16 ARI 19 May 2005 1210An IntegrativeTheory .docx
 
Aspect oriented a candidate for neural networks and evolvable software
Aspect oriented a candidate for neural networks and evolvable softwareAspect oriented a candidate for neural networks and evolvable software
Aspect oriented a candidate for neural networks and evolvable software
 
bbbPaper
bbbPaperbbbPaper
bbbPaper
 
CCS355 Neural Networks & Deep Learning Unit 1 PDF notes with Question bank .pdf
CCS355 Neural Networks & Deep Learning Unit 1 PDF notes with Question bank .pdfCCS355 Neural Networks & Deep Learning Unit 1 PDF notes with Question bank .pdf
CCS355 Neural Networks & Deep Learning Unit 1 PDF notes with Question bank .pdf
 
Artificial Neural Network report
Artificial Neural Network reportArtificial Neural Network report
Artificial Neural Network report
 

Recently uploaded

Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...EduSkills OECD
 
MENTAL STATUS EXAMINATION format.docx
MENTAL     STATUS EXAMINATION format.docxMENTAL     STATUS EXAMINATION format.docx
MENTAL STATUS EXAMINATION format.docxPoojaSen20
 
KSHARA STURA .pptx---KSHARA KARMA THERAPY (CAUSTIC THERAPY)————IMP.OF KSHARA ...
KSHARA STURA .pptx---KSHARA KARMA THERAPY (CAUSTIC THERAPY)————IMP.OF KSHARA ...KSHARA STURA .pptx---KSHARA KARMA THERAPY (CAUSTIC THERAPY)————IMP.OF KSHARA ...
KSHARA STURA .pptx---KSHARA KARMA THERAPY (CAUSTIC THERAPY)————IMP.OF KSHARA ...M56BOOKSTORE PRODUCT/SERVICE
 
Concept of Vouching. B.Com(Hons) /B.Compdf
Concept of Vouching. B.Com(Hons) /B.CompdfConcept of Vouching. B.Com(Hons) /B.Compdf
Concept of Vouching. B.Com(Hons) /B.CompdfUmakantAnnand
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Krashi Coaching
 
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxiammrhaywood
 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxpboyjonauth
 
Hybridoma Technology ( Production , Purification , and Application )
Hybridoma Technology  ( Production , Purification , and Application  ) Hybridoma Technology  ( Production , Purification , and Application  )
Hybridoma Technology ( Production , Purification , and Application ) Sakshi Ghasle
 
Employee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxEmployee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxNirmalaLoungPoorunde1
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactdawncurless
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformChameera Dedduwage
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdfSoniaTolstoy
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introductionMaksud Ahmed
 
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTiammrhaywood
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxSayali Powar
 
CARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxCARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxGaneshChakor2
 

Recently uploaded (20)

Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
 
MENTAL STATUS EXAMINATION format.docx
MENTAL     STATUS EXAMINATION format.docxMENTAL     STATUS EXAMINATION format.docx
MENTAL STATUS EXAMINATION format.docx
 
Model Call Girl in Bikash Puri Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Bikash Puri  Delhi reach out to us at 🔝9953056974🔝Model Call Girl in Bikash Puri  Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Bikash Puri Delhi reach out to us at 🔝9953056974🔝
 
KSHARA STURA .pptx---KSHARA KARMA THERAPY (CAUSTIC THERAPY)————IMP.OF KSHARA ...
KSHARA STURA .pptx---KSHARA KARMA THERAPY (CAUSTIC THERAPY)————IMP.OF KSHARA ...KSHARA STURA .pptx---KSHARA KARMA THERAPY (CAUSTIC THERAPY)————IMP.OF KSHARA ...
KSHARA STURA .pptx---KSHARA KARMA THERAPY (CAUSTIC THERAPY)————IMP.OF KSHARA ...
 
Staff of Color (SOC) Retention Efforts DDSD
Staff of Color (SOC) Retention Efforts DDSDStaff of Color (SOC) Retention Efforts DDSD
Staff of Color (SOC) Retention Efforts DDSD
 
Concept of Vouching. B.Com(Hons) /B.Compdf
Concept of Vouching. B.Com(Hons) /B.CompdfConcept of Vouching. B.Com(Hons) /B.Compdf
Concept of Vouching. B.Com(Hons) /B.Compdf
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
 
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptx
 
Hybridoma Technology ( Production , Purification , and Application )
Hybridoma Technology  ( Production , Purification , and Application  ) Hybridoma Technology  ( Production , Purification , and Application  )
Hybridoma Technology ( Production , Purification , and Application )
 
Employee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxEmployee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptx
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impact
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy Reform
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introduction
 
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
 
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdfTataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
 
CARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxCARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptx
 

Neuron level interpretation of deep nlp model

  • 1. Neuron level interpretation of Deep NLP model By Shreya Goyal (015251990)
  • 2. Introduction The proliferation of deep neural networks in various domains has seen an increased need for the interpretability of these methods. A plethora of research has been carried out to analyze and understand components of the deep neural network models. Preliminary work done along these lines and papers that surveyed such were focused on a more high-level representation analysis.
  • 3. What are neurons ? State-of-the-art neural networks, such as RNNs or Transformer models consist of various components such as blocks, layers, attention heads, gates/cells, etc. We use the term neuron to refer to the output of a single dimension from any of the aforementioned neural network components. For example, in the BERT base model, the output of a layer block has 768 neurons and the output of an attention head has 64 neurons. In the literature, neurons have also been referred to as features, experts, and units.
  • 4. The Objective of Neuron Analysis Given a neural network model, neuron analysis aims to identify neurons that: i) learn a specific concept, or ii) are important to the model. An important facet is that some methods require a concept C as an additional input, in which case the importance is computed against the concept, while other methods just require the model M, where importance is measured with respect to the overall model and not a specific concept. For example, considering gender as a concept, the goal of neuron analysis would be to extract neurons from a model that encodes gender information.
  • 5. Visualization A simple way to discover the role of neurons is through visualizing their activations and manually identifying patterns over a set of sentences. However given the large number of neurons in a neural network model, it is cumbersome to visualize all of them.
  • 6. Corpus Rank Given a neuron, the Corpus Rank method sorts sentences based on the activation values. The top sentences are then visualized to identify a concept the neuron is representing. For example, In a survey, it is mentioned that words that maximally activate a neuron by analyzing its top-k context from the corpus. The Corpus Rank method makes the process less cumbersome by reducing the search space to only the top sentences, as opposed to the Visualization method where the entire corpus or a random subset is analyzed by a human.
  • 7. Corpus Generation Corpus rank methods are efficacious in identifying neurons focusing on lexical concepts. However, their space of analysis is limited to the underlying corpus. It is possible that a neuron represents a diverse concept not featured in the corpus. The Corpus Generation method addresses this problem by generating novel sentences that maximize a neuron’s activations.
  • 8. Mask based corpus selection The corpus rank and corpus generation methods require humans in the loop to analyze the concepts that are represented by the neurons. Now we discuss a set of methods that automatically select a concept given a neuron. In a paper few researchers have proposed a Masked-based Corpus Selection method to determine important neurons with respect to a concept.
  • 9. Neuron Probing The visualization and corpus-based methods involve analyzing neurons from a data perspective, relying on sentences from corpora or automatically generating them. The Neuron Probing method trains a post-hoc classifier to identify neurons with respect to a pre- defined concept. Specifically, given supervised data for a concept, say Noun, it extracts activations of all the noun words in context.
  • 10. Evaluation: Ablation While ablation has been used to discover salient neurons for the model, it has also been used to evaluate the efficacy of the rankings. The idea is to ablate neurons in the model in the order of their importance. For neuron ranking to be correct, removing the top neurons should result in larger drop in performance compared to removing same amount of random or bottom neurons from that list. Classification Performance Given salient neurons with respect to a concept, a simple method to evaluate their correctness is to train a classifier using them as features and predict the concept of interest.
  • 11. Applications Controlling model behavior By manipulating the values of the switch neuron at test time, researchers were able to successfully change output translations from present to past tense. The authors additionally found neurons that capture gender and number agreement concepts and manipulated them to control the system’s output. Model Distillation and Efficiency Someone devised an efficient feature-based transfer learning procedure, stemmed from their redundancy analysis. By exploiting layer and neuron-specific redundancy in the transformer models, they were able to reduce the feature set size to less than 100 neurons for several tasks while maintaining more than 97% of the performance.