SlideShare a Scribd company logo
Application of Machine Learning
to Predict Outcome of
US Court of Appeals
Presented By:
Krishna Mohan Nitin Hosurkar Pradeepta Mishra
Thomson Reuters Arrow Electronics Ma Foi Analytics
Indian Institute of Science (IISc), Bangalore India
12/21/2016Indian Institute of Science, Bangalore
Background
Andrew Martin Theodore RugerPauline Kim
12/21/2016Indian Institute of Science, Bangalore
Process
12/21/2016Indian Institute of Science, Bangalore
US Court System
Source: http://www.slideshare.net/bmtoth/organization-of-us-court-system
U.S. Court of
Appeals
12/21/2016Indian Institute of Science, Bangalore
Problem Statement
Predict the outcome or treatment of a case by the
US Court of Appeals based on historical data
• Basic case characteristics
• Participants
• Nature of the case
• Judges
12/21/2016Indian Institute of Science, Bangalore
Data Source and Characteristics
Website: http://artsandsciences.sc.edu/poli/juri/appct.htm
Codebook: http://artsandsciences.sc.edu/poli/juri/KH_update_codebook.pdf
Data: http://www.cas.sc.edu/poli/juri/KH_update_stata.zip
• 2,160 Rows
• 244 Columns
• 12 possible outcomes
• Integers + Dates + Categories + Binaries + Complex variables
• Majority Categorical variables
• Multi-level Factor variables
• Coded Factor values
• Missing values
12/21/2016Indian Institute of Science, Bangalore
Project
Plan
12/21/2016Indian Institute of Science, Bangalore
Data Prep
MissingValues
• Remove variables with more than 15% missing values
244
204
Decompose Complex Fields
• Consists of 5-digit variables
• Each digit represents a categorical value
• As many as 12 sub-categories with different combinations
Change Column names for better understanding
Change levels from Numbers to Description
12/21/2016Indian Institute of Science, Bangalore
Schematic Data Organization
12/21/2016Indian Institute of Science, Bangalore
Dimensionality Reduction
Chi-Square HypothesisTesting
H0 =The predictor and the target variable (CaseTreatment) are independent of each other
HA =The predictor and the target variable (CaseTreatment) are dependent / related
12/21/2016Indian Institute of Science, Bangalore
Dimensionality Reduction
Feature Selection using Boruta Package in R
12/21/2016Indian Institute of Science, Bangalore
Model Selection Criteria
• Multi-variate output
• Categorical predictor and target variables
• Multi-level factors
• Size of dataset
Models selected
• Random Forest
• Neural Network
• XG Boost
• Ensemble
12/21/2016Indian Institute of Science, Bangalore
Neural
Network
Neural Network Model
Tuning
FullTraining Data
Accuracy: 0.62
TrimmedTraining Data
Accuracy: 0.64
Not much improvement
Training Data with
Oversampling
Accuracy: 0.65
Test Data
Accuracy: 0.85
Big Improvement
BalancedTrimmedTraining
Data
Accuracy: 0.82
Good Performance
12/21/2016Indian Institute of Science, Bangalore
XGBoostXG Boost ModelTuning
FullTraining Data
Accuracy: 0.69
TrimmedTraining Data
Accuracy: 0.7
Not much improvement
TrimmedTraining Data
with Oversampling
Accuracy: 0.94
Big improvement
Test Data
Accuracy: 0.91
Good Performance
12/21/2016Indian Institute of Science, Bangalore
Observations
Neural Network
XG Boost Misclassification
Petition Denied as Affirmed
Reversed as Affirmed
Vacated as Affirmed
Affirmed as Reversed
Isolate factors contributing to
misclassification
12/21/2016Indian Institute of Science, Bangalore
Observations & Conclusions
Adequate representation of all outcomes in
Multinomial Classification Models
Most SignificantVariables
• Circuit Court – Appeals court that is currently hearing the case
• Verdict of the previous court as ‘Not Ascertained’
• Nature of the Appellant – Natural citizen has greater significance
• Directionality of 3rd Judge - Assertion for broadest interpretation of First
Amendment protection – Freedom of Speech, Religion and Peaceful Protest
• Initiator –Who initiated the Appeal – original plaintiff or others
Important!!
12/21/2016Indian Institute of Science, Bangalore
THANK YOU
12/21/2016Indian Institute of Science, Bangalore

More Related Content

What's hot

Query-Based Retrieval of Annotated Document
Query-Based Retrieval of Annotated DocumentQuery-Based Retrieval of Annotated Document
Query-Based Retrieval of Annotated Document
IRJET Journal
 
Application of data mining tools for
Application of data mining tools forApplication of data mining tools for
Application of data mining tools for
IJDKP
 
IJET-V2I6P32
IJET-V2I6P32IJET-V2I6P32
Enhancement techniques for data warehouse staging area
Enhancement techniques for data warehouse staging areaEnhancement techniques for data warehouse staging area
Enhancement techniques for data warehouse staging area
IJDKP
 
Credit Scoring Using CART Algorithm and Binary Particle Swarm Optimization
Credit Scoring Using CART Algorithm and Binary Particle Swarm Optimization Credit Scoring Using CART Algorithm and Binary Particle Swarm Optimization
Credit Scoring Using CART Algorithm and Binary Particle Swarm Optimization
IJECEIAES
 
Ez36937941
Ez36937941Ez36937941
Ez36937941
IJERA Editor
 
Analysis on Student Admission Enquiry System
Analysis on Student Admission Enquiry SystemAnalysis on Student Admission Enquiry System
Analysis on Student Admission Enquiry System
IJSRD
 
A statistical data fusion technique in virtual data integration environment
A statistical data fusion technique in virtual data integration environmentA statistical data fusion technique in virtual data integration environment
A statistical data fusion technique in virtual data integration environment
IJDKP
 
Jane Howard
Jane HowardJane Howard
Jane Howard
Jane Howard
 
Quality Assurance in Knowledge Data Warehouse
Quality Assurance in Knowledge Data WarehouseQuality Assurance in Knowledge Data Warehouse
Quality Assurance in Knowledge Data Warehouse
Universitas Pembangunan Panca Budi
 
Introduction to feature subset selection method
Introduction to feature subset selection methodIntroduction to feature subset selection method
Introduction to feature subset selection method
IJSRD
 
Ijcatr04061009
Ijcatr04061009Ijcatr04061009
Ijcatr04061009
Editor IJCATR
 
Effective data mining for proper
Effective data mining for properEffective data mining for proper
Effective data mining for proper
IJDKP
 
IRJET- A Review of Data Cleaning and its Current Approaches
IRJET- A Review of Data Cleaning and its Current ApproachesIRJET- A Review of Data Cleaning and its Current Approaches
IRJET- A Review of Data Cleaning and its Current Approaches
IRJET Journal
 
INTEGRATED ASSOCIATIVE CLASSIFICATION AND NEURAL NETWORK MODEL ENHANCED BY US...
INTEGRATED ASSOCIATIVE CLASSIFICATION AND NEURAL NETWORK MODEL ENHANCED BY US...INTEGRATED ASSOCIATIVE CLASSIFICATION AND NEURAL NETWORK MODEL ENHANCED BY US...
INTEGRATED ASSOCIATIVE CLASSIFICATION AND NEURAL NETWORK MODEL ENHANCED BY US...
IJDKP
 
Pharma data analytics
Pharma data analyticsPharma data analytics
Pharma data analytics
Axon Lawyers
 
[IJET-V1I3P10] Authors : Kalaignanam.K, Aishwarya.M, Vasantharaj.K, Kumaresan...
[IJET-V1I3P10] Authors : Kalaignanam.K, Aishwarya.M, Vasantharaj.K, Kumaresan...[IJET-V1I3P10] Authors : Kalaignanam.K, Aishwarya.M, Vasantharaj.K, Kumaresan...
[IJET-V1I3P10] Authors : Kalaignanam.K, Aishwarya.M, Vasantharaj.K, Kumaresan...
IJET - International Journal of Engineering and Techniques
 
Digital webinar master deck final
Digital webinar master deck finalDigital webinar master deck final
Digital webinar master deck final
Pistoia Alliance
 
PREDICTION OF MALIGNANCY IN SUSPECTED THYROID TUMOUR PATIENTS BY THREE DIFFER...
PREDICTION OF MALIGNANCY IN SUSPECTED THYROID TUMOUR PATIENTS BY THREE DIFFER...PREDICTION OF MALIGNANCY IN SUSPECTED THYROID TUMOUR PATIENTS BY THREE DIFFER...
PREDICTION OF MALIGNANCY IN SUSPECTED THYROID TUMOUR PATIENTS BY THREE DIFFER...
cscpconf
 

What's hot (19)

Query-Based Retrieval of Annotated Document
Query-Based Retrieval of Annotated DocumentQuery-Based Retrieval of Annotated Document
Query-Based Retrieval of Annotated Document
 
Application of data mining tools for
Application of data mining tools forApplication of data mining tools for
Application of data mining tools for
 
IJET-V2I6P32
IJET-V2I6P32IJET-V2I6P32
IJET-V2I6P32
 
Enhancement techniques for data warehouse staging area
Enhancement techniques for data warehouse staging areaEnhancement techniques for data warehouse staging area
Enhancement techniques for data warehouse staging area
 
Credit Scoring Using CART Algorithm and Binary Particle Swarm Optimization
Credit Scoring Using CART Algorithm and Binary Particle Swarm Optimization Credit Scoring Using CART Algorithm and Binary Particle Swarm Optimization
Credit Scoring Using CART Algorithm and Binary Particle Swarm Optimization
 
Ez36937941
Ez36937941Ez36937941
Ez36937941
 
Analysis on Student Admission Enquiry System
Analysis on Student Admission Enquiry SystemAnalysis on Student Admission Enquiry System
Analysis on Student Admission Enquiry System
 
A statistical data fusion technique in virtual data integration environment
A statistical data fusion technique in virtual data integration environmentA statistical data fusion technique in virtual data integration environment
A statistical data fusion technique in virtual data integration environment
 
Jane Howard
Jane HowardJane Howard
Jane Howard
 
Quality Assurance in Knowledge Data Warehouse
Quality Assurance in Knowledge Data WarehouseQuality Assurance in Knowledge Data Warehouse
Quality Assurance in Knowledge Data Warehouse
 
Introduction to feature subset selection method
Introduction to feature subset selection methodIntroduction to feature subset selection method
Introduction to feature subset selection method
 
Ijcatr04061009
Ijcatr04061009Ijcatr04061009
Ijcatr04061009
 
Effective data mining for proper
Effective data mining for properEffective data mining for proper
Effective data mining for proper
 
IRJET- A Review of Data Cleaning and its Current Approaches
IRJET- A Review of Data Cleaning and its Current ApproachesIRJET- A Review of Data Cleaning and its Current Approaches
IRJET- A Review of Data Cleaning and its Current Approaches
 
INTEGRATED ASSOCIATIVE CLASSIFICATION AND NEURAL NETWORK MODEL ENHANCED BY US...
INTEGRATED ASSOCIATIVE CLASSIFICATION AND NEURAL NETWORK MODEL ENHANCED BY US...INTEGRATED ASSOCIATIVE CLASSIFICATION AND NEURAL NETWORK MODEL ENHANCED BY US...
INTEGRATED ASSOCIATIVE CLASSIFICATION AND NEURAL NETWORK MODEL ENHANCED BY US...
 
Pharma data analytics
Pharma data analyticsPharma data analytics
Pharma data analytics
 
[IJET-V1I3P10] Authors : Kalaignanam.K, Aishwarya.M, Vasantharaj.K, Kumaresan...
[IJET-V1I3P10] Authors : Kalaignanam.K, Aishwarya.M, Vasantharaj.K, Kumaresan...[IJET-V1I3P10] Authors : Kalaignanam.K, Aishwarya.M, Vasantharaj.K, Kumaresan...
[IJET-V1I3P10] Authors : Kalaignanam.K, Aishwarya.M, Vasantharaj.K, Kumaresan...
 
Digital webinar master deck final
Digital webinar master deck finalDigital webinar master deck final
Digital webinar master deck final
 
PREDICTION OF MALIGNANCY IN SUSPECTED THYROID TUMOUR PATIENTS BY THREE DIFFER...
PREDICTION OF MALIGNANCY IN SUSPECTED THYROID TUMOUR PATIENTS BY THREE DIFFER...PREDICTION OF MALIGNANCY IN SUSPECTED THYROID TUMOUR PATIENTS BY THREE DIFFER...
PREDICTION OF MALIGNANCY IN SUSPECTED THYROID TUMOUR PATIENTS BY THREE DIFFER...
 

Similar to ICBAI Presentation (2)

Towards Automatic Composition of Multicomponent Predictive Systems
Towards Automatic Composition of Multicomponent Predictive SystemsTowards Automatic Composition of Multicomponent Predictive Systems
Towards Automatic Composition of Multicomponent Predictive Systems
Manuel Martín
 
Paper ID 216@ ICMLBDA 2023.pptx
Paper ID 216@ ICMLBDA 2023.pptxPaper ID 216@ ICMLBDA 2023.pptx
Paper ID 216@ ICMLBDA 2023.pptx
KrishnaReddy717023
 
Classifier Model using Artificial Neural Network
Classifier Model using Artificial Neural NetworkClassifier Model using Artificial Neural Network
Classifier Model using Artificial Neural Network
AI Publications
 
Leverage Big Data Analytics to Enhance Clinical Trials from Planning to Execu...
Leverage Big Data Analytics to Enhance Clinical Trials from Planning to Execu...Leverage Big Data Analytics to Enhance Clinical Trials from Planning to Execu...
Leverage Big Data Analytics to Enhance Clinical Trials from Planning to Execu...
Saama
 
Using ID3 Decision Tree Algorithm to the Student Grade Analysis and Prediction
Using ID3 Decision Tree Algorithm to the Student Grade Analysis and PredictionUsing ID3 Decision Tree Algorithm to the Student Grade Analysis and Prediction
Using ID3 Decision Tree Algorithm to the Student Grade Analysis and Prediction
ijtsrd
 
CV_M. Nur-A-Alam
CV_M. Nur-A-AlamCV_M. Nur-A-Alam
CV_M. Nur-A-Alam
Engr. Md. Nur-A-Alam
 
Survey on Techniques for Predictive Analysis of Student Grades and Career
Survey on Techniques for Predictive Analysis of Student Grades and CareerSurvey on Techniques for Predictive Analysis of Student Grades and Career
Survey on Techniques for Predictive Analysis of Student Grades and Career
IRJET Journal
 
Analyzing the Fundamental Aspects and Developing a Forecasting Model to Enhan...
Analyzing the Fundamental Aspects and Developing a Forecasting Model to Enhan...Analyzing the Fundamental Aspects and Developing a Forecasting Model to Enhan...
Analyzing the Fundamental Aspects and Developing a Forecasting Model to Enhan...
hasnat1983
 
Weather Data Analysis And Prediction In Bangladesh Using Machine Learning
Weather Data Analysis And Prediction In Bangladesh Using Machine LearningWeather Data Analysis And Prediction In Bangladesh Using Machine Learning
Weather Data Analysis And Prediction In Bangladesh Using Machine Learning
Imran Risal
 
Assessments of a Cloud-Based Data Wallet for Personal Identity Management
Assessments of a Cloud-Based Data Wallet for Personal Identity ManagementAssessments of a Cloud-Based Data Wallet for Personal Identity Management
Assessments of a Cloud-Based Data Wallet for Personal Identity Management
FarzaneH Karegar
 
Correlation of artificial neural network classification and nfrs attribute fi...
Correlation of artificial neural network classification and nfrs attribute fi...Correlation of artificial neural network classification and nfrs attribute fi...
Correlation of artificial neural network classification and nfrs attribute fi...
eSAT Journals
 
Pharma statistic 2018
Pharma statistic 2018Pharma statistic 2018
Pharma statistic 2018
Majdi Ayoub
 
Prognostication of the placement of students applying machine learning algori...
Prognostication of the placement of students applying machine learning algori...Prognostication of the placement of students applying machine learning algori...
Prognostication of the placement of students applying machine learning algori...
BIJIAM Journal
 
Educational data mining using jmp
Educational data mining using jmpEducational data mining using jmp
Educational data mining using jmp
ijcsit
 
Perspectives on chemical composition and crystal structure representations fr...
Perspectives on chemical composition and crystal structure representations fr...Perspectives on chemical composition and crystal structure representations fr...
Perspectives on chemical composition and crystal structure representations fr...
Anubhav Jain
 
Machine learning in Data Science
Machine learning in Data ScienceMachine learning in Data Science
Machine learning in Data Science
Dr. Vaibhav Kumar
 
Performance evaluation of random forest with feature selection methods in pre...
Performance evaluation of random forest with feature selection methods in pre...Performance evaluation of random forest with feature selection methods in pre...
Performance evaluation of random forest with feature selection methods in pre...
IJECEIAES
 
IRJET- Survey of Estimation of Crop Yield using Agriculture Data
IRJET- Survey of Estimation of Crop Yield using Agriculture DataIRJET- Survey of Estimation of Crop Yield using Agriculture Data
IRJET- Survey of Estimation of Crop Yield using Agriculture Data
IRJET Journal
 
BATCH 1 FIRST REVIEW-1.pptx
BATCH 1 FIRST REVIEW-1.pptxBATCH 1 FIRST REVIEW-1.pptx
BATCH 1 FIRST REVIEW-1.pptx
SurajRavi16
 
The International Journal of Engineering and Science (The IJES)
The International Journal of Engineering and Science (The IJES)The International Journal of Engineering and Science (The IJES)
The International Journal of Engineering and Science (The IJES)
theijes
 

Similar to ICBAI Presentation (2) (20)

Towards Automatic Composition of Multicomponent Predictive Systems
Towards Automatic Composition of Multicomponent Predictive SystemsTowards Automatic Composition of Multicomponent Predictive Systems
Towards Automatic Composition of Multicomponent Predictive Systems
 
Paper ID 216@ ICMLBDA 2023.pptx
Paper ID 216@ ICMLBDA 2023.pptxPaper ID 216@ ICMLBDA 2023.pptx
Paper ID 216@ ICMLBDA 2023.pptx
 
Classifier Model using Artificial Neural Network
Classifier Model using Artificial Neural NetworkClassifier Model using Artificial Neural Network
Classifier Model using Artificial Neural Network
 
Leverage Big Data Analytics to Enhance Clinical Trials from Planning to Execu...
Leverage Big Data Analytics to Enhance Clinical Trials from Planning to Execu...Leverage Big Data Analytics to Enhance Clinical Trials from Planning to Execu...
Leverage Big Data Analytics to Enhance Clinical Trials from Planning to Execu...
 
Using ID3 Decision Tree Algorithm to the Student Grade Analysis and Prediction
Using ID3 Decision Tree Algorithm to the Student Grade Analysis and PredictionUsing ID3 Decision Tree Algorithm to the Student Grade Analysis and Prediction
Using ID3 Decision Tree Algorithm to the Student Grade Analysis and Prediction
 
CV_M. Nur-A-Alam
CV_M. Nur-A-AlamCV_M. Nur-A-Alam
CV_M. Nur-A-Alam
 
Survey on Techniques for Predictive Analysis of Student Grades and Career
Survey on Techniques for Predictive Analysis of Student Grades and CareerSurvey on Techniques for Predictive Analysis of Student Grades and Career
Survey on Techniques for Predictive Analysis of Student Grades and Career
 
Analyzing the Fundamental Aspects and Developing a Forecasting Model to Enhan...
Analyzing the Fundamental Aspects and Developing a Forecasting Model to Enhan...Analyzing the Fundamental Aspects and Developing a Forecasting Model to Enhan...
Analyzing the Fundamental Aspects and Developing a Forecasting Model to Enhan...
 
Weather Data Analysis And Prediction In Bangladesh Using Machine Learning
Weather Data Analysis And Prediction In Bangladesh Using Machine LearningWeather Data Analysis And Prediction In Bangladesh Using Machine Learning
Weather Data Analysis And Prediction In Bangladesh Using Machine Learning
 
Assessments of a Cloud-Based Data Wallet for Personal Identity Management
Assessments of a Cloud-Based Data Wallet for Personal Identity ManagementAssessments of a Cloud-Based Data Wallet for Personal Identity Management
Assessments of a Cloud-Based Data Wallet for Personal Identity Management
 
Correlation of artificial neural network classification and nfrs attribute fi...
Correlation of artificial neural network classification and nfrs attribute fi...Correlation of artificial neural network classification and nfrs attribute fi...
Correlation of artificial neural network classification and nfrs attribute fi...
 
Pharma statistic 2018
Pharma statistic 2018Pharma statistic 2018
Pharma statistic 2018
 
Prognostication of the placement of students applying machine learning algori...
Prognostication of the placement of students applying machine learning algori...Prognostication of the placement of students applying machine learning algori...
Prognostication of the placement of students applying machine learning algori...
 
Educational data mining using jmp
Educational data mining using jmpEducational data mining using jmp
Educational data mining using jmp
 
Perspectives on chemical composition and crystal structure representations fr...
Perspectives on chemical composition and crystal structure representations fr...Perspectives on chemical composition and crystal structure representations fr...
Perspectives on chemical composition and crystal structure representations fr...
 
Machine learning in Data Science
Machine learning in Data ScienceMachine learning in Data Science
Machine learning in Data Science
 
Performance evaluation of random forest with feature selection methods in pre...
Performance evaluation of random forest with feature selection methods in pre...Performance evaluation of random forest with feature selection methods in pre...
Performance evaluation of random forest with feature selection methods in pre...
 
IRJET- Survey of Estimation of Crop Yield using Agriculture Data
IRJET- Survey of Estimation of Crop Yield using Agriculture DataIRJET- Survey of Estimation of Crop Yield using Agriculture Data
IRJET- Survey of Estimation of Crop Yield using Agriculture Data
 
BATCH 1 FIRST REVIEW-1.pptx
BATCH 1 FIRST REVIEW-1.pptxBATCH 1 FIRST REVIEW-1.pptx
BATCH 1 FIRST REVIEW-1.pptx
 
The International Journal of Engineering and Science (The IJES)
The International Journal of Engineering and Science (The IJES)The International Journal of Engineering and Science (The IJES)
The International Journal of Engineering and Science (The IJES)
 

ICBAI Presentation (2)

  • 1. Application of Machine Learning to Predict Outcome of US Court of Appeals Presented By: Krishna Mohan Nitin Hosurkar Pradeepta Mishra Thomson Reuters Arrow Electronics Ma Foi Analytics Indian Institute of Science (IISc), Bangalore India
  • 2. 12/21/2016Indian Institute of Science, Bangalore
  • 3. Background Andrew Martin Theodore RugerPauline Kim 12/21/2016Indian Institute of Science, Bangalore
  • 5. US Court System Source: http://www.slideshare.net/bmtoth/organization-of-us-court-system U.S. Court of Appeals 12/21/2016Indian Institute of Science, Bangalore
  • 6. Problem Statement Predict the outcome or treatment of a case by the US Court of Appeals based on historical data • Basic case characteristics • Participants • Nature of the case • Judges 12/21/2016Indian Institute of Science, Bangalore
  • 7. Data Source and Characteristics Website: http://artsandsciences.sc.edu/poli/juri/appct.htm Codebook: http://artsandsciences.sc.edu/poli/juri/KH_update_codebook.pdf Data: http://www.cas.sc.edu/poli/juri/KH_update_stata.zip • 2,160 Rows • 244 Columns • 12 possible outcomes • Integers + Dates + Categories + Binaries + Complex variables • Majority Categorical variables • Multi-level Factor variables • Coded Factor values • Missing values 12/21/2016Indian Institute of Science, Bangalore
  • 9. Data Prep MissingValues • Remove variables with more than 15% missing values 244 204 Decompose Complex Fields • Consists of 5-digit variables • Each digit represents a categorical value • As many as 12 sub-categories with different combinations Change Column names for better understanding Change levels from Numbers to Description 12/21/2016Indian Institute of Science, Bangalore
  • 10. Schematic Data Organization 12/21/2016Indian Institute of Science, Bangalore
  • 11. Dimensionality Reduction Chi-Square HypothesisTesting H0 =The predictor and the target variable (CaseTreatment) are independent of each other HA =The predictor and the target variable (CaseTreatment) are dependent / related 12/21/2016Indian Institute of Science, Bangalore
  • 12. Dimensionality Reduction Feature Selection using Boruta Package in R 12/21/2016Indian Institute of Science, Bangalore
  • 13. Model Selection Criteria • Multi-variate output • Categorical predictor and target variables • Multi-level factors • Size of dataset Models selected • Random Forest • Neural Network • XG Boost • Ensemble 12/21/2016Indian Institute of Science, Bangalore
  • 14. Neural Network Neural Network Model Tuning FullTraining Data Accuracy: 0.62 TrimmedTraining Data Accuracy: 0.64 Not much improvement Training Data with Oversampling Accuracy: 0.65 Test Data Accuracy: 0.85 Big Improvement BalancedTrimmedTraining Data Accuracy: 0.82 Good Performance 12/21/2016Indian Institute of Science, Bangalore
  • 15. XGBoostXG Boost ModelTuning FullTraining Data Accuracy: 0.69 TrimmedTraining Data Accuracy: 0.7 Not much improvement TrimmedTraining Data with Oversampling Accuracy: 0.94 Big improvement Test Data Accuracy: 0.91 Good Performance 12/21/2016Indian Institute of Science, Bangalore
  • 16. Observations Neural Network XG Boost Misclassification Petition Denied as Affirmed Reversed as Affirmed Vacated as Affirmed Affirmed as Reversed Isolate factors contributing to misclassification 12/21/2016Indian Institute of Science, Bangalore
  • 17. Observations & Conclusions Adequate representation of all outcomes in Multinomial Classification Models Most SignificantVariables • Circuit Court – Appeals court that is currently hearing the case • Verdict of the previous court as ‘Not Ascertained’ • Nature of the Appellant – Natural citizen has greater significance • Directionality of 3rd Judge - Assertion for broadest interpretation of First Amendment protection – Freedom of Speech, Religion and Peaceful Protest • Initiator –Who initiated the Appeal – original plaintiff or others Important!! 12/21/2016Indian Institute of Science, Bangalore
  • 18. THANK YOU 12/21/2016Indian Institute of Science, Bangalore