1. AMAN RAJ
700 Health Sciences Drive, Chapin C 1044C, Stony Brook, New York 11790
(631) · 398 · 6540 amraj@cs.stonybrook.edu www.linkedin.com/in/amankaraj www.github.com/amankaraj
EDUCATION
Stony Brook University • New York December 2016
Master in Science, Computer Science - Data Science and Engineering Specialization GPA 3.84/4.0
Relevant Courses: Analysis of Algorithm; Artificial Intelligence; Probability and Statistics for Data Scientists;
Big Data Analytics, Data Visualization; Computational Biology; Fundamental of Computer Networks
Indian Institute of Information Technology • Allahabad, India June 2012
Bachelor of Technology in Information Technology GPA 8.25/10
Relevant Courses:Operating System; Databases; Natural Language Processing; Algorithms and Data Structures
SKILLS
Programming Languages Java, Python, JavaScript, SQL, Flex
Databases Oracle, MySQL, MongoDB
Tools Jira, Eclipse, Adobe Flex, SVN, Git, Tomcat,
Technologies Design Patterns, Spring, JPA, JUnit, WebLogic, REST APis
Libraries NLP Tools, NLTK, Pandas, Numpy, Sklearn, Mokito, D3.js
WORK EXPERIENCE
VMware June 2016 – August 2016
Member of Technical Staff - Intern(Distributed Firewall - NSX) Palo Alto, CA, US
• Firewall Rule Visibility - Automated identification of unreachable and shadowed firewall rules in Vmware NSX.
• What-If Impact analysis of Addition, Update or Deletion of a rule on the existing rule base before actual action.
• Identification and Merge suggestions of similar Firewall rules targeted at easy Maintenance and Efficiency.
Citigroup July 2012 – July 2015
Senior Software Developer Pune, India
• Responsible for User Interface & server-side design and development of core modules of a live financial appli-
cation Permitted Product List. This provides the standard products for trading at various desks.
• Researched-employed AGILE methodology to effectively implement procedures at stages of application lifecycle.
• Created an extensive testing environment using JUnit test cases and Mokito for performance and stress testing.
• Efficient Design and handling of the database for projects in Oracle using SQL Server.
• Received individual performance award for single handedly working on all aspects of Permitted Product List.
Carnegie Mellon University & IIIT-H, India Dec. 2010
• Created a Machine Learning model to classify sentences into descriptive, methodological or explanatory
• Worked on improvement of tutor agents using this Sentence Classification model.
PROJECTS • GITHUB HANDLE
• Syntactic Analysis of Social Media Data: In order to get deeper syntactic information from social media
text(Facebook messages), we created multiple customized NLP pipeline by integrating various kind of available
tools and analyzed the performance. Tools included various Tokenizer, Taggers, Normalizers and Parsers.
• Movie Recommendation: Created a Collaborative Filtering based Movie Recommendation System using
the Movielens 1M dataset. Performed Multiple Linear Regression and Clustering to get more insights.
• Decision Tree Classifier: Predict credit worthiness of Credit Card applicants.Overfitting and noise handled.
• Automated implementation of Sudoku Solver, Connect Four, Set Game and Peg Solitaire:Implemented
Iterative Deepening, A* Search, MinMax Tree, Alpha-Beta Pruning, Backtracking using Constraint Satisfaction
Methods like MRV, Forward Checking, Constraint Propogation for Artificial Intelligence course in Python
• Blog data analysis: Performed linguistic topics correlation with age and occupation, gender, using various
statistical tests. Correlate topics using Multiple Linear and Logistic Regression on Blog Authorship Corpus.
• Data Visualization: Developed various projects on Data Analysis & Visualization including Interactive
Histograms; Pie Charts; Force directed graphs and Collapsable Tree using the JavaScript library d3.js.
• Cellular Localization: Predicting location of a device using only Cellular Signal Strengths based on labelled
data and prior history of a user using Machine Learning classifiers and Hidden Markov Model.
• Page Load Time Analysis & Prediction: Built a Machine Learning Regression model to Predict the Page
Load Time(PLT) of websites. Analyzed the major factor affecting the PLT and how can we reduce the same.