Schema on read is obsolete. Welcome metaprogramming..pdf
Rijuta Wagh Resume
1. RIJUTA WAGH (617) 372-1509
wagh.ri@husky.neu.edu| LinkedIn | Tableau Public |Github 1163 Boylston St, Apt 31, Boston, MA-02215
EDUCATION
MS Information Systems: Northeastern University, Boston, USA Aug 2018
Coursework: Data Warehousing and Business Intelligence, Data Science, Data Analysis using Python, Big Data Technologies
BE Computer Engineering: University of Mumbai, Mumbai, India May 2016
SKILLS
• Programming: R 3.3.2, Python 3.5, Java, SQL, PLSQL
• Tools & IDEs: Jupyter Notebook, RStudio 1.0.136, Eclipse, Toad data Modeler 6.0, Git 2.11.0, Rational Rose,
SQL Server Management Studio, Microsoft Azure Machine Learning Studio, UMLet, Jira
• Data Technologies: MySQL, Oracle 11g, Postgre, Apache Hadoop, MapReduce, Hive, Apache Spark, Databricks
• BI and ETL tools: Tableau, PowerBI, Talend, Qlik Sense
WORK EXPERIENCE
Business Intelligence Analyst Co-op: Fidelity Investments Jan-June 2018
Technologies used: Qlik Sense, Tableau, Oracle
• Wrangled data from multiple sources in the PI Sensors data model relating to Cash Assets and Flows and created robust
datasets out of approximately 9 billion records using efficient SQL queries in Oracle
• Collaborated with the financial stakeholders and product owner across the organization to identify key performance
indicators, developed and presented dashboards for them that provided actionable insights and helped business partners
to address ad-hoc requests and drive strategic decisions
• Designed and developed intuitive visual representations impacting the business performance in Tableau and QlikSense
for various monthly and yearly financial reports, thereby reducing report processing time by 50%
• Performed data lineage analysis and documentation for various QlikSense reports to track metadata and source data
ACADEMIC PROJECTS
Contosa Retail Data Warehouse & Business Intelligence May-Aug 2017
Technologies used: Talend, Qlik Sense, Power BI, Tableau
• Configured ETL activities for a Retail data warehouse in Talend, also improved the performance of these jobs by 80%
• Proposed a solution to implement slow changing dimensions to track the changing product pricing and costing in a
retail warehouse that made more data available for analysis
• Created interactive and elegant dashboards using Tableau, PowerBI, QlikSense to answer the business queries
Data Science, Lending Club Loan Dataset Jan-Apr 2017
Technologies used: Python (numpy, pandas, scikit-learn, matplotlib/plotly), R, PowerBI
• Preprocessed data and performed exploratory data analysis using PowerBI and Python
• Built classification and prediction models using Linear Regression, Logistic Regression, Decision trees, Random Forest
and Neural Network models that generates a flag to determine whether a person should be granted a loan or not
• Segmented data into clusters using K-means clustering based on various features and built prediction models to predict
interest rate for each cluster
Sentiment Analysis of Amazon Fine Food Reviews Mar-Apr 2017
Technologies used: Python, R, PowerBI, Microsoft Azure Machine Learning Studio, Amazon Web Service, NLTK
• Automated web scraping to download data for amazon fine food reviews and preprocessed it using tokenization and
stemming
• Classified the reviews as positive, negative and neutral using classification algorithms such as Naïve Bayes, Logistic
Regression and Decision trees
• Deployed and implemented Logistic Regression model on Microsoft azure machine learning studio
Analysis of Yelp dataset | Technologies used: Apache Hadoop, Hive Aug-Dec 2017
• Analyzed Yelp data on various business attributes such as top 5 businesses by category and state, aggregated check-ins
over time, time series analysis of user’s ratings, sentiment analysis using various Hadoop MapReduce patterns and Hive
IMDB Database Modeling | Technologies used: SQL, Toad Data Modeler, Hive Sep-Dec 2016
• Designed and modeled a replica of IMDB database using Toad Data Modeler 6.1 and queried the data to answer
analytical questions in SQL Server and Oracle
• Generated DDL scripts using reverse engineering in Toad Data Modeler and scripted stored procedures, functions and
triggers using PLSQL to service various analytical queries an IMDB application user may request
ACHIVEMENTS (Business Intelligence Tool used: Tableau)
• Winner of INFORMS Data Visualization Hackathon held at Northeastern University Apr 2018