This document is a resume for Grace Han Yang that summarizes her education and experience in business analytics. She received a Master's degree in Business Analytics from University College London, and a Bachelor's degree in Business Intelligence and Information Management from Southwestern University of Finance and Economics in China. Her experience includes data analysis internships at the City Council of Croydon and EDF Energy, where she conducted analytics, visualization, and machine learning projects on large datasets.
Predictive analytics of students' academic performance can help decision makers take appropriate actions at the right moment and plan appropriate training in order to improve the student’s success rate.
This presentation introduces big data and explains how to generate actionable insights using analytics techniques. The deck explains general steps involved in a typical analytics project and provides a brief overview of the most commonly used predictive analytics methods and their business applications.
The science of statistics deals with the collection, analysis, interpretation, and presentation of data. We see and use data in our everyday lives. The measure of whether the results of research were due to chance. The more statistical significance assigned to an observation, the less likely the observation occurred by chance.
Predictive analytics of students' academic performance can help decision makers take appropriate actions at the right moment and plan appropriate training in order to improve the student’s success rate.
This presentation introduces big data and explains how to generate actionable insights using analytics techniques. The deck explains general steps involved in a typical analytics project and provides a brief overview of the most commonly used predictive analytics methods and their business applications.
The science of statistics deals with the collection, analysis, interpretation, and presentation of data. We see and use data in our everyday lives. The measure of whether the results of research were due to chance. The more statistical significance assigned to an observation, the less likely the observation occurred by chance.
Experienced Analyst with a demonstrated history of working with huge amounts of data. Skilled in R and python, SQL, Tableau, Microsoft Office, Leadership, Project Management. Strong research professional with a Masters in Statistics with specialization in Data Science from California State University - East Bay. Currently working part time as Data Analyst in the Office of Sustainability, Cal State East Bay.
Linking Heterogeneous Scholarly Data Sources in an Interoperable Setting: the...Platforma Otwartej Nauki
“Open Research Data: Implications for Science and Society”, Warsaw, Poland, May 28–29, 2015, conference organized by the Open Science Platform — an initiative of the Interdisciplinary Centre for Mathematical and Computational Modelling at the University of Warsaw. pon.edu.pl @OpenSciPlatform #ORD2015
It has been said that Mobiles +Cloud + Social + Big Data = Better Run The World. IBM has invested over $20 billion since 2005 to grow its analytics business, many companies will invest more than $120 billion by 2015 on analytics, hardware, software and services critical in almost every industry like ; Healthcare, media, sports, finance, government, etc.
It has been estimated that there is a shortage of 140,000 – 190,000 people with deep analytical skills to fill the demand of jobs in the U.S. by 2018.
Decoding the human genome originally took 10 years to process; now it can be achieved in one week with the power of Analytic and BI (Business Intelligence). This lecture’s Key Messages is that Analytics provide a competitive edge to individuals , companies and institutions and that Analytics and BI are often critical to the success of any organization.
Methodology used is to teach analytic techniques through real world examples and real data with this goal to convince audience of the Analytics Edge and power of BI, and inspire them to use analytics and BI in their career and their life.
This slide discuss predictive data analytics models and their applications in broader content. It gives simple examples of regression and classification.
Experienced Analyst with a demonstrated history of working with huge amounts of data. Skilled in R and python, SQL, Tableau, Microsoft Office, Leadership, Project Management. Strong research professional with a Masters in Statistics with specialization in Data Science from California State University - East Bay. Currently working part time as Data Analyst in the Office of Sustainability, Cal State East Bay.
Linking Heterogeneous Scholarly Data Sources in an Interoperable Setting: the...Platforma Otwartej Nauki
“Open Research Data: Implications for Science and Society”, Warsaw, Poland, May 28–29, 2015, conference organized by the Open Science Platform — an initiative of the Interdisciplinary Centre for Mathematical and Computational Modelling at the University of Warsaw. pon.edu.pl @OpenSciPlatform #ORD2015
It has been said that Mobiles +Cloud + Social + Big Data = Better Run The World. IBM has invested over $20 billion since 2005 to grow its analytics business, many companies will invest more than $120 billion by 2015 on analytics, hardware, software and services critical in almost every industry like ; Healthcare, media, sports, finance, government, etc.
It has been estimated that there is a shortage of 140,000 – 190,000 people with deep analytical skills to fill the demand of jobs in the U.S. by 2018.
Decoding the human genome originally took 10 years to process; now it can be achieved in one week with the power of Analytic and BI (Business Intelligence). This lecture’s Key Messages is that Analytics provide a competitive edge to individuals , companies and institutions and that Analytics and BI are often critical to the success of any organization.
Methodology used is to teach analytic techniques through real world examples and real data with this goal to convince audience of the Analytics Edge and power of BI, and inspire them to use analytics and BI in their career and their life.
This slide discuss predictive data analytics models and their applications in broader content. It gives simple examples of regression and classification.
This brief work is aimed in the direction of basics of data sciences and model building with focus on implementation on fairly sizable dataset. It focuses on cleaning the data, visualization, EDA, feature scaling, feature normalization, k-nearest neighbor, logistic regression, random forests, cross validation without delving too deep into any of them but giving a start to a new learner.
This presentation introduces big data and explains how to generate actionable insights using analytics techniques. The deck explains general steps involved in a typical analytics project and provides a brief overview of the most commonly used predictive analytics methods and their business applications.
Vijay Adamapure is a Data Science Enthusiast with extensive experience in the field of data mining, predictive modeling and machine learning. He has worked on numerous analytics projects ranging from healthcare, business analytics, renewable energy to IoT.
Vijay presented these slides during the Internet of Everything Meetup event 'Predictive Analytics - An Overview' that took place on Jan. 9, 2015 in Mumbai. To join the Meetup group, register here: http://bit.ly/1A7T0A1
Seeking a challenging position to utilize my quantitative and data interpretation skills complementing with my knowledge of Technology and Management to excel in areas of Analytics and Digital Marketing; which will nurture and bring forth the best I can offer to the organization, self & society
1. GRACE (HAN) YANG
han.yang.15@ucl.ac.uk (+44) 07719924933 49 Southbridge Road, Croydon, CR0 1AG
EDUCATION
UNIVERSITY COLLEGE LONDON, London, UK Sep 2015–Sep 2016
Master of Science in Business Analytics (with specialization in COMPUTER SCIENCE) GPA: 78.75/100 top 10%
CORE MODULES: Business Analytics, Data Analytics, Information Retrieval and Data Mining, Web Economics,
Natural Language Processing, Graphical Models, Applied Machine Learning, Supervised Learning
SOUTHWESTERN UNIVERSITY OF FINANCE AND ECONOMICS, Chengdu, China Sep 2011-Jul 2015
Bachelor of Science in Business Intelligence and Information Management GPA: 86.3/100 top 15%
CORE MODULES: Mathematical Analysis, Advanced Algebra, Probability, Statistics, Time Series Analysis,
Econometrics, Information System Management, Data Analysis and Business Intelligence, Database Management,
Data Mining, Microeconomics, Macroeconomics, Accounting, Intermediate Financial Accounting, Finance
INCLUDING: UNIVERSITY OF JYVÄSKYLÄ, Finland Sep 2014-Jun 2015
Exchange Student in Information Systems (sponsored by CHINA SCHOLARSHIP COUNCIL) top 10%
EXPERIENCE
CITY COUNCIL OF CROYDON, London, UK Oct 2016-present
Data Analyst Intern
Investigate data from various sources (e.g. welfare, legal system) and conduct advanced data visualization individually
using MS Power BI; generate well-written report to support the decision-making process of the ongoing projects.
EDF ENERGY, London, UK Jan 2016-Apr 2016
Data Analyst Intern
Worked in a group of 4 and developed ad-hoc machine-learning algorithms to predict the probability of a customer
renewing with EDF in Python on a 70077-records dataset; reported to non-technical EDF manager in weekly meeting.
l Visualized each variable and got exploratory insight into the customer dataset; recognized outliers and extracted their
common characteristics with SVM; proposed several renewal product recommendations based on the data insight.
l Identified customer segmentations using a method integrating K-means and K-modes to cluster mixed categorical
and numerical data, finally obtained 5 customer clusters and evaluated them on their customer values.
l Achieved final prediction accuracy of 72% within set deadline; generated a final report summarizing the project
implementation and presented to senior manager of our predictive models as well as the modeling results.
CHINESE HOUSEHOLD FINANCE RESEARCH CENTER, Chengdu, China Jul 2013-Aug 2013
Surveyor & Data Analyst Intern
Conducted research on the financial overview of Chinese households; attended data collection techniques training.
l Interviewed sample families those are randomly chosen by computer in Guizhou Province and collected their
financial data; wrote advanced queries to analyze the major payment behavior of 8000 households by MySQL.
l Identified a significant increasing trend of online payment; analyzed the ongoing payment behavior transformation.
PROJECTS
DETERMINE PROTEIN DOMAIN STRUCTURE USING MACHINE LEARNING, London, UK Jun 2016-Sep 2016
Data Analyst
Built machine-learning models to predict the protein domain structure directly from amino acid sequences; reported
weekly progress and discussed further work with supervisor; wrote up a final dissertation based on the modeling results.
l Integrated unstructured protein sequence data of various formats into one unified data file; created a novel chunk
prediction method for domain boundary and predicted domain number solely by global protein sequence features.
l Ensemble classifiers and logistic regression model (LR) were trained with the consolidated data to predict domain
boundary in Python; obtaining domain boundary and number prediction accuracy of 87% and 72% respectively.
CLICK-THROUGH RATE PREDICTION FOR ADVERTISEMENT REAL-TIME BIDDING, UK Feb 2016-Apr 2016
Group Leader & Data Analyst
Cooperated with 2 group members from math and engineering backgrounds to predict the click-through rate of each
auctioned ad impression in real-time bidding display advertising given a noisy dataset of 3M users’ clicking records.
l Cleaned and resampled the imbalanced dataset to train machine learning algorithms (e.g. boosted tree models, LR)
with various feature-engineering methods (e.g. log-odds), optimizing the trained models by parameter tuning.
l Obtained a converged AUC score 0.780 and finished ranking the FIRST place in the KAGGLE competition.
TECHNICAL SKILLS
Languages: Python, R, MATLAB, Java, SQL Tools: SPSS, SAS, Tableau, MS Power BI, MS Office, MS SQL server