Yu's resume

Yu Wang(Max)
2600 Waterview pkwy ■ Richardson, Texas 75080 Visa Type: F1(have SSN)
Mobile: (972)-961-6952 Linkedin: www.linkedin.com/in/0yuwang0
Email: yxw124430@utdallas.edu Github: https://github.com/MaxWang0
Objective To obtain a challenging Co-op/Internship position (2016 summer/Fall) in the field of Computer
Science.
SPECIALIZATION Web development with ASP.NET MVC and nodeJS Big data development in spark and mongodb
&INTERESTS Software Development using Java Data manipulation and data analysis using R, Perl and Python
Database design using MySQL and MSSQL
EDUCATION The University of Texas at Dallas, Richardson, TX
M.S. Computer Science, Anticipated Graduation – December 2016 GPA: 3.35
COMPUTER Programming Languages: Java, JDBC, JSP, Servlet, C#, C++, Perl, R, Python, Scala
SKILLS Big Data: Google Cloud, MongoDB, Spark, Hadoop, Pig, Hive, Cassandra, Mahout, Recommendation System
Web Design: JavaScript, ASP.NET MVC, PHP, JSON, AJAX, JQuery, SOA,
Software/IDE/Frameworks: Eclipse, Android Studio, Vim, Visual Studio, MSSQL Server,
MySQL workbench, ASP.NET MVC, Azure, IIS, Apache, Angular JS, React JS, Node JS
Scripting: Perl, Bash, Python, JavaScript
Database Systems: MongoDB, MySQL, Oracle, MSSQL, Stored Procedure
Operating Systems: Apache Linux server, Ubuntu Linux Shell scripting, Windows, Git
WORK EXPERIENCE
Web Big Data Engineer Ecomotto(Start up), Richardson, TX December 2015 – present
 Built search engine for web data retrieval with Java and shell script in Google Cloud Server
 Assisted web service connection by customizing soap request and data retrieval in ftp server
 Performed real-time data manipulation and analysis with spark and mongodb in scala and sparkR
 Developed server side programs for real-time data retrieval with Java, shell-scripting and mongodb
 Implemented AngularJS, meteorJS and reactJS(D3.js) for UI design and data visualization
 Designed predictive model for data real-time prediction with accumulative logistic regression in sparkR
Research Assistant on Data Analysis University of Texas at Dallas, Richardson, TX Fall 2012 – Summer 2015
 Retrieved sample data from 2TB data collection with Perl mapping function and execute in shell script in Apache Server.
 Developed predictive models to improve the detection rate of data statistical bias with logistic regression, Bayesian Scheme,
HMM and EM algorithm in R and C++.
 Performed accuracy validation in ROC curve with R graph to revise the parameter.
Research Assistant Beijing Genomic Institute(BGI), ShenZhen, China July 2011 - December 2011
 Performed auto detection program in shell script and manipulate data with Perl and R programming in Apache Server
 Bench work on sample preparation for raw data production, including whole exome capture form patients with single gene
PROJECTS
Microsoft Malware Classification with Apache Spark (Microsoft Kaggle Challenge) Summer 2015
 Developed predictive model with Scala in Spark to classify 9 classes of Malware data(45GB, bytes, asm) based on its content
pattern and characteristics with machine learning algorithms such as Naïve Bayesian Scheme, Decision Tree and Random Forest.
Technologies(Skill Sets): Spark, Scala, Python, Mahoot, Yarn, Naïve Bayesian, Decision Tree, Random Forest, PCA
ChinaTea Retail Web Application Development Fall 2014
 Developed a China tea eCommerce website (http://teahome.azurewebsites.net/) with ASP.NET MVC framework using visual C#
to display different products and perform shopping functions consisting of shopping cart and checkout.
 Deployed database system in Windows Azure to store, retrieve and update information about different categories of tea and
users.
Technologies(Skill Sets): ASP.NET MVC , C#, HTML5, CSS3, MSSQL, JavaScript, JQuery, Visual Studio, Windows Azure

OnlineLibrary Management System with GUI Fall 2013
 Developed a library management system by using Java swing, MySQL Server and JDBC.
 The users can search for availability of books, check in and check out books, pay their late fee online and an email is sent to the
corresponding users about their upcoming deadline to check in their books. The system also sends an email to the
corresponding user about the upcoming deadline to return their books.
Technologies(Skill Sets): Java, MySQL, Eclipse, MySQL workbench
Android Mobile Contact Manager UI development Fall 2016
 Designed and developed an Android application of Contact Manager with common user interface and functions.
 Utilized Intent to retrieve variables from different activity, create tab in tabhost, user can add, edit, delete and save the item
locally as needed. Apply Android design principles for the whole application.
Technologies(Skill Sets): Java, XML, Android Studio
Hadoop Relational Operations on IMDB Datasets Summer 2015
 Implemented various complex Pig Latin, UDF, Hive Queries and Cassandra Queries to gain insightful analytics of IMDB movie
database.
 Utilized HIVE and PIG frameworks to perform relational operations including joins, co-groups, etc. to analyze some properties of
IMDB data such as movie preferences of male and female users.
Technologies(Skill Sets): Pig, Hive,Cassandra, Map-Reduce, Hortonworks
Big Data Analysis on Online Purchase Data Summer 2015
 Designed Hadoop Map-Reduce applications by running Chained map-reduce jobs to derive statistics such as top 10 most
popular stores, number of purchases in a particular product type etc.. from Online Purchase Dataset.
Technologies(Skill Sets): Python, Hadoop Framework, Map-Reduce, Linux, HDFS, Cloudera
Derive statistics from Yelp Dataset Summer 2015
 Designed Hadoop Map-Reduce application to retrieve top 10 average rated business in Yelp Dataset using Java.
 Implemented Chaining of Map Reduce job along with both in memory and Reduce side join. Achieved desired output using
secondary sorting and custom partitioning in MapReduce Job.
Technologies(Skill Sets): Java, Hadoop Framework, Map-Reduce, Linux, HDFS, Hortonworks
Netflix Recommendation System Summer 2015
 Implemented itembased collaborative filtering with mahoot’s spark-itemsimilarity to perform business recommendation based
on certain users.
Technologies(Skill Sets): Scala, Apache Spark, Mahoot, Yarn
Post Office simulation with multiple threads Spring 2015
 Implemented Java Threads and Semaphores to model customer and employee behavior in post office.
 Created threads to simulate customers and postal workers respectively and utilized semaphore for the coordination between
customer thread and postal worker thread. Mutual exclusion was kept to a minimum to allow the most concurrency.
Technologies(Skill Sets): Java, Linux, Vim, Eclipse
Computer System simulation Spring 2015
 Simulated computer system consisting of a CPU and Memory by Multi-processes which simulates computer instruction cycle.
The computer can run programs written by specific instructions.
 Created 2 processes as CPU and Memory. The CPU has 6 registers and 1 cache. The memory has 1000 addresses. The two
processes can communicate to each other. The CPU can get instruction from memory then perform the calculation of fetch
data.
Technologies(Skill Sets): Java, Linux, Vim, Eclipse
RELEVANT The University of Texas at Dallas - Erik Johnson School of Engineering & Computer Science
COURSES Big Data Management and Analytics Database design
Web Programming Languages UI Design and Mobile Application
Design and analysis of Computer Algorithms Algorithm analysis and data structure
Machine learning Operating system concept
Cloud Computing Statistical method in data science
ACTIVITY Activity Designer of Friendship Association of Chinese Students and Scholars at UT Dallas
AVAILABILITY Summer /Fall 2016

Yu's resume

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Viewers also liked

Viewers also liked (10)

Similar to Yu's resume

Similar to Yu's resume (20)

Recently uploaded

Recently uploaded (20)

Yu's resume