1. Yuanzhe Cai’s Curriculum Vitae The University of Texas at Arlington
CURRICULUM VITAE
GGeenneerraall IInnffoorrmmaattiioonn
Name: Dr. Yuanzhe Cai Gender: Male Age: 32
Address: D209 Via Lucca, Irvine, CA 92612
Email: yuanzhe.cai@gmail.com Mobile Telephone: (682) 240-5640
OObbjjeeccttiivveess
Seek for a full time software engineer
EEdduuccaattiioonn
The University of Texas at Arlington (Texas) 2009-2014 (GPA: 3.78/4.0)
Ph.D. in Computer Science and Engineering Supervisor: Prof. Sharma Chakravarthy
Dissertation: Inferring answer quality, answerer expertise, and ranking in question/answer social
networks.
Renmin University of China (Beijing) 2005-2008 (GPA 3.78/4.0)
M.S. in Computer Science and Engineering Supervisor: Prof. Xiaoyong Du
Thesis: A method for the similarity calculation on the large scale documents
Xidian University (Xi’an) 2001-2005 (GPA 3.6/4.0)
B.S. in Software Engineer
SSkkiillllss && HHaannddlleedd IInnssttrruummeennttss
Solid knowledge in database system, data mining and search engine
Hands-on database kernel components for PostgreSQL (2)
Good knowledge in Database Optimization (table indexing, query analyzing, performance tuning,
etc.) (5)
Good knowledge in Big data: Hadoop Framework (2) , PostgreSQL 9.4 (NoSQL Feature) (1),
MongoDB (1), Spark (0.5)
Proficiency with data mining tool and information retrieval software: weka (4) and lucene (1) system
Expertise in social networks analysis: Q/A community Analysis (5)
Expertise in recommendation system: book and social tagging recommendation system
Good knowledge in J2EE Optimization (JMS optimization, JBoss server performance tuning,
Hibernate performance tuning, etc.) (1)
Proficiency with: Java (10), C (5), Matlab (5), SQL (9), PL/SQL (1), J2EE Framework (3), VBA (0.5),
C++ (1), EJB (1), JBoss (1), MYSQL (2)
AAcchhiieevveemmeennttss
Database Kernel Development (PostgreSQL 8.3):
Result set cache development
Performance monitoring tools
Online Community Analysis (e.g., Facebook, Yahoo! Answers, Stack Overflow, etc):
Crawl more than 30G web original data
Manage more than 10 million question and answers pairs
2. Yuanzhe Cai’s Curriculum Vitae The University of Texas at Arlington
Design efficiently and effectively algorithms to analyze answerer’s behavior
Calculate user’s expertise
Recommend the questions to the proper users
Digital Library for Renmin University of China (2.0):
Retrieve the book according to the keywords using lucence system
Personally recommend a proper book for a user
This system was used in Renmin University Library
Healthcare Radiology Information System (RIS):
Full-Stack developer for J2EE framework.
System performance turning (DB and JBoss server tuning).
PPrroojjeeccttss
06/2015- Radiology Information System (RIS) Re-Architecture (JBoss, EJB, JAVA, MYSQL etc)
Full-Stack developer (schedule, worklist, report)
Improve the system performance. (JMS tuning, DB tuning, JBoss tuning, etc)
Data Migration Tool. (Migration from RIS(v2.3) to RIS(2.8))
Import Tool. (Import excel data into RIS system)
09/2009-05/2014 Identifying Expertise and Answer Quality in Q/A Social Networks (Java, Hadoop
framework, MongoDB, PostgreSQL and Matlab)
Answer Quality Prediction in Q/A Social Network by Leveraging Temporal Feature
Expertise Ranking of Users in Q/A Community
Identify the specialist for a particular question or a domain in the Q/A community.
Social tagging recommendation using the tensor decomposition in the Q/A community
02/2008-08/2009 Document Similarity Analysis (Java)
Develop an algorithm to calculate the document similarity.
Improve SimRank algorithm from 2 days to 2 minutes (100 thousands nodes for the citation
graph)
09/2007-02/2008 Code system development (ontology management system) (Java, PLSQL, Oracle)
Implement the different kinds of relationship, instance and class for ontology.
This system was used in the Database & Intelligent Information Retrieval Lab.
02/2007-09/2007 Digital Library for for Renmin University of China (Java, Lucene)
Develop the book retrieval system 2.0 using lucene
Personally recommend a proper book for a user using item based and user based algorithms
07/2006-02/2007 Database Performance Monitoring (C and PostgreSQL)
Develop a group of database views to monitor the database performance, such as io, buffer, file,
lock, event, log information, etc.
This monitor was used in Kingbase v.4.1.
03/2006-07/2006 Cadre evaluation system of the CPC Central Committee (VB, PowerDesigner,
Kingbase 4.1 and VBA)
Develop the cadre evaluation system for the CPC Central committee.
Implement the database design and UI program
This system was used for cadres’ election for 23 provinces in China.
09/2005-03/2006 SQL result set cache (C and PostgreSQL)
Implemented both client memory cache and share memory cache.
3. Yuanzhe Cai’s Curriculum Vitae The University of Texas at Arlington
Take the TPC-C test.
This result set cache was used in Kingbase v4.1.
07/2005-09/2006 Japanese healthcare software system (Java, J2EE framework, Tomcat and
Oracle)
Build the J2EE framework using Struts + Spring + Hibernate
Implement the database design for the healthcare system
02/2005-06/2006 Business customer behavior analysis system (Java, JSP and Tomcat)
Using data mining technique to analyze the customer’s behaviors
Implement data mining algorithms, such as ID3 tree classifier, naive bayes classifier, k-mean
cluster, apriori algorithm, and preprocess, etc.
Direct a group (5 persons) to implement system
WWoorrkk EExxppeerriieennccee && IInntteerrnnsshhiipp
06/15- Candelis Inc.
Software Engineer for the Backend Performance
09/14-06/15 University of Texas at Arlington
Lecturer for CSE 2320 (Data structure and Algorithm) and CSE 5311 (Algorithm)
09/09-05/14 University of Texas at Arlington
TA for C, Java, Database, Data Structure, Computer Architecture, etc.
09/05 - 01/07 Beijing BaseSoft Co., Ltd., PostgreSQL kernel development
Database Kernel Developer
07/05 - 09/05 Shanghai Xinyou Co., Ltd., Japanese healthcare software system
Software Engineer
02/05 - 06/05 Xi’an Software Park, Business customer behavior analysis system
Software Engineer
HHoonnoorrss && SScchhoollaarrsshhiippss
TA Fellowship at University of Texas, Arlington from fall 2009 to May 2014.
Three Years Fellowship in Renmin University of China from 2005-2008
The third-class Scholarship in Xidian University in 2004 and 2005
IBM Web Sphere Certification
The third-class Math Model in Xidian University in 2004
PPaatteennttss
Xiaoyong Du, Hongyan Liu, Jun He, Yuanzhe Cai and Pei Li, A method of the document similarity
calculation, Patent Id. CN101576903B
Xiaoyong Du, Hongyan Liu, Jun He, Pei Li and Yuanzhe Cai, Efficient similarity calculation on a
graph using block structure, Patent Id. CN101576905B
Xiaoyong Du, Hongyan Liu, Jun He, Yuanzhe Cai and Xu Jia, Explore the power law distribution on
a graph for efficient similarity calculation, Patent Id. CN101853281A
PPuubblliiccaattiioonnss
[1] Yuanzhe Cai, Sharma Chakravarthy: Answer Quality Prediction in Q/A Social Networks by
Leveraging Temporal Features. International Journal of Next-Generation Computing, Volume 4,
2013
4. Yuanzhe Cai’s Curriculum Vitae The University of Texas at Arlington
[2] Yuanzhe Cai, Sharma Chakravarthy: Expertise Ranking of Users in QA Community. Database
Systems for Advanced Applications, 18th International Conference, DASFAA 2013, Wuhan, China,
April 22-25, 2013
[3] Yuanzhe Cai, Sharma Chakravarthy: Pairwise Similarity Calculation of Information Networks. Data
Warehousing and Knowledge Discovery - 13th International Conference, DaWaK 2011, Toulouse,
France, August 29-September 2,2011.
[4] Yuanzhe Cai, Miao Zhang, Dijun Luo, Chris H. Q. Ding, Sharma Chakravarthy: Low-order tensor
decompositions for social tagging recommendation. Proceedings of the Forth International
Conference on Web Search and Web Data Mining, WSDM 2011, Hong Kong, China, February 9-12,
2011.
[5] Xu Jia, Hongyan Liu, Li Zou, Jun He, Xiaoyong Du, Yuanzhe Cai: Local Methods for Estimating
SimRank Score. Advances in Web Technologies and Applications, Proceedings of the 12th
Asia-Pacific Web Conference, APWeb 2010, Busan, Korea, 6-8 April 2010.
[6] Yuanzhe Cai, Miao Zhang, Chris H. Q. Ding, Sharma Chakravarthy: Closed form solution of
similarity algorithms. Proceeding of the 33rd International ACM SIGIR Conference on Research and
Development in Information Retrieval, SIGIR 2010, Geneva, Switzerland, July 19-23, 2010.
[7] Xu Jia, Yuanzhe Cai, Hongyan Liu, Jun He, Xiaoyong Du: Calculating Similarity Efficiently in a
Small World. Advanced Data Mining and Applications, 5th International Conference, ADMA 2009,
Beijing, China, August 17-19, 2009.
[8] Yuanzhe Cai, Hongyan Liu, Jun He, Xiaoyong Du, Xu Jia: An Adaptive Method for the Efficient
Similarity Calculation. Database Systems for Advanced Applications, 14th International Conference,
DASFAA 2009, Brisbane, Australia, April 21-23, 2009.
[9] Yuanzhe Cai, Gao Cong, Xu Jia, Hongyan Liu, Jun He, Jiaheng Lu, Xiaoyong Du: Efficient
Algorithm for Computing Link-Based Similarity in Real World Networks. ICDM 2009, The Ninth
IEEE International Conference on Data Mining, Miami, Florida, USA, 6-9 December 2009.
[10] Pei Li, Yuanzhe Cai, Hongyan Liu, Jun He, Xiaoyong Du: Exploiting the Block Structure of Link
Graph for Efficient Similarity Computation. Advances in Knowledge Discovery and Data Mining,
13th Pacific-Asia Conference, PAKDD 2009, Bangkok, Thailand, April 27-30, 2009, Proceedings.
[11] Yuanzhe Cai, Pei Li, Hongyan Liu, Jun He, Xiaoyong Du: S-SimRank: Combining Content and Link
Information to Cluster Papers Effectively and Efficiently. Advanced Data Mining and Applications,
4th International Conference, ADMA 2008, Chengdu, China, October 8-10, 2008.
[12] Yuanzhe Cai and Sharma Chakravarthy. Identifying Specialists for Concepts. 18th International
Conference on Extending Database Technology, March 23-27, 2015 - Brussels, Belgium
(submitted).
[13] Yuanzhe Cai and Sharma Chakravarthy. HITS vs. Non-negative Matrix Factorization. Technique
Report, 2014.
RReeffeerreenncceess
Dr. Sharma Chakravarthy, Professor, Department of Computer Science and Engineering, UT
Arlington, email: sharma@cse.uta.edu, Phone: (817) 272-2082
Dr. Chris Ding, Professor, Department of Computer Science and Engineering, UT Arlington, email:
CHQDing@uta.edu, Phone: (817) 272-7041
Dr. Deguang Kong, Senior Research Engineer, Samsung Electronics, email: doogkong@gmail.com,
Phone: (408) 718-4906
Ming Ge, RIS Project Manager, Candelis Inc, email: ming.ge@candelis.com, Phone: (917)348-8560