SlideShare a Scribd company logo
1 of 4
Download to read offline
Yuanzhe Cai’s Curriculum Vitae The University of Texas at Arlington
CURRICULUM VITAE
GGeenneerraall IInnffoorrmmaattiioonn
Name: Dr. Yuanzhe Cai Gender: Male Age: 32
Address: D209 Via Lucca, Irvine, CA 92612
Email: yuanzhe.cai@gmail.com Mobile Telephone: (682) 240-5640
OObbjjeeccttiivveess
Seek for a full time software engineer
EEdduuccaattiioonn
The University of Texas at Arlington (Texas) 2009-2014 (GPA: 3.78/4.0)
Ph.D. in Computer Science and Engineering Supervisor: Prof. Sharma Chakravarthy
Dissertation: Inferring answer quality, answerer expertise, and ranking in question/answer social
networks.
Renmin University of China (Beijing) 2005-2008 (GPA 3.78/4.0)
M.S. in Computer Science and Engineering Supervisor: Prof. Xiaoyong Du
Thesis: A method for the similarity calculation on the large scale documents
Xidian University (Xi’an) 2001-2005 (GPA 3.6/4.0)
B.S. in Software Engineer
SSkkiillllss && HHaannddlleedd IInnssttrruummeennttss
 Solid knowledge in database system, data mining and search engine
 Hands-on database kernel components for PostgreSQL (2)
 Good knowledge in Database Optimization (table indexing, query analyzing, performance tuning,
etc.) (5)
 Good knowledge in Big data: Hadoop Framework (2) , PostgreSQL 9.4 (NoSQL Feature) (1),
MongoDB (1), Spark (0.5)
 Proficiency with data mining tool and information retrieval software: weka (4) and lucene (1) system
 Expertise in social networks analysis: Q/A community Analysis (5)
 Expertise in recommendation system: book and social tagging recommendation system
 Good knowledge in J2EE Optimization (JMS optimization, JBoss server performance tuning,
Hibernate performance tuning, etc.) (1)
 Proficiency with: Java (10), C (5), Matlab (5), SQL (9), PL/SQL (1), J2EE Framework (3), VBA (0.5),
C++ (1), EJB (1), JBoss (1), MYSQL (2)
AAcchhiieevveemmeennttss
 Database Kernel Development (PostgreSQL 8.3):
Result set cache development
Performance monitoring tools
 Online Community Analysis (e.g., Facebook, Yahoo! Answers, Stack Overflow, etc):
Crawl more than 30G web original data
Manage more than 10 million question and answers pairs
Yuanzhe Cai’s Curriculum Vitae The University of Texas at Arlington
Design efficiently and effectively algorithms to analyze answerer’s behavior
Calculate user’s expertise
Recommend the questions to the proper users
 Digital Library for Renmin University of China (2.0):
Retrieve the book according to the keywords using lucence system
Personally recommend a proper book for a user
This system was used in Renmin University Library
 Healthcare Radiology Information System (RIS):
Full-Stack developer for J2EE framework.
System performance turning (DB and JBoss server tuning).
PPrroojjeeccttss
 06/2015- Radiology Information System (RIS) Re-Architecture (JBoss, EJB, JAVA, MYSQL etc)
 Full-Stack developer (schedule, worklist, report)
 Improve the system performance. (JMS tuning, DB tuning, JBoss tuning, etc)
 Data Migration Tool. (Migration from RIS(v2.3) to RIS(2.8))
 Import Tool. (Import excel data into RIS system)
 09/2009-05/2014 Identifying Expertise and Answer Quality in Q/A Social Networks (Java, Hadoop
framework, MongoDB, PostgreSQL and Matlab)
 Answer Quality Prediction in Q/A Social Network by Leveraging Temporal Feature
 Expertise Ranking of Users in Q/A Community
 Identify the specialist for a particular question or a domain in the Q/A community.
 Social tagging recommendation using the tensor decomposition in the Q/A community
 02/2008-08/2009 Document Similarity Analysis (Java)
 Develop an algorithm to calculate the document similarity.
 Improve SimRank algorithm from 2 days to 2 minutes (100 thousands nodes for the citation
graph)
 09/2007-02/2008 Code system development (ontology management system) (Java, PLSQL, Oracle)
 Implement the different kinds of relationship, instance and class for ontology.
 This system was used in the Database & Intelligent Information Retrieval Lab.
 02/2007-09/2007 Digital Library for for Renmin University of China (Java, Lucene)
 Develop the book retrieval system 2.0 using lucene
 Personally recommend a proper book for a user using item based and user based algorithms
 07/2006-02/2007 Database Performance Monitoring (C and PostgreSQL)
 Develop a group of database views to monitor the database performance, such as io, buffer, file,
lock, event, log information, etc.
 This monitor was used in Kingbase v.4.1.
 03/2006-07/2006 Cadre evaluation system of the CPC Central Committee (VB, PowerDesigner,
Kingbase 4.1 and VBA)
 Develop the cadre evaluation system for the CPC Central committee.
 Implement the database design and UI program
 This system was used for cadres’ election for 23 provinces in China.
 09/2005-03/2006 SQL result set cache (C and PostgreSQL)
 Implemented both client memory cache and share memory cache.
Yuanzhe Cai’s Curriculum Vitae The University of Texas at Arlington
 Take the TPC-C test.
 This result set cache was used in Kingbase v4.1.
 07/2005-09/2006 Japanese healthcare software system (Java, J2EE framework, Tomcat and
Oracle)
 Build the J2EE framework using Struts + Spring + Hibernate
 Implement the database design for the healthcare system
 02/2005-06/2006 Business customer behavior analysis system (Java, JSP and Tomcat)
 Using data mining technique to analyze the customer’s behaviors
 Implement data mining algorithms, such as ID3 tree classifier, naive bayes classifier, k-mean
cluster, apriori algorithm, and preprocess, etc.
 Direct a group (5 persons) to implement system
WWoorrkk EExxppeerriieennccee && IInntteerrnnsshhiipp
 06/15- Candelis Inc.
Software Engineer for the Backend Performance
 09/14-06/15 University of Texas at Arlington
Lecturer for CSE 2320 (Data structure and Algorithm) and CSE 5311 (Algorithm)
 09/09-05/14 University of Texas at Arlington
TA for C, Java, Database, Data Structure, Computer Architecture, etc.
 09/05 - 01/07 Beijing BaseSoft Co., Ltd., PostgreSQL kernel development
Database Kernel Developer
 07/05 - 09/05 Shanghai Xinyou Co., Ltd., Japanese healthcare software system
Software Engineer
 02/05 - 06/05 Xi’an Software Park, Business customer behavior analysis system
Software Engineer
HHoonnoorrss && SScchhoollaarrsshhiippss
 TA Fellowship at University of Texas, Arlington from fall 2009 to May 2014.
 Three Years Fellowship in Renmin University of China from 2005-2008
 The third-class Scholarship in Xidian University in 2004 and 2005
 IBM Web Sphere Certification
 The third-class Math Model in Xidian University in 2004
PPaatteennttss
 Xiaoyong Du, Hongyan Liu, Jun He, Yuanzhe Cai and Pei Li, A method of the document similarity
calculation, Patent Id. CN101576903B
 Xiaoyong Du, Hongyan Liu, Jun He, Pei Li and Yuanzhe Cai, Efficient similarity calculation on a
graph using block structure, Patent Id. CN101576905B
 Xiaoyong Du, Hongyan Liu, Jun He, Yuanzhe Cai and Xu Jia, Explore the power law distribution on
a graph for efficient similarity calculation, Patent Id. CN101853281A
PPuubblliiccaattiioonnss
[1] Yuanzhe Cai, Sharma Chakravarthy: Answer Quality Prediction in Q/A Social Networks by
Leveraging Temporal Features. International Journal of Next-Generation Computing, Volume 4,
2013
Yuanzhe Cai’s Curriculum Vitae The University of Texas at Arlington
[2] Yuanzhe Cai, Sharma Chakravarthy: Expertise Ranking of Users in QA Community. Database
Systems for Advanced Applications, 18th International Conference, DASFAA 2013, Wuhan, China,
April 22-25, 2013
[3] Yuanzhe Cai, Sharma Chakravarthy: Pairwise Similarity Calculation of Information Networks. Data
Warehousing and Knowledge Discovery - 13th International Conference, DaWaK 2011, Toulouse,
France, August 29-September 2,2011.
[4] Yuanzhe Cai, Miao Zhang, Dijun Luo, Chris H. Q. Ding, Sharma Chakravarthy: Low-order tensor
decompositions for social tagging recommendation. Proceedings of the Forth International
Conference on Web Search and Web Data Mining, WSDM 2011, Hong Kong, China, February 9-12,
2011.
[5] Xu Jia, Hongyan Liu, Li Zou, Jun He, Xiaoyong Du, Yuanzhe Cai: Local Methods for Estimating
SimRank Score. Advances in Web Technologies and Applications, Proceedings of the 12th
Asia-Pacific Web Conference, APWeb 2010, Busan, Korea, 6-8 April 2010.
[6] Yuanzhe Cai, Miao Zhang, Chris H. Q. Ding, Sharma Chakravarthy: Closed form solution of
similarity algorithms. Proceeding of the 33rd International ACM SIGIR Conference on Research and
Development in Information Retrieval, SIGIR 2010, Geneva, Switzerland, July 19-23, 2010.
[7] Xu Jia, Yuanzhe Cai, Hongyan Liu, Jun He, Xiaoyong Du: Calculating Similarity Efficiently in a
Small World. Advanced Data Mining and Applications, 5th International Conference, ADMA 2009,
Beijing, China, August 17-19, 2009.
[8] Yuanzhe Cai, Hongyan Liu, Jun He, Xiaoyong Du, Xu Jia: An Adaptive Method for the Efficient
Similarity Calculation. Database Systems for Advanced Applications, 14th International Conference,
DASFAA 2009, Brisbane, Australia, April 21-23, 2009.
[9] Yuanzhe Cai, Gao Cong, Xu Jia, Hongyan Liu, Jun He, Jiaheng Lu, Xiaoyong Du: Efficient
Algorithm for Computing Link-Based Similarity in Real World Networks. ICDM 2009, The Ninth
IEEE International Conference on Data Mining, Miami, Florida, USA, 6-9 December 2009.
[10] Pei Li, Yuanzhe Cai, Hongyan Liu, Jun He, Xiaoyong Du: Exploiting the Block Structure of Link
Graph for Efficient Similarity Computation. Advances in Knowledge Discovery and Data Mining,
13th Pacific-Asia Conference, PAKDD 2009, Bangkok, Thailand, April 27-30, 2009, Proceedings.
[11] Yuanzhe Cai, Pei Li, Hongyan Liu, Jun He, Xiaoyong Du: S-SimRank: Combining Content and Link
Information to Cluster Papers Effectively and Efficiently. Advanced Data Mining and Applications,
4th International Conference, ADMA 2008, Chengdu, China, October 8-10, 2008.
[12] Yuanzhe Cai and Sharma Chakravarthy. Identifying Specialists for Concepts. 18th International
Conference on Extending Database Technology, March 23-27, 2015 - Brussels, Belgium
(submitted).
[13] Yuanzhe Cai and Sharma Chakravarthy. HITS vs. Non-negative Matrix Factorization. Technique
Report, 2014.
RReeffeerreenncceess
 Dr. Sharma Chakravarthy, Professor, Department of Computer Science and Engineering, UT
Arlington, email: sharma@cse.uta.edu, Phone: (817) 272-2082
 Dr. Chris Ding, Professor, Department of Computer Science and Engineering, UT Arlington, email:
CHQDing@uta.edu, Phone: (817) 272-7041
 Dr. Deguang Kong, Senior Research Engineer, Samsung Electronics, email: doogkong@gmail.com,
Phone: (408) 718-4906
 Ming Ge, RIS Project Manager, Candelis Inc, email: ming.ge@candelis.com, Phone: (917)348-8560

More Related Content

What's hot

The UVA School of Data Science
The UVA School of Data ScienceThe UVA School of Data Science
The UVA School of Data SciencePhilip Bourne
 
Open domain question answering system using semantic role labeling
Open domain question answering system using semantic role labelingOpen domain question answering system using semantic role labeling
Open domain question answering system using semantic role labelingeSAT Publishing House
 
Csi poster
Csi posterCsi poster
Csi posterISSIP
 
D0373024030
D0373024030D0373024030
D0373024030theijes
 
Make your data great now
Make your data great nowMake your data great now
Make your data great nowDaniel JACOB
 
How Data Commons are Changing the Way that Large Datasets Are Analyzed and Sh...
How Data Commons are Changing the Way that Large Datasets Are Analyzed and Sh...How Data Commons are Changing the Way that Large Datasets Are Analyzed and Sh...
How Data Commons are Changing the Way that Large Datasets Are Analyzed and Sh...Robert Grossman
 
DBLP-SSE: A DBLP Search Support Engine
DBLP-SSE: A DBLP Search Support EngineDBLP-SSE: A DBLP Search Support Engine
DBLP-SSE: A DBLP Search Support EngineYi Zeng
 
Data science syllabus
Data science syllabusData science syllabus
Data science syllabusanoop bk
 
Next-Generation Search Engines for Information Retrieval
Next-Generation Search Engines for Information RetrievalNext-Generation Search Engines for Information Retrieval
Next-Generation Search Engines for Information RetrievalWaqas Tariq
 
Data legend dh_benelux_2017.key
Data legend dh_benelux_2017.keyData legend dh_benelux_2017.key
Data legend dh_benelux_2017.keyRichard Zijdeman
 
ICMLDA_poster.doc
ICMLDA_poster.docICMLDA_poster.doc
ICMLDA_poster.docbutest
 
Tutorial Data Management and workflows
Tutorial Data Management and workflowsTutorial Data Management and workflows
Tutorial Data Management and workflowsSSSW
 
Introduction to Big Data and its Potential for Dementia Research
Introduction to Big Data and its Potential for Dementia ResearchIntroduction to Big Data and its Potential for Dementia Research
Introduction to Big Data and its Potential for Dementia ResearchDavid De Roure
 
NZ eResearch Symposium 2013 - Capturing the Flux in Scientific Knowledge
NZ eResearch Symposium 2013 - Capturing the Flux in Scientific KnowledgeNZ eResearch Symposium 2013 - Capturing the Flux in Scientific Knowledge
NZ eResearch Symposium 2013 - Capturing the Flux in Scientific KnowledgePrashant Gupta
 
International Collaboration Networks in the Emerging (Big) Data Science
International Collaboration Networks in the Emerging (Big) Data ScienceInternational Collaboration Networks in the Emerging (Big) Data Science
International Collaboration Networks in the Emerging (Big) Data Sciencedatasciencekorea
 
Chapter 1. Introduction
Chapter 1. IntroductionChapter 1. Introduction
Chapter 1. Introductionbutest
 
Prov-O-Viz: Interactive Provenance Visualization
Prov-O-Viz: Interactive Provenance VisualizationProv-O-Viz: Interactive Provenance Visualization
Prov-O-Viz: Interactive Provenance VisualizationRinke Hoekstra
 
Application of hidden markov model in question answering systems
Application of hidden markov model in question answering systemsApplication of hidden markov model in question answering systems
Application of hidden markov model in question answering systemsijcsa
 

What's hot (20)

The UVA School of Data Science
The UVA School of Data ScienceThe UVA School of Data Science
The UVA School of Data Science
 
Data mining weka
Data mining wekaData mining weka
Data mining weka
 
Open domain question answering system using semantic role labeling
Open domain question answering system using semantic role labelingOpen domain question answering system using semantic role labeling
Open domain question answering system using semantic role labeling
 
Csi poster
Csi posterCsi poster
Csi poster
 
D0373024030
D0373024030D0373024030
D0373024030
 
Make your data great now
Make your data great nowMake your data great now
Make your data great now
 
How Data Commons are Changing the Way that Large Datasets Are Analyzed and Sh...
How Data Commons are Changing the Way that Large Datasets Are Analyzed and Sh...How Data Commons are Changing the Way that Large Datasets Are Analyzed and Sh...
How Data Commons are Changing the Way that Large Datasets Are Analyzed and Sh...
 
DBLP-SSE: A DBLP Search Support Engine
DBLP-SSE: A DBLP Search Support EngineDBLP-SSE: A DBLP Search Support Engine
DBLP-SSE: A DBLP Search Support Engine
 
Data science syllabus
Data science syllabusData science syllabus
Data science syllabus
 
Next-Generation Search Engines for Information Retrieval
Next-Generation Search Engines for Information RetrievalNext-Generation Search Engines for Information Retrieval
Next-Generation Search Engines for Information Retrieval
 
Data legend dh_benelux_2017.key
Data legend dh_benelux_2017.keyData legend dh_benelux_2017.key
Data legend dh_benelux_2017.key
 
ICMLDA_poster.doc
ICMLDA_poster.docICMLDA_poster.doc
ICMLDA_poster.doc
 
Tutorial Data Management and workflows
Tutorial Data Management and workflowsTutorial Data Management and workflows
Tutorial Data Management and workflows
 
Introduction to Big Data and its Potential for Dementia Research
Introduction to Big Data and its Potential for Dementia ResearchIntroduction to Big Data and its Potential for Dementia Research
Introduction to Big Data and its Potential for Dementia Research
 
Satya Sahoo Thesis Defense
Satya Sahoo Thesis DefenseSatya Sahoo Thesis Defense
Satya Sahoo Thesis Defense
 
NZ eResearch Symposium 2013 - Capturing the Flux in Scientific Knowledge
NZ eResearch Symposium 2013 - Capturing the Flux in Scientific KnowledgeNZ eResearch Symposium 2013 - Capturing the Flux in Scientific Knowledge
NZ eResearch Symposium 2013 - Capturing the Flux in Scientific Knowledge
 
International Collaboration Networks in the Emerging (Big) Data Science
International Collaboration Networks in the Emerging (Big) Data ScienceInternational Collaboration Networks in the Emerging (Big) Data Science
International Collaboration Networks in the Emerging (Big) Data Science
 
Chapter 1. Introduction
Chapter 1. IntroductionChapter 1. Introduction
Chapter 1. Introduction
 
Prov-O-Viz: Interactive Provenance Visualization
Prov-O-Viz: Interactive Provenance VisualizationProv-O-Viz: Interactive Provenance Visualization
Prov-O-Viz: Interactive Provenance Visualization
 
Application of hidden markov model in question answering systems
Application of hidden markov model in question answering systemsApplication of hidden markov model in question answering systems
Application of hidden markov model in question answering systems
 

Viewers also liked

Srikanth resume
Srikanth resumeSrikanth resume
Srikanth resumeSrik Maxo
 
Xuejun Resume 2015
Xuejun Resume 2015Xuejun Resume 2015
Xuejun Resume 2015Xuejun Zhang
 
Resume Eve Liu-Aug 3rd
Resume Eve Liu-Aug 3rdResume Eve Liu-Aug 3rd
Resume Eve Liu-Aug 3rdEve Liu
 
Mack Gao Resume 2015
Mack Gao Resume 2015Mack Gao Resume 2015
Mack Gao Resume 2015Mack Gao
 
RESUME 2.0-ZIMU WANG
RESUME 2.0-ZIMU WANGRESUME 2.0-ZIMU WANG
RESUME 2.0-ZIMU WANGZimu Wang
 
Georgia Manu Resume
Georgia Manu Resume Georgia Manu Resume
Georgia Manu Resume Georgia Manu
 
YUNJIAN CUI resume
YUNJIAN CUI resumeYUNJIAN CUI resume
YUNJIAN CUI resumeYunjian Cui
 
Walter Rivera Resume
Walter Rivera  Resume Walter Rivera  Resume
Walter Rivera Resume Walter Rivera
 
Shruti Panda Resume -Updated
Shruti Panda Resume -UpdatedShruti Panda Resume -Updated
Shruti Panda Resume -UpdatedShruti Panda
 
Jeff seifert 2015 resume
Jeff seifert 2015 resumeJeff seifert 2015 resume
Jeff seifert 2015 resumeJeff Seifert
 
Connie Wang Resume
Connie Wang ResumeConnie Wang Resume
Connie Wang ResumeConnie Wang
 
Chu_Yue_Resume_Fall_2015
Chu_Yue_Resume_Fall_2015Chu_Yue_Resume_Fall_2015
Chu_Yue_Resume_Fall_2015Yue Chu
 
Chandrakant pandey java j2ee developer resume
Chandrakant pandey java j2ee developer resumeChandrakant pandey java j2ee developer resume
Chandrakant pandey java j2ee developer resumeChandrakant Pandey
 
Jerry Smith Resume - 2016
Jerry Smith Resume - 2016Jerry Smith Resume - 2016
Jerry Smith Resume - 2016Jerry Smith
 
Winston Cannady II resume update
Winston Cannady II resume updateWinston Cannady II resume update
Winston Cannady II resume updateWinston Cannady
 
John N. Lewis - Resume - Public
John N. Lewis - Resume - Public John N. Lewis - Resume - Public
John N. Lewis - Resume - Public John N. Lewis
 

Viewers also liked (20)

Srikanth resume
Srikanth resumeSrikanth resume
Srikanth resume
 
Xuejun Resume 2015
Xuejun Resume 2015Xuejun Resume 2015
Xuejun Resume 2015
 
Resume Eve Liu-Aug 3rd
Resume Eve Liu-Aug 3rdResume Eve Liu-Aug 3rd
Resume Eve Liu-Aug 3rd
 
Mack Gao Resume 2015
Mack Gao Resume 2015Mack Gao Resume 2015
Mack Gao Resume 2015
 
RESUME 2.0-ZIMU WANG
RESUME 2.0-ZIMU WANGRESUME 2.0-ZIMU WANG
RESUME 2.0-ZIMU WANG
 
PARSENEAU resume
PARSENEAU resumePARSENEAU resume
PARSENEAU resume
 
Georgia Manu Resume
Georgia Manu Resume Georgia Manu Resume
Georgia Manu Resume
 
Lingyi Chen (6)
Lingyi Chen (6)Lingyi Chen (6)
Lingyi Chen (6)
 
YUNJIAN CUI resume
YUNJIAN CUI resumeYUNJIAN CUI resume
YUNJIAN CUI resume
 
Cscott Resume
Cscott ResumeCscott Resume
Cscott Resume
 
Resume
ResumeResume
Resume
 
Walter Rivera Resume
Walter Rivera  Resume Walter Rivera  Resume
Walter Rivera Resume
 
Shruti Panda Resume -Updated
Shruti Panda Resume -UpdatedShruti Panda Resume -Updated
Shruti Panda Resume -Updated
 
Jeff seifert 2015 resume
Jeff seifert 2015 resumeJeff seifert 2015 resume
Jeff seifert 2015 resume
 
Connie Wang Resume
Connie Wang ResumeConnie Wang Resume
Connie Wang Resume
 
Chu_Yue_Resume_Fall_2015
Chu_Yue_Resume_Fall_2015Chu_Yue_Resume_Fall_2015
Chu_Yue_Resume_Fall_2015
 
Chandrakant pandey java j2ee developer resume
Chandrakant pandey java j2ee developer resumeChandrakant pandey java j2ee developer resume
Chandrakant pandey java j2ee developer resume
 
Jerry Smith Resume - 2016
Jerry Smith Resume - 2016Jerry Smith Resume - 2016
Jerry Smith Resume - 2016
 
Winston Cannady II resume update
Winston Cannady II resume updateWinston Cannady II resume update
Winston Cannady II resume update
 
John N. Lewis - Resume - Public
John N. Lewis - Resume - Public John N. Lewis - Resume - Public
John N. Lewis - Resume - Public
 

Similar to Resume

TUW-ASE Summer 2015 - Quality of Result-aware data analytics
TUW-ASE Summer 2015 - Quality of Result-aware data analyticsTUW-ASE Summer 2015 - Quality of Result-aware data analytics
TUW-ASE Summer 2015 - Quality of Result-aware data analyticsHong-Linh Truong
 
Lei_Resume-it.doc
Lei_Resume-it.docLei_Resume-it.doc
Lei_Resume-it.docbutest
 
Top cited computer science and engineering survey research articles from 2016...
Top cited computer science and engineering survey research articles from 2016...Top cited computer science and engineering survey research articles from 2016...
Top cited computer science and engineering survey research articles from 2016...IJCSES Journal
 
J48 and JRIP Rules for E-Governance Data
J48 and JRIP Rules for E-Governance DataJ48 and JRIP Rules for E-Governance Data
J48 and JRIP Rules for E-Governance DataCSCJournals
 
Study on potential capabilities of a nodb system
Study on potential capabilities of a nodb systemStudy on potential capabilities of a nodb system
Study on potential capabilities of a nodb systemijitjournal
 
95Orchestrating Big Data Analysis Workflows in the Cloud.docx
95Orchestrating Big Data Analysis Workflows in the Cloud.docx95Orchestrating Big Data Analysis Workflows in the Cloud.docx
95Orchestrating Big Data Analysis Workflows in the Cloud.docxfredharris32
 
95Orchestrating Big Data Analysis Workflows in the Cloud.docx
95Orchestrating Big Data Analysis Workflows in the Cloud.docx95Orchestrating Big Data Analysis Workflows in the Cloud.docx
95Orchestrating Big Data Analysis Workflows in the Cloud.docxblondellchancy
 
Stacked Generalization of Random Forest and Decision Tree Techniques for Libr...
Stacked Generalization of Random Forest and Decision Tree Techniques for Libr...Stacked Generalization of Random Forest and Decision Tree Techniques for Libr...
Stacked Generalization of Random Forest and Decision Tree Techniques for Libr...IJEACS
 
Li Cheng WUSTL resume(Amazon)
Li Cheng WUSTL resume(Amazon)Li Cheng WUSTL resume(Amazon)
Li Cheng WUSTL resume(Amazon)Li Cheng
 
research Paper face recognition attendance system
research Paper face recognition attendance systemresearch Paper face recognition attendance system
research Paper face recognition attendance systemAnkitRao82
 
resume_Yuli_Liang
resume_Yuli_Liangresume_Yuli_Liang
resume_Yuli_LiangYuli Liang
 
Services For Science April 2009
Services For Science April 2009Services For Science April 2009
Services For Science April 2009Ian Foster
 
Top cited articles 2020 - Advanced Computational Intelligence: An Internation...
Top cited articles 2020 - Advanced Computational Intelligence: An Internation...Top cited articles 2020 - Advanced Computational Intelligence: An Internation...
Top cited articles 2020 - Advanced Computational Intelligence: An Internation...aciijournal
 
Ingredients for Semantic Sensor Networks
Ingredients for Semantic Sensor NetworksIngredients for Semantic Sensor Networks
Ingredients for Semantic Sensor NetworksOscar Corcho
 
Official resume titash_mandal_
Official resume titash_mandal_Official resume titash_mandal_
Official resume titash_mandal_Titash Mandal
 
Algorithm for calculating relevance of documents in information retrieval sys...
Algorithm for calculating relevance of documents in information retrieval sys...Algorithm for calculating relevance of documents in information retrieval sys...
Algorithm for calculating relevance of documents in information retrieval sys...IRJET Journal
 
Semantic Web concepts used in Web 3.0 applications
Semantic Web concepts used in Web 3.0 applicationsSemantic Web concepts used in Web 3.0 applications
Semantic Web concepts used in Web 3.0 applicationsIRJET Journal
 
Classifier Model using Artificial Neural Network
Classifier Model using Artificial Neural NetworkClassifier Model using Artificial Neural Network
Classifier Model using Artificial Neural NetworkAI Publications
 

Similar to Resume (20)

TUW-ASE Summer 2015 - Quality of Result-aware data analytics
TUW-ASE Summer 2015 - Quality of Result-aware data analyticsTUW-ASE Summer 2015 - Quality of Result-aware data analytics
TUW-ASE Summer 2015 - Quality of Result-aware data analytics
 
Lei_Resume-it.doc
Lei_Resume-it.docLei_Resume-it.doc
Lei_Resume-it.doc
 
Poster (1)
Poster (1)Poster (1)
Poster (1)
 
Top cited computer science and engineering survey research articles from 2016...
Top cited computer science and engineering survey research articles from 2016...Top cited computer science and engineering survey research articles from 2016...
Top cited computer science and engineering survey research articles from 2016...
 
Ijetcas14 446
Ijetcas14 446Ijetcas14 446
Ijetcas14 446
 
J48 and JRIP Rules for E-Governance Data
J48 and JRIP Rules for E-Governance DataJ48 and JRIP Rules for E-Governance Data
J48 and JRIP Rules for E-Governance Data
 
Study on potential capabilities of a nodb system
Study on potential capabilities of a nodb systemStudy on potential capabilities of a nodb system
Study on potential capabilities of a nodb system
 
95Orchestrating Big Data Analysis Workflows in the Cloud.docx
95Orchestrating Big Data Analysis Workflows in the Cloud.docx95Orchestrating Big Data Analysis Workflows in the Cloud.docx
95Orchestrating Big Data Analysis Workflows in the Cloud.docx
 
95Orchestrating Big Data Analysis Workflows in the Cloud.docx
95Orchestrating Big Data Analysis Workflows in the Cloud.docx95Orchestrating Big Data Analysis Workflows in the Cloud.docx
95Orchestrating Big Data Analysis Workflows in the Cloud.docx
 
Stacked Generalization of Random Forest and Decision Tree Techniques for Libr...
Stacked Generalization of Random Forest and Decision Tree Techniques for Libr...Stacked Generalization of Random Forest and Decision Tree Techniques for Libr...
Stacked Generalization of Random Forest and Decision Tree Techniques for Libr...
 
Li Cheng WUSTL resume(Amazon)
Li Cheng WUSTL resume(Amazon)Li Cheng WUSTL resume(Amazon)
Li Cheng WUSTL resume(Amazon)
 
research Paper face recognition attendance system
research Paper face recognition attendance systemresearch Paper face recognition attendance system
research Paper face recognition attendance system
 
resume_Yuli_Liang
resume_Yuli_Liangresume_Yuli_Liang
resume_Yuli_Liang
 
Services For Science April 2009
Services For Science April 2009Services For Science April 2009
Services For Science April 2009
 
Top cited articles 2020 - Advanced Computational Intelligence: An Internation...
Top cited articles 2020 - Advanced Computational Intelligence: An Internation...Top cited articles 2020 - Advanced Computational Intelligence: An Internation...
Top cited articles 2020 - Advanced Computational Intelligence: An Internation...
 
Ingredients for Semantic Sensor Networks
Ingredients for Semantic Sensor NetworksIngredients for Semantic Sensor Networks
Ingredients for Semantic Sensor Networks
 
Official resume titash_mandal_
Official resume titash_mandal_Official resume titash_mandal_
Official resume titash_mandal_
 
Algorithm for calculating relevance of documents in information retrieval sys...
Algorithm for calculating relevance of documents in information retrieval sys...Algorithm for calculating relevance of documents in information retrieval sys...
Algorithm for calculating relevance of documents in information retrieval sys...
 
Semantic Web concepts used in Web 3.0 applications
Semantic Web concepts used in Web 3.0 applicationsSemantic Web concepts used in Web 3.0 applications
Semantic Web concepts used in Web 3.0 applications
 
Classifier Model using Artificial Neural Network
Classifier Model using Artificial Neural NetworkClassifier Model using Artificial Neural Network
Classifier Model using Artificial Neural Network
 

Resume

  • 1. Yuanzhe Cai’s Curriculum Vitae The University of Texas at Arlington CURRICULUM VITAE GGeenneerraall IInnffoorrmmaattiioonn Name: Dr. Yuanzhe Cai Gender: Male Age: 32 Address: D209 Via Lucca, Irvine, CA 92612 Email: yuanzhe.cai@gmail.com Mobile Telephone: (682) 240-5640 OObbjjeeccttiivveess Seek for a full time software engineer EEdduuccaattiioonn The University of Texas at Arlington (Texas) 2009-2014 (GPA: 3.78/4.0) Ph.D. in Computer Science and Engineering Supervisor: Prof. Sharma Chakravarthy Dissertation: Inferring answer quality, answerer expertise, and ranking in question/answer social networks. Renmin University of China (Beijing) 2005-2008 (GPA 3.78/4.0) M.S. in Computer Science and Engineering Supervisor: Prof. Xiaoyong Du Thesis: A method for the similarity calculation on the large scale documents Xidian University (Xi’an) 2001-2005 (GPA 3.6/4.0) B.S. in Software Engineer SSkkiillllss && HHaannddlleedd IInnssttrruummeennttss  Solid knowledge in database system, data mining and search engine  Hands-on database kernel components for PostgreSQL (2)  Good knowledge in Database Optimization (table indexing, query analyzing, performance tuning, etc.) (5)  Good knowledge in Big data: Hadoop Framework (2) , PostgreSQL 9.4 (NoSQL Feature) (1), MongoDB (1), Spark (0.5)  Proficiency with data mining tool and information retrieval software: weka (4) and lucene (1) system  Expertise in social networks analysis: Q/A community Analysis (5)  Expertise in recommendation system: book and social tagging recommendation system  Good knowledge in J2EE Optimization (JMS optimization, JBoss server performance tuning, Hibernate performance tuning, etc.) (1)  Proficiency with: Java (10), C (5), Matlab (5), SQL (9), PL/SQL (1), J2EE Framework (3), VBA (0.5), C++ (1), EJB (1), JBoss (1), MYSQL (2) AAcchhiieevveemmeennttss  Database Kernel Development (PostgreSQL 8.3): Result set cache development Performance monitoring tools  Online Community Analysis (e.g., Facebook, Yahoo! Answers, Stack Overflow, etc): Crawl more than 30G web original data Manage more than 10 million question and answers pairs
  • 2. Yuanzhe Cai’s Curriculum Vitae The University of Texas at Arlington Design efficiently and effectively algorithms to analyze answerer’s behavior Calculate user’s expertise Recommend the questions to the proper users  Digital Library for Renmin University of China (2.0): Retrieve the book according to the keywords using lucence system Personally recommend a proper book for a user This system was used in Renmin University Library  Healthcare Radiology Information System (RIS): Full-Stack developer for J2EE framework. System performance turning (DB and JBoss server tuning). PPrroojjeeccttss  06/2015- Radiology Information System (RIS) Re-Architecture (JBoss, EJB, JAVA, MYSQL etc)  Full-Stack developer (schedule, worklist, report)  Improve the system performance. (JMS tuning, DB tuning, JBoss tuning, etc)  Data Migration Tool. (Migration from RIS(v2.3) to RIS(2.8))  Import Tool. (Import excel data into RIS system)  09/2009-05/2014 Identifying Expertise and Answer Quality in Q/A Social Networks (Java, Hadoop framework, MongoDB, PostgreSQL and Matlab)  Answer Quality Prediction in Q/A Social Network by Leveraging Temporal Feature  Expertise Ranking of Users in Q/A Community  Identify the specialist for a particular question or a domain in the Q/A community.  Social tagging recommendation using the tensor decomposition in the Q/A community  02/2008-08/2009 Document Similarity Analysis (Java)  Develop an algorithm to calculate the document similarity.  Improve SimRank algorithm from 2 days to 2 minutes (100 thousands nodes for the citation graph)  09/2007-02/2008 Code system development (ontology management system) (Java, PLSQL, Oracle)  Implement the different kinds of relationship, instance and class for ontology.  This system was used in the Database & Intelligent Information Retrieval Lab.  02/2007-09/2007 Digital Library for for Renmin University of China (Java, Lucene)  Develop the book retrieval system 2.0 using lucene  Personally recommend a proper book for a user using item based and user based algorithms  07/2006-02/2007 Database Performance Monitoring (C and PostgreSQL)  Develop a group of database views to monitor the database performance, such as io, buffer, file, lock, event, log information, etc.  This monitor was used in Kingbase v.4.1.  03/2006-07/2006 Cadre evaluation system of the CPC Central Committee (VB, PowerDesigner, Kingbase 4.1 and VBA)  Develop the cadre evaluation system for the CPC Central committee.  Implement the database design and UI program  This system was used for cadres’ election for 23 provinces in China.  09/2005-03/2006 SQL result set cache (C and PostgreSQL)  Implemented both client memory cache and share memory cache.
  • 3. Yuanzhe Cai’s Curriculum Vitae The University of Texas at Arlington  Take the TPC-C test.  This result set cache was used in Kingbase v4.1.  07/2005-09/2006 Japanese healthcare software system (Java, J2EE framework, Tomcat and Oracle)  Build the J2EE framework using Struts + Spring + Hibernate  Implement the database design for the healthcare system  02/2005-06/2006 Business customer behavior analysis system (Java, JSP and Tomcat)  Using data mining technique to analyze the customer’s behaviors  Implement data mining algorithms, such as ID3 tree classifier, naive bayes classifier, k-mean cluster, apriori algorithm, and preprocess, etc.  Direct a group (5 persons) to implement system WWoorrkk EExxppeerriieennccee && IInntteerrnnsshhiipp  06/15- Candelis Inc. Software Engineer for the Backend Performance  09/14-06/15 University of Texas at Arlington Lecturer for CSE 2320 (Data structure and Algorithm) and CSE 5311 (Algorithm)  09/09-05/14 University of Texas at Arlington TA for C, Java, Database, Data Structure, Computer Architecture, etc.  09/05 - 01/07 Beijing BaseSoft Co., Ltd., PostgreSQL kernel development Database Kernel Developer  07/05 - 09/05 Shanghai Xinyou Co., Ltd., Japanese healthcare software system Software Engineer  02/05 - 06/05 Xi’an Software Park, Business customer behavior analysis system Software Engineer HHoonnoorrss && SScchhoollaarrsshhiippss  TA Fellowship at University of Texas, Arlington from fall 2009 to May 2014.  Three Years Fellowship in Renmin University of China from 2005-2008  The third-class Scholarship in Xidian University in 2004 and 2005  IBM Web Sphere Certification  The third-class Math Model in Xidian University in 2004 PPaatteennttss  Xiaoyong Du, Hongyan Liu, Jun He, Yuanzhe Cai and Pei Li, A method of the document similarity calculation, Patent Id. CN101576903B  Xiaoyong Du, Hongyan Liu, Jun He, Pei Li and Yuanzhe Cai, Efficient similarity calculation on a graph using block structure, Patent Id. CN101576905B  Xiaoyong Du, Hongyan Liu, Jun He, Yuanzhe Cai and Xu Jia, Explore the power law distribution on a graph for efficient similarity calculation, Patent Id. CN101853281A PPuubblliiccaattiioonnss [1] Yuanzhe Cai, Sharma Chakravarthy: Answer Quality Prediction in Q/A Social Networks by Leveraging Temporal Features. International Journal of Next-Generation Computing, Volume 4, 2013
  • 4. Yuanzhe Cai’s Curriculum Vitae The University of Texas at Arlington [2] Yuanzhe Cai, Sharma Chakravarthy: Expertise Ranking of Users in QA Community. Database Systems for Advanced Applications, 18th International Conference, DASFAA 2013, Wuhan, China, April 22-25, 2013 [3] Yuanzhe Cai, Sharma Chakravarthy: Pairwise Similarity Calculation of Information Networks. Data Warehousing and Knowledge Discovery - 13th International Conference, DaWaK 2011, Toulouse, France, August 29-September 2,2011. [4] Yuanzhe Cai, Miao Zhang, Dijun Luo, Chris H. Q. Ding, Sharma Chakravarthy: Low-order tensor decompositions for social tagging recommendation. Proceedings of the Forth International Conference on Web Search and Web Data Mining, WSDM 2011, Hong Kong, China, February 9-12, 2011. [5] Xu Jia, Hongyan Liu, Li Zou, Jun He, Xiaoyong Du, Yuanzhe Cai: Local Methods for Estimating SimRank Score. Advances in Web Technologies and Applications, Proceedings of the 12th Asia-Pacific Web Conference, APWeb 2010, Busan, Korea, 6-8 April 2010. [6] Yuanzhe Cai, Miao Zhang, Chris H. Q. Ding, Sharma Chakravarthy: Closed form solution of similarity algorithms. Proceeding of the 33rd International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2010, Geneva, Switzerland, July 19-23, 2010. [7] Xu Jia, Yuanzhe Cai, Hongyan Liu, Jun He, Xiaoyong Du: Calculating Similarity Efficiently in a Small World. Advanced Data Mining and Applications, 5th International Conference, ADMA 2009, Beijing, China, August 17-19, 2009. [8] Yuanzhe Cai, Hongyan Liu, Jun He, Xiaoyong Du, Xu Jia: An Adaptive Method for the Efficient Similarity Calculation. Database Systems for Advanced Applications, 14th International Conference, DASFAA 2009, Brisbane, Australia, April 21-23, 2009. [9] Yuanzhe Cai, Gao Cong, Xu Jia, Hongyan Liu, Jun He, Jiaheng Lu, Xiaoyong Du: Efficient Algorithm for Computing Link-Based Similarity in Real World Networks. ICDM 2009, The Ninth IEEE International Conference on Data Mining, Miami, Florida, USA, 6-9 December 2009. [10] Pei Li, Yuanzhe Cai, Hongyan Liu, Jun He, Xiaoyong Du: Exploiting the Block Structure of Link Graph for Efficient Similarity Computation. Advances in Knowledge Discovery and Data Mining, 13th Pacific-Asia Conference, PAKDD 2009, Bangkok, Thailand, April 27-30, 2009, Proceedings. [11] Yuanzhe Cai, Pei Li, Hongyan Liu, Jun He, Xiaoyong Du: S-SimRank: Combining Content and Link Information to Cluster Papers Effectively and Efficiently. Advanced Data Mining and Applications, 4th International Conference, ADMA 2008, Chengdu, China, October 8-10, 2008. [12] Yuanzhe Cai and Sharma Chakravarthy. Identifying Specialists for Concepts. 18th International Conference on Extending Database Technology, March 23-27, 2015 - Brussels, Belgium (submitted). [13] Yuanzhe Cai and Sharma Chakravarthy. HITS vs. Non-negative Matrix Factorization. Technique Report, 2014. RReeffeerreenncceess  Dr. Sharma Chakravarthy, Professor, Department of Computer Science and Engineering, UT Arlington, email: sharma@cse.uta.edu, Phone: (817) 272-2082  Dr. Chris Ding, Professor, Department of Computer Science and Engineering, UT Arlington, email: CHQDing@uta.edu, Phone: (817) 272-7041  Dr. Deguang Kong, Senior Research Engineer, Samsung Electronics, email: doogkong@gmail.com, Phone: (408) 718-4906  Ming Ge, RIS Project Manager, Candelis Inc, email: ming.ge@candelis.com, Phone: (917)348-8560