SlideShare a Scribd company logo
1 of 5
Download to read offline
YANDONG	
  WANG	
  
	
  
I.   CONTACT	
  INFO	
  
IBM Thomas J. Watson Research Center	
  
Cognitive System Analysis and Optimization Group
1101 Kitchawan Rd, Yorktown Height, NY 10598, USA
Office 10-134
Email: roydrax@gmail.com, Phone: +1-585-770-8705	
  
Website: researcher.ibm.com/researcher/view.php?person=us-yandong	
  
	
  
II.   INTERESTS	
  
Distributed Machine Learning; Computer Systems; big data analytics frameworks; In-memory Key-Value
stores; distributed machine learning, high-performance computing; cloud computing; data-center networking.	
  
	
  
III.   EDUCATION	
  
•   Ph.D. in Computer Science Sep. 2010 – May. 2014	
  
Auburn University, Auburn, Alabama, USA. 	
  
Advisor: Prof. Weikuan Yu
Dissertation: “Efficient Movement and Task Management in MapReduce for Fast Analytics of Big Data”.	
  
	
  
•   Master. in Computer Science Sep. 2008 – Aug. 2010 	
  
Rochester Institute of Technology, Rochester, New York, USA. 	
  
Advisor: Prof. Alan Kaminsky
Thesis: “Nvidia CUDA Architecture-based Parallel Incomplete SAT Solver”.	
  
	
  
•   Bachelor. in Traffic Engineering and Information Technology Sep. 2002 – Aug. 2006	
  
TongJi University (Shanghai, China)	
  
	
  
IV.   EMPLOYMENT	
  EXPERIENCE	
  
•   Research Staff Member March. 2014 – Present	
  
Cognitive System Analysis and Optimization Group
IBM Thomas J. Watson Research Center, Yorktown Height, NY, USA.
	
  
•   Research Assistant Sep. 2010 – May. 2014	
  
Parallel Architecture and System Lab Auburn University, AL	
  
Advisor: Prof. Weikuan Yu	
  
	
  
•   Internship at Lawrence Livermore National Lab May. 2013 – Aug. 2013	
  
	
   	
   HPC Strategic Planning and Architecture Livermore, CA	
  
	
  
•   Internship at IBM T.J Watson Research Center May. 2012 – Aug. 2012	
  
System Analysis and Optimization Group Hawthorn, NY	
  
	
  
•   Internship at Oak Ridge National Laboratory June. 2011 – Aug. 2011	
  
Technology Integration Group Knoxville, TN
	
  
V.   RESEARCH	
  EXPERIENCE	
  
•   Research Staff Member March. 2014 – Present	
  
IBM Thomas J. Watson Research Center, Yorktown Height
o   Analyzing	
  the	
  performance	
  characteristics	
  of	
  a	
  variety	
  of	
  big	
  data	
  analytics	
  frameworks.	
  
o   Designing	
  and	
  implementing	
  large-­‐‑scale	
  machine	
  learning	
  frameworks.	
  
o   Designing	
  and	
  implementing	
  next-­‐‑generation	
  in-­‐‑memory	
  key-­‐‑value	
  stores.	
  
o   Optimizing	
  cognitive	
  computing	
  platforms.	
  
o   Disseminating,	
  presenting	
  the	
  research	
  results	
  in	
  conferences	
  and	
  journals.	
  	
  
	
  
•   Research Intern at Lawrence Livermore National Lab May. 2013 – Aug. 2013	
  
o   Analyzing	
  and	
  optimizing	
  the	
  performance	
  of	
  Spark	
  on	
  HPC	
  platforms.	
  
o   Exploring	
  the	
  hybrid	
  memory	
  approaches	
  to	
  improving	
  Spark.	
  
	
  
•   Research Intern at IBM Thomas J. Watson Research May. 2012 – Aug. 2012	
  
o   Investigating	
  the	
  performance	
  of	
  Hadoop	
  MapReduce	
  schedulers.	
  
o   Designing	
  and	
  implementing	
  preemptive	
  reduce	
  task	
  scheduler.	
  
	
  
•   Research Intern at Oak Ridge National Lab June. 2011 – Aug. 2011	
  
o   Examining the performance of Lustre File System metadata management.	
  
o   Designing and implementing Lustre metadata simulator.	
  
	
  
•   Research Assistant at Auburn University Sep. 2010 – Mar. 2014	
  
o   Investigating the performance of MapReduce over high-performance networks.	
  
o   Investigating the performance of MapReduce resource management and contention management.	
  
o   Designing and implementing network levitated merge for Hadoop.	
  
o   Designing and implementing techniques to enhance the fault-tolerance for MapReduce systems.	
  
o   Examining and optimizing scientific applications on commodity cluster.	
  
	
  
VI.   PUBLICATIONS	
  
1.   [Eurosys	
  2016]:	
  Xingbo	
  Wu,	
  Li	
  Zhang,	
  Yandong	
  Wang,	
  Yufei	
  Ren,	
  Michel	
  Hack,	
  Song	
  Jiang,	
  
“zExpander:	
  a	
  Key-­‐‑Value	
  Cache	
  with	
  both	
  High	
  Performance	
  and	
  Fewer	
  Misses”.	
  The	
  11th	
  European	
  
Conference	
  on	
  Computer	
  Systems.	
  	
  
2.   [IPDPS	
  2016]:	
  Luna	
  Xu,	
  Min	
  Li,	
  Li	
  Zhang,	
  Ali	
  R.	
  Butt,	
  Yandong	
  Wang,	
  Zane	
  Hu,	
  “MEMTUNE:	
  Dynamic	
  
Memory	
  Management	
  for	
  In-­‐‑Memory	
  Data	
  Analytics	
  Platforms”.	
  30th	
  IEEE	
  International	
  Parallel	
  and	
  
Distributed	
  Processing	
  Symposium.	
  
3.   [Supercomputing	
  2015]:	
  Yandong	
  Wang,	
  Li	
  Zhang,	
  Jian	
  Tan,	
  Min	
  Li,	
  Yuqing	
  Gao,	
  Xavier	
  Guerin,	
  
Xiaoqiao	
  Meng,	
  Shicong	
  Meng,	
  “HydraDB:	
  A	
  Resilient	
  RDMA-­‐‑driven	
  Key-­‐‑Value	
  Middleware	
  for	
  In-­‐‑
Memory	
  Cluster	
  Computing”.	
  27th	
  IEEE/ACM	
  The	
  International	
  Conference	
  for	
  High	
  Performance	
  
Computing	
  Networking,	
  Storage	
  and	
  Analysis,	
  Nov	
  15-­‐‑20,	
  2015,	
  Austin,	
  TX.	
  
4.   [ACM	
  Sigmetrics	
  Performance	
  Evaluation	
  Review	
  2015]:	
  Jian	
  Tan,	
  Li	
  Zhang,	
  Yandong	
  Wang,	
  
“Miss	
  Behavior	
  for	
  Caching	
  with	
  Lease”.	
  
5.   [ACM	
  Sigmetrics	
  Performance	
  Evaluation	
  Review	
  2015]:	
  Jian	
  Tan,	
  Li	
  Zhang,	
  Min	
  Li,	
  Yandong	
  
Wang,	
  “Multi-­‐‑resource	
  Fair	
  Sharing	
  for	
  Multiclass	
  Workflows”.	
  
6.   [IPDPS	
  2015]:	
  Yandong	
  Wang,	
  Huansong	
  Fu,	
  Weikuan	
  Yu,	
  “Cracking	
  Down	
  MapReduce	
  Failure	
  
Amplification	
  through	
  Analytics	
  Logging	
  and	
  Migration”,	
  29th	
  IEEE	
  International	
  Parallel	
  and	
  
Distributed	
  Processing	
  Symposium,	
  May	
  25-­‐‑29,	
  2015,	
  Hyderabad,	
  India.	
  
7.   [CF	
  2015]:	
  	
  Paula	
  Austel,	
  Han	
  Chen,	
  Parijat	
  Dube,	
  Thomas	
  Mikalsen,	
  Isabelle	
  Rouvellou,	
  Upendra	
  
Sharma,	
  Ignacio	
  Silva-­‐‑Lepe,	
  Revathi	
  Subramanian,	
  Wei	
  Tan	
  and	
  Yangdong	
  Wang,	
  “A	
  PaaS	
  for	
  
Composite	
  Analytics	
  Solutions”,	
  The	
  Workshop	
  on	
  Analytics	
  Platforms	
  for	
  the	
  Cloud,	
  in	
  conjunction	
  
with	
  ACM	
  International	
  Conference	
  on	
  Computing	
  Frontiers,	
  May	
  18,	
  2015.	
  
8.   [CF	
  2015]:	
  Min	
  Li,	
  Jian	
  Tan,	
  Yandong	
  Wang,	
  Li	
  Zhang	
  and	
  Valentina	
  Salapura,	
  “SparkBench:	
  A	
  
Comprehensive	
  Benchmarking	
  Suite	
  for	
  In-­‐‑Memory	
  Data	
  Analytics	
  Platform	
  Spark”,	
  The	
  Workshop	
  
on	
  Analytics	
  Platforms	
  for	
  the	
  Cloud,	
  in	
  conjunction	
  with	
  ACM	
  International	
  Conference	
  on	
  
Computing	
  Frontiers,	
  May	
  18,	
  2015.	
  
9.   [TC	
  2015]:	
  Weikuan	
  Yu,	
  Yandong	
  Wang,	
  Xinyu	
  Que,	
  Cong	
  Xu,	
  “Virtual	
  Shuffling	
  for	
  Efficient	
  Data	
  
Movement	
  in	
  MapReduce”,	
  IEEE	
  Transactions	
  on	
  Computers	
  2015.	
  
10.   [SoCC	
  2014]:	
  Yandong	
  Wang,	
  Xiaoqiao	
  Meng,	
  Li	
  Zhang,	
  Jian	
  Tan,	
  “C-­‐‑Hint:	
  An	
  Effective	
  and	
  Reliable	
  
Cache	
  Management	
  for	
  RDMA-­‐‑Accelerated	
  Key-­‐‑Value	
  Stores”,	
  5th	
  ACM	
  Symposium	
  on	
  Cloud	
  
Computing,	
  Nov	
  3-­‐‑5,	
  2014,	
  Seattle,	
  WA,	
  USA.	
  
11.   [BigData	
  2014]:	
  Teng	
  Wang,	
  Sarp	
  Oral,	
  Yandong	
  Wang,	
  Brad	
  Settlemyer,	
  Scott	
  Atchley,	
  Weikuan	
  Yu,	
  
“Burstmem:	
  A	
  high-­‐‑performance	
  burst	
  buffer	
  system	
  for	
  scientific	
  applications”,	
  the	
  IEEE	
  
International	
  Conference	
  on	
  Big	
  Data	
  2014,	
  Oct	
  27-­‐‑30,	
  Washington	
  DC,	
  USA.	
  
12.   [Sigmetrics	
  2014]:	
  Jian	
  Tan,	
  Yandong	
  Wang,	
  Weikuan	
  Yu,	
  Li	
  Zhang,	
  “Non-­‐‑work-­‐‑conserving	
  Effects	
  
in	
  MapReduce:	
  Diffusion	
  Limit	
  and	
  Criticality”,	
  ACM	
  Sigmetrics	
  2014,	
  Jun	
  16-­‐‑20,	
  Austin,	
  TX,	
  USA.	
  
13.   [IPDPS	
  2014]:	
  Yandong	
  Wang,	
  Robin	
  Goldstone,	
  Weikuan	
  Yu,	
  Teng	
  Wang,	
  “Characterization	
  and	
  
Optimization	
  of	
  Memory-­‐‑Resident	
  MapReduce	
  on	
  HPC	
  Systems”,	
  28th	
  IEEE	
  International	
  Parallel	
  and	
  
Distributed	
  Processing	
  Symposium,	
  May	
  19-­‐‑23,	
  Phoenix,	
  AZ,	
  USA.	
  
14.   [TPDS	
  2014]:	
  Weikuan	
  Yu,	
  Yandong	
  Wang,	
  Xinyu	
  Que,	
  “Design	
  and	
  Evaluation	
  of	
  Network-­‐‑
Levitated	
  Merge	
  for	
  Hadoop	
  Acceleration”,	
  IEEE	
  Transactions	
  on	
  Parallel	
  and	
  Distributed	
  System,	
  
2014.	
  
15.   [WBDB’14]:	
  Yandong	
  Wang,	
  Yizheng	
  Jiao,	
  Cong	
  Xu,	
  Xiaobing	
  Li,	
  Teng	
  Wang,	
  Xinyu	
  Que,	
  Cristi	
  Cira,	
  
Bin	
  Wang,	
  Zhuo	
  Liu,	
  Bliss	
  Bailey,	
  Weikuan	
  Yu,	
  “Assessing	
  the	
  Performance	
  Impact	
  of	
  High-­‐‑Speed	
  
Interconnects	
  on	
  MapReduce	
  Programs”,	
  3rd	
  Workshop	
  on	
  Big	
  Data	
  Benchmarking	
  (Invited	
  Book	
  
Chapter),	
  Xi’an	
  China.	
  
16.   [Supercomputing	
  2013]:	
  Xiaobing	
  Li,	
  Yandong	
  Wang,	
  Yizheng	
  Jiao,	
  Cong	
  Xu,	
  Weikuan	
  Yu.	
  “CooMR:	
  
Cross-­‐‑Task	
  Coordination	
  for	
  Efficient	
  Data	
  Management	
  in	
  MapReduce	
  Programs”,	
  25th	
  IEEE/ACM	
  
The	
  International	
  Conference	
  for	
  High	
  Performance	
  Computing	
  Networking,	
  Storage	
  and	
  Analysis,	
  
Nov	
  17-­‐‑22,	
  2013,	
  Denver,	
  CO,	
  USA.	
  
17.   [ICCCN’13]:	
  Zhuo	
  Liu,	
  Bin	
  Wang,	
  Teng	
  Wang,	
  Yuan	
  Tian,	
  Cong	
  Xu,	
  Yandong	
  Wang,	
  Weikuan	
  Yu,	
  
“Profiling	
  and	
  Improving	
  I/O	
  Performance	
  of	
  a	
  Large-­‐‑Scale	
  Climate	
  Scientific	
  Application”,	
  22nd	
  
International	
  Conference	
  on	
  Computer	
  Communications	
  and	
  Networks,	
  July	
  30-­‐‑Aug	
  2,	
  2013,	
  Nassau,	
  
Bahamas.	
  
18.   [ICAC’13]:	
  Yandong.Wang,	
  JianTan,	
  Weikuan	
  Yu,	
  Li	
  Zhang,	
  Xiaoqiao	
  Meng.	
  “Preemptive	
  ReduceTask	
  
Scheduling	
  for	
  Fair	
  and	
  Fast	
  Job	
  Completion”.	
  10th	
  International	
  Conference	
  on	
  Autonomic	
  
Computing,	
  in	
  conjunction	
  with	
  2013	
  USENIX	
  Federated	
  Conferences.	
  Jun	
  26-­‐‑28,	
  2013,	
  San	
  Jose,	
  CA,	
  
USA.	
  
19.   [CCGRID'13]:	
  Cong	
  Xu,	
  Manjunath.	
  G.	
  Venkata,	
  Richard.	
  L.	
  Graham,	
  Yandong	
  Wang,	
  Zhuo	
  Liu,	
  
Weikuan	
  Yu.	
  “SLOAVx:	
  Scalable	
  LOgarithmic	
  AlltoallV	
  Algorithm	
  for	
  Hierarchical	
  Multicore	
  Systems”,	
  
The	
  13th	
  IEEE/ACM	
  International	
  Symposium	
  on	
  Cluster,	
  Cloud	
  and	
  Grid	
  Computing,	
  May	
  13-­‐‑
16,2013,	
  Delft,	
  Netherlands.	
  
20.   [IPDPS'13]:	
  Yandong.Wang,	
  Cong	
  Xu,	
  Xiaobing	
  Li,	
  Weikuan	
  Yu,	
  “JVM-­‐‑bypassing	
  Shuffling	
  for	
  Hadoop	
  
Acceleration”,	
  27th	
  IEEE	
  International	
  Parallel	
  and	
  Distributed	
  Processing	
  Symposium,	
  Boston,	
  MA,	
  
USA.	
  
21.   [MBDS'12]:	
  Xinyu	
  Que,	
  Yandong.Wang,	
  Cong	
  Xu,	
  Weikuan	
  Yu,	
  “Hierarchical	
  Merge	
  for	
  Scalable	
  
MapReduce”,	
  MBDS	
  in	
  conjunction	
  with	
  ICAC	
  2012,	
  San	
  Jose,	
  CA.	
  
22.   [SuperComputing	
  2011]:	
  Yandong.	
  Wang,	
  Xinyu	
  Que,	
  Weikuan	
  Yu,	
  Dror	
  Goldenberg,	
  Dhiraj.	
  Sehgal.	
  
“Hadoop	
  Acceleration	
  Through	
  Network	
  Levitated	
  Merge”.	
  23rd	
  IEEE/ACM	
  The	
  International	
  
Conference	
  for	
  High	
  Performance	
  Computing	
  Networking,	
  Storage	
  and	
  Analysis,	
  Nov	
  14-­‐‑17,	
  2011	
  
Seattle,	
  WA,	
  USA.	
  
23.   	
  [CLUSTER'11]:	
  Yuan	
  Tian,	
  Scott	
  Klasky,	
  Jay	
  Lofstead,	
  Ray	
  Grout,	
  Norbert	
  Podhorszki,	
  Qing	
  Liu,	
  
Yandong.	
  Wang,	
  Weikuan	
  Yu.	
  “EDO:	
  Improving	
  Read	
  Performance	
  for	
  Scientific	
  Applications	
  
Through	
  Elastic	
  Data	
  Organization”,	
  IEEE	
  International	
  Conference	
  on	
  Cluster	
  Computing,	
  Sep	
  26-­‐‑30,	
  
2011,	
  Austin,	
  Texas.	
  
	
  
24.   Ph.D.	
  Dissertation:	
  	
  
	
  
Efficient	
  Movement	
  and	
  Task	
  Management	
  in	
  MapReduce	
  for	
  Fast	
  Analytics	
  of	
  Big	
  Data	
  
	
  
25.  Master	
  Thesis:	
  
	
  
Nvidia	
  CUDA	
  Architecture-­‐‑based	
  Parallel	
  Incomplete	
  SAT	
  Solver	
  
	
  
VII.   PATENTS:	
  
•   A system and Method of Sharing Remote Pointers for RDMA-Enabled Key-Value Stores.
•   In-Memory Data Store Replication Through Remote Memory Sharing.
•   Cache Management for RDMA-Accelerated Data Stores.
•   A Method and System of Stage-Aware Performance Modeling for DAG-based Data Analytic Platforms.
•   Cache Management in RDMA-based Distributed Key-Value Store Using Atomic Operations.
•   Reducing Memory Consumption for In-Memory Data Stores Using Compression.
•   Restorable Memory Allocator.
•   A Method and System of Dynamic Memory and Cache Management in DAG-based Data Analytic Platforms.
•   A Method and System of Dynamic RDD Cache Management for In-Memory Data Analytics Platforms.
	
  
VIII.   RESEARCH	
  RECOGNITIONS	
  AND	
  AWARDS	
  
•   Outstanding Research Contribution to Hadoop Acceleration, 2011. Mellanox Technologies, Inc.	
  
•   Outstanding International Graduate Student Award from Auburn University.	
  
•   IPDPS 2013 Travel Award ($1000 award).	
  
•   SC 2011 Student Volunteer Award	
  
	
  
IX.   CONFERENCE	
  PRESENTATIONS	
  
•   [SuperComputing	
  2015]:	
  HydraDB:	
  A	
  Resilient	
  RDMA-­‐‑driven	
  Key-­‐‑Value	
  Middleware	
  for	
  Large-­‐‑scale	
  In-­‐‑
Memory	
  Cluster	
  Computing,	
  Nov	
  15-­‐‑20,	
  2015	
  
Links: http://sc15.supercomputing.org/schedule/event_detail?evid=pap231	
  	
  
•   [HPDC	
  2015]:	
  HydraDB:	
  A	
  Resilient	
  RDMA-­‐‑driven	
  Key-­‐‑Value	
  Middleware	
  for	
  In-­‐‑Memory	
  Cluster	
  
Computing,	
  Portland,	
  Oregon,	
  June	
  15-­‐‑19.	
  2015	
  
•   [SoCC 2014]: C-Hint: Cache Management for RDMA-Accelerated Key-Value Stores, ACM SoCC
Conference, Nov 3-5 2014, Seattle, WA, USA.	
  
Links: https://sites.google.com/site/2014socc/home/program
•   [USENIX ICAC 2013]: Preemptive ReduceTask Scheduling for Fair and Fast Job Completion, June 26-28
2013, San Jose, CA, USA.	
  
Links: https://www.usenix.org/conference/icac13/technical-sessions/presentation/wang_yandong
•   [IPDPS 2013]: JVM-bypass for Efficient Hadoop Shuffling, May 20-24 2013 Boston, MA, USA.	
  
	
  
X.   Invited	
  Talks	
  
•   2015 Computer Science Seminar: HydraDB: In-Memory Key-Value Store at IBM, Auburn University	
  
links: http://www.cyber.auburn.edu/calendar/2015/02/seminar-hydradb-in-memory-key-value-store-at-ibm.html
•   Oak Ridge National Lab Seminar: Optimizing MapReduce for Big Data Analytics, Aug 2013	
  
•   IBM Seminar: Hadoop Acceleration Through Network-Levitated Merging, Hawthorn, NY, May 2012	
  
•   IBM Seminar: Preemptive ReduceTask Scheduling For Fair and Fast Job Completion, Aug 2012	
  
	
  
XI.   PROFESSIONAL	
  ACTIVITY	
  
•   Member	
  of	
  Association	
  for	
  Computing	
  Machinery,	
  #5425288	
  
	
  
•   Journal	
  Reviewers:	
  	
  
1.   IEEE	
  Transactions	
  on	
  Parallel	
  and	
  Distributed	
  Systems.	
  
2.   International	
  Journal	
  of	
  High	
  Performance	
  Computing.	
  
	
  
•   Program	
  Committee:	
  
1.   2015	
  The	
  International	
  Workshop	
  on	
  Data-­‐‑Intensive	
  Scalable	
  Computing	
  Systems,	
  in	
  conjunction	
  with	
  SC15.	
  
2.   2015	
  1st	
  IEEE	
  Workshop	
  on	
  Internet	
  Scale	
  Cloud	
  Computing.	
  
3.   2014	
  The	
  International	
  Workshop	
  on	
  Data-­‐‑Intensive	
  Scalable	
  Computing	
  Systems,	
  in	
  conjunction	
  with	
  SC14.	
  
4.   2014	
  3rd	
  International	
  Workshop	
  on	
  Collaborative	
  Cloud.	
  
	
  
•   Conference	
  Reviewers:	
  
1.   2015	
  EuroSys	
  Conference.	
  
2.   2015	
  The	
  International	
  Workshop	
  on	
  Data-­‐‑Intensive	
  Scalable	
  Computing	
  Systems,	
  in	
  conjunction	
  with	
  SC15.	
  
3.   2015	
  1st	
  IEEE	
  Workshop	
  on	
  Internet	
  Scale	
  Cloud	
  Computing.	
  
4.   2014	
  The	
  International	
  Workshop	
  on	
  Data-­‐‑Intensive	
  Scalable	
  Computing	
  Systems,	
  in	
  conjunction	
  with	
  SC14.	
  
5.   2014	
  3rd	
  International	
  Workshop	
  on	
  Collaborative	
  Cloud.	
  
6.   2012	
  31st	
  IEEE	
  International	
  Performance	
  Computing	
  and	
  Communications	
  Conferences.	
  

More Related Content

Similar to YandongWang_Resume

YangHu-CV-Nov2016
YangHu-CV-Nov2016YangHu-CV-Nov2016
YangHu-CV-Nov2016Yang Hu
 
Benchmarking graph databases on the problem of community detection
Benchmarking graph databases on the problem of community detectionBenchmarking graph databases on the problem of community detection
Benchmarking graph databases on the problem of community detectionSotiris Beis
 
Benchmarking graph databases on the problem of community detection
Benchmarking graph databases on the problem of community detectionBenchmarking graph databases on the problem of community detection
Benchmarking graph databases on the problem of community detectionSymeon Papadopoulos
 
MACHINE LEARNING ON MAPREDUCE FRAMEWORK
MACHINE LEARNING ON MAPREDUCE FRAMEWORKMACHINE LEARNING ON MAPREDUCE FRAMEWORK
MACHINE LEARNING ON MAPREDUCE FRAMEWORKAbhi Jit
 
Zejia_CV_final
Zejia_CV_finalZejia_CV_final
Zejia_CV_finalZJ Zheng
 
RashmiTongeRResume
RashmiTongeRResumeRashmiTongeRResume
RashmiTongeRResumeRashmi Tonge
 
Cost savings from auto-scaling of network resources using machine learning
Cost savings from auto-scaling of network resources using machine learningCost savings from auto-scaling of network resources using machine learning
Cost savings from auto-scaling of network resources using machine learningSabidur Rahman
 
mapreduce_presentation
mapreduce_presentationmapreduce_presentation
mapreduce_presentationAdam Martini
 
manoj resume new version
manoj resume new versionmanoj resume new version
manoj resume new versionOswalt Manoj
 
Top cited articles 2020 - Advanced Computational Intelligence: An Internation...
Top cited articles 2020 - Advanced Computational Intelligence: An Internation...Top cited articles 2020 - Advanced Computational Intelligence: An Internation...
Top cited articles 2020 - Advanced Computational Intelligence: An Internation...aciijournal
 
research Paper face recognition attendance system
research Paper face recognition attendance systemresearch Paper face recognition attendance system
research Paper face recognition attendance systemAnkitRao82
 
Pankaj rajanresume2014
Pankaj rajanresume2014Pankaj rajanresume2014
Pankaj rajanresume2014Pankaj Rajan
 
sourabh_bajaj_resume
sourabh_bajaj_resumesourabh_bajaj_resume
sourabh_bajaj_resumeYipei Wang
 
A Maturing Role of Workflows in the Presence of Heterogenous Computing Archit...
A Maturing Role of Workflows in the Presence of Heterogenous Computing Archit...A Maturing Role of Workflows in the Presence of Heterogenous Computing Archit...
A Maturing Role of Workflows in the Presence of Heterogenous Computing Archit...Ilkay Altintas, Ph.D.
 
January 2024 - Top 10 Read Articles in International Journal of Artificial In...
January 2024 - Top 10 Read Articles in International Journal of Artificial In...January 2024 - Top 10 Read Articles in International Journal of Artificial In...
January 2024 - Top 10 Read Articles in International Journal of Artificial In...gerogepatton
 

Similar to YandongWang_Resume (20)

YangHu-CV-Nov2016
YangHu-CV-Nov2016YangHu-CV-Nov2016
YangHu-CV-Nov2016
 
CV
CVCV
CV
 
Benchmarking graph databases on the problem of community detection
Benchmarking graph databases on the problem of community detectionBenchmarking graph databases on the problem of community detection
Benchmarking graph databases on the problem of community detection
 
Benchmarking graph databases on the problem of community detection
Benchmarking graph databases on the problem of community detectionBenchmarking graph databases on the problem of community detection
Benchmarking graph databases on the problem of community detection
 
MACHINE LEARNING ON MAPREDUCE FRAMEWORK
MACHINE LEARNING ON MAPREDUCE FRAMEWORKMACHINE LEARNING ON MAPREDUCE FRAMEWORK
MACHINE LEARNING ON MAPREDUCE FRAMEWORK
 
Zejia_CV_final
Zejia_CV_finalZejia_CV_final
Zejia_CV_final
 
Observlets
Observlets Observlets
Observlets
 
RashmiTongeRResume
RashmiTongeRResumeRashmiTongeRResume
RashmiTongeRResume
 
Cost savings from auto-scaling of network resources using machine learning
Cost savings from auto-scaling of network resources using machine learningCost savings from auto-scaling of network resources using machine learning
Cost savings from auto-scaling of network resources using machine learning
 
CV.pdf
CV.pdfCV.pdf
CV.pdf
 
CV.pdf
CV.pdfCV.pdf
CV.pdf
 
mapreduce_presentation
mapreduce_presentationmapreduce_presentation
mapreduce_presentation
 
manoj resume new version
manoj resume new versionmanoj resume new version
manoj resume new version
 
Top cited articles 2020 - Advanced Computational Intelligence: An Internation...
Top cited articles 2020 - Advanced Computational Intelligence: An Internation...Top cited articles 2020 - Advanced Computational Intelligence: An Internation...
Top cited articles 2020 - Advanced Computational Intelligence: An Internation...
 
research Paper face recognition attendance system
research Paper face recognition attendance systemresearch Paper face recognition attendance system
research Paper face recognition attendance system
 
Pankaj rajanresume2014
Pankaj rajanresume2014Pankaj rajanresume2014
Pankaj rajanresume2014
 
sourabh_bajaj_resume
sourabh_bajaj_resumesourabh_bajaj_resume
sourabh_bajaj_resume
 
A Maturing Role of Workflows in the Presence of Heterogenous Computing Archit...
A Maturing Role of Workflows in the Presence of Heterogenous Computing Archit...A Maturing Role of Workflows in the Presence of Heterogenous Computing Archit...
A Maturing Role of Workflows in the Presence of Heterogenous Computing Archit...
 
Publication
PublicationPublication
Publication
 
January 2024 - Top 10 Read Articles in International Journal of Artificial In...
January 2024 - Top 10 Read Articles in International Journal of Artificial In...January 2024 - Top 10 Read Articles in International Journal of Artificial In...
January 2024 - Top 10 Read Articles in International Journal of Artificial In...
 

YandongWang_Resume

  • 1. YANDONG  WANG     I.   CONTACT  INFO   IBM Thomas J. Watson Research Center   Cognitive System Analysis and Optimization Group 1101 Kitchawan Rd, Yorktown Height, NY 10598, USA Office 10-134 Email: roydrax@gmail.com, Phone: +1-585-770-8705   Website: researcher.ibm.com/researcher/view.php?person=us-yandong     II.   INTERESTS   Distributed Machine Learning; Computer Systems; big data analytics frameworks; In-memory Key-Value stores; distributed machine learning, high-performance computing; cloud computing; data-center networking.     III.   EDUCATION   •   Ph.D. in Computer Science Sep. 2010 – May. 2014   Auburn University, Auburn, Alabama, USA.   Advisor: Prof. Weikuan Yu Dissertation: “Efficient Movement and Task Management in MapReduce for Fast Analytics of Big Data”.     •   Master. in Computer Science Sep. 2008 – Aug. 2010   Rochester Institute of Technology, Rochester, New York, USA.   Advisor: Prof. Alan Kaminsky Thesis: “Nvidia CUDA Architecture-based Parallel Incomplete SAT Solver”.     •   Bachelor. in Traffic Engineering and Information Technology Sep. 2002 – Aug. 2006   TongJi University (Shanghai, China)     IV.   EMPLOYMENT  EXPERIENCE   •   Research Staff Member March. 2014 – Present   Cognitive System Analysis and Optimization Group IBM Thomas J. Watson Research Center, Yorktown Height, NY, USA.   •   Research Assistant Sep. 2010 – May. 2014   Parallel Architecture and System Lab Auburn University, AL   Advisor: Prof. Weikuan Yu     •   Internship at Lawrence Livermore National Lab May. 2013 – Aug. 2013       HPC Strategic Planning and Architecture Livermore, CA     •   Internship at IBM T.J Watson Research Center May. 2012 – Aug. 2012   System Analysis and Optimization Group Hawthorn, NY     •   Internship at Oak Ridge National Laboratory June. 2011 – Aug. 2011   Technology Integration Group Knoxville, TN   V.   RESEARCH  EXPERIENCE   •   Research Staff Member March. 2014 – Present   IBM Thomas J. Watson Research Center, Yorktown Height o   Analyzing  the  performance  characteristics  of  a  variety  of  big  data  analytics  frameworks.   o   Designing  and  implementing  large-­‐‑scale  machine  learning  frameworks.   o   Designing  and  implementing  next-­‐‑generation  in-­‐‑memory  key-­‐‑value  stores.  
  • 2. o   Optimizing  cognitive  computing  platforms.   o   Disseminating,  presenting  the  research  results  in  conferences  and  journals.       •   Research Intern at Lawrence Livermore National Lab May. 2013 – Aug. 2013   o   Analyzing  and  optimizing  the  performance  of  Spark  on  HPC  platforms.   o   Exploring  the  hybrid  memory  approaches  to  improving  Spark.     •   Research Intern at IBM Thomas J. Watson Research May. 2012 – Aug. 2012   o   Investigating  the  performance  of  Hadoop  MapReduce  schedulers.   o   Designing  and  implementing  preemptive  reduce  task  scheduler.     •   Research Intern at Oak Ridge National Lab June. 2011 – Aug. 2011   o   Examining the performance of Lustre File System metadata management.   o   Designing and implementing Lustre metadata simulator.     •   Research Assistant at Auburn University Sep. 2010 – Mar. 2014   o   Investigating the performance of MapReduce over high-performance networks.   o   Investigating the performance of MapReduce resource management and contention management.   o   Designing and implementing network levitated merge for Hadoop.   o   Designing and implementing techniques to enhance the fault-tolerance for MapReduce systems.   o   Examining and optimizing scientific applications on commodity cluster.     VI.   PUBLICATIONS   1.   [Eurosys  2016]:  Xingbo  Wu,  Li  Zhang,  Yandong  Wang,  Yufei  Ren,  Michel  Hack,  Song  Jiang,   “zExpander:  a  Key-­‐‑Value  Cache  with  both  High  Performance  and  Fewer  Misses”.  The  11th  European   Conference  on  Computer  Systems.     2.   [IPDPS  2016]:  Luna  Xu,  Min  Li,  Li  Zhang,  Ali  R.  Butt,  Yandong  Wang,  Zane  Hu,  “MEMTUNE:  Dynamic   Memory  Management  for  In-­‐‑Memory  Data  Analytics  Platforms”.  30th  IEEE  International  Parallel  and   Distributed  Processing  Symposium.   3.   [Supercomputing  2015]:  Yandong  Wang,  Li  Zhang,  Jian  Tan,  Min  Li,  Yuqing  Gao,  Xavier  Guerin,   Xiaoqiao  Meng,  Shicong  Meng,  “HydraDB:  A  Resilient  RDMA-­‐‑driven  Key-­‐‑Value  Middleware  for  In-­‐‑ Memory  Cluster  Computing”.  27th  IEEE/ACM  The  International  Conference  for  High  Performance   Computing  Networking,  Storage  and  Analysis,  Nov  15-­‐‑20,  2015,  Austin,  TX.   4.   [ACM  Sigmetrics  Performance  Evaluation  Review  2015]:  Jian  Tan,  Li  Zhang,  Yandong  Wang,   “Miss  Behavior  for  Caching  with  Lease”.   5.   [ACM  Sigmetrics  Performance  Evaluation  Review  2015]:  Jian  Tan,  Li  Zhang,  Min  Li,  Yandong   Wang,  “Multi-­‐‑resource  Fair  Sharing  for  Multiclass  Workflows”.   6.   [IPDPS  2015]:  Yandong  Wang,  Huansong  Fu,  Weikuan  Yu,  “Cracking  Down  MapReduce  Failure   Amplification  through  Analytics  Logging  and  Migration”,  29th  IEEE  International  Parallel  and   Distributed  Processing  Symposium,  May  25-­‐‑29,  2015,  Hyderabad,  India.   7.   [CF  2015]:    Paula  Austel,  Han  Chen,  Parijat  Dube,  Thomas  Mikalsen,  Isabelle  Rouvellou,  Upendra   Sharma,  Ignacio  Silva-­‐‑Lepe,  Revathi  Subramanian,  Wei  Tan  and  Yangdong  Wang,  “A  PaaS  for   Composite  Analytics  Solutions”,  The  Workshop  on  Analytics  Platforms  for  the  Cloud,  in  conjunction   with  ACM  International  Conference  on  Computing  Frontiers,  May  18,  2015.   8.   [CF  2015]:  Min  Li,  Jian  Tan,  Yandong  Wang,  Li  Zhang  and  Valentina  Salapura,  “SparkBench:  A   Comprehensive  Benchmarking  Suite  for  In-­‐‑Memory  Data  Analytics  Platform  Spark”,  The  Workshop   on  Analytics  Platforms  for  the  Cloud,  in  conjunction  with  ACM  International  Conference  on   Computing  Frontiers,  May  18,  2015.   9.   [TC  2015]:  Weikuan  Yu,  Yandong  Wang,  Xinyu  Que,  Cong  Xu,  “Virtual  Shuffling  for  Efficient  Data   Movement  in  MapReduce”,  IEEE  Transactions  on  Computers  2015.   10.   [SoCC  2014]:  Yandong  Wang,  Xiaoqiao  Meng,  Li  Zhang,  Jian  Tan,  “C-­‐‑Hint:  An  Effective  and  Reliable   Cache  Management  for  RDMA-­‐‑Accelerated  Key-­‐‑Value  Stores”,  5th  ACM  Symposium  on  Cloud   Computing,  Nov  3-­‐‑5,  2014,  Seattle,  WA,  USA.  
  • 3. 11.   [BigData  2014]:  Teng  Wang,  Sarp  Oral,  Yandong  Wang,  Brad  Settlemyer,  Scott  Atchley,  Weikuan  Yu,   “Burstmem:  A  high-­‐‑performance  burst  buffer  system  for  scientific  applications”,  the  IEEE   International  Conference  on  Big  Data  2014,  Oct  27-­‐‑30,  Washington  DC,  USA.   12.   [Sigmetrics  2014]:  Jian  Tan,  Yandong  Wang,  Weikuan  Yu,  Li  Zhang,  “Non-­‐‑work-­‐‑conserving  Effects   in  MapReduce:  Diffusion  Limit  and  Criticality”,  ACM  Sigmetrics  2014,  Jun  16-­‐‑20,  Austin,  TX,  USA.   13.   [IPDPS  2014]:  Yandong  Wang,  Robin  Goldstone,  Weikuan  Yu,  Teng  Wang,  “Characterization  and   Optimization  of  Memory-­‐‑Resident  MapReduce  on  HPC  Systems”,  28th  IEEE  International  Parallel  and   Distributed  Processing  Symposium,  May  19-­‐‑23,  Phoenix,  AZ,  USA.   14.   [TPDS  2014]:  Weikuan  Yu,  Yandong  Wang,  Xinyu  Que,  “Design  and  Evaluation  of  Network-­‐‑ Levitated  Merge  for  Hadoop  Acceleration”,  IEEE  Transactions  on  Parallel  and  Distributed  System,   2014.   15.   [WBDB’14]:  Yandong  Wang,  Yizheng  Jiao,  Cong  Xu,  Xiaobing  Li,  Teng  Wang,  Xinyu  Que,  Cristi  Cira,   Bin  Wang,  Zhuo  Liu,  Bliss  Bailey,  Weikuan  Yu,  “Assessing  the  Performance  Impact  of  High-­‐‑Speed   Interconnects  on  MapReduce  Programs”,  3rd  Workshop  on  Big  Data  Benchmarking  (Invited  Book   Chapter),  Xi’an  China.   16.   [Supercomputing  2013]:  Xiaobing  Li,  Yandong  Wang,  Yizheng  Jiao,  Cong  Xu,  Weikuan  Yu.  “CooMR:   Cross-­‐‑Task  Coordination  for  Efficient  Data  Management  in  MapReduce  Programs”,  25th  IEEE/ACM   The  International  Conference  for  High  Performance  Computing  Networking,  Storage  and  Analysis,   Nov  17-­‐‑22,  2013,  Denver,  CO,  USA.   17.   [ICCCN’13]:  Zhuo  Liu,  Bin  Wang,  Teng  Wang,  Yuan  Tian,  Cong  Xu,  Yandong  Wang,  Weikuan  Yu,   “Profiling  and  Improving  I/O  Performance  of  a  Large-­‐‑Scale  Climate  Scientific  Application”,  22nd   International  Conference  on  Computer  Communications  and  Networks,  July  30-­‐‑Aug  2,  2013,  Nassau,   Bahamas.   18.   [ICAC’13]:  Yandong.Wang,  JianTan,  Weikuan  Yu,  Li  Zhang,  Xiaoqiao  Meng.  “Preemptive  ReduceTask   Scheduling  for  Fair  and  Fast  Job  Completion”.  10th  International  Conference  on  Autonomic   Computing,  in  conjunction  with  2013  USENIX  Federated  Conferences.  Jun  26-­‐‑28,  2013,  San  Jose,  CA,   USA.   19.   [CCGRID'13]:  Cong  Xu,  Manjunath.  G.  Venkata,  Richard.  L.  Graham,  Yandong  Wang,  Zhuo  Liu,   Weikuan  Yu.  “SLOAVx:  Scalable  LOgarithmic  AlltoallV  Algorithm  for  Hierarchical  Multicore  Systems”,   The  13th  IEEE/ACM  International  Symposium  on  Cluster,  Cloud  and  Grid  Computing,  May  13-­‐‑ 16,2013,  Delft,  Netherlands.   20.   [IPDPS'13]:  Yandong.Wang,  Cong  Xu,  Xiaobing  Li,  Weikuan  Yu,  “JVM-­‐‑bypassing  Shuffling  for  Hadoop   Acceleration”,  27th  IEEE  International  Parallel  and  Distributed  Processing  Symposium,  Boston,  MA,   USA.   21.   [MBDS'12]:  Xinyu  Que,  Yandong.Wang,  Cong  Xu,  Weikuan  Yu,  “Hierarchical  Merge  for  Scalable   MapReduce”,  MBDS  in  conjunction  with  ICAC  2012,  San  Jose,  CA.   22.   [SuperComputing  2011]:  Yandong.  Wang,  Xinyu  Que,  Weikuan  Yu,  Dror  Goldenberg,  Dhiraj.  Sehgal.   “Hadoop  Acceleration  Through  Network  Levitated  Merge”.  23rd  IEEE/ACM  The  International   Conference  for  High  Performance  Computing  Networking,  Storage  and  Analysis,  Nov  14-­‐‑17,  2011   Seattle,  WA,  USA.   23.    [CLUSTER'11]:  Yuan  Tian,  Scott  Klasky,  Jay  Lofstead,  Ray  Grout,  Norbert  Podhorszki,  Qing  Liu,   Yandong.  Wang,  Weikuan  Yu.  “EDO:  Improving  Read  Performance  for  Scientific  Applications   Through  Elastic  Data  Organization”,  IEEE  International  Conference  on  Cluster  Computing,  Sep  26-­‐‑30,   2011,  Austin,  Texas.     24.   Ph.D.  Dissertation:       Efficient  Movement  and  Task  Management  in  MapReduce  for  Fast  Analytics  of  Big  Data     25.  Master  Thesis:     Nvidia  CUDA  Architecture-­‐‑based  Parallel  Incomplete  SAT  Solver     VII.   PATENTS:   •   A system and Method of Sharing Remote Pointers for RDMA-Enabled Key-Value Stores.
  • 4. •   In-Memory Data Store Replication Through Remote Memory Sharing. •   Cache Management for RDMA-Accelerated Data Stores. •   A Method and System of Stage-Aware Performance Modeling for DAG-based Data Analytic Platforms. •   Cache Management in RDMA-based Distributed Key-Value Store Using Atomic Operations. •   Reducing Memory Consumption for In-Memory Data Stores Using Compression. •   Restorable Memory Allocator. •   A Method and System of Dynamic Memory and Cache Management in DAG-based Data Analytic Platforms. •   A Method and System of Dynamic RDD Cache Management for In-Memory Data Analytics Platforms.   VIII.   RESEARCH  RECOGNITIONS  AND  AWARDS   •   Outstanding Research Contribution to Hadoop Acceleration, 2011. Mellanox Technologies, Inc.   •   Outstanding International Graduate Student Award from Auburn University.   •   IPDPS 2013 Travel Award ($1000 award).   •   SC 2011 Student Volunteer Award     IX.   CONFERENCE  PRESENTATIONS   •   [SuperComputing  2015]:  HydraDB:  A  Resilient  RDMA-­‐‑driven  Key-­‐‑Value  Middleware  for  Large-­‐‑scale  In-­‐‑ Memory  Cluster  Computing,  Nov  15-­‐‑20,  2015   Links: http://sc15.supercomputing.org/schedule/event_detail?evid=pap231     •   [HPDC  2015]:  HydraDB:  A  Resilient  RDMA-­‐‑driven  Key-­‐‑Value  Middleware  for  In-­‐‑Memory  Cluster   Computing,  Portland,  Oregon,  June  15-­‐‑19.  2015   •   [SoCC 2014]: C-Hint: Cache Management for RDMA-Accelerated Key-Value Stores, ACM SoCC Conference, Nov 3-5 2014, Seattle, WA, USA.   Links: https://sites.google.com/site/2014socc/home/program •   [USENIX ICAC 2013]: Preemptive ReduceTask Scheduling for Fair and Fast Job Completion, June 26-28 2013, San Jose, CA, USA.   Links: https://www.usenix.org/conference/icac13/technical-sessions/presentation/wang_yandong •   [IPDPS 2013]: JVM-bypass for Efficient Hadoop Shuffling, May 20-24 2013 Boston, MA, USA.     X.   Invited  Talks   •   2015 Computer Science Seminar: HydraDB: In-Memory Key-Value Store at IBM, Auburn University   links: http://www.cyber.auburn.edu/calendar/2015/02/seminar-hydradb-in-memory-key-value-store-at-ibm.html •   Oak Ridge National Lab Seminar: Optimizing MapReduce for Big Data Analytics, Aug 2013   •   IBM Seminar: Hadoop Acceleration Through Network-Levitated Merging, Hawthorn, NY, May 2012   •   IBM Seminar: Preemptive ReduceTask Scheduling For Fair and Fast Job Completion, Aug 2012     XI.   PROFESSIONAL  ACTIVITY   •   Member  of  Association  for  Computing  Machinery,  #5425288     •   Journal  Reviewers:     1.   IEEE  Transactions  on  Parallel  and  Distributed  Systems.   2.   International  Journal  of  High  Performance  Computing.     •   Program  Committee:   1.   2015  The  International  Workshop  on  Data-­‐‑Intensive  Scalable  Computing  Systems,  in  conjunction  with  SC15.   2.   2015  1st  IEEE  Workshop  on  Internet  Scale  Cloud  Computing.   3.   2014  The  International  Workshop  on  Data-­‐‑Intensive  Scalable  Computing  Systems,  in  conjunction  with  SC14.   4.   2014  3rd  International  Workshop  on  Collaborative  Cloud.     •   Conference  Reviewers:   1.   2015  EuroSys  Conference.   2.   2015  The  International  Workshop  on  Data-­‐‑Intensive  Scalable  Computing  Systems,  in  conjunction  with  SC15.   3.   2015  1st  IEEE  Workshop  on  Internet  Scale  Cloud  Computing.   4.   2014  The  International  Workshop  on  Data-­‐‑Intensive  Scalable  Computing  Systems,  in  conjunction  with  SC14.  
  • 5. 5.   2014  3rd  International  Workshop  on  Collaborative  Cloud.   6.   2012  31st  IEEE  International  Performance  Computing  and  Communications  Conferences.