January 2024 - Top 10 Read Articles in International Journal of Artificial In...
YandongWang_Resume
1. YANDONG
WANG
I. CONTACT
INFO
IBM Thomas J. Watson Research Center
Cognitive System Analysis and Optimization Group
1101 Kitchawan Rd, Yorktown Height, NY 10598, USA
Office 10-134
Email: roydrax@gmail.com, Phone: +1-585-770-8705
Website: researcher.ibm.com/researcher/view.php?person=us-yandong
II. INTERESTS
Distributed Machine Learning; Computer Systems; big data analytics frameworks; In-memory Key-Value
stores; distributed machine learning, high-performance computing; cloud computing; data-center networking.
III. EDUCATION
• Ph.D. in Computer Science Sep. 2010 – May. 2014
Auburn University, Auburn, Alabama, USA.
Advisor: Prof. Weikuan Yu
Dissertation: “Efficient Movement and Task Management in MapReduce for Fast Analytics of Big Data”.
• Master. in Computer Science Sep. 2008 – Aug. 2010
Rochester Institute of Technology, Rochester, New York, USA.
Advisor: Prof. Alan Kaminsky
Thesis: “Nvidia CUDA Architecture-based Parallel Incomplete SAT Solver”.
• Bachelor. in Traffic Engineering and Information Technology Sep. 2002 – Aug. 2006
TongJi University (Shanghai, China)
IV. EMPLOYMENT
EXPERIENCE
• Research Staff Member March. 2014 – Present
Cognitive System Analysis and Optimization Group
IBM Thomas J. Watson Research Center, Yorktown Height, NY, USA.
• Research Assistant Sep. 2010 – May. 2014
Parallel Architecture and System Lab Auburn University, AL
Advisor: Prof. Weikuan Yu
• Internship at Lawrence Livermore National Lab May. 2013 – Aug. 2013
HPC Strategic Planning and Architecture Livermore, CA
• Internship at IBM T.J Watson Research Center May. 2012 – Aug. 2012
System Analysis and Optimization Group Hawthorn, NY
• Internship at Oak Ridge National Laboratory June. 2011 – Aug. 2011
Technology Integration Group Knoxville, TN
V. RESEARCH
EXPERIENCE
• Research Staff Member March. 2014 – Present
IBM Thomas J. Watson Research Center, Yorktown Height
o Analyzing
the
performance
characteristics
of
a
variety
of
big
data
analytics
frameworks.
o Designing
and
implementing
large-‐‑scale
machine
learning
frameworks.
o Designing
and
implementing
next-‐‑generation
in-‐‑memory
key-‐‑value
stores.
2. o Optimizing
cognitive
computing
platforms.
o Disseminating,
presenting
the
research
results
in
conferences
and
journals.
• Research Intern at Lawrence Livermore National Lab May. 2013 – Aug. 2013
o Analyzing
and
optimizing
the
performance
of
Spark
on
HPC
platforms.
o Exploring
the
hybrid
memory
approaches
to
improving
Spark.
• Research Intern at IBM Thomas J. Watson Research May. 2012 – Aug. 2012
o Investigating
the
performance
of
Hadoop
MapReduce
schedulers.
o Designing
and
implementing
preemptive
reduce
task
scheduler.
• Research Intern at Oak Ridge National Lab June. 2011 – Aug. 2011
o Examining the performance of Lustre File System metadata management.
o Designing and implementing Lustre metadata simulator.
• Research Assistant at Auburn University Sep. 2010 – Mar. 2014
o Investigating the performance of MapReduce over high-performance networks.
o Investigating the performance of MapReduce resource management and contention management.
o Designing and implementing network levitated merge for Hadoop.
o Designing and implementing techniques to enhance the fault-tolerance for MapReduce systems.
o Examining and optimizing scientific applications on commodity cluster.
VI. PUBLICATIONS
1. [Eurosys
2016]:
Xingbo
Wu,
Li
Zhang,
Yandong
Wang,
Yufei
Ren,
Michel
Hack,
Song
Jiang,
“zExpander:
a
Key-‐‑Value
Cache
with
both
High
Performance
and
Fewer
Misses”.
The
11th
European
Conference
on
Computer
Systems.
2. [IPDPS
2016]:
Luna
Xu,
Min
Li,
Li
Zhang,
Ali
R.
Butt,
Yandong
Wang,
Zane
Hu,
“MEMTUNE:
Dynamic
Memory
Management
for
In-‐‑Memory
Data
Analytics
Platforms”.
30th
IEEE
International
Parallel
and
Distributed
Processing
Symposium.
3. [Supercomputing
2015]:
Yandong
Wang,
Li
Zhang,
Jian
Tan,
Min
Li,
Yuqing
Gao,
Xavier
Guerin,
Xiaoqiao
Meng,
Shicong
Meng,
“HydraDB:
A
Resilient
RDMA-‐‑driven
Key-‐‑Value
Middleware
for
In-‐‑
Memory
Cluster
Computing”.
27th
IEEE/ACM
The
International
Conference
for
High
Performance
Computing
Networking,
Storage
and
Analysis,
Nov
15-‐‑20,
2015,
Austin,
TX.
4. [ACM
Sigmetrics
Performance
Evaluation
Review
2015]:
Jian
Tan,
Li
Zhang,
Yandong
Wang,
“Miss
Behavior
for
Caching
with
Lease”.
5. [ACM
Sigmetrics
Performance
Evaluation
Review
2015]:
Jian
Tan,
Li
Zhang,
Min
Li,
Yandong
Wang,
“Multi-‐‑resource
Fair
Sharing
for
Multiclass
Workflows”.
6. [IPDPS
2015]:
Yandong
Wang,
Huansong
Fu,
Weikuan
Yu,
“Cracking
Down
MapReduce
Failure
Amplification
through
Analytics
Logging
and
Migration”,
29th
IEEE
International
Parallel
and
Distributed
Processing
Symposium,
May
25-‐‑29,
2015,
Hyderabad,
India.
7. [CF
2015]:
Paula
Austel,
Han
Chen,
Parijat
Dube,
Thomas
Mikalsen,
Isabelle
Rouvellou,
Upendra
Sharma,
Ignacio
Silva-‐‑Lepe,
Revathi
Subramanian,
Wei
Tan
and
Yangdong
Wang,
“A
PaaS
for
Composite
Analytics
Solutions”,
The
Workshop
on
Analytics
Platforms
for
the
Cloud,
in
conjunction
with
ACM
International
Conference
on
Computing
Frontiers,
May
18,
2015.
8. [CF
2015]:
Min
Li,
Jian
Tan,
Yandong
Wang,
Li
Zhang
and
Valentina
Salapura,
“SparkBench:
A
Comprehensive
Benchmarking
Suite
for
In-‐‑Memory
Data
Analytics
Platform
Spark”,
The
Workshop
on
Analytics
Platforms
for
the
Cloud,
in
conjunction
with
ACM
International
Conference
on
Computing
Frontiers,
May
18,
2015.
9. [TC
2015]:
Weikuan
Yu,
Yandong
Wang,
Xinyu
Que,
Cong
Xu,
“Virtual
Shuffling
for
Efficient
Data
Movement
in
MapReduce”,
IEEE
Transactions
on
Computers
2015.
10. [SoCC
2014]:
Yandong
Wang,
Xiaoqiao
Meng,
Li
Zhang,
Jian
Tan,
“C-‐‑Hint:
An
Effective
and
Reliable
Cache
Management
for
RDMA-‐‑Accelerated
Key-‐‑Value
Stores”,
5th
ACM
Symposium
on
Cloud
Computing,
Nov
3-‐‑5,
2014,
Seattle,
WA,
USA.
3. 11. [BigData
2014]:
Teng
Wang,
Sarp
Oral,
Yandong
Wang,
Brad
Settlemyer,
Scott
Atchley,
Weikuan
Yu,
“Burstmem:
A
high-‐‑performance
burst
buffer
system
for
scientific
applications”,
the
IEEE
International
Conference
on
Big
Data
2014,
Oct
27-‐‑30,
Washington
DC,
USA.
12. [Sigmetrics
2014]:
Jian
Tan,
Yandong
Wang,
Weikuan
Yu,
Li
Zhang,
“Non-‐‑work-‐‑conserving
Effects
in
MapReduce:
Diffusion
Limit
and
Criticality”,
ACM
Sigmetrics
2014,
Jun
16-‐‑20,
Austin,
TX,
USA.
13. [IPDPS
2014]:
Yandong
Wang,
Robin
Goldstone,
Weikuan
Yu,
Teng
Wang,
“Characterization
and
Optimization
of
Memory-‐‑Resident
MapReduce
on
HPC
Systems”,
28th
IEEE
International
Parallel
and
Distributed
Processing
Symposium,
May
19-‐‑23,
Phoenix,
AZ,
USA.
14. [TPDS
2014]:
Weikuan
Yu,
Yandong
Wang,
Xinyu
Que,
“Design
and
Evaluation
of
Network-‐‑
Levitated
Merge
for
Hadoop
Acceleration”,
IEEE
Transactions
on
Parallel
and
Distributed
System,
2014.
15. [WBDB’14]:
Yandong
Wang,
Yizheng
Jiao,
Cong
Xu,
Xiaobing
Li,
Teng
Wang,
Xinyu
Que,
Cristi
Cira,
Bin
Wang,
Zhuo
Liu,
Bliss
Bailey,
Weikuan
Yu,
“Assessing
the
Performance
Impact
of
High-‐‑Speed
Interconnects
on
MapReduce
Programs”,
3rd
Workshop
on
Big
Data
Benchmarking
(Invited
Book
Chapter),
Xi’an
China.
16. [Supercomputing
2013]:
Xiaobing
Li,
Yandong
Wang,
Yizheng
Jiao,
Cong
Xu,
Weikuan
Yu.
“CooMR:
Cross-‐‑Task
Coordination
for
Efficient
Data
Management
in
MapReduce
Programs”,
25th
IEEE/ACM
The
International
Conference
for
High
Performance
Computing
Networking,
Storage
and
Analysis,
Nov
17-‐‑22,
2013,
Denver,
CO,
USA.
17. [ICCCN’13]:
Zhuo
Liu,
Bin
Wang,
Teng
Wang,
Yuan
Tian,
Cong
Xu,
Yandong
Wang,
Weikuan
Yu,
“Profiling
and
Improving
I/O
Performance
of
a
Large-‐‑Scale
Climate
Scientific
Application”,
22nd
International
Conference
on
Computer
Communications
and
Networks,
July
30-‐‑Aug
2,
2013,
Nassau,
Bahamas.
18. [ICAC’13]:
Yandong.Wang,
JianTan,
Weikuan
Yu,
Li
Zhang,
Xiaoqiao
Meng.
“Preemptive
ReduceTask
Scheduling
for
Fair
and
Fast
Job
Completion”.
10th
International
Conference
on
Autonomic
Computing,
in
conjunction
with
2013
USENIX
Federated
Conferences.
Jun
26-‐‑28,
2013,
San
Jose,
CA,
USA.
19. [CCGRID'13]:
Cong
Xu,
Manjunath.
G.
Venkata,
Richard.
L.
Graham,
Yandong
Wang,
Zhuo
Liu,
Weikuan
Yu.
“SLOAVx:
Scalable
LOgarithmic
AlltoallV
Algorithm
for
Hierarchical
Multicore
Systems”,
The
13th
IEEE/ACM
International
Symposium
on
Cluster,
Cloud
and
Grid
Computing,
May
13-‐‑
16,2013,
Delft,
Netherlands.
20. [IPDPS'13]:
Yandong.Wang,
Cong
Xu,
Xiaobing
Li,
Weikuan
Yu,
“JVM-‐‑bypassing
Shuffling
for
Hadoop
Acceleration”,
27th
IEEE
International
Parallel
and
Distributed
Processing
Symposium,
Boston,
MA,
USA.
21. [MBDS'12]:
Xinyu
Que,
Yandong.Wang,
Cong
Xu,
Weikuan
Yu,
“Hierarchical
Merge
for
Scalable
MapReduce”,
MBDS
in
conjunction
with
ICAC
2012,
San
Jose,
CA.
22. [SuperComputing
2011]:
Yandong.
Wang,
Xinyu
Que,
Weikuan
Yu,
Dror
Goldenberg,
Dhiraj.
Sehgal.
“Hadoop
Acceleration
Through
Network
Levitated
Merge”.
23rd
IEEE/ACM
The
International
Conference
for
High
Performance
Computing
Networking,
Storage
and
Analysis,
Nov
14-‐‑17,
2011
Seattle,
WA,
USA.
23.
[CLUSTER'11]:
Yuan
Tian,
Scott
Klasky,
Jay
Lofstead,
Ray
Grout,
Norbert
Podhorszki,
Qing
Liu,
Yandong.
Wang,
Weikuan
Yu.
“EDO:
Improving
Read
Performance
for
Scientific
Applications
Through
Elastic
Data
Organization”,
IEEE
International
Conference
on
Cluster
Computing,
Sep
26-‐‑30,
2011,
Austin,
Texas.
24. Ph.D.
Dissertation:
Efficient
Movement
and
Task
Management
in
MapReduce
for
Fast
Analytics
of
Big
Data
25. Master
Thesis:
Nvidia
CUDA
Architecture-‐‑based
Parallel
Incomplete
SAT
Solver
VII. PATENTS:
• A system and Method of Sharing Remote Pointers for RDMA-Enabled Key-Value Stores.
4. • In-Memory Data Store Replication Through Remote Memory Sharing.
• Cache Management for RDMA-Accelerated Data Stores.
• A Method and System of Stage-Aware Performance Modeling for DAG-based Data Analytic Platforms.
• Cache Management in RDMA-based Distributed Key-Value Store Using Atomic Operations.
• Reducing Memory Consumption for In-Memory Data Stores Using Compression.
• Restorable Memory Allocator.
• A Method and System of Dynamic Memory and Cache Management in DAG-based Data Analytic Platforms.
• A Method and System of Dynamic RDD Cache Management for In-Memory Data Analytics Platforms.
VIII. RESEARCH
RECOGNITIONS
AND
AWARDS
• Outstanding Research Contribution to Hadoop Acceleration, 2011. Mellanox Technologies, Inc.
• Outstanding International Graduate Student Award from Auburn University.
• IPDPS 2013 Travel Award ($1000 award).
• SC 2011 Student Volunteer Award
IX. CONFERENCE
PRESENTATIONS
• [SuperComputing
2015]:
HydraDB:
A
Resilient
RDMA-‐‑driven
Key-‐‑Value
Middleware
for
Large-‐‑scale
In-‐‑
Memory
Cluster
Computing,
Nov
15-‐‑20,
2015
Links: http://sc15.supercomputing.org/schedule/event_detail?evid=pap231
• [HPDC
2015]:
HydraDB:
A
Resilient
RDMA-‐‑driven
Key-‐‑Value
Middleware
for
In-‐‑Memory
Cluster
Computing,
Portland,
Oregon,
June
15-‐‑19.
2015
• [SoCC 2014]: C-Hint: Cache Management for RDMA-Accelerated Key-Value Stores, ACM SoCC
Conference, Nov 3-5 2014, Seattle, WA, USA.
Links: https://sites.google.com/site/2014socc/home/program
• [USENIX ICAC 2013]: Preemptive ReduceTask Scheduling for Fair and Fast Job Completion, June 26-28
2013, San Jose, CA, USA.
Links: https://www.usenix.org/conference/icac13/technical-sessions/presentation/wang_yandong
• [IPDPS 2013]: JVM-bypass for Efficient Hadoop Shuffling, May 20-24 2013 Boston, MA, USA.
X. Invited
Talks
• 2015 Computer Science Seminar: HydraDB: In-Memory Key-Value Store at IBM, Auburn University
links: http://www.cyber.auburn.edu/calendar/2015/02/seminar-hydradb-in-memory-key-value-store-at-ibm.html
• Oak Ridge National Lab Seminar: Optimizing MapReduce for Big Data Analytics, Aug 2013
• IBM Seminar: Hadoop Acceleration Through Network-Levitated Merging, Hawthorn, NY, May 2012
• IBM Seminar: Preemptive ReduceTask Scheduling For Fair and Fast Job Completion, Aug 2012
XI. PROFESSIONAL
ACTIVITY
• Member
of
Association
for
Computing
Machinery,
#5425288
• Journal
Reviewers:
1. IEEE
Transactions
on
Parallel
and
Distributed
Systems.
2. International
Journal
of
High
Performance
Computing.
• Program
Committee:
1. 2015
The
International
Workshop
on
Data-‐‑Intensive
Scalable
Computing
Systems,
in
conjunction
with
SC15.
2. 2015
1st
IEEE
Workshop
on
Internet
Scale
Cloud
Computing.
3. 2014
The
International
Workshop
on
Data-‐‑Intensive
Scalable
Computing
Systems,
in
conjunction
with
SC14.
4. 2014
3rd
International
Workshop
on
Collaborative
Cloud.
• Conference
Reviewers:
1. 2015
EuroSys
Conference.
2. 2015
The
International
Workshop
on
Data-‐‑Intensive
Scalable
Computing
Systems,
in
conjunction
with
SC15.
3. 2015
1st
IEEE
Workshop
on
Internet
Scale
Cloud
Computing.
4. 2014
The
International
Workshop
on
Data-‐‑Intensive
Scalable
Computing
Systems,
in
conjunction
with
SC14.
5. 5. 2014
3rd
International
Workshop
on
Collaborative
Cloud.
6. 2012
31st
IEEE
International
Performance
Computing
and
Communications
Conferences.