SlideShare a Scribd company logo
1 of 4
Yuan Yipeng (GPU Architect, SoC System, Linux Device driver, Testing
Engineer)
(086)15026966201
larkforsure@gmail.com
Profile:
Now everything around GPU.
Key Skills:
 Architecture building and early stage studying of Streaming
Multiprocessor(SM) plus Texture unit, the core of GPU: Focus on cycle
accurate Performance monitor. Also have knowledge of GPGPU, Graphics, flow
automation, RTL verification.
 Multi-Core System Algorithm Implementation and Optimization: From paper
reading to algorithm simulation, choosing, implementation, and optimization.
Familiar with both heterogeneous and homogeneous hardware systems.
 Embedded System Design and Implement: Linux kernel and Device-Driver
programming skills for advanced application. Familiar with kernel debugging,
profiling, Makefile, etc.
 Automation Testing: Testing framework, interface design and implementation.
Framework extension capability guaranteeing. Testcases reviewing, checking,
integration and debugging, Agreements compliance and DTS tracking. Model-
based Testing exploring. Nearly everything involved in this.
 Digital Logic Design: Familiar with Verilog, principles of synthesizable,
verifiable RTL coding style. Understanding the knowledge of timing, low power,
multiple clock domains, etc.
EXPERIENCE/ACCOMPLISHMENTS:
 Next generation GPU SM/Texture unit performance modeling and
verification (NVIDIA 2015, 2016)
C++ based performance simulator for early architecture studying, and pre-tapeout
stage verification;
CUDA on new architecture fast-modeling perf. simulator project owner;
GPGPU and 3D Graphics instruction level chip pipeline investigation and
optimization;
CUDA system level perf simulator integrated to SM level simulator, as equally
important for CUDA programs perf study.
Perf. tuning for GPU Architecture in development and exploring for GPU
Architecture in next generation;
Trace perf. and function verification for RTL;
SASS assembly code analyzing and debugging;
SASS threads level ( WARP ) flow control unit (CBU, CRS) simulation,
optimization, and corner case pipe-clean;
CUDA traces capturing and analyzing;
Texture function implement;
Traces with perf. data delivery on clients' requests;
Highlight:
Found, triage, and fixed one history bug in SM/TPC performance simulator,
which improved department's most essential trace deliver flow, from 2-7 days to
less than 1 hour per delivery;
"Very Good" performance evaluation for the first year;
 Automation Testing Framework Building and Directing (Huawei, Opto-
electronics)
Five months after the boarding, Huawei Opto-electronics depart. had a workload
improvement from nearly all manually test, to one third off to automation-based
test. Those ignored chains since of time restrictions before are now mostly
covered. Found two history bugs which existed for many years, and other bugs out
of counting.
The framework and interface are designed and implemented by himself alone,
characterized with usability, extendibility, stability. Directing testcases implement
of the whole depart. Responsible for submitted testcases, of all, reviewing,
checking, integration, and debugging. Also the bug location and DTS tracking
following.
He had no Ruby or testing experiences before.
 Vector Media Accelerator 2 Development (IBM 2008):
Responsible for the software specification implement of H.264 SOC in the early
stage; FPGA prototype verification, Linux device driver coding, software-
hardware cooperative architecture design and implement, SOC hardware profiling
in the later stage.
Cooperating with other team members, he had implemented both the software
version and the Linux Device-Driver version of the H.264 Main Profile features
(including Loopfilter, B-frame, CABAC, VLC), with the performance improved
in a large pace as a result of his optimization to the original H.264 algorithm,
which was mainly based on his good understanding of the standard.
 Vector Media Accelerator Component Development (IBM 2006):
Coding for the for the Vector Processor element which consisted of VLIW, SIMD
and pipeline programming skills.
He was responsible for the optimization of H.264 software for the Vector
Processor element which was integrated in that component. It could be taken as a
typical DSP based system developing process, which consisted of VLIW, SIMD
and pipeline software programming skills.
 Rochester Medical Imaging Project (Segmentation) on CELL BE (IBM 2007)
Collaborating with colleagues from worldwide to develop a prototype on the Cell
platform to demonstrate the capability of Cell in medical imaging processing. He
has written fully optimized image processing code on the SPE side, moreover, he
has also written communication and synchronizing code both on the PPE and SPE
sides.
 German Medical Imaging Projection B&P (Reconstruction) (IBM 2007)
Supporting German sales team in the B&P processing. Investigating several fast
Reconstruction algorithms and comparing them based on the speed, complexity,
reconstruction quality, bandwidth occupation, etc. Choosing one algorithm which
is the most suitable for CELL BE.
 CELL FOAK project for STTRI (Shanghai Telecommunication Technology
Research Institute) (IBM 2007)
Developing a prototype of media server based on IBM Cell to show the high
performance of CELL on multimedia. Investigating the architecture of next-
generation high performance media server. His part of job includes implementing
media libraries of several standards on a multi-core processor, namely CELL;
designing a framework involving media libraries which must be de-coupling and
re-useable so can easily move it to support other telecom operations; designing a
scalable interface between caller(application server) and media libraries so as to
accommodate future practical application scenario (dynamic 0~8000 routes of
calls simultaneously); designing the interface between media framework and
users’ through RTP protocol.
EDUCATION:
 MS. of Space Physics, Wuhan University, China (2006)
 BS. of Electronics Engineering, Wuhan University, China (2004)

More Related Content

What's hot

Dj Mack Resume.Doc
Dj Mack Resume.DocDj Mack Resume.Doc
Dj Mack Resume.DocDanial_Mack
 
Hardware-Software allocation specification of IMA systems for early simulation
Hardware-Software allocation specification of IMA systems for early simulationHardware-Software allocation specification of IMA systems for early simulation
Hardware-Software allocation specification of IMA systems for early simulationÁkos Horváth
 
sharath_resume_may2016
sharath_resume_may2016sharath_resume_may2016
sharath_resume_may2016Sharath Kulal
 
Ecd302 unit 01(investigate ecad systems)
Ecd302 unit 01(investigate ecad systems)Ecd302 unit 01(investigate ecad systems)
Ecd302 unit 01(investigate ecad systems)Xi Qiu
 
Trudell_Resume_2015
Trudell_Resume_2015Trudell_Resume_2015
Trudell_Resume_2015Bill Trudell
 
Background And An Architecture Example
Background And An Architecture ExampleBackground And An Architecture Example
Background And An Architecture ExampleGlen Wilson
 
NEHA JAIN_RESUME
NEHA JAIN_RESUMENEHA JAIN_RESUME
NEHA JAIN_RESUMENeha Jain
 
Chang Haiyue' Resume
Chang Haiyue' ResumeChang Haiyue' Resume
Chang Haiyue' ResumeDotaman
 
ECAD-MCAD Presentation 2011
ECAD-MCAD Presentation 2011ECAD-MCAD Presentation 2011
ECAD-MCAD Presentation 2011Altium
 
System on Chip Design and Modelling Dr. David J Greaves
System on Chip Design and Modelling   Dr. David J GreavesSystem on Chip Design and Modelling   Dr. David J Greaves
System on Chip Design and Modelling Dr. David J GreavesSatya Harish
 
Resume_AdyaJha
Resume_AdyaJhaResume_AdyaJha
Resume_AdyaJhaAdya Jha
 
p4pktgen: Automated Test Case Generation for P4 Programs
p4pktgen:  Automated Test Case  Generation for P4 Programsp4pktgen:  Automated Test Case  Generation for P4 Programs
p4pktgen: Automated Test Case Generation for P4 ProgramsAJAY KHARAT
 

What's hot (20)

Dj Mack Resume.Doc
Dj Mack Resume.DocDj Mack Resume.Doc
Dj Mack Resume.Doc
 
Hardware-Software allocation specification of IMA systems for early simulation
Hardware-Software allocation specification of IMA systems for early simulationHardware-Software allocation specification of IMA systems for early simulation
Hardware-Software allocation specification of IMA systems for early simulation
 
sharath_resume_may2016
sharath_resume_may2016sharath_resume_may2016
sharath_resume_may2016
 
EC302-Introduction
EC302-IntroductionEC302-Introduction
EC302-Introduction
 
Ecd302 unit 01(investigate ecad systems)
Ecd302 unit 01(investigate ecad systems)Ecd302 unit 01(investigate ecad systems)
Ecd302 unit 01(investigate ecad systems)
 
Trudell_Resume_2015
Trudell_Resume_2015Trudell_Resume_2015
Trudell_Resume_2015
 
Background And An Architecture Example
Background And An Architecture ExampleBackground And An Architecture Example
Background And An Architecture Example
 
JosephAnthonyEAlvarez_CV_2016
JosephAnthonyEAlvarez_CV_2016JosephAnthonyEAlvarez_CV_2016
JosephAnthonyEAlvarez_CV_2016
 
NEHA JAIN_RESUME
NEHA JAIN_RESUMENEHA JAIN_RESUME
NEHA JAIN_RESUME
 
Alex16_ic
Alex16_icAlex16_ic
Alex16_ic
 
Chris brown ti
Chris brown tiChris brown ti
Chris brown ti
 
Self Introduction_OwenChen
Self Introduction_OwenChenSelf Introduction_OwenChen
Self Introduction_OwenChen
 
Resume150721
Resume150721Resume150721
Resume150721
 
Verification strategies
Verification strategiesVerification strategies
Verification strategies
 
Chang Haiyue' Resume
Chang Haiyue' ResumeChang Haiyue' Resume
Chang Haiyue' Resume
 
ECAD-MCAD Presentation 2011
ECAD-MCAD Presentation 2011ECAD-MCAD Presentation 2011
ECAD-MCAD Presentation 2011
 
System on Chip Design and Modelling Dr. David J Greaves
System on Chip Design and Modelling   Dr. David J GreavesSystem on Chip Design and Modelling   Dr. David J Greaves
System on Chip Design and Modelling Dr. David J Greaves
 
Resume_AdyaJha
Resume_AdyaJhaResume_AdyaJha
Resume_AdyaJha
 
p4pktgen: Automated Test Case Generation for P4 Programs
p4pktgen:  Automated Test Case  Generation for P4 Programsp4pktgen:  Automated Test Case  Generation for P4 Programs
p4pktgen: Automated Test Case Generation for P4 Programs
 
Resume
ResumeResume
Resume
 

Viewers also liked

Sophis RISQUE Brochure
Sophis RISQUE BrochureSophis RISQUE Brochure
Sophis RISQUE Brochureyogwt
 
前端规范(初稿)
前端规范(初稿)前端规范(初稿)
前端规范(初稿)EnLei-Cai
 
CV_Shilidong
CV_ShilidongCV_Shilidong
CV_Shilidong?? ?
 
Resume_Yilun Chong_EN
Resume_Yilun Chong_ENResume_Yilun Chong_EN
Resume_Yilun Chong_ENYilun Chong
 
Wei_Zhao_Resume
Wei_Zhao_ResumeWei_Zhao_Resume
Wei_Zhao_ResumeWei Zhao
 
台湾趴趴走
台湾趴趴走台湾趴趴走
台湾趴趴走Limbo Wong
 
詹剑锋:Big databench—benchmarking big data systems
詹剑锋:Big databench—benchmarking big data systems詹剑锋:Big databench—benchmarking big data systems
詹剑锋:Big databench—benchmarking big data systemshdhappy001
 
Dpdk Validation - Liu, Yong
Dpdk Validation - Liu, YongDpdk Validation - Liu, Yong
Dpdk Validation - Liu, Yongharryvanhaaren
 
CVLinkedIn
CVLinkedInCVLinkedIn
CVLinkedInJun Ma
 
Research Park: Year in Review 2014
Research Park: Year in Review 2014Research Park: Year in Review 2014
Research Park: Year in Review 2014UIResearchPark
 
Introducing Ubuntu SDK
Introducing Ubuntu SDKIntroducing Ubuntu SDK
Introducing Ubuntu SDKShuduo Sang
 
Zejia_CV_final
Zejia_CV_finalZejia_CV_final
Zejia_CV_finalZJ Zheng
 
Kafka文件系统设计
Kafka文件系统设计Kafka文件系统设计
Kafka文件系统设计志涛 李
 
Xiaoli_Ma_developer_resume
Xiaoli_Ma_developer_resumeXiaoli_Ma_developer_resume
Xiaoli_Ma_developer_resumeXiaoli Ma
 
Stanford splash spring 2016 basic programming
Stanford splash spring 2016 basic programmingStanford splash spring 2016 basic programming
Stanford splash spring 2016 basic programmingYu-Sheng (Yosen) Chen
 

Viewers also liked (20)

Sophis RISQUE Brochure
Sophis RISQUE BrochureSophis RISQUE Brochure
Sophis RISQUE Brochure
 
Paradigm Shifts
Paradigm ShiftsParadigm Shifts
Paradigm Shifts
 
前端规范(初稿)
前端规范(初稿)前端规范(初稿)
前端规范(初稿)
 
CV_Shilidong
CV_ShilidongCV_Shilidong
CV_Shilidong
 
Resume_Yilun Chong_EN
Resume_Yilun Chong_ENResume_Yilun Chong_EN
Resume_Yilun Chong_EN
 
Wei_Zhao_Resume
Wei_Zhao_ResumeWei_Zhao_Resume
Wei_Zhao_Resume
 
Hung DO-DUY - Spikenet
Hung DO-DUY - Spikenet Hung DO-DUY - Spikenet
Hung DO-DUY - Spikenet
 
Sara Saile cv
Sara Saile cv Sara Saile cv
Sara Saile cv
 
台湾趴趴走
台湾趴趴走台湾趴趴走
台湾趴趴走
 
詹剑锋:Big databench—benchmarking big data systems
詹剑锋:Big databench—benchmarking big data systems詹剑锋:Big databench—benchmarking big data systems
詹剑锋:Big databench—benchmarking big data systems
 
CV-YacineRhalmi
CV-YacineRhalmiCV-YacineRhalmi
CV-YacineRhalmi
 
Dpdk Validation - Liu, Yong
Dpdk Validation - Liu, YongDpdk Validation - Liu, Yong
Dpdk Validation - Liu, Yong
 
Cv 12112015
Cv 12112015Cv 12112015
Cv 12112015
 
CVLinkedIn
CVLinkedInCVLinkedIn
CVLinkedIn
 
Research Park: Year in Review 2014
Research Park: Year in Review 2014Research Park: Year in Review 2014
Research Park: Year in Review 2014
 
Introducing Ubuntu SDK
Introducing Ubuntu SDKIntroducing Ubuntu SDK
Introducing Ubuntu SDK
 
Zejia_CV_final
Zejia_CV_finalZejia_CV_final
Zejia_CV_final
 
Kafka文件系统设计
Kafka文件系统设计Kafka文件系统设计
Kafka文件系统设计
 
Xiaoli_Ma_developer_resume
Xiaoli_Ma_developer_resumeXiaoli_Ma_developer_resume
Xiaoli_Ma_developer_resume
 
Stanford splash spring 2016 basic programming
Stanford splash spring 2016 basic programmingStanford splash spring 2016 basic programming
Stanford splash spring 2016 basic programming
 

Similar to YuanYiPeng_161017

Similar to YuanYiPeng_161017 (20)

Michael_Joshua_Validation
Michael_Joshua_ValidationMichael_Joshua_Validation
Michael_Joshua_Validation
 
Prasad_CTP
Prasad_CTPPrasad_CTP
Prasad_CTP
 
Kavita resume startup
Kavita resume startupKavita resume startup
Kavita resume startup
 
Surekha_haoop_exp
Surekha_haoop_expSurekha_haoop_exp
Surekha_haoop_exp
 
Kavita resume
Kavita resume Kavita resume
Kavita resume
 
Arun Prasad-R.DOCX
Arun Prasad-R.DOCXArun Prasad-R.DOCX
Arun Prasad-R.DOCX
 
Spellman Resume
Spellman ResumeSpellman Resume
Spellman Resume
 
Bhavani HS
Bhavani HSBhavani HS
Bhavani HS
 
Leland Hamilton120320
Leland Hamilton120320Leland Hamilton120320
Leland Hamilton120320
 
kavita_resume_3
kavita_resume_3kavita_resume_3
kavita_resume_3
 
Chirko, Kenneth Resume - long
Chirko, Kenneth Resume - longChirko, Kenneth Resume - long
Chirko, Kenneth Resume - long
 
Ankit sarin
Ankit sarinAnkit sarin
Ankit sarin
 
Host Simulation
Host SimulationHost Simulation
Host Simulation
 
Sudha Madhuri Yagnamurthy Resume 2 (5)
Sudha Madhuri Yagnamurthy Resume 2 (5)Sudha Madhuri Yagnamurthy Resume 2 (5)
Sudha Madhuri Yagnamurthy Resume 2 (5)
 
Chiranjeevi_QA Engg.
Chiranjeevi_QA Engg.Chiranjeevi_QA Engg.
Chiranjeevi_QA Engg.
 
Qiang Yu Resume
Qiang Yu Resume Qiang Yu Resume
Qiang Yu Resume
 
Eric Theis resume61.1
Eric Theis resume61.1Eric Theis resume61.1
Eric Theis resume61.1
 
FredMcLainResumeB
FredMcLainResumeBFredMcLainResumeB
FredMcLainResumeB
 
Resume_VenkataRakeshGudipalli Master - Copy
Resume_VenkataRakeshGudipalli Master - CopyResume_VenkataRakeshGudipalli Master - Copy
Resume_VenkataRakeshGudipalli Master - Copy
 
Resume_PraveenKumar
Resume_PraveenKumarResume_PraveenKumar
Resume_PraveenKumar
 

YuanYiPeng_161017

  • 1. Yuan Yipeng (GPU Architect, SoC System, Linux Device driver, Testing Engineer) (086)15026966201 larkforsure@gmail.com Profile: Now everything around GPU. Key Skills:  Architecture building and early stage studying of Streaming Multiprocessor(SM) plus Texture unit, the core of GPU: Focus on cycle accurate Performance monitor. Also have knowledge of GPGPU, Graphics, flow automation, RTL verification.  Multi-Core System Algorithm Implementation and Optimization: From paper reading to algorithm simulation, choosing, implementation, and optimization. Familiar with both heterogeneous and homogeneous hardware systems.  Embedded System Design and Implement: Linux kernel and Device-Driver programming skills for advanced application. Familiar with kernel debugging, profiling, Makefile, etc.  Automation Testing: Testing framework, interface design and implementation. Framework extension capability guaranteeing. Testcases reviewing, checking, integration and debugging, Agreements compliance and DTS tracking. Model- based Testing exploring. Nearly everything involved in this.  Digital Logic Design: Familiar with Verilog, principles of synthesizable, verifiable RTL coding style. Understanding the knowledge of timing, low power, multiple clock domains, etc. EXPERIENCE/ACCOMPLISHMENTS:  Next generation GPU SM/Texture unit performance modeling and verification (NVIDIA 2015, 2016)
  • 2. C++ based performance simulator for early architecture studying, and pre-tapeout stage verification; CUDA on new architecture fast-modeling perf. simulator project owner; GPGPU and 3D Graphics instruction level chip pipeline investigation and optimization; CUDA system level perf simulator integrated to SM level simulator, as equally important for CUDA programs perf study. Perf. tuning for GPU Architecture in development and exploring for GPU Architecture in next generation; Trace perf. and function verification for RTL; SASS assembly code analyzing and debugging; SASS threads level ( WARP ) flow control unit (CBU, CRS) simulation, optimization, and corner case pipe-clean; CUDA traces capturing and analyzing; Texture function implement; Traces with perf. data delivery on clients' requests; Highlight: Found, triage, and fixed one history bug in SM/TPC performance simulator, which improved department's most essential trace deliver flow, from 2-7 days to less than 1 hour per delivery; "Very Good" performance evaluation for the first year;  Automation Testing Framework Building and Directing (Huawei, Opto- electronics) Five months after the boarding, Huawei Opto-electronics depart. had a workload improvement from nearly all manually test, to one third off to automation-based test. Those ignored chains since of time restrictions before are now mostly covered. Found two history bugs which existed for many years, and other bugs out of counting. The framework and interface are designed and implemented by himself alone, characterized with usability, extendibility, stability. Directing testcases implement
  • 3. of the whole depart. Responsible for submitted testcases, of all, reviewing, checking, integration, and debugging. Also the bug location and DTS tracking following. He had no Ruby or testing experiences before.  Vector Media Accelerator 2 Development (IBM 2008): Responsible for the software specification implement of H.264 SOC in the early stage; FPGA prototype verification, Linux device driver coding, software- hardware cooperative architecture design and implement, SOC hardware profiling in the later stage. Cooperating with other team members, he had implemented both the software version and the Linux Device-Driver version of the H.264 Main Profile features (including Loopfilter, B-frame, CABAC, VLC), with the performance improved in a large pace as a result of his optimization to the original H.264 algorithm, which was mainly based on his good understanding of the standard.  Vector Media Accelerator Component Development (IBM 2006): Coding for the for the Vector Processor element which consisted of VLIW, SIMD and pipeline programming skills. He was responsible for the optimization of H.264 software for the Vector Processor element which was integrated in that component. It could be taken as a typical DSP based system developing process, which consisted of VLIW, SIMD and pipeline software programming skills.  Rochester Medical Imaging Project (Segmentation) on CELL BE (IBM 2007) Collaborating with colleagues from worldwide to develop a prototype on the Cell platform to demonstrate the capability of Cell in medical imaging processing. He has written fully optimized image processing code on the SPE side, moreover, he has also written communication and synchronizing code both on the PPE and SPE sides.  German Medical Imaging Projection B&P (Reconstruction) (IBM 2007) Supporting German sales team in the B&P processing. Investigating several fast Reconstruction algorithms and comparing them based on the speed, complexity,
  • 4. reconstruction quality, bandwidth occupation, etc. Choosing one algorithm which is the most suitable for CELL BE.  CELL FOAK project for STTRI (Shanghai Telecommunication Technology Research Institute) (IBM 2007) Developing a prototype of media server based on IBM Cell to show the high performance of CELL on multimedia. Investigating the architecture of next- generation high performance media server. His part of job includes implementing media libraries of several standards on a multi-core processor, namely CELL; designing a framework involving media libraries which must be de-coupling and re-useable so can easily move it to support other telecom operations; designing a scalable interface between caller(application server) and media libraries so as to accommodate future practical application scenario (dynamic 0~8000 routes of calls simultaneously); designing the interface between media framework and users’ through RTP protocol. EDUCATION:  MS. of Space Physics, Wuhan University, China (2006)  BS. of Electronics Engineering, Wuhan University, China (2004)