Ad
Ad
Ad
Ad
Ad
Ad
Ad
Ad
Ad
Ad
Ad
Exploring Distributed Caching
for Faster GPU Training with
NVMe, GDS and RDMA
Bin Fan
Founding Engineer, VP of Technology ...
Agenda
● Why I/O Matters in LLM Training: Challenges & Opportunities
● Three Options Handling Efficient I/O
● Designing a H...
LLM Performance is Driven by Compute, Data Size, Parameters
=> I/O Becomes Critical
Scaling Law: The performance of Large ...
1 of 36 Ad