alluxio data orchestration storage open source presto big data cloud spark file system distributed computing hybrid cloud machine learning summit data management memory tachyon project aws analytics alluxio day performance hadoop hdfs hive separation of compute and storage cloud computing distributed systems apache spark ai s3 caching data architecture data platform data engineering cloud storage aws s3 kubernetes distributed storage alluxio engineering data analytics multi cloud model training emr sql compute object store meetup deep learning data lake data locality data tachyon object stores tech talk release infrastructure intel fuse rocksdb llm architecture facebook cloud bursting google dataproc unified namespace uber posix orchestration tensorflow artificial intelligence gpu ml cache gpu analytics use case apache hudi apache ozone local cache raft office hour object storage scale hybrid cloud bursting overview compute storage separation computer metadata community tencent ceph memory centric pytorch data loading gpu utilization trino database product school zookeeper apache iceberg presto caching microsoft data lakes fluid alibaba datasapiens under file system zero copy bursting on-prem analytics zoo amazon emr nfs structured data management rakuten query engine data stores conference baidu data warehouse grpc data stack demo amazon web services data ecosystem jd kyligence olap memory-centric generative ai cv api model traiing software engineering software development devops transparent uri product release analytics and ai cloud migration cloud architecture twitter virtual file system apache ranger hybrid big data netapp bilibili data tagging open data platform metadata management shadow cache tiktok cache layer prometheus metrics grafana optane persistent memory raptorx disaggregated storage rapids accelerator data lake analytics dask aspect analytics webinar terraform eks t3go walkme unisound atlas starburst robinhood data catalog paypal gimel sql workloads jd.com distributed applications ing tech dataproc google cloud hybrid data lake helixa comcast china unicom aunalytics hub hybrid shannondb storagequery s3 api analytic workloads public cloud deep learning applications high performance high-performance scalable metadata services structured data services catalog service spark workloads remote data software testing unified data zero copy hybrid bursting mapr cloud workloads dc/os object store analytics on-premise compute e-commerce datasets pipeline api usability concurrency iceberg netflix alibaba cloud gene computing structured data search queries ryte zero-copy burst distributed data caching distributed query walmartlabs global namespace multi-tiering 2.0 preview unified bigdata tutorial storage system security parquet amazon amplab pingo tachyon nexus elastic mapreduce developers developer datawarehouse etl financial services decoupling compute and storage data unification virtualization distributed system in-memory storage qiniu sogou business intelligence ctrip momo talking data nvidia mesosphere qunar strata
See more