Spark 3.0 will include several new features including resolving over 2000 issues, a data source API with catalog support, adaptive query execution, dynamic partition pruning, ANSI SQL compliance, and running on Kubernetes. It will also improve query optimization using rules, costs, and runtime information. Koalas enables working with big data in Python using pandas APIs on Spark DataFrames. Delta Lake provides a standard for building data lakes using a transactional storage layer based on Parquet that ensures data reliability and supports both batch and streaming workloads.