This document describes Bubbles, a Python framework for data processing and quality probing. Bubbles focuses on representing data objects and defining operations that can be performed on those objects. Key aspects include:
- Data objects define the structure and representations of data without enforcing a specific storage format.
- Operations can be performed on data objects and are dispatched dynamically based on the objects' representations.
- A context stores available operations and handles dispatching.
- Stores provide interfaces to load and save objects from formats like SQL, CSV, etc.
- Pipelines allow sequencing operations to transform and process objects from source to target stores.
- The framework includes common operations for filtering, joining, aggreg