1) Complex data pipelines can introduce bugs that compound as dependencies increase. Engineers manage complexity through encapsulation, clear APIs, and integration tests. 2) Data scientists require semantic correctness but making assumptions introduces risks. Sanity checks on fields like verifying formats and constraints help identify potential errors. 3) Defensive data science through data asserts maintains quality by clearly defining trust boundaries and assumptions. Checks should match expectations and be revisited regularly as upstream changes can impact pipelines.