The document discusses how public data hosted in the cloud can enable new forms of data analysis and debate. Key points:
- Many government agencies and organizations now publish public datasets on platforms like Amazon Web Services, allowing interactive analysis of data on-demand without having to download large datasets.
- This can encourage more transparent discussion of issues by letting people analyze the same source data and publish analyses for others to explore.
- Challenges include establishing standards for data formats and enabling interactive models that non-experts can explore to encourage broader participation in analysis and debate.
14. Why Public Data in the Cloud? Publish once, pay by the byte Users create private copies Analyze on demand Pay hourly for compute Publish back to S3
15. Help, I have a Mortgage! All Mortgage Data is 1-2TB and public Costs as little as $5k/mo to analyze The crisis was caused in part by obfuscation Huge opportunity for transparency today
16. “You can prove anything with statistics” What if we all looked at the same data and then debated the economics of healthcare? What if you published your analysis in the cloud?