Innovation is based on many components – a great idea, creativity, persistence, the right data, and technology tools. Amazon Web Services has an engine of innovation for the start-up community, and it’s now being used to power innovative solutions for big societal problems. As government data becomes more widely available, more people can use AWS computing and big data analytics tools to tackle problems that were, until recently, exclusively the domain of government projects. Scientists, developers, and curious citizens are more equipped than ever to find forward-thinking and entirely new solutions for some of the world’s biggest challenges. These opportunities for innovation are improving citizen services and creating opportunities for a new class of civic tech entrepreneur. This session will highlight real examples of open data enabling transformative innovation on the national and local levels. You will hear about GIS Open Data, NASA’s Citizen Science program and how the new Landsat AWS Public Data Set has supported the development of new applications for citizens and government, and other examples of how open data has helped drive citizen engagement.
Steve Sofian, Solution Architect, Amazon Web Services, WWPS, ASEAN
4. What is Open Data?
Open data is data that can be used by anyone for any purpose for free.
Many of our customers, such as Esri, the Weather Company, and the
Climate Corporation, rely on quality open data as much as they rely on our
computing, storage, and other web services.
5. Open Data on AWS
Amazon S3 lets
you store and
retrieve any amount
of data, at any time,
from anywhere on
the web.
Amazon Elastic
MapReduce
(Amazon EMR)
provides the Apache
Hadoop analytics
framework as an
easy-to-use
managed service.
Amazon
DynamoDB is a
fully-managed
NoSQL database
service that makes
it cost-effective to
store and retrieve
any amount of data.
Amazon API
Gateway is a fully
managed service
that makes it easy
to create, publish,
maintain, monitor
and secure APIs
Amazon Web Services provides a comprehensive toolkit for gathering,
storing, analyzing, and working with data at any scale.
6. The power of open data on AWS
Making data open on AWS enables more innovation by making data
available for rapid access to our flexible and low-cost computing
resources.
Amazon
EC2
Amazon
EMR
Amazon
Redshift
Amazon
DynamoDB
AWS
Lambda
Amazon
S3
7. The Weather Company saves $1 million per year running its
forecasting application on AWS
The Weather Company provides millions of people
with the world’s best weather forecasts,
content and data, every day.
Using AWS, TWC can scale as
necessary to handle constantly
changing workloads and maintain
our 11-millisecond response time.
Bryson Koehler
EVP, CTO, CIO, The Weather Company
”
“ • Needed a cost-effective, scalable
alternative to operating 13 data centers
with legacy systems.
• TWC ingests, stores, and analyzes
ingests 4 GB of weather data per
second from over 800 sources.
• Designed to handle more than 15 billion
API calls each day, at a rate of 150,000
per second.
• Reduced its on-premises IT
environment form 13 to 6 data centers.
8. Data Enrichment
Sensemaking
Data at Rest
(Object storage)
Basic APIs
Complex APIs
Consumer
applications
Algorithmic
policy
Data-driven
journalism
Data Catalogs
Focused data
dashboards
Predictive
modeling
Visualizations
Lower cost of knowledge
(Efficiency)
Open data as a platform
9. Data Creation Data Enrichment
Sensemaking
Data at Rest
(Object storage)
Basic APIs
Complex APIs
Consumer
applications
Algorithmic
policy
Data-driven
journalism
Data Catalogs
Focused data
dashboards
Predictive
modeling
Visualizations
Efficiency
Open data as a platform
13. OneMap
First government-wide national intelligent map portal
• Integrated map system for government agencies to deliver location-based
services and information to government agencies and citizens
• Powers over 100 government GIS websites and applications
• Reduced costs by 60%
“AWS has helped my organization
to provide better service availability
and handle higher traffic load at a
lower cost.”
—Chan Chin Wai, Chief Information Officer
Singapore Land Authority
15. Open Data Applications
Ministry of Social & Family Development SG Cares – Volunteering Opportunities
OneMotoring – traffic.smart Surround Network – Location Based Commerce
17. Public datasets on AWS
To enable more innovation, AWS hosts a selection of datasets that anyone
can access for free. Data in our public datasets is available for rapid
access to our flexible and low-cost computing resources.
Earth Science
NASA Earth Exchange
(NASA NEX)
Life Sciences
1000 Genomes Project
Internet Science
Common Crawl Corpus
18. Landsat
The Landsat program is a joint effort
of the U.S. Geological Survey and
NASA. It is the longest running
program to gather Earth imagery
from space and is considered the
gold standard for natural resources
satellite imagery.
19. Landsat is big open data
The Landsat program is a joint effort
of the U.S. Geological Survey and
NASA. It is the longest running
program to gather Earth imagery
from space and is considered the
gold standard for natural resources
satellite imagery.
It has traditionally been time-
consuming and expensive to
acquire, store, and analyze Landsat
data.
20. Landsat on AWS
We have committed to making up to
1 petabyte of Landsat imagery
readily available as objects on
Amazon S3.
Now, anyone can analyze Landsat
data at web scale with no significant
up-front investment of time or capital
expense.
21. Esri—Unlock Earth’s Secrets
Esri has created a tool to show how
ArcGIS Online can quickly visualize
Landsat data for live visualization and
analysis within the browser.
“These are not pre-generated cache
services limited to just visualization—
they are dynamic, high-performance
image services that perform on-the-
fly processing and dynamic
mosaicking of Landsat’s multi-
spectral and multi-temporal imagery.”
http://www.esri.com/landsatonaws
22. landsat-util
Landsat on AWS helped
Development Seed make
optimizations that make landsat-util
over 2× faster and allow for more
functionality.
https://developmentseed.org/blog/2015/03/19/aws-landsat-archive/
23. Landsat-live
Mapbox created Landsat-live, a map
that is constantly refreshed with the
latest satellite imagery from NASA’s
Landsat 8 satellite.
Creating a live Earth imagery
pipeline is possible because Landsat
imagery is available on Amazon S3
within hours of creation.
https://www.mapbox.com/blog/landsat-live-live/
24. MATLAB—Landsat8 Data Explorer
MathWorks created a freely
downloadable MATLAB based tool
for accessing, processing, and
visualizing Landsat 8 data.
The tool allows MATLAB users to
find Landsat 8 scenes, analyze
them, and combine them with other
sources of GIS data for new
visualizations.
http://blogs.mathworks.com/steve/2015/03/19/matlab-landsat-8-aws/
25. Summary
• Publicly available resources have reached critical mass
• Governments
• Citizens
• Businesses
• Consumers
• Seize upon open data to improve products and services
• Businesses begin leverage on open innovation