3. What is Spark?
visit www.spark.tc for more informationIBM | Spark
•Spark is an application framework
for doing highly iterative analysis
that scales to large volumes of
data.
•Spark provides a platform to bring
application developers, data
scientists, and data engineers
together in a unified, easy to use,
environment.*
* read more from Rob Thomas on the STC blog: http://www.spark.tc/spark/
Apache Spark™ lowers the barrier to entry to build analytics applications, by reducing the time
and complexity to develop analytic workflows.
4. What is the STC?
visit www.spark.tc for more informationIBM | Spark
Bridge: STC is an interface between IBM
and the Spark community.
Contribute: We contribute to the Apache
Spark project.
Build: We build applications & experiences
on the Spark technology platform.
5. STC Design Mission
visit www.spark.tc for more informationIBM | Spark
Make data available to everyone.
• We are a multi-disciplinary team of designers who
collaborate with the Spark community
• to create data experiences that benefit real people,
everywhere.
• We want to take Big Data out of the laboratory, and put it in
the hands of people who can use it to do amazing things.
Everything we do is open-source, available to the world.
6. How do we work?
visit www.spark.tc for more informationIBM | Spark
STC Design has three primary areas of focus:
Create:
Build new data science tools for
the community
Consume:
Build experiences that demonstrate
the capabilities of Spark
Engage:
Build experiences in collaboration
with clients & the community
7. Focus #1: Create
visit www.spark.tc for more informationIBM | Spark
Current tools for data scientists are crude, difficult to use, and have a steep learning curve. It won’t always be this
way: STC Design aspires to create beautiful, powerful, easy to use tools for the community.
First project: redesign the Apache Zeppelin notebook to make it the best available for data scientists.
8. Focus #2: Consume
visit www.spark.tc for more informationIBM | Spark
Data science means nothing unless “real” people can benefit from the technology. “Consume” apps are built to
place the power of data in the hands of anyone: business people, doctors, students, regular people anywhere. The
first of these apps is RedRock, a simple yet powerful Twitter analysis tool for the iPad.
9. Focus #3: Engage
visit www.spark.tc for more informationIBM | Spark
STC Design is an IBM Studio, a place to collaborate with clients, partners and the open source community to
create data experiences that benefit the world.
10. Engage: Mission
visit www.spark.tc for more informationIBM | Spark
Engage with the open source
community and promote
technologies around big data
leveraging Spark.
We aspire to help people with great ideas, or work
together to solve problems in ways that can be
extended to people and businesses everywhere.
11. Engage: Keywords
visit www.spark.tc for more informationIBM | Spark
Explore
Innovate
Create
Open Source
Storytellers
Benefit people
Wow factor Public stories
Awesome partners
Flexibility
Experiment
Committed partners
STC Portfolio
Solve real problems
Sexy products
Real products
Community
STC Blog
12. Engage: Decision Tree
visit www.spark.tc for more informationIBM | Spark
We would love to collaborate with
projects that are aligned with our
mission.
We could help teams with anything
regarding design thinking or design
process (from user research to ideation
sessions and visual design creation).
Is it open
source?
Is it public?
Is the partner an
existing IBM
client?
Is it a core
product for
them?
Can it go
public?
Estinguish
YES
YES
NO
YES
NO YES
Estinguish
YES
New engage opportunity
Does it use
Spark?
Is the partner
willing to
experiment?
NO
NO
Does the product need design
consulting?
Estinguish
NO YES
Does the product have
reasonable chances to be
launched?
Does the project allow flexible
deadlines?
YES
CASE BY CASE
NO
NO
Estinguish
NO
YES
Is the product going to have a
strong impact in the
community?
Are we going to have people
support from the partner?
POSSIBLE PARTNERSHIP
YES
Estinguish
NO
YES
Estinguish
NO
Estinguish
NO YES
CASE BY CASE
YESNO
13. STC Design: milestone projects
visit www.spark.tc for more informationIBM | Spark
RedRock is an app that analyzes Twitter data to help
people understand the response to any keyword:
-what are the unique conversations on the topic?
-how do people feel about the topic?
-when and from where have tweets originated?
-what are the professions of the people involved?
-what other topics are part of the conversation?
RedRock was kicked off the week after STC Design
launched, and the app was launched within a month.
RedRock
14. STC Design: milestone projects
visit www.spark.tc for more informationIBM | Spark
RedRock v2 built upon the RedRock foundations, but
with better visualizations, massively more data (2B
tweets), continuous data updating, and near
instantaneous performance.
RedRock was kicked off in September, and launched
mid October.
RedRock
15. STC Design: milestone projects
visit www.spark.tc for more informationIBM | Spark
Apache Zeppelin is the open source notebook that
forms the core user experience of IOP. Alongside the
IOP work of integrating Zeppelin into the platform, the
team is working on improvements to the open-source
version of the design. As of November, we have
already committed several bug fixes, and have core
improvements to UI functionality and design language
ready to submit.
RedRock
IBM Analytics Platform
1 2A Data Scientist or Data Engineer will
be able to produce data visualizations
directly from a notebook, and share
these visualizations with stakeholders,
in standard formats (i.e. .pdf, .exl, .html).
Data Scientists and Data Engineers will
be able to actively collaborate,using
their language of choice, in the same
notebook, in real-time.
A member of the open source
community will see IBM Design directly
contributing assets to the Apache
Zeppelin project and think of IBM
Design as a leader in open source
design.
HILLS
3
PERSONAS
Susan
D ATA S C I E N T I S T
Analytics Knowledge
Business Knowledge
Programming Skills
Ben
D ATA E N G I N E E R
Analytics Knowledge
Business Knowledge
Programming Skills
Analytics Knowledge
Business Knowledge
Programming Skills
C E O
Diana
MISSION Introduce the first truly accessible exploratory data science tool to the
open source community and establish IBM Design as a leader in the open
source community.
COMPETITORS
Apache Zeppelin Notebook
16. Visit www.spark.tc for more information
and sign up for the newsletter
Thanks,
Valeria Montrucchio | valeriamon@us.ibm.com