SlideShare a Scribd company logo
1 of 59
Download to read offline
Treasure Data Cloud Strategy
Masahiro Nakagawa
July Tech Festa: Jul 14, 2013
Sunday, July 14, 13
Who are you?
§ Masahiro Nakagawa
• @repeatedly / masa@treasure-data.com
§ Treasure Data, Inc.
• Senior Software Engineer, since 2012/11
§ Open Source projects
• D Programming Language
• MessagePack: D, Python, etc...
• Fluentd: Core, Mongo, Logger, etc...
• etc...
2
Sunday, July 14, 13
Treasure Data overview
Sunday, July 14, 13
Company Overview
§ Silicon Valley-based Company
• All Founders are Japanese
• Hironobu Yoshikawa
• Kazuki Ohta
• Sadayuki Furuhashi
• About 20 people
• Over 3.5 million jobs
§ OSS Enthusiasts
• MessagePack, Fluentd, etc.
4
Sunday, July 14, 13
Investors
§ Bill Tai
§ Othman Laraki - Former VP Growth at Twitter
§ James Lindenbaum, Adam Wiggins, Orion Henry -
Heroku Founders
§ Anand Babu Periasamy, Hitesh Chellani - Gluster
Founders
§ Yukihiro “Matz” Matsumoto - Creator of Ruby
§ Dan Scheinman - Director of Arista Networks
§ Jerry Yang - Founder of Yahoo!
5
Sunday, July 14, 13
6
DataVolume
Cloud
Enterprise
RDBMSLightweight
RDBMS
DB2
1Bil entry
Or 10TB
Traditional
Data Warehouse
$10B
market
$34B
market
Database-as-a-service
Big Data-as-a-Service
On-Premise
© 2012 Forrester Research, Inc. Reproduction Prohibited
Treasure Data = Cloud + Big Data
Sunday, July 14, 13
The Problem with Other Solutions
7
Customer
Value
Time
Sign-up or PO
On-Premise
Solutions
Obsolescence
over time
Treasure Data
Fully integrated Big Data full-
stack service with simple
interface, low friction initial
engagement & continuous
technical upgrade
Need Upgrade
AWS
(or hosted Hadoops)EC2
EMR
RedShift
S3 Step-by-step manual
integrations
Maintain
NO SpecialistsTOO LONG to get Live
=
Complex Solutions
+
Data Collection
+
Sunday, July 14, 13
8
Big Data Adoption Stages
Intelligence Sophistication
Standard Reports
Ad-hoc Reports
Drill Down Query
Alerts
Statistical Analysis
Predictive Analysis
Optimization
What happened?
Where?
Where exactly?
Error?
Why?
What’s a trend?
What’s the best?
Analytics
Reporting
Sunday, July 14, 13
8
Big Data Adoption Stages
Intelligence Sophistication
Standard Reports
Ad-hoc Reports
Drill Down Query
Alerts
Statistical Analysis
Predictive Analysis
Optimization
What happened?
Where?
Where exactly?
Error?
Why?
What’s a trend?
What’s the best?
Analytics
Reporting
Treasure Data’s FOCUS
(80% of needs)
Sunday, July 14, 13
9
Full Stack Support for Big Data Reporting
Our best-in-class architecture
and operations team ensure the
integrity and availability of your
data.
Data from almost any source
can be securely and reliably
uploaded using td-agent in
streaming or batch mode.
Our SQL, REST, JDBC, ODBC
and command-line interfaces
support all major query tools
and approaches.
You can store gigabytes to
petabytes of data efficiently and
securely in our cloud-based
columnar datastore.
Sunday, July 14, 13
We are...
10
Big Data as a Service
not
Hadoop on Cloud
Sunday, July 14, 13
Columnar Storage
+
Hadoop
MapReduce
600 bil+ records
3.5 mil+ jobs
Product
11
Data Collection Data Warehouse Data Analysis
Open-Source
Log Collector
2,500+ companies
(incl. LinkedIn, etc)
Bulk Loader
CSV / TSV
MySQL,
Postgres
Oracle, etc.
Web Log
App Log
Sensor
RDBMS
CRM
ERP
Streaming Upload
60billion / month
BI Tools
Tableau, QlickView,
Pentaho, Excel, etc.
REST
JDBC / ODBC
SQL
(HiveQL)
or
Pig
Bulk Upload
Parallel Upload
Value Proposition:
“Time-to-Answer” 20bil+, 2 weeks,
UK/Austria
3bil+, 3 weeks
Singapore
2 weeks,
US
2 weeks,
US
3 weeks,
Japan
Dashboard
Custom App,
RDBMS, FTP, etc.
Result push
Multi-Tenant: Single Code for Everyone - Improving the Platform Faster (e.g. SFDC, Heroku)
Sunday, July 14, 13
12
Our Customers – 80 companies
http://docs.treasure-data.com/categories/success-stories
Sunday, July 14, 13
13
A case: “14 Days” from Signup to Success
1. Europe’s largest mobile ad
exchange.
2. Serving >20 billion imps/
month for >15,000 mobile
apps (Q1 2013)
3. Immediate need of analytics
infrastructure: ASAP!
4. With TD, MobFox got into
production only in 14 days,
by one engineer.
"Time is the most precious asset in our fast-moving business,
and Treasure Data saved us a lot of it."
Julian Zehetmayr, CEO & Founder
td-agent =
fluentd rpm/deb
Sunday, July 14, 13
14
A case: “Replace” in-house Hadoop to TD
1. Global “Hulu” - Online Video
Service with millions of users
2. Video contents are distributed
to over 150 languages.
3. Had hard time maintaining
Hadoop cluster
4. With TD, Viki deprecated their
in-house Hadoop cluster and
use engineer for core
businesses.
Before
After
“Treasure Data has always given us thorough and timely support
peppered with insightful tips to make the best use of their service."
Huy Nguyen, Software Engineer
Sunday, July 14, 13
15
A case: Treasure Data with BI Tool (Tableau)
1. World’s largest android
application market
2. Serving >3 billion app
downloads for >100 million
users
3. Only one engineer managing
the data infrastructure
4. With TD, the data engineer can
focus on analyzing data with
existing BI tool
"I will recommend Treasure Data to my friends in a heartbeat because it
benefits all three stakeholders: Operations, Engineering and Business."
Simon Dong, Principal Architect - Data Engineering
Sunday, July 14, 13
16
- Vision -
Single Analytics Platform for the World
http://www.chisite.org/initiatives/WGII
Sunday, July 14, 13
Treasure Data’s
Service Architecture
Sunday, July 14, 13
18
Treasure Data = Collect + Store + Query
Sunday, July 14, 13
19
Architecture Breakdown
Data Collection
• Increasing variety of
data sources
• No single data schema
• Lack of streaming data
collection method
• 60% of Big Data project
resource consumed
Data Store/Analytics
• Remaining complexity in
both traditional DWH
and Hadoop (very slow
time to market)
• Challenges in scaling
data volume and
expanding cost.
Connectivity
• Required to ensure
connectivity with
existing BI/visualization/
apps by JDBC, ODBC
and REST.
• Output ot other services,
e.g. S3, RDBMS, etc.
Sunday, July 14, 13
Product Philosophy
§ Data first, Schema later
• “Schema-on-Read”
• Both Batch and Query processing
§ Simple APIs
• Easy to use and powerful
§ Easy integration
• Log collecting, BI tools and etc...
20
Sunday, July 14, 13
Our technology stack
§ td-agent
• ETL part of Treasure Data
§ Plazma
• Big data processing infrastructure
• Columnar oriented storage
• Reliable data handling
§ Multi-tenant scheduler
• Robust distributed queue and scheduler
21
Sunday, July 14, 13
§ 60% of BI project resource is consumed here
§ Most ‘underestimated’ and ‘unsexy’ but MOST important
§ Fluentd: OSS lightweight but robust Log Collector
• http://fluentd.org/
1) Data Collection
22
Sunday, July 14, 13
Apache
App
App
Other data sources
td-agent
RDBMS
Treasure Data
columnar data
warehouse
Query
Processing
Cluster
Query
API
HIVE, PIG
JDBC, REST
User
td-command
BI apps
23
This!
Sunday, July 14, 13
fluentd.org
Fluentd
the missing log collector
24
Sunday, July 14, 13
Data Processing
Collect Store Process Visualize
Data source
Reporting
Monitoring
Sunday, July 14, 13
Store Process
Cloudera
Horton Works
Treasure Data
Collect Visualize
Tableau
Excel
R
easier & shorter time
???
Related Products
Sunday, July 14, 13
In short
§ Open sourced log collector written in Ruby
• Easy to use, reliable and well performance
• like streaming event processing
§ Using rubygems ecosystem for plugins
27
It’s like syslogd, but
uses JSON for log messages
Sunday, July 14, 13
tail
insert
event
buffering
127.0.0.1 - - [11/Dec/2012:07:26:27] "GET / ...
127.0.0.1 - - [11/Dec/2012:07:26:30] "GET / ...
127.0.0.1 - - [11/Dec/2012:07:26:32] "GET / ...
127.0.0.1 - - [11/Dec/2012:07:26:40] "GET / ...
127.0.0.1 - - [11/Dec/2012:07:27:01] "GET / ...
...
28
Fluentd
Web Server
Example (apache to monogdb)
2012-12-11 07:26:27
apache.log
{
"host": "127.0.0.1",
"method": "GET",
...
}
Sunday, July 14, 13
Application
・・・
Server2
Application
・・・
Server3
Application
・・・
Server1
FluentLog Server
High Latency!
must wait for a day...
29
Before Fluentd
Sunday, July 14, 13
Application
・・・
Server2
Application
・・・
Server3
Application
・・・
Server1
Fluentd Fluentd Fluentd
Fluentd Fluentd
In streaming!
30
After Fluentd
Sunday, July 14, 13
Buffer Output
Input
> Forward
> HTTP
> File tail
> dstat
> ...
> Forward
> File
> MongoDB
> ...
> File
> Memory
31
Pluggable architecture
Engine
Output
> rewrite
> ...
Pluggable Pluggable
Sunday, July 14, 13
Nagios
MongoDB
Hadoop
Alerting
Amazon S3
Analysis
Archiving
MySQL
Apache
Frontend
Access logs
syslogd
App logs
System logs
Backend
Databases
buffer / filter / routing
32
Sunday, July 14, 13
td-agent
§ Open sourced distribution package of Fluentd
• ETL part of Treasure Data
• rpm, deb and homebrew
§ Including useful components
• ruby, jemalloc, fluentd
• 3rd party gems: td, mongo, webhdfs, etc...
• td plugin is for Treasure Data
§ http://packages.treasure-data.com/
33
Sunday, July 14, 13
§ Remaining complexity in both DWH and Hadoop
§ Challenges in scaling data volume and expanding cost
§ Plazma: Hadoop eco system and own projects
2) Data Store / Analytics
34
Sunday, July 14, 13
Apache
App
App
Other data sources
td-agent
RDBMS
Treasure Data
columnar data
warehouse
Query
Processing
Cluster
Query
API
HIVE, PIG
JDBC, REST
User
td-command
BI apps
35
This!
Sunday, July 14, 13
AWS Component Dependencies (1)
§ RDS
• Store user information, job status, etc...
• Store metadata of our columnar database
• Queue worker / Scheduler
§ EC2
• API servers (Ruby on Rails 3)
• Hadoop clusters
• Job workers
• Using Chef to deploy
36
Sunday, July 14, 13
AWS Component Dependencies (2)
§ ELB
• Load balancing of API servers
• Load balancing of td-agents
§ S3
• Columnar storage built on top of S3
• MessagePack columnar format
• Realtime / Archive storage
• Our Result feature supports S3 output.
37
No EBS, EMR, SQS and other products !
Sunday, July 14, 13
Frontend
Queue
Worker
Hadoop
Fluentd
Applications push
metrics to Fluentd
(via local Fluentd)
Librato Metrics
for realtime analysis
Treasure
Data
for historical analysis
Fluentd sums up data minutes
(partial aggregation)
Treasure Data Service Processing Flow
38
Hadoop
Sunday, July 14, 13
39
Data Processing Flow
Sunday, July 14, 13
Structure of Columnar Storages
Realtime Storage
merge (every 1 hour)
2013-07-12 00:23:00 912ec80
2013-07-13 00:01:00 277a259
2013-07-14 00:02:00 d52c831
...
23c82b0ba3405d4c15aa85d2190e
6d7b1482412ab14f0332b8aee119
8a7bc848b2791b8fd603c719e54f
0e3d402b17638477c9a7977e7dab
...
SELECT ...
Archive Storage
Data import
40
Sunday, July 14, 13
Query Language
Query Execution
Columnar Data
Object Storage
41
Sunday, July 14, 13
1/4: Compile SQL into MapReduce
SELECT COUNT(DISTINCT ip) FROM tbl;
SQL Statement
Hive
SQL - to - MapReduce
42
+TD UDFs
Sunday, July 14, 13
2/4: MapReduce is executed in parallel
SELECT COUNT(DISTINCT ip) FROM tbl;
43
Sunday, July 14, 13
3/4: Columnar Data Access
Read ONLY the Required Part of Data
SELECT COUNT(DISTINCT ip) FROM tbl;
44
Sunday, July 14, 13
4/4: Object-based Storage
45
Sunday, July 14, 13
Apply Schema
{“user”:54, “name”:”test”, “value”:”120”, “host”:”local”}
Schema user:int name:string value:int
SELECT 54 (int)
Raw data(JSON)
“test” (string) 120 (int)
host:int
NULL
46
Sunday, July 14, 13
Multi-Tenancy
§ All customers share the Hadoop clusters (Multi Data Centers)
§ Resource Sharing (Burst Cores), Rapid Improvement, Ease of Upgrade
47
datacenter A
datacenter B
datacenter C
datacenter D
Local FairScheduler
Local FairScheduler
Local FairScheduler
Local FairScheduler
Global
Scheduler
On-Demand
Resouce Allocation
Job Submission
+ Plan Change
Sunday, July 14, 13
Trial and error on Cloud
§ Rapid development
• Change hardware
• New architecture testing
• Performance testing
• Change software
• Hadoop parameters
• etc...
§ Use git and chef for these purposes
• Easy to deploy and apply changes
• git for change history
48
Sunday, July 14, 13
§ Services
• CopperEgg
• Librato Metrics
• Logentries
• NewRelic
• PagerDuty
• Desk.com
• Olark
• HipChat
• Alerting
Our Operation Stack: Full Use of SaaS
49
§ Tools
• Hosted Chef (Opscode)
• Jenkins
• including integration test
44
Sunday, July 14, 13
Sunday, July 14, 13
Sunday, July 14, 13
Sunday, July 14, 13
53
3) Connectivity
§ Need to visualize the query result
§ Use metrics / graph for interactive comparison
§ Result: Export result and use existence tools
45
Sunday, July 14, 13
Apache
App
App
Other data sources
td-agent
RDBMS
Treasure Data
columnar data
warehouse
Query
Processing
Cluster
Query
API
HIVE, PIG
JDBC, REST
User
td-command
BI apps
54
This!
Sunday, July 14, 13
55
Pull and Push approaches
Query
(Pull)
Web App
MySQL
Treasure Data
Columnar Storage
Query
Processing
Cluster
Query
API
REST API
JDBC, ODBC Driver
td-command
BI apps
S3
Result
(Push)
…
Sunday, July 14, 13
Support list
56
§ Result
• Treasure Data
• MySQL
• PostgreSQL
• Google SpreadSheet
• REST API
• S3
• etc...
§ BI tool
• Pentaho
• Tableau
• JasperSoft
• Indicee
• Dr. Sum
• Metric Insight
• etc...
http://docs.treasure-data.com/categories/3rd-party-tools-overview
http://docs.treasure-data.com/categories/result
Sunday, July 14, 13
§ Treasure Data
• Cloud based Big-data analytics platform
• Provide Machete for Big data reporting
§ Big Data processing
• Collect / Store / Analytics / Visualization
§ Consider trade-off
• Cloud reinforces idea but not differentiator
• What is the strong point?
• Should focus own vision!
Conclusion
57
Our focus!
Sunday, July 14, 13
Big Data for the Rest of Us
www.treasure-data.com | @TreasureData
Sunday, July 14, 13

More Related Content

What's hot

July 2014 HUG : Pushing the limits of Realtime Analytics using Druid
July 2014 HUG : Pushing the limits of Realtime Analytics using DruidJuly 2014 HUG : Pushing the limits of Realtime Analytics using Druid
July 2014 HUG : Pushing the limits of Realtime Analytics using DruidYahoo Developer Network
 
Data Analytics and Processing at Snap - Druid Meetup LA - September 2018
Data Analytics and Processing at Snap - Druid Meetup LA - September 2018Data Analytics and Processing at Snap - Druid Meetup LA - September 2018
Data Analytics and Processing at Snap - Druid Meetup LA - September 2018Charles Allen
 
Druid: Under the Covers (Virtual Meetup)
Druid: Under the Covers (Virtual Meetup)Druid: Under the Covers (Virtual Meetup)
Druid: Under the Covers (Virtual Meetup)Imply
 
Programmatic Bidding Data Streams & Druid
Programmatic Bidding Data Streams & DruidProgrammatic Bidding Data Streams & Druid
Programmatic Bidding Data Streams & DruidCharles Allen
 
Treasure Data and AWS - Developers.io 2015
Treasure Data and AWS - Developers.io 2015Treasure Data and AWS - Developers.io 2015
Treasure Data and AWS - Developers.io 2015N Masahiro
 
Apache Druid®: A Dance of Distributed Processes
 Apache Druid®: A Dance of Distributed Processes Apache Druid®: A Dance of Distributed Processes
Apache Druid®: A Dance of Distributed ProcessesImply
 
Presto at Hadoop Summit 2016
Presto at Hadoop Summit 2016Presto at Hadoop Summit 2016
Presto at Hadoop Summit 2016kbajda
 
Building Data Applications with Apache Druid
Building Data Applications with Apache DruidBuilding Data Applications with Apache Druid
Building Data Applications with Apache DruidImply
 
Gregorry Letribot - Druid at Criteo - NoSQL matters 2015
Gregorry Letribot - Druid at Criteo - NoSQL matters 2015Gregorry Letribot - Druid at Criteo - NoSQL matters 2015
Gregorry Letribot - Druid at Criteo - NoSQL matters 2015NoSQLmatters
 
Practical Medium Data Analytics with Python (10 Things I Hate About pandas, P...
Practical Medium Data Analytics with Python (10 Things I Hate About pandas, P...Practical Medium Data Analytics with Python (10 Things I Hate About pandas, P...
Practical Medium Data Analytics with Python (10 Things I Hate About pandas, P...Wes McKinney
 
Presto - Hadoop Conference Japan 2014
Presto - Hadoop Conference Japan 2014Presto - Hadoop Conference Japan 2014
Presto - Hadoop Conference Japan 2014Sadayuki Furuhashi
 
August meetup - All about Apache Druid
August meetup - All about Apache Druid August meetup - All about Apache Druid
August meetup - All about Apache Druid Imply
 
Using PostgreSQL with Bibliographic Data
Using PostgreSQL with Bibliographic DataUsing PostgreSQL with Bibliographic Data
Using PostgreSQL with Bibliographic DataJimmy Angelakos
 
Web analytics at scale with Druid at naver.com
Web analytics at scale with Druid at naver.comWeb analytics at scale with Druid at naver.com
Web analytics at scale with Druid at naver.comJungsu Heo
 
Druid Adoption Tips and Tricks
Druid Adoption Tips and TricksDruid Adoption Tips and Tricks
Druid Adoption Tips and TricksImply
 
AWS big-data-demystified #1.1 | Big Data Architecture Lessons Learned | English
AWS big-data-demystified #1.1  | Big Data Architecture Lessons Learned | EnglishAWS big-data-demystified #1.1  | Big Data Architecture Lessons Learned | English
AWS big-data-demystified #1.1 | Big Data Architecture Lessons Learned | EnglishOmid Vahdaty
 
Introduction to Presto at Treasure Data
Introduction to Presto at Treasure DataIntroduction to Presto at Treasure Data
Introduction to Presto at Treasure DataTaro L. Saito
 
Apache Druid Vision and Roadmap
Apache Druid Vision and RoadmapApache Druid Vision and Roadmap
Apache Druid Vision and RoadmapImply
 

What's hot (20)

July 2014 HUG : Pushing the limits of Realtime Analytics using Druid
July 2014 HUG : Pushing the limits of Realtime Analytics using DruidJuly 2014 HUG : Pushing the limits of Realtime Analytics using Druid
July 2014 HUG : Pushing the limits of Realtime Analytics using Druid
 
Presto+MySQLで分散SQL
Presto+MySQLで分散SQLPresto+MySQLで分散SQL
Presto+MySQLで分散SQL
 
Data Analytics and Processing at Snap - Druid Meetup LA - September 2018
Data Analytics and Processing at Snap - Druid Meetup LA - September 2018Data Analytics and Processing at Snap - Druid Meetup LA - September 2018
Data Analytics and Processing at Snap - Druid Meetup LA - September 2018
 
Druid: Under the Covers (Virtual Meetup)
Druid: Under the Covers (Virtual Meetup)Druid: Under the Covers (Virtual Meetup)
Druid: Under the Covers (Virtual Meetup)
 
Programmatic Bidding Data Streams & Druid
Programmatic Bidding Data Streams & DruidProgrammatic Bidding Data Streams & Druid
Programmatic Bidding Data Streams & Druid
 
Treasure Data and AWS - Developers.io 2015
Treasure Data and AWS - Developers.io 2015Treasure Data and AWS - Developers.io 2015
Treasure Data and AWS - Developers.io 2015
 
Apache Druid®: A Dance of Distributed Processes
 Apache Druid®: A Dance of Distributed Processes Apache Druid®: A Dance of Distributed Processes
Apache Druid®: A Dance of Distributed Processes
 
Presto at Hadoop Summit 2016
Presto at Hadoop Summit 2016Presto at Hadoop Summit 2016
Presto at Hadoop Summit 2016
 
Building Data Applications with Apache Druid
Building Data Applications with Apache DruidBuilding Data Applications with Apache Druid
Building Data Applications with Apache Druid
 
Gregorry Letribot - Druid at Criteo - NoSQL matters 2015
Gregorry Letribot - Druid at Criteo - NoSQL matters 2015Gregorry Letribot - Druid at Criteo - NoSQL matters 2015
Gregorry Letribot - Druid at Criteo - NoSQL matters 2015
 
Presto
PrestoPresto
Presto
 
Practical Medium Data Analytics with Python (10 Things I Hate About pandas, P...
Practical Medium Data Analytics with Python (10 Things I Hate About pandas, P...Practical Medium Data Analytics with Python (10 Things I Hate About pandas, P...
Practical Medium Data Analytics with Python (10 Things I Hate About pandas, P...
 
Presto - Hadoop Conference Japan 2014
Presto - Hadoop Conference Japan 2014Presto - Hadoop Conference Japan 2014
Presto - Hadoop Conference Japan 2014
 
August meetup - All about Apache Druid
August meetup - All about Apache Druid August meetup - All about Apache Druid
August meetup - All about Apache Druid
 
Using PostgreSQL with Bibliographic Data
Using PostgreSQL with Bibliographic DataUsing PostgreSQL with Bibliographic Data
Using PostgreSQL with Bibliographic Data
 
Web analytics at scale with Druid at naver.com
Web analytics at scale with Druid at naver.comWeb analytics at scale with Druid at naver.com
Web analytics at scale with Druid at naver.com
 
Druid Adoption Tips and Tricks
Druid Adoption Tips and TricksDruid Adoption Tips and Tricks
Druid Adoption Tips and Tricks
 
AWS big-data-demystified #1.1 | Big Data Architecture Lessons Learned | English
AWS big-data-demystified #1.1  | Big Data Architecture Lessons Learned | EnglishAWS big-data-demystified #1.1  | Big Data Architecture Lessons Learned | English
AWS big-data-demystified #1.1 | Big Data Architecture Lessons Learned | English
 
Introduction to Presto at Treasure Data
Introduction to Presto at Treasure DataIntroduction to Presto at Treasure Data
Introduction to Presto at Treasure Data
 
Apache Druid Vision and Roadmap
Apache Druid Vision and RoadmapApache Druid Vision and Roadmap
Apache Druid Vision and Roadmap
 

Viewers also liked

トレジャーデータ流,データ分析の始め方
トレジャーデータ流,データ分析の始め方トレジャーデータ流,データ分析の始め方
トレジャーデータ流,データ分析の始め方Takahiro Inoue
 
並列データベースシステムの概念と原理
並列データベースシステムの概念と原理並列データベースシステムの概念と原理
並列データベースシステムの概念と原理Makoto Yui
 
글로벌 사례로 보는 데이터로 돈 버는 법 - 트레저데이터 (Treasure Data)
글로벌 사례로 보는 데이터로 돈 버는 법 - 트레저데이터 (Treasure Data)글로벌 사례로 보는 데이터로 돈 버는 법 - 트레저데이터 (Treasure Data)
글로벌 사례로 보는 데이터로 돈 버는 법 - 트레저데이터 (Treasure Data)Treasure Data, Inc.
 
社内勉強会資料(Varnish Module)
社内勉強会資料(Varnish Module)社内勉強会資料(Varnish Module)
社内勉強会資料(Varnish Module)Iwana Chan
 
[D36] Michael Stonebrakerが生み出した列指向データベースは何が凄いのか? ~Verticaを例に列指向データベースのアーキテクチャ...
[D36] Michael Stonebrakerが生み出した列指向データベースは何が凄いのか? ~Verticaを例に列指向データベースのアーキテクチャ...[D36] Michael Stonebrakerが生み出した列指向データベースは何が凄いのか? ~Verticaを例に列指向データベースのアーキテクチャ...
[D36] Michael Stonebrakerが生み出した列指向データベースは何が凄いのか? ~Verticaを例に列指向データベースのアーキテクチャ...Insight Technology, Inc.
 
[db tech showcase Sapporo 2015] B16:ビッグデータには、なぜ列指向が有効なのか? by 日本ヒューレット・パッカード株式...
[db tech showcase Sapporo 2015] B16:ビッグデータには、なぜ列指向が有効なのか? by 日本ヒューレット・パッカード株式...[db tech showcase Sapporo 2015] B16:ビッグデータには、なぜ列指向が有効なのか? by 日本ヒューレット・パッカード株式...
[db tech showcase Sapporo 2015] B16:ビッグデータには、なぜ列指向が有効なのか? by 日本ヒューレット・パッカード株式...Insight Technology, Inc.
 
Data scientist casual talk in 白金台
Data scientist casual talk in 白金台Data scientist casual talk in 白金台
Data scientist casual talk in 白金台Hiroko Onari
 
Packaging Ecosystems -Monki Gras 2017
Packaging Ecosystems -Monki Gras 2017Packaging Ecosystems -Monki Gras 2017
Packaging Ecosystems -Monki Gras 2017Treasure Data, Inc.
 
JP_Chaosmap 2013 Spring
JP_Chaosmap 2013 SpringJP_Chaosmap 2013 Spring
JP_Chaosmap 2013 SpringHiroshi Kondo
 
C4SAでFacebookアプリつくってみた
C4SAでFacebookアプリつくってみたC4SAでFacebookアプリつくってみた
C4SAでFacebookアプリつくってみたTomoaki Hosomi
 
初心者がC4SAでつくるかんたん神アプリ
初心者がC4SAでつくるかんたん神アプリ初心者がC4SAでつくるかんたん神アプリ
初心者がC4SAでつくるかんたん神アプリNobukazu Yoshii
 
Prestoで実現するインタラクティブクエリ - dbtech showcase 2014 Tokyo
Prestoで実現するインタラクティブクエリ - dbtech showcase 2014 TokyoPrestoで実現するインタラクティブクエリ - dbtech showcase 2014 Tokyo
Prestoで実現するインタラクティブクエリ - dbtech showcase 2014 TokyoTreasure Data, Inc.
 
Nifty cloud c4 sa meetup
Nifty cloud c4 sa meetupNifty cloud c4 sa meetup
Nifty cloud c4 sa meetupYuichi Saotome
 
The future of data by Doug Cutting #hcj2014
The future of data by Doug Cutting  #hcj2014The future of data by Doug Cutting  #hcj2014
The future of data by Doug Cutting #hcj2014Cloudera Japan
 
Toward Firefox OS
Toward Firefox OSToward Firefox OS
Toward Firefox OSdynamis
 
WebIntentsにより拓かれる次のWeb
WebIntentsにより拓かれる次のWebWebIntentsにより拓かれる次のWeb
WebIntentsにより拓かれる次のWebKensaku Komatsu
 
ドリコムの分析環境とデータサイエンス活用事例
ドリコムの分析環境とデータサイエンス活用事例ドリコムの分析環境とデータサイエンス活用事例
ドリコムの分析環境とデータサイエンス活用事例Yohei Sato
 
AWS Black Belt Online Seminar 2017 動画配信 on AWS
AWS Black Belt Online Seminar 2017 動画配信 on AWSAWS Black Belt Online Seminar 2017 動画配信 on AWS
AWS Black Belt Online Seminar 2017 動画配信 on AWSAmazon Web Services Japan
 

Viewers also liked (20)

トレジャーデータ流,データ分析の始め方
トレジャーデータ流,データ分析の始め方トレジャーデータ流,データ分析の始め方
トレジャーデータ流,データ分析の始め方
 
並列データベースシステムの概念と原理
並列データベースシステムの概念と原理並列データベースシステムの概念と原理
並列データベースシステムの概念と原理
 
글로벌 사례로 보는 데이터로 돈 버는 법 - 트레저데이터 (Treasure Data)
글로벌 사례로 보는 데이터로 돈 버는 법 - 트레저데이터 (Treasure Data)글로벌 사례로 보는 데이터로 돈 버는 법 - 트레저데이터 (Treasure Data)
글로벌 사례로 보는 데이터로 돈 버는 법 - 트레저데이터 (Treasure Data)
 
社内勉強会資料(Varnish Module)
社内勉強会資料(Varnish Module)社内勉強会資料(Varnish Module)
社内勉強会資料(Varnish Module)
 
Using Embulk at Treasure Data
Using Embulk at Treasure DataUsing Embulk at Treasure Data
Using Embulk at Treasure Data
 
[D36] Michael Stonebrakerが生み出した列指向データベースは何が凄いのか? ~Verticaを例に列指向データベースのアーキテクチャ...
[D36] Michael Stonebrakerが生み出した列指向データベースは何が凄いのか? ~Verticaを例に列指向データベースのアーキテクチャ...[D36] Michael Stonebrakerが生み出した列指向データベースは何が凄いのか? ~Verticaを例に列指向データベースのアーキテクチャ...
[D36] Michael Stonebrakerが生み出した列指向データベースは何が凄いのか? ~Verticaを例に列指向データベースのアーキテクチャ...
 
[db tech showcase Sapporo 2015] B16:ビッグデータには、なぜ列指向が有効なのか? by 日本ヒューレット・パッカード株式...
[db tech showcase Sapporo 2015] B16:ビッグデータには、なぜ列指向が有効なのか? by 日本ヒューレット・パッカード株式...[db tech showcase Sapporo 2015] B16:ビッグデータには、なぜ列指向が有効なのか? by 日本ヒューレット・パッカード株式...
[db tech showcase Sapporo 2015] B16:ビッグデータには、なぜ列指向が有効なのか? by 日本ヒューレット・パッカード株式...
 
Internals of Presto Service
Internals of Presto ServiceInternals of Presto Service
Internals of Presto Service
 
Data scientist casual talk in 白金台
Data scientist casual talk in 白金台Data scientist casual talk in 白金台
Data scientist casual talk in 白金台
 
Packaging Ecosystems -Monki Gras 2017
Packaging Ecosystems -Monki Gras 2017Packaging Ecosystems -Monki Gras 2017
Packaging Ecosystems -Monki Gras 2017
 
JP_Chaosmap 2013 Spring
JP_Chaosmap 2013 SpringJP_Chaosmap 2013 Spring
JP_Chaosmap 2013 Spring
 
C4SAでFacebookアプリつくってみた
C4SAでFacebookアプリつくってみたC4SAでFacebookアプリつくってみた
C4SAでFacebookアプリつくってみた
 
初心者がC4SAでつくるかんたん神アプリ
初心者がC4SAでつくるかんたん神アプリ初心者がC4SAでつくるかんたん神アプリ
初心者がC4SAでつくるかんたん神アプリ
 
Prestoで実現するインタラクティブクエリ - dbtech showcase 2014 Tokyo
Prestoで実現するインタラクティブクエリ - dbtech showcase 2014 TokyoPrestoで実現するインタラクティブクエリ - dbtech showcase 2014 Tokyo
Prestoで実現するインタラクティブクエリ - dbtech showcase 2014 Tokyo
 
Nifty cloud c4 sa meetup
Nifty cloud c4 sa meetupNifty cloud c4 sa meetup
Nifty cloud c4 sa meetup
 
The future of data by Doug Cutting #hcj2014
The future of data by Doug Cutting  #hcj2014The future of data by Doug Cutting  #hcj2014
The future of data by Doug Cutting #hcj2014
 
Toward Firefox OS
Toward Firefox OSToward Firefox OS
Toward Firefox OS
 
WebIntentsにより拓かれる次のWeb
WebIntentsにより拓かれる次のWebWebIntentsにより拓かれる次のWeb
WebIntentsにより拓かれる次のWeb
 
ドリコムの分析環境とデータサイエンス活用事例
ドリコムの分析環境とデータサイエンス活用事例ドリコムの分析環境とデータサイエンス活用事例
ドリコムの分析環境とデータサイエンス活用事例
 
AWS Black Belt Online Seminar 2017 動画配信 on AWS
AWS Black Belt Online Seminar 2017 動画配信 on AWSAWS Black Belt Online Seminar 2017 動画配信 on AWS
AWS Black Belt Online Seminar 2017 動画配信 on AWS
 

Similar to Treasure Data Cloud Strategy

情報処理学会 Exciting Coding! Treasure Data
情報処理学会 Exciting Coding! Treasure Data情報処理学会 Exciting Coding! Treasure Data
情報処理学会 Exciting Coding! Treasure DataTreasure Data, Inc.
 
Treasure Data Cloud Data Platform
Treasure Data Cloud Data PlatformTreasure Data Cloud Data Platform
Treasure Data Cloud Data Platforminside-BigData.com
 
Proud to be polyglot!
Proud to be polyglot!Proud to be polyglot!
Proud to be polyglot!NLJUG
 
Ledingkart Meetup #4: Data pipeline @ lk
Ledingkart Meetup #4: Data pipeline @ lkLedingkart Meetup #4: Data pipeline @ lk
Ledingkart Meetup #4: Data pipeline @ lkMukesh Singh
 
ODI11g, Hadoop and "Big Data" Sources
ODI11g, Hadoop and "Big Data" SourcesODI11g, Hadoop and "Big Data" Sources
ODI11g, Hadoop and "Big Data" SourcesMark Rittman
 
Introduction to NoSQL with Couchbase
Introduction to NoSQL with CouchbaseIntroduction to NoSQL with Couchbase
Introduction to NoSQL with CouchbaseTugdual Grall
 
Data Science with Hadoop - A primer
Data Science with Hadoop - A primerData Science with Hadoop - A primer
Data Science with Hadoop - A primerOfer Mendelevitch
 
Why and How to integrate Hadoop and NoSQL?
Why and How to integrate Hadoop and NoSQL?Why and How to integrate Hadoop and NoSQL?
Why and How to integrate Hadoop and NoSQL?Tugdual Grall
 
Tools and techniques for data science
Tools and techniques for data scienceTools and techniques for data science
Tools and techniques for data scienceAjay Ohri
 
Data Science with Hadoop: A Primer
Data Science with Hadoop: A PrimerData Science with Hadoop: A Primer
Data Science with Hadoop: A PrimerDataWorks Summit
 
Web Briefing: Unlock the power of Hadoop to enable interactive analytics
Web Briefing: Unlock the power of Hadoop to enable interactive analyticsWeb Briefing: Unlock the power of Hadoop to enable interactive analytics
Web Briefing: Unlock the power of Hadoop to enable interactive analyticsKognitio
 
Expand a Data warehouse with Hadoop and Big Data
Expand a Data warehouse with Hadoop and Big DataExpand a Data warehouse with Hadoop and Big Data
Expand a Data warehouse with Hadoop and Big Datajdijcks
 
Future of Data Intensive Applicaitons
Future of Data Intensive ApplicaitonsFuture of Data Intensive Applicaitons
Future of Data Intensive ApplicaitonsMilind Bhandarkar
 
Solving Data Discovery Challenges at Lyft with Amundsen, an Open-source Metad...
Solving Data Discovery Challenges at Lyft with Amundsen, an Open-source Metad...Solving Data Discovery Challenges at Lyft with Amundsen, an Open-source Metad...
Solving Data Discovery Challenges at Lyft with Amundsen, an Open-source Metad...Databricks
 
Big Data Architecture
Big Data ArchitectureBig Data Architecture
Big Data ArchitectureGuido Schmutz
 
Nyc web perf-final-july-23
Nyc web perf-final-july-23Nyc web perf-final-july-23
Nyc web perf-final-july-23Dan Boutin
 
Introduction to the Hadoop Ecosystem (IT-Stammtisch Darmstadt Edition)
Introduction to the Hadoop Ecosystem (IT-Stammtisch Darmstadt Edition)Introduction to the Hadoop Ecosystem (IT-Stammtisch Darmstadt Edition)
Introduction to the Hadoop Ecosystem (IT-Stammtisch Darmstadt Edition)Uwe Printz
 
Data Linage Solution in MOMO
Data Linage Solution in MOMOData Linage Solution in MOMO
Data Linage Solution in MOMONguyen Hai
 
The Value of the Modern Data Architecture with Apache Hadoop and Teradata
The Value of the Modern Data Architecture with Apache Hadoop and Teradata The Value of the Modern Data Architecture with Apache Hadoop and Teradata
The Value of the Modern Data Architecture with Apache Hadoop and Teradata Hortonworks
 

Similar to Treasure Data Cloud Strategy (20)

情報処理学会 Exciting Coding! Treasure Data
情報処理学会 Exciting Coding! Treasure Data情報処理学会 Exciting Coding! Treasure Data
情報処理学会 Exciting Coding! Treasure Data
 
Treasure Data Cloud Data Platform
Treasure Data Cloud Data PlatformTreasure Data Cloud Data Platform
Treasure Data Cloud Data Platform
 
Proud to be polyglot!
Proud to be polyglot!Proud to be polyglot!
Proud to be polyglot!
 
Spotify: Data center & Backend buildout
Spotify: Data center & Backend buildoutSpotify: Data center & Backend buildout
Spotify: Data center & Backend buildout
 
Ledingkart Meetup #4: Data pipeline @ lk
Ledingkart Meetup #4: Data pipeline @ lkLedingkart Meetup #4: Data pipeline @ lk
Ledingkart Meetup #4: Data pipeline @ lk
 
ODI11g, Hadoop and "Big Data" Sources
ODI11g, Hadoop and "Big Data" SourcesODI11g, Hadoop and "Big Data" Sources
ODI11g, Hadoop and "Big Data" Sources
 
Introduction to NoSQL with Couchbase
Introduction to NoSQL with CouchbaseIntroduction to NoSQL with Couchbase
Introduction to NoSQL with Couchbase
 
Data Science with Hadoop - A primer
Data Science with Hadoop - A primerData Science with Hadoop - A primer
Data Science with Hadoop - A primer
 
Why and How to integrate Hadoop and NoSQL?
Why and How to integrate Hadoop and NoSQL?Why and How to integrate Hadoop and NoSQL?
Why and How to integrate Hadoop and NoSQL?
 
Tools and techniques for data science
Tools and techniques for data scienceTools and techniques for data science
Tools and techniques for data science
 
Data Science with Hadoop: A Primer
Data Science with Hadoop: A PrimerData Science with Hadoop: A Primer
Data Science with Hadoop: A Primer
 
Web Briefing: Unlock the power of Hadoop to enable interactive analytics
Web Briefing: Unlock the power of Hadoop to enable interactive analyticsWeb Briefing: Unlock the power of Hadoop to enable interactive analytics
Web Briefing: Unlock the power of Hadoop to enable interactive analytics
 
Expand a Data warehouse with Hadoop and Big Data
Expand a Data warehouse with Hadoop and Big DataExpand a Data warehouse with Hadoop and Big Data
Expand a Data warehouse with Hadoop and Big Data
 
Future of Data Intensive Applicaitons
Future of Data Intensive ApplicaitonsFuture of Data Intensive Applicaitons
Future of Data Intensive Applicaitons
 
Solving Data Discovery Challenges at Lyft with Amundsen, an Open-source Metad...
Solving Data Discovery Challenges at Lyft with Amundsen, an Open-source Metad...Solving Data Discovery Challenges at Lyft with Amundsen, an Open-source Metad...
Solving Data Discovery Challenges at Lyft with Amundsen, an Open-source Metad...
 
Big Data Architecture
Big Data ArchitectureBig Data Architecture
Big Data Architecture
 
Nyc web perf-final-july-23
Nyc web perf-final-july-23Nyc web perf-final-july-23
Nyc web perf-final-july-23
 
Introduction to the Hadoop Ecosystem (IT-Stammtisch Darmstadt Edition)
Introduction to the Hadoop Ecosystem (IT-Stammtisch Darmstadt Edition)Introduction to the Hadoop Ecosystem (IT-Stammtisch Darmstadt Edition)
Introduction to the Hadoop Ecosystem (IT-Stammtisch Darmstadt Edition)
 
Data Linage Solution in MOMO
Data Linage Solution in MOMOData Linage Solution in MOMO
Data Linage Solution in MOMO
 
The Value of the Modern Data Architecture with Apache Hadoop and Teradata
The Value of the Modern Data Architecture with Apache Hadoop and Teradata The Value of the Modern Data Architecture with Apache Hadoop and Teradata
The Value of the Modern Data Architecture with Apache Hadoop and Teradata
 

More from Treasure Data, Inc.

GDPR: A Practical Guide for Marketers
GDPR: A Practical Guide for MarketersGDPR: A Practical Guide for Marketers
GDPR: A Practical Guide for MarketersTreasure Data, Inc.
 
AR and VR by the Numbers: A Data First Approach to the Technology and Market
AR and VR by the Numbers: A Data First Approach to the Technology and MarketAR and VR by the Numbers: A Data First Approach to the Technology and Market
AR and VR by the Numbers: A Data First Approach to the Technology and MarketTreasure Data, Inc.
 
Introduction to Customer Data Platforms
Introduction to Customer Data PlatformsIntroduction to Customer Data Platforms
Introduction to Customer Data PlatformsTreasure Data, Inc.
 
Hands-On: Managing Slowly Changing Dimensions Using TD Workflow
Hands-On: Managing Slowly Changing Dimensions Using TD WorkflowHands-On: Managing Slowly Changing Dimensions Using TD Workflow
Hands-On: Managing Slowly Changing Dimensions Using TD WorkflowTreasure Data, Inc.
 
Brand Analytics Management: Measuring CLV Across Platforms, Devices and Apps
Brand Analytics Management: Measuring CLV Across Platforms, Devices and AppsBrand Analytics Management: Measuring CLV Across Platforms, Devices and Apps
Brand Analytics Management: Measuring CLV Across Platforms, Devices and AppsTreasure Data, Inc.
 
How to Power Your Customer Experience with Data
How to Power Your Customer Experience with DataHow to Power Your Customer Experience with Data
How to Power Your Customer Experience with DataTreasure Data, Inc.
 
Why Your VR Game is Virtually Useless Without Data
Why Your VR Game is Virtually Useless Without DataWhy Your VR Game is Virtually Useless Without Data
Why Your VR Game is Virtually Useless Without DataTreasure Data, Inc.
 
Connecting the Customer Data Dots
Connecting the Customer Data DotsConnecting the Customer Data Dots
Connecting the Customer Data DotsTreasure Data, Inc.
 
Harnessing Data for Better Customer Experience and Company Success
Harnessing Data for Better Customer Experience and Company SuccessHarnessing Data for Better Customer Experience and Company Success
Harnessing Data for Better Customer Experience and Company SuccessTreasure Data, Inc.
 
Introduction to New features and Use cases of Hivemall
Introduction to New features and Use cases of HivemallIntroduction to New features and Use cases of Hivemall
Introduction to New features and Use cases of HivemallTreasure Data, Inc.
 
Scaling to Infinity - Open Source meets Big Data
Scaling to Infinity - Open Source meets Big DataScaling to Infinity - Open Source meets Big Data
Scaling to Infinity - Open Source meets Big DataTreasure Data, Inc.
 
Treasure Data: Move your data from MySQL to Redshift with (not much more tha...
Treasure Data:  Move your data from MySQL to Redshift with (not much more tha...Treasure Data:  Move your data from MySQL to Redshift with (not much more tha...
Treasure Data: Move your data from MySQL to Redshift with (not much more tha...Treasure Data, Inc.
 
Treasure Data From MySQL to Redshift
Treasure Data  From MySQL to RedshiftTreasure Data  From MySQL to Redshift
Treasure Data From MySQL to RedshiftTreasure Data, Inc.
 
Unifying Events and Logs into the Cloud
Unifying Events and Logs into the CloudUnifying Events and Logs into the Cloud
Unifying Events and Logs into the CloudTreasure Data, Inc.
 
Fluentd and Docker - running fluentd within a docker container
Fluentd and Docker - running fluentd within a docker containerFluentd and Docker - running fluentd within a docker container
Fluentd and Docker - running fluentd within a docker containerTreasure Data, Inc.
 
Building a system for machine and event-oriented data with Rocana
Building a system for machine and event-oriented data with RocanaBuilding a system for machine and event-oriented data with Rocana
Building a system for machine and event-oriented data with RocanaTreasure Data, Inc.
 
Augmenting Mongo DB with Treasure Data
Augmenting Mongo DB with Treasure DataAugmenting Mongo DB with Treasure Data
Augmenting Mongo DB with Treasure DataTreasure Data, Inc.
 

More from Treasure Data, Inc. (20)

GDPR: A Practical Guide for Marketers
GDPR: A Practical Guide for MarketersGDPR: A Practical Guide for Marketers
GDPR: A Practical Guide for Marketers
 
AR and VR by the Numbers: A Data First Approach to the Technology and Market
AR and VR by the Numbers: A Data First Approach to the Technology and MarketAR and VR by the Numbers: A Data First Approach to the Technology and Market
AR and VR by the Numbers: A Data First Approach to the Technology and Market
 
Introduction to Customer Data Platforms
Introduction to Customer Data PlatformsIntroduction to Customer Data Platforms
Introduction to Customer Data Platforms
 
Hands On: Javascript SDK
Hands On: Javascript SDKHands On: Javascript SDK
Hands On: Javascript SDK
 
Hands-On: Managing Slowly Changing Dimensions Using TD Workflow
Hands-On: Managing Slowly Changing Dimensions Using TD WorkflowHands-On: Managing Slowly Changing Dimensions Using TD Workflow
Hands-On: Managing Slowly Changing Dimensions Using TD Workflow
 
Brand Analytics Management: Measuring CLV Across Platforms, Devices and Apps
Brand Analytics Management: Measuring CLV Across Platforms, Devices and AppsBrand Analytics Management: Measuring CLV Across Platforms, Devices and Apps
Brand Analytics Management: Measuring CLV Across Platforms, Devices and Apps
 
How to Power Your Customer Experience with Data
How to Power Your Customer Experience with DataHow to Power Your Customer Experience with Data
How to Power Your Customer Experience with Data
 
Why Your VR Game is Virtually Useless Without Data
Why Your VR Game is Virtually Useless Without DataWhy Your VR Game is Virtually Useless Without Data
Why Your VR Game is Virtually Useless Without Data
 
Connecting the Customer Data Dots
Connecting the Customer Data DotsConnecting the Customer Data Dots
Connecting the Customer Data Dots
 
Harnessing Data for Better Customer Experience and Company Success
Harnessing Data for Better Customer Experience and Company SuccessHarnessing Data for Better Customer Experience and Company Success
Harnessing Data for Better Customer Experience and Company Success
 
Keynote - Fluentd meetup v14
Keynote - Fluentd meetup v14Keynote - Fluentd meetup v14
Keynote - Fluentd meetup v14
 
Introduction to New features and Use cases of Hivemall
Introduction to New features and Use cases of HivemallIntroduction to New features and Use cases of Hivemall
Introduction to New features and Use cases of Hivemall
 
Scalable Hadoop in the cloud
Scalable Hadoop in the cloudScalable Hadoop in the cloud
Scalable Hadoop in the cloud
 
Scaling to Infinity - Open Source meets Big Data
Scaling to Infinity - Open Source meets Big DataScaling to Infinity - Open Source meets Big Data
Scaling to Infinity - Open Source meets Big Data
 
Treasure Data: Move your data from MySQL to Redshift with (not much more tha...
Treasure Data:  Move your data from MySQL to Redshift with (not much more tha...Treasure Data:  Move your data from MySQL to Redshift with (not much more tha...
Treasure Data: Move your data from MySQL to Redshift with (not much more tha...
 
Treasure Data From MySQL to Redshift
Treasure Data  From MySQL to RedshiftTreasure Data  From MySQL to Redshift
Treasure Data From MySQL to Redshift
 
Unifying Events and Logs into the Cloud
Unifying Events and Logs into the CloudUnifying Events and Logs into the Cloud
Unifying Events and Logs into the Cloud
 
Fluentd and Docker - running fluentd within a docker container
Fluentd and Docker - running fluentd within a docker containerFluentd and Docker - running fluentd within a docker container
Fluentd and Docker - running fluentd within a docker container
 
Building a system for machine and event-oriented data with Rocana
Building a system for machine and event-oriented data with RocanaBuilding a system for machine and event-oriented data with Rocana
Building a system for machine and event-oriented data with Rocana
 
Augmenting Mongo DB with Treasure Data
Augmenting Mongo DB with Treasure DataAugmenting Mongo DB with Treasure Data
Augmenting Mongo DB with Treasure Data
 

Recently uploaded

Generative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfGenerative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfIngrid Airi González
 
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...Nikki Chapple
 
Infrared simulation and processing on Nvidia platforms
Infrared simulation and processing on Nvidia platformsInfrared simulation and processing on Nvidia platforms
Infrared simulation and processing on Nvidia platformsYoss Cohen
 
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesHow to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesThousandEyes
 
Digital Tools & AI in Career Development
Digital Tools & AI in Career DevelopmentDigital Tools & AI in Career Development
Digital Tools & AI in Career DevelopmentMahmoud Rabie
 
Assure Ecommerce and Retail Operations Uptime with ThousandEyes
Assure Ecommerce and Retail Operations Uptime with ThousandEyesAssure Ecommerce and Retail Operations Uptime with ThousandEyes
Assure Ecommerce and Retail Operations Uptime with ThousandEyesThousandEyes
 
Français Patch Tuesday - Avril
Français Patch Tuesday - AvrilFrançais Patch Tuesday - Avril
Français Patch Tuesday - AvrilIvanti
 
2024 April Patch Tuesday
2024 April Patch Tuesday2024 April Patch Tuesday
2024 April Patch TuesdayIvanti
 
Transcript: New from BookNet Canada for 2024: BNC SalesData and LibraryData -...
Transcript: New from BookNet Canada for 2024: BNC SalesData and LibraryData -...Transcript: New from BookNet Canada for 2024: BNC SalesData and LibraryData -...
Transcript: New from BookNet Canada for 2024: BNC SalesData and LibraryData -...BookNet Canada
 
Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Farhan Tariq
 
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfSo einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfpanagenda
 
Landscape Catalogue 2024 Australia-1.pdf
Landscape Catalogue 2024 Australia-1.pdfLandscape Catalogue 2024 Australia-1.pdf
Landscape Catalogue 2024 Australia-1.pdfAarwolf Industries LLC
 
Kuma Meshes Part I - The basics - A tutorial
Kuma Meshes Part I - The basics - A tutorialKuma Meshes Part I - The basics - A tutorial
Kuma Meshes Part I - The basics - A tutorialJoão Esperancinha
 
Microsoft 365 Copilot: How to boost your productivity with AI – Part two: Dat...
Microsoft 365 Copilot: How to boost your productivity with AI – Part two: Dat...Microsoft 365 Copilot: How to boost your productivity with AI – Part two: Dat...
Microsoft 365 Copilot: How to boost your productivity with AI – Part two: Dat...Nikki Chapple
 
Testing tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesTesting tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesKari Kakkonen
 
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...panagenda
 
Generative AI - Gitex v1Generative AI - Gitex v1.pptx
Generative AI - Gitex v1Generative AI - Gitex v1.pptxGenerative AI - Gitex v1Generative AI - Gitex v1.pptx
Generative AI - Gitex v1Generative AI - Gitex v1.pptxfnnc6jmgwh
 
QMMS Lesson 2 - Using MS Excel Formula.pdf
QMMS Lesson 2 - Using MS Excel Formula.pdfQMMS Lesson 2 - Using MS Excel Formula.pdf
QMMS Lesson 2 - Using MS Excel Formula.pdfROWELL MARQUINA
 
QCon London: Mastering long-running processes in modern architectures
QCon London: Mastering long-running processes in modern architecturesQCon London: Mastering long-running processes in modern architectures
QCon London: Mastering long-running processes in modern architecturesBernd Ruecker
 

Recently uploaded (20)

Generative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfGenerative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdf
 
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...
 
Infrared simulation and processing on Nvidia platforms
Infrared simulation and processing on Nvidia platformsInfrared simulation and processing on Nvidia platforms
Infrared simulation and processing on Nvidia platforms
 
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesHow to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
 
Digital Tools & AI in Career Development
Digital Tools & AI in Career DevelopmentDigital Tools & AI in Career Development
Digital Tools & AI in Career Development
 
Assure Ecommerce and Retail Operations Uptime with ThousandEyes
Assure Ecommerce and Retail Operations Uptime with ThousandEyesAssure Ecommerce and Retail Operations Uptime with ThousandEyes
Assure Ecommerce and Retail Operations Uptime with ThousandEyes
 
How Tech Giants Cut Corners to Harvest Data for A.I.
How Tech Giants Cut Corners to Harvest Data for A.I.How Tech Giants Cut Corners to Harvest Data for A.I.
How Tech Giants Cut Corners to Harvest Data for A.I.
 
Français Patch Tuesday - Avril
Français Patch Tuesday - AvrilFrançais Patch Tuesday - Avril
Français Patch Tuesday - Avril
 
2024 April Patch Tuesday
2024 April Patch Tuesday2024 April Patch Tuesday
2024 April Patch Tuesday
 
Transcript: New from BookNet Canada for 2024: BNC SalesData and LibraryData -...
Transcript: New from BookNet Canada for 2024: BNC SalesData and LibraryData -...Transcript: New from BookNet Canada for 2024: BNC SalesData and LibraryData -...
Transcript: New from BookNet Canada for 2024: BNC SalesData and LibraryData -...
 
Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...
 
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfSo einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
 
Landscape Catalogue 2024 Australia-1.pdf
Landscape Catalogue 2024 Australia-1.pdfLandscape Catalogue 2024 Australia-1.pdf
Landscape Catalogue 2024 Australia-1.pdf
 
Kuma Meshes Part I - The basics - A tutorial
Kuma Meshes Part I - The basics - A tutorialKuma Meshes Part I - The basics - A tutorial
Kuma Meshes Part I - The basics - A tutorial
 
Microsoft 365 Copilot: How to boost your productivity with AI – Part two: Dat...
Microsoft 365 Copilot: How to boost your productivity with AI – Part two: Dat...Microsoft 365 Copilot: How to boost your productivity with AI – Part two: Dat...
Microsoft 365 Copilot: How to boost your productivity with AI – Part two: Dat...
 
Testing tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesTesting tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examples
 
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
 
Generative AI - Gitex v1Generative AI - Gitex v1.pptx
Generative AI - Gitex v1Generative AI - Gitex v1.pptxGenerative AI - Gitex v1Generative AI - Gitex v1.pptx
Generative AI - Gitex v1Generative AI - Gitex v1.pptx
 
QMMS Lesson 2 - Using MS Excel Formula.pdf
QMMS Lesson 2 - Using MS Excel Formula.pdfQMMS Lesson 2 - Using MS Excel Formula.pdf
QMMS Lesson 2 - Using MS Excel Formula.pdf
 
QCon London: Mastering long-running processes in modern architectures
QCon London: Mastering long-running processes in modern architecturesQCon London: Mastering long-running processes in modern architectures
QCon London: Mastering long-running processes in modern architectures
 

Treasure Data Cloud Strategy

  • 1. Treasure Data Cloud Strategy Masahiro Nakagawa July Tech Festa: Jul 14, 2013 Sunday, July 14, 13
  • 2. Who are you? § Masahiro Nakagawa • @repeatedly / masa@treasure-data.com § Treasure Data, Inc. • Senior Software Engineer, since 2012/11 § Open Source projects • D Programming Language • MessagePack: D, Python, etc... • Fluentd: Core, Mongo, Logger, etc... • etc... 2 Sunday, July 14, 13
  • 4. Company Overview § Silicon Valley-based Company • All Founders are Japanese • Hironobu Yoshikawa • Kazuki Ohta • Sadayuki Furuhashi • About 20 people • Over 3.5 million jobs § OSS Enthusiasts • MessagePack, Fluentd, etc. 4 Sunday, July 14, 13
  • 5. Investors § Bill Tai § Othman Laraki - Former VP Growth at Twitter § James Lindenbaum, Adam Wiggins, Orion Henry - Heroku Founders § Anand Babu Periasamy, Hitesh Chellani - Gluster Founders § Yukihiro “Matz” Matsumoto - Creator of Ruby § Dan Scheinman - Director of Arista Networks § Jerry Yang - Founder of Yahoo! 5 Sunday, July 14, 13
  • 6. 6 DataVolume Cloud Enterprise RDBMSLightweight RDBMS DB2 1Bil entry Or 10TB Traditional Data Warehouse $10B market $34B market Database-as-a-service Big Data-as-a-Service On-Premise © 2012 Forrester Research, Inc. Reproduction Prohibited Treasure Data = Cloud + Big Data Sunday, July 14, 13
  • 7. The Problem with Other Solutions 7 Customer Value Time Sign-up or PO On-Premise Solutions Obsolescence over time Treasure Data Fully integrated Big Data full- stack service with simple interface, low friction initial engagement & continuous technical upgrade Need Upgrade AWS (or hosted Hadoops)EC2 EMR RedShift S3 Step-by-step manual integrations Maintain NO SpecialistsTOO LONG to get Live = Complex Solutions + Data Collection + Sunday, July 14, 13
  • 8. 8 Big Data Adoption Stages Intelligence Sophistication Standard Reports Ad-hoc Reports Drill Down Query Alerts Statistical Analysis Predictive Analysis Optimization What happened? Where? Where exactly? Error? Why? What’s a trend? What’s the best? Analytics Reporting Sunday, July 14, 13
  • 9. 8 Big Data Adoption Stages Intelligence Sophistication Standard Reports Ad-hoc Reports Drill Down Query Alerts Statistical Analysis Predictive Analysis Optimization What happened? Where? Where exactly? Error? Why? What’s a trend? What’s the best? Analytics Reporting Treasure Data’s FOCUS (80% of needs) Sunday, July 14, 13
  • 10. 9 Full Stack Support for Big Data Reporting Our best-in-class architecture and operations team ensure the integrity and availability of your data. Data from almost any source can be securely and reliably uploaded using td-agent in streaming or batch mode. Our SQL, REST, JDBC, ODBC and command-line interfaces support all major query tools and approaches. You can store gigabytes to petabytes of data efficiently and securely in our cloud-based columnar datastore. Sunday, July 14, 13
  • 11. We are... 10 Big Data as a Service not Hadoop on Cloud Sunday, July 14, 13
  • 12. Columnar Storage + Hadoop MapReduce 600 bil+ records 3.5 mil+ jobs Product 11 Data Collection Data Warehouse Data Analysis Open-Source Log Collector 2,500+ companies (incl. LinkedIn, etc) Bulk Loader CSV / TSV MySQL, Postgres Oracle, etc. Web Log App Log Sensor RDBMS CRM ERP Streaming Upload 60billion / month BI Tools Tableau, QlickView, Pentaho, Excel, etc. REST JDBC / ODBC SQL (HiveQL) or Pig Bulk Upload Parallel Upload Value Proposition: “Time-to-Answer” 20bil+, 2 weeks, UK/Austria 3bil+, 3 weeks Singapore 2 weeks, US 2 weeks, US 3 weeks, Japan Dashboard Custom App, RDBMS, FTP, etc. Result push Multi-Tenant: Single Code for Everyone - Improving the Platform Faster (e.g. SFDC, Heroku) Sunday, July 14, 13
  • 13. 12 Our Customers – 80 companies http://docs.treasure-data.com/categories/success-stories Sunday, July 14, 13
  • 14. 13 A case: “14 Days” from Signup to Success 1. Europe’s largest mobile ad exchange. 2. Serving >20 billion imps/ month for >15,000 mobile apps (Q1 2013) 3. Immediate need of analytics infrastructure: ASAP! 4. With TD, MobFox got into production only in 14 days, by one engineer. "Time is the most precious asset in our fast-moving business, and Treasure Data saved us a lot of it." Julian Zehetmayr, CEO & Founder td-agent = fluentd rpm/deb Sunday, July 14, 13
  • 15. 14 A case: “Replace” in-house Hadoop to TD 1. Global “Hulu” - Online Video Service with millions of users 2. Video contents are distributed to over 150 languages. 3. Had hard time maintaining Hadoop cluster 4. With TD, Viki deprecated their in-house Hadoop cluster and use engineer for core businesses. Before After “Treasure Data has always given us thorough and timely support peppered with insightful tips to make the best use of their service." Huy Nguyen, Software Engineer Sunday, July 14, 13
  • 16. 15 A case: Treasure Data with BI Tool (Tableau) 1. World’s largest android application market 2. Serving >3 billion app downloads for >100 million users 3. Only one engineer managing the data infrastructure 4. With TD, the data engineer can focus on analyzing data with existing BI tool "I will recommend Treasure Data to my friends in a heartbeat because it benefits all three stakeholders: Operations, Engineering and Business." Simon Dong, Principal Architect - Data Engineering Sunday, July 14, 13
  • 17. 16 - Vision - Single Analytics Platform for the World http://www.chisite.org/initiatives/WGII Sunday, July 14, 13
  • 19. 18 Treasure Data = Collect + Store + Query Sunday, July 14, 13
  • 20. 19 Architecture Breakdown Data Collection • Increasing variety of data sources • No single data schema • Lack of streaming data collection method • 60% of Big Data project resource consumed Data Store/Analytics • Remaining complexity in both traditional DWH and Hadoop (very slow time to market) • Challenges in scaling data volume and expanding cost. Connectivity • Required to ensure connectivity with existing BI/visualization/ apps by JDBC, ODBC and REST. • Output ot other services, e.g. S3, RDBMS, etc. Sunday, July 14, 13
  • 21. Product Philosophy § Data first, Schema later • “Schema-on-Read” • Both Batch and Query processing § Simple APIs • Easy to use and powerful § Easy integration • Log collecting, BI tools and etc... 20 Sunday, July 14, 13
  • 22. Our technology stack § td-agent • ETL part of Treasure Data § Plazma • Big data processing infrastructure • Columnar oriented storage • Reliable data handling § Multi-tenant scheduler • Robust distributed queue and scheduler 21 Sunday, July 14, 13
  • 23. § 60% of BI project resource is consumed here § Most ‘underestimated’ and ‘unsexy’ but MOST important § Fluentd: OSS lightweight but robust Log Collector • http://fluentd.org/ 1) Data Collection 22 Sunday, July 14, 13
  • 24. Apache App App Other data sources td-agent RDBMS Treasure Data columnar data warehouse Query Processing Cluster Query API HIVE, PIG JDBC, REST User td-command BI apps 23 This! Sunday, July 14, 13
  • 25. fluentd.org Fluentd the missing log collector 24 Sunday, July 14, 13
  • 26. Data Processing Collect Store Process Visualize Data source Reporting Monitoring Sunday, July 14, 13
  • 27. Store Process Cloudera Horton Works Treasure Data Collect Visualize Tableau Excel R easier & shorter time ??? Related Products Sunday, July 14, 13
  • 28. In short § Open sourced log collector written in Ruby • Easy to use, reliable and well performance • like streaming event processing § Using rubygems ecosystem for plugins 27 It’s like syslogd, but uses JSON for log messages Sunday, July 14, 13
  • 29. tail insert event buffering 127.0.0.1 - - [11/Dec/2012:07:26:27] "GET / ... 127.0.0.1 - - [11/Dec/2012:07:26:30] "GET / ... 127.0.0.1 - - [11/Dec/2012:07:26:32] "GET / ... 127.0.0.1 - - [11/Dec/2012:07:26:40] "GET / ... 127.0.0.1 - - [11/Dec/2012:07:27:01] "GET / ... ... 28 Fluentd Web Server Example (apache to monogdb) 2012-12-11 07:26:27 apache.log { "host": "127.0.0.1", "method": "GET", ... } Sunday, July 14, 13
  • 32. Buffer Output Input > Forward > HTTP > File tail > dstat > ... > Forward > File > MongoDB > ... > File > Memory 31 Pluggable architecture Engine Output > rewrite > ... Pluggable Pluggable Sunday, July 14, 13
  • 33. Nagios MongoDB Hadoop Alerting Amazon S3 Analysis Archiving MySQL Apache Frontend Access logs syslogd App logs System logs Backend Databases buffer / filter / routing 32 Sunday, July 14, 13
  • 34. td-agent § Open sourced distribution package of Fluentd • ETL part of Treasure Data • rpm, deb and homebrew § Including useful components • ruby, jemalloc, fluentd • 3rd party gems: td, mongo, webhdfs, etc... • td plugin is for Treasure Data § http://packages.treasure-data.com/ 33 Sunday, July 14, 13
  • 35. § Remaining complexity in both DWH and Hadoop § Challenges in scaling data volume and expanding cost § Plazma: Hadoop eco system and own projects 2) Data Store / Analytics 34 Sunday, July 14, 13
  • 36. Apache App App Other data sources td-agent RDBMS Treasure Data columnar data warehouse Query Processing Cluster Query API HIVE, PIG JDBC, REST User td-command BI apps 35 This! Sunday, July 14, 13
  • 37. AWS Component Dependencies (1) § RDS • Store user information, job status, etc... • Store metadata of our columnar database • Queue worker / Scheduler § EC2 • API servers (Ruby on Rails 3) • Hadoop clusters • Job workers • Using Chef to deploy 36 Sunday, July 14, 13
  • 38. AWS Component Dependencies (2) § ELB • Load balancing of API servers • Load balancing of td-agents § S3 • Columnar storage built on top of S3 • MessagePack columnar format • Realtime / Archive storage • Our Result feature supports S3 output. 37 No EBS, EMR, SQS and other products ! Sunday, July 14, 13
  • 39. Frontend Queue Worker Hadoop Fluentd Applications push metrics to Fluentd (via local Fluentd) Librato Metrics for realtime analysis Treasure Data for historical analysis Fluentd sums up data minutes (partial aggregation) Treasure Data Service Processing Flow 38 Hadoop Sunday, July 14, 13
  • 41. Structure of Columnar Storages Realtime Storage merge (every 1 hour) 2013-07-12 00:23:00 912ec80 2013-07-13 00:01:00 277a259 2013-07-14 00:02:00 d52c831 ... 23c82b0ba3405d4c15aa85d2190e 6d7b1482412ab14f0332b8aee119 8a7bc848b2791b8fd603c719e54f 0e3d402b17638477c9a7977e7dab ... SELECT ... Archive Storage Data import 40 Sunday, July 14, 13
  • 42. Query Language Query Execution Columnar Data Object Storage 41 Sunday, July 14, 13
  • 43. 1/4: Compile SQL into MapReduce SELECT COUNT(DISTINCT ip) FROM tbl; SQL Statement Hive SQL - to - MapReduce 42 +TD UDFs Sunday, July 14, 13
  • 44. 2/4: MapReduce is executed in parallel SELECT COUNT(DISTINCT ip) FROM tbl; 43 Sunday, July 14, 13
  • 45. 3/4: Columnar Data Access Read ONLY the Required Part of Data SELECT COUNT(DISTINCT ip) FROM tbl; 44 Sunday, July 14, 13
  • 47. Apply Schema {“user”:54, “name”:”test”, “value”:”120”, “host”:”local”} Schema user:int name:string value:int SELECT 54 (int) Raw data(JSON) “test” (string) 120 (int) host:int NULL 46 Sunday, July 14, 13
  • 48. Multi-Tenancy § All customers share the Hadoop clusters (Multi Data Centers) § Resource Sharing (Burst Cores), Rapid Improvement, Ease of Upgrade 47 datacenter A datacenter B datacenter C datacenter D Local FairScheduler Local FairScheduler Local FairScheduler Local FairScheduler Global Scheduler On-Demand Resouce Allocation Job Submission + Plan Change Sunday, July 14, 13
  • 49. Trial and error on Cloud § Rapid development • Change hardware • New architecture testing • Performance testing • Change software • Hadoop parameters • etc... § Use git and chef for these purposes • Easy to deploy and apply changes • git for change history 48 Sunday, July 14, 13
  • 50. § Services • CopperEgg • Librato Metrics • Logentries • NewRelic • PagerDuty • Desk.com • Olark • HipChat • Alerting Our Operation Stack: Full Use of SaaS 49 § Tools • Hosted Chef (Opscode) • Jenkins • including integration test 44 Sunday, July 14, 13
  • 54. 53 3) Connectivity § Need to visualize the query result § Use metrics / graph for interactive comparison § Result: Export result and use existence tools 45 Sunday, July 14, 13
  • 55. Apache App App Other data sources td-agent RDBMS Treasure Data columnar data warehouse Query Processing Cluster Query API HIVE, PIG JDBC, REST User td-command BI apps 54 This! Sunday, July 14, 13
  • 56. 55 Pull and Push approaches Query (Pull) Web App MySQL Treasure Data Columnar Storage Query Processing Cluster Query API REST API JDBC, ODBC Driver td-command BI apps S3 Result (Push) … Sunday, July 14, 13
  • 57. Support list 56 § Result • Treasure Data • MySQL • PostgreSQL • Google SpreadSheet • REST API • S3 • etc... § BI tool • Pentaho • Tableau • JasperSoft • Indicee • Dr. Sum • Metric Insight • etc... http://docs.treasure-data.com/categories/3rd-party-tools-overview http://docs.treasure-data.com/categories/result Sunday, July 14, 13
  • 58. § Treasure Data • Cloud based Big-data analytics platform • Provide Machete for Big data reporting § Big Data processing • Collect / Store / Analytics / Visualization § Consider trade-off • Cloud reinforces idea but not differentiator • What is the strong point? • Should focus own vision! Conclusion 57 Our focus! Sunday, July 14, 13
  • 59. Big Data for the Rest of Us www.treasure-data.com | @TreasureData Sunday, July 14, 13