Building a Sustainable Data Platform on AWS

SmartNews, Inc.
SmartNews, Inc.SmartNews, Inc.
••
Building a Sustainable
Data Platform on AWS
Takumi Sakamoto
2016.01.27
Takumi Sakamoto
@takus
šŸ˜ = ⚽ ✈ šŸ“·
http://bit.ly/1MCOyBX
JAWSDAYS 2015
Mentioned by @jeffbarr
https://twitter.com/jeffbarr/status/649575575787454464
http://www.slideshare.net/smartnews/smart-newss-journey-into-microservices
AWS Case Study
http://aws.amazon.com/solutions/case-studies/smartnews/
Data Platform at
SmartNews
What is SmartNews?
• News Discovery App
• Launched in 2012
• 15M+ Downloads in World Wide
https://www.smartnews.com/en/
Our Mission
the world's quality information?
the people who need it?
How?
Machine Learning
URLs Found
Structure Analysis
Semantics Analysis
Importance Estimation
Diversification
Internet
100,000+ /day
1000+ /day
Feedback
Deliver
Trending Stories
Data Platform Use Cases
• Product development
• track KPI such as DAU and MAU
• A/B test for new feature, on-boarding, etc...
• ad-hoc analysis
• Provide data to applications
• realtime re-ranking news articles
• CTR prediction of Ads system
• dashboard service for media partners
Data & Its Numbers
• User activities
• ~100 GBs per day (compressed)
• 60+ record types
• User demographics or configurations etc...
• 15M+ records
• Articles metadata
• 100K+ records per day
Sustainable
Data Platform?
Sustainable Data Platform
• Provide a reliable and scalable "Lambda Architecture"
• Minimize both operation & running cost
• Be open to uncertain future
Lambda Architecture
http://lambda-architecture.net/
Why Sustainable?
• Do a lot with a few engineers
• no one is a full-time maintainer
• avoid to waste too much time
• Empower brilliant engineers in SmartNews
• everything should be as self-serve as possible
• don't ask for permission, beg for forgiveness
System Design
Ī» Architecture at SmartNews
Input Batch Serving
Speed
Output
Design Principles
• Decoupled "Computation" and "Storage" layers
• multiple consumers can use the same data
• run consumers on Spot Instances
• prevent serious data lost with minimum effort
• Use the right tool for the job
• leverage AWS managed service as possible
• fill in the missing pieces by Presto & PipelineDB
An Example
Amazon EMR
AMI 3.x
Amazon S3
Amazon EMR
Hive
General
Users
Application
Engineer
I wanna
upgrade hive
Ad
Engineer
I wanna combine
news data with
ad data
Amazon EMR
AMI 4.x
Amazon EMR
Spark
We’re satisfied
with current
version
Data
Scientist
I wanna test my
algorithm with the
latest spark
Batch Layer
Run multiple EMR clusters for each usages
Kinesis
Stream
Spark
on EMR
AWS
Lambda
Data
Scientist
I wanna consume
streaming data by
Spark
Application
Engineer
I wanna add a
streaming monitor
by Lambda
Speed Layer
Consume the same data for each usages
• AWS managed services
• Replicated data into Multiple AZs
• High availability
Input Data
Collect Events by Fluentd
• Forwarder (running on each instances)
• store JSON events to S3
• forward events to aggregators
• collect metrics and post them to Datadog
• Aggregator
• input events into Kinesis & PipelineDB
• other reporting tasks (not mentioned today)
Forwards to S3
<source>
@type tail
format json
path /data/log/user_activity.log
pos_file /data/log/pos/user_activity.pos
tag smartnews.user_activity
time_key timestamp
</source>
<match smartnews.user_activity>
@type copy
<store>
@type relabel
@label @s3
</store>
<store>
@type forward
@label @forward
</store>
</match>
@include conf.d/s3.conf
@include conf.d/forward.conf
<label @s3>
<% node[:td_agent][:s3].each do |c| -%>
<match <%= c[:tag] %>>
@id s3.<%= c[:tag] %>
@type s3
...
path fluentd/<%= node[:env] %>/<%= node[:role] %>/<%= c[:tag] %>
time_slice_format dt=%Y-%m-%d/hh=%H
time_key timestamp
include_time_key
time_as_epoch
reduced_redundancy true
format json
utc
buffer_chunk_limit 2048m
</match>
<% end -%>
</label>
td-agent.conf conf.d/s3.conf
Capture DynamoDB Streams
<source>
type dynamodb_streams
stream_arn YOUR_DDB_STREAMS_ARN
pos_file /path/to/table.pos
fetch_interval 1
fetch_size 100
</source>
https://github.com/takus/fluent-plugin-dynamodb-streams
DynamoDB DynamoDB
Streams
Amazon S3
AWS
Lambda
Fluentd
Recommended Practices
• Make configuration simple as possible
• fluentd can cover everything, but shouldn't
• keep stateless
• Use v0.12 or later
• "Filter" : better performance
• "Label": eliminate 'output_tag' configuration
Monitor Fluentd Status
• Monitor traffic volume & retry count by Datadog
• Datadog's fluentd integration
• fluent-plugin-flowcounter
• fluent-plugin-dogstatsd
Archive to Amazon S3
• I have 2 recommended settings
• versioning
• enable to recover from human error
• lifecycle policy
• minify storage cost
Archives to IA or Gracier
xx days after the creation date
Keep previous versions xx days
Save you in the future!!
Batch Layer
Various ETL Tasks
• Extract
• dump MySQL records by Embulk
• make files on S3 readable to Hive
• Transform
• transform text files into columnar files (RCFile, ORC)
• generate features for machine learning
• aggregate records (by country, by channel)
• Load
• load aggregated metrics into Amazon Aurora
Hive
• Most popular project on Hadoop ecosystem
• famous for its lovely logo :)
• HiveQL and MapReduce
• convert SQL-like query into MR jobs
• Not adopt Tez engine yet
• Amazon EMR doesn't support now
• limited improvement to our queries
How to process JSON?
A. Transform into columnar table periodically
• required converting job
• better performance
B. Use JSON-SerDe for temporary analysis
• easy way for querying raw json text files
• required to "drop table" for change schema
• performance is not good
Transform Tables
-- Make S3 files readable by Hive
ALTER TABLE raw_activities ADD IF NOT EXISTS PARTITION
(dt='${DATE}', hh='${HOUR}');
-- Transform text files into columnar files (Flatten JSON)
INSERT OVERWRITE TABLE activities
PARTITION (dt='${DATE}', action)
SELECT
user_id, timestamp, os, country,
data,
action
FROM raw_activities
LATERAL VIEW json_tuple(
raw_activities.json,
'userId','timestamp','platform','country','action','data'
) a as user_id, timestamp, os, country, action, data
WHERE dt = '${DATE}'
CLUSTER BY os, country, action, user_id
;
JSON-SerDe
-- Define table with SERDE
CREATE TABLE json_table (
country string,
languages array<string>,
religions map<string,array<int>>
)
ROW FORMAT SERDE 'org.openx.data.jsonserde.JsonSerDe'
STORED AS TEXTFILE;
-- Result: 10
SELECT religions['catholic'][0] FROM json_table;
cf. hive-ruby-scripting
-- Define your ruby (JRuby) script
SET rb.script=
require 'json'
def parse (json)
j = JSON.load(json)
j['profile']['attribute1']
end
;
-- Use the script in HQL
SELECT rb_exec('&parse', json) FROM user;
https://github.com/gree/hive-ruby-scripting
Spark
http://www.slideshare.net/smartnews/aws-meetupapache-spark-on-emr
Self-Serve via AWS CLI
# Create EMR clusters that runs Hive & Spark & Ganglia
aws emr create-cluster 
--name "My Cluster" 
--release-label emr-4.2.0 
--applications Name=Hive Name=Spark Name=GANGLIA 
--ec2-attributes KeyName=myKey 
--instance-type c3.4xlarge 
--instance-count 4 
--use-default-roles
Minimize expenses
• Use Spot Instances as possible
• typically discount 50-90%
• select instance type with stable price
• C3 families spike often :(
• Dynamic cluster resizing
• x2 capacity during daily batch job
• 1/2 capacity during midnight
Handle Data Dependencies
Typical Anti-Pattern
5 * * * * app hive -f query_1.hql
15 * * * * app hive -f query_2.hql
30 * * * * app hive -f query_3.hql
0 * * * * app hive -f query_4.hql
1 * * * * app hive -f query_5.hql
Workflow Management
• Define dependencies
• task E is executed after finishing task C and task D
• Scheduling
• task A is kicked after 09:00 AM
• throttle concurrent running of the same task
• Monitoring
• notification in failure
• task C must finish before 01:00 PM (SLA)
cf. http://www.slideshare.net/taroleo/workflow-hacks-1-dots-tokyo
Airflow
• A workflow management systems
• define workflow by Python
• built in shiny UI & CLI
• pluggable architecture
http://nerds.airbnb.com/airflow/
Define Tasks
dag = DAG('tutorial', default_args=default_args)
t1 = BashOperator(
task_id='print_date',
bash_command='date',
dag=dag)
t2 = BashOperator(
task_id='sleep',
bash_command='sleep 5',
retries=3,
dag=dag)
t3 = BashOperator(
task_id='templated',
bash_command="""
{% for i in range(5) %}
echo "{{ ds }}"
echo "{{ macros.ds_add(ds, 7)}}"
echo "{{ params.my_param }}"
{% endfor %}
""",
params={'my_param': 'Parameter I passed in'},
dag=dag)
t2.set_upstream(t1)
t3.set_upstream(t1)
Task
Dependencies
Python code
DAG
Workflow as Code
Deploy codes automatically after merging into master
Visualize Dependencies
What is done or not?
Alerting to Slack
• SLA Violation
• task A should be done till 00:00 PM
• other team's task K has dependency into task A
• Output validation failure
• stop the following tasks if the output is doubtful
Retry from Web UI
Once clear histories, airflow scheduler back fill the histories
Retry from CLI
// Clear some histories from 2016-01-01
airflow clear etl_smartnews 
--task_regex user_ 
--downstream 
--start_date 2016-01-01
// Backfill uncompleted tasks
airflow backfill etl_smartnews 
--start_date 2016-01-01
Check Rendered Query
How Long Each Tasks?
Pluggable Architecture
• Built-in plugins
• operator: bash, hive, preto, mysql
• transfer: hive_to_mysql
• sensor: wait_hive_partition, wait_s3_file
• Written our own plugin
• mysql_partition
Examples
user_sensor = S3KeySensor(
task_id='wait_user',
bucket_name='smartnews',
bucket_key='user/dt={{ ds }}/dump.csv',
)
etl = HiveOperator(
task_id="task1",
hql="INSERT OVERWRITE INTO...."
)
etl.set_upstream(user_sensor)
import = HiveToMySqlTransfer(
task_id=name,
mysql_preoperator="DELETE FROM %s WHERE date = '{{ ds }}'" % table,
sql="SELECT country, count(*) FROM %s" % table,
mysql_table=table
)
import.set_upstream(etl)
Wait a S3 file creation
After the file is created,
Run ETL Query
After that,
Import into MySQL
Serving Layer
Provides batch views
in low-latency and ad-hoc way
Presto
• A distributed SQL query engine
• join multiple data sources (Hive + MySQL)
• support standard ANSI SQL
• designed to handle TBs or PBs scale data
cf. http://www.slideshare.net/frsyuki/presto-hadoop-conference-japan-2014
Presto Architecture
Amazon S3 Kinesis
Stream
Amazon
RDS
Amazon
Aurora
Presto
Worker
Presto
Worker
Presto
Worker
Presto
Worker
Presto
Worker
Presto
Worker
Presto
Coordinator
Client
1. Query with Standard SQL
4. Scan data concurrently
5. Aggregate data without disk I/O
6. Return result to client
2. Generate execution plan
3. Dispatch tasks into multiple workers
Amazon EMR
(Hive Metastore)
Provides Hive table metadata
(S3 access only)
※ https://github.com/qubole/presto-kinesis
※
Why Presto?
• Join multiple data sources
• skip large parts of ETL process
• enable to merge Hive/MySQL/Kinesis/PipelineDB
• Low latency
• ~30s to scan billions records in S3
• Low maintenance cost
• stateless, and easy to integrate with Auto Scaling
Use case: A/B Test
-- Suppose that this table exists
DESC hive.default.user_activities;
user_id bigint
action varchar
abtest array<map<varchar, bigint>>
url varchar
-- Summarize page view per A/B Test identifier
-- for comparing two algorithms v1 & v2
SELECT
Ā  dt,
Ā  t['behaviorId'],
Ā  count(*) as pv
FROM hive.default.user_activities CROSS JOIN UNNEST(abtest) AS t (t)
WHERE dt like '2016-01-%' AND action = 'viewArticle'
AND t['definitionId'] = 163
GROUP BY dt, t['behaviorId'] ORDER BY dt
;
2015-12-01 | algorithm_v1 | 40000
2015-12-01 | algorithm_v2 | 62000
Use case: Troubleshoot
-- Store access logs to S3, and query to them
-- Summarize access & 95pct response time by SQL
SELECT
from_unixtime(timestamp),
count(*) as access,
approx_percentile(reqtime, 0.95) as pct95_reqtime
FROM hive.default.access_log
WHERE dt = '2015-11-04' AND hh = '13' AND role = 'xxx'
GROUP BY timestamp ORDER BY timestamp
;
2015-11-04 22:00:00.000 | 6377 | 0.522
2015-11-04 22:00:01.000 | 3580 | 0.422
Scheduled Auto Scaling
$ aws autoscaling describe-scheduled-actions
{
"ScheduledUpdateGroupActions": [
{
"DesiredCapacity": 2,
"AutoScalingGroupName": "presto-worker-prd",
"Recurrence": "59 14 * * *",
"ScheduledActionName": "scalein-2359-jst"
},
{
"DesiredCapacity": 20,
"AutoScalingGroupName": "presto-worker-prd",
"Recurrence": "45 0 * * 1-5",
"ScheduledActionName": "scaleout-0945-jst"
}
]
}
Presto Covers Everything? No!
• Fixed system on Amazon Aurora (or other RDB)
• provides KPI for products & business
• require high availability & low latency
• has no flexibility
• Ad-hoc system on Presto
• provides access to all dataset on data platform
• require high scalability
• has flexibility (join various data sources)
Why Fixed vs Ad-hoc?
• Difficulties on the Ad-hoc only solution
• difficult to prevent heavy queries
• large distinct count exhausts computing resources
• decrease presto maintainability
Output Data
Chartio
• Dashboard as A Service
• helps businesses analyze and track their critical data
• one of AWS partners (※)
• Combine multiple data sources at one dashboard
• Presto, MySQL, Redshift, BigQuery, Elasticsearch ...
• enable to join BigQuery + MySQL internally
• Easy to use for every one
• everyone can make their own dashboard
• write SQL directly / generate query by drag & drop
※ http://www.aws-partner-directory.com/PartnerDirectory/PartnerDetail?id=8959
Creating dashboard
1. Building query
(Drag&Drop / SQL)
2. Add step
(filter态sort态modify)
3. Select visualize way
(table态graph)
Examples
Why Chartio?
• Chartio saves a lot of engineering resources
• before
• maintain in-house dashboard written by rails
• everyone got tired to maintain it
• after
• everyone can build their own dashboard easily
• Chartio's UI is cool
• very important factor for dashboard tool
Missing Pieces of Chartio
• No programable API provides
• need to edit dashboard / chart manually
• No rollback feature
• all changes are recorded, but not rollback to the
previous state
• work around : clone => edit => rename
Speed Layer
Why Speed is Matter?
Today’s News is Wrapping
Tomorrow’s Fish and Chips
↑
Yesterday's News
http://www.personalchefapproach.com/tomorrows-fish-n-chips-wrapper/
How News Behaves?
https://gdsdata.blog.gov.uk/2013/10/22/the-half-life-of-news/
Use cases
• Re-rank news articles by user feedback
• track user's positive/negative signal
• consider gender, age, location, interests
• Realtime article monitoring
• detect high bounce rate (may be broken?)
• make realtime reporting dashboard for A/B test
Realtime Re-Ranking
ref. Stream 処理 (Spark Streaming + Kinesis) と Offline 処理 (Hive) の統合
www.slideshare.net/smartnews/stremspark-streaming-kinesisofflinehive
Amazon
CloudSearch
Search
API
API
Gateway
Kinesis
Stream
Amazon S3
Amazon EMR
Amazon S3 Amazon EMR
DynamoDB
Realtime
Feedback
Re-rank
Articles
Article
Metadata
User
Interests
User
Behaviors
Offline Procees
by Hive / Spark
Realtime Monitoring
API
Gateway
Stream
Continuous
View
Continuous
View
Continuous
View
Discard raw record soon after
consumed by Continuous View
Incrementally
updated in realtime
PipelineDB Chartio
AWS
Lambda
Slack
Access Continuous View
by PostgreSQL Client
Record
※1
※1
Use cron on 26 Feb. 2016
Migrate it soon after supporting VPC
PipelineDB
• OSS & enterprise streaming SQL database
• PostgreSQL compatible
• connect to Chartio šŸ˜
• join stream to normal PostgreSQL table
• Support probabilistic data structures
• e.g. HyperLogLog
https://www.pipelinedb.com/
http://developer.smartnews.com/blog/2015/09/09/20150907pipelinedb/
Continuous View
-- Calculate unique users seen per media each day
-- Using only a constant amount of space (HyperLogLog)
CREATE CONTINUOUS VIEW uniques AS
SELECT
day(arrival_timestamp),
substring(url from '.*://([^/]*)') as hostname,
COUNT(DISTINCT user_id::integer)
FROM activity_stream GROUP BY day,hostname;
-- How many impressions have we served in the last five minutes?
CREATE CONTINUOUS VIEW imps WITH (max_age = '5 minutes') AS
SELECT COUNT(*) FROM imps_stream;
-- What are the 90th, 95th, 99th percentiles of request latency?
CREATE CONTINUOUS VIEW latency AS
SELECT
percentile_cont(array[90, 95, 99])
WITHIN GROUP (ORDER BY latency::integer)
FROM latency_stream;
Summary
Sustainable Data Platform
• build a reliable and scalable lambda architecture
• minimize operation & running cost
• be open to uncertain future
My Wishlist to AWS
• Support Reduced Redundancy Storage (RRS) on EMR
• Faster EMR Launch
• Set TTL to DynamoDB records
• Auto-scale Kinesis Stream
• Launch Kinesis Analytics in Tokyo region
Thank you!!
1 of 81

Recommended

AWS Black Belt Online Seminar 2016 Amazon EMR by
AWS Black Belt Online Seminar 2016 Amazon EMRAWS Black Belt Online Seminar 2016 Amazon EMR
AWS Black Belt Online Seminar 2016 Amazon EMRAmazon Web Services Japan
13.3K views•84 slides
AWS Black Belt Online Seminar 2017 Amazon DynamoDB by
AWS Black Belt Online Seminar 2017 Amazon DynamoDB AWS Black Belt Online Seminar 2017 Amazon DynamoDB
AWS Black Belt Online Seminar 2017 Amazon DynamoDB Amazon Web Services Japan
146.4K views•147 slides
AWS Black Belt Tech ć‚·ćƒŖćƒ¼ć‚ŗ 2015 - AWS Data Pipeline by
AWS Black Belt Tech ć‚·ćƒŖćƒ¼ć‚ŗ 2015 - AWS Data PipelineAWS Black Belt Tech ć‚·ćƒŖćƒ¼ć‚ŗ 2015 - AWS Data Pipeline
AWS Black Belt Tech ć‚·ćƒŖćƒ¼ć‚ŗ 2015 - AWS Data PipelineAmazon Web Services Japan
24.7K views•53 slides
20210126 AWS Black Belt Online Seminar AWS CodeDeploy by
20210126 AWS Black Belt Online Seminar AWS CodeDeploy20210126 AWS Black Belt Online Seminar AWS CodeDeploy
20210126 AWS Black Belt Online Seminar AWS CodeDeployAmazon Web Services Japan
4.9K views•107 slides
ć‚Ŗćƒ³ćƒ—ćƒ¬ćƒŸć‚¹RDBMS悒AWSćøē§»č”Œć™ć‚‹ę‰‹ę³• by
ć‚Ŗćƒ³ćƒ—ćƒ¬ćƒŸć‚¹RDBMS悒AWSćøē§»č”Œć™ć‚‹ę‰‹ę³•ć‚Ŗćƒ³ćƒ—ćƒ¬ćƒŸć‚¹RDBMS悒AWSćøē§»č”Œć™ć‚‹ę‰‹ę³•
ć‚Ŗćƒ³ćƒ—ćƒ¬ćƒŸć‚¹RDBMS悒AWSćøē§»č”Œć™ć‚‹ę‰‹ę³•Amazon Web Services Japan
30.6K views•46 slides
AWS Amplify å…„é–€ by
AWS Amplify å…„é–€AWS Amplify å…„é–€
AWS Amplify å…„é–€Hideaki Aoyagi
2K views•24 slides

More Related Content

What's hot

AWS Black Belt Online Seminar 2018 Amazon DynamoDB Advanced Design Pattern by
AWS Black Belt Online Seminar 2018 Amazon DynamoDB Advanced Design PatternAWS Black Belt Online Seminar 2018 Amazon DynamoDB Advanced Design Pattern
AWS Black Belt Online Seminar 2018 Amazon DynamoDB Advanced Design PatternAmazon Web Services Japan
58.1K views•73 slides
JAWS-UG ęƒ…ć‚·ć‚¹ę”ÆéƒØć®ēš†ę§˜å‘ć‘ Amazon Elastic File System (Amazon EFS) by
JAWS-UG ęƒ…ć‚·ć‚¹ę”ÆéƒØć®ēš†ę§˜å‘ć‘ Amazon Elastic File System (Amazon EFS)JAWS-UG ęƒ…ć‚·ć‚¹ę”ÆéƒØć®ēš†ę§˜å‘ć‘ Amazon Elastic File System (Amazon EFS)
JAWS-UG ęƒ…ć‚·ć‚¹ę”ÆéƒØć®ēš†ę§˜å‘ć‘ Amazon Elastic File System (Amazon EFS)Amazon Web Services Japan
10K views•68 slides
칓칓오 ź“‘ź³  ķ”Œėž«ķ¼ MSA 적용 사딀 ė° API Gateway와 ģøģ¦ źµ¬ķ˜„ģ— ėŒ€ķ•œ ģ†Œź°œ by
칓칓오 ź“‘ź³  ķ”Œėž«ķ¼ MSA 적용 사딀 ė° API Gateway와 ģøģ¦ źµ¬ķ˜„ģ— ėŒ€ķ•œ ģ†Œź°œģ¹“ģ¹“ģ˜¤ ź“‘ź³  ķ”Œėž«ķ¼ MSA 적용 사딀 ė° API Gateway와 ģøģ¦ źµ¬ķ˜„ģ— ėŒ€ķ•œ ģ†Œź°œ
칓칓오 ź“‘ź³  ķ”Œėž«ķ¼ MSA 적용 사딀 ė° API Gateway와 ģøģ¦ źµ¬ķ˜„ģ— ėŒ€ķ•œ ģ†Œź°œif kakao
18.8K views•86 slides
20190522 AWS Black Belt Online Seminar AWS Step Functions by
20190522 AWS Black Belt Online Seminar AWS Step Functions20190522 AWS Black Belt Online Seminar AWS Step Functions
20190522 AWS Black Belt Online Seminar AWS Step FunctionsAmazon Web Services Japan
97.4K views•54 slides
20220409 AWS BLEA é–‹ē™ŗć«ć‚ćŸć£ć¦ę¤œčØŽć—ćŸć“ćØ by
20220409 AWS BLEA é–‹ē™ŗć«ć‚ćŸć£ć¦ę¤œčØŽć—ćŸć“ćØ20220409 AWS BLEA é–‹ē™ŗć«ć‚ćŸć£ć¦ę¤œčØŽć—ćŸć“ćØ
20220409 AWS BLEA é–‹ē™ŗć«ć‚ćŸć£ć¦ę¤œčØŽć—ćŸć“ćØAmazon Web Services Japan
3.7K views•28 slides
20190911 AWS Black Belt Online Seminar AWS Batch by
20190911 AWS Black Belt Online Seminar AWS Batch20190911 AWS Black Belt Online Seminar AWS Batch
20190911 AWS Black Belt Online Seminar AWS BatchAmazon Web Services Japan
23K views•67 slides

What's hot(20)

AWS Black Belt Online Seminar 2018 Amazon DynamoDB Advanced Design Pattern by Amazon Web Services Japan
AWS Black Belt Online Seminar 2018 Amazon DynamoDB Advanced Design PatternAWS Black Belt Online Seminar 2018 Amazon DynamoDB Advanced Design Pattern
AWS Black Belt Online Seminar 2018 Amazon DynamoDB Advanced Design Pattern
Amazon Web Services Japan•58.1K views
JAWS-UG ęƒ…ć‚·ć‚¹ę”ÆéƒØć®ēš†ę§˜å‘ć‘ Amazon Elastic File System (Amazon EFS) by Amazon Web Services Japan
JAWS-UG ęƒ…ć‚·ć‚¹ę”ÆéƒØć®ēš†ę§˜å‘ć‘ Amazon Elastic File System (Amazon EFS)JAWS-UG ęƒ…ć‚·ć‚¹ę”ÆéƒØć®ēš†ę§˜å‘ć‘ Amazon Elastic File System (Amazon EFS)
JAWS-UG ęƒ…ć‚·ć‚¹ę”ÆéƒØć®ēš†ę§˜å‘ć‘ Amazon Elastic File System (Amazon EFS)
칓칓오 ź“‘ź³  ķ”Œėž«ķ¼ MSA 적용 사딀 ė° API Gateway와 ģøģ¦ źµ¬ķ˜„ģ— ėŒ€ķ•œ ģ†Œź°œ by if kakao
칓칓오 ź“‘ź³  ķ”Œėž«ķ¼ MSA 적용 사딀 ė° API Gateway와 ģøģ¦ źµ¬ķ˜„ģ— ėŒ€ķ•œ ģ†Œź°œģ¹“ģ¹“ģ˜¤ ź“‘ź³  ķ”Œėž«ķ¼ MSA 적용 사딀 ė° API Gateway와 ģøģ¦ źµ¬ķ˜„ģ— ėŒ€ķ•œ ģ†Œź°œ
칓칓오 ź“‘ź³  ķ”Œėž«ķ¼ MSA 적용 사딀 ė° API Gateway와 ģøģ¦ źµ¬ķ˜„ģ— ėŒ€ķ•œ ģ†Œź°œ
if kakao•18.8K views
20190522 AWS Black Belt Online Seminar AWS Step Functions by Amazon Web Services Japan
20190522 AWS Black Belt Online Seminar AWS Step Functions20190522 AWS Black Belt Online Seminar AWS Step Functions
20190522 AWS Black Belt Online Seminar AWS Step Functions
Amazon Web Services Japan•97.4K views
20220409 AWS BLEA é–‹ē™ŗć«ć‚ćŸć£ć¦ę¤œčØŽć—ćŸć“ćØ by Amazon Web Services Japan
20220409 AWS BLEA é–‹ē™ŗć«ć‚ćŸć£ć¦ę¤œčØŽć—ćŸć“ćØ20220409 AWS BLEA é–‹ē™ŗć«ć‚ćŸć£ć¦ę¤œčØŽć—ćŸć“ćØ
20220409 AWS BLEA é–‹ē™ŗć«ć‚ćŸć£ć¦ę¤œčØŽć—ćŸć“ćØ
Spring Boot 恮 Web ć‚¢ćƒ•ć‚šćƒŖć‚±ćƒ¼ć‚·ćƒ§ćƒ³ć‚’ Docker に載せて AWS ECS で動かしている話 by JustSystems Corporation
Spring Boot 恮 Web ć‚¢ćƒ•ć‚šćƒŖć‚±ćƒ¼ć‚·ćƒ§ćƒ³ć‚’ Docker に載せて AWS ECS で動かしている話Spring Boot 恮 Web ć‚¢ćƒ•ć‚šćƒŖć‚±ćƒ¼ć‚·ćƒ§ćƒ³ć‚’ Docker に載せて AWS ECS で動かしている話
Spring Boot 恮 Web ć‚¢ćƒ•ć‚šćƒŖć‚±ćƒ¼ć‚·ćƒ§ćƒ³ć‚’ Docker に載せて AWS ECS で動かしている話
JustSystems Corporation•16.5K views
20210127 AWS Black Belt Online Seminar Amazon Redshift 運用箔理 by Amazon Web Services Japan
20210127 AWS Black Belt Online Seminar Amazon Redshift 運用箔理20210127 AWS Black Belt Online Seminar Amazon Redshift 運用箔理
20210127 AWS Black Belt Online Seminar Amazon Redshift 運用箔理
Awsć‚’ć‚Ŗćƒ³ćƒ—ćƒ¬ćƒ‰ćƒ”ć‚³ćƒ³ć«é€£ęŗć•ć›ć‚‹ by Syuichi Murashima
Awsć‚’ć‚Ŗćƒ³ćƒ—ćƒ¬ćƒ‰ćƒ”ć‚³ćƒ³ć«é€£ęŗć•ć›ć‚‹Awsć‚’ć‚Ŗćƒ³ćƒ—ćƒ¬ćƒ‰ćƒ”ć‚³ćƒ³ć«é€£ęŗć•ć›ć‚‹
Awsć‚’ć‚Ŗćƒ³ćƒ—ćƒ¬ćƒ‰ćƒ”ć‚³ćƒ³ć«é€£ęŗć•ć›ć‚‹
Syuichi Murashima•5.2K views
Amazon S3ć‚’äø­åæƒćØć™ć‚‹ćƒ‡ćƒ¼ć‚æåˆ†ęžć®ćƒ™ć‚¹ćƒˆćƒ—ćƒ©ć‚Æćƒ†ć‚£ć‚¹ by Amazon Web Services Japan
Amazon S3ć‚’äø­åæƒćØć™ć‚‹ćƒ‡ćƒ¼ć‚æåˆ†ęžć®ćƒ™ć‚¹ćƒˆćƒ—ćƒ©ć‚Æćƒ†ć‚£ć‚¹Amazon S3ć‚’äø­åæƒćØć™ć‚‹ćƒ‡ćƒ¼ć‚æåˆ†ęžć®ćƒ™ć‚¹ćƒˆćƒ—ćƒ©ć‚Æćƒ†ć‚£ć‚¹
Amazon S3ć‚’äø­åæƒćØć™ć‚‹ćƒ‡ćƒ¼ć‚æåˆ†ęžć®ćƒ™ć‚¹ćƒˆćƒ—ćƒ©ć‚Æćƒ†ć‚£ć‚¹
Amazon Web Services Japan•15.2K views
20200212 AWS Black Belt Online Seminar AWS Systems Manager by Amazon Web Services Japan
20200212 AWS Black Belt Online Seminar AWS Systems Manager20200212 AWS Black Belt Online Seminar AWS Systems Manager
20200212 AWS Black Belt Online Seminar AWS Systems Manager
Amazon Web Services Japan•26.2K views
20190424 AWS Black Belt Online Seminar Amazon Aurora MySQL by Amazon Web Services Japan
20190424 AWS Black Belt Online Seminar Amazon Aurora MySQL20190424 AWS Black Belt Online Seminar Amazon Aurora MySQL
20190424 AWS Black Belt Online Seminar Amazon Aurora MySQL
Amazon Web Services Japan•29.7K views
AWS Black Belt Techć‚·ćƒŖćƒ¼ć‚ŗ Amazon Kinesis by Amazon Web Services Japan
AWS Black Belt Techć‚·ćƒŖćƒ¼ć‚ŗ  Amazon KinesisAWS Black Belt Techć‚·ćƒŖćƒ¼ć‚ŗ  Amazon Kinesis
AWS Black Belt Techć‚·ćƒŖćƒ¼ć‚ŗ Amazon Kinesis
Amazon Web Services Japan•19.7K views
AWS Blackbelt 2015ć‚·ćƒŖćƒ¼ć‚ŗ Amazon CloudWatch & Amazon CloudWatch Logs by Amazon Web Services Japan
AWS Blackbelt 2015ć‚·ćƒŖćƒ¼ć‚ŗ Amazon CloudWatch & Amazon CloudWatch LogsAWS Blackbelt 2015ć‚·ćƒŖćƒ¼ć‚ŗ Amazon CloudWatch & Amazon CloudWatch Logs
AWS Blackbelt 2015ć‚·ćƒŖćƒ¼ć‚ŗ Amazon CloudWatch & Amazon CloudWatch Logs
Amazon Web Services Japan•128.5K views
20190828 AWS Black Belt Online Seminar Amazon Aurora with PostgreSQL Compatib... by Amazon Web Services Japan
20190828 AWS Black Belt Online Seminar Amazon Aurora with PostgreSQL Compatib...20190828 AWS Black Belt Online Seminar Amazon Aurora with PostgreSQL Compatib...
20190828 AWS Black Belt Online Seminar Amazon Aurora with PostgreSQL Compatib...
Amazon Web Services Japan•17.2K views
AWS ķ™œģš©ķ•œ Data Lake źµ¬ģ„±ķ•˜źø° by Nak Joo Kwon
AWS ķ™œģš©ķ•œ Data Lake źµ¬ģ„±ķ•˜źø°AWS ķ™œģš©ķ•œ Data Lake źµ¬ģ„±ķ•˜źø°
AWS ķ™œģš©ķ•œ Data Lake źµ¬ģ„±ķ•˜źø°
Nak Joo Kwon•1.6K views

Viewers also liked

20170725 black belt_monitoring_on_aws by
20170725 black belt_monitoring_on_aws20170725 black belt_monitoring_on_aws
20170725 black belt_monitoring_on_awsAmazon Web Services Japan
10.3K views•56 slides
[ć‚ˆćć‚ć‹ć‚‹ć‚Æćƒ©ć‚¦ćƒ‰ćƒ‡ćƒ¼ć‚æćƒ™ćƒ¼ć‚¹] ćƒŖć‚Æćƒ«ćƒ¼ćƒˆć«ćŠć‘ć‚‹Redshiftå°Žå…„ćƒ»ę“»ē”Øäŗ‹ä¾‹ by
[ć‚ˆćć‚ć‹ć‚‹ć‚Æćƒ©ć‚¦ćƒ‰ćƒ‡ćƒ¼ć‚æćƒ™ćƒ¼ć‚¹] ćƒŖć‚Æćƒ«ćƒ¼ćƒˆć«ćŠć‘ć‚‹Redshiftå°Žå…„ćƒ»ę“»ē”Øäŗ‹ä¾‹[ć‚ˆćć‚ć‹ć‚‹ć‚Æćƒ©ć‚¦ćƒ‰ćƒ‡ćƒ¼ć‚æćƒ™ćƒ¼ć‚¹] ćƒŖć‚Æćƒ«ćƒ¼ćƒˆć«ćŠć‘ć‚‹Redshiftå°Žå…„ćƒ»ę“»ē”Øäŗ‹ä¾‹
[ć‚ˆćć‚ć‹ć‚‹ć‚Æćƒ©ć‚¦ćƒ‰ćƒ‡ćƒ¼ć‚æćƒ™ćƒ¼ć‚¹] ćƒŖć‚Æćƒ«ćƒ¼ćƒˆć«ćŠć‘ć‚‹Redshiftå°Žå…„ćƒ»ę“»ē”Øäŗ‹ä¾‹Amazon Web Services Japan
18.5K views•39 slides
(BDT314) A Big Data & Analytics App on Amazon EMR & Amazon Redshift by
(BDT314) A Big Data & Analytics App on Amazon EMR & Amazon Redshift(BDT314) A Big Data & Analytics App on Amazon EMR & Amazon Redshift
(BDT314) A Big Data & Analytics App on Amazon EMR & Amazon RedshiftAmazon Web Services
8K views•52 slides
20170726 black belt_stepfunctions by
20170726 black belt_stepfunctions20170726 black belt_stepfunctions
20170726 black belt_stepfunctionsAmazon Web Services Japan
28.3K views•44 slides
Amazon Redshiftć«ć‚ˆć‚‹ćƒŖć‚¢ćƒ«ć‚æć‚¤ćƒ åˆ†ęžć‚µćƒ¼ćƒ“ć‚¹ć®ę§‹ēÆ‰ by
Amazon Redshiftć«ć‚ˆć‚‹ćƒŖć‚¢ćƒ«ć‚æć‚¤ćƒ åˆ†ęžć‚µćƒ¼ćƒ“ć‚¹ć®ę§‹ēÆ‰Amazon Redshiftć«ć‚ˆć‚‹ćƒŖć‚¢ćƒ«ć‚æć‚¤ćƒ åˆ†ęžć‚µćƒ¼ćƒ“ć‚¹ć®ę§‹ēÆ‰
Amazon Redshiftć«ć‚ˆć‚‹ćƒŖć‚¢ćƒ«ć‚æć‚¤ćƒ åˆ†ęžć‚µćƒ¼ćƒ“ć‚¹ć®ę§‹ēÆ‰Minero Aoki
33.5K views•47 slides
AWS Black Belt Online Seminar 2017 Amazon Connect by
AWS Black Belt Online Seminar 2017 Amazon ConnectAWS Black Belt Online Seminar 2017 Amazon Connect
AWS Black Belt Online Seminar 2017 Amazon ConnectAmazon Web Services Japan
15.8K views•55 slides

Viewers also liked(20)

[ć‚ˆćć‚ć‹ć‚‹ć‚Æćƒ©ć‚¦ćƒ‰ćƒ‡ćƒ¼ć‚æćƒ™ćƒ¼ć‚¹] ćƒŖć‚Æćƒ«ćƒ¼ćƒˆć«ćŠć‘ć‚‹Redshiftå°Žå…„ćƒ»ę“»ē”Øäŗ‹ä¾‹ by Amazon Web Services Japan
[ć‚ˆćć‚ć‹ć‚‹ć‚Æćƒ©ć‚¦ćƒ‰ćƒ‡ćƒ¼ć‚æćƒ™ćƒ¼ć‚¹] ćƒŖć‚Æćƒ«ćƒ¼ćƒˆć«ćŠć‘ć‚‹Redshiftå°Žå…„ćƒ»ę“»ē”Øäŗ‹ä¾‹[ć‚ˆćć‚ć‹ć‚‹ć‚Æćƒ©ć‚¦ćƒ‰ćƒ‡ćƒ¼ć‚æćƒ™ćƒ¼ć‚¹] ćƒŖć‚Æćƒ«ćƒ¼ćƒˆć«ćŠć‘ć‚‹Redshiftå°Žå…„ćƒ»ę“»ē”Øäŗ‹ä¾‹
[ć‚ˆćć‚ć‹ć‚‹ć‚Æćƒ©ć‚¦ćƒ‰ćƒ‡ćƒ¼ć‚æćƒ™ćƒ¼ć‚¹] ćƒŖć‚Æćƒ«ćƒ¼ćƒˆć«ćŠć‘ć‚‹Redshiftå°Žå…„ćƒ»ę“»ē”Øäŗ‹ä¾‹
Amazon Web Services Japan•18.5K views
(BDT314) A Big Data & Analytics App on Amazon EMR & Amazon Redshift by Amazon Web Services
(BDT314) A Big Data & Analytics App on Amazon EMR & Amazon Redshift(BDT314) A Big Data & Analytics App on Amazon EMR & Amazon Redshift
(BDT314) A Big Data & Analytics App on Amazon EMR & Amazon Redshift
Amazon Web Services•8K views
Amazon Redshiftć«ć‚ˆć‚‹ćƒŖć‚¢ćƒ«ć‚æć‚¤ćƒ åˆ†ęžć‚µćƒ¼ćƒ“ć‚¹ć®ę§‹ēÆ‰ by Minero Aoki
Amazon Redshiftć«ć‚ˆć‚‹ćƒŖć‚¢ćƒ«ć‚æć‚¤ćƒ åˆ†ęžć‚µćƒ¼ćƒ“ć‚¹ć®ę§‹ēÆ‰Amazon Redshiftć«ć‚ˆć‚‹ćƒŖć‚¢ćƒ«ć‚æć‚¤ćƒ åˆ†ęžć‚µćƒ¼ćƒ“ć‚¹ć®ę§‹ēÆ‰
Amazon Redshiftć«ć‚ˆć‚‹ćƒŖć‚¢ćƒ«ć‚æć‚¤ćƒ åˆ†ęžć‚µćƒ¼ćƒ“ć‚¹ć®ę§‹ēÆ‰
Minero Aoki•33.5K views
(BDT305) Amazon EMR Deep Dive and Best Practices by Amazon Web Services
(BDT305) Amazon EMR Deep Dive and Best Practices(BDT305) Amazon EMR Deep Dive and Best Practices
(BDT305) Amazon EMR Deep Dive and Best Practices
Amazon Web Services•16.3K views
AWS Black Belt Online Seminar 2017 AWS Summit Tokyo 2017 まとめ by Amazon Web Services Japan
AWS Black Belt Online Seminar 2017 AWS Summit Tokyo 2017 まとめAWS Black Belt Online Seminar 2017 AWS Summit Tokyo 2017 まとめ
AWS Black Belt Online Seminar 2017 AWS Summit Tokyo 2017 まとめ
Amazon Web Services Japan•12.5K views
AWS Black Belt Online Seminar 2017 AWSćøć®ćƒćƒƒćƒˆćƒÆćƒ¼ć‚ÆęŽ„ē¶šćØAWSäøŠć®ćƒćƒƒćƒˆćƒÆćƒ¼ć‚Æå†…éƒØčØ­čØˆ by Amazon Web Services Japan
AWS Black Belt Online Seminar 2017 AWSćøć®ćƒćƒƒćƒˆćƒÆćƒ¼ć‚ÆęŽ„ē¶šćØAWSäøŠć®ćƒćƒƒćƒˆćƒÆćƒ¼ć‚Æå†…éƒØčØ­čØˆAWS Black Belt Online Seminar 2017 AWSćøć®ćƒćƒƒćƒˆćƒÆćƒ¼ć‚ÆęŽ„ē¶šćØAWSäøŠć®ćƒćƒƒćƒˆćƒÆćƒ¼ć‚Æå†…éƒØčØ­čØˆ
AWS Black Belt Online Seminar 2017 AWSćøć®ćƒćƒƒćƒˆćƒÆćƒ¼ć‚ÆęŽ„ē¶šćØAWSäøŠć®ćƒćƒƒćƒˆćƒÆćƒ¼ć‚Æå†…éƒØčØ­čØˆ
Amazon Web Services Japan•24.7K views
AWS Black Belt Online Seminar 2017 Amazon Pinpoint ć§å§‹ć‚ć‚‹ćƒ¢ćƒć‚¤ćƒ«ć‚¢ćƒ—ćƒŖć®ć‚°ćƒ­ćƒ¼ć‚¹ćƒćƒƒć‚Æ by Amazon Web Services Japan
AWS Black Belt Online Seminar 2017 Amazon Pinpoint ć§å§‹ć‚ć‚‹ćƒ¢ćƒć‚¤ćƒ«ć‚¢ćƒ—ćƒŖć®ć‚°ćƒ­ćƒ¼ć‚¹ćƒćƒƒć‚ÆAWS Black Belt Online Seminar 2017 Amazon Pinpoint ć§å§‹ć‚ć‚‹ćƒ¢ćƒć‚¤ćƒ«ć‚¢ćƒ—ćƒŖć®ć‚°ćƒ­ćƒ¼ć‚¹ćƒćƒƒć‚Æ
AWS Black Belt Online Seminar 2017 Amazon Pinpoint ć§å§‹ć‚ć‚‹ćƒ¢ćƒć‚¤ćƒ«ć‚¢ćƒ—ćƒŖć®ć‚°ćƒ­ćƒ¼ć‚¹ćƒćƒƒć‚Æ
Amazon Web Services Japan•39.6K views
Amazon Athena åˆåæƒč€…å‘ć‘ćƒćƒ³ć‚ŗć‚Ŗćƒ³ by Amazon Web Services Japan
Amazon Athena åˆåæƒč€…å‘ć‘ćƒćƒ³ć‚ŗć‚Ŗćƒ³Amazon Athena åˆåæƒč€…å‘ć‘ćƒćƒ³ć‚ŗć‚Ŗćƒ³
Amazon Athena åˆåæƒč€…å‘ć‘ćƒćƒ³ć‚ŗć‚Ŗćƒ³
Amazon Web Services Japan•25.7K views

Similar to Building a Sustainable Data Platform on AWS

Sf big analytics_2018_04_18: Evolution of the GoPro's data platform by
Sf big analytics_2018_04_18: Evolution of the GoPro's data platformSf big analytics_2018_04_18: Evolution of the GoPro's data platform
Sf big analytics_2018_04_18: Evolution of the GoPro's data platformChester Chen
436 views•62 slides
Automating Workflows for Analytics Pipelines by
Automating Workflows for Analytics PipelinesAutomating Workflows for Analytics Pipelines
Automating Workflows for Analytics PipelinesSadayuki Furuhashi
1.9K views•28 slides
Automate Your Big Data Workflows (SVC201) | AWS re:Invent 2013 by
Automate Your Big Data Workflows (SVC201) | AWS re:Invent 2013Automate Your Big Data Workflows (SVC201) | AWS re:Invent 2013
Automate Your Big Data Workflows (SVC201) | AWS re:Invent 2013Amazon Web Services
5.7K views•124 slides
XStream: stream processing platform at facebook by
XStream:  stream processing platform at facebookXStream:  stream processing platform at facebook
XStream: stream processing platform at facebookAniket Mokashi
454 views•22 slides
Migrating on premises workload to azure sql database by
Migrating on premises workload to azure sql databaseMigrating on premises workload to azure sql database
Migrating on premises workload to azure sql databasePARIKSHIT SAVJANI
649 views•25 slides
Running Airflow Workflows as ETL Processes on Hadoop by
Running Airflow Workflows as ETL Processes on HadoopRunning Airflow Workflows as ETL Processes on Hadoop
Running Airflow Workflows as ETL Processes on Hadoopclairvoyantllc
2K views•24 slides

Similar to Building a Sustainable Data Platform on AWS(20)

Sf big analytics_2018_04_18: Evolution of the GoPro's data platform by Chester Chen
Sf big analytics_2018_04_18: Evolution of the GoPro's data platformSf big analytics_2018_04_18: Evolution of the GoPro's data platform
Sf big analytics_2018_04_18: Evolution of the GoPro's data platform
Chester Chen•436 views
Automating Workflows for Analytics Pipelines by Sadayuki Furuhashi
Automating Workflows for Analytics PipelinesAutomating Workflows for Analytics Pipelines
Automating Workflows for Analytics Pipelines
Sadayuki Furuhashi•1.9K views
Automate Your Big Data Workflows (SVC201) | AWS re:Invent 2013 by Amazon Web Services
Automate Your Big Data Workflows (SVC201) | AWS re:Invent 2013Automate Your Big Data Workflows (SVC201) | AWS re:Invent 2013
Automate Your Big Data Workflows (SVC201) | AWS re:Invent 2013
Amazon Web Services•5.7K views
XStream: stream processing platform at facebook by Aniket Mokashi
XStream:  stream processing platform at facebookXStream:  stream processing platform at facebook
XStream: stream processing platform at facebook
Aniket Mokashi•454 views
Migrating on premises workload to azure sql database by PARIKSHIT SAVJANI
Migrating on premises workload to azure sql databaseMigrating on premises workload to azure sql database
Migrating on premises workload to azure sql database
PARIKSHIT SAVJANI•649 views
Running Airflow Workflows as ETL Processes on Hadoop by clairvoyantllc
Running Airflow Workflows as ETL Processes on HadoopRunning Airflow Workflows as ETL Processes on Hadoop
Running Airflow Workflows as ETL Processes on Hadoop
clairvoyantllc•2K views
AWS Step Functionsģ„ ķ™œģš©ķ•œ ģ„œė²„ė¦¬ģŠ¤ 앱 ģ˜¤ģ¼€ģŠ¤ķŠøė ˆģ“ģ…˜ by Amazon Web Services Korea
AWS Step Functionsģ„ ķ™œģš©ķ•œ ģ„œė²„ė¦¬ģŠ¤ 앱 ģ˜¤ģ¼€ģŠ¤ķŠøė ˆģ“ģ…˜AWS Step Functionsģ„ ķ™œģš©ķ•œ ģ„œė²„ė¦¬ģŠ¤ 앱 ģ˜¤ģ¼€ģŠ¤ķŠøė ˆģ“ģ…˜
AWS Step Functionsģ„ ķ™œģš©ķ•œ ģ„œė²„ė¦¬ģŠ¤ 앱 ģ˜¤ģ¼€ģŠ¤ķŠøė ˆģ“ģ…˜
Evolution of a cloud start up: From C# to Node.js by Steve Jamieson
Evolution of a cloud start up: From C# to Node.jsEvolution of a cloud start up: From C# to Node.js
Evolution of a cloud start up: From C# to Node.js
Steve Jamieson•1.1K views
Apache Samza 1.0 - What's New, What's Next by Prateek Maheshwari
Apache Samza 1.0 - What's New, What's NextApache Samza 1.0 - What's New, What's Next
Apache Samza 1.0 - What's New, What's Next
Prateek Maheshwari•292 views
MongoDB World 2018: Ch-Ch-Ch-Ch-Changes: Taking Your Stitch Application to th... by MongoDB
MongoDB World 2018: Ch-Ch-Ch-Ch-Changes: Taking Your Stitch Application to th...MongoDB World 2018: Ch-Ch-Ch-Ch-Changes: Taking Your Stitch Application to th...
MongoDB World 2018: Ch-Ch-Ch-Ch-Changes: Taking Your Stitch Application to th...
MongoDB•339 views
React state management with Redux and MobX by Darko Kukovec
React state management with Redux and MobXReact state management with Redux and MobX
React state management with Redux and MobX
Darko Kukovec•2K views
(MBL305) You Have Data from the Devices, Now What?: Getting the Value of the IoT by Amazon Web Services
(MBL305) You Have Data from the Devices, Now What?: Getting the Value of the IoT(MBL305) You Have Data from the Devices, Now What?: Getting the Value of the IoT
(MBL305) You Have Data from the Devices, Now What?: Getting the Value of the IoT
Amazon Web Services•14.7K views
Orchestrating complex workflows with aws step functions by Chris Shenton
Orchestrating complex workflows with aws step functionsOrchestrating complex workflows with aws step functions
Orchestrating complex workflows with aws step functions
Chris Shenton•82 views
In-memory ColumnStore Index by SolidQ
In-memory ColumnStore IndexIn-memory ColumnStore Index
In-memory ColumnStore Index
SolidQ•380 views
Using AWS Batch and AWS Step Functions to Design and Run High-Throughput Work... by Amazon Web Services
Using AWS Batch and AWS Step Functions to Design and Run High-Throughput Work...Using AWS Batch and AWS Step Functions to Design and Run High-Throughput Work...
Using AWS Batch and AWS Step Functions to Design and Run High-Throughput Work...
Amazon Web Services•6.4K views
Real time analytics on deep learning @ strata data 2019 by Zhenxiao Luo
Real time analytics on deep learning @ strata data 2019Real time analytics on deep learning @ strata data 2019
Real time analytics on deep learning @ strata data 2019
Zhenxiao Luo•1K views
20211028 ADDO Adapting to Covid with Serverless Craeg Strong Ariel Partners by Craeg Strong
20211028 ADDO Adapting to Covid with Serverless Craeg Strong Ariel Partners20211028 ADDO Adapting to Covid with Serverless Craeg Strong Ariel Partners
20211028 ADDO Adapting to Covid with Serverless Craeg Strong Ariel Partners
Craeg Strong•149 views
Analytics Metrics delivery and ML Feature visualization: Evolution of Data Pl... by Chester Chen
Analytics Metrics delivery and ML Feature visualization: Evolution of Data Pl...Analytics Metrics delivery and ML Feature visualization: Evolution of Data Pl...
Analytics Metrics delivery and ML Feature visualization: Evolution of Data Pl...
Chester Chen•339 views
MongoDB.local Austin 2018: Ch-Ch-Ch-Ch-Changes: Taking Your MongoDB Stitch A... by MongoDB
MongoDB.local Austin 2018:  Ch-Ch-Ch-Ch-Changes: Taking Your MongoDB Stitch A...MongoDB.local Austin 2018:  Ch-Ch-Ch-Ch-Changes: Taking Your MongoDB Stitch A...
MongoDB.local Austin 2018: Ch-Ch-Ch-Ch-Changes: Taking Your MongoDB Stitch A...
MongoDB•113 views

More from SmartNews, Inc.

SmartNewsć‚’ę”Æćˆć‚‹ćƒ‡ćƒ¼ć‚æćƒ‘ć‚¤ćƒ—ćƒ©ć‚¤ćƒ³ćØćć®é‹ē”Ø by
SmartNewsć‚’ę”Æćˆć‚‹ćƒ‡ćƒ¼ć‚æćƒ‘ć‚¤ćƒ—ćƒ©ć‚¤ćƒ³ćØćć®é‹ē”ØSmartNewsć‚’ę”Æćˆć‚‹ćƒ‡ćƒ¼ć‚æćƒ‘ć‚¤ćƒ—ćƒ©ć‚¤ćƒ³ćØćć®é‹ē”Ø
SmartNewsć‚’ę”Æćˆć‚‹ćƒ‡ćƒ¼ć‚æćƒ‘ć‚¤ćƒ—ćƒ©ć‚¤ćƒ³ćØćć®é‹ē”ØSmartNews, Inc.
3.7K views•10 slides
Spring ć§å®Ÿē¾ć™ć‚‹ SmartNews ć®ćƒ‹ćƒ„ćƒ¼ć‚¹é…äæ”åŸŗē›¤ by
Spring ć§å®Ÿē¾ć™ć‚‹ SmartNews ć®ćƒ‹ćƒ„ćƒ¼ć‚¹é…äæ”åŸŗē›¤Spring ć§å®Ÿē¾ć™ć‚‹ SmartNews ć®ćƒ‹ćƒ„ćƒ¼ć‚¹é…äæ”åŸŗē›¤
Spring ć§å®Ÿē¾ć™ć‚‹ SmartNews ć®ćƒ‹ćƒ„ćƒ¼ć‚¹é…äæ”åŸŗē›¤SmartNews, Inc.
3.1K views•80 slides
ć‚Øćƒ³ć‚·ć‚™ćƒ‹ć‚¢ć‹ć‚‰ćƒ—ćƒ­ćƒ€ć‚Æćƒˆćƒžćƒćƒ¼ć‚øćƒ£ćƒ¼ćø by
ć‚Øćƒ³ć‚·ć‚™ćƒ‹ć‚¢ć‹ć‚‰ćƒ—ćƒ­ćƒ€ć‚Æćƒˆćƒžćƒćƒ¼ć‚øćƒ£ćƒ¼ćøć‚Øćƒ³ć‚·ć‚™ćƒ‹ć‚¢ć‹ć‚‰ćƒ—ćƒ­ćƒ€ć‚Æćƒˆćƒžćƒćƒ¼ć‚øćƒ£ćƒ¼ćø
ć‚Øćƒ³ć‚·ć‚™ćƒ‹ć‚¢ć‹ć‚‰ćƒ—ćƒ­ćƒ€ć‚Æćƒˆćƒžćƒćƒ¼ć‚øćƒ£ćƒ¼ćøSmartNews, Inc.
9.7K views•17 slides
SpringOne Platform 2016 å ±å‘Šä¼šć€ŒA Lite Rx API for the JVM怍/ äŗ•å£ č² @ SmartNews, Inc. by
SpringOne Platform 2016 å ±å‘Šä¼šć€ŒA Lite Rx API for the JVM怍/ äŗ•å£ č² @ SmartNews, Inc.SpringOne Platform 2016 å ±å‘Šä¼šć€ŒA Lite Rx API for the JVM怍/ äŗ•å£ č² @ SmartNews, Inc.
SpringOne Platform 2016 å ±å‘Šä¼šć€ŒA Lite Rx API for the JVM怍/ äŗ•å£ č² @ SmartNews, Inc.SmartNews, Inc.
1.2K views•71 slides
SmartNewsć®ćƒ‹ćƒ„ćƒ¼ć‚¹é…äæ”ć‚’ę”Æćˆć‚‹ć‚µćƒ¼ćƒęŠ€č”“ / Kazhiro Sera @ SmartNews,Inc. #jjug_ccc by
SmartNewsć®ćƒ‹ćƒ„ćƒ¼ć‚¹é…äæ”ć‚’ę”Æćˆć‚‹ć‚µćƒ¼ćƒęŠ€č”“ / Kazhiro Sera @ SmartNews,Inc. #jjug_cccSmartNewsć®ćƒ‹ćƒ„ćƒ¼ć‚¹é…äæ”ć‚’ę”Æćˆć‚‹ć‚µćƒ¼ćƒęŠ€č”“ / Kazhiro Sera @ SmartNews,Inc. #jjug_ccc
SmartNewsć®ćƒ‹ćƒ„ćƒ¼ć‚¹é…äæ”ć‚’ę”Æćˆć‚‹ć‚µćƒ¼ćƒęŠ€č”“ / Kazhiro Sera @ SmartNews,Inc. #jjug_cccSmartNews, Inc.
10.6K views•53 slides
Stream Processing in SmartNews #jawsdays by
Stream Processing in SmartNews #jawsdaysStream Processing in SmartNews #jawsdays
Stream Processing in SmartNews #jawsdaysSmartNews, Inc.
22K views•45 slides

More from SmartNews, Inc.(19)

SmartNewsć‚’ę”Æćˆć‚‹ćƒ‡ćƒ¼ć‚æćƒ‘ć‚¤ćƒ—ćƒ©ć‚¤ćƒ³ćØćć®é‹ē”Ø by SmartNews, Inc.
SmartNewsć‚’ę”Æćˆć‚‹ćƒ‡ćƒ¼ć‚æćƒ‘ć‚¤ćƒ—ćƒ©ć‚¤ćƒ³ćØćć®é‹ē”ØSmartNewsć‚’ę”Æćˆć‚‹ćƒ‡ćƒ¼ć‚æćƒ‘ć‚¤ćƒ—ćƒ©ć‚¤ćƒ³ćØćć®é‹ē”Ø
SmartNewsć‚’ę”Æćˆć‚‹ćƒ‡ćƒ¼ć‚æćƒ‘ć‚¤ćƒ—ćƒ©ć‚¤ćƒ³ćØćć®é‹ē”Ø
SmartNews, Inc.•3.7K views
Spring ć§å®Ÿē¾ć™ć‚‹ SmartNews ć®ćƒ‹ćƒ„ćƒ¼ć‚¹é…äæ”åŸŗē›¤ by SmartNews, Inc.
Spring ć§å®Ÿē¾ć™ć‚‹ SmartNews ć®ćƒ‹ćƒ„ćƒ¼ć‚¹é…äæ”åŸŗē›¤Spring ć§å®Ÿē¾ć™ć‚‹ SmartNews ć®ćƒ‹ćƒ„ćƒ¼ć‚¹é…äæ”åŸŗē›¤
Spring ć§å®Ÿē¾ć™ć‚‹ SmartNews ć®ćƒ‹ćƒ„ćƒ¼ć‚¹é…äæ”åŸŗē›¤
SmartNews, Inc.•3.1K views
ć‚Øćƒ³ć‚·ć‚™ćƒ‹ć‚¢ć‹ć‚‰ćƒ—ćƒ­ćƒ€ć‚Æćƒˆćƒžćƒćƒ¼ć‚øćƒ£ćƒ¼ćø by SmartNews, Inc.
ć‚Øćƒ³ć‚·ć‚™ćƒ‹ć‚¢ć‹ć‚‰ćƒ—ćƒ­ćƒ€ć‚Æćƒˆćƒžćƒćƒ¼ć‚øćƒ£ćƒ¼ćøć‚Øćƒ³ć‚·ć‚™ćƒ‹ć‚¢ć‹ć‚‰ćƒ—ćƒ­ćƒ€ć‚Æćƒˆćƒžćƒćƒ¼ć‚øćƒ£ćƒ¼ćø
ć‚Øćƒ³ć‚·ć‚™ćƒ‹ć‚¢ć‹ć‚‰ćƒ—ćƒ­ćƒ€ć‚Æćƒˆćƒžćƒćƒ¼ć‚øćƒ£ćƒ¼ćø
SmartNews, Inc.•9.7K views
SpringOne Platform 2016 å ±å‘Šä¼šć€ŒA Lite Rx API for the JVM怍/ äŗ•å£ č² @ SmartNews, Inc. by SmartNews, Inc.
SpringOne Platform 2016 å ±å‘Šä¼šć€ŒA Lite Rx API for the JVM怍/ äŗ•å£ č² @ SmartNews, Inc.SpringOne Platform 2016 å ±å‘Šä¼šć€ŒA Lite Rx API for the JVM怍/ äŗ•å£ č² @ SmartNews, Inc.
SpringOne Platform 2016 å ±å‘Šä¼šć€ŒA Lite Rx API for the JVM怍/ äŗ•å£ č² @ SmartNews, Inc.
SmartNews, Inc.•1.2K views
SmartNewsć®ćƒ‹ćƒ„ćƒ¼ć‚¹é…äæ”ć‚’ę”Æćˆć‚‹ć‚µćƒ¼ćƒęŠ€č”“ / Kazhiro Sera @ SmartNews,Inc. #jjug_ccc by SmartNews, Inc.
SmartNewsć®ćƒ‹ćƒ„ćƒ¼ć‚¹é…äæ”ć‚’ę”Æćˆć‚‹ć‚µćƒ¼ćƒęŠ€č”“ / Kazhiro Sera @ SmartNews,Inc. #jjug_cccSmartNewsć®ćƒ‹ćƒ„ćƒ¼ć‚¹é…äæ”ć‚’ę”Æćˆć‚‹ć‚µćƒ¼ćƒęŠ€č”“ / Kazhiro Sera @ SmartNews,Inc. #jjug_ccc
SmartNewsć®ćƒ‹ćƒ„ćƒ¼ć‚¹é…äæ”ć‚’ę”Æćˆć‚‹ć‚µćƒ¼ćƒęŠ€č”“ / Kazhiro Sera @ SmartNews,Inc. #jjug_ccc
SmartNews, Inc.•10.6K views
Stream Processing in SmartNews #jawsdays by SmartNews, Inc.
Stream Processing in SmartNews #jawsdaysStream Processing in SmartNews #jawsdays
Stream Processing in SmartNews #jawsdays
SmartNews, Inc.•22K views
AWSć®é€²åŒ–ćØSmartNewsć®č£å“ by SmartNews, Inc.
AWSć®é€²åŒ–ćØSmartNewsć®č£å“AWSć®é€²åŒ–ćØSmartNewsć®č£å“
AWSć®é€²åŒ–ćØSmartNewsć®č£å“
SmartNews, Inc.•12.8K views
SmartNews TechNight Vol.5 : AD Data Engineering in practice: SmartNews Adsč£ć®ćƒ‡... by SmartNews, Inc.
SmartNews TechNight Vol.5 : AD Data Engineering in practice: SmartNews Adsč£ć®ćƒ‡...SmartNews TechNight Vol.5 : AD Data Engineering in practice: SmartNews Adsč£ć®ćƒ‡...
SmartNews TechNight Vol.5 : AD Data Engineering in practice: SmartNews Adsč£ć®ćƒ‡...
SmartNews, Inc.•20.5K views
SmartNews TechNight Vol.5 : SmartNews Ads ć®é…äæ”ęœ€é©åŒ–ć®ä»•ēµ„ćæćÆć©ć†ćŖć£ć¦ć‚‹ć®ļ¼Ÿ (ć‚Øćƒ³ć‚øćƒ‹ć‚¢ / SmartN... by SmartNews, Inc.
SmartNews TechNight Vol.5 : SmartNews Ads ć®é…äæ”ęœ€é©åŒ–ć®ä»•ēµ„ćæćÆć©ć†ćŖć£ć¦ć‚‹ć®ļ¼Ÿ (ć‚Øćƒ³ć‚øćƒ‹ć‚¢ / SmartN...SmartNews TechNight Vol.5 : SmartNews Ads ć®é…äæ”ęœ€é©åŒ–ć®ä»•ēµ„ćæćÆć©ć†ćŖć£ć¦ć‚‹ć®ļ¼Ÿ (ć‚Øćƒ³ć‚øćƒ‹ć‚¢ / SmartN...
SmartNews TechNight Vol.5 : SmartNews Ads ć®é…äæ”ęœ€é©åŒ–ć®ä»•ēµ„ćæćÆć©ć†ćŖć£ć¦ć‚‹ć®ļ¼Ÿ (ć‚Øćƒ³ć‚øćƒ‹ć‚¢ / SmartN...
SmartNews, Inc.•35.2K views
SmartNews TechNight Vol5 : SmartNews AdServer 解体新書 / ćƒć‚¹ćƒˆćƒ¢ćƒ¼ćƒ†ćƒ  by SmartNews, Inc.
SmartNews TechNight Vol5 : SmartNews AdServer 解体新書 / ćƒć‚¹ćƒˆćƒ¢ćƒ¼ćƒ†ćƒ SmartNews TechNight Vol5 : SmartNews AdServer 解体新書 / ćƒć‚¹ćƒˆćƒ¢ćƒ¼ćƒ†ćƒ 
SmartNews TechNight Vol5 : SmartNews AdServer 解体新書 / ćƒć‚¹ćƒˆćƒ¢ćƒ¼ćƒ†ćƒ 
SmartNews, Inc.•32K views
SmartNews TechNight vol5 SmartNews Ads大図解 by SmartNews, Inc.
SmartNews TechNight vol5 SmartNews Ads大図解SmartNews TechNight vol5 SmartNews Ads大図解
SmartNews TechNight vol5 SmartNews Ads大図解
SmartNews, Inc.•25.6K views
SmartNews's journey into microservices by SmartNews, Inc.
SmartNews's journey into microservicesSmartNews's journey into microservices
SmartNews's journey into microservices
SmartNews, Inc.•11.1K views
Strem処理(Spark Streaming + Kinesis)とOffline処理(Hive)の統合 by SmartNews, Inc.
Strem処理(Spark Streaming + Kinesis)とOffline処理(Hive)の統合Strem処理(Spark Streaming + Kinesis)とOffline処理(Hive)の統合
Strem処理(Spark Streaming + Kinesis)とOffline処理(Hive)の統合
SmartNews, Inc.•2.3K views
SmartNews 恮 Webmining ć‚’ę”Æćˆć‚‹ćƒ•ć‚šćƒ©ćƒƒćƒˆćƒ•ć‚©ćƒ¼ćƒ  by SmartNews, Inc.
SmartNews 恮 Webmining ć‚’ę”Æćˆć‚‹ćƒ•ć‚šćƒ©ćƒƒćƒˆćƒ•ć‚©ćƒ¼ćƒ SmartNews 恮 Webmining ć‚’ę”Æćˆć‚‹ćƒ•ć‚šćƒ©ćƒƒćƒˆćƒ•ć‚©ćƒ¼ćƒ 
SmartNews 恮 Webmining ć‚’ę”Æćˆć‚‹ćƒ•ć‚šćƒ©ćƒƒćƒˆćƒ•ć‚©ćƒ¼ćƒ 
SmartNews, Inc.•10.1K views
AWS meetup怌Apache Spark on EMR怍 by SmartNews, Inc.
AWS meetup怌Apache Spark on EMR怍AWS meetup怌Apache Spark on EMR怍
AWS meetup怌Apache Spark on EMR怍
SmartNews, Inc.•7.4K views
Smartnews Product Manager Night by SmartNews, Inc.
Smartnews Product Manager NightSmartnews Product Manager Night
Smartnews Product Manager Night
SmartNews, Inc.•12.7K views
SmartNews Ads System - AWS Summit Tokyo 2015 by SmartNews, Inc.
SmartNews Ads System - AWS Summit Tokyo 2015SmartNews Ads System - AWS Summit Tokyo 2015
SmartNews Ads System - AWS Summit Tokyo 2015
SmartNews, Inc.•16.9K views
ć‚¤ćƒ³ćƒ•ćƒ©å°‚ä»»ć‚Øćƒ³ć‚øćƒ‹ć‚¢ćŒäø€äŗŗć‚‚ć„ćŖć„SmartNewsć«ćŠć‘ć‚‹ć‚Æćƒ©ć‚¦ćƒ‰ę“»ē”Øę³• by SmartNews, Inc.
ć‚¤ćƒ³ćƒ•ćƒ©å°‚ä»»ć‚Øćƒ³ć‚øćƒ‹ć‚¢ćŒäø€äŗŗć‚‚ć„ćŖć„SmartNewsć«ćŠć‘ć‚‹ć‚Æćƒ©ć‚¦ćƒ‰ę“»ē”Øę³•ć‚¤ćƒ³ćƒ•ćƒ©å°‚ä»»ć‚Øćƒ³ć‚øćƒ‹ć‚¢ćŒäø€äŗŗć‚‚ć„ćŖć„SmartNewsć«ćŠć‘ć‚‹ć‚Æćƒ©ć‚¦ćƒ‰ę“»ē”Øę³•
ć‚¤ćƒ³ćƒ•ćƒ©å°‚ä»»ć‚Øćƒ³ć‚øćƒ‹ć‚¢ćŒäø€äŗŗć‚‚ć„ćŖć„SmartNewsć«ćŠć‘ć‚‹ć‚Æćƒ©ć‚¦ćƒ‰ę“»ē”Øę³•
SmartNews, Inc.•12.4K views

Recently uploaded

Voice Logger - Telephony Integration Solution at Aegis by
Voice Logger - Telephony Integration Solution at AegisVoice Logger - Telephony Integration Solution at Aegis
Voice Logger - Telephony Integration Solution at AegisNirmal Sharma
31 views•1 slide
Transcript: The Details of Description Techniques tips and tangents on altern... by
Transcript: The Details of Description Techniques tips and tangents on altern...Transcript: The Details of Description Techniques tips and tangents on altern...
Transcript: The Details of Description Techniques tips and tangents on altern...BookNet Canada
135 views•15 slides
PharoJS - Zürich Smalltalk Group Meetup November 2023 by
PharoJS - Zürich Smalltalk Group Meetup November 2023PharoJS - Zürich Smalltalk Group Meetup November 2023
PharoJS - Zürich Smalltalk Group Meetup November 2023Noury Bouraqadi
126 views•17 slides
Automating a World-Class Technology Conference; Behind the Scenes of CiscoLive by
Automating a World-Class Technology Conference; Behind the Scenes of CiscoLiveAutomating a World-Class Technology Conference; Behind the Scenes of CiscoLive
Automating a World-Class Technology Conference; Behind the Scenes of CiscoLiveNetwork Automation Forum
30 views•35 slides
Vertical User Stories by
Vertical User StoriesVertical User Stories
Vertical User StoriesMoisƩs Armani Ramƭrez
12 views•16 slides
AMAZON PRODUCT RESEARCH.pdf by
AMAZON PRODUCT RESEARCH.pdfAMAZON PRODUCT RESEARCH.pdf
AMAZON PRODUCT RESEARCH.pdfJerikkLaureta
19 views•13 slides

Recently uploaded(20)

Voice Logger - Telephony Integration Solution at Aegis by Nirmal Sharma
Voice Logger - Telephony Integration Solution at AegisVoice Logger - Telephony Integration Solution at Aegis
Voice Logger - Telephony Integration Solution at Aegis
Nirmal Sharma•31 views
Transcript: The Details of Description Techniques tips and tangents on altern... by BookNet Canada
Transcript: The Details of Description Techniques tips and tangents on altern...Transcript: The Details of Description Techniques tips and tangents on altern...
Transcript: The Details of Description Techniques tips and tangents on altern...
BookNet Canada•135 views
PharoJS - Zürich Smalltalk Group Meetup November 2023 by Noury Bouraqadi
PharoJS - Zürich Smalltalk Group Meetup November 2023PharoJS - Zürich Smalltalk Group Meetup November 2023
PharoJS - Zürich Smalltalk Group Meetup November 2023
Noury Bouraqadi•126 views
Automating a World-Class Technology Conference; Behind the Scenes of CiscoLive by Network Automation Forum
Automating a World-Class Technology Conference; Behind the Scenes of CiscoLiveAutomating a World-Class Technology Conference; Behind the Scenes of CiscoLive
Automating a World-Class Technology Conference; Behind the Scenes of CiscoLive
AMAZON PRODUCT RESEARCH.pdf by JerikkLaureta
AMAZON PRODUCT RESEARCH.pdfAMAZON PRODUCT RESEARCH.pdf
AMAZON PRODUCT RESEARCH.pdf
JerikkLaureta•19 views
Piloting & Scaling Successfully With Microsoft Viva by Richard Harbridge
Piloting & Scaling Successfully With Microsoft VivaPiloting & Scaling Successfully With Microsoft Viva
Piloting & Scaling Successfully With Microsoft Viva
Richard Harbridge•12 views
Tunable Laser (1).pptx by Hajira Mahmood
Tunable Laser (1).pptxTunable Laser (1).pptx
Tunable Laser (1).pptx
Hajira Mahmood•24 views
handbook for web 3 adoption.pdf by Liveplex
handbook for web 3 adoption.pdfhandbook for web 3 adoption.pdf
handbook for web 3 adoption.pdf
Liveplex•22 views
GDG Cloud Southlake 28 Brad Taylor and Shawn Augenstein Old Problems in the N... by James Anderson
GDG Cloud Southlake 28 Brad Taylor and Shawn Augenstein Old Problems in the N...GDG Cloud Southlake 28 Brad Taylor and Shawn Augenstein Old Problems in the N...
GDG Cloud Southlake 28 Brad Taylor and Shawn Augenstein Old Problems in the N...
James Anderson•66 views
Igniting Next Level Productivity with AI-Infused Data Integration Workflows by Safe Software
Igniting Next Level Productivity with AI-Infused Data Integration Workflows Igniting Next Level Productivity with AI-Infused Data Integration Workflows
Igniting Next Level Productivity with AI-Infused Data Integration Workflows
Safe Software•257 views
Case Study Copenhagen Energy and Business Central.pdf by Aitana
Case Study Copenhagen Energy and Business Central.pdfCase Study Copenhagen Energy and Business Central.pdf
Case Study Copenhagen Energy and Business Central.pdf
Aitana•16 views
Spesifikasi Lengkap ASUS Vivobook Go 14 by Dot Semarang
Spesifikasi Lengkap ASUS Vivobook Go 14Spesifikasi Lengkap ASUS Vivobook Go 14
Spesifikasi Lengkap ASUS Vivobook Go 14
Dot Semarang•37 views
Perth MeetUp November 2023 by Michael Price
Perth MeetUp November 2023 Perth MeetUp November 2023
Perth MeetUp November 2023
Michael Price•19 views
The details of description: Techniques, tips, and tangents on alternative tex... by BookNet Canada
The details of description: Techniques, tips, and tangents on alternative tex...The details of description: Techniques, tips, and tangents on alternative tex...
The details of description: Techniques, tips, and tangents on alternative tex...
BookNet Canada•126 views
From chaos to control: Managing migrations and Microsoft 365 with ShareGate! by sammart93
From chaos to control: Managing migrations and Microsoft 365 with ShareGate!From chaos to control: Managing migrations and Microsoft 365 with ShareGate!
From chaos to control: Managing migrations and Microsoft 365 with ShareGate!
sammart93•9 views
1st parposal presentation.pptx by i238212
1st parposal presentation.pptx1st parposal presentation.pptx
1st parposal presentation.pptx
i238212•9 views

Building a Sustainable Data Platform on AWS

  • 1. Building a Sustainable Data Platform on AWS Takumi Sakamoto 2016.01.27
  • 2. Takumi Sakamoto @takus šŸ˜ = ⚽ ✈ šŸ“·
  • 7. What is SmartNews? • News Discovery App • Launched in 2012 • 15M+ Downloads in World Wide https://www.smartnews.com/en/
  • 8. Our Mission the world's quality information? the people who need it? How?
  • 9. Machine Learning URLs Found Structure Analysis Semantics Analysis Importance Estimation Diversification Internet 100,000+ /day 1000+ /day Feedback Deliver Trending Stories
  • 10. Data Platform Use Cases • Product development • track KPI such as DAU and MAU • A/B test for new feature, on-boarding, etc... • ad-hoc analysis • Provide data to applications • realtime re-ranking news articles • CTR prediction of Ads system • dashboard service for media partners
  • 11. Data & Its Numbers • User activities • ~100 GBs per day (compressed) • 60+ record types • User demographics or configurations etc... • 15M+ records • Articles metadata • 100K+ records per day
  • 13. Sustainable Data Platform • Provide a reliable and scalable "Lambda Architecture" • Minimize both operation & running cost • Be open to uncertain future
  • 15. Why Sustainable? • Do a lot with a few engineers • no one is a full-time maintainer • avoid to waste too much time • Empower brilliant engineers in SmartNews • everything should be as self-serve as possible • don't ask for permission, beg for forgiveness
  • 17. Ī» Architecture at SmartNews Input Batch Serving Speed Output
  • 18. Design Principles • Decoupled "Computation" and "Storage" layers • multiple consumers can use the same data • run consumers on Spot Instances • prevent serious data lost with minimum effort • Use the right tool for the job • leverage AWS managed service as possible • fill in the missing pieces by Presto & PipelineDB
  • 19. An Example Amazon EMR AMI 3.x Amazon S3 Amazon EMR Hive General Users Application Engineer I wanna upgrade hive Ad Engineer I wanna combine news data with ad data Amazon EMR AMI 4.x Amazon EMR Spark We’re satisfied with current version Data Scientist I wanna test my algorithm with the latest spark Batch Layer Run multiple EMR clusters for each usages Kinesis Stream Spark on EMR AWS Lambda Data Scientist I wanna consume streaming data by Spark Application Engineer I wanna add a streaming monitor by Lambda Speed Layer Consume the same data for each usages • AWS managed services • Replicated data into Multiple AZs • High availability
  • 21. Collect Events by Fluentd • Forwarder (running on each instances) • store JSON events to S3 • forward events to aggregators • collect metrics and post them to Datadog • Aggregator • input events into Kinesis & PipelineDB • other reporting tasks (not mentioned today)
  • 22. Forwards to S3 <source> @type tail format json path /data/log/user_activity.log pos_file /data/log/pos/user_activity.pos tag smartnews.user_activity time_key timestamp </source> <match smartnews.user_activity> @type copy <store> @type relabel @label @s3 </store> <store> @type forward @label @forward </store> </match> @include conf.d/s3.conf @include conf.d/forward.conf <label @s3> <% node[:td_agent][:s3].each do |c| -%> <match <%= c[:tag] %>> @id s3.<%= c[:tag] %> @type s3 ... path fluentd/<%= node[:env] %>/<%= node[:role] %>/<%= c[:tag] %> time_slice_format dt=%Y-%m-%d/hh=%H time_key timestamp include_time_key time_as_epoch reduced_redundancy true format json utc buffer_chunk_limit 2048m </match> <% end -%> </label> td-agent.conf conf.d/s3.conf
  • 23. Capture DynamoDB Streams <source> type dynamodb_streams stream_arn YOUR_DDB_STREAMS_ARN pos_file /path/to/table.pos fetch_interval 1 fetch_size 100 </source> https://github.com/takus/fluent-plugin-dynamodb-streams DynamoDB DynamoDB Streams Amazon S3 AWS Lambda Fluentd
  • 24. Recommended Practices • Make configuration simple as possible • fluentd can cover everything, but shouldn't • keep stateless • Use v0.12 or later • "Filter" : better performance • "Label": eliminate 'output_tag' configuration
  • 25. Monitor Fluentd Status • Monitor traffic volume & retry count by Datadog • Datadog's fluentd integration • fluent-plugin-flowcounter • fluent-plugin-dogstatsd
  • 26. Archive to Amazon S3 • I have 2 recommended settings • versioning • enable to recover from human error • lifecycle policy • minify storage cost Archives to IA or Gracier xx days after the creation date Keep previous versions xx days Save you in the future!!
  • 28. Various ETL Tasks • Extract • dump MySQL records by Embulk • make files on S3 readable to Hive • Transform • transform text files into columnar files (RCFile, ORC) • generate features for machine learning • aggregate records (by country, by channel) • Load • load aggregated metrics into Amazon Aurora
  • 29. Hive • Most popular project on Hadoop ecosystem • famous for its lovely logo :) • HiveQL and MapReduce • convert SQL-like query into MR jobs • Not adopt Tez engine yet • Amazon EMR doesn't support now • limited improvement to our queries
  • 30. How to process JSON? A. Transform into columnar table periodically • required converting job • better performance B. Use JSON-SerDe for temporary analysis • easy way for querying raw json text files • required to "drop table" for change schema • performance is not good
  • 31. Transform Tables -- Make S3 files readable by Hive ALTER TABLE raw_activities ADD IF NOT EXISTS PARTITION (dt='${DATE}', hh='${HOUR}'); -- Transform text files into columnar files (Flatten JSON) INSERT OVERWRITE TABLE activities PARTITION (dt='${DATE}', action) SELECT user_id, timestamp, os, country, data, action FROM raw_activities LATERAL VIEW json_tuple( raw_activities.json, 'userId','timestamp','platform','country','action','data' ) a as user_id, timestamp, os, country, action, data WHERE dt = '${DATE}' CLUSTER BY os, country, action, user_id ;
  • 32. JSON-SerDe -- Define table with SERDE CREATE TABLE json_table ( country string, languages array<string>, religions map<string,array<int>> ) ROW FORMAT SERDE 'org.openx.data.jsonserde.JsonSerDe' STORED AS TEXTFILE; -- Result: 10 SELECT religions['catholic'][0] FROM json_table;
  • 33. cf. hive-ruby-scripting -- Define your ruby (JRuby) script SET rb.script= require 'json' def parse (json) j = JSON.load(json) j['profile']['attribute1'] end ; -- Use the script in HQL SELECT rb_exec('&parse', json) FROM user; https://github.com/gree/hive-ruby-scripting
  • 35. Self-Serve via AWS CLI # Create EMR clusters that runs Hive & Spark & Ganglia aws emr create-cluster --name "My Cluster" --release-label emr-4.2.0 --applications Name=Hive Name=Spark Name=GANGLIA --ec2-attributes KeyName=myKey --instance-type c3.4xlarge --instance-count 4 --use-default-roles
  • 36. Minimize expenses • Use Spot Instances as possible • typically discount 50-90% • select instance type with stable price • C3 families spike often :( • Dynamic cluster resizing • x2 capacity during daily batch job • 1/2 capacity during midnight
  • 38. Typical Anti-Pattern 5 * * * * app hive -f query_1.hql 15 * * * * app hive -f query_2.hql 30 * * * * app hive -f query_3.hql 0 * * * * app hive -f query_4.hql 1 * * * * app hive -f query_5.hql
  • 39. Workflow Management • Define dependencies • task E is executed after finishing task C and task D • Scheduling • task A is kicked after 09:00 AM • throttle concurrent running of the same task • Monitoring • notification in failure • task C must finish before 01:00 PM (SLA) cf. http://www.slideshare.net/taroleo/workflow-hacks-1-dots-tokyo
  • 40. Airflow • A workflow management systems • define workflow by Python • built in shiny UI & CLI • pluggable architecture http://nerds.airbnb.com/airflow/
  • 41. Define Tasks dag = DAG('tutorial', default_args=default_args) t1 = BashOperator( task_id='print_date', bash_command='date', dag=dag) t2 = BashOperator( task_id='sleep', bash_command='sleep 5', retries=3, dag=dag) t3 = BashOperator( task_id='templated', bash_command=""" {% for i in range(5) %} echo "{{ ds }}" echo "{{ macros.ds_add(ds, 7)}}" echo "{{ params.my_param }}" {% endfor %} """, params={'my_param': 'Parameter I passed in'}, dag=dag) t2.set_upstream(t1) t3.set_upstream(t1) Task Dependencies Python code DAG
  • 42. Workflow as Code Deploy codes automatically after merging into master
  • 44. What is done or not?
  • 45. Alerting to Slack • SLA Violation • task A should be done till 00:00 PM • other team's task K has dependency into task A • Output validation failure • stop the following tasks if the output is doubtful
  • 46. Retry from Web UI Once clear histories, airflow scheduler back fill the histories
  • 47. Retry from CLI // Clear some histories from 2016-01-01 airflow clear etl_smartnews --task_regex user_ --downstream --start_date 2016-01-01 // Backfill uncompleted tasks airflow backfill etl_smartnews --start_date 2016-01-01
  • 49. How Long Each Tasks?
  • 50. Pluggable Architecture • Built-in plugins • operator: bash, hive, preto, mysql • transfer: hive_to_mysql • sensor: wait_hive_partition, wait_s3_file • Written our own plugin • mysql_partition
  • 51. Examples user_sensor = S3KeySensor( task_id='wait_user', bucket_name='smartnews', bucket_key='user/dt={{ ds }}/dump.csv', ) etl = HiveOperator( task_id="task1", hql="INSERT OVERWRITE INTO...." ) etl.set_upstream(user_sensor) import = HiveToMySqlTransfer( task_id=name, mysql_preoperator="DELETE FROM %s WHERE date = '{{ ds }}'" % table, sql="SELECT country, count(*) FROM %s" % table, mysql_table=table ) import.set_upstream(etl) Wait a S3 file creation After the file is created, Run ETL Query After that, Import into MySQL
  • 53. Provides batch views in low-latency and ad-hoc way
  • 54. Presto • A distributed SQL query engine • join multiple data sources (Hive + MySQL) • support standard ANSI SQL • designed to handle TBs or PBs scale data cf. http://www.slideshare.net/frsyuki/presto-hadoop-conference-japan-2014
  • 55. Presto Architecture Amazon S3 Kinesis Stream Amazon RDS Amazon Aurora Presto Worker Presto Worker Presto Worker Presto Worker Presto Worker Presto Worker Presto Coordinator Client 1. Query with Standard SQL 4. Scan data concurrently 5. Aggregate data without disk I/O 6. Return result to client 2. Generate execution plan 3. Dispatch tasks into multiple workers Amazon EMR (Hive Metastore) Provides Hive table metadata (S3 access only) ※ https://github.com/qubole/presto-kinesis ※
  • 56. Why Presto? • Join multiple data sources • skip large parts of ETL process • enable to merge Hive/MySQL/Kinesis/PipelineDB • Low latency • ~30s to scan billions records in S3 • Low maintenance cost • stateless, and easy to integrate with Auto Scaling
  • 57. Use case: A/B Test -- Suppose that this table exists DESC hive.default.user_activities; user_id bigint action varchar abtest array<map<varchar, bigint>> url varchar -- Summarize page view per A/B Test identifier -- for comparing two algorithms v1 & v2 SELECT Ā  dt, Ā  t['behaviorId'], Ā  count(*) as pv FROM hive.default.user_activities CROSS JOIN UNNEST(abtest) AS t (t) WHERE dt like '2016-01-%' AND action = 'viewArticle' AND t['definitionId'] = 163 GROUP BY dt, t['behaviorId'] ORDER BY dt ; 2015-12-01 | algorithm_v1 | 40000 2015-12-01 | algorithm_v2 | 62000
  • 58. Use case: Troubleshoot -- Store access logs to S3, and query to them -- Summarize access & 95pct response time by SQL SELECT from_unixtime(timestamp), count(*) as access, approx_percentile(reqtime, 0.95) as pct95_reqtime FROM hive.default.access_log WHERE dt = '2015-11-04' AND hh = '13' AND role = 'xxx' GROUP BY timestamp ORDER BY timestamp ; 2015-11-04 22:00:00.000 | 6377 | 0.522 2015-11-04 22:00:01.000 | 3580 | 0.422
  • 59. Scheduled Auto Scaling $ aws autoscaling describe-scheduled-actions { "ScheduledUpdateGroupActions": [ { "DesiredCapacity": 2, "AutoScalingGroupName": "presto-worker-prd", "Recurrence": "59 14 * * *", "ScheduledActionName": "scalein-2359-jst" }, { "DesiredCapacity": 20, "AutoScalingGroupName": "presto-worker-prd", "Recurrence": "45 0 * * 1-5", "ScheduledActionName": "scaleout-0945-jst" } ] }
  • 60. Presto Covers Everything? No! • Fixed system on Amazon Aurora (or other RDB) • provides KPI for products & business • require high availability & low latency • has no flexibility • Ad-hoc system on Presto • provides access to all dataset on data platform • require high scalability • has flexibility (join various data sources)
  • 61. Why Fixed vs Ad-hoc? • Difficulties on the Ad-hoc only solution • difficult to prevent heavy queries • large distinct count exhausts computing resources • decrease presto maintainability
  • 63. Chartio • Dashboard as A Service • helps businesses analyze and track their critical data • one of AWS partners (※) • Combine multiple data sources at one dashboard • Presto, MySQL, Redshift, BigQuery, Elasticsearch ... • enable to join BigQuery + MySQL internally • Easy to use for every one • everyone can make their own dashboard • write SQL directly / generate query by drag & drop ※ http://www.aws-partner-directory.com/PartnerDirectory/PartnerDetail?id=8959
  • 64. Creating dashboard 1. Building query (Drag&Drop / SQL) 2. Add step (filter态sort态modify) 3. Select visualize way (table态graph)
  • 66. Why Chartio? • Chartio saves a lot of engineering resources • before • maintain in-house dashboard written by rails • everyone got tired to maintain it • after • everyone can build their own dashboard easily • Chartio's UI is cool • very important factor for dashboard tool
  • 67. Missing Pieces of Chartio • No programable API provides • need to edit dashboard / chart manually • No rollback feature • all changes are recorded, but not rollback to the previous state • work around : clone => edit => rename
  • 69. Why Speed is Matter?
  • 70. Today’s News is Wrapping Tomorrow’s Fish and Chips
  • 73. Use cases • Re-rank news articles by user feedback • track user's positive/negative signal • consider gender, age, location, interests • Realtime article monitoring • detect high bounce rate (may be broken?) • make realtime reporting dashboard for A/B test
  • 74. Realtime Re-Ranking ref. Stream 処理 (Spark Streaming + Kinesis) と Offline 処理 (Hive) の統合 www.slideshare.net/smartnews/stremspark-streaming-kinesisofflinehive Amazon CloudSearch Search API API Gateway Kinesis Stream Amazon S3 Amazon EMR Amazon S3 Amazon EMR DynamoDB Realtime Feedback Re-rank Articles Article Metadata User Interests User Behaviors Offline Procees by Hive / Spark
  • 75. Realtime Monitoring API Gateway Stream Continuous View Continuous View Continuous View Discard raw record soon after consumed by Continuous View Incrementally updated in realtime PipelineDB Chartio AWS Lambda Slack Access Continuous View by PostgreSQL Client Record ※1 ※1 Use cron on 26 Feb. 2016 Migrate it soon after supporting VPC
  • 76. PipelineDB • OSS & enterprise streaming SQL database • PostgreSQL compatible • connect to Chartio šŸ˜ • join stream to normal PostgreSQL table • Support probabilistic data structures • e.g. HyperLogLog https://www.pipelinedb.com/ http://developer.smartnews.com/blog/2015/09/09/20150907pipelinedb/
  • 77. Continuous View -- Calculate unique users seen per media each day -- Using only a constant amount of space (HyperLogLog) CREATE CONTINUOUS VIEW uniques AS SELECT day(arrival_timestamp), substring(url from '.*://([^/]*)') as hostname, COUNT(DISTINCT user_id::integer) FROM activity_stream GROUP BY day,hostname; -- How many impressions have we served in the last five minutes? CREATE CONTINUOUS VIEW imps WITH (max_age = '5 minutes') AS SELECT COUNT(*) FROM imps_stream; -- What are the 90th, 95th, 99th percentiles of request latency? CREATE CONTINUOUS VIEW latency AS SELECT percentile_cont(array[90, 95, 99]) WITHIN GROUP (ORDER BY latency::integer) FROM latency_stream;
  • 79. Sustainable Data Platform • build a reliable and scalable lambda architecture • minimize operation & running cost • be open to uncertain future
  • 80. My Wishlist to AWS • Support Reduced Redundancy Storage (RRS) on EMR • Faster EMR Launch • Set TTL to DynamoDB records • Auto-scale Kinesis Stream • Launch Kinesis Analytics in Tokyo region