SlideShare a Scribd company logo
1 of 29
Spark + Cassandra ๊ธฐ๋ฐ˜ Big Data๋ฅผ ํ™œ์šฉํ•œ
์ถ”์ฒœ์‹œ์Šคํ…œ ์„œ๋น™ ํŒŒ์ดํ”„๋ผ์ธ ์ตœ์ ํ™”
2020.11.26
SSG.COM
๋ฐ•์ˆ˜์„ฑ
CONTENTS
1. E-commerce Data Use case
2. Data Pipeline with Spark + Cassandra
3. Trouble Shooting & Optimization
4. Q&A
1. E-commerce Data Use case
- ๊ณ ๊ฐ์˜ ํ–‰๋™ (๋ฐฉ๋ฌธ, ๊ฒ€์ƒ‰, ์žฅ๋ฐ”๊ตฌ๋‹ˆ, ํด๋ฆญ, ๊ตฌ๋งค, ๋ฆฌ๋ทฐ ๋“ฑ) ๋ฐ์ดํ„ฐ๊ฐ€ ์ƒ์„ฑ
- ์ƒํ’ˆ ์ถ”์ฒœ, ์ˆ˜์š” ์˜ˆ์ธก, ํŠธ๋ Œ๋“œ ๋ถ„์„ ๋“ฑ์— ํ™œ์šฉ ๊ฐ€๋Šฅ
๊ณ ๊ฐ๋ณ„ ์ถ”์ฒœ ์ƒํ’ˆ๋ณ„ ์ถ”์ฒœ
๊ณ ๊ฐ๋ณ„๋กœ ๋‹ค๋ฅธ
์ƒํ’ˆ ์ถ”์ฒœ
Ex) ๋‹ค๋ฅธ ๊ณ ๊ฐ์˜
Path ์ฐธ๊ณ 
์ƒํ’ˆ๋ณ„๋กœ ๋‹ค๋ฅธ
์ƒํ’ˆ ์ถ”์ฒœ
Ex) ๋Œ€์ฒด์žฌ, ๋ณด์™„
์žฌ ๋“ฑ..
1. E-commerce Data Use case
- ์นด๋ ˆ์—ฌ์™•์„ ๊ตฌ๋งคํ•˜๋Š” ๊ณ ๊ฐ์—๊ฒŒ ๋ณ„๋„์˜ ๋ฌถ์Œ ์ƒํ’ˆ์„ ์ œ๊ณต
- ํ•ด๋‹น ์ƒํ’ˆ๊ณผ ํ•จ๊ป˜ ๋ฐฉ๋ฌธ/๊ตฌ๋งค/์žฅ๋ฐ”๊ตฌ๋‹ˆ ์•ก์…˜์ด ์ผ์–ด๋‚˜๋Š” ์ƒํ’ˆ ๋ฌถ์Œ์„ ๋…ธ์ถœํ•˜๋ฉด์„œ ์ฟ ํฐ&ํ• ์ธ ์ œ๊ณต
ํ•จ๊ป˜ ๊ตฌ๋งคํ•˜๋„๋ก ์œ ๋„
1. E-commerce Data Use case
- ์ˆ˜ ๋งŽ์€ ํ”„๋ ˆ์ž„์›Œํฌ์—์„œ๋Š” ์‰ฝ๊ฒŒ ์ ‘๊ทผ์ด ๊ฐ€๋Šฅํ•œ ์•Œ๊ณ ๋ฆฌ์ฆ˜์ด ์กด์žฌ
- ์ „์ฒ˜๋ฆฌ, ํ›„์ฒ˜๋ฆฌ, ํŒŒ๋ผ๋ฏธํ„ฐ ์กฐ์ ˆ ๋งŒ์œผ๋กœ๋„ ์‰ฝ๊ฒŒ ์ถ”์ฒœ ๋ฐ์ดํ„ฐ Set์„ ๋งŒ๋“ค์–ด๋‚ผ ์ˆ˜ ์žˆ์Œ.
Ex) MLlib: Main Guide
- Basic statistics
- Pipelines
- Extracting, transforming and selecting
features
- Classification and Regression
- Clustering
- Collaborative filtering
- Frequent Pattern Mining
- Model selection and tuning
Spark MLlib์˜ FP-Growth ์˜ˆ์ œ ์ฝ”๋“œ
- ๊ณผ๊ฑฐ์— ๋น„ํ•ด ๊ฐœ๋ฐœ์ž๋“ค๋„ ML์— ๋Œ€ํ•œ ์ ‘๊ทผ์ด ์‰ฌ์›Œ์ง€๊ณ  ์žˆ์Œ.
- ๋ถ„๋ช… ์—ฌ๋Ÿฌ ํ”„๋ ˆ์ž„์›Œํฌ์—์„œ ์ œ๊ณตํ•˜๋Š” ์•Œ๊ณ ๋ฆฌ์ฆ˜์— ๋ถ€์กฑํ•œ ๋ถ€๋ถ„์ด ์กด์žฌ.
- But ๊พธ์ค€ํžˆ ๋ฐœ์ „ ์ค‘์ด๊ณ  ์ƒˆ๋กœ์šด ๊ฒƒ์ด ๊ณ„์†ํ•ด์„œ ๋“ฑ์žฅ.
1. E-commerce Data Use case
- ์ข‹์€ ์•Œ๊ณ ๋ฆฌ์ฆ˜์œผ๋กœ ์ œํ’ˆ์„ ๋งŒ๋“ค์–ด๋‚ด๋Š” ๊ฒƒ๋„ ์ค‘์š”
- ํ•˜์ง€๋งŒ ๊ณ ๊ฐ์—๊ฒŒ ๋ฐ์ดํ„ฐ๋ฅผ ์„œ๋น™ํ•  ์ˆ˜ ์žˆ์–ด์•ผ ์˜๋ฏธ๊ฐ€ ์žˆ์Œ
- ์„œ๋น™ ํŒŒ์ดํ”„๋ผ์ธ ๊ตฌ์ถ•์€ ๊ฒฝํ—˜๊ณผ ๋…ธํ•˜์šฐ๊ฐ€ ํ•„์š”
1. E-commerce Data Use case
2. Data Pipeline with Spark + Cassandra
HDFS
Mesos
Spark
HDFS
Mesos
Spark
HDFS
Mesos
Spark
HDFS
Mesos
Spark
HDFS
Mesos
Spark
HDFS
Mesos
Spark
HDFS
Mesos
Spark
HDFS
Mesos
Spark
HDFS
Mesos
Spark
HDFS
Mesos
Spark
Kafka
Logstash
๋ฐ์ดํ„ฐ ์ €์žฅ&์ฒ˜๋ฆฌ ์ถ”์ฒœ์…‹ ์ €์žฅ
API Server
Database
Database
Database
Database
Database
Spring
Boots
์ˆ˜์ง‘์„œ๋ฒ„
์„œ๋น™ ๋ ˆ์ด์–ด ๊ฐ„์†Œํ™”๋ฅผ ์˜ˆ์‹œ
2. Data Pipeline with Spark + Cassandra
- ๋งค์ผ ์ƒˆ๋ฒฝ ์‹œ๊ฐ„๋Œ€์— ์ˆ˜๋ฐฑ GB์˜ ๋ฐ์ดํ„ฐ์…‹๋“ค์„ DB์— Insert
- ํŠน์ • ์‹ค์‹œ๊ฐ„ ๋กœ๊ทธ๋“ค ๋‹จ๊ฑด Insert
- ๋ณ‘๋ ฌ ์ˆ˜ํ–‰์„ ์œ„ํ•œ Spark์™€์˜ ๊ถํ•ฉ
- ๋‹จ์ˆœ Select๊ฐ€ ์ฃผ์š” ์ฟผ๋ฆฌ
- ํŠน์ • ํšŒ์›/์ƒํ’ˆ Skew
- DB Downtime์ด ๋ฐœ์ƒํ•˜๋”๋ผ๋„ ์ง€์†๊ฐ€๋Šฅํ•œ ์šด์˜
- ์ถ”ํ›„ ์‰ฝ๊ฒŒ ํ™•์žฅ ๊ฐ€๋Šฅํ•ด์•ผ ํ•จ.
- ๋ชจ๋‹ˆํ„ฐ๋ง ํ”„๋กœ์„ธ์Šค ํ•„์š”
2. Data Pipeline with Spark + Cassandra / Serving Layer Architecture
- ์ƒˆ๋ฒฝ ์‹œ๊ฐ„๋Œ€์— ์ˆ˜๋ฐฑ GB์˜ ๋ฐ์ดํ„ฐ์…‹๋“ค์„ ๋งค์ผ DB์— Insert
- ํŠน์ • ์‹ค์‹œ๊ฐ„ ๋กœ๊ทธ๋“ค ๋‹จ๊ฑด Insert
(๋ฐฐ์น˜ ์ปค์Šคํ„ฐ๋งˆ์ด์ง• ์˜์—ญ)
- ๋ณ‘๋ ฌ ์ˆ˜ํ–‰์„ ์œ„ํ•œ Spark์™€์˜ ๊ถํ•ฉ
(Cassandra ์—ญ์‹œ Apache Project)
- ๋‹จ์ˆœ Select๊ฐ€ ์ฃผ์š” ์ฟผ๋ฆฌ
(Cassandra๋Š” key ์กฐํšŒ์— ์•Œ๋งž์Œ.)
- ํŠน์ • ํšŒ์›/์ƒํ’ˆ Skew
(์ƒค๋”ฉ์— ๋Œ€ํ•œ ๊ณ ๋ ค ์•ˆํ•ด๋„ ๋จ)
- DB Downtime์ด ๋ฐœ์ƒํ•˜๋”๋ผ๋„ ์ง€์†๊ฐ€๋Šฅํ•œ ์šด์˜
(Ring ๊ตฌ์กฐ๋ผ์„œ ์‰ฝ๊ฒŒ Up/Down Serivce ๊ฐ€๋Šฅ)
- ์ถ”ํ›„ ์‰ฝ๊ฒŒ ํ™•์žฅ ๊ฐ€๋Šฅํ•ด์•ผ ํ•จ.
(Ring ๊ตฌ์กฐ์— ๋‹จ์ˆœํžˆ ์ถ”๊ฐ€ํ•˜๋Š” ๊ฒƒ์œผ๋กœ Scale Out ๊ฐ€๋Šฅ)
- ๋ชจ๋‹ˆํ„ฐ๋ง ํ”„๋กœ์„ธ์Šค ํ•„์š”
(JMX exporter๋ฅผ ํ†ตํ•ด Metric ์ •๋ณด๋ฅผ ์ˆ˜์ง‘&๋ชจ๋‹ˆํ„ฐ๋ง ๊ฐ€๋Šฅ)
What is Cassandra?
- No master & slaves
- distributed like a ring
- Scalability
- high availability
2. Data Pipeline with Spark + Cassandra / Serving Layer Architecture
Memory
2. Data Pipeline with Spark + Cassandra / Serving Layer Architecture
Memtable
1๋ฒˆ SSTable
2๋ฒˆ SSTable
3๋ฒˆ SSTable
4๋ฒˆ SSTable
n๋ฒˆ SSTable
ํ…Œ์ด๋ธ” A
Disk
Data Path
Spark
Driver
Spark Excutor
partition
Spark Excutor
partition
Spark Excutor
partition
Spark Excutor
partition
Spark Excutor
partition
Spark Excutor
partition
2. Data Pipeline with Spark + Cassandra
API Server
Spring
Boots
Spark
Streaming
์ผ๋ฐฐ์น˜์„ฑ ๋Œ€์šฉ๋Ÿ‰ ๋ฐ์ดํ„ฐ
์‹ค์‹œ๊ฐ„์„ฑ ์ €์šฉ๋Ÿ‰ ๋ฐ์ดํ„ฐ
3. Trouble Shooting & Optimization
๋ชจ๋“  ๋ฐฐ์น˜์„ฑ ๋ฐ์ดํ„ฐ, ์‹ค์‹œ๊ฐ„ ๋ฐ์ดํ„ฐ๋ฅผ memtable(In-memory) ๋ฐฉ์‹์œผ๋กœ Insert ํ•จ.
์‹ค์‹œ๊ฐ„ ๋ฐ์ดํ„ฐ๋Š” ์ž‘์ง€๋งŒ ๋ฐฐ์น˜์„ฑ ๋ฐ์ดํ„ฐ๋Š” ์ˆœ๊ฐ„ ์ˆ˜๋ฐฑ GB์˜ ๋ฐ์ดํ„ฐ๊ฐ€ ๋ฉ”๋ชจ๋ฆฌ์— ์˜ฌ๋ผ๊ฐ€๋Š” ํšจ๊ณผ.
๊ทธ ๊ฒฐ๊ณผ ์ฒ˜๋ฆฌ์œจ์— ๋”ฐ๋ผ CPU๊ฐ€ 100%์— ๋„๋‹ฌํ•˜๋ฉฐ ์ œ์‹œ๊ฐ„์— Response๋ฅผ ์ฃผ์ง€ ๋ชปํ•˜๋Š” ๋ฌธ์ œ๊ฐ€ ๋ฐœ์ƒ.
CPU 100%!!
60k/s
3. Trouble Shooting & Optimization
ํ•ด๋‹น ๋ฐ์ดํ„ฐ๊ฐ€ ๋„ˆ๋ฌด ํด ๊ฒฝ์šฐ Memtable(In-memory) -> SSTable(Disk)๋กœ ๋‚ด๋ฆฌ๋Š” ๊ณผ์ •์ธ Flush๊ฐ€ ์กด์žฌํ•จ.
Service Downtime์„ ๊ณ ๋ คํ•˜์—ฌ Replica๋ฅผ 3์œผ๋กœ ์žก์Œ.
๋”ฐ๋ผ์„œ Flush ๋ฐ Copy๋กœ ์ธํ•œ Batch ์‹œ๊ฐ„์€ ๊ธธ์–ด์ง€๊ณ  ์ด ๋ชจ๋“  ์‹œ๊ฐ„๋Œ€๋Š” ์žฅ์• ์ƒํ™ฉ์œผ๋กœ ๊ฐ„์ฃผ.
Connection Timeouts / Pending threads
3. Trouble Shooting & Optimization
Idea ?
- ๋Œ€์šฉ๋Ÿ‰์˜ ๋ฐ์ดํ„ฐ๋Š” Memtable -> SSTable ๋ฐฉ์‹์œผ๋กœ Insert ๋ณ€๊ฒฝ.
- ์‹ค์‹œ๊ฐ„์„ฑ ๋‹จ๊ฑด Insert ๋ฐ์ดํ„ฐ๋Š” Spark Streaming + Memtable ๋ฐฉ์‹์œผ๋กœ Insert ์œ ์ง€.
Spark
Streaming
Spark Cluster
์‹ค์‹œ๊ฐ„/๋‹จ๊ฑด
์ผ๋ฐฐ์น˜ ๋Œ€์šฉ๋Ÿ‰
Memtable Insert
SSTable File ์ƒ์„ฑ -> SSTable Bulk Load
Spark
Driver
Spark Excutor
partition
Spark Excutor
partition
Spark Excutor
partition
Spark Excutor
partition
Spark Excutor
partition
Spark Excutor
partition
API Server
Spring
Boots
SSTable Files SSTable Files SSTable Files SSTable Files SSTable FilesSSTable Files
3. Trouble Shooting & Optimization
Spark
Streaming
์ผ๋ฐฐ์น˜์„ฑ ๋Œ€์šฉ๋Ÿ‰ ๋ฐ์ดํ„ฐ
์‹ค์‹œ๊ฐ„์„ฑ ์ €์šฉ๋Ÿ‰ ๋ฐ์ดํ„ฐ
* ์ž‘์—… ์ˆœ์„œ
1. UUID๋ฅผ ํ™œ์šฉํ•˜์—ฌ SSTable Directory ์ƒ์„ฑ
2. Directory์— SSTable ์ƒ์„ฑ
3. SSTable Bulk Load To Cassandra
4. Delete Directory
3. Trouble Shooting & Optimization
* ๊ธฐ๋Œ€ํšจ๊ณผ
- ๊ฐ ์นด์‚ฐ๋“œ๋ผ ๋…ธ๋“œ๋Š” SSD์ด๊ธฐ ๋•Œ๋ฌธ์— ํšจ์œจ ์ฆ๊ฐ€
- CPU ์‚ฌ์šฉ์ด ๋ฏธ๋ฏธํ•  ๊ฒƒ์ด๊ธฐ ๋•Œ๋ฌธ์— ์šด์˜ ์ƒ์— ์˜ํ–ฅ ๋ฏธ๋ฏธ
- Network/Disk ์„ฑ๋Šฅ์ด ์ถฉ๋ถ„ํžˆ ๋ฐ›์ณ์ค€๋‹ค๋ฉด ๋” ๋งŽ์€ ๋ฐฐ์น˜๋ฅผ ๋™์‹œ์ˆ˜ํ–‰ ๊ฐ€๋Šฅ
3. Trouble Shooting & Optimization
๋ณ‘๋ ฌ์ˆ˜ํ–‰์„ ์œ„ํ•œ repartition
UUID๋ฅผ ํ™œ์šฉํ•ด์„œ ๋””๋ ‰ํ† ๋ฆฌ ์ƒ์„ฑ
SSTable ์ƒ์„ฑ
๋žœ๋ค์œผ๋กœ ์นด์‚ฐ๋“œ๋ผ ๋…ธ๋“œ ์„ ํƒ
Stream buffer size ์กฐ์ ˆ ๋ฐ ์ „์†ก
๋””๋ ‰ํ† ๋ฆฌ ์‚ญ์ œ
์กฐ์ ˆ ๊ฐ€๋Šฅํ•œ ๋ณ€์ˆ˜
1. partition_num
2. streamthrottlembits
3. Trouble Shooting & Optimization
Ex) ์ˆ˜์‹  ์ธก์ด Network Traffic์€ ์ตœ๋Œ€ 1Gbps๋ฅผ ๋ฐ›์„ ์ˆ˜ ์žˆ๊ณ  SSD๋Š” 2GBs. (Network Traffic๋งŒ ๊ณ ๋ ค)
Base Line์„ Max 35%์ธ 350Mbps๋งŒ ์‚ฌ์šฉํ•˜๋„๋ก ๊ฐ€์ •.
๋งŒ์•ฝ HDFS Size 1G๋‹น ํŒŒํ‹ฐ์…˜ 1๊ฐœ๋กœ ๊ณ ์ •ํ•œ๋‹ค๋ฉด 10GB๋ฅผ ์ „์†กํ•  ๊ฒฝ์šฐ 10๋Œ€๊ฐ€ ๋ณ‘๋ ฌ๋กœ ์ˆ˜ํ–‰.
Streamthrottlembits * partition_num = 350 (mbps)
350Mbps๋กœ ์ œํ•œํ•˜๋Š” ์˜ˆ์ œ ์ฝ”๋“œ
3. Trouble Shooting & Optimization
๊ธฐ๋Œ€ํ–ˆ๋˜ Base Line์ธ 350Mbps์ด ์ตœ๋Œ€์น˜์˜€์œผ๋ฉฐ ์ด๋Ÿฌํ•œ ๊ฒฝ์šฐ ๋ฐฐ์น˜๋ฅผ 2๊ฐœ๊นŒ์ง€ ์ˆ˜ํ–‰ ๊ฐ€๋Šฅ.(~70%)
์‹ค์ œ ์šด์˜ ์‹œ์—๋Š” ์žฅ๋น„๊ฐ€ ๋” ์ข‹์„ ๊ฒƒ์ด๋ฏ€๋กœ ๋” ํ—ค๋น„ํ•˜๊ฒŒ ์‚ฌ์šฉ๊ฐ€๋Šฅ
์ค‘์š”ํ•œ ๊ฒƒ์€ ๊ทธ ํŠธ๋ž˜ํ”ฝ์„ ์–ผ๋งˆ๋‚˜ ์ž˜ ์ œ์–ดํ•˜๊ณ , ๋ฐฐ์น˜๋ณ„๋กœ ์–ด๋–ป๊ฒŒ ๋™์ ์œผ๋กœ ๋ถ„๋ฐฐํ•  ์ง€๋ฅผ ๊ณ ๋ฏผ.
๊ฒฐ๊ณผ
์ •ํ™•ํžˆ 35%๋งŒ ์‚ฌ์šฉ!!
3. Trouble Shooting & Optimization
์ „ ํ›„
CPU, Memory ์‚ฌ์šฉ ๊ฐ์†Œ๋กœ ์ธํ•ด ๋™์‹œ์— ๋” ๋งŽ์€ ๋ฐฐ์น˜๊ฐ€ ์ˆ˜ํ–‰ ๊ฐ€๋Šฅํ•ด์ง (Network, Disk I/O ๊ณ ๋ ค)
60K write/sec
MAX CPU Usage 100%
60 write/sec
MAX CPU Usage 20%
- ๋ฐ์ดํ„ฐ์…‹ Size์— ๋”ฐ๋ผ ๊ฐ ๋…ธ๋“œ์—์„œ๋Š” ์ผ๋ณ„ ์ˆ˜๋ฐฑ~์ˆ˜์ฒœ๊ฐœ๊ฐ€ ์ƒ์„ฑ์ด ๋˜๊ณ  ๋งค์ผ ์Œ“์ด๋Š” ๊ตฌ์กฐ
- Cache์— ์˜ฌ๋ผ๊ฐ€๊ธฐ ์ „์— ์ˆ˜ ๋งŽ์€ SSTable์„ Readํ•˜๋ฉด์„œ ์„ฑ๋Šฅ ํ•˜๋ฝ์˜ ์›์ธ์ด ๋  ์ˆ˜ ์žˆ์Œ
-> ๋”ฐ๋ผ์„œ ๋งค์ผ Compaction์„ ์ˆ˜ํ–‰ํ•˜๋ฉด์„œ SSTable์„ ์ค„์—ฌ์ฃผ๋Š” ์ž‘์—… ์ง„ํ–‰
3. Trouble Shooting & Optimization
Compaction์„ ํ†ตํ•ด SSTable์ด ์ค„์–ด๋“œ๋Š” ๊ทธ๋ž˜ํ”„
* Azul Systems์˜ Zing ๋„์ž… ๋ฐฐ๊ฒฝ
- ์ถ”์ฒœ์…‹ ๋ฐฐ์น˜๊ฐ€ ์ ์  ๋” ๋งŽ์•„์ง
- ๋ฐ์ดํ„ฐ ์‚ฌ์ด์ฆˆ๊ฐ€ ๊ณ„์†ํ•ด์„œ ์ปค์ง
- ์ถ”๊ฐ€ ํ”„๋กœ์ ํŠธ ์ง„ํ–‰์‹œ ํŠธ๋ž˜ํ”ฝ์ด ๋ณด์ˆ˜์ ์œผ๋กœ 5๋ฐฐ๋Š” ๋Š˜์–ด๋‚  ๊ฒƒ์œผ๋กœ ์˜ˆ์ƒ
ํŠนํžˆ ์ถ”ํ›„ ํŠธ๋ž˜ํ”ฝ์ด ๋งŽ์•„์งˆ ๊ฒฝ์šฐ ์•ˆ์ •์ ์ธ response๋ฅผ ์œ„ํ•ด Zing GC ํ…Œ์ŠคํŠธ ํ•ด๋ณด๊ธฐ๋กœ ๊ฒฐ์ •
๏ƒ  Cassandra + Zing ์กฐํ•ฉ์ด ๊ตญ๋‚ด ๋ ˆํผ๋Ÿฐ์Šค๊ฐ€ ์—†์—ˆ๊ธฐ ๋•Œ๋ฌธ์— ๋‚ด๋ถ€์—์„œ ์ž์ฒด ํ…Œ์ŠคํŠธ๋ฅผ ์ง„ํ–‰
3. Trouble Shooting & Optimization
ZING GC
G1GC
HotSpot JVM vs Zing JVM ๋น„๊ต
3. Trouble Shooting & Optimization
ZING GC
G1GC
Zing GC(์œ„) vs G1GC (์•„๋ž˜)
- ์กฐ๊ธˆ ๋” ์•ˆ์ •์ ์œผ๋กœ ์‚ฌ์šฉํ•˜๊ธฐ ์œ„ํ•ด Zing ๋„์ž…
- Zing์˜ ๊ฒฝ์šฐ ๊พธ์ค€ํ•œ ํ•ญ์ƒ ์ผ์ •ํ•œ GC time ์œ ์ง€
- STW๋ฅผ ์‹ ๊ฒฝ์“ฐ์ง€ ์•Š๊ณ  ์•ˆ์ •์ ์ธ Response ๊ธฐ๋Œ€
- ํŠธ๋ž˜ํ”ฝ์ด ๋Š˜์–ด๋‚˜๊ณ , ๋” ํ—ค๋น„ ํ•˜๊ฒŒ ์‚ฌ์šฉํ•  ๊ฒฝ์šฐ
๋” ๊ทน์ ์ธ ํšจ๊ณผ ๊ธฐ๋Œ€
๊ฐ€์žฅ ๋งˆ์Œ์— ๋“ค์—ˆ๋˜ ๊ฒƒ์€ GC ํŠœ๋‹์ด ํ•„์š”๊ฐ€ ์—†์Œ!
Ex ) โ€“Xmx60g๋งŒ ์ฃผ๊ณ  ์‚ฌ์šฉ
3. Trouble Shooting & Optimization
Zing GC
G1GC
Prometheus + Grafana ์กฐํ•ฉ์œผ๋กœ ๋ชจ๋‹ˆํ„ฐ๋ง ์‹œ๊ฐํ™”
3. Trouble Shooting & Optimization
- CPU ์‚ฌ์šฉ๋Ÿ‰ 100% -> 20%๋กœ ๊ฐ์†Œํ•จ์œผ๋กœ์จ ์šด์˜ ์•ˆ์ •์„ฑ ํ™•๋ณด
- ์ฒ˜๋ฆฌ์œจ ์ฆ๊ฐ€๋กœ ์ธํ•ด ๋ฐฐ์น˜ ์†๋„ 2H -> 15M ์œผ๋กœ ๊ฐ์†Œ
- Network & Disk I/O ๋งŒ ๊ณ ๋ คํ•˜๋ฉด์„œ ๋‹ค์ค‘ ๋ฐฐ์น˜ ์ˆ˜ํ–‰ ๊ฐ€๋Šฅ
- Zing GC ๋„์ž… ํ›„ ์•ˆ์ •์ ์ธ Response ๋ฐ ๋ฉ”๋ชจ๋ฆฌ ๊ด€๋ฆฌ
- Full Data๊ฐ€ ์•„๋‹Œ Key Cache๋งŒ ๋ฉ”๋ชจ๋ฆฌ์— ์˜ฌ๋ ค๋†“์Œ์œผ๋กœ์จ ํšจ์œจ์  ์šด์˜
- ์‹ค์‹œ๊ฐ„/๋Œ€์šฉ๋Ÿ‰ ๋ฐฐ์น˜๋ฅผ ๋ถ„๋ฆฌ ์šด์˜ํ•จ์œผ๋กœ์จ ์•ˆ์ •์„ฑ ํ™•๋ณด
3. Trouble Shooting & Optimization
Q & A

More Related Content

What's hot

๊ด€๊ณ„ํ˜• ๋ฐ์ดํ„ฐ๋ฒ ์ด์Šค์˜ ์ƒˆ๋กœ์šด ํŒจ๋Ÿฌ๋‹ค์ž„ Amazon Aurora :: ๊น€์ƒํ•„ :: AWS Summit Seoul 2016
๊ด€๊ณ„ํ˜• ๋ฐ์ดํ„ฐ๋ฒ ์ด์Šค์˜ ์ƒˆ๋กœ์šด ํŒจ๋Ÿฌ๋‹ค์ž„ Amazon Aurora :: ๊น€์ƒํ•„ :: AWS Summit Seoul 2016๊ด€๊ณ„ํ˜• ๋ฐ์ดํ„ฐ๋ฒ ์ด์Šค์˜ ์ƒˆ๋กœ์šด ํŒจ๋Ÿฌ๋‹ค์ž„ Amazon Aurora :: ๊น€์ƒํ•„ :: AWS Summit Seoul 2016
๊ด€๊ณ„ํ˜• ๋ฐ์ดํ„ฐ๋ฒ ์ด์Šค์˜ ์ƒˆ๋กœ์šด ํŒจ๋Ÿฌ๋‹ค์ž„ Amazon Aurora :: ๊น€์ƒํ•„ :: AWS Summit Seoul 2016Amazon Web Services Korea
ย 
Hanghae99 FinalProject Moyeora!
Hanghae99 FinalProject Moyeora!Hanghae99 FinalProject Moyeora!
Hanghae99 FinalProject Moyeora!Young Woo Lee
ย 
AWS ํ™œ์šฉํ•œ Data Lake ๊ตฌ์„ฑํ•˜๊ธฐ
AWS ํ™œ์šฉํ•œ Data Lake ๊ตฌ์„ฑํ•˜๊ธฐAWS ํ™œ์šฉํ•œ Data Lake ๊ตฌ์„ฑํ•˜๊ธฐ
AWS ํ™œ์šฉํ•œ Data Lake ๊ตฌ์„ฑํ•˜๊ธฐNak Joo Kwon
ย 
Accelerate spring boot application with apache ignite
Accelerate spring boot application with apache igniteAccelerate spring boot application with apache ignite
Accelerate spring boot application with apache igniteYEON BOK LEE
ย 
Amazon Aurora 100% ํ™œ์šฉํ•˜๊ธฐ
Amazon Aurora 100% ํ™œ์šฉํ•˜๊ธฐAmazon Aurora 100% ํ™œ์šฉํ•˜๊ธฐ
Amazon Aurora 100% ํ™œ์šฉํ•˜๊ธฐAmazon Web Services Korea
ย 
Azure Databases for PostgreSQL MYSQL and MariaDB
Azure Databases for PostgreSQL MYSQL and MariaDBAzure Databases for PostgreSQL MYSQL and MariaDB
Azure Databases for PostgreSQL MYSQL and MariaDBrockplace
ย 
[Azure study group] azure์˜ ๋ถ€ํ•˜๋ถ„์‚ฐ
[Azure study group] azure์˜ ๋ถ€ํ•˜๋ถ„์‚ฐ[Azure study group] azure์˜ ๋ถ€ํ•˜๋ถ„์‚ฐ
[Azure study group] azure์˜ ๋ถ€ํ•˜๋ถ„์‚ฐ์„ธ์ค€ ๊น€
ย 
แ„‡แ…ฎแ†ซแ„‰แ…ฅแ†จแ„€แ…กแ„…แ…ณแ†ฏ แ„‹แ…ฑแ„’แ…กแ†ซ Aws แ„€แ…ตแ„‡แ…กแ†ซแ„‹แ…ด digital แ„‘แ…ณแ†ฏแ„…แ…ขแ†บแ„‘แ…ฉแ†ท แ„€แ…ฎแ„Žแ…ฎแ†จ
แ„‡แ…ฎแ†ซแ„‰แ…ฅแ†จแ„€แ…กแ„…แ…ณแ†ฏ แ„‹แ…ฑแ„’แ…กแ†ซ Aws แ„€แ…ตแ„‡แ…กแ†ซแ„‹แ…ด digital แ„‘แ…ณแ†ฏแ„…แ…ขแ†บแ„‘แ…ฉแ†ท แ„€แ…ฎแ„Žแ…ฎแ†จแ„‡แ…ฎแ†ซแ„‰แ…ฅแ†จแ„€แ…กแ„…แ…ณแ†ฏ แ„‹แ…ฑแ„’แ…กแ†ซ Aws แ„€แ…ตแ„‡แ…กแ†ซแ„‹แ…ด digital แ„‘แ…ณแ†ฏแ„…แ…ขแ†บแ„‘แ…ฉแ†ท แ„€แ…ฎแ„Žแ…ฎแ†จ
แ„‡แ…ฎแ†ซแ„‰แ…ฅแ†จแ„€แ…กแ„…แ…ณแ†ฏ แ„‹แ…ฑแ„’แ…กแ†ซ Aws แ„€แ…ตแ„‡แ…กแ†ซแ„‹แ…ด digital แ„‘แ…ณแ†ฏแ„…แ…ขแ†บแ„‘แ…ฉแ†ท แ„€แ…ฎแ„Žแ…ฎแ†จNak Joo Kwon
ย 
[AWSKRUG&JAWS-UG Meetup #1] ํƒœ์–‘๊ด‘๋ฐœ์ „์†Œ ์›๊ฒฉ ๊ฐ์‹œ ์‹œ์Šคํ…œ์˜ ๋Œ€๋Ÿ‰๋ฐ์ดํ„ฐ ํ•ด์„ใ€ๆ ชๅผไผš็คพfusicใ€‘
[AWSKRUG&JAWS-UG Meetup #1] ํƒœ์–‘๊ด‘๋ฐœ์ „์†Œ ์›๊ฒฉ ๊ฐ์‹œ ์‹œ์Šคํ…œ์˜  ๋Œ€๋Ÿ‰๋ฐ์ดํ„ฐ ํ•ด์„ใ€ๆ ชๅผไผš็คพfusicใ€‘[AWSKRUG&JAWS-UG Meetup #1] ํƒœ์–‘๊ด‘๋ฐœ์ „์†Œ ์›๊ฒฉ ๊ฐ์‹œ ์‹œ์Šคํ…œ์˜  ๋Œ€๋Ÿ‰๋ฐ์ดํ„ฐ ํ•ด์„ใ€ๆ ชๅผไผš็คพfusicใ€‘
[AWSKRUG&JAWS-UG Meetup #1] ํƒœ์–‘๊ด‘๋ฐœ์ „์†Œ ์›๊ฒฉ ๊ฐ์‹œ ์‹œ์Šคํ…œ์˜ ๋Œ€๋Ÿ‰๋ฐ์ดํ„ฐ ํ•ด์„ใ€ๆ ชๅผไผš็คพfusicใ€‘AWSKRUG - AWSํ•œ๊ตญ์‚ฌ์šฉ์ž๋ชจ์ž„
ย 
Scalable web architecture and distributed systems
Scalable web architecture and distributed systemsScalable web architecture and distributed systems
Scalable web architecture and distributed systemsํ˜„์ข… ๊น€
ย 
๋„ฅ์Šจ ๊ธ€๋กœ๋ฒŒ ํ”Œ๋žซํผ ๊ตฌ์ถ• ์ด์•ผ๊ธฐ : DB Migration case study (์ž„ํ˜„์ˆ˜ ํ”Œ๋žซํผ์ธํ”„๋ผ์‹ค Technical Manager, ๋„ฅ...
๋„ฅ์Šจ ๊ธ€๋กœ๋ฒŒ ํ”Œ๋žซํผ ๊ตฌ์ถ• ์ด์•ผ๊ธฐ : DB Migration case study (์ž„ํ˜„์ˆ˜ ํ”Œ๋žซํผ์ธํ”„๋ผ์‹ค Technical Manager, ๋„ฅ...๋„ฅ์Šจ ๊ธ€๋กœ๋ฒŒ ํ”Œ๋žซํผ ๊ตฌ์ถ• ์ด์•ผ๊ธฐ : DB Migration case study (์ž„ํ˜„์ˆ˜ ํ”Œ๋žซํผ์ธํ”„๋ผ์‹ค Technical Manager, ๋„ฅ...
๋„ฅ์Šจ ๊ธ€๋กœ๋ฒŒ ํ”Œ๋žซํผ ๊ตฌ์ถ• ์ด์•ผ๊ธฐ : DB Migration case study (์ž„ํ˜„์ˆ˜ ํ”Œ๋žซํผ์ธํ”„๋ผ์‹ค Technical Manager, ๋„ฅ...Amazon Web Services Korea
ย 
AWS 9์›” ์›จ๋น„๋‚˜ | Amazon Aurora Deep Dive
AWS 9์›” ์›จ๋น„๋‚˜ | Amazon Aurora Deep DiveAWS 9์›” ์›จ๋น„๋‚˜ | Amazon Aurora Deep Dive
AWS 9์›” ์›จ๋น„๋‚˜ | Amazon Aurora Deep DiveAmazon Web Services Korea
ย 
[์šฐ๋ฆฌ๊ฐ€ ๋ฐ์ดํ„ฐ๋ฅผ ์“ฐ๋Š” ๋ฒ•] ๋ชจ๋ฐ”์ผ ๊ฒŒ์ž„ ๋กœ๊ทธ ๋ฐ์ดํ„ฐ ๋ถ„์„ ์ด์•ผ๊ธฐ - ์—”ํ„ฐ๋ฉ”์ดํŠธ ๊ณต์‹ ๋ฐฐ ํŒ€์žฅ
[์šฐ๋ฆฌ๊ฐ€ ๋ฐ์ดํ„ฐ๋ฅผ ์“ฐ๋Š” ๋ฒ•] ๋ชจ๋ฐ”์ผ ๊ฒŒ์ž„ ๋กœ๊ทธ ๋ฐ์ดํ„ฐ ๋ถ„์„ ์ด์•ผ๊ธฐ - ์—”ํ„ฐ๋ฉ”์ดํŠธ ๊ณต์‹ ๋ฐฐ ํŒ€์žฅ[์šฐ๋ฆฌ๊ฐ€ ๋ฐ์ดํ„ฐ๋ฅผ ์“ฐ๋Š” ๋ฒ•] ๋ชจ๋ฐ”์ผ ๊ฒŒ์ž„ ๋กœ๊ทธ ๋ฐ์ดํ„ฐ ๋ถ„์„ ์ด์•ผ๊ธฐ - ์—”ํ„ฐ๋ฉ”์ดํŠธ ๊ณต์‹ ๋ฐฐ ํŒ€์žฅ
[์šฐ๋ฆฌ๊ฐ€ ๋ฐ์ดํ„ฐ๋ฅผ ์“ฐ๋Š” ๋ฒ•] ๋ชจ๋ฐ”์ผ ๊ฒŒ์ž„ ๋กœ๊ทธ ๋ฐ์ดํ„ฐ ๋ถ„์„ ์ด์•ผ๊ธฐ - ์—”ํ„ฐ๋ฉ”์ดํŠธ ๊ณต์‹ ๋ฐฐ ํŒ€์žฅDylan Ko
ย 
Webservice cache strategy
Webservice cache strategyWebservice cache strategy
Webservice cache strategyDaeMyung Kang
ย 
[Gaming on AWS] AWS์™€ ํ•จ๊ป˜ ํ•œ ์ฟ ํ‚ค๋Ÿฐ ์„œ๋ฒ„ Re-architecting ์‚ฌ๋ก€ - ๋ฐ๋ธŒ์‹œ์Šคํ„ฐ์ฆˆ
[Gaming on AWS] AWS์™€ ํ•จ๊ป˜ ํ•œ ์ฟ ํ‚ค๋Ÿฐ ์„œ๋ฒ„ Re-architecting ์‚ฌ๋ก€ - ๋ฐ๋ธŒ์‹œ์Šคํ„ฐ์ฆˆ[Gaming on AWS] AWS์™€ ํ•จ๊ป˜ ํ•œ ์ฟ ํ‚ค๋Ÿฐ ์„œ๋ฒ„ Re-architecting ์‚ฌ๋ก€ - ๋ฐ๋ธŒ์‹œ์Šคํ„ฐ์ฆˆ
[Gaming on AWS] AWS์™€ ํ•จ๊ป˜ ํ•œ ์ฟ ํ‚ค๋Ÿฐ ์„œ๋ฒ„ Re-architecting ์‚ฌ๋ก€ - ๋ฐ๋ธŒ์‹œ์Šคํ„ฐ์ฆˆAmazon Web Services Korea
ย 
20150125 AWS BlackBelt - Amazon RDS (Korean)
20150125 AWS BlackBelt - Amazon RDS (Korean)20150125 AWS BlackBelt - Amazon RDS (Korean)
20150125 AWS BlackBelt - Amazon RDS (Korean)Amazon Web Services Korea
ย 
AWS์— ๋Œ€ํ•ด ๊ฐ€์žฅ ๊ถ๊ธˆํ–ˆ๋˜ ์—ด ๊ฐ€์ง€ (์ •์šฐ๊ทผ) - AWS ์›จ๋น„๋‚˜ ์‹œ๋ฆฌ์ฆˆ
AWS์— ๋Œ€ํ•ด ๊ฐ€์žฅ ๊ถ๊ธˆํ–ˆ๋˜ ์—ด ๊ฐ€์ง€ (์ •์šฐ๊ทผ) - AWS ์›จ๋น„๋‚˜ ์‹œ๋ฆฌ์ฆˆAWS์— ๋Œ€ํ•ด ๊ฐ€์žฅ ๊ถ๊ธˆํ–ˆ๋˜ ์—ด ๊ฐ€์ง€ (์ •์šฐ๊ทผ) - AWS ์›จ๋น„๋‚˜ ์‹œ๋ฆฌ์ฆˆ
AWS์— ๋Œ€ํ•ด ๊ฐ€์žฅ ๊ถ๊ธˆํ–ˆ๋˜ ์—ด ๊ฐ€์ง€ (์ •์šฐ๊ทผ) - AWS ์›จ๋น„๋‚˜ ์‹œ๋ฆฌ์ฆˆAmazon Web Services Korea
ย 
2017 ์‹ ๋ผ๋Œ€ํ•™๊ต azure์—์„œ iaas ํ™œ์šฉํ•˜๊ธฐ
2017 ์‹ ๋ผ๋Œ€ํ•™๊ต azure์—์„œ iaas ํ™œ์šฉํ•˜๊ธฐ2017 ์‹ ๋ผ๋Œ€ํ•™๊ต azure์—์„œ iaas ํ™œ์šฉํ•˜๊ธฐ
2017 ์‹ ๋ผ๋Œ€ํ•™๊ต azure์—์„œ iaas ํ™œ์šฉํ•˜๊ธฐ์„ธ์ค€ ๊น€
ย 

What's hot (20)

๊ด€๊ณ„ํ˜• ๋ฐ์ดํ„ฐ๋ฒ ์ด์Šค์˜ ์ƒˆ๋กœ์šด ํŒจ๋Ÿฌ๋‹ค์ž„ Amazon Aurora :: ๊น€์ƒํ•„ :: AWS Summit Seoul 2016
๊ด€๊ณ„ํ˜• ๋ฐ์ดํ„ฐ๋ฒ ์ด์Šค์˜ ์ƒˆ๋กœ์šด ํŒจ๋Ÿฌ๋‹ค์ž„ Amazon Aurora :: ๊น€์ƒํ•„ :: AWS Summit Seoul 2016๊ด€๊ณ„ํ˜• ๋ฐ์ดํ„ฐ๋ฒ ์ด์Šค์˜ ์ƒˆ๋กœ์šด ํŒจ๋Ÿฌ๋‹ค์ž„ Amazon Aurora :: ๊น€์ƒํ•„ :: AWS Summit Seoul 2016
๊ด€๊ณ„ํ˜• ๋ฐ์ดํ„ฐ๋ฒ ์ด์Šค์˜ ์ƒˆ๋กœ์šด ํŒจ๋Ÿฌ๋‹ค์ž„ Amazon Aurora :: ๊น€์ƒํ•„ :: AWS Summit Seoul 2016
ย 
Hanghae99 FinalProject Moyeora!
Hanghae99 FinalProject Moyeora!Hanghae99 FinalProject Moyeora!
Hanghae99 FinalProject Moyeora!
ย 
AWS ํ™œ์šฉํ•œ Data Lake ๊ตฌ์„ฑํ•˜๊ธฐ
AWS ํ™œ์šฉํ•œ Data Lake ๊ตฌ์„ฑํ•˜๊ธฐAWS ํ™œ์šฉํ•œ Data Lake ๊ตฌ์„ฑํ•˜๊ธฐ
AWS ํ™œ์šฉํ•œ Data Lake ๊ตฌ์„ฑํ•˜๊ธฐ
ย 
1711 azure-live
1711 azure-live1711 azure-live
1711 azure-live
ย 
Accelerate spring boot application with apache ignite
Accelerate spring boot application with apache igniteAccelerate spring boot application with apache ignite
Accelerate spring boot application with apache ignite
ย 
Amazon Aurora 100% ํ™œ์šฉํ•˜๊ธฐ
Amazon Aurora 100% ํ™œ์šฉํ•˜๊ธฐAmazon Aurora 100% ํ™œ์šฉํ•˜๊ธฐ
Amazon Aurora 100% ํ™œ์šฉํ•˜๊ธฐ
ย 
Azure Databases for PostgreSQL MYSQL and MariaDB
Azure Databases for PostgreSQL MYSQL and MariaDBAzure Databases for PostgreSQL MYSQL and MariaDB
Azure Databases for PostgreSQL MYSQL and MariaDB
ย 
[Azure study group] azure์˜ ๋ถ€ํ•˜๋ถ„์‚ฐ
[Azure study group] azure์˜ ๋ถ€ํ•˜๋ถ„์‚ฐ[Azure study group] azure์˜ ๋ถ€ํ•˜๋ถ„์‚ฐ
[Azure study group] azure์˜ ๋ถ€ํ•˜๋ถ„์‚ฐ
ย 
แ„‡แ…ฎแ†ซแ„‰แ…ฅแ†จแ„€แ…กแ„…แ…ณแ†ฏ แ„‹แ…ฑแ„’แ…กแ†ซ Aws แ„€แ…ตแ„‡แ…กแ†ซแ„‹แ…ด digital แ„‘แ…ณแ†ฏแ„…แ…ขแ†บแ„‘แ…ฉแ†ท แ„€แ…ฎแ„Žแ…ฎแ†จ
แ„‡แ…ฎแ†ซแ„‰แ…ฅแ†จแ„€แ…กแ„…แ…ณแ†ฏ แ„‹แ…ฑแ„’แ…กแ†ซ Aws แ„€แ…ตแ„‡แ…กแ†ซแ„‹แ…ด digital แ„‘แ…ณแ†ฏแ„…แ…ขแ†บแ„‘แ…ฉแ†ท แ„€แ…ฎแ„Žแ…ฎแ†จแ„‡แ…ฎแ†ซแ„‰แ…ฅแ†จแ„€แ…กแ„…แ…ณแ†ฏ แ„‹แ…ฑแ„’แ…กแ†ซ Aws แ„€แ…ตแ„‡แ…กแ†ซแ„‹แ…ด digital แ„‘แ…ณแ†ฏแ„…แ…ขแ†บแ„‘แ…ฉแ†ท แ„€แ…ฎแ„Žแ…ฎแ†จ
แ„‡แ…ฎแ†ซแ„‰แ…ฅแ†จแ„€แ…กแ„…แ…ณแ†ฏ แ„‹แ…ฑแ„’แ…กแ†ซ Aws แ„€แ…ตแ„‡แ…กแ†ซแ„‹แ…ด digital แ„‘แ…ณแ†ฏแ„…แ…ขแ†บแ„‘แ…ฉแ†ท แ„€แ…ฎแ„Žแ…ฎแ†จ
ย 
[AWSKRUG&JAWS-UG Meetup #1] ํƒœ์–‘๊ด‘๋ฐœ์ „์†Œ ์›๊ฒฉ ๊ฐ์‹œ ์‹œ์Šคํ…œ์˜ ๋Œ€๋Ÿ‰๋ฐ์ดํ„ฐ ํ•ด์„ใ€ๆ ชๅผไผš็คพfusicใ€‘
[AWSKRUG&JAWS-UG Meetup #1] ํƒœ์–‘๊ด‘๋ฐœ์ „์†Œ ์›๊ฒฉ ๊ฐ์‹œ ์‹œ์Šคํ…œ์˜  ๋Œ€๋Ÿ‰๋ฐ์ดํ„ฐ ํ•ด์„ใ€ๆ ชๅผไผš็คพfusicใ€‘[AWSKRUG&JAWS-UG Meetup #1] ํƒœ์–‘๊ด‘๋ฐœ์ „์†Œ ์›๊ฒฉ ๊ฐ์‹œ ์‹œ์Šคํ…œ์˜  ๋Œ€๋Ÿ‰๋ฐ์ดํ„ฐ ํ•ด์„ใ€ๆ ชๅผไผš็คพfusicใ€‘
[AWSKRUG&JAWS-UG Meetup #1] ํƒœ์–‘๊ด‘๋ฐœ์ „์†Œ ์›๊ฒฉ ๊ฐ์‹œ ์‹œ์Šคํ…œ์˜ ๋Œ€๋Ÿ‰๋ฐ์ดํ„ฐ ํ•ด์„ใ€ๆ ชๅผไผš็คพfusicใ€‘
ย 
Scalable web architecture and distributed systems
Scalable web architecture and distributed systemsScalable web architecture and distributed systems
Scalable web architecture and distributed systems
ย 
๋„ฅ์Šจ ๊ธ€๋กœ๋ฒŒ ํ”Œ๋žซํผ ๊ตฌ์ถ• ์ด์•ผ๊ธฐ : DB Migration case study (์ž„ํ˜„์ˆ˜ ํ”Œ๋žซํผ์ธํ”„๋ผ์‹ค Technical Manager, ๋„ฅ...
๋„ฅ์Šจ ๊ธ€๋กœ๋ฒŒ ํ”Œ๋žซํผ ๊ตฌ์ถ• ์ด์•ผ๊ธฐ : DB Migration case study (์ž„ํ˜„์ˆ˜ ํ”Œ๋žซํผ์ธํ”„๋ผ์‹ค Technical Manager, ๋„ฅ...๋„ฅ์Šจ ๊ธ€๋กœ๋ฒŒ ํ”Œ๋žซํผ ๊ตฌ์ถ• ์ด์•ผ๊ธฐ : DB Migration case study (์ž„ํ˜„์ˆ˜ ํ”Œ๋žซํผ์ธํ”„๋ผ์‹ค Technical Manager, ๋„ฅ...
๋„ฅ์Šจ ๊ธ€๋กœ๋ฒŒ ํ”Œ๋žซํผ ๊ตฌ์ถ• ์ด์•ผ๊ธฐ : DB Migration case study (์ž„ํ˜„์ˆ˜ ํ”Œ๋žซํผ์ธํ”„๋ผ์‹ค Technical Manager, ๋„ฅ...
ย 
AWS 9์›” ์›จ๋น„๋‚˜ | Amazon Aurora Deep Dive
AWS 9์›” ์›จ๋น„๋‚˜ | Amazon Aurora Deep DiveAWS 9์›” ์›จ๋น„๋‚˜ | Amazon Aurora Deep Dive
AWS 9์›” ์›จ๋น„๋‚˜ | Amazon Aurora Deep Dive
ย 
[์šฐ๋ฆฌ๊ฐ€ ๋ฐ์ดํ„ฐ๋ฅผ ์“ฐ๋Š” ๋ฒ•] ๋ชจ๋ฐ”์ผ ๊ฒŒ์ž„ ๋กœ๊ทธ ๋ฐ์ดํ„ฐ ๋ถ„์„ ์ด์•ผ๊ธฐ - ์—”ํ„ฐ๋ฉ”์ดํŠธ ๊ณต์‹ ๋ฐฐ ํŒ€์žฅ
[์šฐ๋ฆฌ๊ฐ€ ๋ฐ์ดํ„ฐ๋ฅผ ์“ฐ๋Š” ๋ฒ•] ๋ชจ๋ฐ”์ผ ๊ฒŒ์ž„ ๋กœ๊ทธ ๋ฐ์ดํ„ฐ ๋ถ„์„ ์ด์•ผ๊ธฐ - ์—”ํ„ฐ๋ฉ”์ดํŠธ ๊ณต์‹ ๋ฐฐ ํŒ€์žฅ[์šฐ๋ฆฌ๊ฐ€ ๋ฐ์ดํ„ฐ๋ฅผ ์“ฐ๋Š” ๋ฒ•] ๋ชจ๋ฐ”์ผ ๊ฒŒ์ž„ ๋กœ๊ทธ ๋ฐ์ดํ„ฐ ๋ถ„์„ ์ด์•ผ๊ธฐ - ์—”ํ„ฐ๋ฉ”์ดํŠธ ๊ณต์‹ ๋ฐฐ ํŒ€์žฅ
[์šฐ๋ฆฌ๊ฐ€ ๋ฐ์ดํ„ฐ๋ฅผ ์“ฐ๋Š” ๋ฒ•] ๋ชจ๋ฐ”์ผ ๊ฒŒ์ž„ ๋กœ๊ทธ ๋ฐ์ดํ„ฐ ๋ถ„์„ ์ด์•ผ๊ธฐ - ์—”ํ„ฐ๋ฉ”์ดํŠธ ๊ณต์‹ ๋ฐฐ ํŒ€์žฅ
ย 
1611 azure-live-์„ธ์…˜-2
1611 azure-live-์„ธ์…˜-21611 azure-live-์„ธ์…˜-2
1611 azure-live-์„ธ์…˜-2
ย 
Webservice cache strategy
Webservice cache strategyWebservice cache strategy
Webservice cache strategy
ย 
[Gaming on AWS] AWS์™€ ํ•จ๊ป˜ ํ•œ ์ฟ ํ‚ค๋Ÿฐ ์„œ๋ฒ„ Re-architecting ์‚ฌ๋ก€ - ๋ฐ๋ธŒ์‹œ์Šคํ„ฐ์ฆˆ
[Gaming on AWS] AWS์™€ ํ•จ๊ป˜ ํ•œ ์ฟ ํ‚ค๋Ÿฐ ์„œ๋ฒ„ Re-architecting ์‚ฌ๋ก€ - ๋ฐ๋ธŒ์‹œ์Šคํ„ฐ์ฆˆ[Gaming on AWS] AWS์™€ ํ•จ๊ป˜ ํ•œ ์ฟ ํ‚ค๋Ÿฐ ์„œ๋ฒ„ Re-architecting ์‚ฌ๋ก€ - ๋ฐ๋ธŒ์‹œ์Šคํ„ฐ์ฆˆ
[Gaming on AWS] AWS์™€ ํ•จ๊ป˜ ํ•œ ์ฟ ํ‚ค๋Ÿฐ ์„œ๋ฒ„ Re-architecting ์‚ฌ๋ก€ - ๋ฐ๋ธŒ์‹œ์Šคํ„ฐ์ฆˆ
ย 
20150125 AWS BlackBelt - Amazon RDS (Korean)
20150125 AWS BlackBelt - Amazon RDS (Korean)20150125 AWS BlackBelt - Amazon RDS (Korean)
20150125 AWS BlackBelt - Amazon RDS (Korean)
ย 
AWS์— ๋Œ€ํ•ด ๊ฐ€์žฅ ๊ถ๊ธˆํ–ˆ๋˜ ์—ด ๊ฐ€์ง€ (์ •์šฐ๊ทผ) - AWS ์›จ๋น„๋‚˜ ์‹œ๋ฆฌ์ฆˆ
AWS์— ๋Œ€ํ•ด ๊ฐ€์žฅ ๊ถ๊ธˆํ–ˆ๋˜ ์—ด ๊ฐ€์ง€ (์ •์šฐ๊ทผ) - AWS ์›จ๋น„๋‚˜ ์‹œ๋ฆฌ์ฆˆAWS์— ๋Œ€ํ•ด ๊ฐ€์žฅ ๊ถ๊ธˆํ–ˆ๋˜ ์—ด ๊ฐ€์ง€ (์ •์šฐ๊ทผ) - AWS ์›จ๋น„๋‚˜ ์‹œ๋ฆฌ์ฆˆ
AWS์— ๋Œ€ํ•ด ๊ฐ€์žฅ ๊ถ๊ธˆํ–ˆ๋˜ ์—ด ๊ฐ€์ง€ (์ •์šฐ๊ทผ) - AWS ์›จ๋น„๋‚˜ ์‹œ๋ฆฌ์ฆˆ
ย 
2017 ์‹ ๋ผ๋Œ€ํ•™๊ต azure์—์„œ iaas ํ™œ์šฉํ•˜๊ธฐ
2017 ์‹ ๋ผ๋Œ€ํ•™๊ต azure์—์„œ iaas ํ™œ์šฉํ•˜๊ธฐ2017 ์‹ ๋ผ๋Œ€ํ•™๊ต azure์—์„œ iaas ํ™œ์šฉํ•˜๊ธฐ
2017 ์‹ ๋ผ๋Œ€ํ•™๊ต azure์—์„œ iaas ํ™œ์šฉํ•˜๊ธฐ
ย 

Similar to Spark+Cassandra Data pipeline optimazation at recommend system for recommend system

๋น…๋ฐ์ดํ„ฐ ๊ธฐ์ˆ  ํ˜„ํ™ฉ๊ณผ ์‹œ์žฅ ์ „๋ง(2014)
๋น…๋ฐ์ดํ„ฐ ๊ธฐ์ˆ  ํ˜„ํ™ฉ๊ณผ ์‹œ์žฅ ์ „๋ง(2014)๋น…๋ฐ์ดํ„ฐ ๊ธฐ์ˆ  ํ˜„ํ™ฉ๊ณผ ์‹œ์žฅ ์ „๋ง(2014)
๋น…๋ฐ์ดํ„ฐ ๊ธฐ์ˆ  ํ˜„ํ™ฉ๊ณผ ์‹œ์žฅ ์ „๋ง(2014)Channy Yun
ย 
[2015 07-06-์œค์„์ค€] Oracle ์„ฑ๋Šฅ ์ตœ์ ํ™” ๋ฐ ํ’ˆ์งˆ ๊ณ ๋„ํ™” 4
[2015 07-06-์œค์„์ค€] Oracle ์„ฑ๋Šฅ ์ตœ์ ํ™” ๋ฐ ํ’ˆ์งˆ ๊ณ ๋„ํ™” 4[2015 07-06-์œค์„์ค€] Oracle ์„ฑ๋Šฅ ์ตœ์ ํ™” ๋ฐ ํ’ˆ์งˆ ๊ณ ๋„ํ™” 4
[2015 07-06-์œค์„์ค€] Oracle ์„ฑ๋Šฅ ์ตœ์ ํ™” ๋ฐ ํ’ˆ์งˆ ๊ณ ๋„ํ™” 4Seok-joon Yun
ย 
Spark streaming tutorial
Spark streaming tutorialSpark streaming tutorial
Spark streaming tutorialMinho Kim
ย 
Apache spark ์†Œ๊ฐœ ๋ฐ ์‹ค์Šต
Apache spark ์†Œ๊ฐœ ๋ฐ ์‹ค์ŠตApache spark ์†Œ๊ฐœ ๋ฐ ์‹ค์Šต
Apache spark ์†Œ๊ฐœ ๋ฐ ์‹ค์Šต๋™ํ˜„ ๊ฐ•
ย 
๋ฐ๋ธŒ์‹œ์Šคํ„ฐ์ฆˆ ๋ฐ์ดํ„ฐ ๋ ˆ์ดํฌ ๊ตฌ์ถ• ์ด์•ผ๊ธฐ : Data Lake architecture case study (๋ฐ•์ฃผํ™ ๋ฐ์ดํ„ฐ ๋ถ„์„ ๋ฐ ์ธํ”„๋ผ ํŒ€...
๋ฐ๋ธŒ์‹œ์Šคํ„ฐ์ฆˆ ๋ฐ์ดํ„ฐ ๋ ˆ์ดํฌ ๊ตฌ์ถ• ์ด์•ผ๊ธฐ : Data Lake architecture case study (๋ฐ•์ฃผํ™ ๋ฐ์ดํ„ฐ ๋ถ„์„ ๋ฐ ์ธํ”„๋ผ ํŒ€...๋ฐ๋ธŒ์‹œ์Šคํ„ฐ์ฆˆ ๋ฐ์ดํ„ฐ ๋ ˆ์ดํฌ ๊ตฌ์ถ• ์ด์•ผ๊ธฐ : Data Lake architecture case study (๋ฐ•์ฃผํ™ ๋ฐ์ดํ„ฐ ๋ถ„์„ ๋ฐ ์ธํ”„๋ผ ํŒ€...
๋ฐ๋ธŒ์‹œ์Šคํ„ฐ์ฆˆ ๋ฐ์ดํ„ฐ ๋ ˆ์ดํฌ ๊ตฌ์ถ• ์ด์•ผ๊ธฐ : Data Lake architecture case study (๋ฐ•์ฃผํ™ ๋ฐ์ดํ„ฐ ๋ถ„์„ ๋ฐ ์ธํ”„๋ผ ํŒ€...Amazon Web Services Korea
ย 
SQream DB, GPU-accelerated data warehouse
SQream DB, GPU-accelerated data warehouseSQream DB, GPU-accelerated data warehouse
SQream DB, GPU-accelerated data warehouseNAVER Engineering
ย 
MySQL_SQL_Tunning_v0.1.3.docx
MySQL_SQL_Tunning_v0.1.3.docxMySQL_SQL_Tunning_v0.1.3.docx
MySQL_SQL_Tunning_v0.1.3.docxNeoClova
ย 
AWS CLOUD 2017 - Amazon Redshift ๊ธฐ๋ฐ˜ DW ์™€ ๋น„์ง€๋‹ˆ์Šค ์ธํ…”๋ฆฌ์ „์Šค ๊ตฌํ˜„ ๋ฐฉ๋ฒ• (๊น€์ผํ˜ธ ์†”๋ฃจ์…˜์ฆˆ ์•„ํ‚คํ…ํŠธ)
AWS CLOUD 2017 - Amazon Redshift ๊ธฐ๋ฐ˜ DW ์™€ ๋น„์ง€๋‹ˆ์Šค ์ธํ…”๋ฆฌ์ „์Šค ๊ตฌํ˜„ ๋ฐฉ๋ฒ• (๊น€์ผํ˜ธ ์†”๋ฃจ์…˜์ฆˆ ์•„ํ‚คํ…ํŠธ)AWS CLOUD 2017 - Amazon Redshift ๊ธฐ๋ฐ˜ DW ์™€ ๋น„์ง€๋‹ˆ์Šค ์ธํ…”๋ฆฌ์ „์Šค ๊ตฌํ˜„ ๋ฐฉ๋ฒ• (๊น€์ผํ˜ธ ์†”๋ฃจ์…˜์ฆˆ ์•„ํ‚คํ…ํŠธ)
AWS CLOUD 2017 - Amazon Redshift ๊ธฐ๋ฐ˜ DW ์™€ ๋น„์ง€๋‹ˆ์Šค ์ธํ…”๋ฆฌ์ „์Šค ๊ตฌํ˜„ ๋ฐฉ๋ฒ• (๊น€์ผํ˜ธ ์†”๋ฃจ์…˜์ฆˆ ์•„ํ‚คํ…ํŠธ)Amazon Web Services Korea
ย 
S3 Select๋ฅผ ํ†ตํ•œ ๋น ๋ฅธ ๋ฐ์ดํ„ฐ ๋ถ„์„ํ•˜๊ธฐ - ํŠธ๋ž™2, Community Day 2018 re:Invent ํŠน์ง‘
S3 Select๋ฅผ ํ†ตํ•œ ๋น ๋ฅธ ๋ฐ์ดํ„ฐ ๋ถ„์„ํ•˜๊ธฐ - ํŠธ๋ž™2, Community Day 2018 re:Invent ํŠน์ง‘S3 Select๋ฅผ ํ†ตํ•œ ๋น ๋ฅธ ๋ฐ์ดํ„ฐ ๋ถ„์„ํ•˜๊ธฐ - ํŠธ๋ž™2, Community Day 2018 re:Invent ํŠน์ง‘
S3 Select๋ฅผ ํ†ตํ•œ ๋น ๋ฅธ ๋ฐ์ดํ„ฐ ๋ถ„์„ํ•˜๊ธฐ - ํŠธ๋ž™2, Community Day 2018 re:Invent ํŠน์ง‘AWSKRUG - AWSํ•œ๊ตญ์‚ฌ์šฉ์ž๋ชจ์ž„
ย 
๋ ˆ์ฝ”๋ฒจ์˜ ์ถ”์ฒœ ์„œ๋น„์Šค ๊ณ ๊ตฐ ๋ถ„ํˆฌ๊ธฐ - AWS Summit Seoul 2017
๋ ˆ์ฝ”๋ฒจ์˜ ์ถ”์ฒœ ์„œ๋น„์Šค ๊ณ ๊ตฐ ๋ถ„ํˆฌ๊ธฐ - AWS Summit Seoul 2017๋ ˆ์ฝ”๋ฒจ์˜ ์ถ”์ฒœ ์„œ๋น„์Šค ๊ณ ๊ตฐ ๋ถ„ํˆฌ๊ธฐ - AWS Summit Seoul 2017
๋ ˆ์ฝ”๋ฒจ์˜ ์ถ”์ฒœ ์„œ๋น„์Šค ๊ณ ๊ตฐ ๋ถ„ํˆฌ๊ธฐ - AWS Summit Seoul 2017Amazon Web Services Korea
ย 
Amazon Aurora ์‹ ๊ทœ ์„œ๋น„์Šค ์•Œ์•„๋ณด๊ธฐ::์ตœ์œ ์ •::AWS Summit Seoul 2018
Amazon Aurora ์‹ ๊ทœ ์„œ๋น„์Šค ์•Œ์•„๋ณด๊ธฐ::์ตœ์œ ์ •::AWS Summit Seoul 2018Amazon Aurora ์‹ ๊ทœ ์„œ๋น„์Šค ์•Œ์•„๋ณด๊ธฐ::์ตœ์œ ์ •::AWS Summit Seoul 2018
Amazon Aurora ์‹ ๊ทœ ์„œ๋น„์Šค ์•Œ์•„๋ณด๊ธฐ::์ตœ์œ ์ •::AWS Summit Seoul 2018Amazon Web Services Korea
ย 
AWS๊ธฐ๋ฐ˜ ์„œ๋ฒ„๋ฆฌ์Šค ๋ฐ์ดํ„ฐ๋ ˆ์ดํฌ ๊ตฌ์ถ•ํ•˜๊ธฐ - ๊น€์ง„์›… (SK C&C) :: AWS Community Day 2020
AWS๊ธฐ๋ฐ˜ ์„œ๋ฒ„๋ฆฌ์Šค ๋ฐ์ดํ„ฐ๋ ˆ์ดํฌ ๊ตฌ์ถ•ํ•˜๊ธฐ - ๊น€์ง„์›… (SK C&C) :: AWS Community Day 2020 AWS๊ธฐ๋ฐ˜ ์„œ๋ฒ„๋ฆฌ์Šค ๋ฐ์ดํ„ฐ๋ ˆ์ดํฌ ๊ตฌ์ถ•ํ•˜๊ธฐ - ๊น€์ง„์›… (SK C&C) :: AWS Community Day 2020
AWS๊ธฐ๋ฐ˜ ์„œ๋ฒ„๋ฆฌ์Šค ๋ฐ์ดํ„ฐ๋ ˆ์ดํฌ ๊ตฌ์ถ•ํ•˜๊ธฐ - ๊น€์ง„์›… (SK C&C) :: AWS Community Day 2020 AWSKRUG - AWSํ•œ๊ตญ์‚ฌ์šฉ์ž๋ชจ์ž„
ย 
AWS๊ธฐ๋ฐ˜ ์„œ๋ฒ„๋ฆฌ์Šค ๋ฐ์ดํ„ฐ๋ ˆ์ดํฌ ๊ตฌ์ถ•ํ•˜๊ธฐ - ๊น€์ง„์›… (SK C&C) :: AWS Community Day 2020
AWS๊ธฐ๋ฐ˜ ์„œ๋ฒ„๋ฆฌ์Šค ๋ฐ์ดํ„ฐ๋ ˆ์ดํฌ ๊ตฌ์ถ•ํ•˜๊ธฐ - ๊น€์ง„์›… (SK C&C) :: AWS Community Day 2020AWS๊ธฐ๋ฐ˜ ์„œ๋ฒ„๋ฆฌ์Šค ๋ฐ์ดํ„ฐ๋ ˆ์ดํฌ ๊ตฌ์ถ•ํ•˜๊ธฐ - ๊น€์ง„์›… (SK C&C) :: AWS Community Day 2020
AWS๊ธฐ๋ฐ˜ ์„œ๋ฒ„๋ฆฌ์Šค ๋ฐ์ดํ„ฐ๋ ˆ์ดํฌ ๊ตฌ์ถ•ํ•˜๊ธฐ - ๊น€์ง„์›… (SK C&C) :: AWS Community Day 2020Jinwoong Kim
ย 
AWS๋ฅผ ํ†ตํ•œ ๋น…๋ฐ์ดํ„ฐ ๊ธฐ๋ฐ˜ ๋น„์ง€๋‹ˆ์Šค ์ธํ…”๋ฆฌ์ „์Šค ๊ตฌ์ถ•- AWS Summit Seoul 2017
AWS๋ฅผ ํ†ตํ•œ ๋น…๋ฐ์ดํ„ฐ ๊ธฐ๋ฐ˜ ๋น„์ง€๋‹ˆ์Šค ์ธํ…”๋ฆฌ์ „์Šค ๊ตฌ์ถ•- AWS Summit Seoul 2017AWS๋ฅผ ํ†ตํ•œ ๋น…๋ฐ์ดํ„ฐ ๊ธฐ๋ฐ˜ ๋น„์ง€๋‹ˆ์Šค ์ธํ…”๋ฆฌ์ „์Šค ๊ตฌ์ถ•- AWS Summit Seoul 2017
AWS๋ฅผ ํ†ตํ•œ ๋น…๋ฐ์ดํ„ฐ ๊ธฐ๋ฐ˜ ๋น„์ง€๋‹ˆ์Šค ์ธํ…”๋ฆฌ์ „์Šค ๊ตฌ์ถ•- AWS Summit Seoul 2017Amazon Web Services Korea
ย 
Cloud dw benchmark using tpd-ds( Snowflake vs Redshift vs EMR Hive )
Cloud dw benchmark using tpd-ds( Snowflake vs Redshift vs EMR Hive )Cloud dw benchmark using tpd-ds( Snowflake vs Redshift vs EMR Hive )
Cloud dw benchmark using tpd-ds( Snowflake vs Redshift vs EMR Hive )SANG WON PARK
ย 
ํฌํ„ธ ๊ฒ€์ƒ‰์–ด ์ˆœ์œ„ ์ˆ˜์ง‘ ๋ฐ ๋ถ„์„ ํ›„๊ธฐ
ํฌํ„ธ ๊ฒ€์ƒ‰์–ด ์ˆœ์œ„ ์ˆ˜์ง‘ ๋ฐ ๋ถ„์„ ํ›„๊ธฐํฌํ„ธ ๊ฒ€์ƒ‰์–ด ์ˆœ์œ„ ์ˆ˜์ง‘ ๋ฐ ๋ถ„์„ ํ›„๊ธฐ
ํฌํ„ธ ๊ฒ€์ƒ‰์–ด ์ˆœ์œ„ ์ˆ˜์ง‘ ๋ฐ ๋ถ„์„ ํ›„๊ธฐKyuhyun Byun
ย 
SK Telecom - ๋ง๊ด€๋ฆฌ ํ”„๋กœ์ ํŠธ TANGO์˜ ์˜คํ”ˆ์†Œ์Šค ๋ฐ์ดํ„ฐ๋ฒ ์ด์Šค ์ „ํ™˜ ์—ฌ์ • - ๋ฐœํ‘œ์ž : ๋ฐ•์Šน์ „, Project Manager, ...
SK Telecom - ๋ง๊ด€๋ฆฌ ํ”„๋กœ์ ํŠธ TANGO์˜ ์˜คํ”ˆ์†Œ์Šค ๋ฐ์ดํ„ฐ๋ฒ ์ด์Šค ์ „ํ™˜ ์—ฌ์ • - ๋ฐœํ‘œ์ž : ๋ฐ•์Šน์ „, Project Manager, ...SK Telecom - ๋ง๊ด€๋ฆฌ ํ”„๋กœ์ ํŠธ TANGO์˜ ์˜คํ”ˆ์†Œ์Šค ๋ฐ์ดํ„ฐ๋ฒ ์ด์Šค ์ „ํ™˜ ์—ฌ์ • - ๋ฐœํ‘œ์ž : ๋ฐ•์Šน์ „, Project Manager, ...
SK Telecom - ๋ง๊ด€๋ฆฌ ํ”„๋กœ์ ํŠธ TANGO์˜ ์˜คํ”ˆ์†Œ์Šค ๋ฐ์ดํ„ฐ๋ฒ ์ด์Šค ์ „ํ™˜ ์—ฌ์ • - ๋ฐœํ‘œ์ž : ๋ฐ•์Šน์ „, Project Manager, ...Amazon Web Services Korea
ย 
๊ฒŒ์ž„ ๊ณ ๊ฐ ์‚ฌ๋ก€๋ฅผ ํ†ตํ•ด ์‚ดํŽด๋ณด๋Š” AWS ํ™œ์šฉ ์ „๋žต :: ์ด๊ฒฝ์•ˆ :: AWS Summit Seoul 2016
๊ฒŒ์ž„ ๊ณ ๊ฐ ์‚ฌ๋ก€๋ฅผ ํ†ตํ•ด ์‚ดํŽด๋ณด๋Š” AWS ํ™œ์šฉ ์ „๋žต :: ์ด๊ฒฝ์•ˆ :: AWS Summit Seoul 2016๊ฒŒ์ž„ ๊ณ ๊ฐ ์‚ฌ๋ก€๋ฅผ ํ†ตํ•ด ์‚ดํŽด๋ณด๋Š” AWS ํ™œ์šฉ ์ „๋žต :: ์ด๊ฒฝ์•ˆ :: AWS Summit Seoul 2016
๊ฒŒ์ž„ ๊ณ ๊ฐ ์‚ฌ๋ก€๋ฅผ ํ†ตํ•ด ์‚ดํŽด๋ณด๋Š” AWS ํ™œ์šฉ ์ „๋žต :: ์ด๊ฒฝ์•ˆ :: AWS Summit Seoul 2016Amazon Web Services Korea
ย 
์‚ฌ์—… ์‹ค์ 
์‚ฌ์—… ์‹ค์ ์‚ฌ์—… ์‹ค์ 
์‚ฌ์—… ์‹ค์ mobigen
ย 
๋ฐ์ดํ„ฐ๋ฒ ์ด์Šค ์šด์˜, ์„œ๋ฒ„๋ฆฌ์Šค๋กœ ๊ฑฑ์ • ๋! - ์œค์„์ฐฌ, AWS ํ…Œํฌ์—๋ฐ˜์ ค๋ฆฌ์ŠคํŠธ - AWS Builders Online Series
๋ฐ์ดํ„ฐ๋ฒ ์ด์Šค ์šด์˜, ์„œ๋ฒ„๋ฆฌ์Šค๋กœ ๊ฑฑ์ • ๋! - ์œค์„์ฐฌ, AWS ํ…Œํฌ์—๋ฐ˜์ ค๋ฆฌ์ŠคํŠธ - AWS Builders Online Series๋ฐ์ดํ„ฐ๋ฒ ์ด์Šค ์šด์˜, ์„œ๋ฒ„๋ฆฌ์Šค๋กœ ๊ฑฑ์ • ๋! - ์œค์„์ฐฌ, AWS ํ…Œํฌ์—๋ฐ˜์ ค๋ฆฌ์ŠคํŠธ - AWS Builders Online Series
๋ฐ์ดํ„ฐ๋ฒ ์ด์Šค ์šด์˜, ์„œ๋ฒ„๋ฆฌ์Šค๋กœ ๊ฑฑ์ • ๋! - ์œค์„์ฐฌ, AWS ํ…Œํฌ์—๋ฐ˜์ ค๋ฆฌ์ŠคํŠธ - AWS Builders Online SeriesAmazon Web Services Korea
ย 

Similar to Spark+Cassandra Data pipeline optimazation at recommend system for recommend system (20)

๋น…๋ฐ์ดํ„ฐ ๊ธฐ์ˆ  ํ˜„ํ™ฉ๊ณผ ์‹œ์žฅ ์ „๋ง(2014)
๋น…๋ฐ์ดํ„ฐ ๊ธฐ์ˆ  ํ˜„ํ™ฉ๊ณผ ์‹œ์žฅ ์ „๋ง(2014)๋น…๋ฐ์ดํ„ฐ ๊ธฐ์ˆ  ํ˜„ํ™ฉ๊ณผ ์‹œ์žฅ ์ „๋ง(2014)
๋น…๋ฐ์ดํ„ฐ ๊ธฐ์ˆ  ํ˜„ํ™ฉ๊ณผ ์‹œ์žฅ ์ „๋ง(2014)
ย 
[2015 07-06-์œค์„์ค€] Oracle ์„ฑ๋Šฅ ์ตœ์ ํ™” ๋ฐ ํ’ˆ์งˆ ๊ณ ๋„ํ™” 4
[2015 07-06-์œค์„์ค€] Oracle ์„ฑ๋Šฅ ์ตœ์ ํ™” ๋ฐ ํ’ˆ์งˆ ๊ณ ๋„ํ™” 4[2015 07-06-์œค์„์ค€] Oracle ์„ฑ๋Šฅ ์ตœ์ ํ™” ๋ฐ ํ’ˆ์งˆ ๊ณ ๋„ํ™” 4
[2015 07-06-์œค์„์ค€] Oracle ์„ฑ๋Šฅ ์ตœ์ ํ™” ๋ฐ ํ’ˆ์งˆ ๊ณ ๋„ํ™” 4
ย 
Spark streaming tutorial
Spark streaming tutorialSpark streaming tutorial
Spark streaming tutorial
ย 
Apache spark ์†Œ๊ฐœ ๋ฐ ์‹ค์Šต
Apache spark ์†Œ๊ฐœ ๋ฐ ์‹ค์ŠตApache spark ์†Œ๊ฐœ ๋ฐ ์‹ค์Šต
Apache spark ์†Œ๊ฐœ ๋ฐ ์‹ค์Šต
ย 
๋ฐ๋ธŒ์‹œ์Šคํ„ฐ์ฆˆ ๋ฐ์ดํ„ฐ ๋ ˆ์ดํฌ ๊ตฌ์ถ• ์ด์•ผ๊ธฐ : Data Lake architecture case study (๋ฐ•์ฃผํ™ ๋ฐ์ดํ„ฐ ๋ถ„์„ ๋ฐ ์ธํ”„๋ผ ํŒ€...
๋ฐ๋ธŒ์‹œ์Šคํ„ฐ์ฆˆ ๋ฐ์ดํ„ฐ ๋ ˆ์ดํฌ ๊ตฌ์ถ• ์ด์•ผ๊ธฐ : Data Lake architecture case study (๋ฐ•์ฃผํ™ ๋ฐ์ดํ„ฐ ๋ถ„์„ ๋ฐ ์ธํ”„๋ผ ํŒ€...๋ฐ๋ธŒ์‹œ์Šคํ„ฐ์ฆˆ ๋ฐ์ดํ„ฐ ๋ ˆ์ดํฌ ๊ตฌ์ถ• ์ด์•ผ๊ธฐ : Data Lake architecture case study (๋ฐ•์ฃผํ™ ๋ฐ์ดํ„ฐ ๋ถ„์„ ๋ฐ ์ธํ”„๋ผ ํŒ€...
๋ฐ๋ธŒ์‹œ์Šคํ„ฐ์ฆˆ ๋ฐ์ดํ„ฐ ๋ ˆ์ดํฌ ๊ตฌ์ถ• ์ด์•ผ๊ธฐ : Data Lake architecture case study (๋ฐ•์ฃผํ™ ๋ฐ์ดํ„ฐ ๋ถ„์„ ๋ฐ ์ธํ”„๋ผ ํŒ€...
ย 
SQream DB, GPU-accelerated data warehouse
SQream DB, GPU-accelerated data warehouseSQream DB, GPU-accelerated data warehouse
SQream DB, GPU-accelerated data warehouse
ย 
MySQL_SQL_Tunning_v0.1.3.docx
MySQL_SQL_Tunning_v0.1.3.docxMySQL_SQL_Tunning_v0.1.3.docx
MySQL_SQL_Tunning_v0.1.3.docx
ย 
AWS CLOUD 2017 - Amazon Redshift ๊ธฐ๋ฐ˜ DW ์™€ ๋น„์ง€๋‹ˆ์Šค ์ธํ…”๋ฆฌ์ „์Šค ๊ตฌํ˜„ ๋ฐฉ๋ฒ• (๊น€์ผํ˜ธ ์†”๋ฃจ์…˜์ฆˆ ์•„ํ‚คํ…ํŠธ)
AWS CLOUD 2017 - Amazon Redshift ๊ธฐ๋ฐ˜ DW ์™€ ๋น„์ง€๋‹ˆ์Šค ์ธํ…”๋ฆฌ์ „์Šค ๊ตฌํ˜„ ๋ฐฉ๋ฒ• (๊น€์ผํ˜ธ ์†”๋ฃจ์…˜์ฆˆ ์•„ํ‚คํ…ํŠธ)AWS CLOUD 2017 - Amazon Redshift ๊ธฐ๋ฐ˜ DW ์™€ ๋น„์ง€๋‹ˆ์Šค ์ธํ…”๋ฆฌ์ „์Šค ๊ตฌํ˜„ ๋ฐฉ๋ฒ• (๊น€์ผํ˜ธ ์†”๋ฃจ์…˜์ฆˆ ์•„ํ‚คํ…ํŠธ)
AWS CLOUD 2017 - Amazon Redshift ๊ธฐ๋ฐ˜ DW ์™€ ๋น„์ง€๋‹ˆ์Šค ์ธํ…”๋ฆฌ์ „์Šค ๊ตฌํ˜„ ๋ฐฉ๋ฒ• (๊น€์ผํ˜ธ ์†”๋ฃจ์…˜์ฆˆ ์•„ํ‚คํ…ํŠธ)
ย 
S3 Select๋ฅผ ํ†ตํ•œ ๋น ๋ฅธ ๋ฐ์ดํ„ฐ ๋ถ„์„ํ•˜๊ธฐ - ํŠธ๋ž™2, Community Day 2018 re:Invent ํŠน์ง‘
S3 Select๋ฅผ ํ†ตํ•œ ๋น ๋ฅธ ๋ฐ์ดํ„ฐ ๋ถ„์„ํ•˜๊ธฐ - ํŠธ๋ž™2, Community Day 2018 re:Invent ํŠน์ง‘S3 Select๋ฅผ ํ†ตํ•œ ๋น ๋ฅธ ๋ฐ์ดํ„ฐ ๋ถ„์„ํ•˜๊ธฐ - ํŠธ๋ž™2, Community Day 2018 re:Invent ํŠน์ง‘
S3 Select๋ฅผ ํ†ตํ•œ ๋น ๋ฅธ ๋ฐ์ดํ„ฐ ๋ถ„์„ํ•˜๊ธฐ - ํŠธ๋ž™2, Community Day 2018 re:Invent ํŠน์ง‘
ย 
๋ ˆ์ฝ”๋ฒจ์˜ ์ถ”์ฒœ ์„œ๋น„์Šค ๊ณ ๊ตฐ ๋ถ„ํˆฌ๊ธฐ - AWS Summit Seoul 2017
๋ ˆ์ฝ”๋ฒจ์˜ ์ถ”์ฒœ ์„œ๋น„์Šค ๊ณ ๊ตฐ ๋ถ„ํˆฌ๊ธฐ - AWS Summit Seoul 2017๋ ˆ์ฝ”๋ฒจ์˜ ์ถ”์ฒœ ์„œ๋น„์Šค ๊ณ ๊ตฐ ๋ถ„ํˆฌ๊ธฐ - AWS Summit Seoul 2017
๋ ˆ์ฝ”๋ฒจ์˜ ์ถ”์ฒœ ์„œ๋น„์Šค ๊ณ ๊ตฐ ๋ถ„ํˆฌ๊ธฐ - AWS Summit Seoul 2017
ย 
Amazon Aurora ์‹ ๊ทœ ์„œ๋น„์Šค ์•Œ์•„๋ณด๊ธฐ::์ตœ์œ ์ •::AWS Summit Seoul 2018
Amazon Aurora ์‹ ๊ทœ ์„œ๋น„์Šค ์•Œ์•„๋ณด๊ธฐ::์ตœ์œ ์ •::AWS Summit Seoul 2018Amazon Aurora ์‹ ๊ทœ ์„œ๋น„์Šค ์•Œ์•„๋ณด๊ธฐ::์ตœ์œ ์ •::AWS Summit Seoul 2018
Amazon Aurora ์‹ ๊ทœ ์„œ๋น„์Šค ์•Œ์•„๋ณด๊ธฐ::์ตœ์œ ์ •::AWS Summit Seoul 2018
ย 
AWS๊ธฐ๋ฐ˜ ์„œ๋ฒ„๋ฆฌ์Šค ๋ฐ์ดํ„ฐ๋ ˆ์ดํฌ ๊ตฌ์ถ•ํ•˜๊ธฐ - ๊น€์ง„์›… (SK C&C) :: AWS Community Day 2020
AWS๊ธฐ๋ฐ˜ ์„œ๋ฒ„๋ฆฌ์Šค ๋ฐ์ดํ„ฐ๋ ˆ์ดํฌ ๊ตฌ์ถ•ํ•˜๊ธฐ - ๊น€์ง„์›… (SK C&C) :: AWS Community Day 2020 AWS๊ธฐ๋ฐ˜ ์„œ๋ฒ„๋ฆฌ์Šค ๋ฐ์ดํ„ฐ๋ ˆ์ดํฌ ๊ตฌ์ถ•ํ•˜๊ธฐ - ๊น€์ง„์›… (SK C&C) :: AWS Community Day 2020
AWS๊ธฐ๋ฐ˜ ์„œ๋ฒ„๋ฆฌ์Šค ๋ฐ์ดํ„ฐ๋ ˆ์ดํฌ ๊ตฌ์ถ•ํ•˜๊ธฐ - ๊น€์ง„์›… (SK C&C) :: AWS Community Day 2020
ย 
AWS๊ธฐ๋ฐ˜ ์„œ๋ฒ„๋ฆฌ์Šค ๋ฐ์ดํ„ฐ๋ ˆ์ดํฌ ๊ตฌ์ถ•ํ•˜๊ธฐ - ๊น€์ง„์›… (SK C&C) :: AWS Community Day 2020
AWS๊ธฐ๋ฐ˜ ์„œ๋ฒ„๋ฆฌ์Šค ๋ฐ์ดํ„ฐ๋ ˆ์ดํฌ ๊ตฌ์ถ•ํ•˜๊ธฐ - ๊น€์ง„์›… (SK C&C) :: AWS Community Day 2020AWS๊ธฐ๋ฐ˜ ์„œ๋ฒ„๋ฆฌ์Šค ๋ฐ์ดํ„ฐ๋ ˆ์ดํฌ ๊ตฌ์ถ•ํ•˜๊ธฐ - ๊น€์ง„์›… (SK C&C) :: AWS Community Day 2020
AWS๊ธฐ๋ฐ˜ ์„œ๋ฒ„๋ฆฌ์Šค ๋ฐ์ดํ„ฐ๋ ˆ์ดํฌ ๊ตฌ์ถ•ํ•˜๊ธฐ - ๊น€์ง„์›… (SK C&C) :: AWS Community Day 2020
ย 
AWS๋ฅผ ํ†ตํ•œ ๋น…๋ฐ์ดํ„ฐ ๊ธฐ๋ฐ˜ ๋น„์ง€๋‹ˆ์Šค ์ธํ…”๋ฆฌ์ „์Šค ๊ตฌ์ถ•- AWS Summit Seoul 2017
AWS๋ฅผ ํ†ตํ•œ ๋น…๋ฐ์ดํ„ฐ ๊ธฐ๋ฐ˜ ๋น„์ง€๋‹ˆ์Šค ์ธํ…”๋ฆฌ์ „์Šค ๊ตฌ์ถ•- AWS Summit Seoul 2017AWS๋ฅผ ํ†ตํ•œ ๋น…๋ฐ์ดํ„ฐ ๊ธฐ๋ฐ˜ ๋น„์ง€๋‹ˆ์Šค ์ธํ…”๋ฆฌ์ „์Šค ๊ตฌ์ถ•- AWS Summit Seoul 2017
AWS๋ฅผ ํ†ตํ•œ ๋น…๋ฐ์ดํ„ฐ ๊ธฐ๋ฐ˜ ๋น„์ง€๋‹ˆ์Šค ์ธํ…”๋ฆฌ์ „์Šค ๊ตฌ์ถ•- AWS Summit Seoul 2017
ย 
Cloud dw benchmark using tpd-ds( Snowflake vs Redshift vs EMR Hive )
Cloud dw benchmark using tpd-ds( Snowflake vs Redshift vs EMR Hive )Cloud dw benchmark using tpd-ds( Snowflake vs Redshift vs EMR Hive )
Cloud dw benchmark using tpd-ds( Snowflake vs Redshift vs EMR Hive )
ย 
ํฌํ„ธ ๊ฒ€์ƒ‰์–ด ์ˆœ์œ„ ์ˆ˜์ง‘ ๋ฐ ๋ถ„์„ ํ›„๊ธฐ
ํฌํ„ธ ๊ฒ€์ƒ‰์–ด ์ˆœ์œ„ ์ˆ˜์ง‘ ๋ฐ ๋ถ„์„ ํ›„๊ธฐํฌํ„ธ ๊ฒ€์ƒ‰์–ด ์ˆœ์œ„ ์ˆ˜์ง‘ ๋ฐ ๋ถ„์„ ํ›„๊ธฐ
ํฌํ„ธ ๊ฒ€์ƒ‰์–ด ์ˆœ์œ„ ์ˆ˜์ง‘ ๋ฐ ๋ถ„์„ ํ›„๊ธฐ
ย 
SK Telecom - ๋ง๊ด€๋ฆฌ ํ”„๋กœ์ ํŠธ TANGO์˜ ์˜คํ”ˆ์†Œ์Šค ๋ฐ์ดํ„ฐ๋ฒ ์ด์Šค ์ „ํ™˜ ์—ฌ์ • - ๋ฐœํ‘œ์ž : ๋ฐ•์Šน์ „, Project Manager, ...
SK Telecom - ๋ง๊ด€๋ฆฌ ํ”„๋กœ์ ํŠธ TANGO์˜ ์˜คํ”ˆ์†Œ์Šค ๋ฐ์ดํ„ฐ๋ฒ ์ด์Šค ์ „ํ™˜ ์—ฌ์ • - ๋ฐœํ‘œ์ž : ๋ฐ•์Šน์ „, Project Manager, ...SK Telecom - ๋ง๊ด€๋ฆฌ ํ”„๋กœ์ ํŠธ TANGO์˜ ์˜คํ”ˆ์†Œ์Šค ๋ฐ์ดํ„ฐ๋ฒ ์ด์Šค ์ „ํ™˜ ์—ฌ์ • - ๋ฐœํ‘œ์ž : ๋ฐ•์Šน์ „, Project Manager, ...
SK Telecom - ๋ง๊ด€๋ฆฌ ํ”„๋กœ์ ํŠธ TANGO์˜ ์˜คํ”ˆ์†Œ์Šค ๋ฐ์ดํ„ฐ๋ฒ ์ด์Šค ์ „ํ™˜ ์—ฌ์ • - ๋ฐœํ‘œ์ž : ๋ฐ•์Šน์ „, Project Manager, ...
ย 
๊ฒŒ์ž„ ๊ณ ๊ฐ ์‚ฌ๋ก€๋ฅผ ํ†ตํ•ด ์‚ดํŽด๋ณด๋Š” AWS ํ™œ์šฉ ์ „๋žต :: ์ด๊ฒฝ์•ˆ :: AWS Summit Seoul 2016
๊ฒŒ์ž„ ๊ณ ๊ฐ ์‚ฌ๋ก€๋ฅผ ํ†ตํ•ด ์‚ดํŽด๋ณด๋Š” AWS ํ™œ์šฉ ์ „๋žต :: ์ด๊ฒฝ์•ˆ :: AWS Summit Seoul 2016๊ฒŒ์ž„ ๊ณ ๊ฐ ์‚ฌ๋ก€๋ฅผ ํ†ตํ•ด ์‚ดํŽด๋ณด๋Š” AWS ํ™œ์šฉ ์ „๋žต :: ์ด๊ฒฝ์•ˆ :: AWS Summit Seoul 2016
๊ฒŒ์ž„ ๊ณ ๊ฐ ์‚ฌ๋ก€๋ฅผ ํ†ตํ•ด ์‚ดํŽด๋ณด๋Š” AWS ํ™œ์šฉ ์ „๋žต :: ์ด๊ฒฝ์•ˆ :: AWS Summit Seoul 2016
ย 
์‚ฌ์—… ์‹ค์ 
์‚ฌ์—… ์‹ค์ ์‚ฌ์—… ์‹ค์ 
์‚ฌ์—… ์‹ค์ 
ย 
๋ฐ์ดํ„ฐ๋ฒ ์ด์Šค ์šด์˜, ์„œ๋ฒ„๋ฆฌ์Šค๋กœ ๊ฑฑ์ • ๋! - ์œค์„์ฐฌ, AWS ํ…Œํฌ์—๋ฐ˜์ ค๋ฆฌ์ŠคํŠธ - AWS Builders Online Series
๋ฐ์ดํ„ฐ๋ฒ ์ด์Šค ์šด์˜, ์„œ๋ฒ„๋ฆฌ์Šค๋กœ ๊ฑฑ์ • ๋! - ์œค์„์ฐฌ, AWS ํ…Œํฌ์—๋ฐ˜์ ค๋ฆฌ์ŠคํŠธ - AWS Builders Online Series๋ฐ์ดํ„ฐ๋ฒ ์ด์Šค ์šด์˜, ์„œ๋ฒ„๋ฆฌ์Šค๋กœ ๊ฑฑ์ • ๋! - ์œค์„์ฐฌ, AWS ํ…Œํฌ์—๋ฐ˜์ ค๋ฆฌ์ŠคํŠธ - AWS Builders Online Series
๋ฐ์ดํ„ฐ๋ฒ ์ด์Šค ์šด์˜, ์„œ๋ฒ„๋ฆฌ์Šค๋กœ ๊ฑฑ์ • ๋! - ์œค์„์ฐฌ, AWS ํ…Œํฌ์—๋ฐ˜์ ค๋ฆฌ์ŠคํŠธ - AWS Builders Online Series
ย 

Spark+Cassandra Data pipeline optimazation at recommend system for recommend system

  • 1. Spark + Cassandra ๊ธฐ๋ฐ˜ Big Data๋ฅผ ํ™œ์šฉํ•œ ์ถ”์ฒœ์‹œ์Šคํ…œ ์„œ๋น™ ํŒŒ์ดํ”„๋ผ์ธ ์ตœ์ ํ™” 2020.11.26 SSG.COM ๋ฐ•์ˆ˜์„ฑ
  • 2. CONTENTS 1. E-commerce Data Use case 2. Data Pipeline with Spark + Cassandra 3. Trouble Shooting & Optimization 4. Q&A
  • 3. 1. E-commerce Data Use case - ๊ณ ๊ฐ์˜ ํ–‰๋™ (๋ฐฉ๋ฌธ, ๊ฒ€์ƒ‰, ์žฅ๋ฐ”๊ตฌ๋‹ˆ, ํด๋ฆญ, ๊ตฌ๋งค, ๋ฆฌ๋ทฐ ๋“ฑ) ๋ฐ์ดํ„ฐ๊ฐ€ ์ƒ์„ฑ - ์ƒํ’ˆ ์ถ”์ฒœ, ์ˆ˜์š” ์˜ˆ์ธก, ํŠธ๋ Œ๋“œ ๋ถ„์„ ๋“ฑ์— ํ™œ์šฉ ๊ฐ€๋Šฅ ๊ณ ๊ฐ๋ณ„ ์ถ”์ฒœ ์ƒํ’ˆ๋ณ„ ์ถ”์ฒœ ๊ณ ๊ฐ๋ณ„๋กœ ๋‹ค๋ฅธ ์ƒํ’ˆ ์ถ”์ฒœ Ex) ๋‹ค๋ฅธ ๊ณ ๊ฐ์˜ Path ์ฐธ๊ณ  ์ƒํ’ˆ๋ณ„๋กœ ๋‹ค๋ฅธ ์ƒํ’ˆ ์ถ”์ฒœ Ex) ๋Œ€์ฒด์žฌ, ๋ณด์™„ ์žฌ ๋“ฑ..
  • 4. 1. E-commerce Data Use case - ์นด๋ ˆ์—ฌ์™•์„ ๊ตฌ๋งคํ•˜๋Š” ๊ณ ๊ฐ์—๊ฒŒ ๋ณ„๋„์˜ ๋ฌถ์Œ ์ƒํ’ˆ์„ ์ œ๊ณต - ํ•ด๋‹น ์ƒํ’ˆ๊ณผ ํ•จ๊ป˜ ๋ฐฉ๋ฌธ/๊ตฌ๋งค/์žฅ๋ฐ”๊ตฌ๋‹ˆ ์•ก์…˜์ด ์ผ์–ด๋‚˜๋Š” ์ƒํ’ˆ ๋ฌถ์Œ์„ ๋…ธ์ถœํ•˜๋ฉด์„œ ์ฟ ํฐ&ํ• ์ธ ์ œ๊ณต ํ•จ๊ป˜ ๊ตฌ๋งคํ•˜๋„๋ก ์œ ๋„
  • 5. 1. E-commerce Data Use case - ์ˆ˜ ๋งŽ์€ ํ”„๋ ˆ์ž„์›Œํฌ์—์„œ๋Š” ์‰ฝ๊ฒŒ ์ ‘๊ทผ์ด ๊ฐ€๋Šฅํ•œ ์•Œ๊ณ ๋ฆฌ์ฆ˜์ด ์กด์žฌ - ์ „์ฒ˜๋ฆฌ, ํ›„์ฒ˜๋ฆฌ, ํŒŒ๋ผ๋ฏธํ„ฐ ์กฐ์ ˆ ๋งŒ์œผ๋กœ๋„ ์‰ฝ๊ฒŒ ์ถ”์ฒœ ๋ฐ์ดํ„ฐ Set์„ ๋งŒ๋“ค์–ด๋‚ผ ์ˆ˜ ์žˆ์Œ. Ex) MLlib: Main Guide - Basic statistics - Pipelines - Extracting, transforming and selecting features - Classification and Regression - Clustering - Collaborative filtering - Frequent Pattern Mining - Model selection and tuning Spark MLlib์˜ FP-Growth ์˜ˆ์ œ ์ฝ”๋“œ
  • 6. - ๊ณผ๊ฑฐ์— ๋น„ํ•ด ๊ฐœ๋ฐœ์ž๋“ค๋„ ML์— ๋Œ€ํ•œ ์ ‘๊ทผ์ด ์‰ฌ์›Œ์ง€๊ณ  ์žˆ์Œ. - ๋ถ„๋ช… ์—ฌ๋Ÿฌ ํ”„๋ ˆ์ž„์›Œํฌ์—์„œ ์ œ๊ณตํ•˜๋Š” ์•Œ๊ณ ๋ฆฌ์ฆ˜์— ๋ถ€์กฑํ•œ ๋ถ€๋ถ„์ด ์กด์žฌ. - But ๊พธ์ค€ํžˆ ๋ฐœ์ „ ์ค‘์ด๊ณ  ์ƒˆ๋กœ์šด ๊ฒƒ์ด ๊ณ„์†ํ•ด์„œ ๋“ฑ์žฅ. 1. E-commerce Data Use case
  • 7. - ์ข‹์€ ์•Œ๊ณ ๋ฆฌ์ฆ˜์œผ๋กœ ์ œํ’ˆ์„ ๋งŒ๋“ค์–ด๋‚ด๋Š” ๊ฒƒ๋„ ์ค‘์š” - ํ•˜์ง€๋งŒ ๊ณ ๊ฐ์—๊ฒŒ ๋ฐ์ดํ„ฐ๋ฅผ ์„œ๋น™ํ•  ์ˆ˜ ์žˆ์–ด์•ผ ์˜๋ฏธ๊ฐ€ ์žˆ์Œ - ์„œ๋น™ ํŒŒ์ดํ”„๋ผ์ธ ๊ตฌ์ถ•์€ ๊ฒฝํ—˜๊ณผ ๋…ธํ•˜์šฐ๊ฐ€ ํ•„์š” 1. E-commerce Data Use case
  • 8. 2. Data Pipeline with Spark + Cassandra HDFS Mesos Spark HDFS Mesos Spark HDFS Mesos Spark HDFS Mesos Spark HDFS Mesos Spark HDFS Mesos Spark HDFS Mesos Spark HDFS Mesos Spark HDFS Mesos Spark HDFS Mesos Spark Kafka Logstash ๋ฐ์ดํ„ฐ ์ €์žฅ&์ฒ˜๋ฆฌ ์ถ”์ฒœ์…‹ ์ €์žฅ API Server Database Database Database Database Database Spring Boots ์ˆ˜์ง‘์„œ๋ฒ„ ์„œ๋น™ ๋ ˆ์ด์–ด ๊ฐ„์†Œํ™”๋ฅผ ์˜ˆ์‹œ
  • 9. 2. Data Pipeline with Spark + Cassandra - ๋งค์ผ ์ƒˆ๋ฒฝ ์‹œ๊ฐ„๋Œ€์— ์ˆ˜๋ฐฑ GB์˜ ๋ฐ์ดํ„ฐ์…‹๋“ค์„ DB์— Insert - ํŠน์ • ์‹ค์‹œ๊ฐ„ ๋กœ๊ทธ๋“ค ๋‹จ๊ฑด Insert - ๋ณ‘๋ ฌ ์ˆ˜ํ–‰์„ ์œ„ํ•œ Spark์™€์˜ ๊ถํ•ฉ - ๋‹จ์ˆœ Select๊ฐ€ ์ฃผ์š” ์ฟผ๋ฆฌ - ํŠน์ • ํšŒ์›/์ƒํ’ˆ Skew - DB Downtime์ด ๋ฐœ์ƒํ•˜๋”๋ผ๋„ ์ง€์†๊ฐ€๋Šฅํ•œ ์šด์˜ - ์ถ”ํ›„ ์‰ฝ๊ฒŒ ํ™•์žฅ ๊ฐ€๋Šฅํ•ด์•ผ ํ•จ. - ๋ชจ๋‹ˆํ„ฐ๋ง ํ”„๋กœ์„ธ์Šค ํ•„์š”
  • 10. 2. Data Pipeline with Spark + Cassandra / Serving Layer Architecture - ์ƒˆ๋ฒฝ ์‹œ๊ฐ„๋Œ€์— ์ˆ˜๋ฐฑ GB์˜ ๋ฐ์ดํ„ฐ์…‹๋“ค์„ ๋งค์ผ DB์— Insert - ํŠน์ • ์‹ค์‹œ๊ฐ„ ๋กœ๊ทธ๋“ค ๋‹จ๊ฑด Insert (๋ฐฐ์น˜ ์ปค์Šคํ„ฐ๋งˆ์ด์ง• ์˜์—ญ) - ๋ณ‘๋ ฌ ์ˆ˜ํ–‰์„ ์œ„ํ•œ Spark์™€์˜ ๊ถํ•ฉ (Cassandra ์—ญ์‹œ Apache Project) - ๋‹จ์ˆœ Select๊ฐ€ ์ฃผ์š” ์ฟผ๋ฆฌ (Cassandra๋Š” key ์กฐํšŒ์— ์•Œ๋งž์Œ.) - ํŠน์ • ํšŒ์›/์ƒํ’ˆ Skew (์ƒค๋”ฉ์— ๋Œ€ํ•œ ๊ณ ๋ ค ์•ˆํ•ด๋„ ๋จ) - DB Downtime์ด ๋ฐœ์ƒํ•˜๋”๋ผ๋„ ์ง€์†๊ฐ€๋Šฅํ•œ ์šด์˜ (Ring ๊ตฌ์กฐ๋ผ์„œ ์‰ฝ๊ฒŒ Up/Down Serivce ๊ฐ€๋Šฅ) - ์ถ”ํ›„ ์‰ฝ๊ฒŒ ํ™•์žฅ ๊ฐ€๋Šฅํ•ด์•ผ ํ•จ. (Ring ๊ตฌ์กฐ์— ๋‹จ์ˆœํžˆ ์ถ”๊ฐ€ํ•˜๋Š” ๊ฒƒ์œผ๋กœ Scale Out ๊ฐ€๋Šฅ) - ๋ชจ๋‹ˆํ„ฐ๋ง ํ”„๋กœ์„ธ์Šค ํ•„์š” (JMX exporter๋ฅผ ํ†ตํ•ด Metric ์ •๋ณด๋ฅผ ์ˆ˜์ง‘&๋ชจ๋‹ˆํ„ฐ๋ง ๊ฐ€๋Šฅ)
  • 11. What is Cassandra? - No master & slaves - distributed like a ring - Scalability - high availability 2. Data Pipeline with Spark + Cassandra / Serving Layer Architecture
  • 12. Memory 2. Data Pipeline with Spark + Cassandra / Serving Layer Architecture Memtable 1๋ฒˆ SSTable 2๋ฒˆ SSTable 3๋ฒˆ SSTable 4๋ฒˆ SSTable n๋ฒˆ SSTable ํ…Œ์ด๋ธ” A Disk Data Path
  • 13. Spark Driver Spark Excutor partition Spark Excutor partition Spark Excutor partition Spark Excutor partition Spark Excutor partition Spark Excutor partition 2. Data Pipeline with Spark + Cassandra API Server Spring Boots Spark Streaming ์ผ๋ฐฐ์น˜์„ฑ ๋Œ€์šฉ๋Ÿ‰ ๋ฐ์ดํ„ฐ ์‹ค์‹œ๊ฐ„์„ฑ ์ €์šฉ๋Ÿ‰ ๋ฐ์ดํ„ฐ
  • 14. 3. Trouble Shooting & Optimization ๋ชจ๋“  ๋ฐฐ์น˜์„ฑ ๋ฐ์ดํ„ฐ, ์‹ค์‹œ๊ฐ„ ๋ฐ์ดํ„ฐ๋ฅผ memtable(In-memory) ๋ฐฉ์‹์œผ๋กœ Insert ํ•จ. ์‹ค์‹œ๊ฐ„ ๋ฐ์ดํ„ฐ๋Š” ์ž‘์ง€๋งŒ ๋ฐฐ์น˜์„ฑ ๋ฐ์ดํ„ฐ๋Š” ์ˆœ๊ฐ„ ์ˆ˜๋ฐฑ GB์˜ ๋ฐ์ดํ„ฐ๊ฐ€ ๋ฉ”๋ชจ๋ฆฌ์— ์˜ฌ๋ผ๊ฐ€๋Š” ํšจ๊ณผ. ๊ทธ ๊ฒฐ๊ณผ ์ฒ˜๋ฆฌ์œจ์— ๋”ฐ๋ผ CPU๊ฐ€ 100%์— ๋„๋‹ฌํ•˜๋ฉฐ ์ œ์‹œ๊ฐ„์— Response๋ฅผ ์ฃผ์ง€ ๋ชปํ•˜๋Š” ๋ฌธ์ œ๊ฐ€ ๋ฐœ์ƒ. CPU 100%!! 60k/s
  • 15. 3. Trouble Shooting & Optimization ํ•ด๋‹น ๋ฐ์ดํ„ฐ๊ฐ€ ๋„ˆ๋ฌด ํด ๊ฒฝ์šฐ Memtable(In-memory) -> SSTable(Disk)๋กœ ๋‚ด๋ฆฌ๋Š” ๊ณผ์ •์ธ Flush๊ฐ€ ์กด์žฌํ•จ. Service Downtime์„ ๊ณ ๋ คํ•˜์—ฌ Replica๋ฅผ 3์œผ๋กœ ์žก์Œ. ๋”ฐ๋ผ์„œ Flush ๋ฐ Copy๋กœ ์ธํ•œ Batch ์‹œ๊ฐ„์€ ๊ธธ์–ด์ง€๊ณ  ์ด ๋ชจ๋“  ์‹œ๊ฐ„๋Œ€๋Š” ์žฅ์• ์ƒํ™ฉ์œผ๋กœ ๊ฐ„์ฃผ. Connection Timeouts / Pending threads
  • 16. 3. Trouble Shooting & Optimization Idea ? - ๋Œ€์šฉ๋Ÿ‰์˜ ๋ฐ์ดํ„ฐ๋Š” Memtable -> SSTable ๋ฐฉ์‹์œผ๋กœ Insert ๋ณ€๊ฒฝ. - ์‹ค์‹œ๊ฐ„์„ฑ ๋‹จ๊ฑด Insert ๋ฐ์ดํ„ฐ๋Š” Spark Streaming + Memtable ๋ฐฉ์‹์œผ๋กœ Insert ์œ ์ง€. Spark Streaming Spark Cluster ์‹ค์‹œ๊ฐ„/๋‹จ๊ฑด ์ผ๋ฐฐ์น˜ ๋Œ€์šฉ๋Ÿ‰ Memtable Insert SSTable File ์ƒ์„ฑ -> SSTable Bulk Load
  • 17. Spark Driver Spark Excutor partition Spark Excutor partition Spark Excutor partition Spark Excutor partition Spark Excutor partition Spark Excutor partition API Server Spring Boots SSTable Files SSTable Files SSTable Files SSTable Files SSTable FilesSSTable Files 3. Trouble Shooting & Optimization Spark Streaming ์ผ๋ฐฐ์น˜์„ฑ ๋Œ€์šฉ๋Ÿ‰ ๋ฐ์ดํ„ฐ ์‹ค์‹œ๊ฐ„์„ฑ ์ €์šฉ๋Ÿ‰ ๋ฐ์ดํ„ฐ
  • 18. * ์ž‘์—… ์ˆœ์„œ 1. UUID๋ฅผ ํ™œ์šฉํ•˜์—ฌ SSTable Directory ์ƒ์„ฑ 2. Directory์— SSTable ์ƒ์„ฑ 3. SSTable Bulk Load To Cassandra 4. Delete Directory 3. Trouble Shooting & Optimization * ๊ธฐ๋Œ€ํšจ๊ณผ - ๊ฐ ์นด์‚ฐ๋“œ๋ผ ๋…ธ๋“œ๋Š” SSD์ด๊ธฐ ๋•Œ๋ฌธ์— ํšจ์œจ ์ฆ๊ฐ€ - CPU ์‚ฌ์šฉ์ด ๋ฏธ๋ฏธํ•  ๊ฒƒ์ด๊ธฐ ๋•Œ๋ฌธ์— ์šด์˜ ์ƒ์— ์˜ํ–ฅ ๋ฏธ๋ฏธ - Network/Disk ์„ฑ๋Šฅ์ด ์ถฉ๋ถ„ํžˆ ๋ฐ›์ณ์ค€๋‹ค๋ฉด ๋” ๋งŽ์€ ๋ฐฐ์น˜๋ฅผ ๋™์‹œ์ˆ˜ํ–‰ ๊ฐ€๋Šฅ
  • 19. 3. Trouble Shooting & Optimization ๋ณ‘๋ ฌ์ˆ˜ํ–‰์„ ์œ„ํ•œ repartition UUID๋ฅผ ํ™œ์šฉํ•ด์„œ ๋””๋ ‰ํ† ๋ฆฌ ์ƒ์„ฑ SSTable ์ƒ์„ฑ ๋žœ๋ค์œผ๋กœ ์นด์‚ฐ๋“œ๋ผ ๋…ธ๋“œ ์„ ํƒ Stream buffer size ์กฐ์ ˆ ๋ฐ ์ „์†ก ๋””๋ ‰ํ† ๋ฆฌ ์‚ญ์ œ ์กฐ์ ˆ ๊ฐ€๋Šฅํ•œ ๋ณ€์ˆ˜ 1. partition_num 2. streamthrottlembits
  • 20. 3. Trouble Shooting & Optimization Ex) ์ˆ˜์‹  ์ธก์ด Network Traffic์€ ์ตœ๋Œ€ 1Gbps๋ฅผ ๋ฐ›์„ ์ˆ˜ ์žˆ๊ณ  SSD๋Š” 2GBs. (Network Traffic๋งŒ ๊ณ ๋ ค) Base Line์„ Max 35%์ธ 350Mbps๋งŒ ์‚ฌ์šฉํ•˜๋„๋ก ๊ฐ€์ •. ๋งŒ์•ฝ HDFS Size 1G๋‹น ํŒŒํ‹ฐ์…˜ 1๊ฐœ๋กœ ๊ณ ์ •ํ•œ๋‹ค๋ฉด 10GB๋ฅผ ์ „์†กํ•  ๊ฒฝ์šฐ 10๋Œ€๊ฐ€ ๋ณ‘๋ ฌ๋กœ ์ˆ˜ํ–‰. Streamthrottlembits * partition_num = 350 (mbps) 350Mbps๋กœ ์ œํ•œํ•˜๋Š” ์˜ˆ์ œ ์ฝ”๋“œ
  • 21. 3. Trouble Shooting & Optimization ๊ธฐ๋Œ€ํ–ˆ๋˜ Base Line์ธ 350Mbps์ด ์ตœ๋Œ€์น˜์˜€์œผ๋ฉฐ ์ด๋Ÿฌํ•œ ๊ฒฝ์šฐ ๋ฐฐ์น˜๋ฅผ 2๊ฐœ๊นŒ์ง€ ์ˆ˜ํ–‰ ๊ฐ€๋Šฅ.(~70%) ์‹ค์ œ ์šด์˜ ์‹œ์—๋Š” ์žฅ๋น„๊ฐ€ ๋” ์ข‹์„ ๊ฒƒ์ด๋ฏ€๋กœ ๋” ํ—ค๋น„ํ•˜๊ฒŒ ์‚ฌ์šฉ๊ฐ€๋Šฅ ์ค‘์š”ํ•œ ๊ฒƒ์€ ๊ทธ ํŠธ๋ž˜ํ”ฝ์„ ์–ผ๋งˆ๋‚˜ ์ž˜ ์ œ์–ดํ•˜๊ณ , ๋ฐฐ์น˜๋ณ„๋กœ ์–ด๋–ป๊ฒŒ ๋™์ ์œผ๋กœ ๋ถ„๋ฐฐํ•  ์ง€๋ฅผ ๊ณ ๋ฏผ. ๊ฒฐ๊ณผ ์ •ํ™•ํžˆ 35%๋งŒ ์‚ฌ์šฉ!!
  • 22. 3. Trouble Shooting & Optimization ์ „ ํ›„ CPU, Memory ์‚ฌ์šฉ ๊ฐ์†Œ๋กœ ์ธํ•ด ๋™์‹œ์— ๋” ๋งŽ์€ ๋ฐฐ์น˜๊ฐ€ ์ˆ˜ํ–‰ ๊ฐ€๋Šฅํ•ด์ง (Network, Disk I/O ๊ณ ๋ ค) 60K write/sec MAX CPU Usage 100% 60 write/sec MAX CPU Usage 20%
  • 23. - ๋ฐ์ดํ„ฐ์…‹ Size์— ๋”ฐ๋ผ ๊ฐ ๋…ธ๋“œ์—์„œ๋Š” ์ผ๋ณ„ ์ˆ˜๋ฐฑ~์ˆ˜์ฒœ๊ฐœ๊ฐ€ ์ƒ์„ฑ์ด ๋˜๊ณ  ๋งค์ผ ์Œ“์ด๋Š” ๊ตฌ์กฐ - Cache์— ์˜ฌ๋ผ๊ฐ€๊ธฐ ์ „์— ์ˆ˜ ๋งŽ์€ SSTable์„ Readํ•˜๋ฉด์„œ ์„ฑ๋Šฅ ํ•˜๋ฝ์˜ ์›์ธ์ด ๋  ์ˆ˜ ์žˆ์Œ -> ๋”ฐ๋ผ์„œ ๋งค์ผ Compaction์„ ์ˆ˜ํ–‰ํ•˜๋ฉด์„œ SSTable์„ ์ค„์—ฌ์ฃผ๋Š” ์ž‘์—… ์ง„ํ–‰ 3. Trouble Shooting & Optimization Compaction์„ ํ†ตํ•ด SSTable์ด ์ค„์–ด๋“œ๋Š” ๊ทธ๋ž˜ํ”„
  • 24. * Azul Systems์˜ Zing ๋„์ž… ๋ฐฐ๊ฒฝ - ์ถ”์ฒœ์…‹ ๋ฐฐ์น˜๊ฐ€ ์ ์  ๋” ๋งŽ์•„์ง - ๋ฐ์ดํ„ฐ ์‚ฌ์ด์ฆˆ๊ฐ€ ๊ณ„์†ํ•ด์„œ ์ปค์ง - ์ถ”๊ฐ€ ํ”„๋กœ์ ํŠธ ์ง„ํ–‰์‹œ ํŠธ๋ž˜ํ”ฝ์ด ๋ณด์ˆ˜์ ์œผ๋กœ 5๋ฐฐ๋Š” ๋Š˜์–ด๋‚  ๊ฒƒ์œผ๋กœ ์˜ˆ์ƒ ํŠนํžˆ ์ถ”ํ›„ ํŠธ๋ž˜ํ”ฝ์ด ๋งŽ์•„์งˆ ๊ฒฝ์šฐ ์•ˆ์ •์ ์ธ response๋ฅผ ์œ„ํ•ด Zing GC ํ…Œ์ŠคํŠธ ํ•ด๋ณด๊ธฐ๋กœ ๊ฒฐ์ • ๏ƒ  Cassandra + Zing ์กฐํ•ฉ์ด ๊ตญ๋‚ด ๋ ˆํผ๋Ÿฐ์Šค๊ฐ€ ์—†์—ˆ๊ธฐ ๋•Œ๋ฌธ์— ๋‚ด๋ถ€์—์„œ ์ž์ฒด ํ…Œ์ŠคํŠธ๋ฅผ ์ง„ํ–‰ 3. Trouble Shooting & Optimization ZING GC G1GC
  • 25. HotSpot JVM vs Zing JVM ๋น„๊ต 3. Trouble Shooting & Optimization ZING GC G1GC
  • 26. Zing GC(์œ„) vs G1GC (์•„๋ž˜) - ์กฐ๊ธˆ ๋” ์•ˆ์ •์ ์œผ๋กœ ์‚ฌ์šฉํ•˜๊ธฐ ์œ„ํ•ด Zing ๋„์ž… - Zing์˜ ๊ฒฝ์šฐ ๊พธ์ค€ํ•œ ํ•ญ์ƒ ์ผ์ •ํ•œ GC time ์œ ์ง€ - STW๋ฅผ ์‹ ๊ฒฝ์“ฐ์ง€ ์•Š๊ณ  ์•ˆ์ •์ ์ธ Response ๊ธฐ๋Œ€ - ํŠธ๋ž˜ํ”ฝ์ด ๋Š˜์–ด๋‚˜๊ณ , ๋” ํ—ค๋น„ ํ•˜๊ฒŒ ์‚ฌ์šฉํ•  ๊ฒฝ์šฐ ๋” ๊ทน์ ์ธ ํšจ๊ณผ ๊ธฐ๋Œ€ ๊ฐ€์žฅ ๋งˆ์Œ์— ๋“ค์—ˆ๋˜ ๊ฒƒ์€ GC ํŠœ๋‹์ด ํ•„์š”๊ฐ€ ์—†์Œ! Ex ) โ€“Xmx60g๋งŒ ์ฃผ๊ณ  ์‚ฌ์šฉ 3. Trouble Shooting & Optimization Zing GC G1GC
  • 27. Prometheus + Grafana ์กฐํ•ฉ์œผ๋กœ ๋ชจ๋‹ˆํ„ฐ๋ง ์‹œ๊ฐํ™” 3. Trouble Shooting & Optimization
  • 28. - CPU ์‚ฌ์šฉ๋Ÿ‰ 100% -> 20%๋กœ ๊ฐ์†Œํ•จ์œผ๋กœ์จ ์šด์˜ ์•ˆ์ •์„ฑ ํ™•๋ณด - ์ฒ˜๋ฆฌ์œจ ์ฆ๊ฐ€๋กœ ์ธํ•ด ๋ฐฐ์น˜ ์†๋„ 2H -> 15M ์œผ๋กœ ๊ฐ์†Œ - Network & Disk I/O ๋งŒ ๊ณ ๋ คํ•˜๋ฉด์„œ ๋‹ค์ค‘ ๋ฐฐ์น˜ ์ˆ˜ํ–‰ ๊ฐ€๋Šฅ - Zing GC ๋„์ž… ํ›„ ์•ˆ์ •์ ์ธ Response ๋ฐ ๋ฉ”๋ชจ๋ฆฌ ๊ด€๋ฆฌ - Full Data๊ฐ€ ์•„๋‹Œ Key Cache๋งŒ ๋ฉ”๋ชจ๋ฆฌ์— ์˜ฌ๋ ค๋†“์Œ์œผ๋กœ์จ ํšจ์œจ์  ์šด์˜ - ์‹ค์‹œ๊ฐ„/๋Œ€์šฉ๋Ÿ‰ ๋ฐฐ์น˜๋ฅผ ๋ถ„๋ฆฌ ์šด์˜ํ•จ์œผ๋กœ์จ ์•ˆ์ •์„ฑ ํ™•๋ณด 3. Trouble Shooting & Optimization
  • 29. Q & A