SlideShare a Scribd company logo
1 of 85
Download to read offline
3 3 1 0
1 2 ,
์›จ๋น„๋‚˜
0 2 ,
๋ฃจํ‚ค
ํ”„๋กœ
๋งˆ์Šคํ„ฐ
๋ฃจํ‚ค ํ”„๋กœ ๋งˆ์Šคํ„ฐ
๋ชจ๋‘ ๋งˆ์Šคํ„ฐ๊ฐ€ ๋˜์–ด๋ณด์„ธ์š”! J
๊ฐ•์—ฐ ์ค‘ ์งˆ๋ฌธํ•˜๋Š” ๋ฐฉ๋ฒ• AWS Builders
Go to Webinar โ€œQuestionsโ€ ์ฐฝ์— ์ž์‹ ์ด ์งˆ๋ฌธํ•œ
๋‚ด์—ญ์ด ํ‘œ์‹œ๋ฉ๋‹ˆ๋‹ค. ๊ธฐ๋ณธ์ ์œผ๋กœ ๋ชจ๋“  ์งˆ๋ฌธ์€
๊ณต๊ฐœ๋กœ ๋‹ต๋ณ€ ๋ฉ๋‹ˆ๋‹ค๋งŒ ๋ณธ์ธ๋งŒ ๋‹ต๋ณ€์„ ๋ฐ›๊ณ 
์‹ถ์œผ๋ฉด (๋น„๊ณต๊ฐœ)๋ผ๊ณ  ํ•˜๊ณ  ์งˆ๋ฌธํ•ด ์ฃผ์‹œ๋ฉด ๋ฉ๋‹ˆ๋‹ค.
๋ณธ ์ปจํ…์ธ ๋Š” ๊ณ ๊ฐ์˜ ํŽธ์˜๋ฅผ ์œ„ํ•ด AWS ์„œ๋น„์Šค ์„ค๋ช…์„ ์œ„ํ•ด ์˜จ๋ผ์ธ ์„ธ๋ฏธ๋‚˜์šฉ์œผ๋กœ ๋ณ„๋„๋กœ ์ œ์ž‘, ์ œ๊ณต๋œ ๊ฒƒ์ž…๋‹ˆ๋‹ค. ๋งŒ์•ฝ AWS
์‚ฌ์ดํŠธ์™€ ์ปจํ…์ธ  ์ƒ์—์„œ ์ฐจ์ด๋‚˜ ๋ถˆ์ผ์น˜๊ฐ€ ์žˆ์„ ๊ฒฝ์šฐ, AWS ์‚ฌ์ดํŠธ(aws.amazon.com)๊ฐ€ ์šฐ์„ ํ•ฉ๋‹ˆ๋‹ค. ๋˜ํ•œ AWS ์‚ฌ์ดํŠธ
์ƒ์—์„œ ํ•œ๊ธ€ ๋ฒˆ์—ญ๋ฌธ๊ณผ ์˜์–ด ์›๋ฌธ์— ์ฐจ์ด๋‚˜ ๋ถˆ์ผ์น˜๊ฐ€ ์žˆ์„ ๊ฒฝ์šฐ(๋ฒˆ์—ญ์˜ ์ง€์ฒด๋กœ ์ธํ•œ ๊ฒฝ์šฐ ๋“ฑ ํฌํ•จ), ์˜์–ด ์›๋ฌธ์ด ์šฐ์„ ํ•ฉ๋‹ˆ๋‹ค.
AWS๋Š” ๋ณธ ์ปจํ…์ธ ์— ํฌํ•จ๋˜๊ฑฐ๋‚˜ ์ปจํ…์ธ ๋ฅผ ํ†ตํ•˜์—ฌ ๊ณ ๊ฐ์—๊ฒŒ ์ œ๊ณต๋œ ์ผ์ฒด์˜ ์ •๋ณด, ์ฝ˜ํ…์ธ , ์ž๋ฃŒ, ์ œํ’ˆ(์†Œํ”„ํŠธ์›จ์–ด ํฌํ•จ) ๋˜๋Š” ์„œ๋น„์Šค๋ฅผ ์ด์šฉํ•จ์œผ๋กœ ์ธํ•˜์—ฌ ๋ฐœ์ƒํ•˜๋Š” ์—ฌํ•˜ํ•œ ์ข…๋ฅ˜์˜ ์†ํ•ด์—
๋Œ€ํ•˜์—ฌ ์–ด๋– ํ•œ ์ฑ…์ž„๋„ ์ง€์ง€ ์•„๋‹ˆํ•˜๋ฉฐ, ์ด๋Š” ์ง์ ‘ ์†ํ•ด, ๊ฐ„์ ‘ ์†ํ•ด, ๋ถ€์ˆ˜์  ์†ํ•ด, ์ง•๋ฒŒ์  ์†ํ•ด ๋ฐ ๊ฒฐ๊ณผ์  ์†ํ•ด๋ฅผ ํฌํ•จํ•˜๋˜ ์ด์— ํ•œ์ •๋˜์ง€ ์•„๋‹ˆํ•ฉ๋‹ˆ๋‹ค.
๊ณ ์ง€ ์‚ฌํ•ญ(Disclaimer)
ยง Introdution
ยง Glue internal
ยง Items
ยง Item1: Processing lots of small files
ยง Item2: Processing a few large files
ยง Item3: Optimizing parallelism
ยง Item4: JDBC partitions
ยง Item5: Python udf & performance
ยง Item6: Scheduler
ยง Item7: Python shell
ยง QnA
Introduction
Fully-managed, serverless extract-transform-load (ETL) service
for developers, built by developers
1000s of customers and jobs
A year ago โ€ฆ
AWS Glue
Serverless data catalog & ETL service
Data Catalog
ETL Job
authoring
Discover data and
extract schema
Auto-generates
customizable ETL code
in Python and Scala
Automatically discovers data and stores schema
Data searchable, and available for ETL
Generates customizable code
Schedules and runs your ETL jobs
Serverless, flexible, and built on open standards
Putting it together - data lake with AWS Glue
Amazon S3
(Raw data)
Amazon S3
(Staging
data)
Amazon S3
(Processed
data)
AWS Glue Data Catalog
Crawlers Crawlers Crawlers
Select AWS Glue customers
AWS Glue
Serverless data catalog & ETL service
Data Catalog
ETL Job
authoring
Discover data and
extract schema
Auto-generates
customizable ETL code
in Python and Scala
Automatically discovers data and stores schema
Data searchable, and available for ETL
Generates customizable code
Schedules and runs your ETL jobs
Serverless, flexible, and built on open standards
Glue internal
Programming Environment
โ€ข ETL in Python
โ€ข Python 2.7
โ€ข Boto 3
โ€ข ETL in Scala
โ€ข Scala 2.11
โ€ข Spark Cluster
โ€ข Spark 2.2.1
Programming Environment
โ€ข 1 DPU (Data Processing Unit)
โ€ข 1 m4.xlarge node
โ€ข 4vCPU
โ€ข 16G RAM
โ€ข 2 executors
โ€ข 1 Executor
โ€ข 5G RAM
โ€ข 4 Tasks
Driver
Executors
Programming Environment
โ€ข Glue Job
โ€ข Minimum DPU: 2
โ€ข Default DPU: 10
โ€ข Ex) 10 DPU Job
โ€ข 10 node cluster
โ€ข 1 Master + 9 Core Nodes
โ€ข 18 executors
โ€ข 1 driver
โ€ข 17 executors
Programming Environment
โ€ข Internal argument to AWS Glue
โ€ข --conf
โ€ข --debug
โ€ข --mode
โ€ข --JOB_NAME
Basics of ETL Job Programming
1. Initialize
2. Read
3. Transform data
4. Write
## Initialize
glueContext = GlueContext(SparkContext.getOrCreate())
## Create DynamicFrame and retrieve data from source
ds0 = glueContext.create_dynamic_frame.from_catalog (
database = "mysql", table_name = "customer",
transformation_ctx = "ds0")
## Implement data transformation here
ds1 = ds0 ...
## Write DynamicFrame from Catalog
ds2 = glueContext.write_dynamic_frame.from_catalog (
frame = ds1, database = "redshift",
table_name = "customer_dim",
redshift_tmp_dir = args["TempDir"],
transformation_ctx = "ds2")
What is Apache Spark?
Parallel, scale-out data processing engine
Fault-tolerance built-in
Flexible interface: Python scripting, SQL
Rich eco-system: ML, Graph, analytics, โ€ฆ
Apache Spark and AWS Glue ETL
Spark core: RDDs
SparkSQL
Dataframes DynamicFrames
AWS Glue ETL
AWS Glue ETL libraries
Integration: Data Catalog, job orchestration,
code-generation, job bookmarks, S3, RDS
ETL transforms, more connectors & formats
New data structure: DynamicFrames
Dataframes
Core data structure for SparkSQL
Like structured tables
Need schema up-front
Each row has same structure
Suited for SQL-like analytics
Dataframes and Dynamic Frames
Dynamic Frames
Like dataframes for ETL
Designed for processing semi-structured data,
e.g. JSON, Avro, Apache logs ...
Public GitHub timeline is โ€ฆ
35+ event types
semi-structured
payload structure
and size varies by
event type
ยฉ 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
schema per-record, no up-front schema needed
Easy to restructure, tag, modify
Can be more compact than dataframe rows
Many flows can be done in single-pass
ยง {โ€œidโ€:โ€2489โ€, โ€œtypeโ€:
โ€CreateEventโ€,
โ€payloadโ€: {โ€œcreatorโ€:โ€ฆ}, โ€ฆ}
Dynamic Records
typeid typeid
Dynamic Frame Schema
typeid
Dynamic Frame internals
{โ€œidโ€:4391, โ€œtypeโ€: โ€œPullEventโ€,
โ€payloadโ€: {โ€œassetsโ€:โ€ฆ}, โ€ฆ}
typeid
{โ€œidโ€:โ€6510โ€, โ€œtypeโ€: โ€œPushEventโ€,
โ€payloadโ€: {โ€œpusherโ€:โ€ฆ}, โ€ฆ}
id
ResolveChoice() B B B
project
B
cast
B
separate into cols
B B
ApplyMapping() A
X Y
A X Y
C
15+ transforms out-of-the box
Dynamic Frame transforms
Semi-structured schema Relational schema
FKA B B C.X C.Y
PK ValueOffset
A C D [ ]
X Y
B B
Transforms and adds new columns, types, and tables on-the-fly
Tracks keys and foreign keys across runs
SQL on the relational schema is orders of magnitude faster than JSON processing
Relationalize() transform
toDF(): Convert to a Dataframe
fromDF(): Convert from a Dataframe
Spigot(): Sample data of any Dynamic Frame to S3
Unbox(): Parse string column as given format into Dynamic Frame
Filter(), Map(): Apply Python UDFs to Dynamic Frames
Join(): Join two Dynamic Frames
And more โ€ฆ.
Useful AWS Glue transforms
0
200
400
600
800
1000
1200
1400
1600
1800
Day Month Year
GitHub Timeline ETL Performance
DynamicFrames DataFrames
Time(sec)
On average: 2x performance
improvement
Data size (# files)
24 744 8699
Performance: AWS Glue ETL
Configuration
10 DPUs
Apache Spark 2.1.1
Workload
JSON to CSV
Filter for Pull events
(lower is better)
Lots of small files, e.g. Kinesis Firehose
Vanilla Apache Spark (2.1.1) overheads
Must reconstruct partitions (2-pass)
Too many tasks: task per file
Scheduling & memory overheads
AWS Glue Dynamic Frames
Integration with Data Catalog
Automatically group files per task
Rely on crawler statistics
Performance: Lots of small files
0
1000
2000
3000
4000
5000
6000
7000
8000
1:2K 20:40K 40:80K 80:160K 160:320K 320:640K 640: 1280K
AWS Glue ETL small file scalability
Spark Glue
1.2 Million Files
Spark
Out-Of-Memory
>= 320: 640K files
Grouping
Time(sec)
# partitions : # files
AWS Glue execution model: data partitions
โ€ข Apache Spark and AWS Glue
are data parallel.
โ€ข Data is divided into partitions
that are processed
concurrently.
โ€ข A stage is a set of parallel
tasks โ€“ one task per partition
Driver
Executors Overall throughput is limited
by the number of partitions
AWS Glue execution model: jobs and stages
AWS Glue execution model: jobs and stages
Actions
AWS Glue execution model: jobs and stages
Jobs
AWS Glue execution model: jobs and stages
Repartition
FilterRead
Drop
Nulls
Write
Read Show
Job 1
Job 2
Stage 1
Stage 2
Stage 1
Apply
Mapping
Filter
Apply
Mapping Jobs
โ€ข How is your dataset
partitioned?
โ€ข How is your application
divided into jobs and
stages?
โ€ข Data is divided into
partitions that are
processed concurrently
AWS Glue performance: key questions
Enabling job metrics
Item1: Processing lots of small files
Example: Processing lots of small files
โ€ข Let's look at a straightforward JSON to Parquet conversion job
โ€ข 1.28 million JSON files in 640 partitions:
Example: Processing lots of small files
โ€ข First try: use a standard SparkSQL job
Example: Processing lots of small files
Example: Processing lots of small files
Example: Processing lots of small files
โ€ข Driver memory use is growing fast and approaching the 5g max.
Example: Processing lots of small files
โ€ข Case 2: Run using AWS Glue DynamicFrames.
Example: Processing lots of small files
Example: Processing lots of small files
Driver memory remains below 50%
for the entire duration of execution.
Example: Processing lots of small files
Example: Processing lots of small files
Options for grouping files
โ€ข groupFiles
โ€ข inPartition: within a partition.
โ€ข acrossPartition: from different partitions.
โ€ข groupSize
โ€ข Target size of each group.
Example: Aggressively grouping files
โ€ข Execution is much slower, but hasn't crashed.
"groupFiles": "acrossPartition"
Example: Aggressively grouping files
Executor memory is higher than driver. Only one executor is active.
Item2: Processing a few large files
Example: Processing a few large files
โ€ข Let's see how this looks on a sample dataset of 5 large csv files.
โ€ข Each file is
โ€ข 12.5 GB uncompressed
โ€ข 1.6 GB gzip
โ€ข 1.3 GB bzip2
โ€ข Script converts data to Parquet.
Example: Processing a few large gzip files
โ€ข We only have 5 partitions โ€“ one for each file.
โ€ข Job fails after 2 hours.
Example: Processing a few large bzip2 files
โ€ข Bzip2 files can be split into blocks, so we see up to 104 tasks.
โ€ข Job completes in 18 minutes.
Example: Processing a few large bzip2 files
โ€ข With 15 DPU, the number of active executors closely tracks the maximum needed
number of executors.
Example: Processing a few large uncompressed files
โ€ข Uncompressed files can be split into lines, so we construct 64MB partitions.
โ€ข Job completes in 12 minutes.
Example: Processing a few large files
โ€ข If you have a choice of compression type, prefer bzip2.
โ€ข If you are using gzip, make sure you have enough files to fully utilize your resources.
โ€ข Bandwidth is rarely the bottleneck for AWS Glue jobs, so consider leaving files
uncompressed.
Item3: Optimizing parallelism
Example: optimizing parallelism
Processing large, split-able bzip2 files.
With 10 DPU, metric maximum needed executors shows room for scaling.
ยง 17 Executors (Maximum Allocated Executors)
ยง 10 DPU = 10 Node Cluster = 1 Master + 9 Core Node
ยง 9 Core Node = 18 Executors = 1 Driver + 17 Executors
ยง 27 Executors (Maximum Needed Executors)
ยง 1 Driver + 27 Executors = 28 Executors = 14 Core Node
ยง 14 Core Node + 1 Master = 15 Node Cluster = 15 DPU
DPU
Example: optimizing parallelism
With 15 DPU, active executors closely tracks maximum needed executors.
Item4: JDBC partitions
AWS Glue JDBC partitions
โ€ข For JDBC sources, by default each table is read as a single partition.
โ€ข AWS Glue automatically partitions datasets with fewer than 10
partitions after the data has been loaded.
Reading JDBC partitions
Reading JDBC partitions
Reading JDBC partitions
A single executor is used
for the JDBC query
Data is repartitioned for
the rest of the job.
Options for reading database tables in parallel
โ€ข hashexpression โ€“ Integer expression to use for distribution.
โ€ข hashfield โ€“ Single column to use for distribution.
โ€ข hashpartitions โ€“ Number of parallel queries to make. Default is 7.
โ€ข Turns into a collection of queries of the form
Options for reading database tables in parallel
โ€ข Guidelines for picking distribution keys.
โ€ข For hashexpression, choose a column that is evenly distributed across values. A primary key works well.
โ€ข If no such field exists, use hashfield to define one.
โ€ข Example: The taxi dataset does not have a primary key, so we set hashfield to
partition based on day of the month:
datasource0 = glueContext.create_dynamic_frame.from_catalog(
database = "nyctaxi",
table_name = "green-mysql-large",
additional_options={'hashfield': 'day(lpep_pickup_datetime)',
'hashpartitions': 15})
Options for reading database tables in parallel
โ€ข Four executors can process 16 partitions concurrently.
Options for reading database tables in parallel
โ€ข Make sure to understand impact to database engine.
Job Bookmarks for JDBC Queries
โ€ข Job bookmarks only work when the source table has an ordered
primary key.
โ€ข Updates are not handled today.
Item5: Python performance
Python performance
โ€ข Using map and filter in Python
is expensive for large data sets.
โ€ข All data is serialized and sent
between the JVM and Python.
โ€ข Alternatives
โ€ข Use AWS Glue Scala SDK.
โ€ข Convert to DataFrame and use Spark
SQL expressions.
Spark JVM
Python VM
Item6: Scheduler
Glue
Glue
Boto3
Jenkins with boto3
Oozie with boto3
Airflow with boto3
Item7: Python shell
Announcing a new job type: Python shell
A new cost-effective ETL primitive for small to
medium tasks
Python
shell 3rd party
service
AWS Glue Python shell specs
Python 2.7 environment with
boto3, awscli, numpy, scipy, pandas, scikit-learn, PyGreSQL, โ€ฆ
cold spin-up: < 20 sec, support for VPCs, no runtime limit
sizes: 1 DPU (includes 16GB), and 1/16 DPU (includes 1GB)
pricing: $0.44 per DPU-hour, 1-min minimum, per-second billing
Python shell collaborative filtering example
Amazon customer reviews dataset (s3://amazon-reviews-pds)
Video category
Compute low-rank approx of (Customer x Product) ratings using SVD
uses scipy sparse matrix and SVD library
Step Time (sec)
Redshift COPY 13
Extract ratings 5
Generate matrix 1552
SVD (k=1000) 2575
Total 4145
69 min
$0.60
๋” ๋‚˜์€ ์„ธ๋ฏธ๋‚˜๋ฅผ ์œ„ํ•ด
์—ฌ๋Ÿฌ๋ถ„์˜ ์˜๊ฒฌ์„ ๋‚จ๊ฒจ์ฃผ์„ธ์š”!
โ–ถ ์งˆ๋ฌธ์— ๋Œ€ํ•œ ๋‹ต๋ณ€ ๋“œ๋ฆฝ๋‹ˆ๋‹ค.
โ–ถ ๋ฐœํ‘œ์ž๋ฃŒ/๋…นํ™”์˜์ƒ์„ ์ œ๊ณตํ•ฉ๋‹ˆ๋‹ค.
http://bit.ly/awskr-webinar

More Related Content

What's hot

AWS Summit Seoul 2023 | ์‚ผ์„ฑ์ „์ž/์ฟ ํŒก์˜ ๋Œ€๊ทœ๋ชจ ํŠธ๋ž˜ํ”ฝ ์ฒ˜๋ฆฌ๋ฅผ ์œ„ํ•œ ํด๋ผ์šฐ๋“œ ๋„ค์ดํ‹ฐ๋ธŒ ๋ฐ์ดํ„ฐ๋ฒ ์ด์Šค ํ™œ์šฉ
AWS Summit Seoul 2023 | ์‚ผ์„ฑ์ „์ž/์ฟ ํŒก์˜ ๋Œ€๊ทœ๋ชจ ํŠธ๋ž˜ํ”ฝ ์ฒ˜๋ฆฌ๋ฅผ ์œ„ํ•œ ํด๋ผ์šฐ๋“œ ๋„ค์ดํ‹ฐ๋ธŒ ๋ฐ์ดํ„ฐ๋ฒ ์ด์Šค ํ™œ์šฉAWS Summit Seoul 2023 | ์‚ผ์„ฑ์ „์ž/์ฟ ํŒก์˜ ๋Œ€๊ทœ๋ชจ ํŠธ๋ž˜ํ”ฝ ์ฒ˜๋ฆฌ๋ฅผ ์œ„ํ•œ ํด๋ผ์šฐ๋“œ ๋„ค์ดํ‹ฐ๋ธŒ ๋ฐ์ดํ„ฐ๋ฒ ์ด์Šค ํ™œ์šฉ
AWS Summit Seoul 2023 | ์‚ผ์„ฑ์ „์ž/์ฟ ํŒก์˜ ๋Œ€๊ทœ๋ชจ ํŠธ๋ž˜ํ”ฝ ์ฒ˜๋ฆฌ๋ฅผ ์œ„ํ•œ ํด๋ผ์šฐ๋“œ ๋„ค์ดํ‹ฐ๋ธŒ ๋ฐ์ดํ„ฐ๋ฒ ์ด์Šค ํ™œ์šฉ
Amazon Web Services Korea
ย 
AWS EMR Cost optimization
AWS EMR Cost optimizationAWS EMR Cost optimization
AWS EMR Cost optimization
SANG WON PARK
ย 
ํšจ์œจ์ ์ธ ๋น…๋ฐ์ดํ„ฐ ๋ถ„์„ ๋ฐ ์ฒ˜๋ฆฌ๋ฅผ ์œ„ํ•œ Glue, EMR ํ™œ์šฉ - ๊น€ํƒœํ˜„ ์†”๋ฃจ์…˜์ฆˆ ์•„ํ‚คํ…ํŠธ, AWS :: AWS Summit Seoul 2019
ํšจ์œจ์ ์ธ ๋น…๋ฐ์ดํ„ฐ ๋ถ„์„ ๋ฐ ์ฒ˜๋ฆฌ๋ฅผ ์œ„ํ•œ Glue, EMR ํ™œ์šฉ - ๊น€ํƒœํ˜„ ์†”๋ฃจ์…˜์ฆˆ ์•„ํ‚คํ…ํŠธ, AWS :: AWS Summit Seoul 2019ํšจ์œจ์ ์ธ ๋น…๋ฐ์ดํ„ฐ ๋ถ„์„ ๋ฐ ์ฒ˜๋ฆฌ๋ฅผ ์œ„ํ•œ Glue, EMR ํ™œ์šฉ - ๊น€ํƒœํ˜„ ์†”๋ฃจ์…˜์ฆˆ ์•„ํ‚คํ…ํŠธ, AWS :: AWS Summit Seoul 2019
ํšจ์œจ์ ์ธ ๋น…๋ฐ์ดํ„ฐ ๋ถ„์„ ๋ฐ ์ฒ˜๋ฆฌ๋ฅผ ์œ„ํ•œ Glue, EMR ํ™œ์šฉ - ๊น€ํƒœํ˜„ ์†”๋ฃจ์…˜์ฆˆ ์•„ํ‚คํ…ํŠธ, AWS :: AWS Summit Seoul 2019
Amazon Web Services Korea
ย 
ํšจ๊ณผ์ ์ธ NoSQL (Elasticahe / DynamoDB) ๋””์ž์ธ ๋ฐ ํ™œ์šฉ ๋ฐฉ์•ˆ (์ตœ์œ ์ • & ์ตœํ™์‹, AWS ์†”๋ฃจ์…˜์ฆˆ ์•„ํ‚คํ…ํŠธ) :: ...
ํšจ๊ณผ์ ์ธ NoSQL (Elasticahe / DynamoDB) ๋””์ž์ธ ๋ฐ ํ™œ์šฉ ๋ฐฉ์•ˆ (์ตœ์œ ์ • & ์ตœํ™์‹, AWS ์†”๋ฃจ์…˜์ฆˆ ์•„ํ‚คํ…ํŠธ) :: ...ํšจ๊ณผ์ ์ธ NoSQL (Elasticahe / DynamoDB) ๋””์ž์ธ ๋ฐ ํ™œ์šฉ ๋ฐฉ์•ˆ (์ตœ์œ ์ • & ์ตœํ™์‹, AWS ์†”๋ฃจ์…˜์ฆˆ ์•„ํ‚คํ…ํŠธ) :: ...
ํšจ๊ณผ์ ์ธ NoSQL (Elasticahe / DynamoDB) ๋””์ž์ธ ๋ฐ ํ™œ์šฉ ๋ฐฉ์•ˆ (์ตœ์œ ์ • & ์ตœํ™์‹, AWS ์†”๋ฃจ์…˜์ฆˆ ์•„ํ‚คํ…ํŠธ) :: ...
Amazon Web Services Korea
ย 
Oracle DB๋ฅผ AWS๋กœ ์ด๊ด€ํ•˜๋Š” ๋ฐฉ๋ฒ•๋“ค - ์„œํ˜ธ์„ ํด๋ผ์šฐ๋“œ ์‚ฌ์—…๋ถ€/์ปจ์„คํŒ…ํŒ€ ์ด์‚ฌ, ์˜์šฐ๋””์ง€ํƒˆ :: AWS Summit Seoul 2021
Oracle DB๋ฅผ AWS๋กœ ์ด๊ด€ํ•˜๋Š” ๋ฐฉ๋ฒ•๋“ค - ์„œํ˜ธ์„ ํด๋ผ์šฐ๋“œ ์‚ฌ์—…๋ถ€/์ปจ์„คํŒ…ํŒ€ ์ด์‚ฌ, ์˜์šฐ๋””์ง€ํƒˆ :: AWS Summit Seoul 2021Oracle DB๋ฅผ AWS๋กœ ์ด๊ด€ํ•˜๋Š” ๋ฐฉ๋ฒ•๋“ค - ์„œํ˜ธ์„ ํด๋ผ์šฐ๋“œ ์‚ฌ์—…๋ถ€/์ปจ์„คํŒ…ํŒ€ ์ด์‚ฌ, ์˜์šฐ๋””์ง€ํƒˆ :: AWS Summit Seoul 2021
Oracle DB๋ฅผ AWS๋กœ ์ด๊ด€ํ•˜๋Š” ๋ฐฉ๋ฒ•๋“ค - ์„œํ˜ธ์„ ํด๋ผ์šฐ๋“œ ์‚ฌ์—…๋ถ€/์ปจ์„คํŒ…ํŒ€ ์ด์‚ฌ, ์˜์šฐ๋””์ง€ํƒˆ :: AWS Summit Seoul 2021
Amazon Web Services Korea
ย 
AWS๋ฅผ ํ™œ์šฉํ•œ Digital Manufacturing ์‹คํ˜„ ๋ฐฉ๋ฒ• ๋ฐ ์‚ฌ๋ก€ ์†Œ๊ฐœ - Douglas Bellin, ์›”๋“œ์™€์ด๋“œ ์ œ์กฐ ์†”๋ฃจ์…˜ ๋‹ด...
AWS๋ฅผ ํ™œ์šฉํ•œ Digital Manufacturing ์‹คํ˜„ ๋ฐฉ๋ฒ• ๋ฐ ์‚ฌ๋ก€ ์†Œ๊ฐœ - Douglas Bellin, ์›”๋“œ์™€์ด๋“œ ์ œ์กฐ ์†”๋ฃจ์…˜ ๋‹ด...AWS๋ฅผ ํ™œ์šฉํ•œ Digital Manufacturing ์‹คํ˜„ ๋ฐฉ๋ฒ• ๋ฐ ์‚ฌ๋ก€ ์†Œ๊ฐœ - Douglas Bellin, ์›”๋“œ์™€์ด๋“œ ์ œ์กฐ ์†”๋ฃจ์…˜ ๋‹ด...
AWS๋ฅผ ํ™œ์šฉํ•œ Digital Manufacturing ์‹คํ˜„ ๋ฐฉ๋ฒ• ๋ฐ ์‚ฌ๋ก€ ์†Œ๊ฐœ - Douglas Bellin, ์›”๋“œ์™€์ด๋“œ ์ œ์กฐ ์†”๋ฃจ์…˜ ๋‹ด...
Amazon Web Services Korea
ย 
Amazon OpenSearch Deep dive - ๋‚ด๋ถ€๊ตฌ์กฐ, ์„ฑ๋Šฅ์ตœ์ ํ™” ๊ทธ๋ฆฌ๊ณ  ์Šค์ผ€์ผ๋ง
Amazon OpenSearch Deep dive - ๋‚ด๋ถ€๊ตฌ์กฐ, ์„ฑ๋Šฅ์ตœ์ ํ™” ๊ทธ๋ฆฌ๊ณ  ์Šค์ผ€์ผ๋งAmazon OpenSearch Deep dive - ๋‚ด๋ถ€๊ตฌ์กฐ, ์„ฑ๋Šฅ์ตœ์ ํ™” ๊ทธ๋ฆฌ๊ณ  ์Šค์ผ€์ผ๋ง
Amazon OpenSearch Deep dive - ๋‚ด๋ถ€๊ตฌ์กฐ, ์„ฑ๋Šฅ์ตœ์ ํ™” ๊ทธ๋ฆฌ๊ณ  ์Šค์ผ€์ผ๋ง
Amazon Web Services Korea
ย 
AWS Summit Seoul 2023 | Snowflake: ๋ชจ๋“  ๋ฐ์ดํ„ฐ ์›Œํฌ๋กœ๋“œ๋ฅผ ์œ„ํ•œ ํ•˜๋‚˜์˜ ํด๋ผ์šฐ๋“œ ๋ฐ์ดํ„ฐ ํ”Œ๋žซํผ
AWS Summit Seoul 2023 | Snowflake: ๋ชจ๋“  ๋ฐ์ดํ„ฐ ์›Œํฌ๋กœ๋“œ๋ฅผ ์œ„ํ•œ ํ•˜๋‚˜์˜ ํด๋ผ์šฐ๋“œ ๋ฐ์ดํ„ฐ ํ”Œ๋žซํผAWS Summit Seoul 2023 | Snowflake: ๋ชจ๋“  ๋ฐ์ดํ„ฐ ์›Œํฌ๋กœ๋“œ๋ฅผ ์œ„ํ•œ ํ•˜๋‚˜์˜ ํด๋ผ์šฐ๋“œ ๋ฐ์ดํ„ฐ ํ”Œ๋žซํผ
AWS Summit Seoul 2023 | Snowflake: ๋ชจ๋“  ๋ฐ์ดํ„ฐ ์›Œํฌ๋กœ๋“œ๋ฅผ ์œ„ํ•œ ํ•˜๋‚˜์˜ ํด๋ผ์šฐ๋“œ ๋ฐ์ดํ„ฐ ํ”Œ๋žซํผ
Amazon Web Services Korea
ย 
๊ธ€๋กœ๋ฒŒ ๊ธฐ์—…๋“ค์˜ ํšจ๊ณผ์ ์ธ ๋ฐ์ดํ„ฐ ๋ถ„์„์„ ์œ„ํ•œ Data Lake ๊ตฌ์ถ• ๋ฐ ๋ถ„์„ ์‚ฌ๋ก€ - ๊น€์ค€ํ˜• (AWS ์†”๋ฃจ์…˜์ฆˆ ์•„ํ‚คํ…ํŠธ)
๊ธ€๋กœ๋ฒŒ ๊ธฐ์—…๋“ค์˜ ํšจ๊ณผ์ ์ธ ๋ฐ์ดํ„ฐ ๋ถ„์„์„ ์œ„ํ•œ Data Lake ๊ตฌ์ถ• ๋ฐ ๋ถ„์„ ์‚ฌ๋ก€ - ๊น€์ค€ํ˜• (AWS ์†”๋ฃจ์…˜์ฆˆ ์•„ํ‚คํ…ํŠธ)๊ธ€๋กœ๋ฒŒ ๊ธฐ์—…๋“ค์˜ ํšจ๊ณผ์ ์ธ ๋ฐ์ดํ„ฐ ๋ถ„์„์„ ์œ„ํ•œ Data Lake ๊ตฌ์ถ• ๋ฐ ๋ถ„์„ ์‚ฌ๋ก€ - ๊น€์ค€ํ˜• (AWS ์†”๋ฃจ์…˜์ฆˆ ์•„ํ‚คํ…ํŠธ)
๊ธ€๋กœ๋ฒŒ ๊ธฐ์—…๋“ค์˜ ํšจ๊ณผ์ ์ธ ๋ฐ์ดํ„ฐ ๋ถ„์„์„ ์œ„ํ•œ Data Lake ๊ตฌ์ถ• ๋ฐ ๋ถ„์„ ์‚ฌ๋ก€ - ๊น€์ค€ํ˜• (AWS ์†”๋ฃจ์…˜์ฆˆ ์•„ํ‚คํ…ํŠธ)
Amazon Web Services Korea
ย 

What's hot (20)

AWS Glue - let's get stuck in!
AWS Glue - let's get stuck in!AWS Glue - let's get stuck in!
AWS Glue - let's get stuck in!
ย 
AWS Summit Seoul 2023 | ์‚ผ์„ฑ์ „์ž/์ฟ ํŒก์˜ ๋Œ€๊ทœ๋ชจ ํŠธ๋ž˜ํ”ฝ ์ฒ˜๋ฆฌ๋ฅผ ์œ„ํ•œ ํด๋ผ์šฐ๋“œ ๋„ค์ดํ‹ฐ๋ธŒ ๋ฐ์ดํ„ฐ๋ฒ ์ด์Šค ํ™œ์šฉ
AWS Summit Seoul 2023 | ์‚ผ์„ฑ์ „์ž/์ฟ ํŒก์˜ ๋Œ€๊ทœ๋ชจ ํŠธ๋ž˜ํ”ฝ ์ฒ˜๋ฆฌ๋ฅผ ์œ„ํ•œ ํด๋ผ์šฐ๋“œ ๋„ค์ดํ‹ฐ๋ธŒ ๋ฐ์ดํ„ฐ๋ฒ ์ด์Šค ํ™œ์šฉAWS Summit Seoul 2023 | ์‚ผ์„ฑ์ „์ž/์ฟ ํŒก์˜ ๋Œ€๊ทœ๋ชจ ํŠธ๋ž˜ํ”ฝ ์ฒ˜๋ฆฌ๋ฅผ ์œ„ํ•œ ํด๋ผ์šฐ๋“œ ๋„ค์ดํ‹ฐ๋ธŒ ๋ฐ์ดํ„ฐ๋ฒ ์ด์Šค ํ™œ์šฉ
AWS Summit Seoul 2023 | ์‚ผ์„ฑ์ „์ž/์ฟ ํŒก์˜ ๋Œ€๊ทœ๋ชจ ํŠธ๋ž˜ํ”ฝ ์ฒ˜๋ฆฌ๋ฅผ ์œ„ํ•œ ํด๋ผ์šฐ๋“œ ๋„ค์ดํ‹ฐ๋ธŒ ๋ฐ์ดํ„ฐ๋ฒ ์ด์Šค ํ™œ์šฉ
ย 
Deep Dive on Amazon Athena - AWS Online Tech Talks
Deep Dive on Amazon Athena - AWS Online Tech TalksDeep Dive on Amazon Athena - AWS Online Tech Talks
Deep Dive on Amazon Athena - AWS Online Tech Talks
ย 
AWS EMR Cost optimization
AWS EMR Cost optimizationAWS EMR Cost optimization
AWS EMR Cost optimization
ย 
Amazon EMR Masterclass
Amazon EMR MasterclassAmazon EMR Masterclass
Amazon EMR Masterclass
ย 
ํšจ์œจ์ ์ธ ๋น…๋ฐ์ดํ„ฐ ๋ถ„์„ ๋ฐ ์ฒ˜๋ฆฌ๋ฅผ ์œ„ํ•œ Glue, EMR ํ™œ์šฉ - ๊น€ํƒœํ˜„ ์†”๋ฃจ์…˜์ฆˆ ์•„ํ‚คํ…ํŠธ, AWS :: AWS Summit Seoul 2019
ํšจ์œจ์ ์ธ ๋น…๋ฐ์ดํ„ฐ ๋ถ„์„ ๋ฐ ์ฒ˜๋ฆฌ๋ฅผ ์œ„ํ•œ Glue, EMR ํ™œ์šฉ - ๊น€ํƒœํ˜„ ์†”๋ฃจ์…˜์ฆˆ ์•„ํ‚คํ…ํŠธ, AWS :: AWS Summit Seoul 2019ํšจ์œจ์ ์ธ ๋น…๋ฐ์ดํ„ฐ ๋ถ„์„ ๋ฐ ์ฒ˜๋ฆฌ๋ฅผ ์œ„ํ•œ Glue, EMR ํ™œ์šฉ - ๊น€ํƒœํ˜„ ์†”๋ฃจ์…˜์ฆˆ ์•„ํ‚คํ…ํŠธ, AWS :: AWS Summit Seoul 2019
ํšจ์œจ์ ์ธ ๋น…๋ฐ์ดํ„ฐ ๋ถ„์„ ๋ฐ ์ฒ˜๋ฆฌ๋ฅผ ์œ„ํ•œ Glue, EMR ํ™œ์šฉ - ๊น€ํƒœํ˜„ ์†”๋ฃจ์…˜์ฆˆ ์•„ํ‚คํ…ํŠธ, AWS :: AWS Summit Seoul 2019
ย 
Introduction to Amazon Athena
Introduction to Amazon AthenaIntroduction to Amazon Athena
Introduction to Amazon Athena
ย 
ํšจ๊ณผ์ ์ธ NoSQL (Elasticahe / DynamoDB) ๋””์ž์ธ ๋ฐ ํ™œ์šฉ ๋ฐฉ์•ˆ (์ตœ์œ ์ • & ์ตœํ™์‹, AWS ์†”๋ฃจ์…˜์ฆˆ ์•„ํ‚คํ…ํŠธ) :: ...
ํšจ๊ณผ์ ์ธ NoSQL (Elasticahe / DynamoDB) ๋””์ž์ธ ๋ฐ ํ™œ์šฉ ๋ฐฉ์•ˆ (์ตœ์œ ์ • & ์ตœํ™์‹, AWS ์†”๋ฃจ์…˜์ฆˆ ์•„ํ‚คํ…ํŠธ) :: ...ํšจ๊ณผ์ ์ธ NoSQL (Elasticahe / DynamoDB) ๋””์ž์ธ ๋ฐ ํ™œ์šฉ ๋ฐฉ์•ˆ (์ตœ์œ ์ • & ์ตœํ™์‹, AWS ์†”๋ฃจ์…˜์ฆˆ ์•„ํ‚คํ…ํŠธ) :: ...
ํšจ๊ณผ์ ์ธ NoSQL (Elasticahe / DynamoDB) ๋””์ž์ธ ๋ฐ ํ™œ์šฉ ๋ฐฉ์•ˆ (์ตœ์œ ์ • & ์ตœํ™์‹, AWS ์†”๋ฃจ์…˜์ฆˆ ์•„ํ‚คํ…ํŠธ) :: ...
ย 
Introduction to AWS Glue: Data Analytics Week at the SF Loft
Introduction to AWS Glue: Data Analytics Week at the SF LoftIntroduction to AWS Glue: Data Analytics Week at the SF Loft
Introduction to AWS Glue: Data Analytics Week at the SF Loft
ย 
Migrating Databases to the Cloud: Introduction to AWS DMS - SRV215 - Chicago ...
Migrating Databases to the Cloud: Introduction to AWS DMS - SRV215 - Chicago ...Migrating Databases to the Cloud: Introduction to AWS DMS - SRV215 - Chicago ...
Migrating Databases to the Cloud: Introduction to AWS DMS - SRV215 - Chicago ...
ย 
์›Œํฌ๋กœ๋“œ ํŠน์„ฑ์— ๋”ฐ๋ฅธ ์•ˆ์ „ํ•˜๊ณ  ํšจ์œจ์ ์ธ Data Lake ์šด์˜ ๋ฐฉ์•ˆ
์›Œํฌ๋กœ๋“œ ํŠน์„ฑ์— ๋”ฐ๋ฅธ ์•ˆ์ „ํ•˜๊ณ  ํšจ์œจ์ ์ธ Data Lake ์šด์˜ ๋ฐฉ์•ˆ์›Œํฌ๋กœ๋“œ ํŠน์„ฑ์— ๋”ฐ๋ฅธ ์•ˆ์ „ํ•˜๊ณ  ํšจ์œจ์ ์ธ Data Lake ์šด์˜ ๋ฐฉ์•ˆ
์›Œํฌ๋กœ๋“œ ํŠน์„ฑ์— ๋”ฐ๋ฅธ ์•ˆ์ „ํ•˜๊ณ  ํšจ์œจ์ ์ธ Data Lake ์šด์˜ ๋ฐฉ์•ˆ
ย 
Amazon DocumentDB - Architecture ๋ฐ Best Practice (Level 200) - ๋ฐœํ‘œ์ž: ์žฅ๋™ํ›ˆ, Sr. ...
Amazon DocumentDB - Architecture ๋ฐ Best Practice (Level 200) - ๋ฐœํ‘œ์ž: ์žฅ๋™ํ›ˆ, Sr. ...Amazon DocumentDB - Architecture ๋ฐ Best Practice (Level 200) - ๋ฐœํ‘œ์ž: ์žฅ๋™ํ›ˆ, Sr. ...
Amazon DocumentDB - Architecture ๋ฐ Best Practice (Level 200) - ๋ฐœํ‘œ์ž: ์žฅ๋™ํ›ˆ, Sr. ...
ย 
Oracle DB๋ฅผ AWS๋กœ ์ด๊ด€ํ•˜๋Š” ๋ฐฉ๋ฒ•๋“ค - ์„œํ˜ธ์„ ํด๋ผ์šฐ๋“œ ์‚ฌ์—…๋ถ€/์ปจ์„คํŒ…ํŒ€ ์ด์‚ฌ, ์˜์šฐ๋””์ง€ํƒˆ :: AWS Summit Seoul 2021
Oracle DB๋ฅผ AWS๋กœ ์ด๊ด€ํ•˜๋Š” ๋ฐฉ๋ฒ•๋“ค - ์„œํ˜ธ์„ ํด๋ผ์šฐ๋“œ ์‚ฌ์—…๋ถ€/์ปจ์„คํŒ…ํŒ€ ์ด์‚ฌ, ์˜์šฐ๋””์ง€ํƒˆ :: AWS Summit Seoul 2021Oracle DB๋ฅผ AWS๋กœ ์ด๊ด€ํ•˜๋Š” ๋ฐฉ๋ฒ•๋“ค - ์„œํ˜ธ์„ ํด๋ผ์šฐ๋“œ ์‚ฌ์—…๋ถ€/์ปจ์„คํŒ…ํŒ€ ์ด์‚ฌ, ์˜์šฐ๋””์ง€ํƒˆ :: AWS Summit Seoul 2021
Oracle DB๋ฅผ AWS๋กœ ์ด๊ด€ํ•˜๋Š” ๋ฐฉ๋ฒ•๋“ค - ์„œํ˜ธ์„ ํด๋ผ์šฐ๋“œ ์‚ฌ์—…๋ถ€/์ปจ์„คํŒ…ํŒ€ ์ด์‚ฌ, ์˜์šฐ๋””์ง€ํƒˆ :: AWS Summit Seoul 2021
ย 
AWS๋ฅผ ํ™œ์šฉํ•œ Digital Manufacturing ์‹คํ˜„ ๋ฐฉ๋ฒ• ๋ฐ ์‚ฌ๋ก€ ์†Œ๊ฐœ - Douglas Bellin, ์›”๋“œ์™€์ด๋“œ ์ œ์กฐ ์†”๋ฃจ์…˜ ๋‹ด...
AWS๋ฅผ ํ™œ์šฉํ•œ Digital Manufacturing ์‹คํ˜„ ๋ฐฉ๋ฒ• ๋ฐ ์‚ฌ๋ก€ ์†Œ๊ฐœ - Douglas Bellin, ์›”๋“œ์™€์ด๋“œ ์ œ์กฐ ์†”๋ฃจ์…˜ ๋‹ด...AWS๋ฅผ ํ™œ์šฉํ•œ Digital Manufacturing ์‹คํ˜„ ๋ฐฉ๋ฒ• ๋ฐ ์‚ฌ๋ก€ ์†Œ๊ฐœ - Douglas Bellin, ์›”๋“œ์™€์ด๋“œ ์ œ์กฐ ์†”๋ฃจ์…˜ ๋‹ด...
AWS๋ฅผ ํ™œ์šฉํ•œ Digital Manufacturing ์‹คํ˜„ ๋ฐฉ๋ฒ• ๋ฐ ์‚ฌ๋ก€ ์†Œ๊ฐœ - Douglas Bellin, ์›”๋“œ์™€์ด๋“œ ์ œ์กฐ ์†”๋ฃจ์…˜ ๋‹ด...
ย 
Amazon OpenSearch Deep dive - ๋‚ด๋ถ€๊ตฌ์กฐ, ์„ฑ๋Šฅ์ตœ์ ํ™” ๊ทธ๋ฆฌ๊ณ  ์Šค์ผ€์ผ๋ง
Amazon OpenSearch Deep dive - ๋‚ด๋ถ€๊ตฌ์กฐ, ์„ฑ๋Šฅ์ตœ์ ํ™” ๊ทธ๋ฆฌ๊ณ  ์Šค์ผ€์ผ๋งAmazon OpenSearch Deep dive - ๋‚ด๋ถ€๊ตฌ์กฐ, ์„ฑ๋Šฅ์ตœ์ ํ™” ๊ทธ๋ฆฌ๊ณ  ์Šค์ผ€์ผ๋ง
Amazon OpenSearch Deep dive - ๋‚ด๋ถ€๊ตฌ์กฐ, ์„ฑ๋Šฅ์ตœ์ ํ™” ๊ทธ๋ฆฌ๊ณ  ์Šค์ผ€์ผ๋ง
ย 
AWS Application Discovery Service
AWS Application Discovery ServiceAWS Application Discovery Service
AWS Application Discovery Service
ย 
Building Serverless ETL Pipelines with AWS Glue
Building Serverless ETL Pipelines with AWS GlueBuilding Serverless ETL Pipelines with AWS Glue
Building Serverless ETL Pipelines with AWS Glue
ย 
AWS Summit Seoul 2023 | Snowflake: ๋ชจ๋“  ๋ฐ์ดํ„ฐ ์›Œํฌ๋กœ๋“œ๋ฅผ ์œ„ํ•œ ํ•˜๋‚˜์˜ ํด๋ผ์šฐ๋“œ ๋ฐ์ดํ„ฐ ํ”Œ๋žซํผ
AWS Summit Seoul 2023 | Snowflake: ๋ชจ๋“  ๋ฐ์ดํ„ฐ ์›Œํฌ๋กœ๋“œ๋ฅผ ์œ„ํ•œ ํ•˜๋‚˜์˜ ํด๋ผ์šฐ๋“œ ๋ฐ์ดํ„ฐ ํ”Œ๋žซํผAWS Summit Seoul 2023 | Snowflake: ๋ชจ๋“  ๋ฐ์ดํ„ฐ ์›Œํฌ๋กœ๋“œ๋ฅผ ์œ„ํ•œ ํ•˜๋‚˜์˜ ํด๋ผ์šฐ๋“œ ๋ฐ์ดํ„ฐ ํ”Œ๋žซํผ
AWS Summit Seoul 2023 | Snowflake: ๋ชจ๋“  ๋ฐ์ดํ„ฐ ์›Œํฌ๋กœ๋“œ๋ฅผ ์œ„ํ•œ ํ•˜๋‚˜์˜ ํด๋ผ์šฐ๋“œ ๋ฐ์ดํ„ฐ ํ”Œ๋žซํผ
ย 
๊ธ€๋กœ๋ฒŒ ๊ธฐ์—…๋“ค์˜ ํšจ๊ณผ์ ์ธ ๋ฐ์ดํ„ฐ ๋ถ„์„์„ ์œ„ํ•œ Data Lake ๊ตฌ์ถ• ๋ฐ ๋ถ„์„ ์‚ฌ๋ก€ - ๊น€์ค€ํ˜• (AWS ์†”๋ฃจ์…˜์ฆˆ ์•„ํ‚คํ…ํŠธ)
๊ธ€๋กœ๋ฒŒ ๊ธฐ์—…๋“ค์˜ ํšจ๊ณผ์ ์ธ ๋ฐ์ดํ„ฐ ๋ถ„์„์„ ์œ„ํ•œ Data Lake ๊ตฌ์ถ• ๋ฐ ๋ถ„์„ ์‚ฌ๋ก€ - ๊น€์ค€ํ˜• (AWS ์†”๋ฃจ์…˜์ฆˆ ์•„ํ‚คํ…ํŠธ)๊ธ€๋กœ๋ฒŒ ๊ธฐ์—…๋“ค์˜ ํšจ๊ณผ์ ์ธ ๋ฐ์ดํ„ฐ ๋ถ„์„์„ ์œ„ํ•œ Data Lake ๊ตฌ์ถ• ๋ฐ ๋ถ„์„ ์‚ฌ๋ก€ - ๊น€์ค€ํ˜• (AWS ์†”๋ฃจ์…˜์ฆˆ ์•„ํ‚คํ…ํŠธ)
๊ธ€๋กœ๋ฒŒ ๊ธฐ์—…๋“ค์˜ ํšจ๊ณผ์ ์ธ ๋ฐ์ดํ„ฐ ๋ถ„์„์„ ์œ„ํ•œ Data Lake ๊ตฌ์ถ• ๋ฐ ๋ถ„์„ ์‚ฌ๋ก€ - ๊น€์ค€ํ˜• (AWS ์†”๋ฃจ์…˜์ฆˆ ์•„ํ‚คํ…ํŠธ)
ย 
Introduction to AWS Glue
Introduction to AWS GlueIntroduction to AWS Glue
Introduction to AWS Glue
ย 

Similar to [AWS Builders] Effective AWS Glue

AWS Lambda and Serverless framework: lessons learned while building a serverl...
AWS Lambda and Serverless framework: lessons learned while building a serverl...AWS Lambda and Serverless framework: lessons learned while building a serverl...
AWS Lambda and Serverless framework: lessons learned while building a serverl...
Luciano Mammino
ย 

Similar to [AWS Builders] Effective AWS Glue (20)

Aws-What You Need to Know_Simon Elisha
Aws-What You Need to Know_Simon ElishaAws-What You Need to Know_Simon Elisha
Aws-What You Need to Know_Simon Elisha
ย 
Transactional writes to cloud storage with Eric Liang
Transactional writes to cloud storage with Eric LiangTransactional writes to cloud storage with Eric Liang
Transactional writes to cloud storage with Eric Liang
ย 
PyConIE 2017 Writing and deploying serverless python applications
PyConIE 2017 Writing and deploying serverless python applicationsPyConIE 2017 Writing and deploying serverless python applications
PyConIE 2017 Writing and deploying serverless python applications
ย 
Writing and deploying serverless python applications
Writing and deploying serverless python applicationsWriting and deploying serverless python applications
Writing and deploying serverless python applications
ย 
AWS Lambda and Serverless framework: lessons learned while building a serverl...
AWS Lambda and Serverless framework: lessons learned while building a serverl...AWS Lambda and Serverless framework: lessons learned while building a serverl...
AWS Lambda and Serverless framework: lessons learned while building a serverl...
ย 
PyConIT 2018 Writing and deploying serverless python applications
PyConIT 2018 Writing and deploying serverless python applicationsPyConIT 2018 Writing and deploying serverless python applications
PyConIT 2018 Writing and deploying serverless python applications
ย 
I Just Want to Run My Code: Waypoint, Nomad, and Other Things
I Just Want to Run My Code: Waypoint, Nomad, and Other ThingsI Just Want to Run My Code: Waypoint, Nomad, and Other Things
I Just Want to Run My Code: Waypoint, Nomad, and Other Things
ย 
BDA311 Introduction to AWS Glue
BDA311 Introduction to AWS GlueBDA311 Introduction to AWS Glue
BDA311 Introduction to AWS Glue
ย 
Why Scale Matters and How the Cloud is Really Different (at scale)
Why Scale Matters and How the Cloud is Really Different (at scale)Why Scale Matters and How the Cloud is Really Different (at scale)
Why Scale Matters and How the Cloud is Really Different (at scale)
ย 
Serverless Computing
Serverless ComputingServerless Computing
Serverless Computing
ย 
AWS Batch: Simplifying Batch Computing in the Cloud
AWS Batch: Simplifying Batch Computing in the CloudAWS Batch: Simplifying Batch Computing in the Cloud
AWS Batch: Simplifying Batch Computing in the Cloud
ย 
AWS Batch: Simplifying batch computing in the cloud
AWS Batch: Simplifying batch computing in the cloudAWS Batch: Simplifying batch computing in the cloud
AWS Batch: Simplifying batch computing in the cloud
ย 
From a student to an apache committer practice of apache io tdb
From a student to an apache committer  practice of apache io tdbFrom a student to an apache committer  practice of apache io tdb
From a student to an apache committer practice of apache io tdb
ย 
Second Skin: Real-Time Retheming a Legacy Web Application with Diazo in the C...
Second Skin: Real-Time Retheming a Legacy Web Application with Diazo in the C...Second Skin: Real-Time Retheming a Legacy Web Application with Diazo in the C...
Second Skin: Real-Time Retheming a Legacy Web Application with Diazo in the C...
ย 
Devops continuousintegration and deployment onaws puttingmoneybackintoyourmis...
Devops continuousintegration and deployment onaws puttingmoneybackintoyourmis...Devops continuousintegration and deployment onaws puttingmoneybackintoyourmis...
Devops continuousintegration and deployment onaws puttingmoneybackintoyourmis...
ย 
DevOps, Continuous Integration and Deployment on AWS: Putting Money Back into...
DevOps, Continuous Integration and Deployment on AWS: Putting Money Back into...DevOps, Continuous Integration and Deployment on AWS: Putting Money Back into...
DevOps, Continuous Integration and Deployment on AWS: Putting Money Back into...
ย 
Serverless for High Performance Computing
Serverless for High Performance ComputingServerless for High Performance Computing
Serverless for High Performance Computing
ย 
DataTalks.Club - Building Scalable End-to-End Deep Learning Pipelines in the ...
DataTalks.Club - Building Scalable End-to-End Deep Learning Pipelines in the ...DataTalks.Club - Building Scalable End-to-End Deep Learning Pipelines in the ...
DataTalks.Club - Building Scalable End-to-End Deep Learning Pipelines in the ...
ย 
202201 AWS Black Belt Online Seminar Apache Spark Performnace Tuning for AWS ...
202201 AWS Black Belt Online Seminar Apache Spark Performnace Tuning for AWS ...202201 AWS Black Belt Online Seminar Apache Spark Performnace Tuning for AWS ...
202201 AWS Black Belt Online Seminar Apache Spark Performnace Tuning for AWS ...
ย 
Demo 0.9.4
Demo 0.9.4Demo 0.9.4
Demo 0.9.4
ย 

More from Amazon Web Services Korea

[Keynote] ์Šฌ๊ธฐ๋กœ์šด AWS ๋ฐ์ดํ„ฐ๋ฒ ์ด์Šค ์„ ํƒํ•˜๊ธฐ - ๋ฐœํ‘œ์ž: ๊ฐ•๋ฏผ์„, Korea Database SA Manager, WWSO, A...
[Keynote] ์Šฌ๊ธฐ๋กœ์šด AWS ๋ฐ์ดํ„ฐ๋ฒ ์ด์Šค ์„ ํƒํ•˜๊ธฐ - ๋ฐœํ‘œ์ž: ๊ฐ•๋ฏผ์„, Korea Database SA Manager, WWSO, A...[Keynote] ์Šฌ๊ธฐ๋กœ์šด AWS ๋ฐ์ดํ„ฐ๋ฒ ์ด์Šค ์„ ํƒํ•˜๊ธฐ - ๋ฐœํ‘œ์ž: ๊ฐ•๋ฏผ์„, Korea Database SA Manager, WWSO, A...
[Keynote] ์Šฌ๊ธฐ๋กœ์šด AWS ๋ฐ์ดํ„ฐ๋ฒ ์ด์Šค ์„ ํƒํ•˜๊ธฐ - ๋ฐœํ‘œ์ž: ๊ฐ•๋ฏผ์„, Korea Database SA Manager, WWSO, A...
Amazon Web Services Korea
ย 
Demystify Streaming on AWS - ๋ฐœํ‘œ์ž: ์ด์ข…ํ˜, Sr Analytics Specialist, WWSO, AWS :::...
Demystify Streaming on AWS - ๋ฐœํ‘œ์ž: ์ด์ข…ํ˜, Sr Analytics Specialist, WWSO, AWS :::...Demystify Streaming on AWS - ๋ฐœํ‘œ์ž: ์ด์ข…ํ˜, Sr Analytics Specialist, WWSO, AWS :::...
Demystify Streaming on AWS - ๋ฐœํ‘œ์ž: ์ด์ข…ํ˜, Sr Analytics Specialist, WWSO, AWS :::...
Amazon Web Services Korea
ย 
Amazon EMR - Enhancements on Cost/Performance, Serverless - ๋ฐœํ‘œ์ž: ๊น€๊ธฐ์˜, Sr Anal...
Amazon EMR - Enhancements on Cost/Performance, Serverless - ๋ฐœํ‘œ์ž: ๊น€๊ธฐ์˜, Sr Anal...Amazon EMR - Enhancements on Cost/Performance, Serverless - ๋ฐœํ‘œ์ž: ๊น€๊ธฐ์˜, Sr Anal...
Amazon EMR - Enhancements on Cost/Performance, Serverless - ๋ฐœํ‘œ์ž: ๊น€๊ธฐ์˜, Sr Anal...
Amazon Web Services Korea
ย 
Enabling Agility with Data Governance - ๋ฐœํ‘œ์ž: ๊น€์„ฑ์—ฐ, Analytics Specialist, WWSO,...
Enabling Agility with Data Governance - ๋ฐœํ‘œ์ž: ๊น€์„ฑ์—ฐ, Analytics Specialist, WWSO,...Enabling Agility with Data Governance - ๋ฐœํ‘œ์ž: ๊น€์„ฑ์—ฐ, Analytics Specialist, WWSO,...
Enabling Agility with Data Governance - ๋ฐœํ‘œ์ž: ๊น€์„ฑ์—ฐ, Analytics Specialist, WWSO,...
Amazon Web Services Korea
ย 
From Insights to Action, How to build and maintain a Data Driven Organization...
From Insights to Action, How to build and maintain a Data Driven Organization...From Insights to Action, How to build and maintain a Data Driven Organization...
From Insights to Action, How to build and maintain a Data Driven Organization...
Amazon Web Services Korea
ย 
[Keynote] Accelerating Business Outcomes with AWS Data - ๋ฐœํ‘œ์ž: Saeed Gharadagh...
[Keynote] Accelerating Business Outcomes with AWS Data - ๋ฐœํ‘œ์ž: Saeed Gharadagh...[Keynote] Accelerating Business Outcomes with AWS Data - ๋ฐœํ‘œ์ž: Saeed Gharadagh...
[Keynote] Accelerating Business Outcomes with AWS Data - ๋ฐœํ‘œ์ž: Saeed Gharadagh...
Amazon Web Services Korea
ย 
LG์ „์ž - Amazon Aurora ๋ฐ RDS ๋ธ”๋ฃจ/๊ทธ๋ฆฐ ๋ฐฐํฌ๋ฅผ ์ด์šฉํ•œ ๋ฐ์ดํ„ฐ๋ฒ ์ด์Šค ์—…๊ทธ๋ ˆ์ด๋“œ ์•ˆ์ •์„ฑ ํ™•๋ณด - ๋ฐœํ‘œ์ž: ์ด์€๊ฒฝ ์ฑ…์ž„, L...
LG์ „์ž - Amazon Aurora ๋ฐ RDS ๋ธ”๋ฃจ/๊ทธ๋ฆฐ ๋ฐฐํฌ๋ฅผ ์ด์šฉํ•œ ๋ฐ์ดํ„ฐ๋ฒ ์ด์Šค ์—…๊ทธ๋ ˆ์ด๋“œ ์•ˆ์ •์„ฑ ํ™•๋ณด - ๋ฐœํ‘œ์ž: ์ด์€๊ฒฝ ์ฑ…์ž„, L...LG์ „์ž - Amazon Aurora ๋ฐ RDS ๋ธ”๋ฃจ/๊ทธ๋ฆฐ ๋ฐฐํฌ๋ฅผ ์ด์šฉํ•œ ๋ฐ์ดํ„ฐ๋ฒ ์ด์Šค ์—…๊ทธ๋ ˆ์ด๋“œ ์•ˆ์ •์„ฑ ํ™•๋ณด - ๋ฐœํ‘œ์ž: ์ด์€๊ฒฝ ์ฑ…์ž„, L...
LG์ „์ž - Amazon Aurora ๋ฐ RDS ๋ธ”๋ฃจ/๊ทธ๋ฆฐ ๋ฐฐํฌ๋ฅผ ์ด์šฉํ•œ ๋ฐ์ดํ„ฐ๋ฒ ์ด์Šค ์—…๊ทธ๋ ˆ์ด๋“œ ์•ˆ์ •์„ฑ ํ™•๋ณด - ๋ฐœํ‘œ์ž: ์ด์€๊ฒฝ ์ฑ…์ž„, L...
Amazon Web Services Korea
ย 
KB๊ตญ๋ฏผ์นด๋“œ - ํด๋ผ์šฐ๋“œ ๊ธฐ๋ฐ˜ ๋ถ„์„ ํ”Œ๋žซํผ ํ˜์‹  ์—ฌ์ • - ๋ฐœํ‘œ์ž: ๋ฐ•์ฐฝ์šฉ ๊ณผ์žฅ, ๋ฐ์ดํ„ฐ์ „๋žต๋ณธ๋ถ€, AIํ˜์‹ ๋ถ€, KB์นด๋“œโ”‚๊ฐ•๋ณ‘์–ต, Soluti...
KB๊ตญ๋ฏผ์นด๋“œ - ํด๋ผ์šฐ๋“œ ๊ธฐ๋ฐ˜ ๋ถ„์„ ํ”Œ๋žซํผ ํ˜์‹  ์—ฌ์ • - ๋ฐœํ‘œ์ž: ๋ฐ•์ฐฝ์šฉ ๊ณผ์žฅ, ๋ฐ์ดํ„ฐ์ „๋žต๋ณธ๋ถ€, AIํ˜์‹ ๋ถ€, KB์นด๋“œโ”‚๊ฐ•๋ณ‘์–ต, Soluti...KB๊ตญ๋ฏผ์นด๋“œ - ํด๋ผ์šฐ๋“œ ๊ธฐ๋ฐ˜ ๋ถ„์„ ํ”Œ๋žซํผ ํ˜์‹  ์—ฌ์ • - ๋ฐœํ‘œ์ž: ๋ฐ•์ฐฝ์šฉ ๊ณผ์žฅ, ๋ฐ์ดํ„ฐ์ „๋žต๋ณธ๋ถ€, AIํ˜์‹ ๋ถ€, KB์นด๋“œโ”‚๊ฐ•๋ณ‘์–ต, Soluti...
KB๊ตญ๋ฏผ์นด๋“œ - ํด๋ผ์šฐ๋“œ ๊ธฐ๋ฐ˜ ๋ถ„์„ ํ”Œ๋žซํผ ํ˜์‹  ์—ฌ์ • - ๋ฐœํ‘œ์ž: ๋ฐ•์ฐฝ์šฉ ๊ณผ์žฅ, ๋ฐ์ดํ„ฐ์ „๋žต๋ณธ๋ถ€, AIํ˜์‹ ๋ถ€, KB์นด๋“œโ”‚๊ฐ•๋ณ‘์–ต, Soluti...
Amazon Web Services Korea
ย 
SK Telecom - ๋ง๊ด€๋ฆฌ ํ”„๋กœ์ ํŠธ TANGO์˜ ์˜คํ”ˆ์†Œ์Šค ๋ฐ์ดํ„ฐ๋ฒ ์ด์Šค ์ „ํ™˜ ์—ฌ์ • - ๋ฐœํ‘œ์ž : ๋ฐ•์Šน์ „, Project Manager, ...
SK Telecom - ๋ง๊ด€๋ฆฌ ํ”„๋กœ์ ํŠธ TANGO์˜ ์˜คํ”ˆ์†Œ์Šค ๋ฐ์ดํ„ฐ๋ฒ ์ด์Šค ์ „ํ™˜ ์—ฌ์ • - ๋ฐœํ‘œ์ž : ๋ฐ•์Šน์ „, Project Manager, ...SK Telecom - ๋ง๊ด€๋ฆฌ ํ”„๋กœ์ ํŠธ TANGO์˜ ์˜คํ”ˆ์†Œ์Šค ๋ฐ์ดํ„ฐ๋ฒ ์ด์Šค ์ „ํ™˜ ์—ฌ์ • - ๋ฐœํ‘œ์ž : ๋ฐ•์Šน์ „, Project Manager, ...
SK Telecom - ๋ง๊ด€๋ฆฌ ํ”„๋กœ์ ํŠธ TANGO์˜ ์˜คํ”ˆ์†Œ์Šค ๋ฐ์ดํ„ฐ๋ฒ ์ด์Šค ์ „ํ™˜ ์—ฌ์ • - ๋ฐœํ‘œ์ž : ๋ฐ•์Šน์ „, Project Manager, ...
Amazon Web Services Korea
ย 
์ฝ”๋ฆฌ์•ˆ๋ฆฌ - ๋ฐ์ดํ„ฐ ๋ถ„์„ ํ”Œ๋žซํผ ๊ตฌ์ถ• ์—ฌ์ •, ๊ทธ ์‹œ์ž‘๊ณผ ๊ณผ์ œ - ๋ฐœํ‘œ์ž: ๊น€์„๊ธฐ ๊ทธ๋ฃน์žฅ, ๋ฐ์ดํ„ฐ๋น„์ฆˆ๋‹ˆ์Šค์„ผํ„ฐ, ๋ฉ”๊ฐ€์กดํด๋ผ์šฐ๋“œ ::: AWS ...
์ฝ”๋ฆฌ์•ˆ๋ฆฌ - ๋ฐ์ดํ„ฐ ๋ถ„์„ ํ”Œ๋žซํผ ๊ตฌ์ถ• ์—ฌ์ •, ๊ทธ ์‹œ์ž‘๊ณผ ๊ณผ์ œ - ๋ฐœํ‘œ์ž: ๊น€์„๊ธฐ ๊ทธ๋ฃน์žฅ, ๋ฐ์ดํ„ฐ๋น„์ฆˆ๋‹ˆ์Šค์„ผํ„ฐ, ๋ฉ”๊ฐ€์กดํด๋ผ์šฐ๋“œ ::: AWS ...์ฝ”๋ฆฌ์•ˆ๋ฆฌ - ๋ฐ์ดํ„ฐ ๋ถ„์„ ํ”Œ๋žซํผ ๊ตฌ์ถ• ์—ฌ์ •, ๊ทธ ์‹œ์ž‘๊ณผ ๊ณผ์ œ - ๋ฐœํ‘œ์ž: ๊น€์„๊ธฐ ๊ทธ๋ฃน์žฅ, ๋ฐ์ดํ„ฐ๋น„์ฆˆ๋‹ˆ์Šค์„ผํ„ฐ, ๋ฉ”๊ฐ€์กดํด๋ผ์šฐ๋“œ ::: AWS ...
์ฝ”๋ฆฌ์•ˆ๋ฆฌ - ๋ฐ์ดํ„ฐ ๋ถ„์„ ํ”Œ๋žซํผ ๊ตฌ์ถ• ์—ฌ์ •, ๊ทธ ์‹œ์ž‘๊ณผ ๊ณผ์ œ - ๋ฐœํ‘œ์ž: ๊น€์„๊ธฐ ๊ทธ๋ฃน์žฅ, ๋ฐ์ดํ„ฐ๋น„์ฆˆ๋‹ˆ์Šค์„ผํ„ฐ, ๋ฉ”๊ฐ€์กดํด๋ผ์šฐ๋“œ ::: AWS ...
Amazon Web Services Korea
ย 
LG ์ด๋…ธํ… - Amazon Redshift Serverless๋ฅผ ํ™œ์šฉํ•œ ๋ฐ์ดํ„ฐ ๋ถ„์„ ํ”Œ๋žซํผ ํ˜์‹  ๊ณผ์ • - ๋ฐœํ‘œ์ž: ์œ ์žฌ์ƒ ์„ ์ž„, LG์ด๋…ธ...
LG ์ด๋…ธํ… - Amazon Redshift Serverless๋ฅผ ํ™œ์šฉํ•œ ๋ฐ์ดํ„ฐ ๋ถ„์„ ํ”Œ๋žซํผ ํ˜์‹  ๊ณผ์ • - ๋ฐœํ‘œ์ž: ์œ ์žฌ์ƒ ์„ ์ž„, LG์ด๋…ธ...LG ์ด๋…ธํ… - Amazon Redshift Serverless๋ฅผ ํ™œ์šฉํ•œ ๋ฐ์ดํ„ฐ ๋ถ„์„ ํ”Œ๋žซํผ ํ˜์‹  ๊ณผ์ • - ๋ฐœํ‘œ์ž: ์œ ์žฌ์ƒ ์„ ์ž„, LG์ด๋…ธ...
LG ์ด๋…ธํ… - Amazon Redshift Serverless๋ฅผ ํ™œ์šฉํ•œ ๋ฐ์ดํ„ฐ ๋ถ„์„ ํ”Œ๋žซํผ ํ˜์‹  ๊ณผ์ • - ๋ฐœํ‘œ์ž: ์œ ์žฌ์ƒ ์„ ์ž„, LG์ด๋…ธ...
Amazon Web Services Korea
ย 
[Keynote] Data Driven Organizations with AWS Data - ๋ฐœํ‘œ์ž: Agnes Panosian, Head...
[Keynote] Data Driven Organizations with AWS Data - ๋ฐœํ‘œ์ž: Agnes Panosian, Head...[Keynote] Data Driven Organizations with AWS Data - ๋ฐœํ‘œ์ž: Agnes Panosian, Head...
[Keynote] Data Driven Organizations with AWS Data - ๋ฐœํ‘œ์ž: Agnes Panosian, Head...
Amazon Web Services Korea
ย 

More from Amazon Web Services Korea (20)

AWS Modern Infra with Storage Roadshow 2023 - Day 2
AWS Modern Infra with Storage Roadshow 2023 - Day 2AWS Modern Infra with Storage Roadshow 2023 - Day 2
AWS Modern Infra with Storage Roadshow 2023 - Day 2
ย 
AWS Modern Infra with Storage Roadshow 2023 - Day 1
AWS Modern Infra with Storage Roadshow 2023 - Day 1AWS Modern Infra with Storage Roadshow 2023 - Day 1
AWS Modern Infra with Storage Roadshow 2023 - Day 1
ย 
์‚ฌ๋ก€๋กœ ์•Œ์•„๋ณด๋Š” Database Migration Service : ๋ฐ์ดํ„ฐ๋ฒ ์ด์Šค ๋ฐ ๋ฐ์ดํ„ฐ ์ด๊ด€, ํ†ตํ•ฉ, ๋ถ„๋ฆฌ, ๋ถ„์„์˜ ๋„๊ตฌ - ๋ฐœํ‘œ์ž: ...
์‚ฌ๋ก€๋กœ ์•Œ์•„๋ณด๋Š” Database Migration Service : ๋ฐ์ดํ„ฐ๋ฒ ์ด์Šค ๋ฐ ๋ฐ์ดํ„ฐ ์ด๊ด€, ํ†ตํ•ฉ, ๋ถ„๋ฆฌ, ๋ถ„์„์˜ ๋„๊ตฌ - ๋ฐœํ‘œ์ž: ...์‚ฌ๋ก€๋กœ ์•Œ์•„๋ณด๋Š” Database Migration Service : ๋ฐ์ดํ„ฐ๋ฒ ์ด์Šค ๋ฐ ๋ฐ์ดํ„ฐ ์ด๊ด€, ํ†ตํ•ฉ, ๋ถ„๋ฆฌ, ๋ถ„์„์˜ ๋„๊ตฌ - ๋ฐœํ‘œ์ž: ...
์‚ฌ๋ก€๋กœ ์•Œ์•„๋ณด๋Š” Database Migration Service : ๋ฐ์ดํ„ฐ๋ฒ ์ด์Šค ๋ฐ ๋ฐ์ดํ„ฐ ์ด๊ด€, ํ†ตํ•ฉ, ๋ถ„๋ฆฌ, ๋ถ„์„์˜ ๋„๊ตฌ - ๋ฐœํ‘œ์ž: ...
ย 
Amazon Elasticache - Fully managed, Redis & Memcached Compatible Service (Lev...
Amazon Elasticache - Fully managed, Redis & Memcached Compatible Service (Lev...Amazon Elasticache - Fully managed, Redis & Memcached Compatible Service (Lev...
Amazon Elasticache - Fully managed, Redis & Memcached Compatible Service (Lev...
ย 
Internal Architecture of Amazon Aurora (Level 400) - ๋ฐœํ‘œ์ž: ์ •๋‹ฌ์˜, APAC RDS Speci...
Internal Architecture of Amazon Aurora (Level 400) - ๋ฐœํ‘œ์ž: ์ •๋‹ฌ์˜, APAC RDS Speci...Internal Architecture of Amazon Aurora (Level 400) - ๋ฐœํ‘œ์ž: ์ •๋‹ฌ์˜, APAC RDS Speci...
Internal Architecture of Amazon Aurora (Level 400) - ๋ฐœํ‘œ์ž: ์ •๋‹ฌ์˜, APAC RDS Speci...
ย 
[Keynote] ์Šฌ๊ธฐ๋กœ์šด AWS ๋ฐ์ดํ„ฐ๋ฒ ์ด์Šค ์„ ํƒํ•˜๊ธฐ - ๋ฐœํ‘œ์ž: ๊ฐ•๋ฏผ์„, Korea Database SA Manager, WWSO, A...
[Keynote] ์Šฌ๊ธฐ๋กœ์šด AWS ๋ฐ์ดํ„ฐ๋ฒ ์ด์Šค ์„ ํƒํ•˜๊ธฐ - ๋ฐœํ‘œ์ž: ๊ฐ•๋ฏผ์„, Korea Database SA Manager, WWSO, A...[Keynote] ์Šฌ๊ธฐ๋กœ์šด AWS ๋ฐ์ดํ„ฐ๋ฒ ์ด์Šค ์„ ํƒํ•˜๊ธฐ - ๋ฐœํ‘œ์ž: ๊ฐ•๋ฏผ์„, Korea Database SA Manager, WWSO, A...
[Keynote] ์Šฌ๊ธฐ๋กœ์šด AWS ๋ฐ์ดํ„ฐ๋ฒ ์ด์Šค ์„ ํƒํ•˜๊ธฐ - ๋ฐœํ‘œ์ž: ๊ฐ•๋ฏผ์„, Korea Database SA Manager, WWSO, A...
ย 
Demystify Streaming on AWS - ๋ฐœํ‘œ์ž: ์ด์ข…ํ˜, Sr Analytics Specialist, WWSO, AWS :::...
Demystify Streaming on AWS - ๋ฐœํ‘œ์ž: ์ด์ข…ํ˜, Sr Analytics Specialist, WWSO, AWS :::...Demystify Streaming on AWS - ๋ฐœํ‘œ์ž: ์ด์ข…ํ˜, Sr Analytics Specialist, WWSO, AWS :::...
Demystify Streaming on AWS - ๋ฐœํ‘œ์ž: ์ด์ข…ํ˜, Sr Analytics Specialist, WWSO, AWS :::...
ย 
Amazon EMR - Enhancements on Cost/Performance, Serverless - ๋ฐœํ‘œ์ž: ๊น€๊ธฐ์˜, Sr Anal...
Amazon EMR - Enhancements on Cost/Performance, Serverless - ๋ฐœํ‘œ์ž: ๊น€๊ธฐ์˜, Sr Anal...Amazon EMR - Enhancements on Cost/Performance, Serverless - ๋ฐœํ‘œ์ž: ๊น€๊ธฐ์˜, Sr Anal...
Amazon EMR - Enhancements on Cost/Performance, Serverless - ๋ฐœํ‘œ์ž: ๊น€๊ธฐ์˜, Sr Anal...
ย 
Amazon OpenSearch - Use Cases, Security/Observability, Serverless and Enhance...
Amazon OpenSearch - Use Cases, Security/Observability, Serverless and Enhance...Amazon OpenSearch - Use Cases, Security/Observability, Serverless and Enhance...
Amazon OpenSearch - Use Cases, Security/Observability, Serverless and Enhance...
ย 
Enabling Agility with Data Governance - ๋ฐœํ‘œ์ž: ๊น€์„ฑ์—ฐ, Analytics Specialist, WWSO,...
Enabling Agility with Data Governance - ๋ฐœํ‘œ์ž: ๊น€์„ฑ์—ฐ, Analytics Specialist, WWSO,...Enabling Agility with Data Governance - ๋ฐœํ‘œ์ž: ๊น€์„ฑ์—ฐ, Analytics Specialist, WWSO,...
Enabling Agility with Data Governance - ๋ฐœํ‘œ์ž: ๊น€์„ฑ์—ฐ, Analytics Specialist, WWSO,...
ย 
Amazon Redshift Deep Dive - Serverless, Streaming, ML, Auto Copy (New feature...
Amazon Redshift Deep Dive - Serverless, Streaming, ML, Auto Copy (New feature...Amazon Redshift Deep Dive - Serverless, Streaming, ML, Auto Copy (New feature...
Amazon Redshift Deep Dive - Serverless, Streaming, ML, Auto Copy (New feature...
ย 
From Insights to Action, How to build and maintain a Data Driven Organization...
From Insights to Action, How to build and maintain a Data Driven Organization...From Insights to Action, How to build and maintain a Data Driven Organization...
From Insights to Action, How to build and maintain a Data Driven Organization...
ย 
[Keynote] Accelerating Business Outcomes with AWS Data - ๋ฐœํ‘œ์ž: Saeed Gharadagh...
[Keynote] Accelerating Business Outcomes with AWS Data - ๋ฐœํ‘œ์ž: Saeed Gharadagh...[Keynote] Accelerating Business Outcomes with AWS Data - ๋ฐœํ‘œ์ž: Saeed Gharadagh...
[Keynote] Accelerating Business Outcomes with AWS Data - ๋ฐœํ‘œ์ž: Saeed Gharadagh...
ย 
Amazon DynamoDB - Use Cases and Cost Optimization - ๋ฐœํ‘œ์ž: ์ดํ˜, DynamoDB Special...
Amazon DynamoDB - Use Cases and Cost Optimization - ๋ฐœํ‘œ์ž: ์ดํ˜, DynamoDB Special...Amazon DynamoDB - Use Cases and Cost Optimization - ๋ฐœํ‘œ์ž: ์ดํ˜, DynamoDB Special...
Amazon DynamoDB - Use Cases and Cost Optimization - ๋ฐœํ‘œ์ž: ์ดํ˜, DynamoDB Special...
ย 
LG์ „์ž - Amazon Aurora ๋ฐ RDS ๋ธ”๋ฃจ/๊ทธ๋ฆฐ ๋ฐฐํฌ๋ฅผ ์ด์šฉํ•œ ๋ฐ์ดํ„ฐ๋ฒ ์ด์Šค ์—…๊ทธ๋ ˆ์ด๋“œ ์•ˆ์ •์„ฑ ํ™•๋ณด - ๋ฐœํ‘œ์ž: ์ด์€๊ฒฝ ์ฑ…์ž„, L...
LG์ „์ž - Amazon Aurora ๋ฐ RDS ๋ธ”๋ฃจ/๊ทธ๋ฆฐ ๋ฐฐํฌ๋ฅผ ์ด์šฉํ•œ ๋ฐ์ดํ„ฐ๋ฒ ์ด์Šค ์—…๊ทธ๋ ˆ์ด๋“œ ์•ˆ์ •์„ฑ ํ™•๋ณด - ๋ฐœํ‘œ์ž: ์ด์€๊ฒฝ ์ฑ…์ž„, L...LG์ „์ž - Amazon Aurora ๋ฐ RDS ๋ธ”๋ฃจ/๊ทธ๋ฆฐ ๋ฐฐํฌ๋ฅผ ์ด์šฉํ•œ ๋ฐ์ดํ„ฐ๋ฒ ์ด์Šค ์—…๊ทธ๋ ˆ์ด๋“œ ์•ˆ์ •์„ฑ ํ™•๋ณด - ๋ฐœํ‘œ์ž: ์ด์€๊ฒฝ ์ฑ…์ž„, L...
LG์ „์ž - Amazon Aurora ๋ฐ RDS ๋ธ”๋ฃจ/๊ทธ๋ฆฐ ๋ฐฐํฌ๋ฅผ ์ด์šฉํ•œ ๋ฐ์ดํ„ฐ๋ฒ ์ด์Šค ์—…๊ทธ๋ ˆ์ด๋“œ ์•ˆ์ •์„ฑ ํ™•๋ณด - ๋ฐœํ‘œ์ž: ์ด์€๊ฒฝ ์ฑ…์ž„, L...
ย 
KB๊ตญ๋ฏผ์นด๋“œ - ํด๋ผ์šฐ๋“œ ๊ธฐ๋ฐ˜ ๋ถ„์„ ํ”Œ๋žซํผ ํ˜์‹  ์—ฌ์ • - ๋ฐœํ‘œ์ž: ๋ฐ•์ฐฝ์šฉ ๊ณผ์žฅ, ๋ฐ์ดํ„ฐ์ „๋žต๋ณธ๋ถ€, AIํ˜์‹ ๋ถ€, KB์นด๋“œโ”‚๊ฐ•๋ณ‘์–ต, Soluti...
KB๊ตญ๋ฏผ์นด๋“œ - ํด๋ผ์šฐ๋“œ ๊ธฐ๋ฐ˜ ๋ถ„์„ ํ”Œ๋žซํผ ํ˜์‹  ์—ฌ์ • - ๋ฐœํ‘œ์ž: ๋ฐ•์ฐฝ์šฉ ๊ณผ์žฅ, ๋ฐ์ดํ„ฐ์ „๋žต๋ณธ๋ถ€, AIํ˜์‹ ๋ถ€, KB์นด๋“œโ”‚๊ฐ•๋ณ‘์–ต, Soluti...KB๊ตญ๋ฏผ์นด๋“œ - ํด๋ผ์šฐ๋“œ ๊ธฐ๋ฐ˜ ๋ถ„์„ ํ”Œ๋žซํผ ํ˜์‹  ์—ฌ์ • - ๋ฐœํ‘œ์ž: ๋ฐ•์ฐฝ์šฉ ๊ณผ์žฅ, ๋ฐ์ดํ„ฐ์ „๋žต๋ณธ๋ถ€, AIํ˜์‹ ๋ถ€, KB์นด๋“œโ”‚๊ฐ•๋ณ‘์–ต, Soluti...
KB๊ตญ๋ฏผ์นด๋“œ - ํด๋ผ์šฐ๋“œ ๊ธฐ๋ฐ˜ ๋ถ„์„ ํ”Œ๋žซํผ ํ˜์‹  ์—ฌ์ • - ๋ฐœํ‘œ์ž: ๋ฐ•์ฐฝ์šฉ ๊ณผ์žฅ, ๋ฐ์ดํ„ฐ์ „๋žต๋ณธ๋ถ€, AIํ˜์‹ ๋ถ€, KB์นด๋“œโ”‚๊ฐ•๋ณ‘์–ต, Soluti...
ย 
SK Telecom - ๋ง๊ด€๋ฆฌ ํ”„๋กœ์ ํŠธ TANGO์˜ ์˜คํ”ˆ์†Œ์Šค ๋ฐ์ดํ„ฐ๋ฒ ์ด์Šค ์ „ํ™˜ ์—ฌ์ • - ๋ฐœํ‘œ์ž : ๋ฐ•์Šน์ „, Project Manager, ...
SK Telecom - ๋ง๊ด€๋ฆฌ ํ”„๋กœ์ ํŠธ TANGO์˜ ์˜คํ”ˆ์†Œ์Šค ๋ฐ์ดํ„ฐ๋ฒ ์ด์Šค ์ „ํ™˜ ์—ฌ์ • - ๋ฐœํ‘œ์ž : ๋ฐ•์Šน์ „, Project Manager, ...SK Telecom - ๋ง๊ด€๋ฆฌ ํ”„๋กœ์ ํŠธ TANGO์˜ ์˜คํ”ˆ์†Œ์Šค ๋ฐ์ดํ„ฐ๋ฒ ์ด์Šค ์ „ํ™˜ ์—ฌ์ • - ๋ฐœํ‘œ์ž : ๋ฐ•์Šน์ „, Project Manager, ...
SK Telecom - ๋ง๊ด€๋ฆฌ ํ”„๋กœ์ ํŠธ TANGO์˜ ์˜คํ”ˆ์†Œ์Šค ๋ฐ์ดํ„ฐ๋ฒ ์ด์Šค ์ „ํ™˜ ์—ฌ์ • - ๋ฐœํ‘œ์ž : ๋ฐ•์Šน์ „, Project Manager, ...
ย 
์ฝ”๋ฆฌ์•ˆ๋ฆฌ - ๋ฐ์ดํ„ฐ ๋ถ„์„ ํ”Œ๋žซํผ ๊ตฌ์ถ• ์—ฌ์ •, ๊ทธ ์‹œ์ž‘๊ณผ ๊ณผ์ œ - ๋ฐœํ‘œ์ž: ๊น€์„๊ธฐ ๊ทธ๋ฃน์žฅ, ๋ฐ์ดํ„ฐ๋น„์ฆˆ๋‹ˆ์Šค์„ผํ„ฐ, ๋ฉ”๊ฐ€์กดํด๋ผ์šฐ๋“œ ::: AWS ...
์ฝ”๋ฆฌ์•ˆ๋ฆฌ - ๋ฐ์ดํ„ฐ ๋ถ„์„ ํ”Œ๋žซํผ ๊ตฌ์ถ• ์—ฌ์ •, ๊ทธ ์‹œ์ž‘๊ณผ ๊ณผ์ œ - ๋ฐœํ‘œ์ž: ๊น€์„๊ธฐ ๊ทธ๋ฃน์žฅ, ๋ฐ์ดํ„ฐ๋น„์ฆˆ๋‹ˆ์Šค์„ผํ„ฐ, ๋ฉ”๊ฐ€์กดํด๋ผ์šฐ๋“œ ::: AWS ...์ฝ”๋ฆฌ์•ˆ๋ฆฌ - ๋ฐ์ดํ„ฐ ๋ถ„์„ ํ”Œ๋žซํผ ๊ตฌ์ถ• ์—ฌ์ •, ๊ทธ ์‹œ์ž‘๊ณผ ๊ณผ์ œ - ๋ฐœํ‘œ์ž: ๊น€์„๊ธฐ ๊ทธ๋ฃน์žฅ, ๋ฐ์ดํ„ฐ๋น„์ฆˆ๋‹ˆ์Šค์„ผํ„ฐ, ๋ฉ”๊ฐ€์กดํด๋ผ์šฐ๋“œ ::: AWS ...
์ฝ”๋ฆฌ์•ˆ๋ฆฌ - ๋ฐ์ดํ„ฐ ๋ถ„์„ ํ”Œ๋žซํผ ๊ตฌ์ถ• ์—ฌ์ •, ๊ทธ ์‹œ์ž‘๊ณผ ๊ณผ์ œ - ๋ฐœํ‘œ์ž: ๊น€์„๊ธฐ ๊ทธ๋ฃน์žฅ, ๋ฐ์ดํ„ฐ๋น„์ฆˆ๋‹ˆ์Šค์„ผํ„ฐ, ๋ฉ”๊ฐ€์กดํด๋ผ์šฐ๋“œ ::: AWS ...
ย 
LG ์ด๋…ธํ… - Amazon Redshift Serverless๋ฅผ ํ™œ์šฉํ•œ ๋ฐ์ดํ„ฐ ๋ถ„์„ ํ”Œ๋žซํผ ํ˜์‹  ๊ณผ์ • - ๋ฐœํ‘œ์ž: ์œ ์žฌ์ƒ ์„ ์ž„, LG์ด๋…ธ...
LG ์ด๋…ธํ… - Amazon Redshift Serverless๋ฅผ ํ™œ์šฉํ•œ ๋ฐ์ดํ„ฐ ๋ถ„์„ ํ”Œ๋žซํผ ํ˜์‹  ๊ณผ์ • - ๋ฐœํ‘œ์ž: ์œ ์žฌ์ƒ ์„ ์ž„, LG์ด๋…ธ...LG ์ด๋…ธํ… - Amazon Redshift Serverless๋ฅผ ํ™œ์šฉํ•œ ๋ฐ์ดํ„ฐ ๋ถ„์„ ํ”Œ๋žซํผ ํ˜์‹  ๊ณผ์ • - ๋ฐœํ‘œ์ž: ์œ ์žฌ์ƒ ์„ ์ž„, LG์ด๋…ธ...
LG ์ด๋…ธํ… - Amazon Redshift Serverless๋ฅผ ํ™œ์šฉํ•œ ๋ฐ์ดํ„ฐ ๋ถ„์„ ํ”Œ๋žซํผ ํ˜์‹  ๊ณผ์ • - ๋ฐœํ‘œ์ž: ์œ ์žฌ์ƒ ์„ ์ž„, LG์ด๋…ธ...
ย 
[Keynote] Data Driven Organizations with AWS Data - ๋ฐœํ‘œ์ž: Agnes Panosian, Head...
[Keynote] Data Driven Organizations with AWS Data - ๋ฐœํ‘œ์ž: Agnes Panosian, Head...[Keynote] Data Driven Organizations with AWS Data - ๋ฐœํ‘œ์ž: Agnes Panosian, Head...
[Keynote] Data Driven Organizations with AWS Data - ๋ฐœํ‘œ์ž: Agnes Panosian, Head...
ย 

Recently uploaded

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
ย 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Victor Rentea
ย 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
ย 

Recently uploaded (20)

Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
ย 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
ย 
WSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering Developers
ย 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
ย 
JohnPollard-hybrid-app-RailsConf2024.pptx
JohnPollard-hybrid-app-RailsConf2024.pptxJohnPollard-hybrid-app-RailsConf2024.pptx
JohnPollard-hybrid-app-RailsConf2024.pptx
ย 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
ย 
AI in Action: Real World Use Cases by Anitaraj
AI in Action: Real World Use Cases by AnitarajAI in Action: Real World Use Cases by Anitaraj
AI in Action: Real World Use Cases by Anitaraj
ย 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
ย 
Spring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUKSpring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUK
ย 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
ย 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
ย 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
ย 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
ย 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
ย 
Mcleodganj Call Girls ๐Ÿฅฐ 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls ๐Ÿฅฐ 8617370543 Service Offer VIP Hot ModelMcleodganj Call Girls ๐Ÿฅฐ 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls ๐Ÿฅฐ 8617370543 Service Offer VIP Hot Model
ย 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
ย 
Introduction to use of FHIR Documents in ABDM
Introduction to use of FHIR Documents in ABDMIntroduction to use of FHIR Documents in ABDM
Introduction to use of FHIR Documents in ABDM
ย 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
ย 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
ย 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
ย 

[AWS Builders] Effective AWS Glue

  • 1. 3 3 1 0
  • 2.
  • 7. ๊ฐ•์—ฐ ์ค‘ ์งˆ๋ฌธํ•˜๋Š” ๋ฐฉ๋ฒ• AWS Builders Go to Webinar โ€œQuestionsโ€ ์ฐฝ์— ์ž์‹ ์ด ์งˆ๋ฌธํ•œ ๋‚ด์—ญ์ด ํ‘œ์‹œ๋ฉ๋‹ˆ๋‹ค. ๊ธฐ๋ณธ์ ์œผ๋กœ ๋ชจ๋“  ์งˆ๋ฌธ์€ ๊ณต๊ฐœ๋กœ ๋‹ต๋ณ€ ๋ฉ๋‹ˆ๋‹ค๋งŒ ๋ณธ์ธ๋งŒ ๋‹ต๋ณ€์„ ๋ฐ›๊ณ  ์‹ถ์œผ๋ฉด (๋น„๊ณต๊ฐœ)๋ผ๊ณ  ํ•˜๊ณ  ์งˆ๋ฌธํ•ด ์ฃผ์‹œ๋ฉด ๋ฉ๋‹ˆ๋‹ค. ๋ณธ ์ปจํ…์ธ ๋Š” ๊ณ ๊ฐ์˜ ํŽธ์˜๋ฅผ ์œ„ํ•ด AWS ์„œ๋น„์Šค ์„ค๋ช…์„ ์œ„ํ•ด ์˜จ๋ผ์ธ ์„ธ๋ฏธ๋‚˜์šฉ์œผ๋กœ ๋ณ„๋„๋กœ ์ œ์ž‘, ์ œ๊ณต๋œ ๊ฒƒ์ž…๋‹ˆ๋‹ค. ๋งŒ์•ฝ AWS ์‚ฌ์ดํŠธ์™€ ์ปจํ…์ธ  ์ƒ์—์„œ ์ฐจ์ด๋‚˜ ๋ถˆ์ผ์น˜๊ฐ€ ์žˆ์„ ๊ฒฝ์šฐ, AWS ์‚ฌ์ดํŠธ(aws.amazon.com)๊ฐ€ ์šฐ์„ ํ•ฉ๋‹ˆ๋‹ค. ๋˜ํ•œ AWS ์‚ฌ์ดํŠธ ์ƒ์—์„œ ํ•œ๊ธ€ ๋ฒˆ์—ญ๋ฌธ๊ณผ ์˜์–ด ์›๋ฌธ์— ์ฐจ์ด๋‚˜ ๋ถˆ์ผ์น˜๊ฐ€ ์žˆ์„ ๊ฒฝ์šฐ(๋ฒˆ์—ญ์˜ ์ง€์ฒด๋กœ ์ธํ•œ ๊ฒฝ์šฐ ๋“ฑ ํฌํ•จ), ์˜์–ด ์›๋ฌธ์ด ์šฐ์„ ํ•ฉ๋‹ˆ๋‹ค. AWS๋Š” ๋ณธ ์ปจํ…์ธ ์— ํฌํ•จ๋˜๊ฑฐ๋‚˜ ์ปจํ…์ธ ๋ฅผ ํ†ตํ•˜์—ฌ ๊ณ ๊ฐ์—๊ฒŒ ์ œ๊ณต๋œ ์ผ์ฒด์˜ ์ •๋ณด, ์ฝ˜ํ…์ธ , ์ž๋ฃŒ, ์ œํ’ˆ(์†Œํ”„ํŠธ์›จ์–ด ํฌํ•จ) ๋˜๋Š” ์„œ๋น„์Šค๋ฅผ ์ด์šฉํ•จ์œผ๋กœ ์ธํ•˜์—ฌ ๋ฐœ์ƒํ•˜๋Š” ์—ฌํ•˜ํ•œ ์ข…๋ฅ˜์˜ ์†ํ•ด์— ๋Œ€ํ•˜์—ฌ ์–ด๋– ํ•œ ์ฑ…์ž„๋„ ์ง€์ง€ ์•„๋‹ˆํ•˜๋ฉฐ, ์ด๋Š” ์ง์ ‘ ์†ํ•ด, ๊ฐ„์ ‘ ์†ํ•ด, ๋ถ€์ˆ˜์  ์†ํ•ด, ์ง•๋ฒŒ์  ์†ํ•ด ๋ฐ ๊ฒฐ๊ณผ์  ์†ํ•ด๋ฅผ ํฌํ•จํ•˜๋˜ ์ด์— ํ•œ์ •๋˜์ง€ ์•„๋‹ˆํ•ฉ๋‹ˆ๋‹ค. ๊ณ ์ง€ ์‚ฌํ•ญ(Disclaimer)
  • 8. ยง Introdution ยง Glue internal ยง Items ยง Item1: Processing lots of small files ยง Item2: Processing a few large files ยง Item3: Optimizing parallelism ยง Item4: JDBC partitions ยง Item5: Python udf & performance ยง Item6: Scheduler ยง Item7: Python shell ยง QnA
  • 10. Fully-managed, serverless extract-transform-load (ETL) service for developers, built by developers 1000s of customers and jobs A year ago โ€ฆ
  • 11. AWS Glue Serverless data catalog & ETL service Data Catalog ETL Job authoring Discover data and extract schema Auto-generates customizable ETL code in Python and Scala Automatically discovers data and stores schema Data searchable, and available for ETL Generates customizable code Schedules and runs your ETL jobs Serverless, flexible, and built on open standards
  • 12. Putting it together - data lake with AWS Glue Amazon S3 (Raw data) Amazon S3 (Staging data) Amazon S3 (Processed data) AWS Glue Data Catalog Crawlers Crawlers Crawlers
  • 13. Select AWS Glue customers
  • 14. AWS Glue Serverless data catalog & ETL service Data Catalog ETL Job authoring Discover data and extract schema Auto-generates customizable ETL code in Python and Scala Automatically discovers data and stores schema Data searchable, and available for ETL Generates customizable code Schedules and runs your ETL jobs Serverless, flexible, and built on open standards
  • 16. Programming Environment โ€ข ETL in Python โ€ข Python 2.7 โ€ข Boto 3 โ€ข ETL in Scala โ€ข Scala 2.11 โ€ข Spark Cluster โ€ข Spark 2.2.1
  • 17. Programming Environment โ€ข 1 DPU (Data Processing Unit) โ€ข 1 m4.xlarge node โ€ข 4vCPU โ€ข 16G RAM โ€ข 2 executors โ€ข 1 Executor โ€ข 5G RAM โ€ข 4 Tasks Driver Executors
  • 18. Programming Environment โ€ข Glue Job โ€ข Minimum DPU: 2 โ€ข Default DPU: 10 โ€ข Ex) 10 DPU Job โ€ข 10 node cluster โ€ข 1 Master + 9 Core Nodes โ€ข 18 executors โ€ข 1 driver โ€ข 17 executors
  • 19. Programming Environment โ€ข Internal argument to AWS Glue โ€ข --conf โ€ข --debug โ€ข --mode โ€ข --JOB_NAME
  • 20. Basics of ETL Job Programming 1. Initialize 2. Read 3. Transform data 4. Write ## Initialize glueContext = GlueContext(SparkContext.getOrCreate()) ## Create DynamicFrame and retrieve data from source ds0 = glueContext.create_dynamic_frame.from_catalog ( database = "mysql", table_name = "customer", transformation_ctx = "ds0") ## Implement data transformation here ds1 = ds0 ... ## Write DynamicFrame from Catalog ds2 = glueContext.write_dynamic_frame.from_catalog ( frame = ds1, database = "redshift", table_name = "customer_dim", redshift_tmp_dir = args["TempDir"], transformation_ctx = "ds2")
  • 21. What is Apache Spark? Parallel, scale-out data processing engine Fault-tolerance built-in Flexible interface: Python scripting, SQL Rich eco-system: ML, Graph, analytics, โ€ฆ Apache Spark and AWS Glue ETL Spark core: RDDs SparkSQL Dataframes DynamicFrames AWS Glue ETL AWS Glue ETL libraries Integration: Data Catalog, job orchestration, code-generation, job bookmarks, S3, RDS ETL transforms, more connectors & formats New data structure: DynamicFrames
  • 22. Dataframes Core data structure for SparkSQL Like structured tables Need schema up-front Each row has same structure Suited for SQL-like analytics Dataframes and Dynamic Frames Dynamic Frames Like dataframes for ETL Designed for processing semi-structured data, e.g. JSON, Avro, Apache logs ...
  • 23. Public GitHub timeline is โ€ฆ 35+ event types semi-structured payload structure and size varies by event type
  • 24. ยฉ 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. schema per-record, no up-front schema needed Easy to restructure, tag, modify Can be more compact than dataframe rows Many flows can be done in single-pass ยง {โ€œidโ€:โ€2489โ€, โ€œtypeโ€: โ€CreateEventโ€, โ€payloadโ€: {โ€œcreatorโ€:โ€ฆ}, โ€ฆ} Dynamic Records typeid typeid Dynamic Frame Schema typeid Dynamic Frame internals {โ€œidโ€:4391, โ€œtypeโ€: โ€œPullEventโ€, โ€payloadโ€: {โ€œassetsโ€:โ€ฆ}, โ€ฆ} typeid {โ€œidโ€:โ€6510โ€, โ€œtypeโ€: โ€œPushEventโ€, โ€payloadโ€: {โ€œpusherโ€:โ€ฆ}, โ€ฆ} id
  • 25. ResolveChoice() B B B project B cast B separate into cols B B ApplyMapping() A X Y A X Y C 15+ transforms out-of-the box Dynamic Frame transforms
  • 26. Semi-structured schema Relational schema FKA B B C.X C.Y PK ValueOffset A C D [ ] X Y B B Transforms and adds new columns, types, and tables on-the-fly Tracks keys and foreign keys across runs SQL on the relational schema is orders of magnitude faster than JSON processing Relationalize() transform
  • 27. toDF(): Convert to a Dataframe fromDF(): Convert from a Dataframe Spigot(): Sample data of any Dynamic Frame to S3 Unbox(): Parse string column as given format into Dynamic Frame Filter(), Map(): Apply Python UDFs to Dynamic Frames Join(): Join two Dynamic Frames And more โ€ฆ. Useful AWS Glue transforms
  • 28. 0 200 400 600 800 1000 1200 1400 1600 1800 Day Month Year GitHub Timeline ETL Performance DynamicFrames DataFrames Time(sec) On average: 2x performance improvement Data size (# files) 24 744 8699 Performance: AWS Glue ETL Configuration 10 DPUs Apache Spark 2.1.1 Workload JSON to CSV Filter for Pull events (lower is better)
  • 29. Lots of small files, e.g. Kinesis Firehose Vanilla Apache Spark (2.1.1) overheads Must reconstruct partitions (2-pass) Too many tasks: task per file Scheduling & memory overheads AWS Glue Dynamic Frames Integration with Data Catalog Automatically group files per task Rely on crawler statistics Performance: Lots of small files 0 1000 2000 3000 4000 5000 6000 7000 8000 1:2K 20:40K 40:80K 80:160K 160:320K 320:640K 640: 1280K AWS Glue ETL small file scalability Spark Glue 1.2 Million Files Spark Out-Of-Memory >= 320: 640K files Grouping Time(sec) # partitions : # files
  • 30. AWS Glue execution model: data partitions โ€ข Apache Spark and AWS Glue are data parallel. โ€ข Data is divided into partitions that are processed concurrently. โ€ข A stage is a set of parallel tasks โ€“ one task per partition Driver Executors Overall throughput is limited by the number of partitions
  • 31. AWS Glue execution model: jobs and stages
  • 32. AWS Glue execution model: jobs and stages Actions
  • 33. AWS Glue execution model: jobs and stages Jobs
  • 34. AWS Glue execution model: jobs and stages Repartition FilterRead Drop Nulls Write Read Show Job 1 Job 2 Stage 1 Stage 2 Stage 1 Apply Mapping Filter Apply Mapping Jobs
  • 35. โ€ข How is your dataset partitioned? โ€ข How is your application divided into jobs and stages? โ€ข Data is divided into partitions that are processed concurrently AWS Glue performance: key questions
  • 37. Item1: Processing lots of small files
  • 38. Example: Processing lots of small files โ€ข Let's look at a straightforward JSON to Parquet conversion job โ€ข 1.28 million JSON files in 640 partitions:
  • 39. Example: Processing lots of small files โ€ข First try: use a standard SparkSQL job
  • 40. Example: Processing lots of small files
  • 41. Example: Processing lots of small files
  • 42. Example: Processing lots of small files โ€ข Driver memory use is growing fast and approaching the 5g max.
  • 43. Example: Processing lots of small files โ€ข Case 2: Run using AWS Glue DynamicFrames.
  • 44. Example: Processing lots of small files
  • 45. Example: Processing lots of small files Driver memory remains below 50% for the entire duration of execution.
  • 46. Example: Processing lots of small files
  • 47. Example: Processing lots of small files
  • 48. Options for grouping files โ€ข groupFiles โ€ข inPartition: within a partition. โ€ข acrossPartition: from different partitions. โ€ข groupSize โ€ข Target size of each group.
  • 49. Example: Aggressively grouping files โ€ข Execution is much slower, but hasn't crashed. "groupFiles": "acrossPartition"
  • 50. Example: Aggressively grouping files Executor memory is higher than driver. Only one executor is active.
  • 51. Item2: Processing a few large files
  • 52. Example: Processing a few large files โ€ข Let's see how this looks on a sample dataset of 5 large csv files. โ€ข Each file is โ€ข 12.5 GB uncompressed โ€ข 1.6 GB gzip โ€ข 1.3 GB bzip2 โ€ข Script converts data to Parquet.
  • 53. Example: Processing a few large gzip files โ€ข We only have 5 partitions โ€“ one for each file. โ€ข Job fails after 2 hours.
  • 54. Example: Processing a few large bzip2 files โ€ข Bzip2 files can be split into blocks, so we see up to 104 tasks. โ€ข Job completes in 18 minutes.
  • 55. Example: Processing a few large bzip2 files โ€ข With 15 DPU, the number of active executors closely tracks the maximum needed number of executors.
  • 56. Example: Processing a few large uncompressed files โ€ข Uncompressed files can be split into lines, so we construct 64MB partitions. โ€ข Job completes in 12 minutes.
  • 57. Example: Processing a few large files โ€ข If you have a choice of compression type, prefer bzip2. โ€ข If you are using gzip, make sure you have enough files to fully utilize your resources. โ€ข Bandwidth is rarely the bottleneck for AWS Glue jobs, so consider leaving files uncompressed.
  • 59. Example: optimizing parallelism Processing large, split-able bzip2 files. With 10 DPU, metric maximum needed executors shows room for scaling.
  • 60. ยง 17 Executors (Maximum Allocated Executors) ยง 10 DPU = 10 Node Cluster = 1 Master + 9 Core Node ยง 9 Core Node = 18 Executors = 1 Driver + 17 Executors ยง 27 Executors (Maximum Needed Executors) ยง 1 Driver + 27 Executors = 28 Executors = 14 Core Node ยง 14 Core Node + 1 Master = 15 Node Cluster = 15 DPU DPU
  • 61. Example: optimizing parallelism With 15 DPU, active executors closely tracks maximum needed executors.
  • 63. AWS Glue JDBC partitions โ€ข For JDBC sources, by default each table is read as a single partition. โ€ข AWS Glue automatically partitions datasets with fewer than 10 partitions after the data has been loaded.
  • 66. Reading JDBC partitions A single executor is used for the JDBC query Data is repartitioned for the rest of the job.
  • 67. Options for reading database tables in parallel โ€ข hashexpression โ€“ Integer expression to use for distribution. โ€ข hashfield โ€“ Single column to use for distribution. โ€ข hashpartitions โ€“ Number of parallel queries to make. Default is 7. โ€ข Turns into a collection of queries of the form
  • 68. Options for reading database tables in parallel โ€ข Guidelines for picking distribution keys. โ€ข For hashexpression, choose a column that is evenly distributed across values. A primary key works well. โ€ข If no such field exists, use hashfield to define one. โ€ข Example: The taxi dataset does not have a primary key, so we set hashfield to partition based on day of the month: datasource0 = glueContext.create_dynamic_frame.from_catalog( database = "nyctaxi", table_name = "green-mysql-large", additional_options={'hashfield': 'day(lpep_pickup_datetime)', 'hashpartitions': 15})
  • 69. Options for reading database tables in parallel โ€ข Four executors can process 16 partitions concurrently.
  • 70. Options for reading database tables in parallel โ€ข Make sure to understand impact to database engine.
  • 71. Job Bookmarks for JDBC Queries โ€ข Job bookmarks only work when the source table has an ordered primary key. โ€ข Updates are not handled today.
  • 73. Python performance โ€ข Using map and filter in Python is expensive for large data sets. โ€ข All data is serialized and sent between the JVM and Python. โ€ข Alternatives โ€ข Use AWS Glue Scala SDK. โ€ข Convert to DataFrame and use Spark SQL expressions. Spark JVM Python VM
  • 75. Glue
  • 76. Glue
  • 77. Boto3
  • 82. Announcing a new job type: Python shell A new cost-effective ETL primitive for small to medium tasks Python shell 3rd party service
  • 83. AWS Glue Python shell specs Python 2.7 environment with boto3, awscli, numpy, scipy, pandas, scikit-learn, PyGreSQL, โ€ฆ cold spin-up: < 20 sec, support for VPCs, no runtime limit sizes: 1 DPU (includes 16GB), and 1/16 DPU (includes 1GB) pricing: $0.44 per DPU-hour, 1-min minimum, per-second billing
  • 84. Python shell collaborative filtering example Amazon customer reviews dataset (s3://amazon-reviews-pds) Video category Compute low-rank approx of (Customer x Product) ratings using SVD uses scipy sparse matrix and SVD library Step Time (sec) Redshift COPY 13 Extract ratings 5 Generate matrix 1552 SVD (k=1000) 2575 Total 4145 69 min $0.60
  • 85. ๋” ๋‚˜์€ ์„ธ๋ฏธ๋‚˜๋ฅผ ์œ„ํ•ด ์—ฌ๋Ÿฌ๋ถ„์˜ ์˜๊ฒฌ์„ ๋‚จ๊ฒจ์ฃผ์„ธ์š”! โ–ถ ์งˆ๋ฌธ์— ๋Œ€ํ•œ ๋‹ต๋ณ€ ๋“œ๋ฆฝ๋‹ˆ๋‹ค. โ–ถ ๋ฐœํ‘œ์ž๋ฃŒ/๋…นํ™”์˜์ƒ์„ ์ œ๊ณตํ•ฉ๋‹ˆ๋‹ค. http://bit.ly/awskr-webinar