YouTube video: https://www.youtube.com/watch?v=PliThXFP3dM
Yes, it matters! (we are talking about databases, don’t worry).
Learn how to combine some of the most amazing AWS services to build a petabyte scale database able to store and retrieve billions of records per day without spending a lot of money.
Best of all, we won’t need a single server running!
10. Lucas Ceballos
Size does matter@ Dataops 2019
SIZEDOESMATTER 9
NEWUSERS
USERACQUISITION
NEWORDERS
USERRE-ENGAGEMENT
11. Lucas Ceballos
Size does matter@ Dataops 2019
SIZEDOESMATTER 10
IMPRESSION
+50K / SEG
CLICK
+5K / SEG
CONVERSION
+15K / HR
AD LIFE CYCLE
12. Lucas Ceballos
Size does matter@ Dataops 2019
SIZEDOESMATTER
Click IDImpression ID
10
IMPRESSION
+50K / SEG
CLICK
+5K / SEG
CONVERSION
+15K / HR
AD LIFE CYCLE
13. Lucas Ceballos
Size does matter@ Dataops 2019
SIZEDOESMATTER 10
IMPRESSION
+50K / SEG
CLICK
+5K / SEG
CONVERSION
+15K / HR
AD LIFE CYCLE
△t△t
14. Lucas Ceballos
Size does matter@ Dataops 2019
SIZEDOESMATTER 10
IMPRESSION
+50K / SEG
CLICK
+5K / SEG
CONVERSION
+15K / HR
AD LIFE CYCLE
△t△t
<1min 1-∞min
15. Lucas Ceballos
Size does matter@ Dataops 2019
SIZEDOESMATTER 10
IMPRESSION
+50K / SEG
CLICK
+5K / SEG
CONVERSION
+15K / HR
AD LIFE CYCLE
△t△t
<1min 1-∞min
16. Lucas Ceballos
Size does matter@ Dataops 2019
SIZEDOESMATTER 10
IMPRESSION
+50K / SEG
CLICK
+5K / SEG
CONVERSION
+15K / HR
AD LIFE CYCLE
△t△t
<1min AttributionWindow
17. Lucas Ceballos
Size does matter@ Dataops 2019
SIZEDOESMATTER 11
Intheworldofdigitalattribution,anattributionwindow
isatimeperiodduringwhicheligibleconversionevents
canbeclaimed.Theconversioneventscanbeanapp
install,addedtocart,orpurchase.
ATTRIBUTIONWINDOW:
18. Lucas Ceballos
Size does matter@ Dataops 2019
SIZEDOESMATTER 12
UNDER THE HOOD
EC2
EC2
EC2
EC2
EC2
EC2
EC2
EC2
Incoming
Clicks
Incoming
Conversions
W R
attribution window
19. Lucas Ceballos
Size does matter@ Dataops 2019
SIZEDOESMATTER 12
UNDER THE HOOD
EC2
EC2
EC2
EC2
EC2
EC2
EC2
EC2
Incoming
Clicks
Incoming
Conversions
W R
attribution window
20. Lucas Ceballos
Size does matter@ Dataops 2019
SIZEDOESMATTER 12
UNDER THE HOOD
EC2
EC2
EC2
EC2
EC2
EC2
EC2
EC2
Incoming
Clicks
Incoming
Conversions
W R
attribution window
21. Lucas Ceballos
Size does matter@ Dataops 2019
SIZEDOESMATTER 12
UNDER THE HOOD
EC2
EC2
EC2
EC2
EC2
EC2
EC2
EC2
Incoming
Clicks
Incoming
Conversions
W R
attribution window
22. Lucas Ceballos
Size does matter@ Dataops 2019
SIZEDOESMATTER 12
UNDER THE HOOD
EC2
EC2
EC2
EC2
EC2
EC2
EC2
EC2
Incoming
Clicks
Incoming
Conversions
W R
attribution window
23. Lucas Ceballos
Size does matter@ Dataops 2019
SIZEDOESMATTER 13
1808210149a523932a4ceaed
CLICK ID
Date &Time (up to minute) Random Hexadecimal String
27. Lucas Ceballos
Size does matter@ Dataops 2019
SIZEDOESMATTER 17
UNDER THE HOOD
EC2
EC2
EC2
EC2
EC2
EC2
EC2
EC2
Incoming
Clicks
Incoming
Conversions
W R
attribution window
28. Lucas Ceballos
Size does matter@ Dataops 2019
SIZEDOESMATTER 18
500.000.000
x 650
x 30
Dailyclicks
Avg. click size
Days/month
~9 TB/month
29. Lucas Ceballos
Size does matter@ Dataops 2019
SIZEDOESMATTER 19
EC2
EC2
EC2
EC2
EC2
EC2
EC2
EC2
Incoming
Clicks
Incoming
Events
W R
54. Lucas Ceballos
Size does matter@ Dataops 2019
SIZEDOESMATTER 32
t1 t2
attribution window
1H
70%<1H
30%>1H
55. Lucas Ceballos
Size does matter@ Dataops 2019
SIZEDOESMATTER 33
“DIVIDEANDCONQUER”
FINAL CHAPTER
56. Lucas Ceballos
Size does matter@ Dataops 2019
SIZEDOESMATTER 34
H T
Last hourclicks
C LD
> 1 hourclicks
FREQUENTLY
ACCESSED
SMALL
NUMBER
INFREQUENTLY
ACCESSED
HUGE
NUMBER
+ +
= =
57. Lucas Ceballos
Size does matter@ Dataops 2019
SIZEDOESMATTER 34
H T
Last hourclicks
C LD
> 1 hourclicks
+ +
= =
Fridge
58. Lucas Ceballos
Size does matter@ Dataops 2019
SIZEDOESMATTER 35
UNDER THE HOOD
EC2
EC2
EC2
EC2
Incoming
Clicks
EC2
EC2
EC2
EC2
Incoming
Conversions
H T H T
C LDC LD
Fridge
59. Lucas Ceballos
Size does matter@ Dataops 2019
SIZEDOESMATTER 36
H T
EC2
EC2
EC2
EC2
Incoming
Clicks
CSV
EC2
EC2
EC2
EC2
Incoming
Conversions
(450b)
1 hour~ 9GB
Elasticache
HADES
60. Lucas Ceballos
Size does matter@ Dataops 2019
SIZEDOESMATTER 37
EC2
EC2
EC2
EC2
Incoming
Clicks
C LD
JSON
1m BUFFERING
PARQUET
Fridge
ALFREDDO
61. Lucas Ceballos
Size does matter@ Dataops 2019
SIZEDOESMATTER 38
EC2
EC2
EC2
EC2
Incoming
Conversions
C LD
SQS Athena Clicks
H TSQS
Retry
Fridge
62. Lucas Ceballos
Size does matter@ Dataops 2019
SIZEDOESMATTER 38
EC2
EC2
EC2
EC2
Incoming
Conversions
C LD
SQS Athena Clicks
H TSQS
Retry
Fridge
63. Lucas Ceballos
Size does matter@ Dataops 2019
SIZEDOESMATTER 38
EC2
EC2
EC2
EC2
Incoming
Conversions
C LD
SQS Athena Clicks
H TSQS
Retry
Fridge
64. Lucas Ceballos
Size does matter@ Dataops 2019
SIZEDOESMATTER 39
JSON
PARQUET
Athena
PRESTO dt=2018-08-21-01-51
dt=2018-08-21-01-50
dt=2018-08-21-01-49
dt=2018-08-21-01-48
dt=2018-08-21-01-47
Fridge