Personalisering av tv.nrk.no

•Download as PPTX, PDF•

0 likes•470 views

This document discusses strategies for scaling applications and services across multiple data centers and cloud regions. It provides examples of patterns for queue-based load leveling and publishing/subscribing to channels. It also outlines the structure of a Redis cache with different data types to store user profiles and activity. Scaling considerations for DocumentDB are reviewed based on request units.

Data & Analytics

Personalisering av
http://tv.nrk.no
Harald Schult Ulriksen
@hsulriksen
http://aurum.no

Bunntekst/Presentasjonstittel 7CC BY Gemma Longman
Ytelse Oppetid Tilstand Sikkerhet

Topic
V2
West Europe v1
West Europe v2
North Europe V2
North Europe V1
WorkerRole
WorkerRole
WorkerRole
WorkerRole

Patterns
• Command Query Responsibility Segregation
• Queue-based Load Leveling
• Competing consumers
• Publish subscribe channel
• Content based router
• Deadletter
• Message store

API
Redis
Cache
Redis
Cache
API
Kø
Datasenter 1
Datasenter 2
Kø
Datasenter 1
Datasenter 2
Kø leser
DocumentDb
Kø leser
DocumentDb
Datasenter 1 Datasenter 2
1
3
2
1
2
1
1
1
2
3

Scaling DocumentDb
Request units
S2
S1
S0 250
1000 + 1000 + 1000 + 1000 + 1000 + 1000 = 6000

Request units
S0 250 kr 177,00
S1 1000 kr 353,00
S2 2500 kr 706,00
Sekunder mellom hver ping S1 RU's 1000
120,00 Antall 10,00 kr 3 530,00
API klienter Request pr. minutt Pr. sekund MS pr melding RU's needed RU's total RU Balance
100 000,00 50 000,00 833,33 1,20 10 791,67 10 000,00 -791,67
50 000,00 25 000,00 416,67 2,40 5 395,83 10 000,00 4 604,17
20 000,00 10 000,00 166,67 6,00 2 158,33 10 000,00 7 841,67
10 000,00 5 000,00 83,33 12,00 1 079,17 10 000,00 8 920,83
Request charge
Save recently watched Document lookup 2,28
Replace existing 10,67
Total 12,95

$Redis cache structure Name Type Key / Score Data Plo Hashset ProgramId Program list item sp_{seriesId} Set ProgramIds uf_{userId} Hashset ProgramId Serialized data usf_{userId} SortedSet Unix epoch date added ProgramId uh_{userId} Hashset ProgramId Serialized data ush_{userId} SortedSet Unix epoch, last watched ProgramId User loaded ul_{userId} Bool User last write time uw_{userId} Date time$

Similar to Personalisering av tv.nrk.no

Resilient Kafka: How DNS Traffic Management and Client Wrappers Ensure Availa...VanessaVuibert1

Resilient Kafka: How DNS Traffic Management and Client Wrappers Ensure Availa...Vanessa Vuibert

Spark Summit EU talk by Sebastian Schroeder and Ralf SigmundSpark Summit

Big datadc skyfall_preso_v2abramsm

OSCON Data 2011 -- NoSQL @ Netflix, Part 2Sid Anand

Navigate Data Service using AWSArno Broekhof

MQTC V2.0.1.3 - WMQ & TCP Buffers – Size DOES Matter! (pps)Art Schanz

XMPP/Jingle(VoIP)/Perl Ocean 2012/03Lyo Kato

Automotive network and gateway simulationDeepak Shankar

Massive Data Processing in Adobe Using Delta LakeDatabricks

AF Ceph: Ceph Performance Analysis and Improvement on FlashCeph Community

Druid at naver.com - part 1Jungsu Heo

Bloomreach - BloomStore Compute Cloud Infrastructure bloomreacheng

In Flux Limiting for a multi-tenant logging serviceDataWorks Summit/Hadoop Summit

2017 Microservices Practitioner Virtual Summit: Microservices at Squarespace ...Ambassador Labs

Efail: Breaking S/MIME and OpenPGP Email Encryption using Exfiltration ChannelsPriyanka Aash

Cloud Native Patterns Meetup 2019-11-20RegisWilson1

Rackspace: Email's Solution for Indexing 50K Documents per Second: Presented ...Lucidworks

Leveraging the Power of Solr with Spark: Presented by Johannes Weigend, QAwareLucidworks

Leveraging the Power of Solr with SparkQAware GmbH

Similar to Personalisering av tv.nrk.no (20)

Resilient Kafka: How DNS Traffic Management and Client Wrappers Ensure Availa...

Spark Summit EU talk by Sebastian Schroeder and Ralf Sigmund

Big datadc skyfall_preso_v2

OSCON Data 2011 -- NoSQL @ Netflix, Part 2

Navigate Data Service using AWS

MQTC V2.0.1.3 - WMQ & TCP Buffers – Size DOES Matter! (pps)

XMPP/Jingle(VoIP)/Perl Ocean 2012/03

Automotive network and gateway simulation

Massive Data Processing in Adobe Using Delta Lake

AF Ceph: Ceph Performance Analysis and Improvement on Flash

Druid at naver.com - part 1

Bloomreach - BloomStore Compute Cloud Infrastructure

In Flux Limiting for a multi-tenant logging service

2017 Microservices Practitioner Virtual Summit: Microservices at Squarespace ...

Efail: Breaking S/MIME and OpenPGP Email Encryption using Exfiltration Channels

Cloud Native Patterns Meetup 2019-11-20

Rackspace: Email's Solution for Indexing 50K Documents per Second: Presented ...

Leveraging the Power of Solr with Spark: Presented by Johannes Weigend, QAware

Leveraging the Power of Solr with Spark

Recently uploaded

{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...Pooja Nehwal

Schema on read is obsolete. Welcome metaprogramming..pdfLars Albertsson

Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service BhilaiSuhani Kapoor

Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfSocial Samosa

04242024_CCC TUG_Joins and Relationshipsccctableauusergroup

꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Callshivangimorya083

Data Science Project: Advancements in Fetal Health ClassificationBoston Institute of Analytics

VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiSuhani Kapoor

Russian Call Girls Dwarka Sector 15 💓 Delhi 9999965857 @Sabina Modi VVIP MODE...Call Girls In Delhi Whatsup 9873940964 Enjoy Unlimited Pleasure

Full night 🥵 Call Girls Delhi New Friends Colony {9711199171} Sanya Reddy ✌️o...shivangimorya083

E-Commerce Order PredictionShraddha Kamble.pptxBoston Institute of Analytics

Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Sapana Sha

Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...dajasot375

Spark3's new memory model/managementakshesh doshi

B2 Creative Industry Response Evaluation.docxStephen266013

Brighton SEO | April 2024 | Data StorytellingNeil Barnes

Call Girls In Mahipalpur O9654467111 Escorts ServiceSapana Sha

Ukraine War presentation: KNOW THE BASICSAishani27

Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfRachmat Ramadhan H

Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝soniya singh

Recently uploaded (20)

{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...

Schema on read is obsolete. Welcome metaprogramming..pdf

Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai

Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf

04242024_CCC TUG_Joins and Relationships

꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call

Data Science Project: Advancements in Fetal Health Classification

VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati

Russian Call Girls Dwarka Sector 15 💓 Delhi 9999965857 @Sabina Modi VVIP MODE...

Full night 🥵 Call Girls Delhi New Friends Colony {9711199171} Sanya Reddy ✌️o...

E-Commerce Order PredictionShraddha Kamble.pptx

Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...

Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...

Spark3's new memory model/management

B2 Creative Industry Response Evaluation.docx

Brighton SEO | April 2024 | Data Storytelling

Call Girls In Mahipalpur O9654467111 Escorts Service

Ukraine War presentation: KNOW THE BASICS

Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf

Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝

Personalisering av tv.nrk.no

1. Personalisering av http://tv.nrk.no Harald Schult Ulriksen @hsulriksen http://aurum.no

3. Utgangspunkt

4. Utgangspunkt

7. Bunntekst/Presentasjonstittel 7CC BY Gemma Longman Ytelse Oppetid Tilstand Sikkerhet

9. Kø

10. Kø

11. Kø

12. Topic Subscription A Subscription B

13. Topic V2 West Europe v1 West Europe v2 North Europe V2 North Europe V1 WorkerRole WorkerRole WorkerRole WorkerRole

14. Patterns • Command Query Responsibility Segregation • Queue-based Load Leveling • Competing consumers • Publish subscribe channel • Content based router • Deadletter • Message store

15. API Redis Cache Redis Cache API Kø Datasenter 1 Datasenter 2 Kø Datasenter 1 Datasenter 2 Kø leser DocumentDb Kø leser DocumentDb Datasenter 1 Datasenter 2 1 3 2 1 2 1 1 1 2 3

16. DocumentDb

17. Scaling DocumentDb Request units S2 S1 S0 250 1000 + 1000 + 1000 + 1000 + 1000 + 1000 = 6000

18.

19. Request units S0 250 kr 177,00 S1 1000 kr 353,00 S2 2500 kr 706,00 Sekunder mellom hver ping S1 RU's 1000 120,00 Antall 10,00 kr 3 530,00 API klienter Request pr. minutt Pr. sekund MS pr melding RU's needed RU's total RU Balance 100 000,00 50 000,00 833,33 1,20 10 791,67 10 000,00 -791,67 50 000,00 25 000,00 416,67 2,40 5 395,83 10 000,00 4 604,17 20 000,00 10 000,00 166,67 6,00 2 158,33 10 000,00 7 841,67 10 000,00 5 000,00 83,33 12,00 1 079,17 10 000,00 8 920,83 Request charge Save recently watched Document lookup 2,28 Replace existing 10,67 Total 12,95

20.

21. Redis cache structure Name Type Key / Score Data Plo Hashset ProgramId Program list item sp_{seriesId} Set ProgramIds uf_{userId} Hashset ProgramId Serialized data usf_{userId} SortedSet Unix epoch date added ProgramId uh_{userId} Hashset ProgramId Serialized data ush_{userId} SortedSet Unix epoch, last watched ProgramId User loaded ul_{userId} Bool User last write time uw_{userId} Date time

22. Script to load user data

Editor's Notes

Hva er personalisering for NRK. Favoritter, mine programmer. Analyse, kobles til innholdsplakat.
20-40K API requests / minutt Median 13-15ms 95 persentil 150-200ms 99 persentil 600-800ms TV, Radio og Klipp. Read only
Ingen «kontroll» på når data endres. Web, App,
100K samtidige seere on-demand Oppdateringsfrekvens Play start, delayed Hvert 2 minutt Pause Onbeforeunload Program slutt ca 55K request / minutt Separate API kall / Resources Berikelse av eksisterende kall Mengden data øker over tid, hvem er master.
Presentasjon av personaliserte data Synkronisering mellom to datasentre Caching Skalering 8 fallacies of distributed computing The network is reliable. Latency is zero. Bandwidth is infinite. The network is secure. Topology doesn't change. There is one administrator. Transport cost is zero. The network is homogeneous.
Command Query Responsibility Segregation Naturlig med forskjellig skriveløp, gir mulighet å skalere de to individuelt.
Queue based load leveling
Ujevn ytelse
Competing consumers. - mulighet å øke ytelsen på skrivelaget ved behov. Reliability, feiler en så fortsetter resten. En treg melding vil ikke forstyrre resten.
Publish subscribe – fra observer. Her melding dupliseres og behandles individuelt.
Content based router - filtrering
Command Query Responsibility Segregation Naturlig med forskjellig skriveløp, gir mulighet å skalere de to individuelt.
Partitioning component of Amazon's storage system Dynamo[4][5] Data partitioning in Apache Cassandra[6] Data Partitioning in Voldemort[7] Akka's consistent hashing router[8]
Forbedringer: Egen kø hvis noe må sendes på nytt for en worker, topic sender alltid alt til alle. Cassandra – data senter aware storage.Eventhub -> Stream analytics – multiple datacenter

Personalisering av tv.nrk.no

Recommended

Recommended

More Related Content

Similar to Personalisering av tv.nrk.no

Similar to Personalisering av tv.nrk.no (20)

Recently uploaded

Recently uploaded (20)

Personalisering av tv.nrk.no

Editor's Notes