[DSC Adria 23] Miro MIljanic Telco Data Pipelines in the Cloud Architecture and Use Cases.pptx

May. 30, 2023
[DSC Adria 23] Miro MIljanic Telco Data Pipelines in the Cloud Architecture and Use Cases.pptx
[DSC Adria 23] Miro MIljanic Telco Data Pipelines in the Cloud Architecture and Use Cases.pptx
[DSC Adria 23] Miro MIljanic Telco Data Pipelines in the Cloud Architecture and Use Cases.pptx
[DSC Adria 23] Miro MIljanic Telco Data Pipelines in the Cloud Architecture and Use Cases.pptx
[DSC Adria 23] Miro MIljanic Telco Data Pipelines in the Cloud Architecture and Use Cases.pptx
[DSC Adria 23] Miro MIljanic Telco Data Pipelines in the Cloud Architecture and Use Cases.pptx
[DSC Adria 23] Miro MIljanic Telco Data Pipelines in the Cloud Architecture and Use Cases.pptx
[DSC Adria 23] Miro MIljanic Telco Data Pipelines in the Cloud Architecture and Use Cases.pptx
[DSC Adria 23] Miro MIljanic Telco Data Pipelines in the Cloud Architecture and Use Cases.pptx
[DSC Adria 23] Miro MIljanic Telco Data Pipelines in the Cloud Architecture and Use Cases.pptx
[DSC Adria 23] Miro MIljanic Telco Data Pipelines in the Cloud Architecture and Use Cases.pptx
[DSC Adria 23] Miro MIljanic Telco Data Pipelines in the Cloud Architecture and Use Cases.pptx
[DSC Adria 23] Miro MIljanic Telco Data Pipelines in the Cloud Architecture and Use Cases.pptx
[DSC Adria 23] Miro MIljanic Telco Data Pipelines in the Cloud Architecture and Use Cases.pptx
[DSC Adria 23] Miro MIljanic Telco Data Pipelines in the Cloud Architecture and Use Cases.pptx
[DSC Adria 23] Miro MIljanic Telco Data Pipelines in the Cloud Architecture and Use Cases.pptx
[DSC Adria 23] Miro MIljanic Telco Data Pipelines in the Cloud Architecture and Use Cases.pptx
[DSC Adria 23] Miro MIljanic Telco Data Pipelines in the Cloud Architecture and Use Cases.pptx
[DSC Adria 23] Miro MIljanic Telco Data Pipelines in the Cloud Architecture and Use Cases.pptx
[DSC Adria 23] Miro MIljanic Telco Data Pipelines in the Cloud Architecture and Use Cases.pptx
[DSC Adria 23] Miro MIljanic Telco Data Pipelines in the Cloud Architecture and Use Cases.pptx
[DSC Adria 23] Miro MIljanic Telco Data Pipelines in the Cloud Architecture and Use Cases.pptx
[DSC Adria 23] Miro MIljanic Telco Data Pipelines in the Cloud Architecture and Use Cases.pptx
[DSC Adria 23] Miro MIljanic Telco Data Pipelines in the Cloud Architecture and Use Cases.pptx
[DSC Adria 23] Miro MIljanic Telco Data Pipelines in the Cloud Architecture and Use Cases.pptx
[DSC Adria 23] Miro MIljanic Telco Data Pipelines in the Cloud Architecture and Use Cases.pptx
[DSC Adria 23] Miro MIljanic Telco Data Pipelines in the Cloud Architecture and Use Cases.pptx
[DSC Adria 23] Miro MIljanic Telco Data Pipelines in the Cloud Architecture and Use Cases.pptx
[DSC Adria 23] Miro MIljanic Telco Data Pipelines in the Cloud Architecture and Use Cases.pptx
[DSC Adria 23] Miro MIljanic Telco Data Pipelines in the Cloud Architecture and Use Cases.pptx
1 of 30

More Related Content

Similar to [DSC Adria 23] Miro MIljanic Telco Data Pipelines in the Cloud Architecture and Use Cases.pptx

How Celtra Optimizes its Advertising Platformwith DatabricksHow Celtra Optimizes its Advertising Platformwith Databricks
How Celtra Optimizes its Advertising Platform with DatabricksGrega Kespret
Why retail companies can't afford database downtimeWhy retail companies can't afford database downtime
Why retail companies can't afford database downtimeDBmaestro - Database DevOps
Virtual Data :  Eliminating the data constraint in Application DevelopmentVirtual Data :  Eliminating the data constraint in Application Development
Virtual Data : Eliminating the data constraint in Application DevelopmentKyle Hailey
Scaling Systems: Architectures that growScaling Systems: Architectures that grow
Scaling Systems: Architectures that growGibraltar Software
Kscope 14 Presentation : Virtual Data PlatformKscope 14 Presentation : Virtual Data Platform
Kscope 14 Presentation : Virtual Data PlatformKyle Hailey
Data Lineage, Property Based Testing & Neo4j Data Lineage, Property Based Testing & Neo4j
Data Lineage, Property Based Testing & Neo4j Neo4j

Similar to [DSC Adria 23] Miro MIljanic Telco Data Pipelines in the Cloud Architecture and Use Cases.pptx(20)

More from DataScienceConferenc1

[DSC Adria 23] Sinisa Sljepcevic - The Japanese Secret for a Joyful Life- Can...[DSC Adria 23] Sinisa Sljepcevic - The Japanese Secret for a Joyful Life- Can...
[DSC Adria 23] Sinisa Sljepcevic - The Japanese Secret for a Joyful Life- Can...DataScienceConferenc1
[DSC Adria 23]Goran Paulin Procedural Generation Of Synthsets For Computer Vi...[DSC Adria 23]Goran Paulin Procedural Generation Of Synthsets For Computer Vi...
[DSC Adria 23]Goran Paulin Procedural Generation Of Synthsets For Computer Vi...DataScienceConferenc1
[DSC Adria 23]Nino Pozar What Are We Shipping.pptx[DSC Adria 23]Nino Pozar What Are We Shipping.pptx
[DSC Adria 23]Nino Pozar What Are We Shipping.pptxDataScienceConferenc1
[DSC Adria 23] Muthu Ramachandran AI Ethics Framework for Generative AI such ...[DSC Adria 23] Muthu Ramachandran AI Ethics Framework for Generative AI such ...
[DSC Adria 23] Muthu Ramachandran AI Ethics Framework for Generative AI such ...DataScienceConferenc1
[DSC Adria 23]Dino Pitoski - Distribution of international migrants across Cr...[DSC Adria 23]Dino Pitoski - Distribution of international migrants across Cr...
[DSC Adria 23]Dino Pitoski - Distribution of international migrants across Cr...DataScienceConferenc1
[DSC Adria 23] Mirjana Pejic Bach Data mining approach to internal fraud in a...[DSC Adria 23] Mirjana Pejic Bach Data mining approach to internal fraud in a...
[DSC Adria 23] Mirjana Pejic Bach Data mining approach to internal fraud in a...DataScienceConferenc1

More from DataScienceConferenc1(20)

Recently uploaded

apidays London 2023 - Overengineering Weakens your API Security, Dr. David Va...apidays London 2023 - Overengineering Weakens your API Security, Dr. David Va...
apidays London 2023 - Overengineering Weakens your API Security, Dr. David Va...apidays
apidays London 2023 - API Metrics matters in APIOps, Ludovic Pourrat,  Lombar...apidays London 2023 - API Metrics matters in APIOps, Ludovic Pourrat,  Lombar...
apidays London 2023 - API Metrics matters in APIOps, Ludovic Pourrat, Lombar...apidays
apidays London 2023 - Open Standards, AI and Data for better business decisio...apidays London 2023 - Open Standards, AI and Data for better business decisio...
apidays London 2023 - Open Standards, AI and Data for better business decisio...apidays
Introduction to Cypher Introduction to Cypher
Introduction to Cypher Neo4j
Essential numpy before you start your Machine Learning journey in python.pdfEssential numpy before you start your Machine Learning journey in python.pdf
Essential numpy before you start your Machine Learning journey in python.pdfSmrati Kumar Katiyar
OCTRI PSS Simulations in R Seminar.pdfOCTRI PSS Simulations in R Seminar.pdf
OCTRI PSS Simulations in R Seminar.pdfssuser84c78e

[DSC Adria 23] Miro MIljanic Telco Data Pipelines in the Cloud Architecture and Use Cases.pptx

Editor's Notes

  1. Thank You all for coming today, my name is Miro Miljanić and I’m Data Architect in CROZ and I’m currently responsible for managing Cloud data initiatives. In CROZ, we have a long history of Data and SW engineering and consulting, but we’ve only in last few years began to gain significant experience in Cloud. The talk today is about several of our experiences with data and AI related Cloud initiatives. Although it has a Telco in its name, not all of the examples were built for Telco companies, but, they are legitimate use cases which could be used in any Telecommunication company.
  2. In Act One, the Setup, the main protagonist is introduced, its everyday life and the incident - introduction of love interest or a challenge. In our case this the initial project or POC (pi ou si) description and its drive. Act Two: Conflict - contains the pursue of the main character towards the goal and complications that arise. At one point, the main character seems to have reached its goal, but it is the false victory, there are some deeper issues or the victory is short-lived. It is the Emotional low point of our character where he learns an important lesson and re-evaluate their priorities. In our case this is a turning point – the situation that arouse and changed the course of action. Act Three: Resolution, it’s all about happy ending. Our hero overcomes all its obstacles, wins his love, becomes a better person and they live happily ever after. So, in the third part, I’ll explain what we did to solve the problem.
  3. Single view ment than there should be replication and delta replication from several (different engine) databases into one Cloud DB that should be used not only for corporate reporting, but for other analysis, also. This would be the main purpose of this DB, the applications, data processing and ETL logic will remain on on-prem databases. Reimplementation ment rewriting reporting logic from legacy DB code, views, packages, procedures and legacy reporting tool to new reporting tool. And initial Governace ment that there should be catologization and lineage of data together with data access and security model. As at every change implementation – this was the right time to enforce
  4. So, since this was a large scope of work we began thorough analysis of the requirements, and multiple source systems which contained more than 10k objects and their code. We also started working on architecture and several POCs (pi ou sis) regarding various scenarios: Delta load – what we have to implement on source to propagate of only changes to target DB DB code reimplementation scenarios – how to handle it, where to do it and not to affect application logic. How to manage reporting logic reimplementation, reporting optimizations, data maintenance scenarios and so on
  5. Things were going pretty well, we had a good relationship with the customer and understanding of complexity of the task, analysis and architecture setup was on the track, when we received a following response from the customer: That’s all nice, but this is a bit too much for us now, what could we get for X days? And Yes, the X was not nearly the number we anticipated for the whole solution.
  6. Things were going pretty well, …
  7. So, what did we do?
  8. Anonymization will provide a way to remove private data from the reviews without deleting them, and it will allow to keep the reviews in the database so that it could be used for other purposes in the future e.g. -topic extraction, sentiment analysis
  9. Example of the original message, Regular Anonymization and Synthetic replacement The scope of this projects covers NAMED ENTITY RECOGNITION machine learning problem
  10. Anonymization – Rule engine for rule based anonymization – PII that could be recognized with an algorithm, by specific format e.g. for e-mail, phone number. PII detection puts labels together with label confidence, for each label.
  11. Things were going pretty well, … AWS, Databricks problem
  12. 1. Human in the loop - Label studio – open source application – integrated in our solution, deployed as app service on Azure 2. Human in the loop enhancement – Human in the loop is used as gold label – new training data
  13. AWS - 2006, S3, EC2 Azure - 2008, commercial release - 2010 Google - 2010