SlideShare a Scribd company logo
1 of 56
Download to read offline
Building data pipelines
01
from simple to more advanced - hands-on
Sergii Khomenko, Data Scientist
sergii.khomenko@stylight.com, @lc0d3r
CrunchConf - October 29, 2015
Sergii Khomenko
2
Data scientist at one of the biggest fashion communities, Stylight.
Data analysis and visualisation hobbyist, working on problems not
only in working time but in free time for fun and personal data
visualisations.
Speaker at Berlin Buzzwords 2014, ApacheCon Europe 2014, Puppet
Camp London, Berlin Buzzwords 2015 , Tableau Conference on Tour,
Budapest BI Forum 2015
Profitable Leads
Stylight provides its
partners with high-
quality leads enabling
partner shops to
leverage Stylight as a
ROI positive traffic
channel.
Inspiration
Stylight offers
shoppable
inspiration that
makes it easy to
know what to
buy and how to
style it.
Branding & Reach
Stylight offers a unique
opportunity for brands to reach
an audience that is actively
looking for style online.
Shopping
Stylight helps users search
and shop fashion and lifestyle
products smarter across
hundreds of shops.
3
Stylight – Make Style Happen
Core Target Group
Stylight help aspiring women between 18 and 35 to evolve their style through shoppable inspiration.
Stylight – acting on a global scale
Experienced & Ambitious Team
Innovative cross-
functional organisation
with flat hierarchy builds a 

unique team spirit.
• +200 employees
• 40 PhDs/Engineers
• 28 years average age
• 63% female
• 23 nationalities
• 0 suits
5
Agenda
6
T h e G o o d , T h e B a d A n d T h e L e g a c y
O p e n S o u r c e s t a c k
A m a z o n A W S
G o o g l e C l o u d
T i p s , t r i c k s a n d b e s t p r a c t i c e s
7
I n c o m p u t i n g , a p i p e l i n e i s a
s e t o f d a t a p r o c e s s i n g e l e m e n t s
c o n n e c t e d i n s e r i e s , w h e r e t h e
o u t p u t o f o n e e l e m e n t i s t h e
i n p u t o f t h e n e x t o n e .
The Good, The Bad
And The Legacy
8
Sources of data:
9
• Web tracking
• Metrics tracking
• Behaviour tracking
• Business intelligence ETL
• Internal Services
• ML tagging service
Access patterns
10
• Real-time
• Nearly real-time
• Daily batches
11
12
Properties
13
• Data consistency
• Doesn’t scale
• Hard to add new sources
• Complex system
• Many interfaces
• As lean and legacy as possible
• No need for special services
14
15
Streaming
Open Source Stack
16
17
http://lambda-architecture.net/
18
A p a c h e K a f k a i s p u b l i s h - s u b s c r i b e m e s s a g i n g
r e t h o u g h t a s a d i s t r i b u t e d c o m m i t l o g .
19
20
21http://www.ipponusa.com/wp-content/uploads/2014/10/spark-architecture.jpg
Results
22
• Scalable
• Flexible
• High costs of maintenance
• Not so easy to setup
23
A p r o g r a m m i n g l a n g u a g e i s l o w
l e v e l w h e n i t s p r o g r a m s r e q u i r e
a t t e n t i o n t o t h e i r r e l e v a n t .
Alan Jay Perlis / Epigrams on Programming
Amazon AWS
24
Kinesis Streams
27
28
29
business
development
& finance
website
events
enrichment
Business
Intelligence
Kinesis Firehose Kinesis Analytics
33
34
custom
unification
pipeline
Product
Processing
Business
Intelligence
ML/Tagging
Product events
variety of event types
and structures
36
AWS Data Pipeline
Google Cloud
39
40
41
42
43
44
Tips, tricks and best practices
46
Cross-Functional
Team
47
Department: mission oriented team with
all resources and the least dependencies
Product Team: builds the software the
department or its customers use
Squad: team that executes the product
development
47
Department
Product Team
Squad
PO
Engineer
Engineer
Designer
Data Scientist
Head of
Business Role
Business Role
48
Cross-Functional
Team
49
• You build it - you run it
• You check your numbers (domain
knowledge)
• You provide your data as interface layer
• Data report comes after data tracking
49
Department
Product Team
Squad
PO
Engineer
Engineer
Designer
Data Scientist
Head of
Business Role
Business Role
50
51
52
54
I t h i n k t h a t i t ' s e x t r a o r d i n a r i l y
i m p o r t a n t t h a t w e i n c o m p u t e r
s c i e n c e k e e p f u n i n c o m p u t i n g .
W h e n i t s t a r t e d o u t , i t w a s a n a w f u l
l o t o f f u n .
Alan Jay Perlis /
The Structure and Interpretation
of Computer Programs
www.stylight.com
sergii.khomenko@stylight.com
@lc0d3r
Related talks
56
• Helping Data Teams with Puppet / Puppet Camp London
• Secure Data Scalability at Stylight with Tableau Online and
Amazon Redshift / Tableau Conference on Tour - Berlin
• Google Cloud Dataflow Two Worlds Become a Much Better
One

More Related Content

What's hot

How To Run A Successful BI Project with Hadoop
How To Run A Successful BI Project with HadoopHow To Run A Successful BI Project with Hadoop
How To Run A Successful BI Project with HadoopMammoth Data
 
Getting Insights through Conversational Chatbots for Analytics
Getting Insights through Conversational Chatbots for AnalyticsGetting Insights through Conversational Chatbots for Analytics
Getting Insights through Conversational Chatbots for AnalyticsData Con LA
 
Building an Insight Machine - Strata DDBD 2015
Building an Insight Machine - Strata DDBD 2015Building an Insight Machine - Strata DDBD 2015
Building an Insight Machine - Strata DDBD 2015Domino Data Lab
 
Data Engineering @ Patistic Innovations
Data Engineering @ Patistic InnovationsData Engineering @ Patistic Innovations
Data Engineering @ Patistic InnovationsPatistic Innovations
 
Making Big Data Projects Successful - Data Science Pop-up Seattle
Making Big Data Projects Successful - Data Science Pop-up SeattleMaking Big Data Projects Successful - Data Science Pop-up Seattle
Making Big Data Projects Successful - Data Science Pop-up SeattleDomino Data Lab
 
Creating a Data-Driven Organization -- thisismetis meetup
Creating a Data-Driven Organization -- thisismetis meetupCreating a Data-Driven Organization -- thisismetis meetup
Creating a Data-Driven Organization -- thisismetis meetupCarl Anderson
 
Industry Focus Camp SCB17 "How to build a data driven organization"
Industry Focus Camp SCB17 "How to build a data driven organization"Industry Focus Camp SCB17 "How to build a data driven organization"
Industry Focus Camp SCB17 "How to build a data driven organization"Bundesverband Deutsche Startups e.V.
 
Competitive Intelligence and Big Data
Competitive Intelligence and Big DataCompetitive Intelligence and Big Data
Competitive Intelligence and Big DataCID GmbH
 
Talent42 2017: Robots are Coming - Nimrod Hoofien and Isabel Kloumann
Talent42 2017: Robots are Coming - Nimrod Hoofien and Isabel KloumannTalent42 2017: Robots are Coming - Nimrod Hoofien and Isabel Kloumann
Talent42 2017: Robots are Coming - Nimrod Hoofien and Isabel KloumannTalent42
 
Be more certain - a practical approach to scaling a research practice
Be more certain - a practical approach to scaling a research practiceBe more certain - a practical approach to scaling a research practice
Be more certain - a practical approach to scaling a research practiceUXinsight
 
Relationships are complicated: how data analysis and UX research come togethe...
Relationships are complicated: how data analysis and UX research come togethe...Relationships are complicated: how data analysis and UX research come togethe...
Relationships are complicated: how data analysis and UX research come togethe...UXinsight
 
Transitioning Infochimps to Lean
Transitioning Infochimps to LeanTransitioning Infochimps to Lean
Transitioning Infochimps to LeanTim Gasper
 
Developing a Data Strategy -- A Guide For Business Leaders
Developing a Data Strategy -- A Guide For Business LeadersDeveloping a Data Strategy -- A Guide For Business Leaders
Developing a Data Strategy -- A Guide For Business Leadersibi
 
Creating a Data-Driven Organization: an executive summary
Creating a Data-Driven Organization: an executive summaryCreating a Data-Driven Organization: an executive summary
Creating a Data-Driven Organization: an executive summaryCarl Anderson
 
Talent42 2017: Building the Best Recruiting Tech Stack - Nick Mailey and Will...
Talent42 2017: Building the Best Recruiting Tech Stack - Nick Mailey and Will...Talent42 2017: Building the Best Recruiting Tech Stack - Nick Mailey and Will...
Talent42 2017: Building the Best Recruiting Tech Stack - Nick Mailey and Will...Talent42
 
Using Signals in Lucidworks Fusion
Using Signals in Lucidworks FusionUsing Signals in Lucidworks Fusion
Using Signals in Lucidworks FusionLucidworks
 

What's hot (20)

How To Run A Successful BI Project with Hadoop
How To Run A Successful BI Project with HadoopHow To Run A Successful BI Project with Hadoop
How To Run A Successful BI Project with Hadoop
 
Getting Insights through Conversational Chatbots for Analytics
Getting Insights through Conversational Chatbots for AnalyticsGetting Insights through Conversational Chatbots for Analytics
Getting Insights through Conversational Chatbots for Analytics
 
Building an Insight Machine - Strata DDBD 2015
Building an Insight Machine - Strata DDBD 2015Building an Insight Machine - Strata DDBD 2015
Building an Insight Machine - Strata DDBD 2015
 
AI as a platform
AI as a platformAI as a platform
AI as a platform
 
Data Engineering @ Patistic Innovations
Data Engineering @ Patistic InnovationsData Engineering @ Patistic Innovations
Data Engineering @ Patistic Innovations
 
It's all about data
It's all about dataIt's all about data
It's all about data
 
Making Big Data Projects Successful - Data Science Pop-up Seattle
Making Big Data Projects Successful - Data Science Pop-up SeattleMaking Big Data Projects Successful - Data Science Pop-up Seattle
Making Big Data Projects Successful - Data Science Pop-up Seattle
 
Creating a Data-Driven Organization -- thisismetis meetup
Creating a Data-Driven Organization -- thisismetis meetupCreating a Data-Driven Organization -- thisismetis meetup
Creating a Data-Driven Organization -- thisismetis meetup
 
Industry Focus Camp SCB17 "How to build a data driven organization"
Industry Focus Camp SCB17 "How to build a data driven organization"Industry Focus Camp SCB17 "How to build a data driven organization"
Industry Focus Camp SCB17 "How to build a data driven organization"
 
Competitive Intelligence and Big Data
Competitive Intelligence and Big DataCompetitive Intelligence and Big Data
Competitive Intelligence and Big Data
 
Talent42 2017: Robots are Coming - Nimrod Hoofien and Isabel Kloumann
Talent42 2017: Robots are Coming - Nimrod Hoofien and Isabel KloumannTalent42 2017: Robots are Coming - Nimrod Hoofien and Isabel Kloumann
Talent42 2017: Robots are Coming - Nimrod Hoofien and Isabel Kloumann
 
Be more certain - a practical approach to scaling a research practice
Be more certain - a practical approach to scaling a research practiceBe more certain - a practical approach to scaling a research practice
Be more certain - a practical approach to scaling a research practice
 
Relationships are complicated: how data analysis and UX research come togethe...
Relationships are complicated: how data analysis and UX research come togethe...Relationships are complicated: how data analysis and UX research come togethe...
Relationships are complicated: how data analysis and UX research come togethe...
 
Transitioning Infochimps to Lean
Transitioning Infochimps to LeanTransitioning Infochimps to Lean
Transitioning Infochimps to Lean
 
Developing a Data Strategy -- A Guide For Business Leaders
Developing a Data Strategy -- A Guide For Business LeadersDeveloping a Data Strategy -- A Guide For Business Leaders
Developing a Data Strategy -- A Guide For Business Leaders
 
Real-Time Analytics: The Future of Big Data in the Agency
Real-Time Analytics: The Future of Big Data in the AgencyReal-Time Analytics: The Future of Big Data in the Agency
Real-Time Analytics: The Future of Big Data in the Agency
 
Infographic: CIOs & Big Data
Infographic: CIOs & Big DataInfographic: CIOs & Big Data
Infographic: CIOs & Big Data
 
Creating a Data-Driven Organization: an executive summary
Creating a Data-Driven Organization: an executive summaryCreating a Data-Driven Organization: an executive summary
Creating a Data-Driven Organization: an executive summary
 
Talent42 2017: Building the Best Recruiting Tech Stack - Nick Mailey and Will...
Talent42 2017: Building the Best Recruiting Tech Stack - Nick Mailey and Will...Talent42 2017: Building the Best Recruiting Tech Stack - Nick Mailey and Will...
Talent42 2017: Building the Best Recruiting Tech Stack - Nick Mailey and Will...
 
Using Signals in Lucidworks Fusion
Using Signals in Lucidworks FusionUsing Signals in Lucidworks Fusion
Using Signals in Lucidworks Fusion
 

Similar to Building data pipelines: from simple to more advanced - hands-on experience / CrunchConf - Oct 29, 2015

Delivering Self-Service Analytics using Big Data and Data Virtualization on t...
Delivering Self-Service Analytics using Big Data and Data Virtualization on t...Delivering Self-Service Analytics using Big Data and Data Virtualization on t...
Delivering Self-Service Analytics using Big Data and Data Virtualization on t...Denodo
 
From Data Science to Production - deploy, scale, enjoy! / PyData Amsterdam - ...
From Data Science to Production - deploy, scale, enjoy! / PyData Amsterdam - ...From Data Science to Production - deploy, scale, enjoy! / PyData Amsterdam - ...
From Data Science to Production - deploy, scale, enjoy! / PyData Amsterdam - ...Sergii Khomenko
 
RubiOne: Apache Spark as the Backbone of a Retail Analytics Development Envir...
RubiOne: Apache Spark as the Backbone of a Retail Analytics Development Envir...RubiOne: Apache Spark as the Backbone of a Retail Analytics Development Envir...
RubiOne: Apache Spark as the Backbone of a Retail Analytics Development Envir...Databricks
 
Slides: How Automating Data Lineage Improves BI Performance
Slides: How Automating Data Lineage Improves BI PerformanceSlides: How Automating Data Lineage Improves BI Performance
Slides: How Automating Data Lineage Improves BI PerformanceDATAVERSITY
 
Mainframe Day 2022 -The NRB Group - the best partner of your z-modernization.pdf
Mainframe Day 2022 -The NRB Group - the best partner of your z-modernization.pdfMainframe Day 2022 -The NRB Group - the best partner of your z-modernization.pdf
Mainframe Day 2022 -The NRB Group - the best partner of your z-modernization.pdfNRB
 
6 winning strategies for agil SaaS editors
6 winning strategies for agil SaaS editors6 winning strategies for agil SaaS editors
6 winning strategies for agil SaaS editorsScaleway
 
Final PPT Pratik 107.pptx
Final PPT Pratik 107.pptxFinal PPT Pratik 107.pptx
Final PPT Pratik 107.pptxVaibhavJhanwar2
 
Building Data applications with Go: from Bloom filters to Data pipelines / FO...
Building Data applications with Go: from Bloom filters to Data pipelines / FO...Building Data applications with Go: from Bloom filters to Data pipelines / FO...
Building Data applications with Go: from Bloom filters to Data pipelines / FO...Sergii Khomenko
 
Create an IoT Gateway and Establish a Data Pipeline to AWS IoT with Intel - I...
Create an IoT Gateway and Establish a Data Pipeline to AWS IoT with Intel - I...Create an IoT Gateway and Establish a Data Pipeline to AWS IoT with Intel - I...
Create an IoT Gateway and Establish a Data Pipeline to AWS IoT with Intel - I...Amazon Web Services
 
Le Groupe NRB : Le meilleur partenaire pour votre z/modernisation
Le Groupe NRB : Le meilleur partenaire pour votre z/modernisationLe Groupe NRB : Le meilleur partenaire pour votre z/modernisation
Le Groupe NRB : Le meilleur partenaire pour votre z/modernisationNRB
 
Building successful data science teams
Building successful data science teamsBuilding successful data science teams
Building successful data science teamsVenkatesh Umaashankar
 
Adopting Data Science and Machine Learning in the financial enterprise
Adopting Data Science and Machine Learning in the financial enterpriseAdopting Data Science and Machine Learning in the financial enterprise
Adopting Data Science and Machine Learning in the financial enterpriseQuantUniversity
 
Handle your Lambdas - From event-based processing to Continuous Integration /...
Handle your Lambdas - From event-based processing to Continuous Integration /...Handle your Lambdas - From event-based processing to Continuous Integration /...
Handle your Lambdas - From event-based processing to Continuous Integration /...Sergii Khomenko
 
Geekcamp Indonesia 2017 : Agile Product Management
Geekcamp Indonesia 2017 : Agile Product ManagementGeekcamp Indonesia 2017 : Agile Product Management
Geekcamp Indonesia 2017 : Agile Product ManagementMichael Ong
 
Fire in the Hole: How a Spark-Powered Platform Charges Analytics
Fire in the Hole: How a Spark-Powered Platform Charges Analytics Fire in the Hole: How a Spark-Powered Platform Charges Analytics
Fire in the Hole: How a Spark-Powered Platform Charges Analytics Inside Analysis
 
Creating Modern Metadata Systems with New Relic, Dow Jones [FutureStack16]
Creating Modern Metadata Systems with New Relic, Dow Jones [FutureStack16]Creating Modern Metadata Systems with New Relic, Dow Jones [FutureStack16]
Creating Modern Metadata Systems with New Relic, Dow Jones [FutureStack16]New Relic
 
How a global manufacturing company built a data science capability from scratch
How a global manufacturing company built a data science capability from scratchHow a global manufacturing company built a data science capability from scratch
How a global manufacturing company built a data science capability from scratchCarlo Torniai
 
Perfect Memory Semantic Digital Asset Management @ Big Media Paris 2016
Perfect Memory Semantic Digital Asset Management @ Big Media Paris 2016Perfect Memory Semantic Digital Asset Management @ Big Media Paris 2016
Perfect Memory Semantic Digital Asset Management @ Big Media Paris 2016ACTUONDA
 

Similar to Building data pipelines: from simple to more advanced - hands-on experience / CrunchConf - Oct 29, 2015 (20)

Delivering Self-Service Analytics using Big Data and Data Virtualization on t...
Delivering Self-Service Analytics using Big Data and Data Virtualization on t...Delivering Self-Service Analytics using Big Data and Data Virtualization on t...
Delivering Self-Service Analytics using Big Data and Data Virtualization on t...
 
From Data Science to Production - deploy, scale, enjoy! / PyData Amsterdam - ...
From Data Science to Production - deploy, scale, enjoy! / PyData Amsterdam - ...From Data Science to Production - deploy, scale, enjoy! / PyData Amsterdam - ...
From Data Science to Production - deploy, scale, enjoy! / PyData Amsterdam - ...
 
RubiOne: Apache Spark as the Backbone of a Retail Analytics Development Envir...
RubiOne: Apache Spark as the Backbone of a Retail Analytics Development Envir...RubiOne: Apache Spark as the Backbone of a Retail Analytics Development Envir...
RubiOne: Apache Spark as the Backbone of a Retail Analytics Development Envir...
 
Slides: How Automating Data Lineage Improves BI Performance
Slides: How Automating Data Lineage Improves BI PerformanceSlides: How Automating Data Lineage Improves BI Performance
Slides: How Automating Data Lineage Improves BI Performance
 
Mainframe Day 2022 -The NRB Group - the best partner of your z-modernization.pdf
Mainframe Day 2022 -The NRB Group - the best partner of your z-modernization.pdfMainframe Day 2022 -The NRB Group - the best partner of your z-modernization.pdf
Mainframe Day 2022 -The NRB Group - the best partner of your z-modernization.pdf
 
6 winning strategies for agil SaaS editors
6 winning strategies for agil SaaS editors6 winning strategies for agil SaaS editors
6 winning strategies for agil SaaS editors
 
Final PPT Pratik 107.pptx
Final PPT Pratik 107.pptxFinal PPT Pratik 107.pptx
Final PPT Pratik 107.pptx
 
Building Data applications with Go: from Bloom filters to Data pipelines / FO...
Building Data applications with Go: from Bloom filters to Data pipelines / FO...Building Data applications with Go: from Bloom filters to Data pipelines / FO...
Building Data applications with Go: from Bloom filters to Data pipelines / FO...
 
Create an IoT Gateway and Establish a Data Pipeline to AWS IoT with Intel - I...
Create an IoT Gateway and Establish a Data Pipeline to AWS IoT with Intel - I...Create an IoT Gateway and Establish a Data Pipeline to AWS IoT with Intel - I...
Create an IoT Gateway and Establish a Data Pipeline to AWS IoT with Intel - I...
 
Le Groupe NRB : Le meilleur partenaire pour votre z/modernisation
Le Groupe NRB : Le meilleur partenaire pour votre z/modernisationLe Groupe NRB : Le meilleur partenaire pour votre z/modernisation
Le Groupe NRB : Le meilleur partenaire pour votre z/modernisation
 
Going Global with AWS
Going Global with AWSGoing Global with AWS
Going Global with AWS
 
Building successful data science teams
Building successful data science teamsBuilding successful data science teams
Building successful data science teams
 
Adopting Data Science and Machine Learning in the financial enterprise
Adopting Data Science and Machine Learning in the financial enterpriseAdopting Data Science and Machine Learning in the financial enterprise
Adopting Data Science and Machine Learning in the financial enterprise
 
Handle your Lambdas - From event-based processing to Continuous Integration /...
Handle your Lambdas - From event-based processing to Continuous Integration /...Handle your Lambdas - From event-based processing to Continuous Integration /...
Handle your Lambdas - From event-based processing to Continuous Integration /...
 
Geekcamp Indonesia 2017 : Agile Product Management
Geekcamp Indonesia 2017 : Agile Product ManagementGeekcamp Indonesia 2017 : Agile Product Management
Geekcamp Indonesia 2017 : Agile Product Management
 
Fire in the Hole: How a Spark-Powered Platform Charges Analytics
Fire in the Hole: How a Spark-Powered Platform Charges Analytics Fire in the Hole: How a Spark-Powered Platform Charges Analytics
Fire in the Hole: How a Spark-Powered Platform Charges Analytics
 
Creating Modern Metadata Systems with New Relic, Dow Jones [FutureStack16]
Creating Modern Metadata Systems with New Relic, Dow Jones [FutureStack16]Creating Modern Metadata Systems with New Relic, Dow Jones [FutureStack16]
Creating Modern Metadata Systems with New Relic, Dow Jones [FutureStack16]
 
How to be data savvy manager
How to be data savvy managerHow to be data savvy manager
How to be data savvy manager
 
How a global manufacturing company built a data science capability from scratch
How a global manufacturing company built a data science capability from scratchHow a global manufacturing company built a data science capability from scratch
How a global manufacturing company built a data science capability from scratch
 
Perfect Memory Semantic Digital Asset Management @ Big Media Paris 2016
Perfect Memory Semantic Digital Asset Management @ Big Media Paris 2016Perfect Memory Semantic Digital Asset Management @ Big Media Paris 2016
Perfect Memory Semantic Digital Asset Management @ Big Media Paris 2016
 

More from Sergii Khomenko

Secure Data Scalability at Stylight with Tableau Online and Amazon Redshift /...
Secure Data Scalability at Stylight with Tableau Online and Amazon Redshift /...Secure Data Scalability at Stylight with Tableau Online and Amazon Redshift /...
Secure Data Scalability at Stylight with Tableau Online and Amazon Redshift /...Sergii Khomenko
 
Helping Data Teams with Puppet / Puppet Camp London - Apr 13, 2015
Helping Data Teams with Puppet / Puppet Camp London - Apr 13, 2015Helping Data Teams with Puppet / Puppet Camp London - Apr 13, 2015
Helping Data Teams with Puppet / Puppet Camp London - Apr 13, 2015Sergii Khomenko
 
Scaling your Tableau - Migrating from Tableau Online to a proper DWH solution...
Scaling your Tableau - Migrating from Tableau Online to a proper DWH solution...Scaling your Tableau - Migrating from Tableau Online to a proper DWH solution...
Scaling your Tableau - Migrating from Tableau Online to a proper DWH solution...Sergii Khomenko
 
Building Ranking Infrastructure: Data-Driven, Lean, Flexible - Sergii Khomenk...
Building Ranking Infrastructure: Data-Driven, Lean, Flexible - Sergii Khomenk...Building Ranking Infrastructure: Data-Driven, Lean, Flexible - Sergii Khomenk...
Building Ranking Infrastructure: Data-Driven, Lean, Flexible - Sergii Khomenk...Sergii Khomenko
 
From simple to more advanced: Lessons learned in 13 months with Tableau
From simple to more advanced: Lessons learned in 13 months with TableauFrom simple to more advanced: Lessons learned in 13 months with Tableau
From simple to more advanced: Lessons learned in 13 months with TableauSergii Khomenko
 
Crunching data with go: Tips, tricks, use-cases
Crunching data with go: Tips, tricks, use-casesCrunching data with go: Tips, tricks, use-cases
Crunching data with go: Tips, tricks, use-casesSergii Khomenko
 
Lean Ranking infrastructure with Solr
Lean Ranking infrastructure with SolrLean Ranking infrastructure with Solr
Lean Ranking infrastructure with SolrSergii Khomenko
 
Data Visualization with R
Data Visualization with RData Visualization with R
Data Visualization with RSergii Khomenko
 

More from Sergii Khomenko (8)

Secure Data Scalability at Stylight with Tableau Online and Amazon Redshift /...
Secure Data Scalability at Stylight with Tableau Online and Amazon Redshift /...Secure Data Scalability at Stylight with Tableau Online and Amazon Redshift /...
Secure Data Scalability at Stylight with Tableau Online and Amazon Redshift /...
 
Helping Data Teams with Puppet / Puppet Camp London - Apr 13, 2015
Helping Data Teams with Puppet / Puppet Camp London - Apr 13, 2015Helping Data Teams with Puppet / Puppet Camp London - Apr 13, 2015
Helping Data Teams with Puppet / Puppet Camp London - Apr 13, 2015
 
Scaling your Tableau - Migrating from Tableau Online to a proper DWH solution...
Scaling your Tableau - Migrating from Tableau Online to a proper DWH solution...Scaling your Tableau - Migrating from Tableau Online to a proper DWH solution...
Scaling your Tableau - Migrating from Tableau Online to a proper DWH solution...
 
Building Ranking Infrastructure: Data-Driven, Lean, Flexible - Sergii Khomenk...
Building Ranking Infrastructure: Data-Driven, Lean, Flexible - Sergii Khomenk...Building Ranking Infrastructure: Data-Driven, Lean, Flexible - Sergii Khomenk...
Building Ranking Infrastructure: Data-Driven, Lean, Flexible - Sergii Khomenk...
 
From simple to more advanced: Lessons learned in 13 months with Tableau
From simple to more advanced: Lessons learned in 13 months with TableauFrom simple to more advanced: Lessons learned in 13 months with Tableau
From simple to more advanced: Lessons learned in 13 months with Tableau
 
Crunching data with go: Tips, tricks, use-cases
Crunching data with go: Tips, tricks, use-casesCrunching data with go: Tips, tricks, use-cases
Crunching data with go: Tips, tricks, use-cases
 
Lean Ranking infrastructure with Solr
Lean Ranking infrastructure with SolrLean Ranking infrastructure with Solr
Lean Ranking infrastructure with Solr
 
Data Visualization with R
Data Visualization with RData Visualization with R
Data Visualization with R
 

Recently uploaded

PKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPramod Kumar Srivastava
 
RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998YohFuh
 
INTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTDINTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTDRafezzaman
 
04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationshipsccctableauusergroup
 
How we prevented account sharing with MFA
How we prevented account sharing with MFAHow we prevented account sharing with MFA
How we prevented account sharing with MFAAndrei Kaleshka
 
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...Florian Roscheck
 
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一F sss
 
ASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel CanterASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel Cantervoginip
 
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort servicejennyeacort
 
Call Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts ServiceCall Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts ServiceSapana Sha
 
Industrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfIndustrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfLars Albertsson
 
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degreeyuu sss
 
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝soniya singh
 
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Sapana Sha
 
Call Girls In Mahipalpur O9654467111 Escorts Service
Call Girls In Mahipalpur O9654467111  Escorts ServiceCall Girls In Mahipalpur O9654467111  Escorts Service
Call Girls In Mahipalpur O9654467111 Escorts ServiceSapana Sha
 
Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...
Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...
Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...ThinkInnovation
 
DBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdfDBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdfJohn Sterrett
 
办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一
办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一
办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一F La
 
B2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxB2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxStephen266013
 

Recently uploaded (20)

PKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
 
RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998
 
INTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTDINTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTD
 
04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships
 
How we prevented account sharing with MFA
How we prevented account sharing with MFAHow we prevented account sharing with MFA
How we prevented account sharing with MFA
 
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
 
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
 
ASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel CanterASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel Canter
 
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
 
Call Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts ServiceCall Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts Service
 
Industrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfIndustrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdf
 
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
 
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
 
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
 
Call Girls In Mahipalpur O9654467111 Escorts Service
Call Girls In Mahipalpur O9654467111  Escorts ServiceCall Girls In Mahipalpur O9654467111  Escorts Service
Call Girls In Mahipalpur O9654467111 Escorts Service
 
Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...
Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...
Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...
 
DBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdfDBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdf
 
办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一
办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一
办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一
 
E-Commerce Order PredictionShraddha Kamble.pptx
E-Commerce Order PredictionShraddha Kamble.pptxE-Commerce Order PredictionShraddha Kamble.pptx
E-Commerce Order PredictionShraddha Kamble.pptx
 
B2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxB2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docx
 

Building data pipelines: from simple to more advanced - hands-on experience / CrunchConf - Oct 29, 2015