SlideShare a Scribd company logo
1 of 42
Download to read offline
BACKING UP OUR TABLEAU -
STEPS TOWARDS A
RELIABLE REPORTING SOLUTION
Sergii Khomenko, Data Scientist
Dr. Konstantin Wemhöner, Head of Business Intelligence
WHAT
IS
STYLIGHT
?
A S H O R T
I N T R O D U C T I O N
STYLIGHT.de Seite 2 / 42
CONTENT meets COMMERCE
N
E
W
The best place to discover & shop fashion.
STYLIGHT.DE
STYLIGHT.de Seite 3 / 42
GLOBAL INSPIRATION – LOCAL COMMERCE
A V A I L A B L E I N 1 4 C O U N T R I E S
Germany, Austria, Switzerland, Netherlands, France, Italy, Sweden, UK,
Spain, Australia, Brazil, US, Norway, Belgium
STYLIGHT.de Seite 4 / 42
STYLIGHT ON THE GO
W H E N E V E R .
W H E R E V E R .
STYLIGHT.de Seite 5 / 42
PROUD TO BLEED PURPLE
•	 Founded: 2008 in Munich
•	 OFFICES: Munich, London, New York
•	 Investors: Holtzbrinck Ventures,
	 Tengelmann Ventures, Seven Ventures
•	 Business Partners: 350+ partner shops
	 worldwide with 6000+ brands
•	 Total Employees — 160+
	 (over 19 nationalities from 4 continents)
F A C T S A N D F I G U R E S
TOTALN°OFEMPLOYEES
50
100
150
201520142013201220112010
STYLIGHT.de Seite 6 / 42
GROSS MERCHANDISE VALUE
$360 MILLION
$2 70 MILLION
$175 MILLION
$50 MILL2011
2012
2013
2014
STYLIGHT.de Seite 7 / 42
BI SETUP BEFORE OUTAGE
SETUP OF OUR REPORTING WORKFLOW
Department
Business Intelligence
STYLIGHT.de Seite 9 / 42
OUR FIRST “DWH”: TABLEAU ONLINE
STYLIGHT.de Seite 10 / 42
WHY WE CHOSE TABLEAU ONLINE?
•	 Easy to start using
•	 Works for free
•	 All data sources in one place
•	 Unified routine
STYLIGHT.de Seite 11 / 42
WHY WE CHOSE TABLEAU ONLINE?
•	 combination of local and online/cloud sources 	
		 (Google Analytics, JDBC…)
•	 Sharing cross-continents - instantaneous
•	 easy distribution of reports with Tabcmd
STYLIGHT.de Seite 12 / 42
SAMPLE REPORTS FROM OUR
ENVIRONMENT
STYLIGHT.de Seite 13 / 42
SAMPLE REPORTS FROM OUR
ENVIRONMENT
STYLIGHT.de Seite 14 / 42
3 TYPES OF DATA SOURCES
static snapshot incremental
STYLIGHT.de Seite 15 / 42
HOW DATA SOURCES WERE UPDATED
STYLIGHT.de Seite 16 / 42
LOADING AND MONITORING
BEFORE OUTAGE
•	 25 workbooks online with 119 views from
		 80 data sources
•	 Scheduled mails
•	 All refreshes scheduled manually
STYLIGHT.de Seite 17 / 42
AND THEN IT CRASHED ...
STYLIGHT.de Seite 18 / 42
BACK FROM CHRISTMAS …
AND EVERYTHING CRASHED
STYLIGHT.de Seite 19 / 42
SERVER OUTAGE JANUARY 2015
•	 Started with empty scheduled mail reports
		 (9th Jan)
•	 Monday >80% of views not working
•	 No clear communication from Tableau
•	 Server outage during our scheduled refreshes
STYLIGHT.de Seite 20 / 42
SERIOUS DOWNTIME OF REPORTING
INFRASTRUCTURE
STYLIGHT.de Seite 21 / 42
3 TYPES OF DATA SOURCES AND HOW THEY
WERE AFFECTED
STYLIGHT.de Seite 22 / 42
FIRST THINGS FIRST: FIREFIGHTING
Replacement of all data sources in workbooks
Open Local copy New extract Replace
STYLIGHT.de Seite 23 / 42
HOW TO REBUILD A BROKEN DATA SOURCE?
Biggest Issue: Workbooks could not be opened
due to broken data source
Understand how a Tableau data extract is build
Find a way to extract and recreate the essential parts
of a TDE
STYLIGHT.de Seite 24 / 42
STYLIGHT.de Seite 25 / 42
THE INSIDE OF A TABLEAU DATA SOURCE FILE
STYLIGHT.de Seite 26 / 42
RECREATING THE FILES CONTENT
STYLIGHT.de Seite 27 / 42
ISSUES, PLANS
•	 We have all DS accessible
•	 We know where data comes from
•	 Structure re-creation
•	 Migration without any manual input
STYLIGHT.de Seite 28 / 42
DWH WITH
AMAZON REDSHIFT
IMPROVING OUR TECHNICAL SETUP
STYLIGHT.de Seite 30 / 42
GENERATION OF DATA INPUT FOR REDSHIFT
STYLIGHT.de Seite 31 / 42
SERVER-SIDE MONITORING OF DATA REFRESHES
STYLIGHT.de Seite 32 / 42
TRACKING DWH PERFORMANCE
STYLIGHT.de Seite 33 / 42
BENEFITS
•	 Control over backups
•	 Control over refreshes
•	 Scale DWH up to petabyte scale
•	 Easy to add new ETL stages (EMR)
•	 More open for new challenges
STYLIGHT.de Seite 34 / 42
MONITORING INSTALLED FOR ALL REFRESHING
DATA SOURCES
STYLIGHT.de Seite 35 / 42
ADDITIONAL ERROR LOGGING WITH LOGGLY
STYLIGHT.de Seite 36 / 42
OUTCOME &
FUTURE PLANS
POSITIVE OUTCOMES
•	 Number of data sources reduced by 30%
•	 Speed increase by using RedShift factor >100
•	 Scalable infrastructure for growing company
•	 More flexible connection of tables via RedShift
STYLIGHT.de Seite 38 / 42
IMPROVING IT TO THE NEXT LEVEL!
•	 Open Source our Python tools
•	 Internal DWH mapping server
•	 Flexible to integrate new things
•	 Google Spreadsheet integration
STYLIGHT.de Seite 39 / 42
HOW TO REACH US
T O O L S , T U G M U N I C H
Sergii Khomenko
sergii.khomenko@stylight.com
@lc0d3r
G E N E R A L I N F O , B I J O B S
Dr. Konstantin Wemhöner
konstantin.wemhoener@stylight.com
@kwarks85
STYLIGHT Engineering: @CodeTailors
STYLIGHT.de Seite 40 / 42
STYLIGHT.de Seite 41 / 42
STYLIGHT
Nymphenburger Straße 86
80636 Munich, Germany
Join us on Facebook: facebook.com/stylight
Follow us on Twitter: twitter.com/stylight
Follow us on Instagram: instagram.com/stylight
STYLIGHT.de Seite 42 / 42

More Related Content

Similar to Secure Data Scalability at Stylight with Tableau Online and Amazon Redshift / Tableau Conference on Tour - Berlin - Jun 9, 2015

Building a Data Ingestion & Processing Pipeline with Spark & Airflow
Building a Data Ingestion & Processing Pipeline with Spark & AirflowBuilding a Data Ingestion & Processing Pipeline with Spark & Airflow
Building a Data Ingestion & Processing Pipeline with Spark & AirflowTom Lous
 
2015-11-24-pepite-data-analytics
2015-11-24-pepite-data-analytics2015-11-24-pepite-data-analytics
2015-11-24-pepite-data-analyticsSirris
 
Designing high performance datawarehouse
Designing high performance datawarehouseDesigning high performance datawarehouse
Designing high performance datawarehouseUday Kothari
 
Chris Day VP IT Transformation and Office of the CIO at AstraZeneca
Chris Day VP IT Transformation and Office of the CIO at AstraZenecaChris Day VP IT Transformation and Office of the CIO at AstraZeneca
Chris Day VP IT Transformation and Office of the CIO at AstraZenecaSteve Ashton
 
Metrics 4 faster feedback
Metrics 4 faster feedbackMetrics 4 faster feedback
Metrics 4 faster feedbackKris Buytaert
 
DIADEM: domain-centric intelligent automated data extraction methodology Pres...
DIADEM: domain-centric intelligent automated data extraction methodology Pres...DIADEM: domain-centric intelligent automated data extraction methodology Pres...
DIADEM: domain-centric intelligent automated data extraction methodology Pres...DBOnto
 
Diadem DBOnto Kick Off meeting
Diadem DBOnto Kick Off meetingDiadem DBOnto Kick Off meeting
Diadem DBOnto Kick Off meetingDBOnto
 
Modern SSO Using the MEAN Stack
Modern SSO Using the MEAN StackModern SSO Using the MEAN Stack
Modern SSO Using the MEAN StackMongoDB
 
Industry AHEAD: Empower and boost aftermarket sales
Industry AHEAD: Empower and boost aftermarket salesIndustry AHEAD: Empower and boost aftermarket sales
Industry AHEAD: Empower and boost aftermarket salesCreuna
 
Overcoming Digital Transformation Pain Points
Overcoming Digital Transformation Pain PointsOvercoming Digital Transformation Pain Points
Overcoming Digital Transformation Pain PointsInductive Automation
 
TUI Nordic: Ensuring great user experience for 80 million site visitors
TUI Nordic: Ensuring great user experience for 80 million site visitorsTUI Nordic: Ensuring great user experience for 80 million site visitors
TUI Nordic: Ensuring great user experience for 80 million site visitorsDynatrace
 
Pivoting event streaming, from PROJECTS to a PLATFORM
Pivoting event streaming, from PROJECTS to a PLATFORMPivoting event streaming, from PROJECTS to a PLATFORM
Pivoting event streaming, from PROJECTS to a PLATFORMconfluent
 
Innovative Data Strategies for Advanced Analytics Solutions and the Role of D...
Innovative Data Strategies for Advanced Analytics Solutions and the Role of D...Innovative Data Strategies for Advanced Analytics Solutions and the Role of D...
Innovative Data Strategies for Advanced Analytics Solutions and the Role of D...Denodo
 
Big Data Warsaw v 4 I "Startups: Lifeguards of the Corporate Data Lake" - Fel...
Big Data Warsaw v 4 I "Startups: Lifeguards of the Corporate Data Lake" - Fel...Big Data Warsaw v 4 I "Startups: Lifeguards of the Corporate Data Lake" - Fel...
Big Data Warsaw v 4 I "Startups: Lifeguards of the Corporate Data Lake" - Fel...Dataconomy Media
 
Taming the Data Lake with Scalable Metrics Model Framework
Taming the Data Lake with Scalable Metrics Model FrameworkTaming the Data Lake with Scalable Metrics Model Framework
Taming the Data Lake with Scalable Metrics Model FrameworkRamkumar Ravichandran
 
Data, Interconnectedness & The Internet of Things
Data, Interconnectedness & The Internet of Things Data, Interconnectedness & The Internet of Things
Data, Interconnectedness & The Internet of Things Software AG
 
How to drive LightsOutPlanning by bluecrux
How to drive LightsOutPlanning by bluecruxHow to drive LightsOutPlanning by bluecrux
How to drive LightsOutPlanning by bluecruxBluecrux
 

Similar to Secure Data Scalability at Stylight with Tableau Online and Amazon Redshift / Tableau Conference on Tour - Berlin - Jun 9, 2015 (20)

Building a Data Ingestion & Processing Pipeline with Spark & Airflow
Building a Data Ingestion & Processing Pipeline with Spark & AirflowBuilding a Data Ingestion & Processing Pipeline with Spark & Airflow
Building a Data Ingestion & Processing Pipeline with Spark & Airflow
 
2015-11-24-pepite-data-analytics
2015-11-24-pepite-data-analytics2015-11-24-pepite-data-analytics
2015-11-24-pepite-data-analytics
 
Designing high performance datawarehouse
Designing high performance datawarehouseDesigning high performance datawarehouse
Designing high performance datawarehouse
 
Chris Day VP IT Transformation and Office of the CIO at AstraZeneca
Chris Day VP IT Transformation and Office of the CIO at AstraZenecaChris Day VP IT Transformation and Office of the CIO at AstraZeneca
Chris Day VP IT Transformation and Office of the CIO at AstraZeneca
 
Metrics 4 faster feedback
Metrics 4 faster feedbackMetrics 4 faster feedback
Metrics 4 faster feedback
 
DIADEM: domain-centric intelligent automated data extraction methodology Pres...
DIADEM: domain-centric intelligent automated data extraction methodology Pres...DIADEM: domain-centric intelligent automated data extraction methodology Pres...
DIADEM: domain-centric intelligent automated data extraction methodology Pres...
 
Diadem DBOnto Kick Off meeting
Diadem DBOnto Kick Off meetingDiadem DBOnto Kick Off meeting
Diadem DBOnto Kick Off meeting
 
Modern SSO Using the MEAN Stack
Modern SSO Using the MEAN StackModern SSO Using the MEAN Stack
Modern SSO Using the MEAN Stack
 
Industry AHEAD: Empower and boost aftermarket sales
Industry AHEAD: Empower and boost aftermarket salesIndustry AHEAD: Empower and boost aftermarket sales
Industry AHEAD: Empower and boost aftermarket sales
 
Overcoming Digital Transformation Pain Points
Overcoming Digital Transformation Pain PointsOvercoming Digital Transformation Pain Points
Overcoming Digital Transformation Pain Points
 
TUI Nordic: Ensuring great user experience for 80 million site visitors
TUI Nordic: Ensuring great user experience for 80 million site visitorsTUI Nordic: Ensuring great user experience for 80 million site visitors
TUI Nordic: Ensuring great user experience for 80 million site visitors
 
Connected Business Days 2017
Connected Business Days 2017Connected Business Days 2017
Connected Business Days 2017
 
Pivoting event streaming, from PROJECTS to a PLATFORM
Pivoting event streaming, from PROJECTS to a PLATFORMPivoting event streaming, from PROJECTS to a PLATFORM
Pivoting event streaming, from PROJECTS to a PLATFORM
 
Innovative Data Strategies for Advanced Analytics Solutions and the Role of D...
Innovative Data Strategies for Advanced Analytics Solutions and the Role of D...Innovative Data Strategies for Advanced Analytics Solutions and the Role of D...
Innovative Data Strategies for Advanced Analytics Solutions and the Role of D...
 
Big Data Warsaw v 4 I "Startups: Lifeguards of the Corporate Data Lake" - Fel...
Big Data Warsaw v 4 I "Startups: Lifeguards of the Corporate Data Lake" - Fel...Big Data Warsaw v 4 I "Startups: Lifeguards of the Corporate Data Lake" - Fel...
Big Data Warsaw v 4 I "Startups: Lifeguards of the Corporate Data Lake" - Fel...
 
Taming data lake - scalable metrics model
Taming data lake - scalable metrics modelTaming data lake - scalable metrics model
Taming data lake - scalable metrics model
 
Solid Works
Solid WorksSolid Works
Solid Works
 
Taming the Data Lake with Scalable Metrics Model Framework
Taming the Data Lake with Scalable Metrics Model FrameworkTaming the Data Lake with Scalable Metrics Model Framework
Taming the Data Lake with Scalable Metrics Model Framework
 
Data, Interconnectedness & The Internet of Things
Data, Interconnectedness & The Internet of Things Data, Interconnectedness & The Internet of Things
Data, Interconnectedness & The Internet of Things
 
How to drive LightsOutPlanning by bluecrux
How to drive LightsOutPlanning by bluecruxHow to drive LightsOutPlanning by bluecrux
How to drive LightsOutPlanning by bluecrux
 

More from Sergii Khomenko

From Data Science to Production - deploy, scale, enjoy! / PyData Amsterdam - ...
From Data Science to Production - deploy, scale, enjoy! / PyData Amsterdam - ...From Data Science to Production - deploy, scale, enjoy! / PyData Amsterdam - ...
From Data Science to Production - deploy, scale, enjoy! / PyData Amsterdam - ...Sergii Khomenko
 
Building Data applications with Go: from Bloom filters to Data pipelines / FO...
Building Data applications with Go: from Bloom filters to Data pipelines / FO...Building Data applications with Go: from Bloom filters to Data pipelines / FO...
Building Data applications with Go: from Bloom filters to Data pipelines / FO...Sergii Khomenko
 
Building data pipelines: from simple to more advanced - hands-on experience /...
Building data pipelines: from simple to more advanced - hands-on experience /...Building data pipelines: from simple to more advanced - hands-on experience /...
Building data pipelines: from simple to more advanced - hands-on experience /...Sergii Khomenko
 
Scaling up Business Intelligence from the scratch and to 15 countries worldwi...
Scaling up Business Intelligence from the scratch and to 15 countries worldwi...Scaling up Business Intelligence from the scratch and to 15 countries worldwi...
Scaling up Business Intelligence from the scratch and to 15 countries worldwi...Sergii Khomenko
 
Helping Data Teams with Puppet / Puppet Camp London - Apr 13, 2015
Helping Data Teams with Puppet / Puppet Camp London - Apr 13, 2015Helping Data Teams with Puppet / Puppet Camp London - Apr 13, 2015
Helping Data Teams with Puppet / Puppet Camp London - Apr 13, 2015Sergii Khomenko
 
Scaling your Tableau - Migrating from Tableau Online to a proper DWH solution...
Scaling your Tableau - Migrating from Tableau Online to a proper DWH solution...Scaling your Tableau - Migrating from Tableau Online to a proper DWH solution...
Scaling your Tableau - Migrating from Tableau Online to a proper DWH solution...Sergii Khomenko
 
Building Ranking Infrastructure: Data-Driven, Lean, Flexible - Sergii Khomenk...
Building Ranking Infrastructure: Data-Driven, Lean, Flexible - Sergii Khomenk...Building Ranking Infrastructure: Data-Driven, Lean, Flexible - Sergii Khomenk...
Building Ranking Infrastructure: Data-Driven, Lean, Flexible - Sergii Khomenk...Sergii Khomenko
 
From simple to more advanced: Lessons learned in 13 months with Tableau
From simple to more advanced: Lessons learned in 13 months with TableauFrom simple to more advanced: Lessons learned in 13 months with Tableau
From simple to more advanced: Lessons learned in 13 months with TableauSergii Khomenko
 
Crunching data with go: Tips, tricks, use-cases
Crunching data with go: Tips, tricks, use-casesCrunching data with go: Tips, tricks, use-cases
Crunching data with go: Tips, tricks, use-casesSergii Khomenko
 
Lean Ranking infrastructure with Solr
Lean Ranking infrastructure with SolrLean Ranking infrastructure with Solr
Lean Ranking infrastructure with SolrSergii Khomenko
 
Data Visualization with R
Data Visualization with RData Visualization with R
Data Visualization with RSergii Khomenko
 

More from Sergii Khomenko (11)

From Data Science to Production - deploy, scale, enjoy! / PyData Amsterdam - ...
From Data Science to Production - deploy, scale, enjoy! / PyData Amsterdam - ...From Data Science to Production - deploy, scale, enjoy! / PyData Amsterdam - ...
From Data Science to Production - deploy, scale, enjoy! / PyData Amsterdam - ...
 
Building Data applications with Go: from Bloom filters to Data pipelines / FO...
Building Data applications with Go: from Bloom filters to Data pipelines / FO...Building Data applications with Go: from Bloom filters to Data pipelines / FO...
Building Data applications with Go: from Bloom filters to Data pipelines / FO...
 
Building data pipelines: from simple to more advanced - hands-on experience /...
Building data pipelines: from simple to more advanced - hands-on experience /...Building data pipelines: from simple to more advanced - hands-on experience /...
Building data pipelines: from simple to more advanced - hands-on experience /...
 
Scaling up Business Intelligence from the scratch and to 15 countries worldwi...
Scaling up Business Intelligence from the scratch and to 15 countries worldwi...Scaling up Business Intelligence from the scratch and to 15 countries worldwi...
Scaling up Business Intelligence from the scratch and to 15 countries worldwi...
 
Helping Data Teams with Puppet / Puppet Camp London - Apr 13, 2015
Helping Data Teams with Puppet / Puppet Camp London - Apr 13, 2015Helping Data Teams with Puppet / Puppet Camp London - Apr 13, 2015
Helping Data Teams with Puppet / Puppet Camp London - Apr 13, 2015
 
Scaling your Tableau - Migrating from Tableau Online to a proper DWH solution...
Scaling your Tableau - Migrating from Tableau Online to a proper DWH solution...Scaling your Tableau - Migrating from Tableau Online to a proper DWH solution...
Scaling your Tableau - Migrating from Tableau Online to a proper DWH solution...
 
Building Ranking Infrastructure: Data-Driven, Lean, Flexible - Sergii Khomenk...
Building Ranking Infrastructure: Data-Driven, Lean, Flexible - Sergii Khomenk...Building Ranking Infrastructure: Data-Driven, Lean, Flexible - Sergii Khomenk...
Building Ranking Infrastructure: Data-Driven, Lean, Flexible - Sergii Khomenk...
 
From simple to more advanced: Lessons learned in 13 months with Tableau
From simple to more advanced: Lessons learned in 13 months with TableauFrom simple to more advanced: Lessons learned in 13 months with Tableau
From simple to more advanced: Lessons learned in 13 months with Tableau
 
Crunching data with go: Tips, tricks, use-cases
Crunching data with go: Tips, tricks, use-casesCrunching data with go: Tips, tricks, use-cases
Crunching data with go: Tips, tricks, use-cases
 
Lean Ranking infrastructure with Solr
Lean Ranking infrastructure with SolrLean Ranking infrastructure with Solr
Lean Ranking infrastructure with Solr
 
Data Visualization with R
Data Visualization with RData Visualization with R
Data Visualization with R
 

Recently uploaded

Reconciling Conflicting Data Curation Actions: Transparency Through Argument...
Reconciling Conflicting Data Curation Actions:  Transparency Through Argument...Reconciling Conflicting Data Curation Actions:  Transparency Through Argument...
Reconciling Conflicting Data Curation Actions: Transparency Through Argument...Bertram Ludäscher
 
Data Analyst Tasks to do the internship.pdf
Data Analyst Tasks to do the internship.pdfData Analyst Tasks to do the internship.pdf
Data Analyst Tasks to do the internship.pdftheeltifs
 
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制vexqp
 
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Valters Lauzums
 
Dubai Call Girls Peeing O525547819 Call Girls Dubai
Dubai Call Girls Peeing O525547819 Call Girls DubaiDubai Call Girls Peeing O525547819 Call Girls Dubai
Dubai Call Girls Peeing O525547819 Call Girls Dubaikojalkojal131
 
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...nirzagarg
 
SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...
SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...
SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...Elaine Werffeli
 
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...nirzagarg
 
7. Epi of Chronic respiratory diseases.ppt
7. Epi of Chronic respiratory diseases.ppt7. Epi of Chronic respiratory diseases.ppt
7. Epi of Chronic respiratory diseases.pptibrahimabdi22
 
Switzerland Constitution 2002.pdf.........
Switzerland Constitution 2002.pdf.........Switzerland Constitution 2002.pdf.........
Switzerland Constitution 2002.pdf.........EfruzAsilolu
 
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi ArabiaIn Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabiaahmedjiabur940
 
Aspirational Block Program Block Syaldey District - Almora
Aspirational Block Program Block Syaldey District - AlmoraAspirational Block Program Block Syaldey District - Almora
Aspirational Block Program Block Syaldey District - AlmoraGovindSinghDasila
 
Lecture_2_Deep_Learning_Overview-newone1
Lecture_2_Deep_Learning_Overview-newone1Lecture_2_Deep_Learning_Overview-newone1
Lecture_2_Deep_Learning_Overview-newone1ranjankumarbehera14
 
Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...gajnagarg
 
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteedamy56318795
 
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...Klinik kandungan
 
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...nirzagarg
 
Vadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book now
Vadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book nowVadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book now
Vadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book nowgargpaaro
 

Recently uploaded (20)

Reconciling Conflicting Data Curation Actions: Transparency Through Argument...
Reconciling Conflicting Data Curation Actions:  Transparency Through Argument...Reconciling Conflicting Data Curation Actions:  Transparency Through Argument...
Reconciling Conflicting Data Curation Actions: Transparency Through Argument...
 
Data Analyst Tasks to do the internship.pdf
Data Analyst Tasks to do the internship.pdfData Analyst Tasks to do the internship.pdf
Data Analyst Tasks to do the internship.pdf
 
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
 
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
 
Cytotec in Jeddah+966572737505) get unwanted pregnancy kit Riyadh
Cytotec in Jeddah+966572737505) get unwanted pregnancy kit RiyadhCytotec in Jeddah+966572737505) get unwanted pregnancy kit Riyadh
Cytotec in Jeddah+966572737505) get unwanted pregnancy kit Riyadh
 
Dubai Call Girls Peeing O525547819 Call Girls Dubai
Dubai Call Girls Peeing O525547819 Call Girls DubaiDubai Call Girls Peeing O525547819 Call Girls Dubai
Dubai Call Girls Peeing O525547819 Call Girls Dubai
 
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
 
SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...
SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...
SAC 25 Final National, Regional & Local Angel Group Investing Insights 2024 0...
 
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
 
7. Epi of Chronic respiratory diseases.ppt
7. Epi of Chronic respiratory diseases.ppt7. Epi of Chronic respiratory diseases.ppt
7. Epi of Chronic respiratory diseases.ppt
 
Switzerland Constitution 2002.pdf.........
Switzerland Constitution 2002.pdf.........Switzerland Constitution 2002.pdf.........
Switzerland Constitution 2002.pdf.........
 
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi ArabiaIn Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia
 
Aspirational Block Program Block Syaldey District - Almora
Aspirational Block Program Block Syaldey District - AlmoraAspirational Block Program Block Syaldey District - Almora
Aspirational Block Program Block Syaldey District - Almora
 
Sequential and reinforcement learning for demand side management by Margaux B...
Sequential and reinforcement learning for demand side management by Margaux B...Sequential and reinforcement learning for demand side management by Margaux B...
Sequential and reinforcement learning for demand side management by Margaux B...
 
Lecture_2_Deep_Learning_Overview-newone1
Lecture_2_Deep_Learning_Overview-newone1Lecture_2_Deep_Learning_Overview-newone1
Lecture_2_Deep_Learning_Overview-newone1
 
Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...
 
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
 
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
 
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
 
Vadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book now
Vadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book nowVadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book now
Vadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book now
 

Secure Data Scalability at Stylight with Tableau Online and Amazon Redshift / Tableau Conference on Tour - Berlin - Jun 9, 2015

  • 1. BACKING UP OUR TABLEAU - STEPS TOWARDS A RELIABLE REPORTING SOLUTION Sergii Khomenko, Data Scientist Dr. Konstantin Wemhöner, Head of Business Intelligence
  • 2. WHAT IS STYLIGHT ? A S H O R T I N T R O D U C T I O N STYLIGHT.de Seite 2 / 42
  • 3. CONTENT meets COMMERCE N E W The best place to discover & shop fashion. STYLIGHT.DE STYLIGHT.de Seite 3 / 42
  • 4. GLOBAL INSPIRATION – LOCAL COMMERCE A V A I L A B L E I N 1 4 C O U N T R I E S Germany, Austria, Switzerland, Netherlands, France, Italy, Sweden, UK, Spain, Australia, Brazil, US, Norway, Belgium STYLIGHT.de Seite 4 / 42
  • 5. STYLIGHT ON THE GO W H E N E V E R . W H E R E V E R . STYLIGHT.de Seite 5 / 42
  • 6. PROUD TO BLEED PURPLE • Founded: 2008 in Munich • OFFICES: Munich, London, New York • Investors: Holtzbrinck Ventures, Tengelmann Ventures, Seven Ventures • Business Partners: 350+ partner shops worldwide with 6000+ brands • Total Employees — 160+ (over 19 nationalities from 4 continents) F A C T S A N D F I G U R E S TOTALN°OFEMPLOYEES 50 100 150 201520142013201220112010 STYLIGHT.de Seite 6 / 42
  • 7. GROSS MERCHANDISE VALUE $360 MILLION $2 70 MILLION $175 MILLION $50 MILL2011 2012 2013 2014 STYLIGHT.de Seite 7 / 42
  • 9. SETUP OF OUR REPORTING WORKFLOW Department Business Intelligence STYLIGHT.de Seite 9 / 42
  • 10. OUR FIRST “DWH”: TABLEAU ONLINE STYLIGHT.de Seite 10 / 42
  • 11. WHY WE CHOSE TABLEAU ONLINE? • Easy to start using • Works for free • All data sources in one place • Unified routine STYLIGHT.de Seite 11 / 42
  • 12. WHY WE CHOSE TABLEAU ONLINE? • combination of local and online/cloud sources (Google Analytics, JDBC…) • Sharing cross-continents - instantaneous • easy distribution of reports with Tabcmd STYLIGHT.de Seite 12 / 42
  • 13. SAMPLE REPORTS FROM OUR ENVIRONMENT STYLIGHT.de Seite 13 / 42
  • 14. SAMPLE REPORTS FROM OUR ENVIRONMENT STYLIGHT.de Seite 14 / 42
  • 15. 3 TYPES OF DATA SOURCES static snapshot incremental STYLIGHT.de Seite 15 / 42
  • 16. HOW DATA SOURCES WERE UPDATED STYLIGHT.de Seite 16 / 42
  • 17. LOADING AND MONITORING BEFORE OUTAGE • 25 workbooks online with 119 views from 80 data sources • Scheduled mails • All refreshes scheduled manually STYLIGHT.de Seite 17 / 42
  • 18. AND THEN IT CRASHED ... STYLIGHT.de Seite 18 / 42
  • 19. BACK FROM CHRISTMAS … AND EVERYTHING CRASHED STYLIGHT.de Seite 19 / 42
  • 20. SERVER OUTAGE JANUARY 2015 • Started with empty scheduled mail reports (9th Jan) • Monday >80% of views not working • No clear communication from Tableau • Server outage during our scheduled refreshes STYLIGHT.de Seite 20 / 42
  • 21. SERIOUS DOWNTIME OF REPORTING INFRASTRUCTURE STYLIGHT.de Seite 21 / 42
  • 22. 3 TYPES OF DATA SOURCES AND HOW THEY WERE AFFECTED STYLIGHT.de Seite 22 / 42
  • 23. FIRST THINGS FIRST: FIREFIGHTING Replacement of all data sources in workbooks Open Local copy New extract Replace STYLIGHT.de Seite 23 / 42
  • 24. HOW TO REBUILD A BROKEN DATA SOURCE? Biggest Issue: Workbooks could not be opened due to broken data source Understand how a Tableau data extract is build Find a way to extract and recreate the essential parts of a TDE STYLIGHT.de Seite 24 / 42
  • 26. THE INSIDE OF A TABLEAU DATA SOURCE FILE STYLIGHT.de Seite 26 / 42
  • 27. RECREATING THE FILES CONTENT STYLIGHT.de Seite 27 / 42
  • 28. ISSUES, PLANS • We have all DS accessible • We know where data comes from • Structure re-creation • Migration without any manual input STYLIGHT.de Seite 28 / 42
  • 30. IMPROVING OUR TECHNICAL SETUP STYLIGHT.de Seite 30 / 42
  • 31. GENERATION OF DATA INPUT FOR REDSHIFT STYLIGHT.de Seite 31 / 42
  • 32. SERVER-SIDE MONITORING OF DATA REFRESHES STYLIGHT.de Seite 32 / 42
  • 34. BENEFITS • Control over backups • Control over refreshes • Scale DWH up to petabyte scale • Easy to add new ETL stages (EMR) • More open for new challenges STYLIGHT.de Seite 34 / 42
  • 35. MONITORING INSTALLED FOR ALL REFRESHING DATA SOURCES STYLIGHT.de Seite 35 / 42
  • 36. ADDITIONAL ERROR LOGGING WITH LOGGLY STYLIGHT.de Seite 36 / 42
  • 38. POSITIVE OUTCOMES • Number of data sources reduced by 30% • Speed increase by using RedShift factor >100 • Scalable infrastructure for growing company • More flexible connection of tables via RedShift STYLIGHT.de Seite 38 / 42
  • 39. IMPROVING IT TO THE NEXT LEVEL! • Open Source our Python tools • Internal DWH mapping server • Flexible to integrate new things • Google Spreadsheet integration STYLIGHT.de Seite 39 / 42
  • 40. HOW TO REACH US T O O L S , T U G M U N I C H Sergii Khomenko sergii.khomenko@stylight.com @lc0d3r G E N E R A L I N F O , B I J O B S Dr. Konstantin Wemhöner konstantin.wemhoener@stylight.com @kwarks85 STYLIGHT Engineering: @CodeTailors STYLIGHT.de Seite 40 / 42
  • 42. STYLIGHT Nymphenburger Straße 86 80636 Munich, Germany Join us on Facebook: facebook.com/stylight Follow us on Twitter: twitter.com/stylight Follow us on Instagram: instagram.com/stylight STYLIGHT.de Seite 42 / 42