SlideShare a Scribd company logo
1 of 61
Extreme Analytics @ eBay
Evolution of GovernedSelf Service Analytics
Agenda
•eBay Today
•Big Data @ eBay
•The HOW?
•Q&A
PRESENTATION TITLE GOES HERE 2
OUR
BUSINESS
Most Powerful
Selling Platform
For business
sellers: the
potential to drive
profitable sales
and build a brand
For consumer
sellers: an easy
way to declutter,
sell and make
money
A partnership not a
competition
Best Choice
Providing the
greatest selection
of inventory for
our buyers
From new,
everyday items to
rare and unique
goods
And incredible
deals only found
on eBay
Most Relevance
A shopping
experience that is
simple, data-driven
and personalized
Enabling buyers to
easily find, compare
and purchase items
they need and want
Highlighting the
unique value that
eBay brings
OUR
STRATEGY
EBAY INC AT A GLANCE
$2.1B
Revenue in Q1 2016
$20.5B
GMV in Q1 2016
162M
Global Active Buyers
57%
International
revenue
Q1 2016 data
$9B
Mobile Volume
314M
App downloads
EBAY MARKETPLACE AT A GLANCE
$19.6B
GMV in Q1 2016
9.5M
New listings added via
mobile per week
300M
Searches each day
63%
Transactions that
ship for free
(in US, UK, DE)
79%
Items sold as new
Q1 2016 data
~900M
Live listings
One of the world’s largest and most vibrant marketplaces
VELOCITY STATS
US
3 car parts or accessories are sold every
A smartphone is sold every
A dress is sold every
1 sec
4 sec
6 sec
UK
A necklace is sold every
A make-up product is sold every
A Lego product is sold every
10 sec
3 sec
19 sec
GERMANY
A truck or car is sold every
A pair of women’s jeans is sold every
A video game is sold every
5 min
4 sec
11 sec
AUSTRALIA
A pair of men’s sunglasses is sold every
A home décor item is sold every
A car or truck part is sold every
1 min
12 sec
4 sec
MOBILE VELOCITY STATS
US
A woman’s handbag is sold every
A car or truck is sold every
An action figure is sold every
10 sec
5 min
10 sec
UK
A tablet is sold every
A cookware item is sold every
A car is sold every
1 min
6 sec
2 min
GERMANY
A pair of women’s shoes is sold every
A watch is sold every
A tire or car part is sold every
20 sec
48 sec
35 sec
AUSTRALIA
A piece of jewelry is sold every
A baby clothing item is sold every
A motorcycle part is sold every
12 sec
46 sec
51 sec
THREE KEY TRENDS ARE
REDEFINING COMMERCE
Smart CommerceSeamless Commerce True Global Commerce
SEAMLESS
COMMERCE
TRUE GLOBAL
COMMERCE of eBay’s business
is international57%
of commercial
sellers engage in
exporting*
95%
languages
8
*Sellers with $10,000 or
more/year in sales
SMART
COMMERCE
Identify an
interesting set of
candidate items,
trends, events, etc.
Personalize
the results
Inspiration
at scale!
PRESENTATION TITLE GOES HERE 13
Volume
Variety
Velocity
VALUE
PRESENTATION TITLE GOES HERE 17
PRESENTATION TITLE GOES HERE 18
Big Data @ eBay
BIG Data VVC
20
>50 TB/day new data
>100 PB/day
>100 Trillion pairs of information
Millionsof queries/day
>7500
business users & analysts
>50k chains of logic
24x7x365
99.98+%Availability
turning over a TB every second
Active/Active
Near-Real-time
>100kdata elements
Always online
Processed
>1.5 x 1012
new records/day
21
TECHNOLOGY
EDW
Analytics
Application
Analysts and Data
Scientists
Management Integrators Business
Owners
Application
Servers
Data Processing
Clusters
Aggregation &
Summarization
Visualization &
Reporting
ClicktoInsights
Kylin
OVER Billings EVENTS FROM 162M EBAY BUYERS
CAPTURED, TRANSFORMED, SYNTHESIZED TO PROVIDE
ACTIONABLE INSIGHTS
22
eBay has one of the largest most active
data platforms in the world.
eBay has one of the largest most active data
platforms in the world with a diverse set of
users.
Product Families
23
CURATED
DATA
PLATFORMS
SQL/RM/SAS/R
TXN
IN
V
BEH
CUS
DATA
EXPLORATION
&
COLLABORATI
ON:
ALATION
ADVANCED
DATA TOOLS on
iHub:
NOUS,GRO,DNA
,M1, SPA, SD,
CC, Adv …
ENTERPRISE
DATA
SERVICE/Feed:
RETAIL
SIGNAL,IDENTI
TY, SS, SH…
REALTIME
DATA
STREAM:
TORA
24
FIVE STAGES
24
4
3
2
1
BEHAVIORALDATA
PLATFORM
ANALYTICAL
APPLICATION
PLATFORM
COLLABORATIVE
IDEATION
PLATFORM
AGILEDATA
WAREHOUSE
AUTONOMOUS
DECISIONING
PLATFORM
5
25
Agile Data Warehousing
EDWs + VDMs1
Semi-Structured SQL++Structured SQL
Low End Enterprise-class System
Contextual-complex analytics, deep,
seasonal, consumable datasets
Production data warehousing,
large concurrent user base
Discover & Explore
Analyze & Report
Enterprise-class System
Unstructured JAVA / C
Structure the unstructured,
detect patterns
Commodity Hardware System
Singularity HadoopTeradata
Enterprise Data Warehouse
DISCOVER & EXPLOREANALYZE & REPORT
26
page
27
Biggest complexity drivers are
 Maintaining separate databases
 weekly/daily/hourly data transfers
 Data inconsistencies
 Data duplication
 Increased complexity
 Loss of centralized viz & control
DMs
A data mart cannot be cheap enough to justify its existence
PRESENTATION TITLE GOES HERE 28
...the wrong way
Data Marts in the Cloud
Customer
Customer
Customer
Customer
Customer
Product
Customer
Product
Customer
Product
Customer
Product
Trx
Customer
Product
Trx
Customer
Product
Trx
Customer
Product
Trx
PRESENTATION TITLE GOES HERE 29
Virtual Data Marts
Customer
Product Transactio
n
Behavior
Virtual
DataMart Virtual
DataMart Virtual
DataMart
Virtual
DataMart
Virtual
DataMart
Virtual
DataMartVirtual
DataMart
Virtual
DataMart
Virtual
DataMart
30
Deep Data Platforms
Hadoop + ddDBMS2
Semi-Structured SQL++Structured SQL
Low End Enterprise-class System
Contextual-complex analytics, deep,
seasonal, consumable datasets
Production data warehousing,
large concurrent user base
Discover & Explore
Analyze & Report
Enterprise-class System
Unstructured JAVA / C
Structure the unstructured,
detect patterns
Commodity Hardware System
Singularity HadoopTeradata
Deep-Data Platforms
DISCOVER & EXPLOREANALYZE & REPORT
31
Behavioral Data Centric
32
The Data Hub
Collaborative Analytics3
33
Collaborative Analytics
Compose
Write and discover queries with ease; understand and reuse
code
easily; drives time and savings.
Catalog & Govern
Document and discover data and concepts; structured and crowd
sourced tagging of content in a stewarded environment.
Answers
Fast, trusted answers for everyone; search for analytic products
(metrics, reports, KPIs).
Forensics
Insightful IT and operational data to expose and
eliminateredundancies
Experts / Stewards
Govern
Simple Data
Management
Analyst
Compose
Better,Faster Queries
Business
Users
Answers
Google for your Data
IT
Forensics
Intelligence about your
data
Wiki + metadata repository
Alation SQL Assistant
Metadata repository
+ +
Storytelling
Mixing textual analysis with graphs
WHAT IT’S LIKECOLLABORATION TOOL
2013
2009
2014 AnswerHub Discussion forum moderated by support
DataHub + for data2010
COLLABORATION JOURNEY
2014
35
The App Platform
Analytics Application Platform4
ENTERPRISE DATA PLATFORM
36
Data Warehouse Data Streams
Batch
Humans
Sets of data
Streams
Systems
Sets of data
Data Services
Services
Applications
Specific calls
Populated
Used by
How
Enterprise
Populated
Used by
How
DQRecon
Data Processing Ecosystem
37
Curated
Streams
Applications
Data Services
ApplicationAnalytics
Data
Scientists
Analysts
BU/PD
Leaders
Site DBs
Real-Time
Data
Sources
External
Data
Sources
ETL
Enterprise
Data
Warehouse
Deep Data
Analytics
Platform
Hadoop
Engineers
Stream
Processing
Caching
DOE
DQFirewall
Buyers/Sellers
38
ENTERPRISE
DATA STREAMS AND SERVICES
39
Automated Decision Support
Signal Detection @ Scale5
Automated Signal Detection
40
Prediction – anomaly signal detection
Massively scalable and automated signal
detection and prediction
 Phase 1: Signal detection
 Phase 2: Root Cause analysis
41
ANALYTICS IN EBAY
Measure Everything
Embedded in our daily life
Bottom-up & Top-down
Think and Live Analytics
Always
But know when to avoid Analysis Paralysis!
Analytics DNA
page
43
IKEA Job Interview
Please have a seat
page
44
Analytics at eBay
Go use data
45
The Diverse User Community
page
46
Diverse User Community
Data
Scientists
Financial
Planning &
Analytics
Site Analysts
Business
Analysts
Consumers
One-off
Analysis
Descriptive, Predictive &
Prescriptive Modeling
Experimentation &
Mining
Standard
Reports
Dashboards
Hadoop
R/SAS/SQL on
Teradata
Excel
Tableau
MicroStrategy
, Diverse Needs& Diverse Tools
47
The Analytics Environment at eBay
 Direct SQL access
 User datasets
 MicroStrategy
 Tableau
 Web based App
 1000+ files
 10,000+ tables
 5000+ reports
 10,000+
 100+ named apps
 Tough to find the right metrics and reports
 Hard to build new metrics and reports
 Impossible to know which metrics and
reports are correct vs old
48
“We can’t solve problems
by using the same kind of
thinking we used when
we created them.”
• - Albert Einstein
51
Organizing for Success
Governed Self Service0
Self-service Strategy changes
everything…
52
The data user experience is….
Incoherent
Isolated
Disjointed
Uncertain
Consolidates all knowledge about data for “Just-in-
Time” use
Unifies a consistent set of Data Products on the hub
Makes it easy to find and trace the path from Business
Insights and summaries to the underlying SQL, metrics
and metadata
Delivers transparency and build trust with Data
Governance
and Stewardship
Comprehensive & Documented -- Self-directed Experience
Insights Hub
ONE portal , ONE framework, ONE analytics app Store
Targeted & Simplified -- Self-service Experiences
SQL Writer Search Collaberation
Knowledge Management
Subject Matter Expert (SME)
Directory and Subject Domain
pages
Business Metrics Glossary
Certified data assets,
endorsements, descriptions.
MoreDetailedMoreSummary
TechnicalAnalysisBusinessInsight
Self Service Strategy, Governed Exploration for Analysis and
Business Insight
DATA GOVERNANCE
54
Business Glossary – Managed articles about logic and language.
Knowledge: What should it be?
Data Asset Certification
Trust: Is this the right view? Who says so? As of when?
Well Managed – Quality checks, release notes, load updates
Trust: Is it ok to use RIGHT NOW?
DATA GOVERNANCE
55
Business GlossaryData Asset Certification Well Managed
What we do: Data knowledge management and data
stewardship
Goals:
• Demystify our data warehouse of tens of thousands of
datasets
• Increase trust in data by increasing transparency
• Save analysts’ time and reduce their opportunities for error
56
Value
Generation
Governed Self Service0
57
Organizing for Success
Purified Data Science6
Data Prep
Data Science
58
59
Data
ScienceData
+
60
A COMPLETE VIEW OF OUR CUSTOMERS
Behavior Demographics & Interests
AttitudeValue to eBay
61
 DATA SCIENCE
Data Data
Science
Business
ImpactData Data Science Data Science Data Science
Business
Impact
Insights
Customer Insights used to make
decisions and set strategy
Predictive Models
Models that predict outcomes
to achieve optimal targeting
Segments
New ways to assess value and
attitudes of our customers
DNA
62
CONVERSION MODEL
User Category Probability
111602**** 1564** 10.1%
111602**** 1562** 6.54%
111602**** 1569** 5.67%
111602**** 3564** 4.33%
111602**** 1397** 1.19%
111602**** 3877** 1.11%
111602**** 9282** 1.01%
111602**** 3607** 0.91%
111602**** 1040** 0.81%
111602**** 1564** 0.76%
111602**** 1040** 0.66%
111602**** 4250** 0.01%
111602**** 5235** 0.01%
• Cart data
• Watch data
• Mobile watch
• Search pages
• Browse data
• Purchase history
Models
Thanks!
• ALEX LIANG
• hliang@ebay.com
• http://www.linkedin.com/in/alexlianghu

More Related Content

What's hot

UF4 Gestor d'arxius web
UF4 Gestor d'arxius webUF4 Gestor d'arxius web
UF4 Gestor d'arxius webRuben Mur
 
Apache Sentry for Hadoop security
Apache Sentry for Hadoop securityApache Sentry for Hadoop security
Apache Sentry for Hadoop securitybigdatagurus_meetup
 
Talend Open Studio for Big Data | Talend Open Studio Tutorial | Talend Online...
Talend Open Studio for Big Data | Talend Open Studio Tutorial | Talend Online...Talend Open Studio for Big Data | Talend Open Studio Tutorial | Talend Online...
Talend Open Studio for Big Data | Talend Open Studio Tutorial | Talend Online...Edureka!
 
Data Visualization Using PowerBI.pdf
Data Visualization Using PowerBI.pdfData Visualization Using PowerBI.pdf
Data Visualization Using PowerBI.pdfR20BBA040
 
Oracle Active Data Guard and Global Data Services in Action!
Oracle Active Data Guard and Global Data Services in Action!Oracle Active Data Guard and Global Data Services in Action!
Oracle Active Data Guard and Global Data Services in Action!Ludovico Caldara
 
Apache Hive Tutorial
Apache Hive TutorialApache Hive Tutorial
Apache Hive TutorialSandeep Patil
 
Data Virtualization: An Introduction
Data Virtualization: An IntroductionData Virtualization: An Introduction
Data Virtualization: An IntroductionDenodo
 
The Hidden Value of Hadoop Migration
The Hidden Value of Hadoop MigrationThe Hidden Value of Hadoop Migration
The Hidden Value of Hadoop MigrationDatabricks
 
Design Principles for a Modern Data Warehouse
Design Principles for a Modern Data WarehouseDesign Principles for a Modern Data Warehouse
Design Principles for a Modern Data WarehouseRob Winters
 
Big Data and Architectural Patterns on AWS - Pop-up Loft Tel Aviv
Big Data and Architectural Patterns on AWS - Pop-up Loft Tel AvivBig Data and Architectural Patterns on AWS - Pop-up Loft Tel Aviv
Big Data and Architectural Patterns on AWS - Pop-up Loft Tel AvivAmazon Web Services
 
Top-10-Features-In-MySQL-8.0 - Vinoth Kanna RS - Mydbops Team
Top-10-Features-In-MySQL-8.0 - Vinoth Kanna RS - Mydbops TeamTop-10-Features-In-MySQL-8.0 - Vinoth Kanna RS - Mydbops Team
Top-10-Features-In-MySQL-8.0 - Vinoth Kanna RS - Mydbops TeamMydbops
 
Ceph Object Storage Performance Secrets and Ceph Data Lake Solution
Ceph Object Storage Performance Secrets and Ceph Data Lake SolutionCeph Object Storage Performance Secrets and Ceph Data Lake Solution
Ceph Object Storage Performance Secrets and Ceph Data Lake SolutionKaran Singh
 
Oracle Cloud Storage Service & Oracle Database Backup Cloud Service
Oracle Cloud Storage Service & Oracle Database Backup Cloud ServiceOracle Cloud Storage Service & Oracle Database Backup Cloud Service
Oracle Cloud Storage Service & Oracle Database Backup Cloud ServiceJean-Philippe PINTE
 
Oracle Data Integrator 12c - Getting Started
Oracle Data Integrator 12c - Getting StartedOracle Data Integrator 12c - Getting Started
Oracle Data Integrator 12c - Getting StartedMichael Rainey
 
Enterprise guide to building a Data Mesh
Enterprise guide to building a Data MeshEnterprise guide to building a Data Mesh
Enterprise guide to building a Data MeshSion Smith
 
PostgreSQL and Benchmarks
PostgreSQL and BenchmarksPostgreSQL and Benchmarks
PostgreSQL and BenchmarksJignesh Shah
 
Introduction of Big data, NoSQL & Hadoop
Introduction of Big data, NoSQL & HadoopIntroduction of Big data, NoSQL & Hadoop
Introduction of Big data, NoSQL & HadoopSavvycom Savvycom
 

What's hot (20)

data warehouse vs data lake
data warehouse vs data lakedata warehouse vs data lake
data warehouse vs data lake
 
UF4 Gestor d'arxius web
UF4 Gestor d'arxius webUF4 Gestor d'arxius web
UF4 Gestor d'arxius web
 
Apache Sentry for Hadoop security
Apache Sentry for Hadoop securityApache Sentry for Hadoop security
Apache Sentry for Hadoop security
 
Talend Open Studio for Big Data | Talend Open Studio Tutorial | Talend Online...
Talend Open Studio for Big Data | Talend Open Studio Tutorial | Talend Online...Talend Open Studio for Big Data | Talend Open Studio Tutorial | Talend Online...
Talend Open Studio for Big Data | Talend Open Studio Tutorial | Talend Online...
 
Data Visualization Using PowerBI.pdf
Data Visualization Using PowerBI.pdfData Visualization Using PowerBI.pdf
Data Visualization Using PowerBI.pdf
 
Oracle Active Data Guard and Global Data Services in Action!
Oracle Active Data Guard and Global Data Services in Action!Oracle Active Data Guard and Global Data Services in Action!
Oracle Active Data Guard and Global Data Services in Action!
 
Apache Hive Tutorial
Apache Hive TutorialApache Hive Tutorial
Apache Hive Tutorial
 
Data Virtualization: An Introduction
Data Virtualization: An IntroductionData Virtualization: An Introduction
Data Virtualization: An Introduction
 
The Hidden Value of Hadoop Migration
The Hidden Value of Hadoop MigrationThe Hidden Value of Hadoop Migration
The Hidden Value of Hadoop Migration
 
Design Principles for a Modern Data Warehouse
Design Principles for a Modern Data WarehouseDesign Principles for a Modern Data Warehouse
Design Principles for a Modern Data Warehouse
 
Big Data and Architectural Patterns on AWS - Pop-up Loft Tel Aviv
Big Data and Architectural Patterns on AWS - Pop-up Loft Tel AvivBig Data and Architectural Patterns on AWS - Pop-up Loft Tel Aviv
Big Data and Architectural Patterns on AWS - Pop-up Loft Tel Aviv
 
Top-10-Features-In-MySQL-8.0 - Vinoth Kanna RS - Mydbops Team
Top-10-Features-In-MySQL-8.0 - Vinoth Kanna RS - Mydbops TeamTop-10-Features-In-MySQL-8.0 - Vinoth Kanna RS - Mydbops Team
Top-10-Features-In-MySQL-8.0 - Vinoth Kanna RS - Mydbops Team
 
Technical tips for secure Apache Hadoop cluster #ApacheConAsia #ApacheCon
Technical tips for secure Apache Hadoop cluster #ApacheConAsia #ApacheConTechnical tips for secure Apache Hadoop cluster #ApacheConAsia #ApacheCon
Technical tips for secure Apache Hadoop cluster #ApacheConAsia #ApacheCon
 
Ceph Object Storage Performance Secrets and Ceph Data Lake Solution
Ceph Object Storage Performance Secrets and Ceph Data Lake SolutionCeph Object Storage Performance Secrets and Ceph Data Lake Solution
Ceph Object Storage Performance Secrets and Ceph Data Lake Solution
 
Oracle Cloud Storage Service & Oracle Database Backup Cloud Service
Oracle Cloud Storage Service & Oracle Database Backup Cloud ServiceOracle Cloud Storage Service & Oracle Database Backup Cloud Service
Oracle Cloud Storage Service & Oracle Database Backup Cloud Service
 
Hadoop seminar
Hadoop seminarHadoop seminar
Hadoop seminar
 
Oracle Data Integrator 12c - Getting Started
Oracle Data Integrator 12c - Getting StartedOracle Data Integrator 12c - Getting Started
Oracle Data Integrator 12c - Getting Started
 
Enterprise guide to building a Data Mesh
Enterprise guide to building a Data MeshEnterprise guide to building a Data Mesh
Enterprise guide to building a Data Mesh
 
PostgreSQL and Benchmarks
PostgreSQL and BenchmarksPostgreSQL and Benchmarks
PostgreSQL and Benchmarks
 
Introduction of Big data, NoSQL & Hadoop
Introduction of Big data, NoSQL & HadoopIntroduction of Big data, NoSQL & Hadoop
Introduction of Big data, NoSQL & Hadoop
 

Viewers also liked

eBay Pulsar: Real-time analytics platform
eBay Pulsar: Real-time analytics platformeBay Pulsar: Real-time analytics platform
eBay Pulsar: Real-time analytics platformKyoungMo Yang
 
QCon San Francisco 2011: Agility in eBay
QCon San Francisco 2011: Agility in eBayQCon San Francisco 2011: Agility in eBay
QCon San Francisco 2011: Agility in eBayDeepak Nadig
 
Optimizing eBay - Improving customer experience at the world’s online marketp...
Optimizing eBay - Improving customer experience at the world’s online marketp...Optimizing eBay - Improving customer experience at the world’s online marketp...
Optimizing eBay - Improving customer experience at the world’s online marketp...Deepak Nadig
 
Di itek big data task force – skab datadrevet forretningsudvikling
Di itek big data task force – skab datadrevet forretningsudviklingDi itek big data task force – skab datadrevet forretningsudvikling
Di itek big data task force – skab datadrevet forretningsudviklingChristian Graversen
 
The Changing Role of Healthcare Data Analysts
The Changing Role of Healthcare Data AnalystsThe Changing Role of Healthcare Data Analysts
The Changing Role of Healthcare Data AnalystsHealth Catalyst
 
Big Data in Retail - Examples in Action
Big Data in Retail - Examples in ActionBig Data in Retail - Examples in Action
Big Data in Retail - Examples in ActionDavid Pittman
 
4 Best Practices for Analyzing Healthcare Data
4 Best Practices for Analyzing Healthcare Data4 Best Practices for Analyzing Healthcare Data
4 Best Practices for Analyzing Healthcare DataHealth Catalyst
 
Amazon.com: the Hidden Empire - Update 2013
Amazon.com: the Hidden Empire - Update 2013Amazon.com: the Hidden Empire - Update 2013
Amazon.com: the Hidden Empire - Update 2013Fabernovel
 

Viewers also liked (9)

eBay Pulsar: Real-time analytics platform
eBay Pulsar: Real-time analytics platformeBay Pulsar: Real-time analytics platform
eBay Pulsar: Real-time analytics platform
 
QCon San Francisco 2011: Agility in eBay
QCon San Francisco 2011: Agility in eBayQCon San Francisco 2011: Agility in eBay
QCon San Francisco 2011: Agility in eBay
 
Big Data Analytics at Vestas Wind Systems
Big Data Analytics at Vestas Wind SystemsBig Data Analytics at Vestas Wind Systems
Big Data Analytics at Vestas Wind Systems
 
Optimizing eBay - Improving customer experience at the world’s online marketp...
Optimizing eBay - Improving customer experience at the world’s online marketp...Optimizing eBay - Improving customer experience at the world’s online marketp...
Optimizing eBay - Improving customer experience at the world’s online marketp...
 
Di itek big data task force – skab datadrevet forretningsudvikling
Di itek big data task force – skab datadrevet forretningsudviklingDi itek big data task force – skab datadrevet forretningsudvikling
Di itek big data task force – skab datadrevet forretningsudvikling
 
The Changing Role of Healthcare Data Analysts
The Changing Role of Healthcare Data AnalystsThe Changing Role of Healthcare Data Analysts
The Changing Role of Healthcare Data Analysts
 
Big Data in Retail - Examples in Action
Big Data in Retail - Examples in ActionBig Data in Retail - Examples in Action
Big Data in Retail - Examples in Action
 
4 Best Practices for Analyzing Healthcare Data
4 Best Practices for Analyzing Healthcare Data4 Best Practices for Analyzing Healthcare Data
4 Best Practices for Analyzing Healthcare Data
 
Amazon.com: the Hidden Empire - Update 2013
Amazon.com: the Hidden Empire - Update 2013Amazon.com: the Hidden Empire - Update 2013
Amazon.com: the Hidden Empire - Update 2013
 

Similar to Extreme Analytics Evolution at eBay

Six Things To Make Analytics Work - Exponea
Six Things To Make Analytics Work - ExponeaSix Things To Make Analytics Work - Exponea
Six Things To Make Analytics Work - ExponeaNexteria
 
6 Things to Make Analytics Work
6 Things to Make Analytics Work6 Things to Make Analytics Work
6 Things to Make Analytics WorkExponea
 
Smarter Analytics: Supporting the Enterprise with Automation
Smarter Analytics: Supporting the Enterprise with AutomationSmarter Analytics: Supporting the Enterprise with Automation
Smarter Analytics: Supporting the Enterprise with AutomationInside Analysis
 
Deep.bi - Real-time, Deep Data Analytics Platform For Ecommerce
Deep.bi - Real-time, Deep Data Analytics Platform For EcommerceDeep.bi - Real-time, Deep Data Analytics Platform For Ecommerce
Deep.bi - Real-time, Deep Data Analytics Platform For EcommerceDeep.BI
 
In-Memory Computing Webcast. Market Predictions 2017
In-Memory Computing Webcast. Market Predictions 2017In-Memory Computing Webcast. Market Predictions 2017
In-Memory Computing Webcast. Market Predictions 2017SingleStore
 
How to Capitalize on Big Data with Oracle Analytics Cloud
How to Capitalize on Big Data with Oracle Analytics CloudHow to Capitalize on Big Data with Oracle Analytics Cloud
How to Capitalize on Big Data with Oracle Analytics CloudPerficient, Inc.
 
SAP Forum Ankara 2017 - "Verinin Merkezine Seyahat"
SAP Forum Ankara 2017 - "Verinin Merkezine Seyahat"SAP Forum Ankara 2017 - "Verinin Merkezine Seyahat"
SAP Forum Ankara 2017 - "Verinin Merkezine Seyahat"MDS ap
 
Retail Analytics and BI with Looker, BigQuery, GCP & Leigha Jarett
Retail Analytics and BI with Looker, BigQuery, GCP & Leigha JarettRetail Analytics and BI with Looker, BigQuery, GCP & Leigha Jarett
Retail Analytics and BI with Looker, BigQuery, GCP & Leigha JarettDaniel Zivkovic
 
Data democratisation: Liberate your data to unlock its true value
Data democratisation: Liberate your data to unlock its true valueData democratisation: Liberate your data to unlock its true value
Data democratisation: Liberate your data to unlock its true valueAT Internet
 
DDMA 14 mei 2009 Business Intelligence case Ahold
DDMA 14 mei 2009 Business Intelligence case Ahold DDMA 14 mei 2009 Business Intelligence case Ahold
DDMA 14 mei 2009 Business Intelligence case Ahold DDMA
 
Turning Big Data into Better Business Outcomes
Turning Big Data into Better Business OutcomesTurning Big Data into Better Business Outcomes
Turning Big Data into Better Business OutcomesCisco Canada
 
Penser Analytics - Company Profile
Penser Analytics - Company ProfilePenser Analytics - Company Profile
Penser Analytics - Company ProfileSwarochish Chekuri
 
Not Tooling Around: How The Home Depot Uses Machine Learning for Vendor Accou...
Not Tooling Around: How The Home Depot Uses Machine Learning for Vendor Accou...Not Tooling Around: How The Home Depot Uses Machine Learning for Vendor Accou...
Not Tooling Around: How The Home Depot Uses Machine Learning for Vendor Accou...National Retail Federation
 
AWS re:Invent 2016: Leveraging Amazon Machine Learning, Amazon Redshift, and ...
AWS re:Invent 2016: Leveraging Amazon Machine Learning, Amazon Redshift, and ...AWS re:Invent 2016: Leveraging Amazon Machine Learning, Amazon Redshift, and ...
AWS re:Invent 2016: Leveraging Amazon Machine Learning, Amazon Redshift, and ...Amazon Web Services
 
Intelligence Data Day 2020
Intelligence Data Day 2020Intelligence Data Day 2020
Intelligence Data Day 2020Patrick Deglon
 
AWS Webcast - Sales Productivity Solutions with MicroStrategy and Redshift
AWS Webcast - Sales Productivity Solutions with MicroStrategy and RedshiftAWS Webcast - Sales Productivity Solutions with MicroStrategy and Redshift
AWS Webcast - Sales Productivity Solutions with MicroStrategy and RedshiftAmazon Web Services
 

Similar to Extreme Analytics Evolution at eBay (20)

Six Things To Make Analytics Work - Exponea
Six Things To Make Analytics Work - ExponeaSix Things To Make Analytics Work - Exponea
Six Things To Make Analytics Work - Exponea
 
6 Things to Make Analytics Work
6 Things to Make Analytics Work6 Things to Make Analytics Work
6 Things to Make Analytics Work
 
Smarter Analytics: Supporting the Enterprise with Automation
Smarter Analytics: Supporting the Enterprise with AutomationSmarter Analytics: Supporting the Enterprise with Automation
Smarter Analytics: Supporting the Enterprise with Automation
 
Big Data en Retail
Big Data en RetailBig Data en Retail
Big Data en Retail
 
Deep.bi - Real-time, Deep Data Analytics Platform For Ecommerce
Deep.bi - Real-time, Deep Data Analytics Platform For EcommerceDeep.bi - Real-time, Deep Data Analytics Platform For Ecommerce
Deep.bi - Real-time, Deep Data Analytics Platform For Ecommerce
 
In-Memory Computing Webcast. Market Predictions 2017
In-Memory Computing Webcast. Market Predictions 2017In-Memory Computing Webcast. Market Predictions 2017
In-Memory Computing Webcast. Market Predictions 2017
 
How to Capitalize on Big Data with Oracle Analytics Cloud
How to Capitalize on Big Data with Oracle Analytics CloudHow to Capitalize on Big Data with Oracle Analytics Cloud
How to Capitalize on Big Data with Oracle Analytics Cloud
 
SAP Forum Ankara 2017 - "Verinin Merkezine Seyahat"
SAP Forum Ankara 2017 - "Verinin Merkezine Seyahat"SAP Forum Ankara 2017 - "Verinin Merkezine Seyahat"
SAP Forum Ankara 2017 - "Verinin Merkezine Seyahat"
 
Big Data and E-Commerce
Big Data and E-CommerceBig Data and E-Commerce
Big Data and E-Commerce
 
Retail Analytics and BI with Looker, BigQuery, GCP & Leigha Jarett
Retail Analytics and BI with Looker, BigQuery, GCP & Leigha JarettRetail Analytics and BI with Looker, BigQuery, GCP & Leigha Jarett
Retail Analytics and BI with Looker, BigQuery, GCP & Leigha Jarett
 
Data democratisation: Liberate your data to unlock its true value
Data democratisation: Liberate your data to unlock its true valueData democratisation: Liberate your data to unlock its true value
Data democratisation: Liberate your data to unlock its true value
 
DDMA 14 mei 2009 Business Intelligence case Ahold
DDMA 14 mei 2009 Business Intelligence case Ahold DDMA 14 mei 2009 Business Intelligence case Ahold
DDMA 14 mei 2009 Business Intelligence case Ahold
 
Turning Big Data into Better Business Outcomes
Turning Big Data into Better Business OutcomesTurning Big Data into Better Business Outcomes
Turning Big Data into Better Business Outcomes
 
Penser Analytics - Company Profile
Penser Analytics - Company ProfilePenser Analytics - Company Profile
Penser Analytics - Company Profile
 
Not Tooling Around: How The Home Depot Uses Machine Learning for Vendor Accou...
Not Tooling Around: How The Home Depot Uses Machine Learning for Vendor Accou...Not Tooling Around: How The Home Depot Uses Machine Learning for Vendor Accou...
Not Tooling Around: How The Home Depot Uses Machine Learning for Vendor Accou...
 
Taming data lake - scalable metrics model
Taming data lake - scalable metrics modelTaming data lake - scalable metrics model
Taming data lake - scalable metrics model
 
A6 big data_in_the_cloud
A6 big data_in_the_cloudA6 big data_in_the_cloud
A6 big data_in_the_cloud
 
AWS re:Invent 2016: Leveraging Amazon Machine Learning, Amazon Redshift, and ...
AWS re:Invent 2016: Leveraging Amazon Machine Learning, Amazon Redshift, and ...AWS re:Invent 2016: Leveraging Amazon Machine Learning, Amazon Redshift, and ...
AWS re:Invent 2016: Leveraging Amazon Machine Learning, Amazon Redshift, and ...
 
Intelligence Data Day 2020
Intelligence Data Day 2020Intelligence Data Day 2020
Intelligence Data Day 2020
 
AWS Webcast - Sales Productivity Solutions with MicroStrategy and Redshift
AWS Webcast - Sales Productivity Solutions with MicroStrategy and RedshiftAWS Webcast - Sales Productivity Solutions with MicroStrategy and Redshift
AWS Webcast - Sales Productivity Solutions with MicroStrategy and Redshift
 

More from DataWorks Summit/Hadoop Summit

Unleashing the Power of Apache Atlas with Apache Ranger
Unleashing the Power of Apache Atlas with Apache RangerUnleashing the Power of Apache Atlas with Apache Ranger
Unleashing the Power of Apache Atlas with Apache RangerDataWorks Summit/Hadoop Summit
 
Enabling Digital Diagnostics with a Data Science Platform
Enabling Digital Diagnostics with a Data Science PlatformEnabling Digital Diagnostics with a Data Science Platform
Enabling Digital Diagnostics with a Data Science PlatformDataWorks Summit/Hadoop Summit
 
Double Your Hadoop Performance with Hortonworks SmartSense
Double Your Hadoop Performance with Hortonworks SmartSenseDouble Your Hadoop Performance with Hortonworks SmartSense
Double Your Hadoop Performance with Hortonworks SmartSenseDataWorks Summit/Hadoop Summit
 
Building a Large-Scale, Adaptive Recommendation Engine with Apache Flink and ...
Building a Large-Scale, Adaptive Recommendation Engine with Apache Flink and ...Building a Large-Scale, Adaptive Recommendation Engine with Apache Flink and ...
Building a Large-Scale, Adaptive Recommendation Engine with Apache Flink and ...DataWorks Summit/Hadoop Summit
 
Real-Time Anomaly Detection using LSTM Auto-Encoders with Deep Learning4J on ...
Real-Time Anomaly Detection using LSTM Auto-Encoders with Deep Learning4J on ...Real-Time Anomaly Detection using LSTM Auto-Encoders with Deep Learning4J on ...
Real-Time Anomaly Detection using LSTM Auto-Encoders with Deep Learning4J on ...DataWorks Summit/Hadoop Summit
 
Mool - Automated Log Analysis using Data Science and ML
Mool - Automated Log Analysis using Data Science and MLMool - Automated Log Analysis using Data Science and ML
Mool - Automated Log Analysis using Data Science and MLDataWorks Summit/Hadoop Summit
 
The Challenge of Driving Business Value from the Analytics of Things (AOT)
The Challenge of Driving Business Value from the Analytics of Things (AOT)The Challenge of Driving Business Value from the Analytics of Things (AOT)
The Challenge of Driving Business Value from the Analytics of Things (AOT)DataWorks Summit/Hadoop Summit
 
From Regulatory Process Verification to Predictive Maintenance and Beyond wit...
From Regulatory Process Verification to Predictive Maintenance and Beyond wit...From Regulatory Process Verification to Predictive Maintenance and Beyond wit...
From Regulatory Process Verification to Predictive Maintenance and Beyond wit...DataWorks Summit/Hadoop Summit
 

More from DataWorks Summit/Hadoop Summit (20)

Running Apache Spark & Apache Zeppelin in Production
Running Apache Spark & Apache Zeppelin in ProductionRunning Apache Spark & Apache Zeppelin in Production
Running Apache Spark & Apache Zeppelin in Production
 
State of Security: Apache Spark & Apache Zeppelin
State of Security: Apache Spark & Apache ZeppelinState of Security: Apache Spark & Apache Zeppelin
State of Security: Apache Spark & Apache Zeppelin
 
Unleashing the Power of Apache Atlas with Apache Ranger
Unleashing the Power of Apache Atlas with Apache RangerUnleashing the Power of Apache Atlas with Apache Ranger
Unleashing the Power of Apache Atlas with Apache Ranger
 
Enabling Digital Diagnostics with a Data Science Platform
Enabling Digital Diagnostics with a Data Science PlatformEnabling Digital Diagnostics with a Data Science Platform
Enabling Digital Diagnostics with a Data Science Platform
 
Revolutionize Text Mining with Spark and Zeppelin
Revolutionize Text Mining with Spark and ZeppelinRevolutionize Text Mining with Spark and Zeppelin
Revolutionize Text Mining with Spark and Zeppelin
 
Double Your Hadoop Performance with Hortonworks SmartSense
Double Your Hadoop Performance with Hortonworks SmartSenseDouble Your Hadoop Performance with Hortonworks SmartSense
Double Your Hadoop Performance with Hortonworks SmartSense
 
Hadoop Crash Course
Hadoop Crash CourseHadoop Crash Course
Hadoop Crash Course
 
Data Science Crash Course
Data Science Crash CourseData Science Crash Course
Data Science Crash Course
 
Apache Spark Crash Course
Apache Spark Crash CourseApache Spark Crash Course
Apache Spark Crash Course
 
Dataflow with Apache NiFi
Dataflow with Apache NiFiDataflow with Apache NiFi
Dataflow with Apache NiFi
 
Schema Registry - Set you Data Free
Schema Registry - Set you Data FreeSchema Registry - Set you Data Free
Schema Registry - Set you Data Free
 
Building a Large-Scale, Adaptive Recommendation Engine with Apache Flink and ...
Building a Large-Scale, Adaptive Recommendation Engine with Apache Flink and ...Building a Large-Scale, Adaptive Recommendation Engine with Apache Flink and ...
Building a Large-Scale, Adaptive Recommendation Engine with Apache Flink and ...
 
Real-Time Anomaly Detection using LSTM Auto-Encoders with Deep Learning4J on ...
Real-Time Anomaly Detection using LSTM Auto-Encoders with Deep Learning4J on ...Real-Time Anomaly Detection using LSTM Auto-Encoders with Deep Learning4J on ...
Real-Time Anomaly Detection using LSTM Auto-Encoders with Deep Learning4J on ...
 
Mool - Automated Log Analysis using Data Science and ML
Mool - Automated Log Analysis using Data Science and MLMool - Automated Log Analysis using Data Science and ML
Mool - Automated Log Analysis using Data Science and ML
 
How Hadoop Makes the Natixis Pack More Efficient
How Hadoop Makes the Natixis Pack More Efficient How Hadoop Makes the Natixis Pack More Efficient
How Hadoop Makes the Natixis Pack More Efficient
 
HBase in Practice
HBase in Practice HBase in Practice
HBase in Practice
 
The Challenge of Driving Business Value from the Analytics of Things (AOT)
The Challenge of Driving Business Value from the Analytics of Things (AOT)The Challenge of Driving Business Value from the Analytics of Things (AOT)
The Challenge of Driving Business Value from the Analytics of Things (AOT)
 
Breaking the 1 Million OPS/SEC Barrier in HOPS Hadoop
Breaking the 1 Million OPS/SEC Barrier in HOPS HadoopBreaking the 1 Million OPS/SEC Barrier in HOPS Hadoop
Breaking the 1 Million OPS/SEC Barrier in HOPS Hadoop
 
From Regulatory Process Verification to Predictive Maintenance and Beyond wit...
From Regulatory Process Verification to Predictive Maintenance and Beyond wit...From Regulatory Process Verification to Predictive Maintenance and Beyond wit...
From Regulatory Process Verification to Predictive Maintenance and Beyond wit...
 
Backup and Disaster Recovery in Hadoop
Backup and Disaster Recovery in Hadoop Backup and Disaster Recovery in Hadoop
Backup and Disaster Recovery in Hadoop
 

Recently uploaded

Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piececharlottematthew16
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsMiki Katsuragi
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr LapshynFwdays
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024The Digital Insurer
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 

Recently uploaded (20)

Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptx
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piece
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering Tips
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 

Extreme Analytics Evolution at eBay

  • 1. Extreme Analytics @ eBay Evolution of GovernedSelf Service Analytics
  • 2. Agenda •eBay Today •Big Data @ eBay •The HOW? •Q&A PRESENTATION TITLE GOES HERE 2
  • 4. Most Powerful Selling Platform For business sellers: the potential to drive profitable sales and build a brand For consumer sellers: an easy way to declutter, sell and make money A partnership not a competition Best Choice Providing the greatest selection of inventory for our buyers From new, everyday items to rare and unique goods And incredible deals only found on eBay Most Relevance A shopping experience that is simple, data-driven and personalized Enabling buyers to easily find, compare and purchase items they need and want Highlighting the unique value that eBay brings OUR STRATEGY
  • 5. EBAY INC AT A GLANCE $2.1B Revenue in Q1 2016 $20.5B GMV in Q1 2016 162M Global Active Buyers 57% International revenue Q1 2016 data $9B Mobile Volume 314M App downloads
  • 6. EBAY MARKETPLACE AT A GLANCE $19.6B GMV in Q1 2016 9.5M New listings added via mobile per week 300M Searches each day 63% Transactions that ship for free (in US, UK, DE) 79% Items sold as new Q1 2016 data ~900M Live listings One of the world’s largest and most vibrant marketplaces
  • 7. VELOCITY STATS US 3 car parts or accessories are sold every A smartphone is sold every A dress is sold every 1 sec 4 sec 6 sec UK A necklace is sold every A make-up product is sold every A Lego product is sold every 10 sec 3 sec 19 sec GERMANY A truck or car is sold every A pair of women’s jeans is sold every A video game is sold every 5 min 4 sec 11 sec AUSTRALIA A pair of men’s sunglasses is sold every A home décor item is sold every A car or truck part is sold every 1 min 12 sec 4 sec
  • 8. MOBILE VELOCITY STATS US A woman’s handbag is sold every A car or truck is sold every An action figure is sold every 10 sec 5 min 10 sec UK A tablet is sold every A cookware item is sold every A car is sold every 1 min 6 sec 2 min GERMANY A pair of women’s shoes is sold every A watch is sold every A tire or car part is sold every 20 sec 48 sec 35 sec AUSTRALIA A piece of jewelry is sold every A baby clothing item is sold every A motorcycle part is sold every 12 sec 46 sec 51 sec
  • 9. THREE KEY TRENDS ARE REDEFINING COMMERCE Smart CommerceSeamless Commerce True Global Commerce
  • 11. TRUE GLOBAL COMMERCE of eBay’s business is international57% of commercial sellers engage in exporting* 95% languages 8 *Sellers with $10,000 or more/year in sales
  • 12. SMART COMMERCE Identify an interesting set of candidate items, trends, events, etc. Personalize the results Inspiration at scale!
  • 17. Big Data @ eBay
  • 18. BIG Data VVC 20 >50 TB/day new data >100 PB/day >100 Trillion pairs of information Millionsof queries/day >7500 business users & analysts >50k chains of logic 24x7x365 99.98+%Availability turning over a TB every second Active/Active Near-Real-time >100kdata elements Always online Processed >1.5 x 1012 new records/day
  • 19. 21 TECHNOLOGY EDW Analytics Application Analysts and Data Scientists Management Integrators Business Owners Application Servers Data Processing Clusters Aggregation & Summarization Visualization & Reporting ClicktoInsights Kylin OVER Billings EVENTS FROM 162M EBAY BUYERS CAPTURED, TRANSFORMED, SYNTHESIZED TO PROVIDE ACTIONABLE INSIGHTS
  • 20. 22 eBay has one of the largest most active data platforms in the world. eBay has one of the largest most active data platforms in the world with a diverse set of users.
  • 21. Product Families 23 CURATED DATA PLATFORMS SQL/RM/SAS/R TXN IN V BEH CUS DATA EXPLORATION & COLLABORATI ON: ALATION ADVANCED DATA TOOLS on iHub: NOUS,GRO,DNA ,M1, SPA, SD, CC, Adv … ENTERPRISE DATA SERVICE/Feed: RETAIL SIGNAL,IDENTI TY, SS, SH… REALTIME DATA STREAM: TORA
  • 24. Semi-Structured SQL++Structured SQL Low End Enterprise-class System Contextual-complex analytics, deep, seasonal, consumable datasets Production data warehousing, large concurrent user base Discover & Explore Analyze & Report Enterprise-class System Unstructured JAVA / C Structure the unstructured, detect patterns Commodity Hardware System Singularity HadoopTeradata Enterprise Data Warehouse DISCOVER & EXPLOREANALYZE & REPORT 26
  • 25. page 27 Biggest complexity drivers are  Maintaining separate databases  weekly/daily/hourly data transfers  Data inconsistencies  Data duplication  Increased complexity  Loss of centralized viz & control DMs A data mart cannot be cheap enough to justify its existence
  • 26. PRESENTATION TITLE GOES HERE 28 ...the wrong way Data Marts in the Cloud Customer Customer Customer Customer Customer Product Customer Product Customer Product Customer Product Trx Customer Product Trx Customer Product Trx Customer Product Trx
  • 27. PRESENTATION TITLE GOES HERE 29 Virtual Data Marts Customer Product Transactio n Behavior Virtual DataMart Virtual DataMart Virtual DataMart Virtual DataMart Virtual DataMart Virtual DataMartVirtual DataMart Virtual DataMart Virtual DataMart
  • 29. Semi-Structured SQL++Structured SQL Low End Enterprise-class System Contextual-complex analytics, deep, seasonal, consumable datasets Production data warehousing, large concurrent user base Discover & Explore Analyze & Report Enterprise-class System Unstructured JAVA / C Structure the unstructured, detect patterns Commodity Hardware System Singularity HadoopTeradata Deep-Data Platforms DISCOVER & EXPLOREANALYZE & REPORT 31 Behavioral Data Centric
  • 31. 33 Collaborative Analytics Compose Write and discover queries with ease; understand and reuse code easily; drives time and savings. Catalog & Govern Document and discover data and concepts; structured and crowd sourced tagging of content in a stewarded environment. Answers Fast, trusted answers for everyone; search for analytic products (metrics, reports, KPIs). Forensics Insightful IT and operational data to expose and eliminateredundancies Experts / Stewards Govern Simple Data Management Analyst Compose Better,Faster Queries Business Users Answers Google for your Data IT Forensics Intelligence about your data
  • 32. Wiki + metadata repository Alation SQL Assistant Metadata repository + + Storytelling Mixing textual analysis with graphs WHAT IT’S LIKECOLLABORATION TOOL 2013 2009 2014 AnswerHub Discussion forum moderated by support DataHub + for data2010 COLLABORATION JOURNEY 2014
  • 33. 35 The App Platform Analytics Application Platform4
  • 34. ENTERPRISE DATA PLATFORM 36 Data Warehouse Data Streams Batch Humans Sets of data Streams Systems Sets of data Data Services Services Applications Specific calls Populated Used by How Enterprise Populated Used by How
  • 35. DQRecon Data Processing Ecosystem 37 Curated Streams Applications Data Services ApplicationAnalytics Data Scientists Analysts BU/PD Leaders Site DBs Real-Time Data Sources External Data Sources ETL Enterprise Data Warehouse Deep Data Analytics Platform Hadoop Engineers Stream Processing Caching DOE DQFirewall Buyers/Sellers
  • 38. Automated Signal Detection 40 Prediction – anomaly signal detection Massively scalable and automated signal detection and prediction  Phase 1: Signal detection  Phase 2: Root Cause analysis
  • 39. 41 ANALYTICS IN EBAY Measure Everything Embedded in our daily life Bottom-up & Top-down Think and Live Analytics Always But know when to avoid Analysis Paralysis! Analytics DNA
  • 40.
  • 43. 45 The Diverse User Community
  • 44. page 46 Diverse User Community Data Scientists Financial Planning & Analytics Site Analysts Business Analysts Consumers One-off Analysis Descriptive, Predictive & Prescriptive Modeling Experimentation & Mining Standard Reports Dashboards Hadoop R/SAS/SQL on Teradata Excel Tableau MicroStrategy , Diverse Needs& Diverse Tools
  • 45. 47 The Analytics Environment at eBay  Direct SQL access  User datasets  MicroStrategy  Tableau  Web based App  1000+ files  10,000+ tables  5000+ reports  10,000+  100+ named apps  Tough to find the right metrics and reports  Hard to build new metrics and reports  Impossible to know which metrics and reports are correct vs old
  • 46. 48
  • 47.
  • 48. “We can’t solve problems by using the same kind of thinking we used when we created them.” • - Albert Einstein
  • 50. Self-service Strategy changes everything… 52 The data user experience is…. Incoherent Isolated Disjointed Uncertain Consolidates all knowledge about data for “Just-in- Time” use Unifies a consistent set of Data Products on the hub Makes it easy to find and trace the path from Business Insights and summaries to the underlying SQL, metrics and metadata Delivers transparency and build trust with Data Governance and Stewardship
  • 51. Comprehensive & Documented -- Self-directed Experience Insights Hub ONE portal , ONE framework, ONE analytics app Store Targeted & Simplified -- Self-service Experiences SQL Writer Search Collaberation Knowledge Management Subject Matter Expert (SME) Directory and Subject Domain pages Business Metrics Glossary Certified data assets, endorsements, descriptions. MoreDetailedMoreSummary TechnicalAnalysisBusinessInsight Self Service Strategy, Governed Exploration for Analysis and Business Insight
  • 52. DATA GOVERNANCE 54 Business Glossary – Managed articles about logic and language. Knowledge: What should it be? Data Asset Certification Trust: Is this the right view? Who says so? As of when? Well Managed – Quality checks, release notes, load updates Trust: Is it ok to use RIGHT NOW?
  • 53. DATA GOVERNANCE 55 Business GlossaryData Asset Certification Well Managed What we do: Data knowledge management and data stewardship Goals: • Demystify our data warehouse of tens of thousands of datasets • Increase trust in data by increasing transparency • Save analysts’ time and reduce their opportunities for error
  • 58. 60 A COMPLETE VIEW OF OUR CUSTOMERS Behavior Demographics & Interests AttitudeValue to eBay
  • 59. 61  DATA SCIENCE Data Data Science Business ImpactData Data Science Data Science Data Science Business Impact Insights Customer Insights used to make decisions and set strategy Predictive Models Models that predict outcomes to achieve optimal targeting Segments New ways to assess value and attitudes of our customers DNA
  • 60. 62 CONVERSION MODEL User Category Probability 111602**** 1564** 10.1% 111602**** 1562** 6.54% 111602**** 1569** 5.67% 111602**** 3564** 4.33% 111602**** 1397** 1.19% 111602**** 3877** 1.11% 111602**** 9282** 1.01% 111602**** 3607** 0.91% 111602**** 1040** 0.81% 111602**** 1564** 0.76% 111602**** 1040** 0.66% 111602**** 4250** 0.01% 111602**** 5235** 0.01% • Cart data • Watch data • Mobile watch • Search pages • Browse data • Purchase history Models
  • 61. Thanks! • ALEX LIANG • hliang@ebay.com • http://www.linkedin.com/in/alexlianghu

Editor's Notes

  1. eBay is the world’s most vibrant marketplace where the world goes to shop, sell, and give. Whether you are buying something new or used, luxurious or modest, rare or commonplace, trendy or one-of-a-kind – if it exists in the world, it’s probably for sale on eBay. Our mission is to be the world’s favorite destination for discovering great value and unique selection. eBay connects millions of buyers and sellers around the globe, empowering people and creating opportunity. Our vision for commerce is one that is enabled by people, powered by technology, and open to everyone. We give sellers the platform, solutions, and support they need to grow their businesses and thrive, but we never compete with them. We measure our success by our customers' success.
  2. Our vision for commerce is one that is enabled by people, powered by technology, and open to everyone. Our strategy is to drive the best choice, have the most relevance, and deliver the most powerful selling platform.
  3. eBay Inc. is a global commerce leader including our Marketplace, StubHub and Classifieds platforms. Collectively, we connect millions of buyers and sellers around the world. The technologies and services that power our platforms are designed to enable sellers worldwide to organize and offer their inventory for sale and buyers to find and buy it virtually anytime and anywhere. eBay Inc. employs approximately 11,600 people globally (as of Dec. 31, 2015)
  4. Today’s eBay isn’t what it used to be - many people think of us only as an auction site, but that perception hasn’t kept up with reality. The reality is that 79% of what is sold on eBay is new merchandise, available for purchase immediately. We have more than 900 million items listed for sale and 162 million active buyers, effectively making us the world’s biggest shopping destination.
  5. From our vantage point, we believe the impact of these three trends will transform the commerce landscape.
  6. Seamless commerce is much more than a mobile experience. To engage with consumers in the “new retail”, brands must take a multi-screen approach. We must stop thinking of experiences across individual devices - and start thinking of holistic shopping experiences, where consumers can seamlessly engage with your brand across multiple screens, literally from wherever they are. Brands also recognize that online and offline are not mutually exclusive. Consumers want the best of both worlds, shopping online and across multiple devices, and offline in-store. The continued proliferation of mobile will deliver a richer consumer experience that help shoppers navigate seamlessly between the digital and physical worlds. At eBay, we’re finding that the multi-screen consumer is more highly engaged. They visit sites more frequently, and they buy significantly more when online. Multiscreen is device agnostic, which means every screen is shoppable. Because we can’t predict what the next great device will be, we must focus on providing customers with the best possible experience - regardless of the device – so consumers can shop when they want, for what they want. At eBay, we are innovating across devices, creating seamless buying and selling experiences for iOS, Android, desktop, and even wearables to make sure our customers can engage at every touch point. We are also allowing people to shop the way they want: online, offline and mobile are coming together in services like Click & Collect offered by eBay with Argos in the UK, which allows buyers to pick-up their purchase in-store if they choose.
  7. Consumers are increasingly able to shop the world. Their market, or where they shop, is no longer defined by borders. They go online to explore the world – interests/likes come to life in different places. Because the brands and products they love can be difficult to find in their markets, they’re willing to shop foreign websites. They tolerate friction in buying in order to access the selection that a global marketplace has to offer. At eBay, 57% of our business is international and 95% of our commercial sellers engage in exporting. The eBay app is available in 190 countries, we host 25 localized websites across the globe and are available in 8 languages. We are offering innovative approaches to eliminate friction points in global shopping, such as programs like the Global Shipping Program – which enables sellers to more easily ship to 64 countries around the world.
  8. Consumers are overwhelmed by the number of choices they face day-to-day. Smart brands are using data to surface inventory to their consumers in ways that feel relevant, helpful and familiar. At eBay, we are curating and simplifying content in ways that align to users’ stated (and sometimes unstated) preferences, serving up content in new, simplified interfaces that surprise and delight them. We are also experimenting with machine learning to help bridge the gap between intent and understanding.
  9. 大数据是一个数量级大于你习惯的数据, Grasshopper
  10. This one take more time. Big Data – size, complexity, velocity. Intersection of #products with customers and activity cause huge volumes All of this needs to be loaded and maintained – daily, hourly, 15-minutes, near-real-time These users generate millions of requests per day, Add HA. Make a big deal about 24x7, no place for batch or query windows. We are a global company with analysts and users all over the world. We load and process and query 24x7. If we take a backup, it has to happen with everything else. 100 PB/day, processed by our systems, going over data over and over against to find new patterns, etc. Vivaldi touches a TB/second itself. That’s 86 PB/day on one system. Its easy to build a large PB store for 10s even 100s of PBs. But, accessing that data and use it in a meaningful way is the challenge. We design our systems for extremely high usage.
  11. Our technology is proprietary – but leverages a lot of Open Source Stack Most of the Data Processing heavy lifting happens on Hadoop Clusters. Majority of it – MR jobs. We also leverage Scala/Scoobi and have a custom built framework (Cascading Based) through a host of libraries, all internally customized. Our approach to reporting – is very ‘democratic”. Since a large part of the analytics are for internal consumption, we have to deal with a wide variety of data customers with different degrees of data knowledge and data handling maturity. A strategy that has worked very well for us is to provide a top line Analytical Tool ( combination of reports and dashboards), depending on the use case and then, provide curated data sets – to allow for interactive querying and analysis.
  12. What exactly do my teams do? Data engineering and technology development at scale You can’t see/touch/feel most of what we do. We build and manage platforms, used by over 10K distinct users in the last year. . Fully integrated DP with history back to beginning.   RJ saying it's our most powerful weapon Emphasize engineering org and expertise. - scale and complexity,    Search science and best match       Include detail slides in deck, Advertising buildout example.- Ilari     And/or Buildout what is required to make trending campaign work in nous and customer Dna     Real time PLA Data management slide    Similar to commerce OS for site Dev we do for data   Use product slide to answer the question -- but what exactly do you do -- emohaisze most resources are working on the platform -- some of what we finis very visible, but most is not as its a platform that enalrd others.  Thanks, Darren
  13. 5 Stages to something they refer to as the SENTIENT ENTERPRISE Framework for maximizing speed/value/agility of investments in Data Data Management at eBay roughly follows a 5-stage model developed by OLIVER  RATZESBERGER | Teradata MOHAN SAWHNEY | Kellogg School of Management I’ve discussed how agile businesses create a balance between imposing and loosening structure – centralizing the definition of data rules but decentralizing use cases to drive innovation. Agile businesses are able to make more strategic decisions based on higher levels of both breadth and depth of data.
  14. The Agile Data Warehouse moves traditional central DW structures to a balanced decentralized framework built for agility. Centralized data – decentralized access. Data Labs that support experimentation and self service Promotion process for VDM to Prod.
  15. We avoid federation of DMs – no pooling, redundant data, inconsistencies, HC to manage, probably 10x more expensive than they appear
  16. But DMs provide agility and speed. The way we do it is completely different. The right way to do cloud for analytics. When we provision a virtual DM, yes it has 100 GB of empty space, but it also has access to PB of reusable company data instantly. Also used for interative development and test.
  17. Behavioral Data Platform From Transactional to Behavioral Data. Value comes from behaviors rather than transactions
  18. LinkedIn for Analytics Harnessing the power of social & crowed sourcing to empower the enterprise to collaborate on analytics as scacle Share, Follow, Like
  19. Working with 3rd party vendor on building out our collaborative data hub
  20. But these are just the initial steps towards an even more exciting future for the enterprise. Our efforts today around creating agile and transparent data architectures and systems will enable us to create the sentient enterprise. What do I mean by “sentient”? Do I want companies to have feelings? It’s true that the word “sentient’ is derived from the Latin word “sentīre,” meaning “to feel,” and it refers to any entity that can feel or perceive things. This is key to why we think it’s the perfect descriptor. At Teradata we work on helping companies on building a corporation that can sense when something’s wrong and report it to the humans in charge of fixing it. In the sentient enterprise, an entire company operates like a single organism, where the left hand knows what the right hand is doing, and where human beings can get signals and suggestions that inform and guide their critical business decision In the sentient enterprise, your data talks to you, like it has a brain of its own. In the sentient enterprise, the CFO could have answers within hours instead of days. In fact, imagine if long before she received her weekly revenue trends report, the CFO could get an alert from the enterprise pointing her to the root cause so she could do something about it. Or, better yet, what if she didn’t get an alert at all – because the revenue dip was prevented in the first place? ns.
  21. Analytical Application Platform Analytical Apps. From static applications and ETL to agile Self Service Apps. From Extraction of Data to Enterprise Listening. From centralized ETL heavy static code to agile frameworks (if you want your data integrated, conform to this service API) From manual data extraction after the fact to real-time Data Listening – Streaming Transactions + Pulsar
  22. Introduce EDP We’ve been on the path of the agile data warehouse for a long time We have the requisite user created data labs, and support experimentation and testing as well as highly integrated core (production) data Our recent change has been the addition of enterprise data streams Enterprise data streams are near real time streaming data designed to mirror the critical core data from the EDW but in real time – enhanced with history from the EDW so that actions and recommendations can be made based upon new (live) actions while taking into account rich context Without this key enabler we could not progress into stage 5: autonomous decision making Reusing what we learned from EDW in the other two areas
  23. doe.corp.ebay.com
  24. Enterprise data streams and real time services
  25. Automated Decisioning Platform Predictive Technologies and Algorithms. From 10% of time on decision making and 90% sifting through data to 90% on decision making with the help of automated algorithms. Implementing Predictive Technologies and Algorithms at scale and operationalizing them throughout the enterprise Let systems deal with the ever increasing combinations and intersections of data Focus the human brain on making decisions
  26. 50K intersection points across our customer experiences modeled each day Example: total GMV in Fashion, New listings in Electronics
  27. 5:00 - 2 minutes with next – 5:02 This joke reminded me of the situation at eBay in terms of how to use data. It’s very confusing, just like putting together an IKEA piece of furniture.
  28. 5:00 - 2 minutes with previous– 5:02 Same situation at eBay when it comes to finding/using data Transition 1 Reality its much more confusing…also note no instructions
  29. 5:02 - 2 minutes with next – 5:04 Before I talk about how we are transforming analytics at eBay, let me tell you more about our analytic environment. At eBay, a lot of what we do – the decisions we make – are driven by data. We have a large and diverse community of analysts, and we have an even larger and diverse community of data consumers – from Executives all the way through to Business users.
  30. 5:02 - 2 minutes with previous – 5:04 We have a diverse user community… Transition 1 Diverse types of individuals that use data in their day-to-day job, from Data Scientists through traditional FP&A, from Site and Business Analysts to the Exec & Business consumer. Transition 2 This diverse community has a diverse set of needs in using data – from one-off analysis to statistical modeling; from A B experimentation comparisons and deep data mining of unstructured data to standard reports and dashboards using transaction data. Transition 3 And we have a number of tools to enable all of this – from Hadoop to R, SAS & SQL on the large Teradata stores that I have showed, reporting from Excel to visualization tools like Tableau and enterprise class tools like MicroStrategy. But such diversity of Users, needs and tools… Transition 4 …does cause some chaos.
  31. 5:04 - 3 minutes – 5:07 Whaddya mean Chaos? Users can have direct SQL access to the biggest data systems on the plant They can create their own datasets They have enterprise class tools like MicroStrategy And slick visualization builders like Tableau and Excel Transition 1 What’s the problem? ? Why is there a need to “transform analytics”? Transition 2 Well, it is a classic problem that I am sure many of you share or recognize. Transition 3 There are hundreds of files sent via email or squirreled away on SharePoint or shared network drives There are thousands of tables built without consideration of reuse, retention or rationalization There are over 5,000 reports in MicroStrategy with little capability to know if they are relevant or if the data/thinking is stale And there are tens of thousands of workbooks in Tableau that share these same issues Transition 4 So, like I said earlier, at eBay you are expected to “go use data” but it is tough to find existing metrics and reports – there are so many If you are adept at SQL and finding data, it is easy to build metrics – but you might be adding to the chaos And in an unstructured environment, it is tough to know which metrics and reports are the right ones to use, or which ones are old or stale
  32. So we need to think differently about solving these big data problems.