VP of Data Engineering & Analytics, Bill Loconzolo talks about building a scalable, secure data platform at the 2016 Strata Hadoop Conference. If you missed his presentation, check it out. http://intuit.me/22T5Beh
5. Intuit’s big vision for data
Transform the lives of our customers by unleashing the
power of data
Customer
Inspired
6. Era of Windows Era of Web Era of the CloudEra of DOS
Spanning three decades of data
Compliant data
Mobile First
1980s 1990s 2000s
• Employees: 150
• Customers: 1.3M customers
• Revenue: $33M
• Employees: 4,500
• Customers: 5.6M
• Revenue: $1.04B
• Employees: 7,700
• Customers: 37M
• Revenue: $4.2B
20162010
Regulatory data Transactional data Batch data Real time data Complex, secure data
8. The Intuit Analytics Cloud (IAC)
Data-Driven Users Data-Driven Products
CG SBG Generic Data products
that enable
value to be
derived from
the IAC
IntuitAnalyticsCloud
Real Time Ingest Batch Ingest
IAC = Data + Infrastructure + Foundational Services
Business Lookup Unified Profile Personalization A/B Testing ……
Real Time Data Layer
Intermediate Data
Source Data
Online Accountant
9. Declare a big vision, but ground it in reality1.
Know how to say “no”2.
Constant communication is not over-rated3.
Move fast, but with built-in rigor4.
Beware of those who believe in big data black magic5.
If I knew then, what I know now
21. Declare a big vision, but ground it in reality1.
Know how to say “no”2.
Constant communication is not over-rated3.
Move fast, but with built-in rigor4.
Beware of those who believe in big data black magic5.
If I knew then, what I know now
Data is a strategy – requires persistence & pace
Like a baseball season – play for the end game (Ups and Downs)
It’s a team sport – No 1 team
33yrs
My learning's I believe are applicable for a startup to large organizations
OK, help me out, need to know who I’m talking to
How many of you consume data capabilities / BI / Warehouse / ML environments
Raise your hands if you create / operate / support data systems for your company / organization
Transformation is a big word….
We have been around for decades - Data needs to be Product, not application Use Cases then data
We need to move from what happened yesterday (BATCH) to in session insights (REALTIME)
6
We are a central team -- we had been around for about a decade
A central capability data team needs to be clear who you are, who you are there to deliver for, and how to celebrate success
Inheriting a team, creating a new vision, and defining & driving the culture and organizational health is THE TOUGHTEST JOB
We built a platform for Products (realtime) and People
We built a lot of things nobody asked for in Products (Streaming capabilities, Unified Profile – 360 Customer, Simple lookup services, Web/Mobile SDK’s, A/B testing framework)
This is hard to explain, why would you build it…
There is a lot of tension
It took longer to gain alignment than it did to execute, access, and iterate
Cant hit a home run every time
You goal may be to win the world series, but you need to win on average 94 games to make it into the playoffs
A data vision requires a multi year objective with many wins and losses, but a clear objective where you are going and what success looks like
Know what your data sources are. -
Understand that BATCH is still king even though we HATE IT
Realize that little to NONE of the sources you integrate with though Data was important
Expect failures in data – no CDC columns. INSERTS, DELETS, UPDATES, ah, why bother who cares
You may have great individuals but are they up for the task to deliver the vision?
Is that what they were hired for
LevelUp story
TEAMS WIN, not All Stars
We had a goal towards our vision 12in12
Some called it the Death March
Some said if we do that there will be nothing left for us to do after…
Invest in your teams, teach, coach them, mentor, share your strengths and areas of oppertunities – Model the behavior you want to see
Many sources, many DB’s
Tell the story of Trinity
People & Products we solved for
What is the ask, how are they accretive to the vision ?
Help set expectations up front
Tell the story of Vertica and not setting expectations
Share your principles
Enable your team to use the principles and apply judgment
Tell Data Stewardship principles story and external data