SlideShare a Scribd company logo
Q&A:	New	Tools	Mean	Big	Data	Dives	Can	Yield	Fast	Results	
Users	can	explore	and	analyze	even	massive	data	sets	quickly	with	new	tools	and	platforms	now	
available	
By	Linda	L.	Briggs	
4.15.2014	
Analytic	tools	that	can	tap	into	raw,	unstructured	machine	data	are	becoming	increasingly	important	
and	valuable,	enabling	organizations	to	explore	unstructured	data	on	the	fly	without	fixed	schemas	
or	specialized	skill	sets.	In	this	interview,	Brett	Sheppard,	director	of	big	data	product	marketing	at	
Splunk,	explains	how	new	tools	and	analytic	platforms	enable	even	non-technical	users	to	explore	
massive	data	sets,	completing	analysis	of	unstructured	data	in	Hadoop	in	hours	instead	of	weeks	and	
months.	Based	in	San	Francisco,	Sheppard	has	been	a	data	analyst	both	at	Gartner	and	the	U.S.	
Department	of	Defense	and	is	a	certified	Hadoop	system	administrator.	
BI	This	Week:	Let's	start	with	a	fairly	basic	question.	With	all	this	talk	of	more	and	more	data,	what	
are	some	of	the	newer	sources	for	all	this	raw	data?	Where's	it	coming	from?	
Brett	Sheppard:	Social,	cloud,	and	sensors	all	contribute	to	big	data.	For	social	media	as	well	as	e-
commerce	transactions,	every	click	on	a	Web	site	generates	some	type	of	clickstream	data,	as	well	as	
every	interaction	on	social	media	and	every	tweet.	There's	significant	value	for	organizations	to	have	
an	ongoing	dialog	with	their	customers	and	prospects	and	with	their	communities.	
Likewise,	every	application	in	the	cloud	generates	sets	of	data,	whether	it's	from	the	hardware,	the	
applications,	the	security	systems,	the	IT	operations	for	data,	or	apps	in	the	cloud.	Finally,	we're	
seeing	a	wealth	of	very	actionable	data	generated	by	sensors,	ranging	from	automobiles	and	
airplanes	to	building	sensors.	
For	example,	Eglin	Air	Force	Base	in	Florida	is	reducing	their	base-wide	energy	costs	by	10	percent	or	
more	by	taking	data	from	the	heating,	ventilation,	and	air	conditioning	systems,	the	hardware	
systems	--	basically	anything	in	a	building	that	generates	data	--	and	using	that	sensor	data	to	find	
where	inefficiencies	are,	whether	it's	lights	on	in	the	middle	of	the	night	or	a	center	that's	running	air
conditioners	too	much.	They	are	thus	able	to	uncover	energy	waste	and	to	find	the	right	balance	of	
heating	and	cooling.	
That's	just	one	example.	Basically,	those	three	sources	of	social,	cloud,	and	sensor	data	are	
contributing	to	a	significant	increase	in	the	volume	and	variety	of	data	as	well	as	the	need	to	address	
it	in	as	real-time	a	manner	as	possible.	
We're	also	seeing	a	lot	of	variability	in	data	formats.	For	example,	we	work	with	Ford	Motor	
Company	on	data	from	cars.	Ford	has	a	standard	that	they	have	proposed	for	automobile	makers	
regarding	sensors.	It's	not	a	standard,	though,	that's	shared	by	the	rest	of	the	industry,	so	when	we	
work	with	Ford,	we're	able	to	use	their	open	data	standard,	but	the	other	auto	makers	haven't	yet	
standardized	on	the	OpenXC	standard.	Accordingly,	there's	a	lot	of	variability	in	working	with	car	
sensor	data.	That's	something	we	see	in	a	lot	of	industries	right	now.	
How	many	companies	out	there	are	successfully	capturing	big	data	yet?	How	many	have	actually	
deployed	an	HDFS	cluster,	for	example,	and	populated	it	with	big	data?	
Well,	I	would	distinguish	between	big	data	as	a	whole	and	specific	technologies.	There	is	a	great	deal	
of	interest	in	Hadoop	and	HDFS,	and	it's	focused	on	three	areas	right	now.	We	see	a	lot	of	Hadoop	
use	in	the	federal	government,	within	Internet	companies,	and	in	Fortune	500	enterprises.	Beyond	
Hadoop,	there	are	a	variety	of	NoSQL	data	stores	...	and	some	organizations	have	big	data	stored	in	
relational	databases.	
What's	happened	is	[that]	organizations	are	able	now	to	store	these	data	types	so	inexpensively	...	
using	a	variety	of	storage	methods	such	as	Hadoop	that	the	opportunity	costs	of	throwing	the	data	
away	are	actually	more	than	the	cost	of	storing	it.	
How	many	companies	are	performing	meaningful	analytics	against	all	that	big	data?	
The	challenge	is	getting	actionable	insights	from	that	data	because	the	data	is	in	so	many	different	
formats.	Typically,	it's	raw	or	unstructured	data	and	typically	doesn't	fit	very	well	in	either	a	
relational	database	or	a	data	intelligence	tool,	which	tends	to	require	an	extreme	amount	of	ETL	
processing,	so	it	can	start	to	look	like	a	Rube	Goldberg	project	in	a	way.	There	are	all	these	steps	to	
go	from	raw	data	to	business	insight.	
That's	what	organizations	are	struggling	with	and	why	the	percent	rate	of	failure	with	big	data	
projects	is	actually	quite	high.	Companies	spend	six	months	or	more	with	five	or	10	people	working	
on	a	big	data	project,	then	find	at	the	end	that	they	just	aren't	able	to	get	the	actionable	insights	that	
they	wanted	to	from	that	raw,	unstructured	big	data.	
What	are	some	of	the	challenges	for	companies	of	working	directly	with	Hadoop	and	MapReduce?	
Why	is	it	so	hard	to	get	value	from	data	in	Hadoop?	
Beyond	Splunk-specific	offerings,	there	are	three	approaches	today	to	extract	value	from	data	in	
Hadoop.	All	three	of	those	have	ways	they	are	generating	value,	but	they	also	have	significant	
disadvantages.	
First	is	MapReduce	and	Apache	Pig,	which	is	the	way	most	organizations	get	started.	You	can	run	
searches	of	data,	but	it's	very	slow.	It	can	take	minutes	or	hours.	Unlike	with	a	relational	database,
where	you	can	have	deterministic	query	response	times,	Hadoop	does	not	have	that.	A	job	can	run	
indefinitely	--	it	could	take	minutes,	it	could	take	hours.	That	consumes	a	lot	of	resources	in	the	
cluster.	
Because	of	that,	most	organizations	also	try	one	of	the	other	two	options.	The	first	is	Apache	Hive	or	
SQL	on	Hadoop.	That	works	very	well	if	you	have	a	narrow	set	of	questions	to	ask	because	it	requires	
fixed	schemas.	If	an	organization	wants	to	replace	an	existing	ETL	framework	with	something	in	
Hadoop,	and	it's	for,	say,	static	reporting	on	a	small	number	of	data	sources,	the	SQL	on	Hadoop	or	
Apache	Hive	approach	can	work	quite	well.	
Where	that	approach	runs	into	challenges	is	with	exploratory	analytics	across	an	entire	Hadoop	
cluster,	where	it's	impossible	to	define	fixed	schemas.	A	knowledge	worker	in	that	organization	may	
want	to	iterate,	ask	questions,	see	the	results,	and	ask	follow-up	questions.	They	need	to	be	able	to	
look	at	all	the	data	that	they	have	access	to	within	their	role-based	access	controls	without	having	to	
pre-define	schemas	and	be	limited	to	the	data	returned	from	those	schemas.	
Finally,	the	third	approach	is	to	extract	data	out	of	Hadoop	and	into	an	in-memory	store.	This	could	
be	Tableau	Software,	or	SAP	HANA,	or	a	variety	of	in-memory	data	stores.	That	approach	works	
really	well	if	Hadoop	is	basically	doing	batch	ETL	--	where	you're	taking	raw	data,	you're	creating	a	
set	of	results	that	can	be	interpreted	in	a	row	and	columnar	format	in	a	relational	database,	and	
you're	able	to	export	it	out	of	Hadoop.	
Customers	come	to	Splunk	when	they	don't	want	to	move	the	data	out	of	Hadoop.	They	essentially	
want	Hadoop	to	be	a	data	lake	where	they	keep	the	data	at	rest.	Organizations	may	have	security	
concerns	about	moving	data	around	too	much	or	they	don't	want	to	have	to	set	up	data	marts.	In	
those	cases,	organizations	are	using	Hadoop	as	the	data	lake,	where	they	persist	that	data	for	many	
months	or	years,	and	use	software	such	as	Hunk	to	ask	and	answer	questions	about	that	data.	
Working	with	big	data	can	also	introduce	skills	challenges,	correct?	There	just	aren't	enough	
people	around	who	understand	the	technologies.	
Absolutely.	In	fact,	that's	the	single	biggest	limitation	today	for	Hadoop	adoption.	Hadoop	is	
maturing	as	a	technology	for	storing	data	in	a	variety	of	formats	and	from	many	sources,	but	what's	
limiting	organizations	today	is	the	need	for	rare,	specialized	skill	sets	to	do	that.	...	
There's	also	a	need	to	mask	Hadoop's	complexity	so	that	non-specialists	can	ask	questions	of	the	
data.	At	the	same	time,	data	scientists	who	are	fluent	in	the	dozen-plus	projects	and	sub-projects	in	
the	Hadoop	system	can	focus	their	skills	on	advanced	statistics	and	advanced	algorithms	that	really	
benefit	from	their	knowledge.	
Unfortunately,	today	many	data	scientists	end	up	wasting	their	time	as	"data	butlers,"	where	they	
have	colleagues	in	the	line	of	business	or	corporate	departments	who	have	analytics	tasks.	Those	
colleagues	don't	have	the	advanced	skill	sets	needed	to	ask	and	get	answers	to	questions	in	Hadoop.	
Accordingly,	data	scientists	are	basically	setting	up	access	for	their	non-specialist	colleagues	rather	
than	spending	their	time	doing	what	they	are	really	there	for,	which	is	advanced	statistics	and	
algorithms	that	really	do	require	a	custom,	personalized	approach.
You	mentioned	role-based	access	controls.	Why	are	they	so	important?	
That's	one	of	the	challenges	to	address	with	big	data.	Along	with	rare,	specialized	skill	sets,	role-
based	access	controls	are	needed	to	protect	non-public	information	that	may	be	stored	in	that	
Hadoop	data	lake.	
That's	a	weakness	of	Hadoop,	which	was	founded	as	shared-use	clusters.	Organizations	such	as	
Yahoo	that	were	storing	clickstream	data	in	Hadoop	had	relatively	few	security	concerns.	The	data	
didn't	contain	non-public	information	--	it	was	simply	rankings	of	public	Web	sites.	Accordingly,	
Hadoop	was	not	founded	with	the	role-based	access	controls	that	anyone	familiar	with	relational	
databases	would	be	conversant	in.	
For	working	with	big	data	the	way	most	companies	need	to,	though,	it's	important	to	have	a	
technology	that	can	mask	some	data	from	some	users,	offering	role-based	access	to	select	data	in	
Hadoop.	
In	fact,	that	issue	is	part	of	what's	held	the	average	size	of	a	cluster	in	Hadoop	to	40	nodes	rather	
than,	say,	400	or	4,000	nodes.	Organizations	have	to	restrict	the	number	of	users	who	have	access	
because	in	traditional	HDFS	and	MapReduce,	once	someone	has	access	to	the	cluster,	they	can	do	
anything	they	want	within	the	cluster.	They	can	see	all	the	data,	they	can	delete	the	data,	there	is	
limited	auditability,	and	at	best	you're	able	to	see	what	someone	has	done	after	they've	done	it.	If	
you're	able	to	define	access	by	role,	you	can	prevent	people	either	maliciously	or	inadvertently	
accessing	data	that	is	beyond	their	role.	
Splunk's	analytics	solution	for	Hadoop	is	Hunk,	which	you	introduced	last	year.	Can	you	talk	about	
what	Hunk	brings	to	the	big	data	equation?	
We're	very	excited	about	Hunk	as	part	of	the	Splunk	portfolio	for	big	data.	Splunk	has	7,000	
customers	today	who	find	data	in	real	time	with	historical	context.	Many	organizations,	though,	
want	to	persist	big	data	for	many	months	or	years,	and	although	Hadoop	is	a	convenient	and	
inexpensive	data	lake	for	long-time	historical	storage,	at	the	same	time,	organizations	want	to	be	
able	to	ask	and	answer	questions	of	that	data	in	Hadoop.	
The	benefit	of	Hunk	is	that	an	organization	can	explore,	analyze,	and	visualize	data	at	rest	in	Hadoop	
without	a	need	for	specialized	skill	sets.	Many	organizations	that	have	tested	Hunk	have	found	that	
they've	been	able	to	go	from	the	free	trial	to	searching	data	in	Hadoop	within	an	hour	or	less.	That's	
possible	because	of	the	Splunk	architecture,	which	has	schema	on	the	fly.	There's	no	need	to	apply	
fixed	schemas,	and	there's	no	need	to	migrate	data	out	of	Hadoop	into	a	fixed	store,	so	the	time	to	
value	is	significantly	faster.	You	can	also	expose	the	data	through	ODBC	drivers	to	existing	business	
intelligence	dashboards.	
	About	the	Author	
	
Linda	L.	Briggs	writes	about	technology	in	corporate,	education,	and	government	markets.	She	is	
based	in	San	Diego.		
	
lbriggs@lindabriggs.com

More Related Content

What's hot

Graph Data Modeling in Four Dimensions – Outline, Differences, Artisanship, A...
Graph Data Modeling in Four Dimensions – Outline, Differences, Artisanship, A...Graph Data Modeling in Four Dimensions – Outline, Differences, Artisanship, A...
Graph Data Modeling in Four Dimensions – Outline, Differences, Artisanship, A...DATAVERSITY
 
How to identify the Return on Investment of Big Data
How to identify the Return on Investment of Big DataHow to identify the Return on Investment of Big Data
How to identify the Return on Investment of Big DataJose Pablo Fernandez
 
From Big Data to Business Value
From Big Data to Business ValueFrom Big Data to Business Value
From Big Data to Business ValueGib Bassett
 
Big data 4 4 the art of the possible 4-en-web
Big data 4 4 the art of the possible 4-en-webBig data 4 4 the art of the possible 4-en-web
Big data 4 4 the art of the possible 4-en-webRick Bouter
 
Data Science Growth Accelerator
Data Science Growth AcceleratorData Science Growth Accelerator
Data Science Growth AcceleratorKanika Khanna
 
Big data 2 4 - big-social-predicting-behavior-with-big-data
Big data 2 4 - big-social-predicting-behavior-with-big-dataBig data 2 4 - big-social-predicting-behavior-with-big-data
Big data 2 4 - big-social-predicting-behavior-with-big-dataRick Bouter
 
Vint big data research privacy technology and the law
Vint big data research privacy technology and the lawVint big data research privacy technology and the law
Vint big data research privacy technology and the lawKarlos Svoboda
 
Data Modeling Techniques
Data Modeling TechniquesData Modeling Techniques
Data Modeling TechniquesDATAVERSITY
 
5 Big Data Visualization Maps that Will Make Your HEAD EXPLODE
5 Big Data Visualization Maps that Will Make Your HEAD EXPLODE5 Big Data Visualization Maps that Will Make Your HEAD EXPLODE
5 Big Data Visualization Maps that Will Make Your HEAD EXPLODEBI Brainz
 
Sogeti big data - no more secrets with big data analytics
Sogeti   big data - no more secrets with big data analyticsSogeti   big data - no more secrets with big data analytics
Sogeti big data - no more secrets with big data analyticsRick Bouter
 
Big Data and The Future of Insight - Future Foundation
Big Data and The Future of Insight - Future FoundationBig Data and The Future of Insight - Future Foundation
Big Data and The Future of Insight - Future FoundationForesight Factory
 
Sogeti on big data creating clarity
Sogeti on big data creating claritySogeti on big data creating clarity
Sogeti on big data creating clarityYann SESE
 
Econsultancy Future Trends Briefing
Econsultancy Future Trends Briefing Econsultancy Future Trends Briefing
Econsultancy Future Trends Briefing Econsultancy
 
The Human Side of Data By Colin Strong
The Human Side of Data By Colin StrongThe Human Side of Data By Colin Strong
The Human Side of Data By Colin StrongMarTech Conference
 
Strategy Report for NextGen BI
Strategy Report for NextGen BIStrategy Report for NextGen BI
Strategy Report for NextGen BINeil Raden
 

What's hot (18)

Graph Data Modeling in Four Dimensions – Outline, Differences, Artisanship, A...
Graph Data Modeling in Four Dimensions – Outline, Differences, Artisanship, A...Graph Data Modeling in Four Dimensions – Outline, Differences, Artisanship, A...
Graph Data Modeling in Four Dimensions – Outline, Differences, Artisanship, A...
 
How to identify the Return on Investment of Big Data
How to identify the Return on Investment of Big DataHow to identify the Return on Investment of Big Data
How to identify the Return on Investment of Big Data
 
From Big Data to Business Value
From Big Data to Business ValueFrom Big Data to Business Value
From Big Data to Business Value
 
Ds article ppt
Ds article pptDs article ppt
Ds article ppt
 
Big data 4 4 the art of the possible 4-en-web
Big data 4 4 the art of the possible 4-en-webBig data 4 4 the art of the possible 4-en-web
Big data 4 4 the art of the possible 4-en-web
 
Data Science Growth Accelerator
Data Science Growth AcceleratorData Science Growth Accelerator
Data Science Growth Accelerator
 
Big data 2 4 - big-social-predicting-behavior-with-big-data
Big data 2 4 - big-social-predicting-behavior-with-big-dataBig data 2 4 - big-social-predicting-behavior-with-big-data
Big data 2 4 - big-social-predicting-behavior-with-big-data
 
Vint big data research privacy technology and the law
Vint big data research privacy technology and the lawVint big data research privacy technology and the law
Vint big data research privacy technology and the law
 
Data Modeling Techniques
Data Modeling TechniquesData Modeling Techniques
Data Modeling Techniques
 
Keynote Dubai
Keynote DubaiKeynote Dubai
Keynote Dubai
 
Get Data Smart
Get Data Smart Get Data Smart
Get Data Smart
 
5 Big Data Visualization Maps that Will Make Your HEAD EXPLODE
5 Big Data Visualization Maps that Will Make Your HEAD EXPLODE5 Big Data Visualization Maps that Will Make Your HEAD EXPLODE
5 Big Data Visualization Maps that Will Make Your HEAD EXPLODE
 
Sogeti big data - no more secrets with big data analytics
Sogeti   big data - no more secrets with big data analyticsSogeti   big data - no more secrets with big data analytics
Sogeti big data - no more secrets with big data analytics
 
Big Data and The Future of Insight - Future Foundation
Big Data and The Future of Insight - Future FoundationBig Data and The Future of Insight - Future Foundation
Big Data and The Future of Insight - Future Foundation
 
Sogeti on big data creating clarity
Sogeti on big data creating claritySogeti on big data creating clarity
Sogeti on big data creating clarity
 
Econsultancy Future Trends Briefing
Econsultancy Future Trends Briefing Econsultancy Future Trends Briefing
Econsultancy Future Trends Briefing
 
The Human Side of Data By Colin Strong
The Human Side of Data By Colin StrongThe Human Side of Data By Colin Strong
The Human Side of Data By Colin Strong
 
Strategy Report for NextGen BI
Strategy Report for NextGen BIStrategy Report for NextGen BI
Strategy Report for NextGen BI
 

Similar to Tdwi brett-sheppard-interview-april-2014

Welcome to Data Science
Welcome to Data ScienceWelcome to Data Science
Welcome to Data ScienceNyraSehgal
 
Big Data Tools: A Deep Dive into Essential Tools
Big Data Tools: A Deep Dive into Essential ToolsBig Data Tools: A Deep Dive into Essential Tools
Big Data Tools: A Deep Dive into Essential ToolsFredReynolds2
 
Big data (word file)
Big data  (word file)Big data  (word file)
Big data (word file)Shahbaz Anjam
 
Paradigm4 Research Report: Leaving Data on the table
Paradigm4 Research Report: Leaving Data on the tableParadigm4 Research Report: Leaving Data on the table
Paradigm4 Research Report: Leaving Data on the tableParadigm4
 
Big data's impact on online marketing
Big data's impact on online marketingBig data's impact on online marketing
Big data's impact on online marketingPros Global Inc
 
Tools for Unstructured Data Analytics
Tools for Unstructured Data AnalyticsTools for Unstructured Data Analytics
Tools for Unstructured Data AnalyticsRavi Teja
 
Making Sense of your data - eLearning Network April 2014
Making Sense of your data - eLearning Network April 2014Making Sense of your data - eLearning Network April 2014
Making Sense of your data - eLearning Network April 2014Andy Wooler
 
Exploring the impact and evolution of Advanced Analytics Tools.pdf
Exploring the impact and evolution of Advanced Analytics Tools.pdfExploring the impact and evolution of Advanced Analytics Tools.pdf
Exploring the impact and evolution of Advanced Analytics Tools.pdfStats Statswork
 
Exploring the impact and evolution of Advanced Analytics Tools.pdf
Exploring the impact and evolution of Advanced Analytics Tools.pdfExploring the impact and evolution of Advanced Analytics Tools.pdf
Exploring the impact and evolution of Advanced Analytics Tools.pdfStats Statswork
 
Getting down to business on Big Data analytics
Getting down to business on Big Data analyticsGetting down to business on Big Data analytics
Getting down to business on Big Data analyticsThe Marketing Distillery
 
Big data analytics
Big data analyticsBig data analytics
Big data analyticsRavi Teja
 
Making a Successful Career in Data Science
Making a Successful Career in Data ScienceMaking a Successful Career in Data Science
Making a Successful Career in Data ScienceSamidha Takle
 
BDS14 Big Data Analytics to the masses
BDS14 Big Data Analytics to the massesBDS14 Big Data Analytics to the masses
BDS14 Big Data Analytics to the massesJose Luis Lopez Pino
 

Similar to Tdwi brett-sheppard-interview-april-2014 (20)

What is big data
What is big dataWhat is big data
What is big data
 
Welcome to Data Science
Welcome to Data ScienceWelcome to Data Science
Welcome to Data Science
 
Big data
Big dataBig data
Big data
 
Big data analytics
Big data analyticsBig data analytics
Big data analytics
 
Big Data Tools: A Deep Dive into Essential Tools
Big Data Tools: A Deep Dive into Essential ToolsBig Data Tools: A Deep Dive into Essential Tools
Big Data Tools: A Deep Dive into Essential Tools
 
KOHN.ppt
KOHN.pptKOHN.ppt
KOHN.ppt
 
KOHN.ppt
KOHN.pptKOHN.ppt
KOHN.ppt
 
Big data (word file)
Big data  (word file)Big data  (word file)
Big data (word file)
 
Paradigm4 Research Report: Leaving Data on the table
Paradigm4 Research Report: Leaving Data on the tableParadigm4 Research Report: Leaving Data on the table
Paradigm4 Research Report: Leaving Data on the table
 
Big data's impact on online marketing
Big data's impact on online marketingBig data's impact on online marketing
Big data's impact on online marketing
 
Tools for Unstructured Data Analytics
Tools for Unstructured Data AnalyticsTools for Unstructured Data Analytics
Tools for Unstructured Data Analytics
 
Making Sense of your data - eLearning Network April 2014
Making Sense of your data - eLearning Network April 2014Making Sense of your data - eLearning Network April 2014
Making Sense of your data - eLearning Network April 2014
 
Exploring the impact and evolution of Advanced Analytics Tools.pdf
Exploring the impact and evolution of Advanced Analytics Tools.pdfExploring the impact and evolution of Advanced Analytics Tools.pdf
Exploring the impact and evolution of Advanced Analytics Tools.pdf
 
Exploring the impact and evolution of Advanced Analytics Tools.pdf
Exploring the impact and evolution of Advanced Analytics Tools.pdfExploring the impact and evolution of Advanced Analytics Tools.pdf
Exploring the impact and evolution of Advanced Analytics Tools.pdf
 
Getting down to business on Big Data analytics
Getting down to business on Big Data analyticsGetting down to business on Big Data analytics
Getting down to business on Big Data analytics
 
Big data analytics
Big data analyticsBig data analytics
Big data analytics
 
Making a Successful Career in Data Science
Making a Successful Career in Data ScienceMaking a Successful Career in Data Science
Making a Successful Career in Data Science
 
BDS14 Big Data Analytics to the masses
BDS14 Big Data Analytics to the massesBDS14 Big Data Analytics to the masses
BDS14 Big Data Analytics to the masses
 
Does big data really mean better decisions?
Does big data really mean better decisions?Does big data really mean better decisions?
Does big data really mean better decisions?
 
Unlocking big data
Unlocking big dataUnlocking big data
Unlocking big data
 

More from Brett Sheppard

5 ways-to-improve-your-security-with-splunk
5 ways-to-improve-your-security-with-splunk5 ways-to-improve-your-security-with-splunk
5 ways-to-improve-your-security-with-splunkBrett Sheppard
 
Sample Google Paid campaign results
Sample Google Paid campaign resultsSample Google Paid campaign results
Sample Google Paid campaign resultsBrett Sheppard
 
Summary of Made to Stick book
Summary of Made to Stick bookSummary of Made to Stick book
Summary of Made to Stick bookBrett Sheppard
 
Shift from manual to interactive reporting
Shift from manual to interactive reportingShift from manual to interactive reporting
Shift from manual to interactive reportingBrett Sheppard
 
Datadog APM Product Launch
Datadog APM Product LaunchDatadog APM Product Launch
Datadog APM Product LaunchBrett Sheppard
 
Brett Sheppard Sample Portfolio
Brett Sheppard Sample PortfolioBrett Sheppard Sample Portfolio
Brett Sheppard Sample PortfolioBrett Sheppard
 
Idc datadog-expands-into-apm
Idc datadog-expands-into-apmIdc datadog-expands-into-apm
Idc datadog-expands-into-apmBrett Sheppard
 
How Comcast Turns Big Data into Real Time Operational Insights: Winter Olympi...
How Comcast Turns Big Data into Real Time Operational Insights: Winter Olympi...How Comcast Turns Big Data into Real Time Operational Insights: Winter Olympi...
How Comcast Turns Big Data into Real Time Operational Insights: Winter Olympi...Brett Sheppard
 
SEO Checklist For Rapid-growth Startups
SEO Checklist For Rapid-growth StartupsSEO Checklist For Rapid-growth Startups
SEO Checklist For Rapid-growth StartupsBrett Sheppard
 
GigaOM Putting Big Data to Work by Brett Sheppard
GigaOM Putting Big Data to Work by Brett SheppardGigaOM Putting Big Data to Work by Brett Sheppard
GigaOM Putting Big Data to Work by Brett SheppardBrett Sheppard
 
DxContinuum Forrester Webinar
DxContinuum Forrester WebinarDxContinuum Forrester Webinar
DxContinuum Forrester WebinarBrett Sheppard
 
Yahoo Enabling Exploratory Analytics of Data in Shared-service Hadoop Clusters
Yahoo Enabling Exploratory Analytics of Data in Shared-service Hadoop ClustersYahoo Enabling Exploratory Analytics of Data in Shared-service Hadoop Clusters
Yahoo Enabling Exploratory Analytics of Data in Shared-service Hadoop ClustersBrett Sheppard
 

More from Brett Sheppard (15)

5 ways-to-improve-your-security-with-splunk
5 ways-to-improve-your-security-with-splunk5 ways-to-improve-your-security-with-splunk
5 ways-to-improve-your-security-with-splunk
 
Sample Google Paid campaign results
Sample Google Paid campaign resultsSample Google Paid campaign results
Sample Google Paid campaign results
 
Summary of Made to Stick book
Summary of Made to Stick bookSummary of Made to Stick book
Summary of Made to Stick book
 
Shift from manual to interactive reporting
Shift from manual to interactive reportingShift from manual to interactive reporting
Shift from manual to interactive reporting
 
Datadog APM Product Launch
Datadog APM Product LaunchDatadog APM Product Launch
Datadog APM Product Launch
 
Brett Sheppard Sample Portfolio
Brett Sheppard Sample PortfolioBrett Sheppard Sample Portfolio
Brett Sheppard Sample Portfolio
 
Idc datadog-expands-into-apm
Idc datadog-expands-into-apmIdc datadog-expands-into-apm
Idc datadog-expands-into-apm
 
How Comcast Turns Big Data into Real Time Operational Insights: Winter Olympi...
How Comcast Turns Big Data into Real Time Operational Insights: Winter Olympi...How Comcast Turns Big Data into Real Time Operational Insights: Winter Olympi...
How Comcast Turns Big Data into Real Time Operational Insights: Winter Olympi...
 
Datadog brief
Datadog briefDatadog brief
Datadog brief
 
SEO Checklist For Rapid-growth Startups
SEO Checklist For Rapid-growth StartupsSEO Checklist For Rapid-growth Startups
SEO Checklist For Rapid-growth Startups
 
Rapid-fire BI
Rapid-fire BIRapid-fire BI
Rapid-fire BI
 
GigaOM Putting Big Data to Work by Brett Sheppard
GigaOM Putting Big Data to Work by Brett SheppardGigaOM Putting Big Data to Work by Brett Sheppard
GigaOM Putting Big Data to Work by Brett Sheppard
 
DxContinuum Forrester Webinar
DxContinuum Forrester WebinarDxContinuum Forrester Webinar
DxContinuum Forrester Webinar
 
Cloudera Hunk
Cloudera HunkCloudera Hunk
Cloudera Hunk
 
Yahoo Enabling Exploratory Analytics of Data in Shared-service Hadoop Clusters
Yahoo Enabling Exploratory Analytics of Data in Shared-service Hadoop ClustersYahoo Enabling Exploratory Analytics of Data in Shared-service Hadoop Clusters
Yahoo Enabling Exploratory Analytics of Data in Shared-service Hadoop Clusters
 

Recently uploaded

From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...Product School
 
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...Thierry Lestable
 
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered QualitySoftware Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered QualityInflectra
 
Measures in SQL (a talk at SF Distributed Systems meetup, 2024-05-22)
Measures in SQL (a talk at SF Distributed Systems meetup, 2024-05-22)Measures in SQL (a talk at SF Distributed Systems meetup, 2024-05-22)
Measures in SQL (a talk at SF Distributed Systems meetup, 2024-05-22)Julian Hyde
 
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...Product School
 
Introduction to Open Source RAG and RAG Evaluation
Introduction to Open Source RAG and RAG EvaluationIntroduction to Open Source RAG and RAG Evaluation
Introduction to Open Source RAG and RAG EvaluationZilliz
 
Speed Wins: From Kafka to APIs in Minutes
Speed Wins: From Kafka to APIs in MinutesSpeed Wins: From Kafka to APIs in Minutes
Speed Wins: From Kafka to APIs in Minutesconfluent
 
ODC, Data Fabric and Architecture User Group
ODC, Data Fabric and Architecture User GroupODC, Data Fabric and Architecture User Group
ODC, Data Fabric and Architecture User GroupCatarinaPereira64715
 
Integrating Telephony Systems with Salesforce: Insights and Considerations, B...
Integrating Telephony Systems with Salesforce: Insights and Considerations, B...Integrating Telephony Systems with Salesforce: Insights and Considerations, B...
Integrating Telephony Systems with Salesforce: Insights and Considerations, B...CzechDreamin
 
Behind the Scenes From the Manager's Chair: Decoding the Secrets of Successfu...
Behind the Scenes From the Manager's Chair: Decoding the Secrets of Successfu...Behind the Scenes From the Manager's Chair: Decoding the Secrets of Successfu...
Behind the Scenes From the Manager's Chair: Decoding the Secrets of Successfu...CzechDreamin
 
How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...Product School
 
Optimizing NoSQL Performance Through Observability
Optimizing NoSQL Performance Through ObservabilityOptimizing NoSQL Performance Through Observability
Optimizing NoSQL Performance Through ObservabilityScyllaDB
 
Free and Effective: Making Flows Publicly Accessible, Yumi Ibrahimzade
Free and Effective: Making Flows Publicly Accessible, Yumi IbrahimzadeFree and Effective: Making Flows Publicly Accessible, Yumi Ibrahimzade
Free and Effective: Making Flows Publicly Accessible, Yumi IbrahimzadeCzechDreamin
 
Search and Society: Reimagining Information Access for Radical Futures
Search and Society: Reimagining Information Access for Radical FuturesSearch and Society: Reimagining Information Access for Radical Futures
Search and Society: Reimagining Information Access for Radical FuturesBhaskar Mitra
 
IOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptx
IOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptxIOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptx
IOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptxAbida Shariff
 
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...UiPathCommunity
 
Essentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with ParametersEssentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with ParametersSafe Software
 
SOQL 201 for Admins & Developers: Slice & Dice Your Org’s Data With Aggregate...
SOQL 201 for Admins & Developers: Slice & Dice Your Org’s Data With Aggregate...SOQL 201 for Admins & Developers: Slice & Dice Your Org’s Data With Aggregate...
SOQL 201 for Admins & Developers: Slice & Dice Your Org’s Data With Aggregate...CzechDreamin
 
When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...Elena Simperl
 
Demystifying gRPC in .Net by John Staveley
Demystifying gRPC in .Net by John StaveleyDemystifying gRPC in .Net by John Staveley
Demystifying gRPC in .Net by John StaveleyJohn Staveley
 

Recently uploaded (20)

From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
 
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
 
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered QualitySoftware Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
 
Measures in SQL (a talk at SF Distributed Systems meetup, 2024-05-22)
Measures in SQL (a talk at SF Distributed Systems meetup, 2024-05-22)Measures in SQL (a talk at SF Distributed Systems meetup, 2024-05-22)
Measures in SQL (a talk at SF Distributed Systems meetup, 2024-05-22)
 
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
 
Introduction to Open Source RAG and RAG Evaluation
Introduction to Open Source RAG and RAG EvaluationIntroduction to Open Source RAG and RAG Evaluation
Introduction to Open Source RAG and RAG Evaluation
 
Speed Wins: From Kafka to APIs in Minutes
Speed Wins: From Kafka to APIs in MinutesSpeed Wins: From Kafka to APIs in Minutes
Speed Wins: From Kafka to APIs in Minutes
 
ODC, Data Fabric and Architecture User Group
ODC, Data Fabric and Architecture User GroupODC, Data Fabric and Architecture User Group
ODC, Data Fabric and Architecture User Group
 
Integrating Telephony Systems with Salesforce: Insights and Considerations, B...
Integrating Telephony Systems with Salesforce: Insights and Considerations, B...Integrating Telephony Systems with Salesforce: Insights and Considerations, B...
Integrating Telephony Systems with Salesforce: Insights and Considerations, B...
 
Behind the Scenes From the Manager's Chair: Decoding the Secrets of Successfu...
Behind the Scenes From the Manager's Chair: Decoding the Secrets of Successfu...Behind the Scenes From the Manager's Chair: Decoding the Secrets of Successfu...
Behind the Scenes From the Manager's Chair: Decoding the Secrets of Successfu...
 
How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...
 
Optimizing NoSQL Performance Through Observability
Optimizing NoSQL Performance Through ObservabilityOptimizing NoSQL Performance Through Observability
Optimizing NoSQL Performance Through Observability
 
Free and Effective: Making Flows Publicly Accessible, Yumi Ibrahimzade
Free and Effective: Making Flows Publicly Accessible, Yumi IbrahimzadeFree and Effective: Making Flows Publicly Accessible, Yumi Ibrahimzade
Free and Effective: Making Flows Publicly Accessible, Yumi Ibrahimzade
 
Search and Society: Reimagining Information Access for Radical Futures
Search and Society: Reimagining Information Access for Radical FuturesSearch and Society: Reimagining Information Access for Radical Futures
Search and Society: Reimagining Information Access for Radical Futures
 
IOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptx
IOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptxIOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptx
IOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptx
 
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
 
Essentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with ParametersEssentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with Parameters
 
SOQL 201 for Admins & Developers: Slice & Dice Your Org’s Data With Aggregate...
SOQL 201 for Admins & Developers: Slice & Dice Your Org’s Data With Aggregate...SOQL 201 for Admins & Developers: Slice & Dice Your Org’s Data With Aggregate...
SOQL 201 for Admins & Developers: Slice & Dice Your Org’s Data With Aggregate...
 
When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...
 
Demystifying gRPC in .Net by John Staveley
Demystifying gRPC in .Net by John StaveleyDemystifying gRPC in .Net by John Staveley
Demystifying gRPC in .Net by John Staveley
 

Tdwi brett-sheppard-interview-april-2014