SlideShare a Scribd company logo
1 of 37
Download to read offline
The Briefing Room
How to Identify, Train or Become a Data Scientist
Twitter Tag: #briefr The Briefing Room
Welcome
Host:
Eric Kavanagh
eric.kavanagh@bloorgroup.com
Twitter Tag: #briefr The Briefing Room
!   Reveal the essential characteristics of enterprise software,
good and bad
!   Provide a forum for detailed analysis of today s innovative
technologies
!   Give vendors a chance to explain their product to savvy
analysts
!   Allow audience members to pose serious questions... and get
answers!
Mission
Twitter Tag: #briefr The Briefing Room
Topics
This Month: ANALYTICS
October: DATA PROCESSING
November: DATA DISCOVERY &
VISUALIZATION
Twitter Tag: #briefr The Briefing Room
Analytics
Twitter Tag: #briefr The Briefing Room
Analyst: Neil Raden
Neil Raden is the founder and Principal Analyst
at Hired Brains Research. He is the co-author,
with James Taylor, of “Smart (Enough) Systems:
How To Deliver Competitive Advantage by
Automating Hidden Decisions.” With 30 years
experience, he is a widely published writer,
well-known speaker, analyst and consultant,
having personally designed and implemented
dozens of large analytical applications in
finance, marketing, distribution, logistics,
actuarial, intelligence, scientific, statistical and
consumer products. As an industry analyst, he
has published over 40 white papers, hundreds of
articles, blogs and research reports. He
welcomes your comments and can be reached
at nraden@hiredbrains.com.
Twitter Tag: #briefr The Briefing Room
Actian
! Actian is a database and software development company
! Actian offers the ParAccel DataFlow Engine, a scalable
parallel platform which provides visual access to complex
data flows
!   The DataFlow Engine is designed to reduce cluster
complexity, manage multi-petabytes of data, and scale with
the size and dimensionality of the data
Twitter Tag: #briefr The Briefing Room
Guest: John Santaferraro
John Santaferraro is the Vice President of Product
Marketing at Actian. Prior to joining Actian, Santaferraro
was an independent industry analyst in the business
intelligence and analytics market. Before that he
developed and executed a vertical market strategy for
Hewlett Packard's BI group, focusing on energy,
communications, retail, healthcare and financial services;
he was also instrumental in helping establish HP’s new BI
business group with a combination of solutions, products
and consulting. In 2000, John founded a marketing and
sales consulting company, Ferraro Consulting, providing
business acceleration strategy for technology companies.
Enabling the Business Scientist
John Santaferraro
Vice President of Marketing, ParAccel Platform Group
September 3, 2013
What is a “business scientist”?
Requirements of a “business
scientist”
Tools of a “business scientist”
Creating a culture of “business
science”
10© 2013 Actian Corporation
The “Moneyball” Effect
§ Analytics Go Mainstream
•  Major League Baseball
§  Hire the best team
•  NSA and Big Data
§  ???????????????
•  Target and Pregnancy
§  Predicting pregnancies
11© 2013 Actian Corporation
What is a Data Scientist?
12© 2013 Actian Corporation
A data scientist “…incorporates
varying elements and builds on
techniques and theories from
many fields, including
mathematics, statistics, data
engineering, pattern recognition
and learning, advanced
computing, visualization,
uncertainty modeling, data
warehousing, and high
performance computing with the
goal of extracting meaning from
data and creating data
products.”
13
What is a Data Scientist?
© 2013 Actian Corporation
Created by Calvin Andrus, depicts a mash-up of
disciplines from which Data Science is derived, 13 July
2012
http://en.wikipedia.org/wiki/Data_science
What is a Business Scientist?
“A business scientist is an expert in the
science of business, sitting between the
business analyst and the data scientist,
pulling together cross-functional expertise
from data science, analytics, business
applications, business processes, and
business strategy.
14© 2013 Actian Corporation
Business Science Practice Areas
Business
Science
Sales
Marketing
Supply
Chain
Logistics
Finance
Human
Resource
Risk
Fraud
15© 2013 Actian Corporation
Business Science Skillset
Understand How Analytics Work
Understand Emerging Data Types
Understand Business Operations & Strategy
Learn Quickly
Think Outside the Box
Tell Compelling Stories
16© 2013 Actian Corporation
§  Libraries of Analytic
Functions Run at Extreme
Speed
•  Transformational Analytics
•  Statistical Analytics
•  Machine Learning Analytics
•  Clustering Analytics
•  Discovery Analytics
§  Visual Framework for Data
Discovery, Preparation and
Analytics
•  Drag and Drop Interaction
•  Libraries of Data Preparation
Operators
•  Libraries of Analytic Operators
•  High-Performance, Parallel
Processing on Hadoop (or
other file systems)
17
The Tools of the Business Scientist
© 2013 Actian Corporation
ParAccel Platform – Unconstrained Analytics
Business	
  Intelligence	
  
and	
  Repor3ng	
  Tools	
  
Advanced	
  	
  
Analy3cs	
  
Analy3c	
  	
  
Applica3ons	
  
Machine	
  
Data	
  
Opera3onal	
  
Data	
  
3rd	
  Party	
  
Info	
  
Provider	
  
Streaming	
  
Data	
  
Logs	
  
On-­‐Demand	
  Integra3on	
  
On	
  Demand	
  Integra3on	
  Services	
  
Enterprise	
  
Data	
  Warehouse	
  
Hadoop	
  
Big	
  Data	
  
Apps	
  
Embedded	
  
Analy3cs	
  
18© 2013 Actian Corporation
In-­‐Database	
  Analy3cs	
  
Accelerate Time to Value with
Libraries of Analytic Functions
Corporate FinanceStatistical •  Standard Deviation
•  Correlation
•  Covariance, etc.
•  Present Value Analysis
•  Stock Valuation
•  Asset Valuation, etc.
Options /
Derivatives
Univariate •  Gamma distribution
•  Maxwell distribution
•  Weibull, etc.
•  Risk neutral valuation
(with/without Black-
Scholes)
•  Greeks, etc.
Portfolio
Management
Multivariate •  Normal Copula
•  Hypothesis Testing
•  Gumbel Copula, etc.
•  Currency / Cross-currency
derivatives
•  Merton Models, etc.
Fixed IncomeData Mining •  K-Means
•  Logistic Regression,
•  Neural Networks, etc.
•  Price and Yield
•  Duration
•  Convexity, etc.
Time Series
Analysis
Mathematical •  Trigonometric
•  Permutation / Combination
•  Exponential / Logarithm, etc.
•  ARMA / ARIMA models
•  ARCH/GRACH model
•  Regime Switch, etc.
u  100+ pre-loaded SQL, windows, and mathematical functions pre-loaded
u  500+ advanced analytics available for purchase
© 2013 Actian Corporation
Business Analyst to Business Scientist
20© 2013 Actian Corporation
Unconstrained Analytics
Load and Go
Run Ad Hoc Queries
Query Any Time
Query Any Data
Query All Data
Run Any Analytics
Execute Sophisticated Analytics
Return Results Quickly
Iterate Quickly Through Discovery
Share Workloads With Any Platform
Support All Analysts
Run Many Applications
Create Analytic Services
ParAccel Dataflow & Hadoop Analytics
21
›  On-demand
integration
›  Data and
Application
Integration
›  In-flight
preparation
›  In-Hadoop
preparation
›  Dataflow
optimizations
›  Hadoop
optimizations
›  In-Hadoop
analytics
›  Non-Hadoop
analytics
Business Intelligence
Analytics
Enterprise
Social
New Data
Applications DW
www Mobile
Machine
Data
High-Performance BI
High-Performance
Analytics
Connect Prepare AnalyzeOptimize
DATA VALUE
A visual framework for high-performance, data provisioning, ETL, and
analytics on Hadoop (or other file systems) without any knowledge of
MapReduce or parallel programming
© 2013 Actian Corporation
ParAccel Dataflow – Designer
§  Single UI for Data Preparation and Advanced Analytics
22© 2013 Actian Corporation
Dataflow Operator Libraries
© 2013 Actian Corporation 23
HDFS
ParAccel Platform in Action
24© 2013 Actian Corporation
ParAccel	
  PlaEorm	
  
Read Write Prepare Analyze Read Write Analyze Read Write
Creating a Culture for Business Science
25© 2013 Actian Corporation
Create Educational
Opportunities
Provide Incentives for
Participants
Reorganize to Support Business
Science
Deploy Infrastructure to Support
Analytics
Contact me at…
john.santaferraro@actian.com
408.373.7500
Visit Actian at…
www.actian.com
26© 2013 Actian Corporation
Twitter Tag: #briefr The Briefing Room
Perceptions & Questions
Analyst:
Neil Raden
Analy5c	
  Types	
  and	
  Roles	
  
Neil	
  Raden	
  
Founder,	
  Hired	
  Brains	
  Research	
  
Twi>er:	
  NeilRaden	
  
	
  
	
  
Blog:	
  h>p://hiredbrains.wordpress.com	
  
Website:	
  h>p://www.hiredbrains.com	
  
Mail:	
  nraden@hiredbrains.com	
  
LinkedIn:	
  h>p://www.linkedin.com/in/neilraden	
  
Copyright	
  2013	
  Neil	
  Raden	
  and	
  Hired	
  Brains	
  
Research	
  LLC	
  
28	
  
No	
  More	
  Managing	
  from	
  Scarcity	
  
29	
  
Even	
  Big	
  Data	
  Doesn’t	
  Speak	
  for	
  Itself	
  
30	
  
•  Incomplete!
•  Behaviors under-
represented!
•  Anonymizing
disasters!
•  Selection!
•  ML still needs
analyst!
Not	
  a	
  crystal	
  ball	
  
Anscombe’s	
  Quartet	
  
Copyright	
  2013	
  Neil	
  Raden	
  and	
  Hired	
  Brains	
  
Research	
  LLC	
  
31	
  
	
  
Mean	
  of	
  x	
  =	
  9	
  	
  
Variance	
  of	
  x	
  =	
  11	
  	
  
Mean	
  of	
  y	
  =	
  7.50	
  	
  
Variance	
  of	
  y	
  =	
  4.122	
  
Correla5on	
  between	
  x	
  and	
  y	
  
=	
  0.816	
  
Linear	
  regression	
  line	
  y	
  =	
  3.00	
  
+	
  0.500x	
  
Descrip3ve	
  Title Quan3ta3ve	
  Sophis3ca3on/
Numeracy
Sample	
  Roles
Type	
  I Quan5ta5ve	
  R&D PhD	
  or	
  equivalent Crea5on	
  of	
  theory,	
  
development	
  of	
  algorithms.	
  
Academic	
  /research.	
  Work	
  in	
  
business/government	
  for	
  
very	
  specialized	
  roles
Type	
  II Data	
  Scien5st	
  or	
  Quan5ta5ve	
  
Analyst
Advanced	
  Math/Stat,	
  not	
  
necessarily	
  PhD
Internal	
  expert	
  in	
  sta5s5cal	
  
and	
  mathema5cal	
  modelling	
  
and	
  development,	
  with	
  solid	
  
business	
  domain	
  knowledge.	
  
Type	
  III Opera5onal	
  Analy5cs	
  	
   Good	
  business	
  domain,	
  
background	
  in	
  sta5s5cs	
  
op5onal
Running	
  and	
  managing	
  
analy5cal	
  models.	
  Strong	
  
skills	
  in	
  and/or	
  project	
  
management	
  of	
  analy5cal	
  
systems	
  implementa5on
Type	
  IV Business	
  Intelligence/	
  
Discovery
Data	
  and	
  numbers	
  oriented,	
  
but	
  no	
  special	
  advanced	
  
sta5s5cal	
  skills
Repor5ng,	
  dashboard,	
  OLAP	
  
and	
  visualiza5on,	
  some	
  
design,	
  posterior	
  analysis	
  of	
  
results	
  from	
  quan5ta5ve	
  
methods.	
  Spreadsheets,	
  
“business	
  discovery	
  tools”	
  
32	
  
Analy3c	
  Types	
  
Types	
  of	
  Analysis	
  
Copyright	
  2013	
  Neil	
  Raden	
  and	
  Hired	
  Brains	
  
Research	
  LLC	
  
Ques5ons	
  
Copyright	
  2013	
  Neil	
  Raden	
  and	
  Hired	
  Brains	
  
Research	
  LLC	
  
33	
  
Analy3c	
  Types	
  
•  How	
  would	
  you	
  describe	
  the	
  difference	
  
between	
  a	
  data	
  scien5st	
  and	
  a	
  business	
  
scien5st?	
  
•  What	
  tools	
  are	
  needed	
  to	
  support	
  a	
  business	
  
analyst?	
  
•  What’s	
  the	
  career	
  path	
  for	
  a	
  business	
  analyst?	
  
•  Is	
  big	
  data	
  suffering	
  from	
  hype?	
  
Ques5ons	
  
Copyright	
  2013	
  Neil	
  Raden	
  and	
  Hired	
  Brains	
  
Research	
  LLC	
  
34	
  
Analy3c	
  Types	
  
•  Why	
  do	
  you	
  think	
  people	
  are	
  afraid	
  of	
  math?	
  
•  Should	
  universi5es	
  prepare	
  people	
  for	
  
business	
  science	
  or	
  should	
  industry?	
  
Twitter Tag: #briefr The Briefing Room
Twitter Tag: #briefr The Briefing Room
Upcoming Topics
www.insideanalysis.com
September: ANALYTICS
October: DATA PROCESSING
November: DATA DISCOVERY &
VISUALIZATION
Twitter Tag: #briefr The Briefing Room
Thank You
for Your
Attention

More Related Content

What's hot

How to become a Data Scientist?
How to become a Data Scientist? How to become a Data Scientist?
How to become a Data Scientist? HackerEarth
 
Data science e machine learning
Data science e machine learningData science e machine learning
Data science e machine learningGiuseppe Manco
 
Introduction to Python for Data Science
Introduction to Python for Data ScienceIntroduction to Python for Data Science
Introduction to Python for Data ScienceArc & Codementor
 
Life of a data scientist (pub)
Life of a data scientist (pub)Life of a data scientist (pub)
Life of a data scientist (pub)Buhwan Jeong
 
Introduction to Data Science and Large-scale Machine Learning
Introduction to Data Science and Large-scale Machine LearningIntroduction to Data Science and Large-scale Machine Learning
Introduction to Data Science and Large-scale Machine LearningNik Spirin
 
Data+Science : A First Course
Data+Science : A First CourseData+Science : A First Course
Data+Science : A First CourseArnab Majumdar
 
How To Become a Data Scientist in Iran Marketplace
How To Become a Data Scientist in Iran Marketplace How To Become a Data Scientist in Iran Marketplace
How To Become a Data Scientist in Iran Marketplace Mohamadreza Mohtat
 
Myths and Mathemagical Superpowers of Data Scientists
Myths and Mathemagical Superpowers of Data ScientistsMyths and Mathemagical Superpowers of Data Scientists
Myths and Mathemagical Superpowers of Data ScientistsDavid Pittman
 
Data Science-Why?What?How? By Hari Prasad
Data Science-Why?What?How? By Hari PrasadData Science-Why?What?How? By Hari Prasad
Data Science-Why?What?How? By Hari PrasadHari Prasad
 
The path to be a data scientist
The path to be a data scientistThe path to be a data scientist
The path to be a data scientistPoo Kuan Hoong
 
Claudia Gold: Learning Data Science Online
Claudia Gold: Learning Data Science OnlineClaudia Gold: Learning Data Science Online
Claudia Gold: Learning Data Science Onlinesfdatascience
 
Data Science presentation for elementary school students
Data Science presentation for elementary school studentsData Science presentation for elementary school students
Data Science presentation for elementary school studentsMelanie Manning, CFA
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data ScienceNiko Vuokko
 
Data Science 101
Data Science 101Data Science 101
Data Science 101odsc
 
How to Hire Data Scientists
How to Hire Data ScientistsHow to Hire Data Scientists
How to Hire Data ScientistsGalvanize
 
Demystifying Data Science with an introduction to Machine Learning
Demystifying Data Science with an introduction to Machine LearningDemystifying Data Science with an introduction to Machine Learning
Demystifying Data Science with an introduction to Machine LearningJulian Bright
 
Data science
Data scienceData science
Data science9diov
 

What's hot (20)

How to become a Data Scientist?
How to become a Data Scientist? How to become a Data Scientist?
How to become a Data Scientist?
 
Data science e machine learning
Data science e machine learningData science e machine learning
Data science e machine learning
 
Introduction to Python for Data Science
Introduction to Python for Data ScienceIntroduction to Python for Data Science
Introduction to Python for Data Science
 
Life of a data scientist (pub)
Life of a data scientist (pub)Life of a data scientist (pub)
Life of a data scientist (pub)
 
Data Science: Past, Present, and Future
Data Science: Past, Present, and FutureData Science: Past, Present, and Future
Data Science: Past, Present, and Future
 
Introduction to Data Science and Large-scale Machine Learning
Introduction to Data Science and Large-scale Machine LearningIntroduction to Data Science and Large-scale Machine Learning
Introduction to Data Science and Large-scale Machine Learning
 
Data+Science : A First Course
Data+Science : A First CourseData+Science : A First Course
Data+Science : A First Course
 
How To Become a Data Scientist in Iran Marketplace
How To Become a Data Scientist in Iran Marketplace How To Become a Data Scientist in Iran Marketplace
How To Become a Data Scientist in Iran Marketplace
 
Myths and Mathemagical Superpowers of Data Scientists
Myths and Mathemagical Superpowers of Data ScientistsMyths and Mathemagical Superpowers of Data Scientists
Myths and Mathemagical Superpowers of Data Scientists
 
Data Science-Why?What?How? By Hari Prasad
Data Science-Why?What?How? By Hari PrasadData Science-Why?What?How? By Hari Prasad
Data Science-Why?What?How? By Hari Prasad
 
The path to be a data scientist
The path to be a data scientistThe path to be a data scientist
The path to be a data scientist
 
Claudia Gold: Learning Data Science Online
Claudia Gold: Learning Data Science OnlineClaudia Gold: Learning Data Science Online
Claudia Gold: Learning Data Science Online
 
Data Science presentation for elementary school students
Data Science presentation for elementary school studentsData Science presentation for elementary school students
Data Science presentation for elementary school students
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
Data science
Data scienceData science
Data science
 
Data Science 101
Data Science 101Data Science 101
Data Science 101
 
How to Hire Data Scientists
How to Hire Data ScientistsHow to Hire Data Scientists
How to Hire Data Scientists
 
Intro to Data Science by DatalentTeam at Data Science Clinic#11
Intro to Data Science by DatalentTeam at Data Science Clinic#11Intro to Data Science by DatalentTeam at Data Science Clinic#11
Intro to Data Science by DatalentTeam at Data Science Clinic#11
 
Demystifying Data Science with an introduction to Machine Learning
Demystifying Data Science with an introduction to Machine LearningDemystifying Data Science with an introduction to Machine Learning
Demystifying Data Science with an introduction to Machine Learning
 
Data science
Data scienceData science
Data science
 

Similar to How to Identify, Train or Become a Data Scientist

Analytic Excellence - Saying Goodbye to Old Constraints
Analytic Excellence - Saying Goodbye to Old ConstraintsAnalytic Excellence - Saying Goodbye to Old Constraints
Analytic Excellence - Saying Goodbye to Old ConstraintsInside Analysis
 
Are you getting the most out of your data?
Are you getting the most out of your data?Are you getting the most out of your data?
Are you getting the most out of your data?SAS Canada
 
Creating your Center of Excellence (CoE) for data driven use cases
Creating your Center of Excellence (CoE) for data driven use casesCreating your Center of Excellence (CoE) for data driven use cases
Creating your Center of Excellence (CoE) for data driven use casesFrank Vullers
 
Big Data Discovery
Big Data DiscoveryBig Data Discovery
Big Data DiscoveryHarald Erb
 
Keyrus US Information
Keyrus US InformationKeyrus US Information
Keyrus US InformationJulian Tong
 
SIMPosium presentation_Bardess Qlik
SIMPosium presentation_Bardess QlikSIMPosium presentation_Bardess Qlik
SIMPosium presentation_Bardess QlikBardess Group
 
Revolution in Business Analytics-Zika Virus Example
Revolution in Business Analytics-Zika Virus ExampleRevolution in Business Analytics-Zika Virus Example
Revolution in Business Analytics-Zika Virus ExampleBardess Group
 
How Can Analytics Improve Business?
How Can Analytics Improve Business?How Can Analytics Improve Business?
How Can Analytics Improve Business?Inside Analysis
 
Expert Big Data Tips
Expert Big Data TipsExpert Big Data Tips
Expert Big Data TipsQubole
 
See the Whole Story: The Case for a Visualization Platform
See the Whole Story: The Case for a Visualization PlatformSee the Whole Story: The Case for a Visualization Platform
See the Whole Story: The Case for a Visualization PlatformEric Kavanagh
 
Riding the wave of analytics revolution
Riding the wave of analytics revolutionRiding the wave of analytics revolution
Riding the wave of analytics revolutionTanuj Poddar
 
How to make your data scientists happy
How to make your data scientists happy How to make your data scientists happy
How to make your data scientists happy Hussain Sultan
 
Put Alternative Data to Use in Capital Markets

Put Alternative Data to Use in Capital Markets
Put Alternative Data to Use in Capital Markets

Put Alternative Data to Use in Capital Markets
Cloudera, Inc.
 
The Agile Analyst: Solving the Data Problem with Virtualization
The Agile Analyst: Solving the Data Problem with VirtualizationThe Agile Analyst: Solving the Data Problem with Virtualization
The Agile Analyst: Solving the Data Problem with VirtualizationInside Analysis
 
Knowledge Graphs Webinar- 11/7/2017
Knowledge Graphs Webinar- 11/7/2017Knowledge Graphs Webinar- 11/7/2017
Knowledge Graphs Webinar- 11/7/2017Neo4j
 
Connecting the Dots with Data Mashups
Connecting the Dots with Data MashupsConnecting the Dots with Data Mashups
Connecting the Dots with Data MashupsInside Analysis
 
Building the Artificially Intelligent Enterprise
Building the Artificially Intelligent EnterpriseBuilding the Artificially Intelligent Enterprise
Building the Artificially Intelligent EnterpriseDatabricks
 
Agile data science
Agile data scienceAgile data science
Agile data scienceJoel Horwitz
 
The Analytic Platform: Empowering the Business Now
The Analytic Platform: Empowering the Business NowThe Analytic Platform: Empowering the Business Now
The Analytic Platform: Empowering the Business NowInside Analysis
 

Similar to How to Identify, Train or Become a Data Scientist (20)

Analytic Excellence - Saying Goodbye to Old Constraints
Analytic Excellence - Saying Goodbye to Old ConstraintsAnalytic Excellence - Saying Goodbye to Old Constraints
Analytic Excellence - Saying Goodbye to Old Constraints
 
Are you getting the most out of your data?
Are you getting the most out of your data?Are you getting the most out of your data?
Are you getting the most out of your data?
 
Creating your Center of Excellence (CoE) for data driven use cases
Creating your Center of Excellence (CoE) for data driven use casesCreating your Center of Excellence (CoE) for data driven use cases
Creating your Center of Excellence (CoE) for data driven use cases
 
Big Data Discovery
Big Data DiscoveryBig Data Discovery
Big Data Discovery
 
Keyrus US Information
Keyrus US InformationKeyrus US Information
Keyrus US Information
 
Keyrus US Information
Keyrus US InformationKeyrus US Information
Keyrus US Information
 
SIMPosium presentation_Bardess Qlik
SIMPosium presentation_Bardess QlikSIMPosium presentation_Bardess Qlik
SIMPosium presentation_Bardess Qlik
 
Revolution in Business Analytics-Zika Virus Example
Revolution in Business Analytics-Zika Virus ExampleRevolution in Business Analytics-Zika Virus Example
Revolution in Business Analytics-Zika Virus Example
 
How Can Analytics Improve Business?
How Can Analytics Improve Business?How Can Analytics Improve Business?
How Can Analytics Improve Business?
 
Expert Big Data Tips
Expert Big Data TipsExpert Big Data Tips
Expert Big Data Tips
 
See the Whole Story: The Case for a Visualization Platform
See the Whole Story: The Case for a Visualization PlatformSee the Whole Story: The Case for a Visualization Platform
See the Whole Story: The Case for a Visualization Platform
 
Riding the wave of analytics revolution
Riding the wave of analytics revolutionRiding the wave of analytics revolution
Riding the wave of analytics revolution
 
How to make your data scientists happy
How to make your data scientists happy How to make your data scientists happy
How to make your data scientists happy
 
Put Alternative Data to Use in Capital Markets

Put Alternative Data to Use in Capital Markets
Put Alternative Data to Use in Capital Markets

Put Alternative Data to Use in Capital Markets

 
The Agile Analyst: Solving the Data Problem with Virtualization
The Agile Analyst: Solving the Data Problem with VirtualizationThe Agile Analyst: Solving the Data Problem with Virtualization
The Agile Analyst: Solving the Data Problem with Virtualization
 
Knowledge Graphs Webinar- 11/7/2017
Knowledge Graphs Webinar- 11/7/2017Knowledge Graphs Webinar- 11/7/2017
Knowledge Graphs Webinar- 11/7/2017
 
Connecting the Dots with Data Mashups
Connecting the Dots with Data MashupsConnecting the Dots with Data Mashups
Connecting the Dots with Data Mashups
 
Building the Artificially Intelligent Enterprise
Building the Artificially Intelligent EnterpriseBuilding the Artificially Intelligent Enterprise
Building the Artificially Intelligent Enterprise
 
Agile data science
Agile data scienceAgile data science
Agile data science
 
The Analytic Platform: Empowering the Business Now
The Analytic Platform: Empowering the Business NowThe Analytic Platform: Empowering the Business Now
The Analytic Platform: Empowering the Business Now
 

More from Inside Analysis

An Ounce of Prevention: Forging Healthy BI
An Ounce of Prevention: Forging Healthy BIAn Ounce of Prevention: Forging Healthy BI
An Ounce of Prevention: Forging Healthy BIInside Analysis
 
Agile, Automated, Aware: How to Model for Success
Agile, Automated, Aware: How to Model for SuccessAgile, Automated, Aware: How to Model for Success
Agile, Automated, Aware: How to Model for SuccessInside Analysis
 
First in Class: Optimizing the Data Lake for Tighter Integration
First in Class: Optimizing the Data Lake for Tighter IntegrationFirst in Class: Optimizing the Data Lake for Tighter Integration
First in Class: Optimizing the Data Lake for Tighter IntegrationInside Analysis
 
Fit For Purpose: Preventing a Big Data Letdown
Fit For Purpose: Preventing a Big Data LetdownFit For Purpose: Preventing a Big Data Letdown
Fit For Purpose: Preventing a Big Data LetdownInside Analysis
 
To Serve and Protect: Making Sense of Hadoop Security
To Serve and Protect: Making Sense of Hadoop Security To Serve and Protect: Making Sense of Hadoop Security
To Serve and Protect: Making Sense of Hadoop Security Inside Analysis
 
The Hadoop Guarantee: Keeping Analytics Running On Time
The Hadoop Guarantee: Keeping Analytics Running On TimeThe Hadoop Guarantee: Keeping Analytics Running On Time
The Hadoop Guarantee: Keeping Analytics Running On TimeInside Analysis
 
Introducing: A Complete Algebra of Data
Introducing: A Complete Algebra of DataIntroducing: A Complete Algebra of Data
Introducing: A Complete Algebra of DataInside Analysis
 
The Role of Data Wrangling in Driving Hadoop Adoption
The Role of Data Wrangling in Driving Hadoop AdoptionThe Role of Data Wrangling in Driving Hadoop Adoption
The Role of Data Wrangling in Driving Hadoop AdoptionInside Analysis
 
Ahead of the Stream: How to Future-Proof Real-Time Analytics
Ahead of the Stream: How to Future-Proof Real-Time AnalyticsAhead of the Stream: How to Future-Proof Real-Time Analytics
Ahead of the Stream: How to Future-Proof Real-Time AnalyticsInside Analysis
 
All Together Now: Connected Analytics for the Internet of Everything
All Together Now: Connected Analytics for the Internet of EverythingAll Together Now: Connected Analytics for the Internet of Everything
All Together Now: Connected Analytics for the Internet of EverythingInside Analysis
 
Goodbye, Bottlenecks: How Scale-Out and In-Memory Solve ETL
Goodbye, Bottlenecks: How Scale-Out and In-Memory Solve ETLGoodbye, Bottlenecks: How Scale-Out and In-Memory Solve ETL
Goodbye, Bottlenecks: How Scale-Out and In-Memory Solve ETLInside Analysis
 
The Biggest Picture: Situational Awareness on a Global Level
The Biggest Picture: Situational Awareness on a Global LevelThe Biggest Picture: Situational Awareness on a Global Level
The Biggest Picture: Situational Awareness on a Global LevelInside Analysis
 
Structurally Sound: How to Tame Your Architecture
Structurally Sound: How to Tame Your ArchitectureStructurally Sound: How to Tame Your Architecture
Structurally Sound: How to Tame Your ArchitectureInside Analysis
 
SQL In Hadoop: Big Data Innovation Without the Risk
SQL In Hadoop: Big Data Innovation Without the RiskSQL In Hadoop: Big Data Innovation Without the Risk
SQL In Hadoop: Big Data Innovation Without the RiskInside Analysis
 
The Perfect Fit: Scalable Graph for Big Data
The Perfect Fit: Scalable Graph for Big DataThe Perfect Fit: Scalable Graph for Big Data
The Perfect Fit: Scalable Graph for Big DataInside Analysis
 
A Revolutionary Approach to Modernizing the Data Warehouse
A Revolutionary Approach to Modernizing the Data WarehouseA Revolutionary Approach to Modernizing the Data Warehouse
A Revolutionary Approach to Modernizing the Data WarehouseInside Analysis
 
The Maturity Model: Taking the Growing Pains Out of Hadoop
The Maturity Model: Taking the Growing Pains Out of HadoopThe Maturity Model: Taking the Growing Pains Out of Hadoop
The Maturity Model: Taking the Growing Pains Out of HadoopInside Analysis
 
Rethinking Data Availability and Governance in a Mobile World
Rethinking Data Availability and Governance in a Mobile WorldRethinking Data Availability and Governance in a Mobile World
Rethinking Data Availability and Governance in a Mobile WorldInside Analysis
 
DisrupTech - Dave Duggal
DisrupTech - Dave DuggalDisrupTech - Dave Duggal
DisrupTech - Dave DuggalInside Analysis
 

More from Inside Analysis (20)

An Ounce of Prevention: Forging Healthy BI
An Ounce of Prevention: Forging Healthy BIAn Ounce of Prevention: Forging Healthy BI
An Ounce of Prevention: Forging Healthy BI
 
Agile, Automated, Aware: How to Model for Success
Agile, Automated, Aware: How to Model for SuccessAgile, Automated, Aware: How to Model for Success
Agile, Automated, Aware: How to Model for Success
 
First in Class: Optimizing the Data Lake for Tighter Integration
First in Class: Optimizing the Data Lake for Tighter IntegrationFirst in Class: Optimizing the Data Lake for Tighter Integration
First in Class: Optimizing the Data Lake for Tighter Integration
 
Fit For Purpose: Preventing a Big Data Letdown
Fit For Purpose: Preventing a Big Data LetdownFit For Purpose: Preventing a Big Data Letdown
Fit For Purpose: Preventing a Big Data Letdown
 
To Serve and Protect: Making Sense of Hadoop Security
To Serve and Protect: Making Sense of Hadoop Security To Serve and Protect: Making Sense of Hadoop Security
To Serve and Protect: Making Sense of Hadoop Security
 
The Hadoop Guarantee: Keeping Analytics Running On Time
The Hadoop Guarantee: Keeping Analytics Running On TimeThe Hadoop Guarantee: Keeping Analytics Running On Time
The Hadoop Guarantee: Keeping Analytics Running On Time
 
Introducing: A Complete Algebra of Data
Introducing: A Complete Algebra of DataIntroducing: A Complete Algebra of Data
Introducing: A Complete Algebra of Data
 
The Role of Data Wrangling in Driving Hadoop Adoption
The Role of Data Wrangling in Driving Hadoop AdoptionThe Role of Data Wrangling in Driving Hadoop Adoption
The Role of Data Wrangling in Driving Hadoop Adoption
 
Ahead of the Stream: How to Future-Proof Real-Time Analytics
Ahead of the Stream: How to Future-Proof Real-Time AnalyticsAhead of the Stream: How to Future-Proof Real-Time Analytics
Ahead of the Stream: How to Future-Proof Real-Time Analytics
 
All Together Now: Connected Analytics for the Internet of Everything
All Together Now: Connected Analytics for the Internet of EverythingAll Together Now: Connected Analytics for the Internet of Everything
All Together Now: Connected Analytics for the Internet of Everything
 
Goodbye, Bottlenecks: How Scale-Out and In-Memory Solve ETL
Goodbye, Bottlenecks: How Scale-Out and In-Memory Solve ETLGoodbye, Bottlenecks: How Scale-Out and In-Memory Solve ETL
Goodbye, Bottlenecks: How Scale-Out and In-Memory Solve ETL
 
The Biggest Picture: Situational Awareness on a Global Level
The Biggest Picture: Situational Awareness on a Global LevelThe Biggest Picture: Situational Awareness on a Global Level
The Biggest Picture: Situational Awareness on a Global Level
 
Structurally Sound: How to Tame Your Architecture
Structurally Sound: How to Tame Your ArchitectureStructurally Sound: How to Tame Your Architecture
Structurally Sound: How to Tame Your Architecture
 
SQL In Hadoop: Big Data Innovation Without the Risk
SQL In Hadoop: Big Data Innovation Without the RiskSQL In Hadoop: Big Data Innovation Without the Risk
SQL In Hadoop: Big Data Innovation Without the Risk
 
The Perfect Fit: Scalable Graph for Big Data
The Perfect Fit: Scalable Graph for Big DataThe Perfect Fit: Scalable Graph for Big Data
The Perfect Fit: Scalable Graph for Big Data
 
A Revolutionary Approach to Modernizing the Data Warehouse
A Revolutionary Approach to Modernizing the Data WarehouseA Revolutionary Approach to Modernizing the Data Warehouse
A Revolutionary Approach to Modernizing the Data Warehouse
 
The Maturity Model: Taking the Growing Pains Out of Hadoop
The Maturity Model: Taking the Growing Pains Out of HadoopThe Maturity Model: Taking the Growing Pains Out of Hadoop
The Maturity Model: Taking the Growing Pains Out of Hadoop
 
Rethinking Data Availability and Governance in a Mobile World
Rethinking Data Availability and Governance in a Mobile WorldRethinking Data Availability and Governance in a Mobile World
Rethinking Data Availability and Governance in a Mobile World
 
DisrupTech - Dave Duggal
DisrupTech - Dave DuggalDisrupTech - Dave Duggal
DisrupTech - Dave Duggal
 
Modus Operandi
Modus OperandiModus Operandi
Modus Operandi
 

Recently uploaded

How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?XfilesPro
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
Azure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAzure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAndikSusilo4
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxOnBoard
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik
 
Artificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraArtificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraDeakin University
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure servicePooja Nehwal
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhisoniya singh
 

Recently uploaded (20)

How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
Azure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAzure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & Application
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptx
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food Manufacturing
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
 
Artificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraArtificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning era
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
The transition to renewables in India.pdf
The transition to renewables in India.pdfThe transition to renewables in India.pdf
The transition to renewables in India.pdf
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
 

How to Identify, Train or Become a Data Scientist

  • 1. The Briefing Room How to Identify, Train or Become a Data Scientist
  • 2. Twitter Tag: #briefr The Briefing Room Welcome Host: Eric Kavanagh eric.kavanagh@bloorgroup.com
  • 3. Twitter Tag: #briefr The Briefing Room !   Reveal the essential characteristics of enterprise software, good and bad !   Provide a forum for detailed analysis of today s innovative technologies !   Give vendors a chance to explain their product to savvy analysts !   Allow audience members to pose serious questions... and get answers! Mission
  • 4. Twitter Tag: #briefr The Briefing Room Topics This Month: ANALYTICS October: DATA PROCESSING November: DATA DISCOVERY & VISUALIZATION
  • 5. Twitter Tag: #briefr The Briefing Room Analytics
  • 6. Twitter Tag: #briefr The Briefing Room Analyst: Neil Raden Neil Raden is the founder and Principal Analyst at Hired Brains Research. He is the co-author, with James Taylor, of “Smart (Enough) Systems: How To Deliver Competitive Advantage by Automating Hidden Decisions.” With 30 years experience, he is a widely published writer, well-known speaker, analyst and consultant, having personally designed and implemented dozens of large analytical applications in finance, marketing, distribution, logistics, actuarial, intelligence, scientific, statistical and consumer products. As an industry analyst, he has published over 40 white papers, hundreds of articles, blogs and research reports. He welcomes your comments and can be reached at nraden@hiredbrains.com.
  • 7. Twitter Tag: #briefr The Briefing Room Actian ! Actian is a database and software development company ! Actian offers the ParAccel DataFlow Engine, a scalable parallel platform which provides visual access to complex data flows !   The DataFlow Engine is designed to reduce cluster complexity, manage multi-petabytes of data, and scale with the size and dimensionality of the data
  • 8. Twitter Tag: #briefr The Briefing Room Guest: John Santaferraro John Santaferraro is the Vice President of Product Marketing at Actian. Prior to joining Actian, Santaferraro was an independent industry analyst in the business intelligence and analytics market. Before that he developed and executed a vertical market strategy for Hewlett Packard's BI group, focusing on energy, communications, retail, healthcare and financial services; he was also instrumental in helping establish HP’s new BI business group with a combination of solutions, products and consulting. In 2000, John founded a marketing and sales consulting company, Ferraro Consulting, providing business acceleration strategy for technology companies.
  • 9. Enabling the Business Scientist John Santaferraro Vice President of Marketing, ParAccel Platform Group September 3, 2013
  • 10. What is a “business scientist”? Requirements of a “business scientist” Tools of a “business scientist” Creating a culture of “business science” 10© 2013 Actian Corporation
  • 11. The “Moneyball” Effect § Analytics Go Mainstream •  Major League Baseball §  Hire the best team •  NSA and Big Data §  ??????????????? •  Target and Pregnancy §  Predicting pregnancies 11© 2013 Actian Corporation
  • 12. What is a Data Scientist? 12© 2013 Actian Corporation
  • 13. A data scientist “…incorporates varying elements and builds on techniques and theories from many fields, including mathematics, statistics, data engineering, pattern recognition and learning, advanced computing, visualization, uncertainty modeling, data warehousing, and high performance computing with the goal of extracting meaning from data and creating data products.” 13 What is a Data Scientist? © 2013 Actian Corporation Created by Calvin Andrus, depicts a mash-up of disciplines from which Data Science is derived, 13 July 2012 http://en.wikipedia.org/wiki/Data_science
  • 14. What is a Business Scientist? “A business scientist is an expert in the science of business, sitting between the business analyst and the data scientist, pulling together cross-functional expertise from data science, analytics, business applications, business processes, and business strategy. 14© 2013 Actian Corporation
  • 15. Business Science Practice Areas Business Science Sales Marketing Supply Chain Logistics Finance Human Resource Risk Fraud 15© 2013 Actian Corporation
  • 16. Business Science Skillset Understand How Analytics Work Understand Emerging Data Types Understand Business Operations & Strategy Learn Quickly Think Outside the Box Tell Compelling Stories 16© 2013 Actian Corporation
  • 17. §  Libraries of Analytic Functions Run at Extreme Speed •  Transformational Analytics •  Statistical Analytics •  Machine Learning Analytics •  Clustering Analytics •  Discovery Analytics §  Visual Framework for Data Discovery, Preparation and Analytics •  Drag and Drop Interaction •  Libraries of Data Preparation Operators •  Libraries of Analytic Operators •  High-Performance, Parallel Processing on Hadoop (or other file systems) 17 The Tools of the Business Scientist © 2013 Actian Corporation
  • 18. ParAccel Platform – Unconstrained Analytics Business  Intelligence   and  Repor3ng  Tools   Advanced     Analy3cs   Analy3c     Applica3ons   Machine   Data   Opera3onal   Data   3rd  Party   Info   Provider   Streaming   Data   Logs   On-­‐Demand  Integra3on   On  Demand  Integra3on  Services   Enterprise   Data  Warehouse   Hadoop   Big  Data   Apps   Embedded   Analy3cs   18© 2013 Actian Corporation In-­‐Database  Analy3cs  
  • 19. Accelerate Time to Value with Libraries of Analytic Functions Corporate FinanceStatistical •  Standard Deviation •  Correlation •  Covariance, etc. •  Present Value Analysis •  Stock Valuation •  Asset Valuation, etc. Options / Derivatives Univariate •  Gamma distribution •  Maxwell distribution •  Weibull, etc. •  Risk neutral valuation (with/without Black- Scholes) •  Greeks, etc. Portfolio Management Multivariate •  Normal Copula •  Hypothesis Testing •  Gumbel Copula, etc. •  Currency / Cross-currency derivatives •  Merton Models, etc. Fixed IncomeData Mining •  K-Means •  Logistic Regression, •  Neural Networks, etc. •  Price and Yield •  Duration •  Convexity, etc. Time Series Analysis Mathematical •  Trigonometric •  Permutation / Combination •  Exponential / Logarithm, etc. •  ARMA / ARIMA models •  ARCH/GRACH model •  Regime Switch, etc. u  100+ pre-loaded SQL, windows, and mathematical functions pre-loaded u  500+ advanced analytics available for purchase © 2013 Actian Corporation
  • 20. Business Analyst to Business Scientist 20© 2013 Actian Corporation Unconstrained Analytics Load and Go Run Ad Hoc Queries Query Any Time Query Any Data Query All Data Run Any Analytics Execute Sophisticated Analytics Return Results Quickly Iterate Quickly Through Discovery Share Workloads With Any Platform Support All Analysts Run Many Applications Create Analytic Services
  • 21. ParAccel Dataflow & Hadoop Analytics 21 ›  On-demand integration ›  Data and Application Integration ›  In-flight preparation ›  In-Hadoop preparation ›  Dataflow optimizations ›  Hadoop optimizations ›  In-Hadoop analytics ›  Non-Hadoop analytics Business Intelligence Analytics Enterprise Social New Data Applications DW www Mobile Machine Data High-Performance BI High-Performance Analytics Connect Prepare AnalyzeOptimize DATA VALUE A visual framework for high-performance, data provisioning, ETL, and analytics on Hadoop (or other file systems) without any knowledge of MapReduce or parallel programming © 2013 Actian Corporation
  • 22. ParAccel Dataflow – Designer §  Single UI for Data Preparation and Advanced Analytics 22© 2013 Actian Corporation
  • 23. Dataflow Operator Libraries © 2013 Actian Corporation 23
  • 24. HDFS ParAccel Platform in Action 24© 2013 Actian Corporation ParAccel  PlaEorm   Read Write Prepare Analyze Read Write Analyze Read Write
  • 25. Creating a Culture for Business Science 25© 2013 Actian Corporation Create Educational Opportunities Provide Incentives for Participants Reorganize to Support Business Science Deploy Infrastructure to Support Analytics
  • 26. Contact me at… john.santaferraro@actian.com 408.373.7500 Visit Actian at… www.actian.com 26© 2013 Actian Corporation
  • 27. Twitter Tag: #briefr The Briefing Room Perceptions & Questions Analyst: Neil Raden
  • 28. Analy5c  Types  and  Roles   Neil  Raden   Founder,  Hired  Brains  Research   Twi>er:  NeilRaden       Blog:  h>p://hiredbrains.wordpress.com   Website:  h>p://www.hiredbrains.com   Mail:  nraden@hiredbrains.com   LinkedIn:  h>p://www.linkedin.com/in/neilraden   Copyright  2013  Neil  Raden  and  Hired  Brains   Research  LLC   28  
  • 29. No  More  Managing  from  Scarcity   29  
  • 30. Even  Big  Data  Doesn’t  Speak  for  Itself   30   •  Incomplete! •  Behaviors under- represented! •  Anonymizing disasters! •  Selection! •  ML still needs analyst! Not  a  crystal  ball  
  • 31. Anscombe’s  Quartet   Copyright  2013  Neil  Raden  and  Hired  Brains   Research  LLC   31     Mean  of  x  =  9     Variance  of  x  =  11     Mean  of  y  =  7.50     Variance  of  y  =  4.122   Correla5on  between  x  and  y   =  0.816   Linear  regression  line  y  =  3.00   +  0.500x  
  • 32. Descrip3ve  Title Quan3ta3ve  Sophis3ca3on/ Numeracy Sample  Roles Type  I Quan5ta5ve  R&D PhD  or  equivalent Crea5on  of  theory,   development  of  algorithms.   Academic  /research.  Work  in   business/government  for   very  specialized  roles Type  II Data  Scien5st  or  Quan5ta5ve   Analyst Advanced  Math/Stat,  not   necessarily  PhD Internal  expert  in  sta5s5cal   and  mathema5cal  modelling   and  development,  with  solid   business  domain  knowledge.   Type  III Opera5onal  Analy5cs     Good  business  domain,   background  in  sta5s5cs   op5onal Running  and  managing   analy5cal  models.  Strong   skills  in  and/or  project   management  of  analy5cal   systems  implementa5on Type  IV Business  Intelligence/   Discovery Data  and  numbers  oriented,   but  no  special  advanced   sta5s5cal  skills Repor5ng,  dashboard,  OLAP   and  visualiza5on,  some   design,  posterior  analysis  of   results  from  quan5ta5ve   methods.  Spreadsheets,   “business  discovery  tools”   32   Analy3c  Types   Types  of  Analysis   Copyright  2013  Neil  Raden  and  Hired  Brains   Research  LLC  
  • 33. Ques5ons   Copyright  2013  Neil  Raden  and  Hired  Brains   Research  LLC   33   Analy3c  Types   •  How  would  you  describe  the  difference   between  a  data  scien5st  and  a  business   scien5st?   •  What  tools  are  needed  to  support  a  business   analyst?   •  What’s  the  career  path  for  a  business  analyst?   •  Is  big  data  suffering  from  hype?  
  • 34. Ques5ons   Copyright  2013  Neil  Raden  and  Hired  Brains   Research  LLC   34   Analy3c  Types   •  Why  do  you  think  people  are  afraid  of  math?   •  Should  universi5es  prepare  people  for   business  science  or  should  industry?  
  • 35. Twitter Tag: #briefr The Briefing Room
  • 36. Twitter Tag: #briefr The Briefing Room Upcoming Topics www.insideanalysis.com September: ANALYTICS October: DATA PROCESSING November: DATA DISCOVERY & VISUALIZATION
  • 37. Twitter Tag: #briefr The Briefing Room Thank You for Your Attention