SlideShare a Scribd company logo
1 of 39
Download to read offline
Showcasing Data Science
Lab functionality
Welcome from Kognitio
www.kognitio.com
Today’s Web Seminar -
Presenters Host
Michael Hiskey
Vice President
Marketing & Business Development
Format &
Agenda
Keynote Presenters
Dr. Sharon Kirkham
Data Scientist
Kognitio Analytics Center of Excellence
• Big Data and Complexity– the need for Data Scientists 
Question Break #1
• Data Manipulation – functional demonstration
Question Break #2
• Product forecasting with parallel R  ‐ practical demonstration 
Question Break # 3
Kognitio
Kognitio is focused on providing the 
premier high‐performance analytical 
platform to power business insight 
around the world
• Kognitio invented the in‐memory analytical 
platform, first taking it to market in 1989
• Privately held
• Labs in the UK ‐ HQ in New York, NY 
The Data Science Lab
Data
Scientists &
Staff
Mathematic
Algorithms
MPP
Computing
BIG DATA
11
What do business users want to do?
Find patterns
Track life
time
journeys
Predict
behavior
Forecast
scenarios
Allocate
scarce
resources
Model
value
Characterize
groups
Visualize
discovery
Respond,
trigger,
manage,
promote
I’m a data scientist! Are you?
Entry level skills and development - aspiration
Machine
Learning
Graduates
I’m a data scientist! Are you?
Business
Expertise
Machine
Learning
Interpretation
skills
= Insight
Graduates
Need
guidance
Data
Scientist
Supporting the data scientist
Typical process – traditionally…
Database
Supporting the data scientist
Typical process – direct data preparation
Database
SQL processing
Supporting the data scientist
Typical process – produces analytical data set
Database
SQL processingData Set
Supporting the data scientist
Typical process – run analytics from server
Database
SQL processingData Set
???
Supporting the data scientist
Typical process – data samples often used
Database
SQL processingData Set
???
Data Samples
Process run
iteratively
= slow
Supporting the data scientist
Typical process – modelling process is honed
Database
SQL processingData Set
???
Data Samples
Process run
iteratively
= slow
Supporting the data scientist
Typical process – model is complete
Database
Data Set
???

Supporting the data scientist
Typical process – score full data (Ouch!)
Database
Data Set
???
Full data
to score
Supporting the data scientist
Push processes to DB – still produce analytical data set
Analytical Platform
SQL processingData Set
Supporting the data scientist
Push processes to DB – translate specific processes
Analytical Platform
SQL processingData Set
???
Translation
Supporting the data scientist
Push processes to DB – results passed back
Analytical Platform
SQL processingData Set
???
Translation
Result Data Set
Supporting the data scientist
Push processes to DB– modelling process is honed
Analytical Platform
SQL processingData Set
???
Translation
Result Data Set
Supporting the data scientist
Push processes to DB– model scoring done in DB
Analytical Platform
SQL processingData Set
???

Result Data Set
Supporting the data scientist
But we always want more! Complex data structure
Analytical Platform
Data Set
???

Result Data Set
SQL cannot handle
Data complexity.
How do I integrate
into my model?
Supporting the data scientist
But we always want more! non-standard processes
Database
SQL processingData Set
???
Data Samples Back where
we started
Supporting the data scientist
Bring Analytics to data – still produce analytical data set
SQL processing
SQL processing
Supporting the data scientist
Bring Analytics to data – can use other code for data prep
SQL processing
Kognitio scripting
Code executed
Using MPP
Data held in
Memory. Fast
access to CPUs
Supporting the data scientist
Bring Analytics to data – run analytics natively in Kognitio
SQL processing
Kognitio scripting
Code executed
Using MPP
Data held in
Memory. Fast
access to CPUs
One platform flexible working
from data prep through analytical
process
New! Kognitio version 8:
Enabling and extending the Analytical Platform
External Tables
External Functions
Not Only SQL
Hadoop Connector Other Connectors
Kognitio Storage
as an External table
General Availability:
June 2013
External Scripting – Data Transformation
Converting structured data into
XML format, i.e. furnishing
personalised content
Assembly
Converting XML into structured
data
Disassembly
Extracting complex information
from URLs
Pulling words from large text fields,
i.e. sentiment analysis
Parsing
Converting row based information
into columns for data mining,
i.e. supporting classification or
segmentation
Transposition
e.g. using perl
Examples where SQL is typically complex and extensive
Data Manipulation
Small Demo
Product Forecasting – with parallel R
Forecasting
Requirements
Forecast
Inputs
R running in an MPP environment
Persistence
Layer
Analytical
Platform
Layer
R running in an MPP environment
Persistence
Layer
Analytical
Platform
Layer
Kognitio
platform
specification
16 servers
462GB
Kognitio
RAM
128 Cores
This is old kit
2.9 billion
rows of
epos
184 day time series
for 12K products
R running in an MPP environment
Persistence
Layer
Analytical
Platform
Layer
R running in an MPP environment
Persistence
Layer
Analytical
Platform
Layer
1 output table
in RAM
128 parallel
instances of R
R running in an MPP environment
Persistence
Layer
Analytical
Platform
Layer
Application &
Client Layer
ExcelAll BI Tools
R running in an MPP environment
Persistence
Layer
Analytical
Platform
Layer
Application &
Client Layer
ExcelAll BI Tools
13 views of
different analytical
output
R running in an MPP environment
Persistence
Layer
Analytical
Platform
Layer
Application &
Client Layer
ExcelAll BI Tools
Result set
contained
# rows
12K forecasts and
stats calculated
in # seconds
2.9B EPOS items
collated into
time series
in # seconds
Product Forecasting
using parallel R Demo
Thank you for your participation today
• More information on today’s topic can be found at: 
• kognitio.com/mpp_r
• kognitio.com/product‐forecasting
• FREE TO USE – perpetual license
– www.kognitio.com/free
– Contact us for the pre‐release version 8
• Analyst White Papers
– EMA Comparative Analysis 
– In‐memory database platforms
– www.kognitio.com/emacompinmem
• Today’s slides (and more): www.slideshare.net/Kognitio
connect
www.kognitio.com
twitter.com/kognitiolinkedin.com/companies/kognitio
tinyurl.com/kognitio youtube.com/kognitio
NA: +1 855  KOGNITIO
EMEA: +44 1344 300 770

More Related Content

What's hot

Data Scientist Salary, Skills, Jobs And Resume | Data Scientist Career | Data...
Data Scientist Salary, Skills, Jobs And Resume | Data Scientist Career | Data...Data Scientist Salary, Skills, Jobs And Resume | Data Scientist Career | Data...
Data Scientist Salary, Skills, Jobs And Resume | Data Scientist Career | Data...Simplilearn
 
II-SDV 2012 Dealing with Large Data Volumes in Statistical Analysis and Text ...
II-SDV 2012 Dealing with Large Data Volumes in Statistical Analysis and Text ...II-SDV 2012 Dealing with Large Data Volumes in Statistical Analysis and Text ...
II-SDV 2012 Dealing with Large Data Volumes in Statistical Analysis and Text ...Dr. Haxel Consult
 
II-SDV 2017: Gridlogics Technologies
II-SDV 2017: Gridlogics TechnologiesII-SDV 2017: Gridlogics Technologies
II-SDV 2017: Gridlogics TechnologiesDr. Haxel Consult
 
II-SDV 2017: Spotting the Stars in your Galaxy of Patent Data
II-SDV 2017: Spotting the Stars in your Galaxy of Patent DataII-SDV 2017: Spotting the Stars in your Galaxy of Patent Data
II-SDV 2017: Spotting the Stars in your Galaxy of Patent DataDr. Haxel Consult
 
Self-service consumption Data Catalog
Self-service consumption Data CatalogSelf-service consumption Data Catalog
Self-service consumption Data CatalogDenodo
 
what is data science
 what is data science what is data science
what is data scienceCrampete
 
AI-SDV 2021: Angela Bauch - AILANI for clinical competitive landscaping
AI-SDV 2021: Angela Bauch - AILANI for clinical competitive landscapingAI-SDV 2021: Angela Bauch - AILANI for clinical competitive landscaping
AI-SDV 2021: Angela Bauch - AILANI for clinical competitive landscapingDr. Haxel Consult
 
Data Catalog in Denodo Platform 7.0: Creating a Data Marketplace with Data Vi...
Data Catalog in Denodo Platform 7.0: Creating a Data Marketplace with Data Vi...Data Catalog in Denodo Platform 7.0: Creating a Data Marketplace with Data Vi...
Data Catalog in Denodo Platform 7.0: Creating a Data Marketplace with Data Vi...Denodo
 
Lecture2 big data life cycle
Lecture2 big data life cycleLecture2 big data life cycle
Lecture2 big data life cyclehktripathy
 
Kerstin Diwisch | Towards a holistic visualization management for knowledge g...
Kerstin Diwisch | Towards a holistic visualization management for knowledge g...Kerstin Diwisch | Towards a holistic visualization management for knowledge g...
Kerstin Diwisch | Towards a holistic visualization management for knowledge g...semanticsconference
 
ICIC 2017: Publication Analysis and Publication Strategy
ICIC 2017: Publication Analysis and Publication Strategy  ICIC 2017: Publication Analysis and Publication Strategy
ICIC 2017: Publication Analysis and Publication Strategy Dr. Haxel Consult
 
Data Analytics Life Cycle [EMC² - Data Science and Big data analytics]
Data Analytics Life Cycle [EMC² - Data Science and Big data analytics]Data Analytics Life Cycle [EMC² - Data Science and Big data analytics]
Data Analytics Life Cycle [EMC² - Data Science and Big data analytics]ssuser23e4f31
 
Data Warehouse By Piyush
Data Warehouse By PiyushData Warehouse By Piyush
Data Warehouse By Piyushastronish
 
`Data mining
`Data mining`Data mining
`Data miningJebin R
 
Survey on Text Mining Based on Social Media Comments as Big Data Analysis Usi...
Survey on Text Mining Based on Social Media Comments as Big Data Analysis Usi...Survey on Text Mining Based on Social Media Comments as Big Data Analysis Usi...
Survey on Text Mining Based on Social Media Comments as Big Data Analysis Usi...IJMREMJournal
 
GraphTour London 2020 - Customer Journey
GraphTour London 2020  - Customer Journey GraphTour London 2020  - Customer Journey
GraphTour London 2020 - Customer Journey Neo4j
 
ICIC 2017: Product presentations FIZ Karlsruhe
ICIC 2017: Product presentations FIZ KarlsruheICIC 2017: Product presentations FIZ Karlsruhe
ICIC 2017: Product presentations FIZ KarlsruheDr. Haxel Consult
 
Data science | What is Data science
Data science | What is Data scienceData science | What is Data science
Data science | What is Data scienceShilpaKrishna6
 
ICIC 2017: New product presentation minesoft
ICIC 2017: New product presentation minesoftICIC 2017: New product presentation minesoft
ICIC 2017: New product presentation minesoftDr. Haxel Consult
 

What's hot (20)

Data Scientist Salary, Skills, Jobs And Resume | Data Scientist Career | Data...
Data Scientist Salary, Skills, Jobs And Resume | Data Scientist Career | Data...Data Scientist Salary, Skills, Jobs And Resume | Data Scientist Career | Data...
Data Scientist Salary, Skills, Jobs And Resume | Data Scientist Career | Data...
 
II-SDV 2012 Dealing with Large Data Volumes in Statistical Analysis and Text ...
II-SDV 2012 Dealing with Large Data Volumes in Statistical Analysis and Text ...II-SDV 2012 Dealing with Large Data Volumes in Statistical Analysis and Text ...
II-SDV 2012 Dealing with Large Data Volumes in Statistical Analysis and Text ...
 
II-SDV 2017: Gridlogics Technologies
II-SDV 2017: Gridlogics TechnologiesII-SDV 2017: Gridlogics Technologies
II-SDV 2017: Gridlogics Technologies
 
II-SDV 2017: Spotting the Stars in your Galaxy of Patent Data
II-SDV 2017: Spotting the Stars in your Galaxy of Patent DataII-SDV 2017: Spotting the Stars in your Galaxy of Patent Data
II-SDV 2017: Spotting the Stars in your Galaxy of Patent Data
 
Self-service consumption Data Catalog
Self-service consumption Data CatalogSelf-service consumption Data Catalog
Self-service consumption Data Catalog
 
what is data science
 what is data science what is data science
what is data science
 
AI-SDV 2021: Angela Bauch - AILANI for clinical competitive landscaping
AI-SDV 2021: Angela Bauch - AILANI for clinical competitive landscapingAI-SDV 2021: Angela Bauch - AILANI for clinical competitive landscaping
AI-SDV 2021: Angela Bauch - AILANI for clinical competitive landscaping
 
Data Catalog in Denodo Platform 7.0: Creating a Data Marketplace with Data Vi...
Data Catalog in Denodo Platform 7.0: Creating a Data Marketplace with Data Vi...Data Catalog in Denodo Platform 7.0: Creating a Data Marketplace with Data Vi...
Data Catalog in Denodo Platform 7.0: Creating a Data Marketplace with Data Vi...
 
Lecture2 big data life cycle
Lecture2 big data life cycleLecture2 big data life cycle
Lecture2 big data life cycle
 
Kerstin Diwisch | Towards a holistic visualization management for knowledge g...
Kerstin Diwisch | Towards a holistic visualization management for knowledge g...Kerstin Diwisch | Towards a holistic visualization management for knowledge g...
Kerstin Diwisch | Towards a holistic visualization management for knowledge g...
 
ICIC 2017: Publication Analysis and Publication Strategy
ICIC 2017: Publication Analysis and Publication Strategy  ICIC 2017: Publication Analysis and Publication Strategy
ICIC 2017: Publication Analysis and Publication Strategy
 
Data Analytics Life Cycle [EMC² - Data Science and Big data analytics]
Data Analytics Life Cycle [EMC² - Data Science and Big data analytics]Data Analytics Life Cycle [EMC² - Data Science and Big data analytics]
Data Analytics Life Cycle [EMC² - Data Science and Big data analytics]
 
Data Warehouse By Piyush
Data Warehouse By PiyushData Warehouse By Piyush
Data Warehouse By Piyush
 
`Data mining
`Data mining`Data mining
`Data mining
 
Survey on Text Mining Based on Social Media Comments as Big Data Analysis Usi...
Survey on Text Mining Based on Social Media Comments as Big Data Analysis Usi...Survey on Text Mining Based on Social Media Comments as Big Data Analysis Usi...
Survey on Text Mining Based on Social Media Comments as Big Data Analysis Usi...
 
Toolboxes for data scientists
Toolboxes for data scientistsToolboxes for data scientists
Toolboxes for data scientists
 
GraphTour London 2020 - Customer Journey
GraphTour London 2020  - Customer Journey GraphTour London 2020  - Customer Journey
GraphTour London 2020 - Customer Journey
 
ICIC 2017: Product presentations FIZ Karlsruhe
ICIC 2017: Product presentations FIZ KarlsruheICIC 2017: Product presentations FIZ Karlsruhe
ICIC 2017: Product presentations FIZ Karlsruhe
 
Data science | What is Data science
Data science | What is Data scienceData science | What is Data science
Data science | What is Data science
 
ICIC 2017: New product presentation minesoft
ICIC 2017: New product presentation minesoftICIC 2017: New product presentation minesoft
ICIC 2017: New product presentation minesoft
 

Similar to Product forecastingwebinar 20130417

Data science lab enabling flexibility
Data science lab   enabling flexibilityData science lab   enabling flexibility
Data science lab enabling flexibilityKognitio
 
Democratizing Apache Spark for the Enterprise with Jonathan Gole
Democratizing Apache Spark for the Enterprise with Jonathan GoleDemocratizing Apache Spark for the Enterprise with Jonathan Gole
Democratizing Apache Spark for the Enterprise with Jonathan GoleDatabricks
 
Data science tools of the trade
Data science tools of the tradeData science tools of the trade
Data science tools of the tradeFangda Wang
 
How Data Virtualization Adds Value to Your Data Science Stack
How Data Virtualization Adds Value to Your Data Science StackHow Data Virtualization Adds Value to Your Data Science Stack
How Data Virtualization Adds Value to Your Data Science StackDenodo
 
How Data Virtualization Puts Enterprise Machine Learning Programs into Produc...
How Data Virtualization Puts Enterprise Machine Learning Programs into Produc...How Data Virtualization Puts Enterprise Machine Learning Programs into Produc...
How Data Virtualization Puts Enterprise Machine Learning Programs into Produc...Denodo
 
Machine learning at scale - Webinar By zekeLabs
Machine learning at scale - Webinar By zekeLabsMachine learning at scale - Webinar By zekeLabs
Machine learning at scale - Webinar By zekeLabszekeLabs Technologies
 
A Maturing Role of Workflows in the Presence of Heterogenous Computing Archit...
A Maturing Role of Workflows in the Presence of Heterogenous Computing Archit...A Maturing Role of Workflows in the Presence of Heterogenous Computing Archit...
A Maturing Role of Workflows in the Presence of Heterogenous Computing Archit...Ilkay Altintas, Ph.D.
 
Simplified Machine Learning, Text, and Graph Analytics with Pivotal Greenplum
Simplified Machine Learning, Text, and Graph Analytics with Pivotal GreenplumSimplified Machine Learning, Text, and Graph Analytics with Pivotal Greenplum
Simplified Machine Learning, Text, and Graph Analytics with Pivotal GreenplumVMware Tanzu
 
Advanced Analytics and Machine Learning with Data Virtualization
Advanced Analytics and Machine Learning with Data VirtualizationAdvanced Analytics and Machine Learning with Data Virtualization
Advanced Analytics and Machine Learning with Data VirtualizationDenodo
 
Building successful data science teams
Building successful data science teamsBuilding successful data science teams
Building successful data science teamsVenkatesh Umaashankar
 
Continuous delivery for machine learning
Continuous delivery for machine learningContinuous delivery for machine learning
Continuous delivery for machine learningRajesh Muppalla
 
Advanced Analytics and Machine Learning with Data Virtualization (India)
Advanced Analytics and Machine Learning with Data Virtualization (India)Advanced Analytics and Machine Learning with Data Virtualization (India)
Advanced Analytics and Machine Learning with Data Virtualization (India)Denodo
 
How Data Virtualization Puts Machine Learning into Production (APAC)
How Data Virtualization Puts Machine Learning into Production (APAC)How Data Virtualization Puts Machine Learning into Production (APAC)
How Data Virtualization Puts Machine Learning into Production (APAC)Denodo
 
Building Data Science into Organizations: Field Experience
Building Data Science into Organizations: Field ExperienceBuilding Data Science into Organizations: Field Experience
Building Data Science into Organizations: Field ExperienceDatabricks
 
Building a Marketing Data Warehouse from Scratch - SMX Advanced 202
Building a Marketing Data Warehouse from Scratch - SMX Advanced 202Building a Marketing Data Warehouse from Scratch - SMX Advanced 202
Building a Marketing Data Warehouse from Scratch - SMX Advanced 202Christopher Gutknecht
 
JavaZone 2018 - A Practical(ish) Introduction to Data Science
JavaZone 2018 - A Practical(ish) Introduction to Data ScienceJavaZone 2018 - A Practical(ish) Introduction to Data Science
JavaZone 2018 - A Practical(ish) Introduction to Data ScienceMark West
 
Discover BigQuery ML, build your own CREATE MODEL statement
Discover BigQuery ML, build your own CREATE MODEL statementDiscover BigQuery ML, build your own CREATE MODEL statement
Discover BigQuery ML, build your own CREATE MODEL statementMárton Kodok
 

Similar to Product forecastingwebinar 20130417 (20)

DevOps for DataScience
DevOps for DataScienceDevOps for DataScience
DevOps for DataScience
 
Data science lab enabling flexibility
Data science lab   enabling flexibilityData science lab   enabling flexibility
Data science lab enabling flexibility
 
Democratizing Apache Spark for the Enterprise with Jonathan Gole
Democratizing Apache Spark for the Enterprise with Jonathan GoleDemocratizing Apache Spark for the Enterprise with Jonathan Gole
Democratizing Apache Spark for the Enterprise with Jonathan Gole
 
Data science tools of the trade
Data science tools of the tradeData science tools of the trade
Data science tools of the trade
 
How Data Virtualization Adds Value to Your Data Science Stack
How Data Virtualization Adds Value to Your Data Science StackHow Data Virtualization Adds Value to Your Data Science Stack
How Data Virtualization Adds Value to Your Data Science Stack
 
How Data Virtualization Puts Enterprise Machine Learning Programs into Produc...
How Data Virtualization Puts Enterprise Machine Learning Programs into Produc...How Data Virtualization Puts Enterprise Machine Learning Programs into Produc...
How Data Virtualization Puts Enterprise Machine Learning Programs into Produc...
 
Machine learning at scale - Webinar By zekeLabs
Machine learning at scale - Webinar By zekeLabsMachine learning at scale - Webinar By zekeLabs
Machine learning at scale - Webinar By zekeLabs
 
A Maturing Role of Workflows in the Presence of Heterogenous Computing Archit...
A Maturing Role of Workflows in the Presence of Heterogenous Computing Archit...A Maturing Role of Workflows in the Presence of Heterogenous Computing Archit...
A Maturing Role of Workflows in the Presence of Heterogenous Computing Archit...
 
Simplified Machine Learning, Text, and Graph Analytics with Pivotal Greenplum
Simplified Machine Learning, Text, and Graph Analytics with Pivotal GreenplumSimplified Machine Learning, Text, and Graph Analytics with Pivotal Greenplum
Simplified Machine Learning, Text, and Graph Analytics with Pivotal Greenplum
 
Advanced Analytics and Machine Learning with Data Virtualization
Advanced Analytics and Machine Learning with Data VirtualizationAdvanced Analytics and Machine Learning with Data Virtualization
Advanced Analytics and Machine Learning with Data Virtualization
 
Building successful data science teams
Building successful data science teamsBuilding successful data science teams
Building successful data science teams
 
Continuous delivery for machine learning
Continuous delivery for machine learningContinuous delivery for machine learning
Continuous delivery for machine learning
 
Architecting for Data Science
Architecting for Data ScienceArchitecting for Data Science
Architecting for Data Science
 
Advanced Analytics and Machine Learning with Data Virtualization (India)
Advanced Analytics and Machine Learning with Data Virtualization (India)Advanced Analytics and Machine Learning with Data Virtualization (India)
Advanced Analytics and Machine Learning with Data Virtualization (India)
 
How Data Virtualization Puts Machine Learning into Production (APAC)
How Data Virtualization Puts Machine Learning into Production (APAC)How Data Virtualization Puts Machine Learning into Production (APAC)
How Data Virtualization Puts Machine Learning into Production (APAC)
 
Building Data Science into Organizations: Field Experience
Building Data Science into Organizations: Field ExperienceBuilding Data Science into Organizations: Field Experience
Building Data Science into Organizations: Field Experience
 
Building a Marketing Data Warehouse from Scratch - SMX Advanced 202
Building a Marketing Data Warehouse from Scratch - SMX Advanced 202Building a Marketing Data Warehouse from Scratch - SMX Advanced 202
Building a Marketing Data Warehouse from Scratch - SMX Advanced 202
 
03_aiops-1.pptx
03_aiops-1.pptx03_aiops-1.pptx
03_aiops-1.pptx
 
JavaZone 2018 - A Practical(ish) Introduction to Data Science
JavaZone 2018 - A Practical(ish) Introduction to Data ScienceJavaZone 2018 - A Practical(ish) Introduction to Data Science
JavaZone 2018 - A Practical(ish) Introduction to Data Science
 
Discover BigQuery ML, build your own CREATE MODEL statement
Discover BigQuery ML, build your own CREATE MODEL statementDiscover BigQuery ML, build your own CREATE MODEL statement
Discover BigQuery ML, build your own CREATE MODEL statement
 

Recently uploaded

Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Wonjun Hwang
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphNeo4j
 
Bluetooth Controlled Car with Arduino.pdf
Bluetooth Controlled Car with Arduino.pdfBluetooth Controlled Car with Arduino.pdf
Bluetooth Controlled Car with Arduino.pdfngoud9212
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024Neo4j
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsAndrey Dotsenko
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions
 
Science&tech:THE INFORMATION AGE STS.pdf
Science&tech:THE INFORMATION AGE STS.pdfScience&tech:THE INFORMATION AGE STS.pdf
Science&tech:THE INFORMATION AGE STS.pdfjimielynbastida
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersThousandEyes
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024BookNet Canada
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 

Recently uploaded (20)

Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
 
Hot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort Service
Hot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort ServiceHot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort Service
Hot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort Service
 
Bluetooth Controlled Car with Arduino.pdf
Bluetooth Controlled Car with Arduino.pdfBluetooth Controlled Car with Arduino.pdf
Bluetooth Controlled Car with Arduino.pdf
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food Manufacturing
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping Elbows
 
Science&tech:THE INFORMATION AGE STS.pdf
Science&tech:THE INFORMATION AGE STS.pdfScience&tech:THE INFORMATION AGE STS.pdf
Science&tech:THE INFORMATION AGE STS.pdf
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 

Product forecastingwebinar 20130417

  • 1. Showcasing Data Science Lab functionality Welcome from Kognitio www.kognitio.com
  • 2. Today’s Web Seminar - Presenters Host Michael Hiskey Vice President Marketing & Business Development Format & Agenda Keynote Presenters Dr. Sharon Kirkham Data Scientist Kognitio Analytics Center of Excellence • Big Data and Complexity– the need for Data Scientists  Question Break #1 • Data Manipulation – functional demonstration Question Break #2 • Product forecasting with parallel R  ‐ practical demonstration  Question Break # 3
  • 4. The Data Science Lab Data Scientists & Staff Mathematic Algorithms MPP Computing BIG DATA 11
  • 5. What do business users want to do? Find patterns Track life time journeys Predict behavior Forecast scenarios Allocate scarce resources Model value Characterize groups Visualize discovery Respond, trigger, manage, promote
  • 6. I’m a data scientist! Are you? Entry level skills and development - aspiration Machine Learning Graduates
  • 7. I’m a data scientist! Are you? Business Expertise Machine Learning Interpretation skills = Insight Graduates Need guidance Data Scientist
  • 8. Supporting the data scientist Typical process – traditionally… Database
  • 9. Supporting the data scientist Typical process – direct data preparation Database SQL processing
  • 10. Supporting the data scientist Typical process – produces analytical data set Database SQL processingData Set
  • 11. Supporting the data scientist Typical process – run analytics from server Database SQL processingData Set ???
  • 12. Supporting the data scientist Typical process – data samples often used Database SQL processingData Set ??? Data Samples Process run iteratively = slow
  • 13. Supporting the data scientist Typical process – modelling process is honed Database SQL processingData Set ??? Data Samples Process run iteratively = slow
  • 14. Supporting the data scientist Typical process – model is complete Database Data Set ??? 
  • 15. Supporting the data scientist Typical process – score full data (Ouch!) Database Data Set ??? Full data to score
  • 16. Supporting the data scientist Push processes to DB – still produce analytical data set Analytical Platform SQL processingData Set
  • 17. Supporting the data scientist Push processes to DB – translate specific processes Analytical Platform SQL processingData Set ??? Translation
  • 18. Supporting the data scientist Push processes to DB – results passed back Analytical Platform SQL processingData Set ??? Translation Result Data Set
  • 19. Supporting the data scientist Push processes to DB– modelling process is honed Analytical Platform SQL processingData Set ??? Translation Result Data Set
  • 20. Supporting the data scientist Push processes to DB– model scoring done in DB Analytical Platform SQL processingData Set ???  Result Data Set
  • 21. Supporting the data scientist But we always want more! Complex data structure Analytical Platform Data Set ???  Result Data Set SQL cannot handle Data complexity. How do I integrate into my model?
  • 22. Supporting the data scientist But we always want more! non-standard processes Database SQL processingData Set ??? Data Samples Back where we started
  • 23. Supporting the data scientist Bring Analytics to data – still produce analytical data set SQL processing SQL processing
  • 24. Supporting the data scientist Bring Analytics to data – can use other code for data prep SQL processing Kognitio scripting Code executed Using MPP Data held in Memory. Fast access to CPUs
  • 25. Supporting the data scientist Bring Analytics to data – run analytics natively in Kognitio SQL processing Kognitio scripting Code executed Using MPP Data held in Memory. Fast access to CPUs One platform flexible working from data prep through analytical process
  • 26. New! Kognitio version 8: Enabling and extending the Analytical Platform External Tables External Functions Not Only SQL Hadoop Connector Other Connectors Kognitio Storage as an External table General Availability: June 2013
  • 27. External Scripting – Data Transformation Converting structured data into XML format, i.e. furnishing personalised content Assembly Converting XML into structured data Disassembly Extracting complex information from URLs Pulling words from large text fields, i.e. sentiment analysis Parsing Converting row based information into columns for data mining, i.e. supporting classification or segmentation Transposition e.g. using perl Examples where SQL is typically complex and extensive
  • 29. Product Forecasting – with parallel R Forecasting Requirements Forecast Inputs
  • 30. R running in an MPP environment Persistence Layer Analytical Platform Layer
  • 31. R running in an MPP environment Persistence Layer Analytical Platform Layer Kognitio platform specification 16 servers 462GB Kognitio RAM 128 Cores This is old kit 2.9 billion rows of epos 184 day time series for 12K products
  • 32. R running in an MPP environment Persistence Layer Analytical Platform Layer
  • 33. R running in an MPP environment Persistence Layer Analytical Platform Layer 1 output table in RAM 128 parallel instances of R
  • 34. R running in an MPP environment Persistence Layer Analytical Platform Layer Application & Client Layer ExcelAll BI Tools
  • 35. R running in an MPP environment Persistence Layer Analytical Platform Layer Application & Client Layer ExcelAll BI Tools 13 views of different analytical output
  • 36. R running in an MPP environment Persistence Layer Analytical Platform Layer Application & Client Layer ExcelAll BI Tools Result set contained # rows 12K forecasts and stats calculated in # seconds 2.9B EPOS items collated into time series in # seconds
  • 38. Thank you for your participation today • More information on today’s topic can be found at:  • kognitio.com/mpp_r • kognitio.com/product‐forecasting • FREE TO USE – perpetual license – www.kognitio.com/free – Contact us for the pre‐release version 8 • Analyst White Papers – EMA Comparative Analysis  – In‐memory database platforms – www.kognitio.com/emacompinmem • Today’s slides (and more): www.slideshare.net/Kognitio