SlideShare a Scribd company logo
Turn Data into Actionable Insights!!
About me:
Vishnu Alavur Kannan
Analytics Technical Platforms Lead
• 15+ years in IT, software engineer @heart
!
• Lead engineering teams through out my career
!
• Platform is just vaporware without passionate people
!
• A players make all the difference in software engineering
✓ 50:1, 100 :1, rarely on any other profession
@Monsanto for two reasons:
!
• I strongly believe in our commitment to sustainable agriculture
!
• I am able to do top-flight Engineering R&D
✓ Complex engineering challenges keeps me going
✓ Freedom to operate:
o Use the right tool for the right job
o Solving problems using cutting-edge technologies
o Open-source friendly, Open environment to contribute back
• Bringing a broad range of solutions to help
nourish our growing world
• Collaborating to help tackle some of the
world’s biggest challenges
• >20,000 employees in 66 countries
• >50% employees based outside of 

the United States
• One of the 25 World’s Best Multinational
Workplaces by Great Place to Work Institute
Monsanto: 



A Sustainable Agriculture Company
Our systems approach integrates technology platforms to
maximize farmer effectiveness
Crop Protection
• Weed Control (Roundup ® 

Branded Agricultural Herbicides)
• Insect Control
• Disease Control
Breeding
• Stress Tolerance
• Disease Control
• Yield
• Vegetables, corn, cotton, soybeans, wheat, canolaBiologicals
• Weed Control
• Insect Control
• Virus Control
• Plant Health
Biotechnology
• Weed Control
• Insect Control
• Stress Tolerance
• Yield / Yield Protection

• NutrientsData Science
• Planting Script Creator
• Increased production
• Efficient land and water use
• Efficient nutrient use
https://www.youtube.com/watch?v=l5Tw0PGcyN0
Why do you do what you do?!
What’s the purpose?
How do you do what you do?
What the hell do you do?
THE GOLDEN CIRCLE
Simon Sinek
Identify the signals from the noise @SCALE
Volume
DATA AT SCALE
Variety
VARIOUS FORMS OF DATA
Velocity
STREAMING IOT
Veracity
DATA UNCERTAINITY
DIGITALMEDIA:280Exabytes
FB:300+Petabytesperday
* Information from multiple sources are adapted and incorporated
POINTS,POLYGONS,
RASTERS,VECTORS
CONNECTINGDATAACROSSSOURCES,
ISWHEREANALYSTSSPENDMOSTOFTHEIRTIME
SENSORSarede-factoto
gatherdata anddetect
anomaliesacrossdomains
Monsanto re-inventing Agriculture through Analytics
Other providers:
Cost
Qualit
y
Agility
• No hardware administration, less software
administration
• Eleven 9’s of data durability
• Harness state-of-the-art software services
!
• DevOps moving towards NoOps
• Provision Infrastructure in seconds:
infrastructure as code - automation
• Grow or shrink compute to match seasonal
workloads and pay smartly as we go
Scale: MON has ~1016+ bytes of data and growing rapidly
• Global Presence: Taking data driven
products & services closer to business
!
• Ability to accelerate feature
development, integrating analytics rapidly
into our workflows @scale
!
• Ingest, store & retrieve massive data sets,
by using the right data store to our
competitive advantage (NoSQL/SQL)
!
• Service diversity, Organizational maturity
IOT, Imagery, Geo-spatial, Genomics, Molecular Breeding…..
Vision
A year ago as we started…
Integrated
Extended
Enhanced
Scalable
Enable Analytics @SCALE for the Enterprise
Reliable
FieldDevices
Apps
Apps
Devices
DevicesApps
DevicesApps
Data
M
odels
M
odels
M
odels
M
odels
Business
Unit-
1
Business
Unit-
2
Business
Unit3
D
igital
Business
Open
Integrate Analytics with Product Platforms
Data Data Science@scale Analytical Models
Turn Data Into Actionable Insights
….
….
APIs
Data
Predictive Product Placement @scale
PFO
PFO
Topography
Site boundary
Zones
Experiment metadata
Planter A/B line
Automap
Elevation
Soil
Weather
Topography
Zones
Location Data Assets
Geo-spatial Catalog
Analytics as a Service
In Collaboration with IT & Business
Scale across teams internalizing a self-service model
Internalize the needs to stay ahead of the curve
Addressing analytics needs based on persona
!
Descriptive
What happened?
!
Diagnostic
Why did it happen?
!
Predictive
What will happen?
!
Prescriptive
What should I do?
!
Cognitive
What can be learnt?
Hindsight Insight Foresight
10’s K of users 1’s K 100’s
Science@Scale
Information Pro-Consumers
Information Consumers
Data ScientistsBusiness Users
Business Analysts Statisticians
Business Intelligence
Ad-hoc Analysis Statistical Analysis
!Data DiscoveryReports
Dashboards
Drill Down Machine Learning
Inferential
CausalExploratory
Machine
Power Users
10’s
Computational Biologists
Neural Networks
Outsight
Systems
Natural Language Processing
Discovery Analytics – Development Environments
Non-prime
Exploratory
Prime
R & D
Development Environments @SCALE
• Big-data Infra. & DevOps
• Data Provisioning @scale
• Model Deployments @scale
• Big-data workloads
• Computational pipelines
• Transformation pipelines
• Training pipelines
• Sizing & Auto-scaling
• Cloud Best practices
• 24/7 availability
• Monitoring
• Alerting
• ELK stack
• ….
Analytical models
@SCALE
• Co-engineering
• Involve us sooner
• Thinking scale ahead
accelerating Time to Market
• Model development &
refactoring
• R, Asreml, Python, OPL…
• Java, Scala, Clojure…
• Infrastructure as code
• AWS, GCP, AzureML
• Docker, Kubernetes
• Distributed computing
• Architecture
• Solutions Design
• Development
!• API integrations
• KAFKA integrations
• OAUTH2 Integrations
• Security/ISO collaborations
Build it once, deploy frameworks as needed for user groups: Bundled in a centralized eco-system
Non-prod to Prod
BLUE / GREEN
Discovery Analytics Development Environments
Data Scientists, Developers and Novice Users
From Discovery to Production
Culture, approach and adoption
Know
Your
Users
For Community
By Community
!
Tailor by Needs
Balance Freedom
with Governance
!
!
!
Drive
User
Adoption
Environments
iteratively served to
everyone @monsanto
Enable analytical capabilities @scale for the enterprise integrated with
Product Platforms
As of today, # of unique data scientists across groups utilizing our discovery analytics environments
Model maturity Global Scalability
Core teams : Train the trainee to share knowledge and best practices utilizing the environment
Business Capabilities
Make the platform robust, sharing a few use cases
Environmental Classification @scale
Engineered using Discovery Analytics - Development Environment
Data

Provisioning
APIs
Data Transformation QA/QC

Rules
Scala

Python

Scikit
API
API
!
• Collaborations with Data Science Teams: Co-engineering R based machine learning model to a
Scala based model training pipeline for scalability
!
• EMR (Amazon Hadoop) & DataProc (GCP) using Apache Spark Computation Engine @scale
• Iterative ON-DEMAND framework, auto-scaling up-to N number of nodes
!
• Training pipeline integration with APIs & co-engineering continuum
Molecular Breeding: Training Pipeline @Scale
Engineered using Discovery Analytics - Development Environment
Data
DATA LEARNER MODEL 1
Cognitive Analytics Pipeline
!
• Collaborations with Cognitive Analytics Data science
team to build:
• An integrated Predictive Product Pipeline from
inception to commercialization
!
Built on:
!
• Apache Airflow (incubating): DAG based model
chaining & workflow management platform
• Models written in Python, R
• Parallelism achieved via Celery workers
• Being customized now to utilize Spark
!
• Apache Parquet - Columnar Storage Format on a file
system; extremely parallelizable
!
• Facebook Presto query engine to query parquet’s via
SQLs through REST APIs – highly performant
!
• Cloud Analytics platform integration
• Co-engineering solutions @scale mining millions
of data points to derive actionable insights
Workflow
DAGs
Libraries
Engineered using Discovery Analytics - Development Environment
Deep learning @SCALE
Discovery Analytics Development Environments integrated with CloudML on GCP
Collect
Store Train
Predict
Evaluate
Training
Pipeline
Retrain
• First Ever Deep Learning platform for the
Enterprise
!
• Perform Deep Learning @scale on CloudML using
TensorFlow via Jupyter from Prime environment
!
• Integrated with data, Inputs, Outputs and
Metadata including Tensor Board to monitor your
model training runs
Discovery Analytics - Workflow
Production Deployment - Workflow
DATA INGESTION AND TRANSFORMATION VIA API’s AND STREAMS
Streaming
Business Intelligence
RUN ANALYTICS@SCALE IN THE CLOUD
Collaborative Data Science - DISCOVERY ANALYTICS
DATA DRIVEN PRODUCTS
KAFKA Streams Data Warehouse*Big-data
Model outputs via APIs & Streams
In-house/Third Party: Platforms
AWS, GCP, Cloudera, DataStax, IBM, Azure, Domino labs…
Prescriptive PredictiveCognitive Historical
Models - Deep Learning, Computational Pipelines, Classification & Simulation Engines
Turn Data into Actionable Insights
Our Journey of Transformation
We have just scratched our surface:
!
• Science@scale – Our Cloud Analytics Platform is only a year old
!
• Talent, Behavior and Platform as our 3 key pillars of focus
!
• Talent:
• Building big-data and cloud analytics engineering team
from the ground up – 150+ interviews, 15 people team now
• Targeting A players, nurture the team on new technologies, build leaders
!
• Behavior/Cultural Mind shift: Data Science & IT Engineering operating as ONE TEAM
• Two extreme spectrums
• Finding the sweet spot in the middle has been the cultural shift
• Data science teams have been very supportive, adapting to change
• Bringing in IT best practices: Agile methodologies, versioning, CI….
• Train the trainee approach to enable adoption across the enterprise
• Leverage the best of both worlds by co-engineering solutions
• Collaboration is our new competitive advantage
!
• Platform: We are at ground zero now, continuing to deliver Minimum Viable Products each sprint
• Continue to mature & stay cutting edge on technologies
• Build vs. Buy [Cost, Time, Quality]
• Miles to go before we sleep
https://www.youtube.com/watch?v=l5Tw0PGcyN0
Why do you do what you do?!
What’s the purpose?
How do you do what you do?
What the hell do you do?
THE GOLDEN CIRCLE
Simon Sinek
• Help identify the signals from the noise @scale
An Enterprise Cloud Analytics platform to serve:
• Analytics as a service enabling Discovery Analytics
environments for the data science community
• Predictive, prescriptive, streaming,
cognitive, IOT edge analytical capabilities @scale
• Big Data Cloud Analytics Engineering
• Internalize data science needs thinking scale ahead
Thank You 

Visit us at engineering.monsanto.com



We are looking for passionate big data
cloud analytics engineers to join our team.



https://www.linkedin.com/in/vishnukannan

More Related Content

What's hot

Implementing and running a secure datalake from the trenches
Implementing and running a secure datalake from the trenches Implementing and running a secure datalake from the trenches
Implementing and running a secure datalake from the trenches
DataWorks Summit
 
Big Data & Data Lakes Building Blocks
Big Data & Data Lakes Building BlocksBig Data & Data Lakes Building Blocks
Big Data & Data Lakes Building Blocks
Amazon Web Services
 
Solving Big Data Problems using Hortonworks
Solving Big Data Problems using Hortonworks Solving Big Data Problems using Hortonworks
Solving Big Data Problems using Hortonworks
DataWorks Summit/Hadoop Summit
 
Big Data Architecture and Deployment
Big Data Architecture and DeploymentBig Data Architecture and Deployment
Big Data Architecture and Deployment
Cisco Canada
 
Top Trends in Building Data Lakes for Machine Learning and AI
Top Trends in Building Data Lakes for Machine Learning and AI Top Trends in Building Data Lakes for Machine Learning and AI
Top Trends in Building Data Lakes for Machine Learning and AI
Holden Ackerman
 
Hadoop Powers Modern Enterprise Data Architectures
Hadoop Powers Modern Enterprise Data ArchitecturesHadoop Powers Modern Enterprise Data Architectures
Hadoop Powers Modern Enterprise Data ArchitecturesDataWorks Summit
 
Big Data on azure
Big Data on azureBig Data on azure
Big Data on azure
David Giard
 
Introduction to Azure HDInsight
Introduction to Azure HDInsightIntroduction to Azure HDInsight
Introduction to Azure HDInsight
Stéphane Fréchette
 
Unlock the value in your big data reservoir using oracle big data discovery a...
Unlock the value in your big data reservoir using oracle big data discovery a...Unlock the value in your big data reservoir using oracle big data discovery a...
Unlock the value in your big data reservoir using oracle big data discovery a...
Mark Rittman
 
High Performance Spatial-Temporal Trajectory Analysis with Spark
High Performance Spatial-Temporal Trajectory Analysis with Spark High Performance Spatial-Temporal Trajectory Analysis with Spark
High Performance Spatial-Temporal Trajectory Analysis with Spark
DataWorks Summit/Hadoop Summit
 
Hadoop vs. RDBMS for Advanced Analytics
Hadoop vs. RDBMS for Advanced AnalyticsHadoop vs. RDBMS for Advanced Analytics
Hadoop vs. RDBMS for Advanced Analyticsjoshwills
 
Hadoop for the Masses
Hadoop for the MassesHadoop for the Masses
Hadoop for the Masses
DataWorks Summit/Hadoop Summit
 
Solving Performance Problems on Hadoop
Solving Performance Problems on HadoopSolving Performance Problems on Hadoop
Solving Performance Problems on Hadoop
Tyler Mitchell
 
Hadoop Data Lake vs classical Data Warehouse: How to utilize best of both wor...
Hadoop Data Lake vs classical Data Warehouse: How to utilize best of both wor...Hadoop Data Lake vs classical Data Warehouse: How to utilize best of both wor...
Hadoop Data Lake vs classical Data Warehouse: How to utilize best of both wor...
Kolja Manuel Rödel
 
Moving to a data-centric architecture: Toronto Data Unconference 2015
Moving to a data-centric architecture: Toronto Data Unconference 2015Moving to a data-centric architecture: Toronto Data Unconference 2015
Moving to a data-centric architecture: Toronto Data Unconference 2015
Adam Muise
 
Using Oracle Big Data SQL 3.0 to add Hadoop & NoSQL to your Oracle Data Wareh...
Using Oracle Big Data SQL 3.0 to add Hadoop & NoSQL to your Oracle Data Wareh...Using Oracle Big Data SQL 3.0 to add Hadoop & NoSQL to your Oracle Data Wareh...
Using Oracle Big Data SQL 3.0 to add Hadoop & NoSQL to your Oracle Data Wareh...
Mark Rittman
 
The EDW Ecosystem
The EDW EcosystemThe EDW Ecosystem
Analysis of Major Trends in Big Data Analytics
Analysis of Major Trends in Big Data AnalyticsAnalysis of Major Trends in Big Data Analytics
Analysis of Major Trends in Big Data Analytics
DataWorks Summit/Hadoop Summit
 
Big Data for Managers: From hadoop to streaming and beyond
Big Data for Managers: From hadoop to streaming and beyondBig Data for Managers: From hadoop to streaming and beyond
Big Data for Managers: From hadoop to streaming and beyond
DataWorks Summit/Hadoop Summit
 
Big data on Azure for Architects
Big data on Azure for ArchitectsBig data on Azure for Architects
Big data on Azure for Architects
Tomasz Kopacz
 

What's hot (20)

Implementing and running a secure datalake from the trenches
Implementing and running a secure datalake from the trenches Implementing and running a secure datalake from the trenches
Implementing and running a secure datalake from the trenches
 
Big Data & Data Lakes Building Blocks
Big Data & Data Lakes Building BlocksBig Data & Data Lakes Building Blocks
Big Data & Data Lakes Building Blocks
 
Solving Big Data Problems using Hortonworks
Solving Big Data Problems using Hortonworks Solving Big Data Problems using Hortonworks
Solving Big Data Problems using Hortonworks
 
Big Data Architecture and Deployment
Big Data Architecture and DeploymentBig Data Architecture and Deployment
Big Data Architecture and Deployment
 
Top Trends in Building Data Lakes for Machine Learning and AI
Top Trends in Building Data Lakes for Machine Learning and AI Top Trends in Building Data Lakes for Machine Learning and AI
Top Trends in Building Data Lakes for Machine Learning and AI
 
Hadoop Powers Modern Enterprise Data Architectures
Hadoop Powers Modern Enterprise Data ArchitecturesHadoop Powers Modern Enterprise Data Architectures
Hadoop Powers Modern Enterprise Data Architectures
 
Big Data on azure
Big Data on azureBig Data on azure
Big Data on azure
 
Introduction to Azure HDInsight
Introduction to Azure HDInsightIntroduction to Azure HDInsight
Introduction to Azure HDInsight
 
Unlock the value in your big data reservoir using oracle big data discovery a...
Unlock the value in your big data reservoir using oracle big data discovery a...Unlock the value in your big data reservoir using oracle big data discovery a...
Unlock the value in your big data reservoir using oracle big data discovery a...
 
High Performance Spatial-Temporal Trajectory Analysis with Spark
High Performance Spatial-Temporal Trajectory Analysis with Spark High Performance Spatial-Temporal Trajectory Analysis with Spark
High Performance Spatial-Temporal Trajectory Analysis with Spark
 
Hadoop vs. RDBMS for Advanced Analytics
Hadoop vs. RDBMS for Advanced AnalyticsHadoop vs. RDBMS for Advanced Analytics
Hadoop vs. RDBMS for Advanced Analytics
 
Hadoop for the Masses
Hadoop for the MassesHadoop for the Masses
Hadoop for the Masses
 
Solving Performance Problems on Hadoop
Solving Performance Problems on HadoopSolving Performance Problems on Hadoop
Solving Performance Problems on Hadoop
 
Hadoop Data Lake vs classical Data Warehouse: How to utilize best of both wor...
Hadoop Data Lake vs classical Data Warehouse: How to utilize best of both wor...Hadoop Data Lake vs classical Data Warehouse: How to utilize best of both wor...
Hadoop Data Lake vs classical Data Warehouse: How to utilize best of both wor...
 
Moving to a data-centric architecture: Toronto Data Unconference 2015
Moving to a data-centric architecture: Toronto Data Unconference 2015Moving to a data-centric architecture: Toronto Data Unconference 2015
Moving to a data-centric architecture: Toronto Data Unconference 2015
 
Using Oracle Big Data SQL 3.0 to add Hadoop & NoSQL to your Oracle Data Wareh...
Using Oracle Big Data SQL 3.0 to add Hadoop & NoSQL to your Oracle Data Wareh...Using Oracle Big Data SQL 3.0 to add Hadoop & NoSQL to your Oracle Data Wareh...
Using Oracle Big Data SQL 3.0 to add Hadoop & NoSQL to your Oracle Data Wareh...
 
The EDW Ecosystem
The EDW EcosystemThe EDW Ecosystem
The EDW Ecosystem
 
Analysis of Major Trends in Big Data Analytics
Analysis of Major Trends in Big Data AnalyticsAnalysis of Major Trends in Big Data Analytics
Analysis of Major Trends in Big Data Analytics
 
Big Data for Managers: From hadoop to streaming and beyond
Big Data for Managers: From hadoop to streaming and beyondBig Data for Managers: From hadoop to streaming and beyond
Big Data for Managers: From hadoop to streaming and beyond
 
Big data on Azure for Architects
Big data on Azure for ArchitectsBig data on Azure for Architects
Big data on Azure for Architects
 

Viewers also liked

Fuel cell
Fuel cellFuel cell
Fuel cell
Ahmed M. Elkholy
 
Introduction to Data Modeling in Cassandra
Introduction to Data Modeling in CassandraIntroduction to Data Modeling in Cassandra
Introduction to Data Modeling in Cassandra
Jim Hatcher
 
What is dev ops?
What is dev ops?What is dev ops?
What is dev ops?
Mukta Aphale
 
Elk stack
Elk stackElk stack
Elk stack
datamantra
 
Expect the unexpected: Prepare for failures in microservices
Expect the unexpected: Prepare for failures in microservicesExpect the unexpected: Prepare for failures in microservices
Expect the unexpected: Prepare for failures in microservices
Bhakti Mehta
 
Exponentiële groei v2
Exponentiële groei v2Exponentiële groei v2
Exponentiële groei v2
guest6b41899
 
How Cisco Migrated from MapReduce Jobs to Spark Jobs - StampedeCon 2015
How Cisco Migrated from MapReduce Jobs to Spark Jobs - StampedeCon 2015How Cisco Migrated from MapReduce Jobs to Spark Jobs - StampedeCon 2015
How Cisco Migrated from MapReduce Jobs to Spark Jobs - StampedeCon 2015
StampedeCon
 
Docker in Production, Look No Hands! by Scott Coulton
Docker in Production, Look No Hands! by Scott CoultonDocker in Production, Look No Hands! by Scott Coulton
Docker in Production, Look No Hands! by Scott Coulton
Docker, Inc.
 
(ARC401) Cloud First: New Architecture for New Infrastructure
(ARC401) Cloud First: New Architecture for New Infrastructure(ARC401) Cloud First: New Architecture for New Infrastructure
(ARC401) Cloud First: New Architecture for New Infrastructure
Amazon Web Services
 
150430 regiosessie corv_almelo
150430 regiosessie corv_almelo150430 regiosessie corv_almelo
150430 regiosessie corv_almeloKING
 
IoT and Big Data
IoT and Big DataIoT and Big Data
IoT and Big Data
sabnees
 
Fluentd v1.0 in a nutshell
Fluentd v1.0 in a nutshellFluentd v1.0 in a nutshell
Fluentd v1.0 in a nutshell
N Masahiro
 
Performance Benchmarking of Clouds Evaluating OpenStack
Performance Benchmarking of Clouds                Evaluating OpenStackPerformance Benchmarking of Clouds                Evaluating OpenStack
Performance Benchmarking of Clouds Evaluating OpenStack
Pradeep Kumar
 
IBM Containers- Bluemix
IBM Containers- BluemixIBM Containers- Bluemix
IBM Containers- Bluemix
Virginia Fernandez
 
Cloud adoption patterns April 11 2016
Cloud adoption patterns April 11 2016Cloud adoption patterns April 11 2016
Cloud adoption patterns April 11 2016
Kyle Brown
 
Sprint 49 review
Sprint 49 reviewSprint 49 review
Sprint 49 review
ManageIQ
 
Raleigh DevDay 2017: Deep Dive on AWS Management Tools
Raleigh DevDay 2017: Deep Dive on AWS Management ToolsRaleigh DevDay 2017: Deep Dive on AWS Management Tools
Raleigh DevDay 2017: Deep Dive on AWS Management Tools
Amazon Web Services
 
Get complete visibility into containers based application environment
Get complete visibility into containers based application environmentGet complete visibility into containers based application environment
Get complete visibility into containers based application environment
AppDynamics
 
How to Scale Your Architecture and DevOps Practices for Big Data Applications
How to Scale Your Architecture and DevOps Practices for Big Data ApplicationsHow to Scale Your Architecture and DevOps Practices for Big Data Applications
How to Scale Your Architecture and DevOps Practices for Big Data Applications
Amazon Web Services
 

Viewers also liked (20)

Fuel cell
Fuel cellFuel cell
Fuel cell
 
Introduction to Data Modeling in Cassandra
Introduction to Data Modeling in CassandraIntroduction to Data Modeling in Cassandra
Introduction to Data Modeling in Cassandra
 
What is dev ops?
What is dev ops?What is dev ops?
What is dev ops?
 
Elk stack
Elk stackElk stack
Elk stack
 
Expect the unexpected: Prepare for failures in microservices
Expect the unexpected: Prepare for failures in microservicesExpect the unexpected: Prepare for failures in microservices
Expect the unexpected: Prepare for failures in microservices
 
Exponentiële groei v2
Exponentiële groei v2Exponentiële groei v2
Exponentiële groei v2
 
Analyze, Influence and Engage Your Customer - v1.7
Analyze, Influence and Engage Your Customer - v1.7Analyze, Influence and Engage Your Customer - v1.7
Analyze, Influence and Engage Your Customer - v1.7
 
How Cisco Migrated from MapReduce Jobs to Spark Jobs - StampedeCon 2015
How Cisco Migrated from MapReduce Jobs to Spark Jobs - StampedeCon 2015How Cisco Migrated from MapReduce Jobs to Spark Jobs - StampedeCon 2015
How Cisco Migrated from MapReduce Jobs to Spark Jobs - StampedeCon 2015
 
Docker in Production, Look No Hands! by Scott Coulton
Docker in Production, Look No Hands! by Scott CoultonDocker in Production, Look No Hands! by Scott Coulton
Docker in Production, Look No Hands! by Scott Coulton
 
(ARC401) Cloud First: New Architecture for New Infrastructure
(ARC401) Cloud First: New Architecture for New Infrastructure(ARC401) Cloud First: New Architecture for New Infrastructure
(ARC401) Cloud First: New Architecture for New Infrastructure
 
150430 regiosessie corv_almelo
150430 regiosessie corv_almelo150430 regiosessie corv_almelo
150430 regiosessie corv_almelo
 
IoT and Big Data
IoT and Big DataIoT and Big Data
IoT and Big Data
 
Fluentd v1.0 in a nutshell
Fluentd v1.0 in a nutshellFluentd v1.0 in a nutshell
Fluentd v1.0 in a nutshell
 
Performance Benchmarking of Clouds Evaluating OpenStack
Performance Benchmarking of Clouds                Evaluating OpenStackPerformance Benchmarking of Clouds                Evaluating OpenStack
Performance Benchmarking of Clouds Evaluating OpenStack
 
IBM Containers- Bluemix
IBM Containers- BluemixIBM Containers- Bluemix
IBM Containers- Bluemix
 
Cloud adoption patterns April 11 2016
Cloud adoption patterns April 11 2016Cloud adoption patterns April 11 2016
Cloud adoption patterns April 11 2016
 
Sprint 49 review
Sprint 49 reviewSprint 49 review
Sprint 49 review
 
Raleigh DevDay 2017: Deep Dive on AWS Management Tools
Raleigh DevDay 2017: Deep Dive on AWS Management ToolsRaleigh DevDay 2017: Deep Dive on AWS Management Tools
Raleigh DevDay 2017: Deep Dive on AWS Management Tools
 
Get complete visibility into containers based application environment
Get complete visibility into containers based application environmentGet complete visibility into containers based application environment
Get complete visibility into containers based application environment
 
How to Scale Your Architecture and DevOps Practices for Big Data Applications
How to Scale Your Architecture and DevOps Practices for Big Data ApplicationsHow to Scale Your Architecture and DevOps Practices for Big Data Applications
How to Scale Your Architecture and DevOps Practices for Big Data Applications
 

Similar to Turn Data Into Actionable Insights - StampedeCon 2016

Initiate Edinburgh 2019 - Big Data Meets AI
Initiate Edinburgh 2019 - Big Data Meets AIInitiate Edinburgh 2019 - Big Data Meets AI
Initiate Edinburgh 2019 - Big Data Meets AI
Amazon Web Services
 
Atlanta Data Science Meetup | Qubole slides
Atlanta Data Science Meetup | Qubole slidesAtlanta Data Science Meetup | Qubole slides
Atlanta Data Science Meetup | Qubole slides
Qubole
 
Analytical Innovation: How to Build the Next Generation Data Platform
Analytical Innovation: How to Build the Next Generation Data PlatformAnalytical Innovation: How to Build the Next Generation Data Platform
Analytical Innovation: How to Build the Next Generation Data Platform
VMware Tanzu
 
Big Data & Analytics - Innovating at the Speed of Light
Big Data & Analytics - Innovating at the Speed of LightBig Data & Analytics - Innovating at the Speed of Light
Big Data & Analytics - Innovating at the Speed of Light
Amazon Web Services LATAM
 
Sudhir Rawat, Sr Techonology Evangelist at Microsoft SQL Business Intelligenc...
Sudhir Rawat, Sr Techonology Evangelist at Microsoft SQL Business Intelligenc...Sudhir Rawat, Sr Techonology Evangelist at Microsoft SQL Business Intelligenc...
Sudhir Rawat, Sr Techonology Evangelist at Microsoft SQL Business Intelligenc...
Dataconomy Media
 
A Maturing Role of Workflows in the Presence of Heterogenous Computing Archit...
A Maturing Role of Workflows in the Presence of Heterogenous Computing Archit...A Maturing Role of Workflows in the Presence of Heterogenous Computing Archit...
A Maturing Role of Workflows in the Presence of Heterogenous Computing Archit...
Ilkay Altintas, Ph.D.
 
ADV Slides: What the Aspiring or New Data Scientist Needs to Know About the E...
ADV Slides: What the Aspiring or New Data Scientist Needs to Know About the E...ADV Slides: What the Aspiring or New Data Scientist Needs to Know About the E...
ADV Slides: What the Aspiring or New Data Scientist Needs to Know About the E...
DATAVERSITY
 
2022 Trends in Enterprise Analytics
2022 Trends in Enterprise Analytics2022 Trends in Enterprise Analytics
2022 Trends in Enterprise Analytics
DATAVERSITY
 
Big Data Open Source Tools and Trends: Enable Real-Time Business Intelligence...
Big Data Open Source Tools and Trends: Enable Real-Time Business Intelligence...Big Data Open Source Tools and Trends: Enable Real-Time Business Intelligence...
Big Data Open Source Tools and Trends: Enable Real-Time Business Intelligence...
Perficient, Inc.
 
DevOps Spain 2019. Olivier Perard-Oracle
DevOps Spain 2019. Olivier Perard-OracleDevOps Spain 2019. Olivier Perard-Oracle
DevOps Spain 2019. Olivier Perard-Oracle
atSistemas
 
AWS Initiate Day Manchester 2019 – AWS Big Data Meets AI
AWS Initiate Day Manchester 2019 – AWS Big Data Meets AIAWS Initiate Day Manchester 2019 – AWS Big Data Meets AI
AWS Initiate Day Manchester 2019 – AWS Big Data Meets AI
Amazon Web Services
 
Ai & Data Analytics 2018 - Azure Databricks for data scientist
Ai & Data Analytics 2018 - Azure Databricks for data scientistAi & Data Analytics 2018 - Azure Databricks for data scientist
Ai & Data Analytics 2018 - Azure Databricks for data scientist
Alberto Diaz Martin
 
Options for Data Prep - A Survey of the Current Market
Options for Data Prep - A Survey of the Current MarketOptions for Data Prep - A Survey of the Current Market
Options for Data Prep - A Survey of the Current Market
Dremio Corporation
 
Scaling up with Cisco Big Data: Data + Science = Data Science
Scaling up with Cisco Big Data: Data + Science = Data ScienceScaling up with Cisco Big Data: Data + Science = Data Science
Scaling up with Cisco Big Data: Data + Science = Data Science
eRic Choo
 
Cortana Analytics Workshop: The "Big Data" of the Cortana Analytics Suite, Pa...
Cortana Analytics Workshop: The "Big Data" of the Cortana Analytics Suite, Pa...Cortana Analytics Workshop: The "Big Data" of the Cortana Analytics Suite, Pa...
Cortana Analytics Workshop: The "Big Data" of the Cortana Analytics Suite, Pa...
MSAdvAnalytics
 
Values & Vision - Cloud Sandboxes for BIG Earth Sciences
Values & Vision - Cloud Sandboxes for BIG Earth SciencesValues & Vision - Cloud Sandboxes for BIG Earth Sciences
Values & Vision - Cloud Sandboxes for BIG Earth Sciences
terradue
 
AWS Initiate Day Dublin 2019 – Big Data Meets AI
AWS Initiate Day Dublin 2019 – Big Data Meets AIAWS Initiate Day Dublin 2019 – Big Data Meets AI
AWS Initiate Day Dublin 2019 – Big Data Meets AI
Amazon Web Services
 
Alten calsoft labs analytics service offerings
Alten calsoft labs   analytics service offeringsAlten calsoft labs   analytics service offerings
Alten calsoft labs analytics service offerings
Sandeep Vyas
 
Application Modernization
Application ModernizationApplication Modernization
Application Modernization
Sulaiman64
 
From Insight to Action: Using Data Science to Transform Your Organization
From Insight to Action: Using Data Science to Transform Your OrganizationFrom Insight to Action: Using Data Science to Transform Your Organization
From Insight to Action: Using Data Science to Transform Your Organization
Cloudera, Inc.
 

Similar to Turn Data Into Actionable Insights - StampedeCon 2016 (20)

Initiate Edinburgh 2019 - Big Data Meets AI
Initiate Edinburgh 2019 - Big Data Meets AIInitiate Edinburgh 2019 - Big Data Meets AI
Initiate Edinburgh 2019 - Big Data Meets AI
 
Atlanta Data Science Meetup | Qubole slides
Atlanta Data Science Meetup | Qubole slidesAtlanta Data Science Meetup | Qubole slides
Atlanta Data Science Meetup | Qubole slides
 
Analytical Innovation: How to Build the Next Generation Data Platform
Analytical Innovation: How to Build the Next Generation Data PlatformAnalytical Innovation: How to Build the Next Generation Data Platform
Analytical Innovation: How to Build the Next Generation Data Platform
 
Big Data & Analytics - Innovating at the Speed of Light
Big Data & Analytics - Innovating at the Speed of LightBig Data & Analytics - Innovating at the Speed of Light
Big Data & Analytics - Innovating at the Speed of Light
 
Sudhir Rawat, Sr Techonology Evangelist at Microsoft SQL Business Intelligenc...
Sudhir Rawat, Sr Techonology Evangelist at Microsoft SQL Business Intelligenc...Sudhir Rawat, Sr Techonology Evangelist at Microsoft SQL Business Intelligenc...
Sudhir Rawat, Sr Techonology Evangelist at Microsoft SQL Business Intelligenc...
 
A Maturing Role of Workflows in the Presence of Heterogenous Computing Archit...
A Maturing Role of Workflows in the Presence of Heterogenous Computing Archit...A Maturing Role of Workflows in the Presence of Heterogenous Computing Archit...
A Maturing Role of Workflows in the Presence of Heterogenous Computing Archit...
 
ADV Slides: What the Aspiring or New Data Scientist Needs to Know About the E...
ADV Slides: What the Aspiring or New Data Scientist Needs to Know About the E...ADV Slides: What the Aspiring or New Data Scientist Needs to Know About the E...
ADV Slides: What the Aspiring or New Data Scientist Needs to Know About the E...
 
2022 Trends in Enterprise Analytics
2022 Trends in Enterprise Analytics2022 Trends in Enterprise Analytics
2022 Trends in Enterprise Analytics
 
Big Data Open Source Tools and Trends: Enable Real-Time Business Intelligence...
Big Data Open Source Tools and Trends: Enable Real-Time Business Intelligence...Big Data Open Source Tools and Trends: Enable Real-Time Business Intelligence...
Big Data Open Source Tools and Trends: Enable Real-Time Business Intelligence...
 
DevOps Spain 2019. Olivier Perard-Oracle
DevOps Spain 2019. Olivier Perard-OracleDevOps Spain 2019. Olivier Perard-Oracle
DevOps Spain 2019. Olivier Perard-Oracle
 
AWS Initiate Day Manchester 2019 – AWS Big Data Meets AI
AWS Initiate Day Manchester 2019 – AWS Big Data Meets AIAWS Initiate Day Manchester 2019 – AWS Big Data Meets AI
AWS Initiate Day Manchester 2019 – AWS Big Data Meets AI
 
Ai & Data Analytics 2018 - Azure Databricks for data scientist
Ai & Data Analytics 2018 - Azure Databricks for data scientistAi & Data Analytics 2018 - Azure Databricks for data scientist
Ai & Data Analytics 2018 - Azure Databricks for data scientist
 
Options for Data Prep - A Survey of the Current Market
Options for Data Prep - A Survey of the Current MarketOptions for Data Prep - A Survey of the Current Market
Options for Data Prep - A Survey of the Current Market
 
Scaling up with Cisco Big Data: Data + Science = Data Science
Scaling up with Cisco Big Data: Data + Science = Data ScienceScaling up with Cisco Big Data: Data + Science = Data Science
Scaling up with Cisco Big Data: Data + Science = Data Science
 
Cortana Analytics Workshop: The "Big Data" of the Cortana Analytics Suite, Pa...
Cortana Analytics Workshop: The "Big Data" of the Cortana Analytics Suite, Pa...Cortana Analytics Workshop: The "Big Data" of the Cortana Analytics Suite, Pa...
Cortana Analytics Workshop: The "Big Data" of the Cortana Analytics Suite, Pa...
 
Values & Vision - Cloud Sandboxes for BIG Earth Sciences
Values & Vision - Cloud Sandboxes for BIG Earth SciencesValues & Vision - Cloud Sandboxes for BIG Earth Sciences
Values & Vision - Cloud Sandboxes for BIG Earth Sciences
 
AWS Initiate Day Dublin 2019 – Big Data Meets AI
AWS Initiate Day Dublin 2019 – Big Data Meets AIAWS Initiate Day Dublin 2019 – Big Data Meets AI
AWS Initiate Day Dublin 2019 – Big Data Meets AI
 
Alten calsoft labs analytics service offerings
Alten calsoft labs   analytics service offeringsAlten calsoft labs   analytics service offerings
Alten calsoft labs analytics service offerings
 
Application Modernization
Application ModernizationApplication Modernization
Application Modernization
 
From Insight to Action: Using Data Science to Transform Your Organization
From Insight to Action: Using Data Science to Transform Your OrganizationFrom Insight to Action: Using Data Science to Transform Your Organization
From Insight to Action: Using Data Science to Transform Your Organization
 

More from StampedeCon

Why Should We Trust You-Interpretability of Deep Neural Networks - StampedeCo...
Why Should We Trust You-Interpretability of Deep Neural Networks - StampedeCo...Why Should We Trust You-Interpretability of Deep Neural Networks - StampedeCo...
Why Should We Trust You-Interpretability of Deep Neural Networks - StampedeCo...
StampedeCon
 
The Search for a New Visual Search Beyond Language - StampedeCon AI Summit 2017
The Search for a New Visual Search Beyond Language - StampedeCon AI Summit 2017The Search for a New Visual Search Beyond Language - StampedeCon AI Summit 2017
The Search for a New Visual Search Beyond Language - StampedeCon AI Summit 2017
StampedeCon
 
Predicting Outcomes When Your Outcomes are Graphs - StampedeCon AI Summit 2017
Predicting Outcomes When Your Outcomes are Graphs - StampedeCon AI Summit 2017Predicting Outcomes When Your Outcomes are Graphs - StampedeCon AI Summit 2017
Predicting Outcomes When Your Outcomes are Graphs - StampedeCon AI Summit 2017
StampedeCon
 
Novel Semi-supervised Probabilistic ML Approach to SNP Variant Calling - Stam...
Novel Semi-supervised Probabilistic ML Approach to SNP Variant Calling - Stam...Novel Semi-supervised Probabilistic ML Approach to SNP Variant Calling - Stam...
Novel Semi-supervised Probabilistic ML Approach to SNP Variant Calling - Stam...
StampedeCon
 
How to Talk about AI to Non-analaysts - Stampedecon AI Summit 2017
How to Talk about AI to Non-analaysts - Stampedecon AI Summit 2017How to Talk about AI to Non-analaysts - Stampedecon AI Summit 2017
How to Talk about AI to Non-analaysts - Stampedecon AI Summit 2017
StampedeCon
 
Getting Started with Keras and TensorFlow - StampedeCon AI Summit 2017
Getting Started with Keras and TensorFlow - StampedeCon AI Summit 2017Getting Started with Keras and TensorFlow - StampedeCon AI Summit 2017
Getting Started with Keras and TensorFlow - StampedeCon AI Summit 2017
StampedeCon
 
Foundations of Machine Learning - StampedeCon AI Summit 2017
Foundations of Machine Learning - StampedeCon AI Summit 2017Foundations of Machine Learning - StampedeCon AI Summit 2017
Foundations of Machine Learning - StampedeCon AI Summit 2017
StampedeCon
 
Don't Start from Scratch: Transfer Learning for Novel Computer Vision Problem...
Don't Start from Scratch: Transfer Learning for Novel Computer Vision Problem...Don't Start from Scratch: Transfer Learning for Novel Computer Vision Problem...
Don't Start from Scratch: Transfer Learning for Novel Computer Vision Problem...
StampedeCon
 
Bringing the Whole Elephant Into View Can Cognitive Systems Bring Real Soluti...
Bringing the Whole Elephant Into View Can Cognitive Systems Bring Real Soluti...Bringing the Whole Elephant Into View Can Cognitive Systems Bring Real Soluti...
Bringing the Whole Elephant Into View Can Cognitive Systems Bring Real Soluti...
StampedeCon
 
Automated AI The Next Frontier in Analytics - StampedeCon AI Summit 2017
Automated AI The Next Frontier in Analytics - StampedeCon AI Summit 2017Automated AI The Next Frontier in Analytics - StampedeCon AI Summit 2017
Automated AI The Next Frontier in Analytics - StampedeCon AI Summit 2017
StampedeCon
 
AI in the Enterprise: Past, Present & Future - StampedeCon AI Summit 2017
AI in the Enterprise: Past,  Present &  Future - StampedeCon AI Summit 2017AI in the Enterprise: Past,  Present &  Future - StampedeCon AI Summit 2017
AI in the Enterprise: Past, Present & Future - StampedeCon AI Summit 2017
StampedeCon
 
A Different Data Science Approach - StampedeCon AI Summit 2017
A Different Data Science Approach - StampedeCon AI Summit 2017A Different Data Science Approach - StampedeCon AI Summit 2017
A Different Data Science Approach - StampedeCon AI Summit 2017
StampedeCon
 
Graph in Customer 360 - StampedeCon Big Data Conference 2017
Graph in Customer 360 - StampedeCon Big Data Conference 2017Graph in Customer 360 - StampedeCon Big Data Conference 2017
Graph in Customer 360 - StampedeCon Big Data Conference 2017
StampedeCon
 
End-to-end Big Data Projects with Python - StampedeCon Big Data Conference 2017
End-to-end Big Data Projects with Python - StampedeCon Big Data Conference 2017End-to-end Big Data Projects with Python - StampedeCon Big Data Conference 2017
End-to-end Big Data Projects with Python - StampedeCon Big Data Conference 2017
StampedeCon
 
Doing Big Data Using Amazon's Analogs - StampedeCon Big Data Conference 2017
Doing Big Data Using Amazon's Analogs - StampedeCon Big Data Conference 2017Doing Big Data Using Amazon's Analogs - StampedeCon Big Data Conference 2017
Doing Big Data Using Amazon's Analogs - StampedeCon Big Data Conference 2017
StampedeCon
 
Enabling New Business Capabilities with Cloud-based Streaming Data Architectu...
Enabling New Business Capabilities with Cloud-based Streaming Data Architectu...Enabling New Business Capabilities with Cloud-based Streaming Data Architectu...
Enabling New Business Capabilities with Cloud-based Streaming Data Architectu...
StampedeCon
 
Big Data Meets IoT: Lessons From the Cloud on Polling, Collecting, and Analyz...
Big Data Meets IoT: Lessons From the Cloud on Polling, Collecting, and Analyz...Big Data Meets IoT: Lessons From the Cloud on Polling, Collecting, and Analyz...
Big Data Meets IoT: Lessons From the Cloud on Polling, Collecting, and Analyz...
StampedeCon
 
Creating a Data Driven Organization - StampedeCon 2016
Creating a Data Driven Organization - StampedeCon 2016Creating a Data Driven Organization - StampedeCon 2016
Creating a Data Driven Organization - StampedeCon 2016
StampedeCon
 
Using The Internet of Things for Population Health Management - StampedeCon 2016
Using The Internet of Things for Population Health Management - StampedeCon 2016Using The Internet of Things for Population Health Management - StampedeCon 2016
Using The Internet of Things for Population Health Management - StampedeCon 2016
StampedeCon
 
Visualizing Big Data – The Fundamentals
Visualizing Big Data – The FundamentalsVisualizing Big Data – The Fundamentals
Visualizing Big Data – The Fundamentals
StampedeCon
 

More from StampedeCon (20)

Why Should We Trust You-Interpretability of Deep Neural Networks - StampedeCo...
Why Should We Trust You-Interpretability of Deep Neural Networks - StampedeCo...Why Should We Trust You-Interpretability of Deep Neural Networks - StampedeCo...
Why Should We Trust You-Interpretability of Deep Neural Networks - StampedeCo...
 
The Search for a New Visual Search Beyond Language - StampedeCon AI Summit 2017
The Search for a New Visual Search Beyond Language - StampedeCon AI Summit 2017The Search for a New Visual Search Beyond Language - StampedeCon AI Summit 2017
The Search for a New Visual Search Beyond Language - StampedeCon AI Summit 2017
 
Predicting Outcomes When Your Outcomes are Graphs - StampedeCon AI Summit 2017
Predicting Outcomes When Your Outcomes are Graphs - StampedeCon AI Summit 2017Predicting Outcomes When Your Outcomes are Graphs - StampedeCon AI Summit 2017
Predicting Outcomes When Your Outcomes are Graphs - StampedeCon AI Summit 2017
 
Novel Semi-supervised Probabilistic ML Approach to SNP Variant Calling - Stam...
Novel Semi-supervised Probabilistic ML Approach to SNP Variant Calling - Stam...Novel Semi-supervised Probabilistic ML Approach to SNP Variant Calling - Stam...
Novel Semi-supervised Probabilistic ML Approach to SNP Variant Calling - Stam...
 
How to Talk about AI to Non-analaysts - Stampedecon AI Summit 2017
How to Talk about AI to Non-analaysts - Stampedecon AI Summit 2017How to Talk about AI to Non-analaysts - Stampedecon AI Summit 2017
How to Talk about AI to Non-analaysts - Stampedecon AI Summit 2017
 
Getting Started with Keras and TensorFlow - StampedeCon AI Summit 2017
Getting Started with Keras and TensorFlow - StampedeCon AI Summit 2017Getting Started with Keras and TensorFlow - StampedeCon AI Summit 2017
Getting Started with Keras and TensorFlow - StampedeCon AI Summit 2017
 
Foundations of Machine Learning - StampedeCon AI Summit 2017
Foundations of Machine Learning - StampedeCon AI Summit 2017Foundations of Machine Learning - StampedeCon AI Summit 2017
Foundations of Machine Learning - StampedeCon AI Summit 2017
 
Don't Start from Scratch: Transfer Learning for Novel Computer Vision Problem...
Don't Start from Scratch: Transfer Learning for Novel Computer Vision Problem...Don't Start from Scratch: Transfer Learning for Novel Computer Vision Problem...
Don't Start from Scratch: Transfer Learning for Novel Computer Vision Problem...
 
Bringing the Whole Elephant Into View Can Cognitive Systems Bring Real Soluti...
Bringing the Whole Elephant Into View Can Cognitive Systems Bring Real Soluti...Bringing the Whole Elephant Into View Can Cognitive Systems Bring Real Soluti...
Bringing the Whole Elephant Into View Can Cognitive Systems Bring Real Soluti...
 
Automated AI The Next Frontier in Analytics - StampedeCon AI Summit 2017
Automated AI The Next Frontier in Analytics - StampedeCon AI Summit 2017Automated AI The Next Frontier in Analytics - StampedeCon AI Summit 2017
Automated AI The Next Frontier in Analytics - StampedeCon AI Summit 2017
 
AI in the Enterprise: Past, Present & Future - StampedeCon AI Summit 2017
AI in the Enterprise: Past,  Present &  Future - StampedeCon AI Summit 2017AI in the Enterprise: Past,  Present &  Future - StampedeCon AI Summit 2017
AI in the Enterprise: Past, Present & Future - StampedeCon AI Summit 2017
 
A Different Data Science Approach - StampedeCon AI Summit 2017
A Different Data Science Approach - StampedeCon AI Summit 2017A Different Data Science Approach - StampedeCon AI Summit 2017
A Different Data Science Approach - StampedeCon AI Summit 2017
 
Graph in Customer 360 - StampedeCon Big Data Conference 2017
Graph in Customer 360 - StampedeCon Big Data Conference 2017Graph in Customer 360 - StampedeCon Big Data Conference 2017
Graph in Customer 360 - StampedeCon Big Data Conference 2017
 
End-to-end Big Data Projects with Python - StampedeCon Big Data Conference 2017
End-to-end Big Data Projects with Python - StampedeCon Big Data Conference 2017End-to-end Big Data Projects with Python - StampedeCon Big Data Conference 2017
End-to-end Big Data Projects with Python - StampedeCon Big Data Conference 2017
 
Doing Big Data Using Amazon's Analogs - StampedeCon Big Data Conference 2017
Doing Big Data Using Amazon's Analogs - StampedeCon Big Data Conference 2017Doing Big Data Using Amazon's Analogs - StampedeCon Big Data Conference 2017
Doing Big Data Using Amazon's Analogs - StampedeCon Big Data Conference 2017
 
Enabling New Business Capabilities with Cloud-based Streaming Data Architectu...
Enabling New Business Capabilities with Cloud-based Streaming Data Architectu...Enabling New Business Capabilities with Cloud-based Streaming Data Architectu...
Enabling New Business Capabilities with Cloud-based Streaming Data Architectu...
 
Big Data Meets IoT: Lessons From the Cloud on Polling, Collecting, and Analyz...
Big Data Meets IoT: Lessons From the Cloud on Polling, Collecting, and Analyz...Big Data Meets IoT: Lessons From the Cloud on Polling, Collecting, and Analyz...
Big Data Meets IoT: Lessons From the Cloud on Polling, Collecting, and Analyz...
 
Creating a Data Driven Organization - StampedeCon 2016
Creating a Data Driven Organization - StampedeCon 2016Creating a Data Driven Organization - StampedeCon 2016
Creating a Data Driven Organization - StampedeCon 2016
 
Using The Internet of Things for Population Health Management - StampedeCon 2016
Using The Internet of Things for Population Health Management - StampedeCon 2016Using The Internet of Things for Population Health Management - StampedeCon 2016
Using The Internet of Things for Population Health Management - StampedeCon 2016
 
Visualizing Big Data – The Fundamentals
Visualizing Big Data – The FundamentalsVisualizing Big Data – The Fundamentals
Visualizing Big Data – The Fundamentals
 

Recently uploaded

Quantum Computing: Current Landscape and the Future Role of APIs
Quantum Computing: Current Landscape and the Future Role of APIsQuantum Computing: Current Landscape and the Future Role of APIs
Quantum Computing: Current Landscape and the Future Role of APIs
Vlad Stirbu
 
By Design, not by Accident - Agile Venture Bolzano 2024
By Design, not by Accident - Agile Venture Bolzano 2024By Design, not by Accident - Agile Venture Bolzano 2024
By Design, not by Accident - Agile Venture Bolzano 2024
Pierluigi Pugliese
 
PCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase TeamPCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase Team
ControlCase
 
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdfFIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance
 
Climate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing DaysClimate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing Days
Kari Kakkonen
 
Accelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish CachingAccelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish Caching
Thijs Feryn
 
DevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA ConnectDevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA Connect
Kari Kakkonen
 
Elevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object CalisthenicsElevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object Calisthenics
Dorra BARTAGUIZ
 
GraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge GraphGraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge Graph
Guy Korland
 
UiPath Community Day Dubai: AI at Work..
UiPath Community Day Dubai: AI at Work..UiPath Community Day Dubai: AI at Work..
UiPath Community Day Dubai: AI at Work..
UiPathCommunity
 
Removing Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software FuzzingRemoving Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software Fuzzing
Aftab Hussain
 
Epistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI supportEpistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI support
Alan Dix
 
UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4
DianaGray10
 
PHP Frameworks: I want to break free (IPC Berlin 2024)
PHP Frameworks: I want to break free (IPC Berlin 2024)PHP Frameworks: I want to break free (IPC Berlin 2024)
PHP Frameworks: I want to break free (IPC Berlin 2024)
Ralf Eggert
 
Assure Contact Center Experiences for Your Customers With ThousandEyes
Assure Contact Center Experiences for Your Customers With ThousandEyesAssure Contact Center Experiences for Your Customers With ThousandEyes
Assure Contact Center Experiences for Your Customers With ThousandEyes
ThousandEyes
 
Essentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FMEEssentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FME
Safe Software
 
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdfFIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance
 
Video Streaming: Then, Now, and in the Future
Video Streaming: Then, Now, and in the FutureVideo Streaming: Then, Now, and in the Future
Video Streaming: Then, Now, and in the Future
Alpen-Adria-Universität
 
Elizabeth Buie - Older adults: Are we really designing for our future selves?
Elizabeth Buie - Older adults: Are we really designing for our future selves?Elizabeth Buie - Older adults: Are we really designing for our future selves?
Elizabeth Buie - Older adults: Are we really designing for our future selves?
Nexer Digital
 
The Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and SalesThe Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and Sales
Laura Byrne
 

Recently uploaded (20)

Quantum Computing: Current Landscape and the Future Role of APIs
Quantum Computing: Current Landscape and the Future Role of APIsQuantum Computing: Current Landscape and the Future Role of APIs
Quantum Computing: Current Landscape and the Future Role of APIs
 
By Design, not by Accident - Agile Venture Bolzano 2024
By Design, not by Accident - Agile Venture Bolzano 2024By Design, not by Accident - Agile Venture Bolzano 2024
By Design, not by Accident - Agile Venture Bolzano 2024
 
PCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase TeamPCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase Team
 
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdfFIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
 
Climate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing DaysClimate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing Days
 
Accelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish CachingAccelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish Caching
 
DevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA ConnectDevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA Connect
 
Elevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object CalisthenicsElevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object Calisthenics
 
GraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge GraphGraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge Graph
 
UiPath Community Day Dubai: AI at Work..
UiPath Community Day Dubai: AI at Work..UiPath Community Day Dubai: AI at Work..
UiPath Community Day Dubai: AI at Work..
 
Removing Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software FuzzingRemoving Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software Fuzzing
 
Epistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI supportEpistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI support
 
UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4
 
PHP Frameworks: I want to break free (IPC Berlin 2024)
PHP Frameworks: I want to break free (IPC Berlin 2024)PHP Frameworks: I want to break free (IPC Berlin 2024)
PHP Frameworks: I want to break free (IPC Berlin 2024)
 
Assure Contact Center Experiences for Your Customers With ThousandEyes
Assure Contact Center Experiences for Your Customers With ThousandEyesAssure Contact Center Experiences for Your Customers With ThousandEyes
Assure Contact Center Experiences for Your Customers With ThousandEyes
 
Essentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FMEEssentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FME
 
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdfFIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
 
Video Streaming: Then, Now, and in the Future
Video Streaming: Then, Now, and in the FutureVideo Streaming: Then, Now, and in the Future
Video Streaming: Then, Now, and in the Future
 
Elizabeth Buie - Older adults: Are we really designing for our future selves?
Elizabeth Buie - Older adults: Are we really designing for our future selves?Elizabeth Buie - Older adults: Are we really designing for our future selves?
Elizabeth Buie - Older adults: Are we really designing for our future selves?
 
The Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and SalesThe Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and Sales
 

Turn Data Into Actionable Insights - StampedeCon 2016

  • 1. Turn Data into Actionable Insights!!
  • 2. About me: Vishnu Alavur Kannan Analytics Technical Platforms Lead • 15+ years in IT, software engineer @heart ! • Lead engineering teams through out my career ! • Platform is just vaporware without passionate people ! • A players make all the difference in software engineering ✓ 50:1, 100 :1, rarely on any other profession @Monsanto for two reasons: ! • I strongly believe in our commitment to sustainable agriculture ! • I am able to do top-flight Engineering R&D ✓ Complex engineering challenges keeps me going ✓ Freedom to operate: o Use the right tool for the right job o Solving problems using cutting-edge technologies o Open-source friendly, Open environment to contribute back
  • 3. • Bringing a broad range of solutions to help nourish our growing world • Collaborating to help tackle some of the world’s biggest challenges • >20,000 employees in 66 countries • >50% employees based outside of 
 the United States • One of the 25 World’s Best Multinational Workplaces by Great Place to Work Institute Monsanto: 
 
 A Sustainable Agriculture Company
  • 4. Our systems approach integrates technology platforms to maximize farmer effectiveness Crop Protection • Weed Control (Roundup ® 
 Branded Agricultural Herbicides) • Insect Control • Disease Control Breeding • Stress Tolerance • Disease Control • Yield • Vegetables, corn, cotton, soybeans, wheat, canolaBiologicals • Weed Control • Insect Control • Virus Control • Plant Health Biotechnology • Weed Control • Insect Control • Stress Tolerance • Yield / Yield Protection
 • NutrientsData Science • Planting Script Creator • Increased production • Efficient land and water use • Efficient nutrient use
  • 5. https://www.youtube.com/watch?v=l5Tw0PGcyN0 Why do you do what you do?! What’s the purpose? How do you do what you do? What the hell do you do? THE GOLDEN CIRCLE Simon Sinek
  • 6. Identify the signals from the noise @SCALE Volume DATA AT SCALE Variety VARIOUS FORMS OF DATA Velocity STREAMING IOT Veracity DATA UNCERTAINITY DIGITALMEDIA:280Exabytes FB:300+Petabytesperday * Information from multiple sources are adapted and incorporated POINTS,POLYGONS, RASTERS,VECTORS CONNECTINGDATAACROSSSOURCES, ISWHEREANALYSTSSPENDMOSTOFTHEIRTIME SENSORSarede-factoto gatherdata anddetect anomaliesacrossdomains
  • 7. Monsanto re-inventing Agriculture through Analytics Other providers: Cost Qualit y Agility • No hardware administration, less software administration • Eleven 9’s of data durability • Harness state-of-the-art software services ! • DevOps moving towards NoOps • Provision Infrastructure in seconds: infrastructure as code - automation • Grow or shrink compute to match seasonal workloads and pay smartly as we go Scale: MON has ~1016+ bytes of data and growing rapidly • Global Presence: Taking data driven products & services closer to business ! • Ability to accelerate feature development, integrating analytics rapidly into our workflows @scale ! • Ingest, store & retrieve massive data sets, by using the right data store to our competitive advantage (NoSQL/SQL) ! • Service diversity, Organizational maturity IOT, Imagery, Geo-spatial, Genomics, Molecular Breeding…..
  • 8. Vision A year ago as we started…
  • 9. Integrated Extended Enhanced Scalable Enable Analytics @SCALE for the Enterprise Reliable FieldDevices Apps Apps Devices DevicesApps DevicesApps Data M odels M odels M odels M odels Business Unit- 1 Business Unit- 2 Business Unit3 D igital Business Open
  • 10. Integrate Analytics with Product Platforms Data Data Science@scale Analytical Models Turn Data Into Actionable Insights …. …. APIs Data
  • 11. Predictive Product Placement @scale PFO PFO Topography Site boundary Zones Experiment metadata Planter A/B line Automap Elevation Soil Weather Topography Zones Location Data Assets Geo-spatial Catalog
  • 12. Analytics as a Service In Collaboration with IT & Business Scale across teams internalizing a self-service model
  • 13. Internalize the needs to stay ahead of the curve Addressing analytics needs based on persona ! Descriptive What happened? ! Diagnostic Why did it happen? ! Predictive What will happen? ! Prescriptive What should I do? ! Cognitive What can be learnt? Hindsight Insight Foresight 10’s K of users 1’s K 100’s Science@Scale Information Pro-Consumers Information Consumers Data ScientistsBusiness Users Business Analysts Statisticians Business Intelligence Ad-hoc Analysis Statistical Analysis !Data DiscoveryReports Dashboards Drill Down Machine Learning Inferential CausalExploratory Machine Power Users 10’s Computational Biologists Neural Networks Outsight Systems Natural Language Processing
  • 14. Discovery Analytics – Development Environments Non-prime Exploratory Prime R & D Development Environments @SCALE • Big-data Infra. & DevOps • Data Provisioning @scale • Model Deployments @scale • Big-data workloads • Computational pipelines • Transformation pipelines • Training pipelines • Sizing & Auto-scaling • Cloud Best practices • 24/7 availability • Monitoring • Alerting • ELK stack • …. Analytical models @SCALE • Co-engineering • Involve us sooner • Thinking scale ahead accelerating Time to Market • Model development & refactoring • R, Asreml, Python, OPL… • Java, Scala, Clojure… • Infrastructure as code • AWS, GCP, AzureML • Docker, Kubernetes • Distributed computing • Architecture • Solutions Design • Development !• API integrations • KAFKA integrations • OAUTH2 Integrations • Security/ISO collaborations Build it once, deploy frameworks as needed for user groups: Bundled in a centralized eco-system Non-prod to Prod BLUE / GREEN
  • 15. Discovery Analytics Development Environments Data Scientists, Developers and Novice Users From Discovery to Production Culture, approach and adoption Know Your Users For Community By Community ! Tailor by Needs Balance Freedom with Governance ! ! ! Drive User Adoption Environments iteratively served to everyone @monsanto Enable analytical capabilities @scale for the enterprise integrated with Product Platforms As of today, # of unique data scientists across groups utilizing our discovery analytics environments Model maturity Global Scalability Core teams : Train the trainee to share knowledge and best practices utilizing the environment
  • 16. Business Capabilities Make the platform robust, sharing a few use cases
  • 17. Environmental Classification @scale Engineered using Discovery Analytics - Development Environment Data Provisioning APIs Data Transformation QA/QC Rules Scala Python Scikit API API
  • 18. ! • Collaborations with Data Science Teams: Co-engineering R based machine learning model to a Scala based model training pipeline for scalability ! • EMR (Amazon Hadoop) & DataProc (GCP) using Apache Spark Computation Engine @scale • Iterative ON-DEMAND framework, auto-scaling up-to N number of nodes ! • Training pipeline integration with APIs & co-engineering continuum Molecular Breeding: Training Pipeline @Scale Engineered using Discovery Analytics - Development Environment Data DATA LEARNER MODEL 1
  • 19. Cognitive Analytics Pipeline ! • Collaborations with Cognitive Analytics Data science team to build: • An integrated Predictive Product Pipeline from inception to commercialization ! Built on: ! • Apache Airflow (incubating): DAG based model chaining & workflow management platform • Models written in Python, R • Parallelism achieved via Celery workers • Being customized now to utilize Spark ! • Apache Parquet - Columnar Storage Format on a file system; extremely parallelizable ! • Facebook Presto query engine to query parquet’s via SQLs through REST APIs – highly performant ! • Cloud Analytics platform integration • Co-engineering solutions @scale mining millions of data points to derive actionable insights Workflow DAGs Libraries Engineered using Discovery Analytics - Development Environment
  • 20. Deep learning @SCALE Discovery Analytics Development Environments integrated with CloudML on GCP Collect Store Train Predict Evaluate Training Pipeline Retrain • First Ever Deep Learning platform for the Enterprise ! • Perform Deep Learning @scale on CloudML using TensorFlow via Jupyter from Prime environment ! • Integrated with data, Inputs, Outputs and Metadata including Tensor Board to monitor your model training runs Discovery Analytics - Workflow Production Deployment - Workflow
  • 21. DATA INGESTION AND TRANSFORMATION VIA API’s AND STREAMS Streaming Business Intelligence RUN ANALYTICS@SCALE IN THE CLOUD Collaborative Data Science - DISCOVERY ANALYTICS DATA DRIVEN PRODUCTS KAFKA Streams Data Warehouse*Big-data Model outputs via APIs & Streams In-house/Third Party: Platforms AWS, GCP, Cloudera, DataStax, IBM, Azure, Domino labs… Prescriptive PredictiveCognitive Historical Models - Deep Learning, Computational Pipelines, Classification & Simulation Engines Turn Data into Actionable Insights
  • 22. Our Journey of Transformation We have just scratched our surface: ! • Science@scale – Our Cloud Analytics Platform is only a year old ! • Talent, Behavior and Platform as our 3 key pillars of focus ! • Talent: • Building big-data and cloud analytics engineering team from the ground up – 150+ interviews, 15 people team now • Targeting A players, nurture the team on new technologies, build leaders ! • Behavior/Cultural Mind shift: Data Science & IT Engineering operating as ONE TEAM • Two extreme spectrums • Finding the sweet spot in the middle has been the cultural shift • Data science teams have been very supportive, adapting to change • Bringing in IT best practices: Agile methodologies, versioning, CI…. • Train the trainee approach to enable adoption across the enterprise • Leverage the best of both worlds by co-engineering solutions • Collaboration is our new competitive advantage ! • Platform: We are at ground zero now, continuing to deliver Minimum Viable Products each sprint • Continue to mature & stay cutting edge on technologies • Build vs. Buy [Cost, Time, Quality] • Miles to go before we sleep
  • 23. https://www.youtube.com/watch?v=l5Tw0PGcyN0 Why do you do what you do?! What’s the purpose? How do you do what you do? What the hell do you do? THE GOLDEN CIRCLE Simon Sinek • Help identify the signals from the noise @scale An Enterprise Cloud Analytics platform to serve: • Analytics as a service enabling Discovery Analytics environments for the data science community • Predictive, prescriptive, streaming, cognitive, IOT edge analytical capabilities @scale • Big Data Cloud Analytics Engineering • Internalize data science needs thinking scale ahead
  • 24. Thank You 
 Visit us at engineering.monsanto.com
 
 We are looking for passionate big data cloud analytics engineers to join our team.
 
 https://www.linkedin.com/in/vishnukannan