SlideShare a Scribd company logo
1
Analytics Life Cycle: Pangea is Panacea!
The accompanying material and any related oral or written discussion (the “Materials”) is governed by
the limitations detailed below:
Licensed Content and Ownership - HCL, PANGEA and other product names are or may be registered trademarks and/or trademarks in the U.S. and/or other
countries. Content distributed within a HCL client organization must display HCL copyright notices and attributions of authorship.
IP & Patent Liability - This Solution/ Proposition is covered by a pending patent. Any refactoring or subsequent re-use is an unlicensed use and therefore
constitutes patent infringement. If there is any further detailed information required, please contact ers.slus@hcl.com
Liability Disclaimer -The information herein is for informational purposes only and represents the current view of HCL Technologies Ltd as of the date of this
presentation. Because HCL must respond to changing market conditions, it should not be interpreted to be a commitment on the part of HCL, and HCL cannot
guarantee the accuracy of any information provided after the date of this presentation. HCL MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO
THE INFORMATION IN THIS PRESENTATION.
Terms of Use, IP and Liability Disclaimer
Terms of Use, IP and liability disclaimer
Analytics Lifecycle – Enterprise View
Data Ingestion
Algorithm Selection
Model Building and Tuning
Model Monitoring
Ingest data from diverse data sources
- It is a common ask
Lesser coding and more effort on model execution
& tuning accounts for productivity
ML models need regular monitoring and
Updation on need basis unlike traditional programs
Select appropriate algorithms suitable for the
Business requirement(s) – Automated recommendation
Is definite ask
Ease of deployment and (near) real time
Scoring are crucial for enterprise acceptance
& Success
Data Preparation
Ontologies for schema preparation
& Transformation should be defined
.
Data
Ingestion
Business
Problem
Description
Data
Preparation
Algorithm
Selection
Model
Building &
Tuning
Model
Deployment
Model
Monitoring
Business
Insights &
VisualizationsBusiness Problem Description
Extract problem definition from business
Owners – Tricky often times but important
Model Deployment
Business Insights
Business insight wrappers are crucial for the
Successful adoption of analytics/ML
ML/DL Model Lifecycle
.
Drift
Analysis Automatic
Output
Variance
Analysis
Manual/
Supervised
Analysis
Identify
impacted
parametersRevise
Model
Parameters
Update model
Deployed
model
Monitor
Inputs
Input Analysis
Output Analysis
Drift/Newness
Error/Variance/FP/FN
Numerical Data – Distribution Analysis
Categorical Data – Obsolete/New categories
Text data – Obsolete/New Keywords
Estimate data shift @ regular intervals
Check for new/deleted categories/words
Error/variance for time/state models
FP/FN for feedback based models
Boolean/Categorical/Labels (Clusters)
Analytics Model Monitoring – Heuristics to Watch
Burst or patch of data causes
abrupt transition
Production data causes the
model outcome to shift/change
incrementally
Yet times data influences
gradual change in the
outcomes over a period of time
Some data sets yield recurring
change states in output
Stray incidents occur when
occasional input results in
unexpected output
Types of Analytical Models - Recap
Preventive and proactive
alerts and life time estimates
Unsupervised Model that
groups similar data/objects into
k - clusters
OptimizationClustering
Heuristic and OR models for
optimization
Survival
Time series based forecast
models
Supervised models that label
datasets
Classification ForecastRegression
Linear, non-linear and logistic
regression models
Best Practices – Analytics Adoption
Data analysis
for duplicates,
missing values
etc
Model
building,
tuning,
deployment
Model
monitoring at
regular
intervals
Reduced Time to Value
Ontologies
and schema
preparation
* These views may not expressly or implied to the affiliated organization. They are entirely speaker’s opinions based on his experience and understanding
Data ingestion
with diverse
connectors
Business logic
wrappers and
insights +
visualizations
Pangea* - Overview
Pangea is a distributed analytics workbench that provides an end to end platform for building and operationalizing Analytics quicker
Delivers end to end analytics with an intuitive
drag and drop of data and models/algorithms
Reduces model deployment time from several
months to days
Data & Code distribution on virtual nodes ensures
scalability
Actionable Insights
customizable solution to fit the client needs
Zero Coding Approach Single Click Deployment
Distributed Analytics at ScaleModular and Flexible
Pangea brings in automation to achieve speed, scale, collaboration and enforces best practices implementation across analytics life cycle to reduce the total cost of ownership
 Drastic time-to-insight reduction
Data Ingestion from
divergent data sources
Modelling and tuning
without coding
Inbuilt & 3rd party UI for
reports and charts
Deployment through
clicks and configuration
* HCL Internal IP/Tool
• Data ingestion is key without too much emphasis on ‘outcome’ at that time
• Data preparation goes hand and glove with business problem descriptions
• Ontology and/or schema preparation invisible yet inevitable step in the
enterprise analytics life cycle
• Analytics/ML Modelling without ease of deployment and monitoring are
short-lived
• Analytical models without business wrappers are only serve as PoCs
9
Summary – Pangea Best Practices

More Related Content

What's hot

Bi and erp integration
Bi and erp integrationBi and erp integration
Bi and erp integrationJorge Garcia
 
Business Intelligence and Analytics Capability
Business Intelligence and Analytics CapabilityBusiness Intelligence and Analytics Capability
Business Intelligence and Analytics Capability
ALTEN Calsoft Labs
 
Insurance sales performance dashboards powered by PMSquare
Insurance sales performance dashboards powered by PMSquareInsurance sales performance dashboards powered by PMSquare
Insurance sales performance dashboards powered by PMSquare
PM square
 
Ceo Communication
Ceo CommunicationCeo Communication
Ceo Communicationprabasiva
 
SAP LoadRunner by HP Solution Brief
SAP LoadRunner by HP Solution Brief SAP LoadRunner by HP Solution Brief
SAP LoadRunner by HP Solution Brief
SAP Solution Extensions
 
eVerge - Business Analytics with OBIEE Foundation Suite (BIFS)
eVerge - Business Analytics with OBIEE Foundation Suite (BIFS)eVerge - Business Analytics with OBIEE Foundation Suite (BIFS)
eVerge - Business Analytics with OBIEE Foundation Suite (BIFS)
Steve Chamberlin
 
Application Portfolio Assessment - 101
Application Portfolio Assessment - 101Application Portfolio Assessment - 101
Application Portfolio Assessment - 101prabasiva
 
Delivering Real-Time Business Value for Aerospace and Defense
Delivering Real-Time Business Value for Aerospace and DefenseDelivering Real-Time Business Value for Aerospace and Defense
Delivering Real-Time Business Value for Aerospace and Defense
SAP Technology
 
Overview of Operations
Overview of OperationsOverview of Operations
Overview of Operations
Glenture
 
Crafting an End-to-End Pharma GRC Strategy
Crafting an End-to-End Pharma GRC StrategyCrafting an End-to-End Pharma GRC Strategy
Crafting an End-to-End Pharma GRC Strategy
Cognizant
 
Delivering Real-Time Business Value for Chemicals
Delivering Real-Time Business Value for ChemicalsDelivering Real-Time Business Value for Chemicals
Delivering Real-Time Business Value for Chemicals
SAP Technology
 
Presentation - Scope and Schedule Management of Business Analytics Project
Presentation - Scope and Schedule Management of Business Analytics ProjectPresentation - Scope and Schedule Management of Business Analytics Project
Presentation - Scope and Schedule Management of Business Analytics ProjectSharad Srivastava
 
Fresh Path Consulting, Inc Company Overview
Fresh Path Consulting, Inc Company OverviewFresh Path Consulting, Inc Company Overview
Fresh Path Consulting, Inc Company Overview
afurst
 
SAP S/4 HANA Information Sheet
SAP S/4 HANA Information Sheet SAP S/4 HANA Information Sheet
SAP S/4 HANA Information Sheet
SAP Solution Extensions
 
Kpi insurance industry
Kpi insurance industryKpi insurance industry
Kpi insurance industrybaluiabrows
 
Geospatial Analytics
Geospatial AnalyticsGeospatial Analytics
Geospatial Analytics
Dickinson + Associates
 
Delivering Real-Time Business Value for Defense and Security
Delivering Real-Time Business Value for Defense and SecurityDelivering Real-Time Business Value for Defense and Security
Delivering Real-Time Business Value for Defense and Security
SAP Technology
 
Sap implementation
Sap implementation  Sap implementation
Sap implementation
Amarendra Munipalle
 

What's hot (19)

Bi and erp integration
Bi and erp integrationBi and erp integration
Bi and erp integration
 
Business Intelligence and Analytics Capability
Business Intelligence and Analytics CapabilityBusiness Intelligence and Analytics Capability
Business Intelligence and Analytics Capability
 
Insurance sales performance dashboards powered by PMSquare
Insurance sales performance dashboards powered by PMSquareInsurance sales performance dashboards powered by PMSquare
Insurance sales performance dashboards powered by PMSquare
 
Ceo Communication
Ceo CommunicationCeo Communication
Ceo Communication
 
SAP LoadRunner by HP Solution Brief
SAP LoadRunner by HP Solution Brief SAP LoadRunner by HP Solution Brief
SAP LoadRunner by HP Solution Brief
 
eVerge - Business Analytics with OBIEE Foundation Suite (BIFS)
eVerge - Business Analytics with OBIEE Foundation Suite (BIFS)eVerge - Business Analytics with OBIEE Foundation Suite (BIFS)
eVerge - Business Analytics with OBIEE Foundation Suite (BIFS)
 
Application Portfolio Assessment - 101
Application Portfolio Assessment - 101Application Portfolio Assessment - 101
Application Portfolio Assessment - 101
 
Delivering Real-Time Business Value for Aerospace and Defense
Delivering Real-Time Business Value for Aerospace and DefenseDelivering Real-Time Business Value for Aerospace and Defense
Delivering Real-Time Business Value for Aerospace and Defense
 
Overview of Operations
Overview of OperationsOverview of Operations
Overview of Operations
 
Crafting an End-to-End Pharma GRC Strategy
Crafting an End-to-End Pharma GRC StrategyCrafting an End-to-End Pharma GRC Strategy
Crafting an End-to-End Pharma GRC Strategy
 
Delivering Real-Time Business Value for Chemicals
Delivering Real-Time Business Value for ChemicalsDelivering Real-Time Business Value for Chemicals
Delivering Real-Time Business Value for Chemicals
 
Presentation - Scope and Schedule Management of Business Analytics Project
Presentation - Scope and Schedule Management of Business Analytics ProjectPresentation - Scope and Schedule Management of Business Analytics Project
Presentation - Scope and Schedule Management of Business Analytics Project
 
Fresh Path Consulting, Inc Company Overview
Fresh Path Consulting, Inc Company OverviewFresh Path Consulting, Inc Company Overview
Fresh Path Consulting, Inc Company Overview
 
Sneha CV
Sneha CVSneha CV
Sneha CV
 
SAP S/4 HANA Information Sheet
SAP S/4 HANA Information Sheet SAP S/4 HANA Information Sheet
SAP S/4 HANA Information Sheet
 
Kpi insurance industry
Kpi insurance industryKpi insurance industry
Kpi insurance industry
 
Geospatial Analytics
Geospatial AnalyticsGeospatial Analytics
Geospatial Analytics
 
Delivering Real-Time Business Value for Defense and Security
Delivering Real-Time Business Value for Defense and SecurityDelivering Real-Time Business Value for Defense and Security
Delivering Real-Time Business Value for Defense and Security
 
Sap implementation
Sap implementation  Sap implementation
Sap implementation
 

Similar to Analytics Development Life Cycle: Pangea is Panacea

New Product Introduction with BPM
New Product Introduction with BPMNew Product Introduction with BPM
New Product Introduction with BPM
Abhinava Pratap Singh
 
Transform Data into Action
Transform Data into ActionTransform Data into Action
Transform Data into Action
Workday, Inc.
 
Greenplum User Case
Greenplum User Case Greenplum User Case
Greenplum User Case
VMware Tanzu Korea
 
Athira mp cv_latest - copy
Athira mp cv_latest - copyAthira mp cv_latest - copy
Athira mp cv_latest - copy
Athira MP
 
A comprehensive guide to Salesforce Org Strategy
A comprehensive guide to Salesforce Org StrategyA comprehensive guide to Salesforce Org Strategy
A comprehensive guide to Salesforce Org Strategy
Gaytri khandelwal
 
Npi with bpm webinar
Npi with bpm webinarNpi with bpm webinar
Npi with bpm webinarAisurya Puhan
 
Destination Digital: Tracking Progress to Continue First Class Performance
Destination Digital: Tracking Progress to Continue First Class PerformanceDestination Digital: Tracking Progress to Continue First Class Performance
Destination Digital: Tracking Progress to Continue First Class Performance
NGA Human Resources
 
Using the power of OpenAI with your own data: what's possible and how to start?
Using the power of OpenAI with your own data: what's possible and how to start?Using the power of OpenAI with your own data: what's possible and how to start?
Using the power of OpenAI with your own data: what's possible and how to start?
Maxim Salnikov
 
How to choose the right modern bi and analytics tool for your business_.pdf
How to choose the right modern bi and analytics tool for your business_.pdfHow to choose the right modern bi and analytics tool for your business_.pdf
How to choose the right modern bi and analytics tool for your business_.pdf
Anil
 
On demand or on premise
On demand or on premiseOn demand or on premise
On demand or on premise
Pankaj Pandey
 
Week8 Topic1 Translate Business Needs Into Technical Requirements
Week8 Topic1 Translate Business Needs Into Technical RequirementsWeek8 Topic1 Translate Business Needs Into Technical Requirements
Week8 Topic1 Translate Business Needs Into Technical Requirementshapy
 
MongoDB IoT City Tour STUTTGART: Analysing the Internet of Things. By, Pentaho
MongoDB IoT City Tour STUTTGART: Analysing the Internet of Things. By, PentahoMongoDB IoT City Tour STUTTGART: Analysing the Internet of Things. By, Pentaho
MongoDB IoT City Tour STUTTGART: Analysing the Internet of Things. By, Pentaho
MongoDB
 
Embedded BI Best Practices: Webinar slides
Embedded BI Best Practices: Webinar slidesEmbedded BI Best Practices: Webinar slides
Embedded BI Best Practices: Webinar slides
Yellowfin
 
Apache Hadoop Summit 2016: The Future of Apache Hadoop an Enterprise Architec...
Apache Hadoop Summit 2016: The Future of Apache Hadoop an Enterprise Architec...Apache Hadoop Summit 2016: The Future of Apache Hadoop an Enterprise Architec...
Apache Hadoop Summit 2016: The Future of Apache Hadoop an Enterprise Architec...
PwC
 
The Future of Apache Hadoop an Enterprise Architecture View
The Future of Apache Hadoop an Enterprise Architecture ViewThe Future of Apache Hadoop an Enterprise Architecture View
The Future of Apache Hadoop an Enterprise Architecture View
DataWorks Summit/Hadoop Summit
 
Business Capital Planning PowerPoint Presentation Slides
Business Capital Planning PowerPoint Presentation SlidesBusiness Capital Planning PowerPoint Presentation Slides
Business Capital Planning PowerPoint Presentation Slides
SlideTeam
 
Estuate EDM Checklist
Estuate EDM ChecklistEstuate EDM Checklist
Estuate EDM Checklist
Estuate, Inc.
 
Cdocumentsandsettingsjnorwooddesktopranzal2010presentationsatconferenceskalei...
Cdocumentsandsettingsjnorwooddesktopranzal2010presentationsatconferenceskalei...Cdocumentsandsettingsjnorwooddesktopranzal2010presentationsatconferenceskalei...
Cdocumentsandsettingsjnorwooddesktopranzal2010presentationsatconferenceskalei...Noura Alkahtani
 
Predictive Analytics: Extending asset management framework for multi-industry...
Predictive Analytics: Extending asset management framework for multi-industry...Predictive Analytics: Extending asset management framework for multi-industry...
Predictive Analytics: Extending asset management framework for multi-industry...
Capgemini
 

Similar to Analytics Development Life Cycle: Pangea is Panacea (20)

New Product Introduction with BPM
New Product Introduction with BPMNew Product Introduction with BPM
New Product Introduction with BPM
 
Transform Data into Action
Transform Data into ActionTransform Data into Action
Transform Data into Action
 
Greenplum User Case
Greenplum User Case Greenplum User Case
Greenplum User Case
 
Athira mp cv_latest - copy
Athira mp cv_latest - copyAthira mp cv_latest - copy
Athira mp cv_latest - copy
 
A comprehensive guide to Salesforce Org Strategy
A comprehensive guide to Salesforce Org StrategyA comprehensive guide to Salesforce Org Strategy
A comprehensive guide to Salesforce Org Strategy
 
Npi with bpm webinar
Npi with bpm webinarNpi with bpm webinar
Npi with bpm webinar
 
Destination Digital: Tracking Progress to Continue First Class Performance
Destination Digital: Tracking Progress to Continue First Class PerformanceDestination Digital: Tracking Progress to Continue First Class Performance
Destination Digital: Tracking Progress to Continue First Class Performance
 
Using the power of OpenAI with your own data: what's possible and how to start?
Using the power of OpenAI with your own data: what's possible and how to start?Using the power of OpenAI with your own data: what's possible and how to start?
Using the power of OpenAI with your own data: what's possible and how to start?
 
How to choose the right modern bi and analytics tool for your business_.pdf
How to choose the right modern bi and analytics tool for your business_.pdfHow to choose the right modern bi and analytics tool for your business_.pdf
How to choose the right modern bi and analytics tool for your business_.pdf
 
On demand or on premise
On demand or on premiseOn demand or on premise
On demand or on premise
 
Week8 Topic1 Translate Business Needs Into Technical Requirements
Week8 Topic1 Translate Business Needs Into Technical RequirementsWeek8 Topic1 Translate Business Needs Into Technical Requirements
Week8 Topic1 Translate Business Needs Into Technical Requirements
 
Clarity It Sourcing Diagnostic Presentation
Clarity It Sourcing Diagnostic PresentationClarity It Sourcing Diagnostic Presentation
Clarity It Sourcing Diagnostic Presentation
 
MongoDB IoT City Tour STUTTGART: Analysing the Internet of Things. By, Pentaho
MongoDB IoT City Tour STUTTGART: Analysing the Internet of Things. By, PentahoMongoDB IoT City Tour STUTTGART: Analysing the Internet of Things. By, Pentaho
MongoDB IoT City Tour STUTTGART: Analysing the Internet of Things. By, Pentaho
 
Embedded BI Best Practices: Webinar slides
Embedded BI Best Practices: Webinar slidesEmbedded BI Best Practices: Webinar slides
Embedded BI Best Practices: Webinar slides
 
Apache Hadoop Summit 2016: The Future of Apache Hadoop an Enterprise Architec...
Apache Hadoop Summit 2016: The Future of Apache Hadoop an Enterprise Architec...Apache Hadoop Summit 2016: The Future of Apache Hadoop an Enterprise Architec...
Apache Hadoop Summit 2016: The Future of Apache Hadoop an Enterprise Architec...
 
The Future of Apache Hadoop an Enterprise Architecture View
The Future of Apache Hadoop an Enterprise Architecture ViewThe Future of Apache Hadoop an Enterprise Architecture View
The Future of Apache Hadoop an Enterprise Architecture View
 
Business Capital Planning PowerPoint Presentation Slides
Business Capital Planning PowerPoint Presentation SlidesBusiness Capital Planning PowerPoint Presentation Slides
Business Capital Planning PowerPoint Presentation Slides
 
Estuate EDM Checklist
Estuate EDM ChecklistEstuate EDM Checklist
Estuate EDM Checklist
 
Cdocumentsandsettingsjnorwooddesktopranzal2010presentationsatconferenceskalei...
Cdocumentsandsettingsjnorwooddesktopranzal2010presentationsatconferenceskalei...Cdocumentsandsettingsjnorwooddesktopranzal2010presentationsatconferenceskalei...
Cdocumentsandsettingsjnorwooddesktopranzal2010presentationsatconferenceskalei...
 
Predictive Analytics: Extending asset management framework for multi-industry...
Predictive Analytics: Extending asset management framework for multi-industry...Predictive Analytics: Extending asset management framework for multi-industry...
Predictive Analytics: Extending asset management framework for multi-industry...
 

More from inside-BigData.com

Major Market Shifts in IT
Major Market Shifts in ITMajor Market Shifts in IT
Major Market Shifts in IT
inside-BigData.com
 
Preparing to program Aurora at Exascale - Early experiences and future direct...
Preparing to program Aurora at Exascale - Early experiences and future direct...Preparing to program Aurora at Exascale - Early experiences and future direct...
Preparing to program Aurora at Exascale - Early experiences and future direct...
inside-BigData.com
 
Transforming Private 5G Networks
Transforming Private 5G NetworksTransforming Private 5G Networks
Transforming Private 5G Networks
inside-BigData.com
 
The Incorporation of Machine Learning into Scientific Simulations at Lawrence...
The Incorporation of Machine Learning into Scientific Simulations at Lawrence...The Incorporation of Machine Learning into Scientific Simulations at Lawrence...
The Incorporation of Machine Learning into Scientific Simulations at Lawrence...
inside-BigData.com
 
How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...
How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...
How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...
inside-BigData.com
 
Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...
Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...
Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...
inside-BigData.com
 
HPC Impact: EDA Telemetry Neural Networks
HPC Impact: EDA Telemetry Neural NetworksHPC Impact: EDA Telemetry Neural Networks
HPC Impact: EDA Telemetry Neural Networks
inside-BigData.com
 
Biohybrid Robotic Jellyfish for Future Applications in Ocean Monitoring
Biohybrid Robotic Jellyfish for Future Applications in Ocean MonitoringBiohybrid Robotic Jellyfish for Future Applications in Ocean Monitoring
Biohybrid Robotic Jellyfish for Future Applications in Ocean Monitoring
inside-BigData.com
 
Machine Learning for Weather Forecasts
Machine Learning for Weather ForecastsMachine Learning for Weather Forecasts
Machine Learning for Weather Forecasts
inside-BigData.com
 
HPC AI Advisory Council Update
HPC AI Advisory Council UpdateHPC AI Advisory Council Update
HPC AI Advisory Council Update
inside-BigData.com
 
Fugaku Supercomputer joins fight against COVID-19
Fugaku Supercomputer joins fight against COVID-19Fugaku Supercomputer joins fight against COVID-19
Fugaku Supercomputer joins fight against COVID-19
inside-BigData.com
 
Energy Efficient Computing using Dynamic Tuning
Energy Efficient Computing using Dynamic TuningEnergy Efficient Computing using Dynamic Tuning
Energy Efficient Computing using Dynamic Tuning
inside-BigData.com
 
HPC at Scale Enabled by DDN A3i and NVIDIA SuperPOD
HPC at Scale Enabled by DDN A3i and NVIDIA SuperPODHPC at Scale Enabled by DDN A3i and NVIDIA SuperPOD
HPC at Scale Enabled by DDN A3i and NVIDIA SuperPOD
inside-BigData.com
 
State of ARM-based HPC
State of ARM-based HPCState of ARM-based HPC
State of ARM-based HPC
inside-BigData.com
 
Versal Premium ACAP for Network and Cloud Acceleration
Versal Premium ACAP for Network and Cloud AccelerationVersal Premium ACAP for Network and Cloud Acceleration
Versal Premium ACAP for Network and Cloud Acceleration
inside-BigData.com
 
Zettar: Moving Massive Amounts of Data across Any Distance Efficiently
Zettar: Moving Massive Amounts of Data across Any Distance EfficientlyZettar: Moving Massive Amounts of Data across Any Distance Efficiently
Zettar: Moving Massive Amounts of Data across Any Distance Efficiently
inside-BigData.com
 
Scaling TCO in a Post Moore's Era
Scaling TCO in a Post Moore's EraScaling TCO in a Post Moore's Era
Scaling TCO in a Post Moore's Era
inside-BigData.com
 
CUDA-Python and RAPIDS for blazing fast scientific computing
CUDA-Python and RAPIDS for blazing fast scientific computingCUDA-Python and RAPIDS for blazing fast scientific computing
CUDA-Python and RAPIDS for blazing fast scientific computing
inside-BigData.com
 
Introducing HPC with a Raspberry Pi Cluster
Introducing HPC with a Raspberry Pi ClusterIntroducing HPC with a Raspberry Pi Cluster
Introducing HPC with a Raspberry Pi Cluster
inside-BigData.com
 
Overview of HPC Interconnects
Overview of HPC InterconnectsOverview of HPC Interconnects
Overview of HPC Interconnects
inside-BigData.com
 

More from inside-BigData.com (20)

Major Market Shifts in IT
Major Market Shifts in ITMajor Market Shifts in IT
Major Market Shifts in IT
 
Preparing to program Aurora at Exascale - Early experiences and future direct...
Preparing to program Aurora at Exascale - Early experiences and future direct...Preparing to program Aurora at Exascale - Early experiences and future direct...
Preparing to program Aurora at Exascale - Early experiences and future direct...
 
Transforming Private 5G Networks
Transforming Private 5G NetworksTransforming Private 5G Networks
Transforming Private 5G Networks
 
The Incorporation of Machine Learning into Scientific Simulations at Lawrence...
The Incorporation of Machine Learning into Scientific Simulations at Lawrence...The Incorporation of Machine Learning into Scientific Simulations at Lawrence...
The Incorporation of Machine Learning into Scientific Simulations at Lawrence...
 
How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...
How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...
How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...
 
Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...
Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...
Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...
 
HPC Impact: EDA Telemetry Neural Networks
HPC Impact: EDA Telemetry Neural NetworksHPC Impact: EDA Telemetry Neural Networks
HPC Impact: EDA Telemetry Neural Networks
 
Biohybrid Robotic Jellyfish for Future Applications in Ocean Monitoring
Biohybrid Robotic Jellyfish for Future Applications in Ocean MonitoringBiohybrid Robotic Jellyfish for Future Applications in Ocean Monitoring
Biohybrid Robotic Jellyfish for Future Applications in Ocean Monitoring
 
Machine Learning for Weather Forecasts
Machine Learning for Weather ForecastsMachine Learning for Weather Forecasts
Machine Learning for Weather Forecasts
 
HPC AI Advisory Council Update
HPC AI Advisory Council UpdateHPC AI Advisory Council Update
HPC AI Advisory Council Update
 
Fugaku Supercomputer joins fight against COVID-19
Fugaku Supercomputer joins fight against COVID-19Fugaku Supercomputer joins fight against COVID-19
Fugaku Supercomputer joins fight against COVID-19
 
Energy Efficient Computing using Dynamic Tuning
Energy Efficient Computing using Dynamic TuningEnergy Efficient Computing using Dynamic Tuning
Energy Efficient Computing using Dynamic Tuning
 
HPC at Scale Enabled by DDN A3i and NVIDIA SuperPOD
HPC at Scale Enabled by DDN A3i and NVIDIA SuperPODHPC at Scale Enabled by DDN A3i and NVIDIA SuperPOD
HPC at Scale Enabled by DDN A3i and NVIDIA SuperPOD
 
State of ARM-based HPC
State of ARM-based HPCState of ARM-based HPC
State of ARM-based HPC
 
Versal Premium ACAP for Network and Cloud Acceleration
Versal Premium ACAP for Network and Cloud AccelerationVersal Premium ACAP for Network and Cloud Acceleration
Versal Premium ACAP for Network and Cloud Acceleration
 
Zettar: Moving Massive Amounts of Data across Any Distance Efficiently
Zettar: Moving Massive Amounts of Data across Any Distance EfficientlyZettar: Moving Massive Amounts of Data across Any Distance Efficiently
Zettar: Moving Massive Amounts of Data across Any Distance Efficiently
 
Scaling TCO in a Post Moore's Era
Scaling TCO in a Post Moore's EraScaling TCO in a Post Moore's Era
Scaling TCO in a Post Moore's Era
 
CUDA-Python and RAPIDS for blazing fast scientific computing
CUDA-Python and RAPIDS for blazing fast scientific computingCUDA-Python and RAPIDS for blazing fast scientific computing
CUDA-Python and RAPIDS for blazing fast scientific computing
 
Introducing HPC with a Raspberry Pi Cluster
Introducing HPC with a Raspberry Pi ClusterIntroducing HPC with a Raspberry Pi Cluster
Introducing HPC with a Raspberry Pi Cluster
 
Overview of HPC Interconnects
Overview of HPC InterconnectsOverview of HPC Interconnects
Overview of HPC Interconnects
 

Recently uploaded

How to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptxHow to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptx
danishmna97
 
zkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex Proofs
zkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex ProofszkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex Proofs
zkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex Proofs
Alex Pruden
 
UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6
DianaGray10
 
Free Complete Python - A step towards Data Science
Free Complete Python - A step towards Data ScienceFree Complete Python - A step towards Data Science
Free Complete Python - A step towards Data Science
RinaMondal9
 
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
Neo4j
 
Generative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to ProductionGenerative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to Production
Aggregage
 
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
SOFTTECHHUB
 
20240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 202420240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 2024
Matthew Sinclair
 
Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !
KatiaHIMEUR1
 
Epistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI supportEpistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI support
Alan Dix
 
DevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA ConnectDevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA Connect
Kari Kakkonen
 
Introduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - CybersecurityIntroduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - Cybersecurity
mikeeftimakis1
 
Removing Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software FuzzingRemoving Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software Fuzzing
Aftab Hussain
 
Enchancing adoption of Open Source Libraries. A case study on Albumentations.AI
Enchancing adoption of Open Source Libraries. A case study on Albumentations.AIEnchancing adoption of Open Source Libraries. A case study on Albumentations.AI
Enchancing adoption of Open Source Libraries. A case study on Albumentations.AI
Vladimir Iglovikov, Ph.D.
 
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
Neo4j
 
GridMate - End to end testing is a critical piece to ensure quality and avoid...
GridMate - End to end testing is a critical piece to ensure quality and avoid...GridMate - End to end testing is a critical piece to ensure quality and avoid...
GridMate - End to end testing is a critical piece to ensure quality and avoid...
ThomasParaiso2
 
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Albert Hoitingh
 
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
Neo4j
 
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
James Anderson
 
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
Neo4j
 

Recently uploaded (20)

How to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptxHow to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptx
 
zkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex Proofs
zkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex ProofszkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex Proofs
zkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex Proofs
 
UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6
 
Free Complete Python - A step towards Data Science
Free Complete Python - A step towards Data ScienceFree Complete Python - A step towards Data Science
Free Complete Python - A step towards Data Science
 
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
 
Generative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to ProductionGenerative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to Production
 
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
 
20240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 202420240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 2024
 
Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !
 
Epistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI supportEpistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI support
 
DevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA ConnectDevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA Connect
 
Introduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - CybersecurityIntroduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - Cybersecurity
 
Removing Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software FuzzingRemoving Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software Fuzzing
 
Enchancing adoption of Open Source Libraries. A case study on Albumentations.AI
Enchancing adoption of Open Source Libraries. A case study on Albumentations.AIEnchancing adoption of Open Source Libraries. A case study on Albumentations.AI
Enchancing adoption of Open Source Libraries. A case study on Albumentations.AI
 
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
 
GridMate - End to end testing is a critical piece to ensure quality and avoid...
GridMate - End to end testing is a critical piece to ensure quality and avoid...GridMate - End to end testing is a critical piece to ensure quality and avoid...
GridMate - End to end testing is a critical piece to ensure quality and avoid...
 
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
 
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
 
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
 
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
 

Analytics Development Life Cycle: Pangea is Panacea

  • 1. 1 Analytics Life Cycle: Pangea is Panacea!
  • 2. The accompanying material and any related oral or written discussion (the “Materials”) is governed by the limitations detailed below: Licensed Content and Ownership - HCL, PANGEA and other product names are or may be registered trademarks and/or trademarks in the U.S. and/or other countries. Content distributed within a HCL client organization must display HCL copyright notices and attributions of authorship. IP & Patent Liability - This Solution/ Proposition is covered by a pending patent. Any refactoring or subsequent re-use is an unlicensed use and therefore constitutes patent infringement. If there is any further detailed information required, please contact ers.slus@hcl.com Liability Disclaimer -The information herein is for informational purposes only and represents the current view of HCL Technologies Ltd as of the date of this presentation. Because HCL must respond to changing market conditions, it should not be interpreted to be a commitment on the part of HCL, and HCL cannot guarantee the accuracy of any information provided after the date of this presentation. HCL MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN THIS PRESENTATION. Terms of Use, IP and Liability Disclaimer Terms of Use, IP and liability disclaimer
  • 3. Analytics Lifecycle – Enterprise View Data Ingestion Algorithm Selection Model Building and Tuning Model Monitoring Ingest data from diverse data sources - It is a common ask Lesser coding and more effort on model execution & tuning accounts for productivity ML models need regular monitoring and Updation on need basis unlike traditional programs Select appropriate algorithms suitable for the Business requirement(s) – Automated recommendation Is definite ask Ease of deployment and (near) real time Scoring are crucial for enterprise acceptance & Success Data Preparation Ontologies for schema preparation & Transformation should be defined . Data Ingestion Business Problem Description Data Preparation Algorithm Selection Model Building & Tuning Model Deployment Model Monitoring Business Insights & VisualizationsBusiness Problem Description Extract problem definition from business Owners – Tricky often times but important Model Deployment Business Insights Business insight wrappers are crucial for the Successful adoption of analytics/ML
  • 4. ML/DL Model Lifecycle . Drift Analysis Automatic Output Variance Analysis Manual/ Supervised Analysis Identify impacted parametersRevise Model Parameters Update model Deployed model Monitor Inputs Input Analysis Output Analysis Drift/Newness Error/Variance/FP/FN Numerical Data – Distribution Analysis Categorical Data – Obsolete/New categories Text data – Obsolete/New Keywords Estimate data shift @ regular intervals Check for new/deleted categories/words Error/variance for time/state models FP/FN for feedback based models Boolean/Categorical/Labels (Clusters)
  • 5. Analytics Model Monitoring – Heuristics to Watch Burst or patch of data causes abrupt transition Production data causes the model outcome to shift/change incrementally Yet times data influences gradual change in the outcomes over a period of time Some data sets yield recurring change states in output Stray incidents occur when occasional input results in unexpected output
  • 6. Types of Analytical Models - Recap Preventive and proactive alerts and life time estimates Unsupervised Model that groups similar data/objects into k - clusters OptimizationClustering Heuristic and OR models for optimization Survival Time series based forecast models Supervised models that label datasets Classification ForecastRegression Linear, non-linear and logistic regression models
  • 7. Best Practices – Analytics Adoption Data analysis for duplicates, missing values etc Model building, tuning, deployment Model monitoring at regular intervals Reduced Time to Value Ontologies and schema preparation * These views may not expressly or implied to the affiliated organization. They are entirely speaker’s opinions based on his experience and understanding Data ingestion with diverse connectors Business logic wrappers and insights + visualizations
  • 8. Pangea* - Overview Pangea is a distributed analytics workbench that provides an end to end platform for building and operationalizing Analytics quicker Delivers end to end analytics with an intuitive drag and drop of data and models/algorithms Reduces model deployment time from several months to days Data & Code distribution on virtual nodes ensures scalability Actionable Insights customizable solution to fit the client needs Zero Coding Approach Single Click Deployment Distributed Analytics at ScaleModular and Flexible Pangea brings in automation to achieve speed, scale, collaboration and enforces best practices implementation across analytics life cycle to reduce the total cost of ownership  Drastic time-to-insight reduction Data Ingestion from divergent data sources Modelling and tuning without coding Inbuilt & 3rd party UI for reports and charts Deployment through clicks and configuration * HCL Internal IP/Tool
  • 9. • Data ingestion is key without too much emphasis on ‘outcome’ at that time • Data preparation goes hand and glove with business problem descriptions • Ontology and/or schema preparation invisible yet inevitable step in the enterprise analytics life cycle • Analytics/ML Modelling without ease of deployment and monitoring are short-lived • Analytical models without business wrappers are only serve as PoCs 9 Summary – Pangea Best Practices