Paris, Sophia Antipolis, London, San Jose USA
A Key Technology Provider and Actor
in the Cloud Migration
within all Big Compute verticals and at the heart of IA/Machine Learning
revolution
Paris, Sophia Antipolis, London, San Jose USA, Montreal CA
1. Company / Team
400 man / year of R&D
2 patents
30 highly qualified engineers out of which 17 are PhD’s
References in all Industries in the US and EMEA
Global Locations
Partnerships
Key information
Management
Denis
Caromel,
CEO
François
Tournesac,
CSO
Fabien
Viale,
CTO
Marco
Castigliego,
CAO
Company
ISV Founded in 2007 by Denis Caromel in Sophia-Antipolis, Spin-off of INRIA
Addressing $80 Billion Hybrid Cloud Market with 27% CAGR
Disruptive Patented Technology w/ Exceptional Business Outcomes
60% of the revenue from international
Sophia-Antipolis (France)
Paris (France)
London (United Kingdom)
San-Jose (United States)
Montreal (Canada)
Fribourg (Switzerland)
Dakar (Senegal)
ProActive Solution
Job Scheduling, Workload Automation
Orchestration & Meta-Scheduling
On-premises and on all clouds
Open Source
2005 An R&D Team of 45 persons headed by Denis Caromel developing a
Core Kernel for Distributed, Parallel & Cloud at INRIA (largest EU Computer
Science Research Institute, 6 000 persons).
Foundation of ActiveEon
Co-development between INRIA Team & ActiveEon
IP Technology Transfer from INRIA to ActiveEon
2007
2009 Scheduler added to the Core
2011 Resource Manager added
2013
2014
2016
2017
Orchestration with Powerful Workflows added
First very large customer references in Production
International Expansions in UK, USA, Africa
ActiveEon Story
R&D Investment
250 M/Y at INRIA + 150 M/Y at AE = 400 Man/Year
2018 Machine Learning Open Studio added to ProActive
International Expansions in Canada
Express Business Needs with Granular
Workflows
• Distributed & Parallel Computing
• On-premises & Cloud
Architectures
A 30+ PhD/Engineer team with focused fields of high expertise:
• Big Data
• IA, ML
• IoT
• Finance
• Gov.
• HPC
• ...
• Data Science, Machine Learning, IA, Matlab,
R
• Big Compute and HPC
Visdom
ActiveEon Technology
R&D Investment: 250 M/Y at INRIA + 150 M/Y at AE = 400 Man/Year
Process Flow & Operation
Execute &
Monitor
Design
Schedule, Monitor &
Connect the
resources
IT department, Data
Scientist, Business Lines,
Activeeon Services IT department,
Business owner
Operational team
1
2
3
PROACTIVE
STUDIO
PROACTIVE
RESOURCES MANAGER
PROACTIVE SCHEDULER
PROACTIVE
AUTOMATION
PORTAL
Next Generation
Scheduler/Orchestration
Scheduler and Orchestration
Priority
& Planning
Parallel
Executions
Error
Management
Multi Users
</>OpenRESTAPI
Resource Management and Monitoring
Slurm
SGE
PBS
LSF
Multi-
platform
Local
Machine
Network
Resource
Batch
Scheduler
Cloud
Processing and Automation Workflows
Any
language
Secured
Data
Transfers
Meta-
scheduler
ETL, ERP,
ELT, …
Full
integration
Translator
Open Workflow Studio
Machine Learning Open Studio
https://www.youtube.com/watch?v=mbrQxCf4lqM
Automation Dashboard - Catalog
Workflows stored in buckets in the Catalog
RBAC support for each bucket / Users can share workflows and templates
Keep track of the revisions with a versioning feature integrated
Job Planner
DefineCalendars AssociateWorkflowstoCalendars VisualizeExecutionPlanning
Manage recurring Jobs
Forecast and check future
Executions
Control recurring jobs from one
endpoint
Schedule Exceptions through
Exclusion Calendars &
Inclusion Calendars
Cloud Automation: On-demand PaaS
On-Demand PaaS Services with full Life-Cycle Management
Scalable & Elastic resources
Incremental resource
deployment
100% resources
usage, no waste
Smart scale down
Provides cloud computing power according to your needs.
Minimize costs by deploying VMs only when needed (configurable
load factor). Never exceed your budget (min/max VMs threshold).
Smart and fully configurable elastic policy. Shutdown unused
VMs whenever it's possible. Prevent time-consuming re-
deployments by adjusting idle nodes’ release delay (avoid scale
up/down cycles).
Global Locations
Some Supported Languages and Connectors
AWSAzure GCP Docker OpenShiftLinux Windows Solaris VMware Openstack
Infrastructure
Slurm PBS LSF
LSF
HPC Schedulers
Google
Cloud Platform
PBS
Works
Cmd Java Scala Javascript Groovy Ruby Jython Python Perle PHP R Cron LDAPPowerShell
Languages and Predefined Tasks
FTPURL SFTP MySQL Oracle
Data Connectors
Linux Bash
MongoDB Cassandra AWS-S3
Clouds
PostgreSQL Greenplum SQL Server Azure Storage
Azure
Data Lake
KafkaZookeeper Spark Hadoop
Big Data
Azure
Databricks Hadoop HDFS Twitter
LogstashSAP Elasticsearch
SGE
SGE
CNTK Keras PyTorch YOLO H2OTensoFlow
Machine Learning & Artificial Intelligence
Caffe Spark MLlib Pandas JupyterLab
Visdom
Visualization
KibanaSwarm Storm Clearwater
Cognitive
Services
Grafana
Scikit-Learn MXNet
AIX AS/400
C++/C#
Cuda
Specialized
Open/CL FPGA
DLib BigDL
DeepLearning
G4J
Kubernetes
20 000 Cores Azure Benchmarks
With ActiveEon Workflows & Scheduler:
15 mn to trigger and acquire 20 000 Azure
Cores and to schedule 20 000 Tasks
99% of requests having less than 90ms
response-time
On 20K Cores, with 19K running Tasks,
only 5 sec. to detect a software failure and
redeploy the Task
With 19K running tasks, only 30.8 sec. to
execute a Job with 10 Tasks of each 30 sec.,
97.4% efficiency.
2. Some Typical Customer Cases:
Capabilities & Portfolio Revue
Large Worldwide International Companies
Early Adopters
Using ActiveEon for Critical Business Applications
Finance
IoT
Gov.
Manufacturing
Automotive
Aerospace
Nuclear
RedHat OpenShift
Some Major Customers
Telco & IT Bio Tech
& Health
FinanceEngineering Aeronautics Energy
& Space
Some Partners:
Media
Distribution
Government
IoTCosmetics
L&G a leading multinational finance and insurance company with headquarters in London
Situation
Comply with new European regulations: Solvency II, Basel III, etc.
Transform legacy system and embrace cloud computing
Solution
Activeeon ProActive and migration to the Cloud have enabled
faster and more reliable execution:
• Cloud bursting
• Error management
• Prioritization
Benefits
From 18 hours to 2 hours for priority reports
Agile development with an objective of 4,000 cores
 $1.2m / year committed spent on Cloud
Finance
Time
64VMs,eachwith16vCPUs
Home Hoffice is the UK Ministry of Interior. They are using ActiveEon for 2 critical
applications:
• Visa Delivery Process, and
• Big Data & Analytics platform for Crime Reduction (HODAC).
Situation
In need to integrate 25 different sources of Data in order to build a consolidated
Data Lake and analytics platform to be used for many Home Land security
applications.
Solution
ActiveEon used as the central Orchestrator to Schedule and Meta-Schedule all the
Big Data, ETL, Analytics, Machine Learnigs software appliance of the platform
(Hadoop, SAS, TIBCO Spotfire, Python, Anaconda, GreenPlum, ElasticSearch, …).
Benefits
• Central Orchestration Tool
• Workflow Expressiveness: universal & comprehensive
• Management of Security for highly sensitive environments
• Management of Resources for all appliances (SAS, TIBCO,… ).
« ActiveEon is the only solution capable
to Schedule any Big Data Analytics,
mono-threaded, multi-threaded, multi-
core, parallel and distributed »
Cap Gemini Lead Engineer for Home
Office
Gov.: UK Ministry of Interior
Komatsu is a Japanese multinational corporation
It manufactures construction, mining, industrial and military equipment.
Situation
ActiveEon Orchestrates on Cloud execution over hot and cold storage for streaming and batch analytics
> 1,200 tasks executed per hour
Solution
Activeeon ProActive has enabled control over and scheduling over execution:
• Error Management – Notification, Automated Recovery
• Job Planner
• Distribution & Parallelization
Benefits
• Reliable execution to orchestrate multiple services and resources
• Provide consistent results and KPIs to end users and BI Tools
IoT
PEPs is the French platform that offers access to the products of the Sentinel satellites (S1A and S1B, S2A and S2B, S3A
and S3B) of the European Union Program for Earth observation and monitoring Copernicus
Situation
Make Sentinel data available to the greatest number and
encourage the development of applications using them (agriculture, maritime field...)
1 petabyte (1015 bytes) in 20 years and 7 petabytes in 2 years!
Solution
Proactive Solution provided by ActiveEon to execute on Azure in hybrid mode
allows enhancing PEPS data and making them available to API providers :
• Multi-Cloud Ecosystem Platform
• Remove complexity for Data Scientists
• Provide Cloud performance
Benefits
• Faster execution, Optimisation of On-Prem ressources & Clouds,
• Easier to use by end-users
Space & Image Processing
Platform for Cosmetic Formulation for 2000 persons around the world and
for Innovation Team. (Statistic, Machine Learning, Use of Language R)
2 000 persons
around the World
Innovation Team
(Statistics, ML, R)
Workflows OrchestrationMonitoring
Data
Compute
Data
Mining
Private
Network
+
HTTPS
ProActive
Cloud Watch
Environment Environment
MachineLearninginITLogAnalysisforErrorDetection&PredictioninFinancialMarket
Analysis &
Classification
• Machine Learning
• Artificial Intelligence
• Probabilistic Analysis
Resources /
Applications /
Services
Resources /
Applications /
Services
Resources /
Applications /
Services
Business Users
11 1
1
2
3
Collect data from
any sources
Update model
Update event
driven system
Events
Monitoring
Complex Event
Processing
• Rule based
• Actions triggering
3
Alert
Predictive
Incident
Request for incident
analysis
2
Automated
Preventive
Action
Incidents
Incidents
Finance Domain: Deep ML for IT Infrastructure
Main Benefits
Openness and diversity of ML
frameworks to be used (vs.
Splunk)
Both Batch and Streaming
Workflow Expressiveness:
universal & comprehensive
IT Users
Orchestration of RedHat OpenShift On-Prem & On Azure
Orchestrate & Manage all layers: IaaS, PaaS, SaaS.
Multi-Cloud, Hybrid, Scalable,
Digital transformation for manufacturing
BENEFITS
Reduce the distance between the virtual and the
manufacturing process
Take advantage of digitalization in the machine tool
field for intelligent manufacturing and more efficient
production
FEATURES
Cloud-based big data analytics during
machining
Optimization of machining parameters
using workflows
Process simulation and optimization tools
Physical measurements and monitoring
Virtual / real part model correction
Use of AI
TARGETED SECTORS
Manufacturing, automotive, aerospace
Cloud processing services in manufacturing
END USERS
Workflows for HPC multi-physics engineering
simulations in automotive and aerospace
BENEFITS
Thermal resistance for engine partsFEATURES
Parallel evaluation of optimal mesh size for
the best tradeoff between execution time
and result accuracy
Complex workflow management: monitoring,
scheduling and orchestration
Infrastructure management: on-premises and
cloud HPC
Data collection and processing
END USERS
Pollution levels in a district
Workflow for exploration of tradeoff
between execution time and result accuracy
DOMAIN: COMPUTATIONAL FLUID DYNAMICS (CFD) AND POST-PROCESSING TOOLS
Acceleration and Automation of
Design Analysis and Optimizations
Deep Learning forAnomaly Detection in
Satellite Manufacturing
FEATURES
Detection of wires defect on a set of images
from production line using Deep Learning
Deep Learning on images of wires: occlusion,
variation, noise, grayscale, semantic analysis
Detection of defaults using a pre-defined wire
model and computing a distance measure
Workflows for model training and prediction for
parallel execution
BENEFITS
Automatic detection of defaults in hybrid
circuits manufacturing
Higher precision of Machine Learning results
Faster results with parallel execution of
machine learning workflows
Workflows can be used for other applications
Faulty wires come out in red
Big DataAnalysis forAutomatedAnomaly
Tracking in Satellite Communication
FEATURES
Data analysis: checking packets number of service
telemetries, order and type
Incident evolution forecasts
Big data workflows for automation of Test Scenarios
Automatic detection of remote controls that didn’t
receive expected telemetries
Data visualization in browser
BENEFITS
Automatic and early detection of defaults via trends
analysis of test results
Engineering process improvement: margin assessment,
robustness analysis, model elaboration based on actual
behaviors
Workflows allowing to accelerate treatments of fast-
growing test data amounts
Data fetching from many sources
ProActive workflow for service
telemetries verification
Visualisation of anomalies
Acceleration of Non-Destructive Evaluation (NDE)
for Nuclear Energy, Oil & Gas,Aerospace
FEATURES
NDE batch processing, parametric studies,
non-regression tests on multiple clusters
Transfer Input and Output data
Event programming to follow executions
Workflow process definition
Activeeon guidance and support
Cloud version: Execution on Microsoft Azure
with 50 VMs/day per CIVA user  25K
nodes/year
A potential of $1M$ Azure spending per Year
BENEFITS
Flexibility and enabler of interoperability
between heterogeneous infrastructures
Ability to run large POD (Probability of Detection)
computations, which were taking months on a
single computer
Large-scale simulations with Microsoft Azure cloud
Radiography – Pipes weld inspection
ABOUT CIVA NDE SOLUTION:
Multi-technique (Ultrasound, Eddy current,
Radiography) software platform developed
by the CEA LIST and its partners
The software is distributed by EXTENDE
and its distributors
Eddy current - Simulations
END USERS
Nuclear Energy, Oil & Gas, Aeronautics,
Resource Manager
Scheduler Calendar
Sync
200 to 300 jobs
planned per week
72 000 patient diagnostics
delivered to nurses
Main Benefits
Job Visualization within Calendar
Edit job planning from both
interfaces
Visualize parallel tasks
Visualize task information in one
view
Usage of customer’s external database:
Oracle 11g Database
using Red Hat Hibernate ORM
(Object – Relational – Mapping)
Formerly part of
Task-Centric View Used
Scheduler
Passive
Mediametrie:
TV Audience
Measurement
Scheduler
Active
EC2 Spot Instances
Low costs
EC2 Instances
Regular costs
IaaS
On-Prem
Main Benefits
Deployed On Premise (Capex) or
on a Hosting Service (Opex)
Auto-scaling on infrastructure to
match capacity and demand
Huge costs optimization using only
the VMs needed and interruptible
low cost instances (e.g. EC2 Spot
instances)
CHALLENGES
Process 500 terabytes per year
Flexibility and enabler of interoperability
between heterogeneous services
Job affinity with data location
Transfer sensitive data to the cloud for
processing
RESULTS
Efficient metagenomics pipeline
Granular compute management
User friendly system for maximum utilization
Secure transfers
Simple workflow process definition
Workflow model and data management
Compute migration from on-prem to the
cloud
MAIN DRIVER
REQUIREMENTS
Guidance and support to achieve high
performances
Fit in hybrid architecture multiplatform
Integration with R
FlexLM support (licenses manager)
Remote Visualization for interactive
tasks
COMPANY PROFILE
Industry: BioTech
Product: Metagenomics
Quantitative Metagenomics Platform
for gene profiling and statistical analysis
Domain-specific
Users
Windows
Cluster 1
192 cores
Linux
Cluster 2
366 cores
Scheduler
Web Portal
Total
DNA
QC/Library preparation
SoLiD/Illumina
Sequencing
1TB /
Sequence
Analysis
40TB
Parallel DataBase
Pre, Post Processing of Data Analysis
Flexibility, Speed of Analysis
Granular execution
Fast
Architecture Overview
Paris, Sophia Antipolis, London, San Jose USA @activeeon
contact@activeeon.com
+33 988 777 660
Automate Accelerate & Scale
10K Nodes, 20K Tasks, 1M Jobs
Paris, Sophia Antipolis, London, San Jose USA, Montreal CA

Activeeon - Scale Beyond Limits

  • 1.
    Paris, Sophia Antipolis,London, San Jose USA A Key Technology Provider and Actor in the Cloud Migration within all Big Compute verticals and at the heart of IA/Machine Learning revolution Paris, Sophia Antipolis, London, San Jose USA, Montreal CA
  • 2.
    1. Company /Team 400 man / year of R&D 2 patents 30 highly qualified engineers out of which 17 are PhD’s References in all Industries in the US and EMEA
  • 3.
    Global Locations Partnerships Key information Management Denis Caromel, CEO François Tournesac, CSO Fabien Viale, CTO Marco Castigliego, CAO Company ISVFounded in 2007 by Denis Caromel in Sophia-Antipolis, Spin-off of INRIA Addressing $80 Billion Hybrid Cloud Market with 27% CAGR Disruptive Patented Technology w/ Exceptional Business Outcomes 60% of the revenue from international Sophia-Antipolis (France) Paris (France) London (United Kingdom) San-Jose (United States) Montreal (Canada) Fribourg (Switzerland) Dakar (Senegal) ProActive Solution Job Scheduling, Workload Automation Orchestration & Meta-Scheduling On-premises and on all clouds Open Source
  • 4.
    2005 An R&DTeam of 45 persons headed by Denis Caromel developing a Core Kernel for Distributed, Parallel & Cloud at INRIA (largest EU Computer Science Research Institute, 6 000 persons). Foundation of ActiveEon Co-development between INRIA Team & ActiveEon IP Technology Transfer from INRIA to ActiveEon 2007 2009 Scheduler added to the Core 2011 Resource Manager added 2013 2014 2016 2017 Orchestration with Powerful Workflows added First very large customer references in Production International Expansions in UK, USA, Africa ActiveEon Story R&D Investment 250 M/Y at INRIA + 150 M/Y at AE = 400 Man/Year 2018 Machine Learning Open Studio added to ProActive International Expansions in Canada
  • 5.
    Express Business Needswith Granular Workflows • Distributed & Parallel Computing • On-premises & Cloud Architectures A 30+ PhD/Engineer team with focused fields of high expertise: • Big Data • IA, ML • IoT • Finance • Gov. • HPC • ... • Data Science, Machine Learning, IA, Matlab, R • Big Compute and HPC Visdom ActiveEon Technology R&D Investment: 250 M/Y at INRIA + 150 M/Y at AE = 400 Man/Year
  • 6.
    Process Flow &Operation Execute & Monitor Design Schedule, Monitor & Connect the resources IT department, Data Scientist, Business Lines, Activeeon Services IT department, Business owner Operational team 1 2 3 PROACTIVE STUDIO PROACTIVE RESOURCES MANAGER PROACTIVE SCHEDULER PROACTIVE AUTOMATION PORTAL
  • 7.
    Next Generation Scheduler/Orchestration Scheduler andOrchestration Priority & Planning Parallel Executions Error Management Multi Users </>OpenRESTAPI Resource Management and Monitoring Slurm SGE PBS LSF Multi- platform Local Machine Network Resource Batch Scheduler Cloud Processing and Automation Workflows Any language Secured Data Transfers Meta- scheduler ETL, ERP, ELT, … Full integration Translator
  • 8.
  • 9.
    Machine Learning OpenStudio https://www.youtube.com/watch?v=mbrQxCf4lqM
  • 10.
    Automation Dashboard -Catalog Workflows stored in buckets in the Catalog RBAC support for each bucket / Users can share workflows and templates Keep track of the revisions with a versioning feature integrated
  • 11.
    Job Planner DefineCalendars AssociateWorkflowstoCalendarsVisualizeExecutionPlanning Manage recurring Jobs Forecast and check future Executions Control recurring jobs from one endpoint Schedule Exceptions through Exclusion Calendars & Inclusion Calendars
  • 12.
    Cloud Automation: On-demandPaaS On-Demand PaaS Services with full Life-Cycle Management
  • 13.
    Scalable & Elasticresources Incremental resource deployment 100% resources usage, no waste Smart scale down Provides cloud computing power according to your needs. Minimize costs by deploying VMs only when needed (configurable load factor). Never exceed your budget (min/max VMs threshold). Smart and fully configurable elastic policy. Shutdown unused VMs whenever it's possible. Prevent time-consuming re- deployments by adjusting idle nodes’ release delay (avoid scale up/down cycles).
  • 14.
    Global Locations Some SupportedLanguages and Connectors AWSAzure GCP Docker OpenShiftLinux Windows Solaris VMware Openstack Infrastructure Slurm PBS LSF LSF HPC Schedulers Google Cloud Platform PBS Works Cmd Java Scala Javascript Groovy Ruby Jython Python Perle PHP R Cron LDAPPowerShell Languages and Predefined Tasks FTPURL SFTP MySQL Oracle Data Connectors Linux Bash MongoDB Cassandra AWS-S3 Clouds PostgreSQL Greenplum SQL Server Azure Storage Azure Data Lake KafkaZookeeper Spark Hadoop Big Data Azure Databricks Hadoop HDFS Twitter LogstashSAP Elasticsearch SGE SGE CNTK Keras PyTorch YOLO H2OTensoFlow Machine Learning & Artificial Intelligence Caffe Spark MLlib Pandas JupyterLab Visdom Visualization KibanaSwarm Storm Clearwater Cognitive Services Grafana Scikit-Learn MXNet AIX AS/400 C++/C# Cuda Specialized Open/CL FPGA DLib BigDL DeepLearning G4J Kubernetes
  • 15.
    20 000 CoresAzure Benchmarks With ActiveEon Workflows & Scheduler: 15 mn to trigger and acquire 20 000 Azure Cores and to schedule 20 000 Tasks 99% of requests having less than 90ms response-time On 20K Cores, with 19K running Tasks, only 5 sec. to detect a software failure and redeploy the Task With 19K running tasks, only 30.8 sec. to execute a Job with 10 Tasks of each 30 sec., 97.4% efficiency.
  • 16.
    2. Some TypicalCustomer Cases: Capabilities & Portfolio Revue Large Worldwide International Companies Early Adopters Using ActiveEon for Critical Business Applications Finance IoT Gov. Manufacturing Automotive Aerospace Nuclear RedHat OpenShift
  • 17.
    Some Major Customers Telco& IT Bio Tech & Health FinanceEngineering Aeronautics Energy & Space Some Partners: Media Distribution Government IoTCosmetics
  • 18.
    L&G a leadingmultinational finance and insurance company with headquarters in London Situation Comply with new European regulations: Solvency II, Basel III, etc. Transform legacy system and embrace cloud computing Solution Activeeon ProActive and migration to the Cloud have enabled faster and more reliable execution: • Cloud bursting • Error management • Prioritization Benefits From 18 hours to 2 hours for priority reports Agile development with an objective of 4,000 cores  $1.2m / year committed spent on Cloud Finance Time 64VMs,eachwith16vCPUs
  • 19.
    Home Hoffice isthe UK Ministry of Interior. They are using ActiveEon for 2 critical applications: • Visa Delivery Process, and • Big Data & Analytics platform for Crime Reduction (HODAC). Situation In need to integrate 25 different sources of Data in order to build a consolidated Data Lake and analytics platform to be used for many Home Land security applications. Solution ActiveEon used as the central Orchestrator to Schedule and Meta-Schedule all the Big Data, ETL, Analytics, Machine Learnigs software appliance of the platform (Hadoop, SAS, TIBCO Spotfire, Python, Anaconda, GreenPlum, ElasticSearch, …). Benefits • Central Orchestration Tool • Workflow Expressiveness: universal & comprehensive • Management of Security for highly sensitive environments • Management of Resources for all appliances (SAS, TIBCO,… ). « ActiveEon is the only solution capable to Schedule any Big Data Analytics, mono-threaded, multi-threaded, multi- core, parallel and distributed » Cap Gemini Lead Engineer for Home Office Gov.: UK Ministry of Interior
  • 20.
    Komatsu is aJapanese multinational corporation It manufactures construction, mining, industrial and military equipment. Situation ActiveEon Orchestrates on Cloud execution over hot and cold storage for streaming and batch analytics > 1,200 tasks executed per hour Solution Activeeon ProActive has enabled control over and scheduling over execution: • Error Management – Notification, Automated Recovery • Job Planner • Distribution & Parallelization Benefits • Reliable execution to orchestrate multiple services and resources • Provide consistent results and KPIs to end users and BI Tools IoT
  • 21.
    PEPs is theFrench platform that offers access to the products of the Sentinel satellites (S1A and S1B, S2A and S2B, S3A and S3B) of the European Union Program for Earth observation and monitoring Copernicus Situation Make Sentinel data available to the greatest number and encourage the development of applications using them (agriculture, maritime field...) 1 petabyte (1015 bytes) in 20 years and 7 petabytes in 2 years! Solution Proactive Solution provided by ActiveEon to execute on Azure in hybrid mode allows enhancing PEPS data and making them available to API providers : • Multi-Cloud Ecosystem Platform • Remove complexity for Data Scientists • Provide Cloud performance Benefits • Faster execution, Optimisation of On-Prem ressources & Clouds, • Easier to use by end-users Space & Image Processing
  • 22.
    Platform for CosmeticFormulation for 2000 persons around the world and for Innovation Team. (Statistic, Machine Learning, Use of Language R) 2 000 persons around the World Innovation Team (Statistics, ML, R) Workflows OrchestrationMonitoring Data Compute Data Mining Private Network + HTTPS
  • 23.
    ProActive Cloud Watch Environment Environment MachineLearninginITLogAnalysisforErrorDetection&PredictioninFinancialMarket Analysis& Classification • Machine Learning • Artificial Intelligence • Probabilistic Analysis Resources / Applications / Services Resources / Applications / Services Resources / Applications / Services Business Users 11 1 1 2 3 Collect data from any sources Update model Update event driven system Events Monitoring Complex Event Processing • Rule based • Actions triggering 3 Alert Predictive Incident Request for incident analysis 2 Automated Preventive Action Incidents Incidents Finance Domain: Deep ML for IT Infrastructure Main Benefits Openness and diversity of ML frameworks to be used (vs. Splunk) Both Batch and Streaming Workflow Expressiveness: universal & comprehensive IT Users
  • 24.
    Orchestration of RedHatOpenShift On-Prem & On Azure Orchestrate & Manage all layers: IaaS, PaaS, SaaS. Multi-Cloud, Hybrid, Scalable,
  • 25.
    Digital transformation formanufacturing BENEFITS Reduce the distance between the virtual and the manufacturing process Take advantage of digitalization in the machine tool field for intelligent manufacturing and more efficient production FEATURES Cloud-based big data analytics during machining Optimization of machining parameters using workflows Process simulation and optimization tools Physical measurements and monitoring Virtual / real part model correction Use of AI TARGETED SECTORS Manufacturing, automotive, aerospace Cloud processing services in manufacturing END USERS
  • 26.
    Workflows for HPCmulti-physics engineering simulations in automotive and aerospace BENEFITS Thermal resistance for engine partsFEATURES Parallel evaluation of optimal mesh size for the best tradeoff between execution time and result accuracy Complex workflow management: monitoring, scheduling and orchestration Infrastructure management: on-premises and cloud HPC Data collection and processing END USERS Pollution levels in a district Workflow for exploration of tradeoff between execution time and result accuracy DOMAIN: COMPUTATIONAL FLUID DYNAMICS (CFD) AND POST-PROCESSING TOOLS Acceleration and Automation of Design Analysis and Optimizations
  • 27.
    Deep Learning forAnomalyDetection in Satellite Manufacturing FEATURES Detection of wires defect on a set of images from production line using Deep Learning Deep Learning on images of wires: occlusion, variation, noise, grayscale, semantic analysis Detection of defaults using a pre-defined wire model and computing a distance measure Workflows for model training and prediction for parallel execution BENEFITS Automatic detection of defaults in hybrid circuits manufacturing Higher precision of Machine Learning results Faster results with parallel execution of machine learning workflows Workflows can be used for other applications Faulty wires come out in red
  • 28.
    Big DataAnalysis forAutomatedAnomaly Trackingin Satellite Communication FEATURES Data analysis: checking packets number of service telemetries, order and type Incident evolution forecasts Big data workflows for automation of Test Scenarios Automatic detection of remote controls that didn’t receive expected telemetries Data visualization in browser BENEFITS Automatic and early detection of defaults via trends analysis of test results Engineering process improvement: margin assessment, robustness analysis, model elaboration based on actual behaviors Workflows allowing to accelerate treatments of fast- growing test data amounts Data fetching from many sources ProActive workflow for service telemetries verification Visualisation of anomalies
  • 29.
    Acceleration of Non-DestructiveEvaluation (NDE) for Nuclear Energy, Oil & Gas,Aerospace FEATURES NDE batch processing, parametric studies, non-regression tests on multiple clusters Transfer Input and Output data Event programming to follow executions Workflow process definition Activeeon guidance and support Cloud version: Execution on Microsoft Azure with 50 VMs/day per CIVA user  25K nodes/year A potential of $1M$ Azure spending per Year BENEFITS Flexibility and enabler of interoperability between heterogeneous infrastructures Ability to run large POD (Probability of Detection) computations, which were taking months on a single computer Large-scale simulations with Microsoft Azure cloud Radiography – Pipes weld inspection ABOUT CIVA NDE SOLUTION: Multi-technique (Ultrasound, Eddy current, Radiography) software platform developed by the CEA LIST and its partners The software is distributed by EXTENDE and its distributors Eddy current - Simulations END USERS Nuclear Energy, Oil & Gas, Aeronautics,
  • 30.
    Resource Manager Scheduler Calendar Sync 200to 300 jobs planned per week 72 000 patient diagnostics delivered to nurses Main Benefits Job Visualization within Calendar Edit job planning from both interfaces Visualize parallel tasks Visualize task information in one view Usage of customer’s external database: Oracle 11g Database using Red Hat Hibernate ORM (Object – Relational – Mapping) Formerly part of Task-Centric View Used
  • 31.
    Scheduler Passive Mediametrie: TV Audience Measurement Scheduler Active EC2 SpotInstances Low costs EC2 Instances Regular costs IaaS On-Prem Main Benefits Deployed On Premise (Capex) or on a Hosting Service (Opex) Auto-scaling on infrastructure to match capacity and demand Huge costs optimization using only the VMs needed and interruptible low cost instances (e.g. EC2 Spot instances)
  • 32.
    CHALLENGES Process 500 terabytesper year Flexibility and enabler of interoperability between heterogeneous services Job affinity with data location Transfer sensitive data to the cloud for processing RESULTS Efficient metagenomics pipeline Granular compute management User friendly system for maximum utilization Secure transfers Simple workflow process definition Workflow model and data management Compute migration from on-prem to the cloud MAIN DRIVER REQUIREMENTS Guidance and support to achieve high performances Fit in hybrid architecture multiplatform Integration with R FlexLM support (licenses manager) Remote Visualization for interactive tasks COMPANY PROFILE Industry: BioTech Product: Metagenomics
  • 33.
    Quantitative Metagenomics Platform forgene profiling and statistical analysis Domain-specific Users Windows Cluster 1 192 cores Linux Cluster 2 366 cores Scheduler Web Portal Total DNA QC/Library preparation SoLiD/Illumina Sequencing 1TB / Sequence Analysis 40TB Parallel DataBase Pre, Post Processing of Data Analysis Flexibility, Speed of Analysis Granular execution Fast Architecture Overview
  • 34.
    Paris, Sophia Antipolis,London, San Jose USA @activeeon contact@activeeon.com +33 988 777 660 Automate Accelerate & Scale 10K Nodes, 20K Tasks, 1M Jobs Paris, Sophia Antipolis, London, San Jose USA, Montreal CA