SlideShare a Scribd company logo
1 of 123
Introduction DAPHNE Background EuroHPC Vega & AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega &,Deploying DAPHNE HPC & GenAI Language Video Machine Power Opportunities
Conferencia Invitada
15 September 2023 at 12:00 CEST
University of Alicante (UA), Departement of Software and Computing Systems (DLSI)
in Sala Ada Lovelace, UA DLSI, Alicante, Spain
Hosted by Elena Lloret Pastor (Profesor/A Titular Universidad, UA DLSI)
150
200
250
300
350
400
450
500
550
16 32 48 64 80
Seconds
to
compute
a
workload
Number of tasks (equals 16 times the SLURM --nodes parameter)
Summarizer workload
1
2
3
4
5
16 32 48 64 80
Workload
scaling
(wall
time)
Number of tasks (equals 16 times the SLURM --nodes parameter)
Summarizer workload
Resources
EuroHPC AI in DAPHNE
and Text Summarization
15 September 2023
@ UA DLSI
Aleš Zamuda
<ales.zamuda@um.si>
Acknowledgement: this work is supported by ARRS programme P2-0041 and EU project no. 957407.
-0.05
0
0.05
0.1
0.15
0.2
0.25
0.3
0.35
0.4
1 10 100 1000 10000
ROUGE-1R
ROUGE-2R
ROUGE-LR
ROUGE-SU4R
Fitness (scaled)
Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 1/123
Introduction DAPHNE Background EuroHPC Vega & AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega &,Deploying DAPHNE HPC & GenAI Language Video Machine Power Opportunities
Introduction & Outline: Aims of this Talk
1 (10 minutes) Part I: Background on DAPHNE Essentials
+ Project presentation: DAPHNE (Integrated Data Analysis
Pipelines for Large-Scale Data Management, HPC, and
Machine Learning)
2 (10 minutes) Part II: Background on EuroHPC Vega and AI
+ Selected recent developments and opportunities with
DAPHNE in EuroHPC deployments like Vega in Maribor
3 (20 minutes) Part III: HPC and AI Opportunities
+ Including container reuse for generative AI
— text summarization, autonomous machines, energy
scheduling, and ecosystems.
4 (10 minutes) Part IV: SORS together and Leadership
+ The leadership approach towards EuroHPC AI in DAPHNE,
+ presented and discussed for potential collaboration,
+ to connect and conclude with take-aways of the talk.
5 (10 minutes) Questions, Misc
Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 2/123
Introduction DAPHNE Background EuroHPC Vega & AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega &,Deploying DAPHNE HPC & GenAI Language Video Machine Power Opportunities
Introduction: Overview (Focus, Use, Scope)
• This contribution focuses
on HPC and AI sors for
EuroHPC and DAPHNE
• and discusses their
collaboration
potential w/ BSC.
• Backgrounds on
HPC deployments’
container reuse
and integration
@EuroHPC scope
are discussed.
• Backgrounds: generative AI [1],
autonomous machines [4], energy
scheduling [3], and ecosystems [2]:
• the Summarizer [1] (language):
generating text summaries,
• UGPP [4] (machine):
generating deep ocean trajectories,
• load balancing of energy power plants
(PSADE@NPdynϵjDE [3]):
generating schedules, and
• EcoMod [2]:
generating procedural 3D models of
natural ecosystems with trees.
[1] A. Zamuda, E. Lloret, Optimizing Data-Driven Models for Summarization as Parallel Tasks. Journal of Computational
Science 42, 101101 (2020).
[2] A. Zamuda, J. Brest. Vectorized Procedural Models for Animated Trees Reconstruction using Differential Evolution.
Information Sciences 278, pp. 1-21, 2014. DOI 10.1016/j.ins.2014.04.037.
[3] A. Glotić, A. Zamuda. Short-term combined economic and emission hydrothermal optimization by surrogate differential
evolution. Applied Energy, 1 March 2015, vol. 141, pp. 42-56. DOI: 10.1016/j.apenergy.2014.12.020
[4] A. Zamuda, J. D. Hernández Sosa, Success history applied to expert system for underwater glider path planning using
differential evolution. Expert Systems with Applications 119, 155-170 (2019).
Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 3/123
Introduction DAPHNE Background EuroHPC Vega & AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega &,Deploying DAPHNE HPC & GenAI Language Video Machine Power Opportunities
Conferencia Invitada
15 September 2023 at 12:00 CEST
University of Alicante (UA), Departement of Software and Computing Systems (DLSI)
in Sala Ada Lovelace, UA DLSI, Alicante, Spain
Hosted by Elena Lloret Pastor (Profesor/A Titular Universidad, UA DLSI)
EuroHPC AI in DAPHNE
and Text Summarization
—
Part I: DAPHNE
Background
—
Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 4/123
Introduction DAPHNE Background EuroHPC Vega & AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega &,Deploying DAPHNE HPC & GenAI Language Video Machine Power Opportunities
DAPHNE Partners
https://daphne-eu.eu
Project Consortium
13 partner institutions
from 7 countries
• DM, ML, HPC
• Academia & industry
• Different application
domains
14
• Technical University Berlin
University of Maribor (UM): UM FERI research team I lead (DAPHNE), SLING connection (EuroHPC Vega).
https://feri.um.si/en/research/international-and-structural-funds-projects/
integrated-data-analysis-pipelines-for-large-scale-data-management-hpc-and-machine-learning/
Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 5/123
Introduction DAPHNE Background EuroHPC Vega & AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega &,Deploying DAPHNE HPC & GenAI Language Video Machine Power Opportunities
DAPHNE: Overview
Overview: Generic Aspect of the Project
• Deployment Challenges
• Hardware Challenges
• DM+ML+HPC share compilation
and runtime techniques /
converging cluster hardware
• End of Dennard scaling:
P = α CFV2 (power density 1)
• End of Moore’s law
• Amdahl’s law: sp = 1/s
 Increasing Specialization
#1 Data
Representations
Sparsity Exploitation
from Algorithms to HW
dense
graph
sparse
compressed
#2 Data
Placement
Local vs distributed
CPUs/
NUMA
GPUs
FPGAs/
ASICs
#3 Data
(Value) Types
FP32, FP64, INT8,
INT32, INT64, UINT8,
BF16, TF32, FlexPoint
[NVIDIA
A100]
 DAPHNE Overall Objective:
Open and extensible system infrastructure
Different
Systems/
Libraries
Dev Teams Programming Models
Resource
Managers
Cluster
Under-
utilization
Data/File
Exchange
3 lessons learnt so far
choices made, methodology
Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 6/123
Introduction DAPHNE Background EuroHPC Vega & AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega &,Deploying DAPHNE HPC & GenAI Language Video Machine Power Opportunities
DAPHNE: Functionalities
y
Functionality Introduction: from Language Abstractions to
Distributed Vectorized Execution and Use Cases
• Federated matrices/frames + distribution primitives
• Hierarchical vectorized pipelines and scheduling
• Coordinator
(spawns distributed fused pipeline)
• #1 Prepare Inputs
(N/A, repartition, broadcasts,
slices broadcasts as necessary)
• #2 Coarse-grained Tasks
(tasks run vectorized pipeline)
• #3 Combine Outputs
(N/A, all-reduce, rbind/cbind)
Node 1
X
[1:
100M]
Node 2
X
[100M:
200M]
colmu
colsd
y
y
(X)
XTX
XTy
dc = DaphneContext()
G = dc.from_numpy(npG)
G = (G != 0)
c = components(G, 100, True).compute()
Python API DaphneLib
def components(G, maxi, verbose) {
n = nrow(G); // get the number of vertexes
maxi = 100;
c = seq(1, n); // init vertex IDs
diff = inf; // init diff to +Infinity
iter = 1;
// iterative computation of connected components
while(diff>0 & iter<=maxi) {
u = max(rowMaxs(G * t(c)), c); // neighbor prop
diff = sum(u != c); // # of changed vertexes
c = u; // update assignment
iter = iter + 1;
}
}
Domain-specific Language DaphneDSL
Multiple dispatch of functions/kernels
Example Use Cases
• DLR Earth Observation
• ESA Sentinel-1/2 datasets  4PB/year
• Training of local climate zone classifiers on So2Sat LCZ42
(15 experts, 400K instances, 10 labels each, ~55GB HDF5)
• ML pipeline: preprocessing,
ResNet-20, climate models
• IFAT Semiconductor Ion Beam Tuning
• KAI Semiconductor Material Degradation
• AVL Vehicle Development Process (ejector geometries, KPIs)
• ML-assisted simulations, data cleaning, augmentation
• Cleaning during exploratory query processing
Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 7/123
Introduction DAPHNE Background EuroHPC Vega & AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega &,Deploying DAPHNE HPC & GenAI Language Video Machine Power Opportunities
DAPHNE: Data Spaces
Contribution to Data Spaces
• How does your project contribute to the Data Spaces
(e.g. use cases combining data spaces and big data/extreme data
analytics, specific functionality / building blocks, contribution to
existing building blocks, specific requirements, etc.)
• focus on the open and extensible infrastructure we are building
• that would enable adaption to Data Spaces as anybody can
extend DAPHNE with custom readers and filters
• furthermore, as we try to bring IDA, ML and HPC communities together,
we can make a case for more efficient in memory processing
if the ingestion and processing happen within the same framework
• additionally, the DAPHNE infrastructure fosters development of
compiler passes that optimize the end to end analytics task
by facilitating operator fusion and reuse of intermediates
Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 8/123
Introduction DAPHNE Background EuroHPC Vega & AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega &,Deploying DAPHNE HPC & GenAI Language Video Machine Power Opportunities
DAPHNE: Interoperability
Contribution to Interoperability
• DAPHNE is open-source software
• https://github.com/daphne-eu/daphne
• Apache v2 license
• Towards an inclusive dev community
 Potential for collaboration in 2023-2024
• Check out our website
• https://daphne-eu.eu
• Follow us on twitter
• @daphne_eu
Enable researchers to
experiment with new
prototypes and extensions
Long-term stability
development available through GitHub
data sets – reproducibility, use cases
Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 9/123
Introduction DAPHNE Background EuroHPC Vega & AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega &,Deploying DAPHNE HPC & GenAI Language Video Machine Power Opportunities
DAPHNE: Data Spaces – Common Framework
Contribution to Data Spaces
• How would a common conceptual/functional framework for Data spaces and Big Data /
Extreme data analytics look like?
• At Big Data technologies and extreme-scale analytics Projects Workshop (Horizon ICT-51- 2020),
organized by Big Data Value Association (BDVA) in collaboration with project EUH4D (European
Federation of Data Driven Innovation Hubs), on September 27, 2022 (online),
the DAPHNE project has been presented by Aleš Zamuda (UM) and Eva Paulusberger (KNOW)
• with the aim to engage for future road-mapping and creating a community around the topic in extreme-scale data
analytics
• during the presentation, we had the opportunity to channel our work towards the activities of BDVA,
• especially in relation to Data Sharing Spaces and standardization.
• we have highlighted the specific technical and non-technical progress of the DAPHNE project, our Use Cases, the
main lessons learnt so far, and contributions to road-mapping activities in the field of extreme-scale data analytics
• the workshop has allowed us to reflect on the value and needs for collaborating with other ICT-51 projects (MORE,
SELMA, VesselAI, EVEREST, and MARVEL), and how to federate our data sets and services under EUH4D
• Framework: published at CIDR 2022
Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 10/123
Introduction DAPHNE Background EuroHPC Vega & AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega &,Deploying DAPHNE HPC & GenAI Language Video Machine Power Opportunities
DAPHNE: Data Spaces – Integrated Architecture
System Architecture
LLVM
Python API w/ lazy evaluation MLIR Dialects,
Extension Catalog
(new data types,
kernels,
scheduling algs)
Sideways Entry,
DSL-level
constraints (data
formats & data/op
placement)
Contribution to Data Spaces
How would a common conceptual/functional framework for
Data spaces and Big Data / Extreme data analytics look like?
Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 11/123
Introduction DAPHNE Background EuroHPC Vega & AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega &,Deploying DAPHNE HPC & GenAI Language Video Machine Power Opportunities
DAPHNE: Vision Paper, Integrating DM + ML + HPC
• Current Status
• System architecture and design
• Initial DSL and Python API
• Prototype of MLIR-based
compiler and runtime
• Vectorized execution
(fused pipelines, scheduling)
• GPU (and FPGA) integration,
BLAS/DNN libraries, I/O primitives
• Standalone distributed runtime w/
different distribution primitives
• Joint Paper on System
Architecture
• Published at CIDR 2022
Contribution Towards common conceptual/functional
framework for Data spaces and
Big Data / Extreme data analytics
 DAPHNE Overall Objective:
Open and extensible system infrastructure
DM + ML + HPC
Recently: reached out to other EU projects towards
interoperability and standardization
• synergies with EVEREST and eFlows4HPC
Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 12/123
Introduction DAPHNE Background EuroHPC Vega & AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega &,Deploying DAPHNE HPC & GenAI Language Video Machine Power Opportunities
DAPHNE: Joint Framework on MLIR
Collaboration on technical aspects – Joint framework
• What are the aspects that interest you the most to work on. Prioritise those
ones of your interest, and address a summarised contribution to those that you
prioritised
• ML/DL systems and system support
• Architectures for gathering heterogeneous data
• System tools, e.g. language, intermediate representation
• Method for extreme-scale analytics, e.g. combination of ML models,
simulations and subsequent data analysis in different use cases
• Standardized interconnection methods, e.g. runtime integration, HPC libraries
• Data fusion and data integration technologies
• Others
Different
Systems/
Libraries
Dev Teams Programming Models
Resource
Managers
Cluster
Under-
utilization
Data/File
Exchange
https://mlir.llvm.org
Based on
Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 13/123
Introduction DAPHNE Background EuroHPC Vega & AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega &,Deploying DAPHNE HPC & GenAI Language Video Machine Power Opportunities
HPC workloads: new DAPHNE Use Cases
Example Use Cases
• DLR Earth Observation
• ESA Sentinel-1/2 datasets  4PB/year
• Training of local climate zone classifiers on So2Sat LCZ42
(15 experts, 400K instances, 10 labels each, ~55GB HDF5)
• ML pipeline: preprocessing,
ResNet-20, climate models
• IFAT Semiconductor Ion Beam Tuning
• KAI Semiconductor Material Degradation
• AVL Vehicle Development Process (ejector geometries, KPIs)
• ML-assisted simulations, data cleaning, augmentation
• Cleaning during exploratory query processing
[So2Sat LC42: https://mediatum.ub.tum.de/1454690]
[Xiao Xiang Zhu et al: So2Sat LCZ42: A
Benchmark Dataset for the Classification of
Global Local Climate Zones. GRSM 8(3) 2020]
Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 14/123
Introduction DAPHNE Background EuroHPC Vega & AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega &,Deploying DAPHNE HPC & GenAI Language Video Machine Power Opportunities
DAPHNE: ICT Rolling Plan for Data and AI Actions
Potential contribution to the Rolling plan
for ICT Standards 2022 (Data and AI actions)
Example identified opportunities:
• IEEE CIS Standards Committee: P2976 - Standard for XAI - eXplainable AI Working Group (CIS/SC/XAI WG),
https://sagroups.ieee.org/1855/
• IEEE CIS Standards Committee: P1849 – Working Group for eXtensible Event Stream (XES) for Achieving
Interoperability in Event Logs and Event Streams, https://standards.ieee.org/ieee/1849/10907/
• IEEE other: https://ethicsinaction.ieee.org/p7000/; other ML&AI:
• IEEE P2841, Framework and Process for Deep Learning Evaluation
• IEEE 3652.1-2020, IEEE Guide for Architectural Framework and Application of Federated Machine Learning,
https://standards.ieee.org/project/3652_1.html
• ISO/IEC JTC 1 SC 42 (AI) - published:
• ISO/IEC 20547-3:2020 Information technology — Big data reference architecture — Part 3: Reference architecture
• ISO/IEC TR 20547-5:2018 Information technology — Big data reference architecture — Part 5: Standards roadmap
• ISO/IEC TR 24029-1:2021 Artificial Intelligence (AI) — Assessment of the robustness of neural networks — Part 1: Overview
• ISO/IEC TR 24030:2021 Information technology — Artificial intelligence (AI) — Use cases
• ISO/IEC 24372 Information technology -- Artificial Intelligence (AI) -- Overview of computational approaches for AI systems
• ISO/IEC JTC 1 SC 42 (AI) – under development:
• ISO/IEC DIS 22989 Artificial Intelligence Concepts and Terminology
• ISO/IEC 23053 Framework for Artificial Intelligence Systems Using Machine Learning
• ISO/IEC 42001 Artificial Intelligence - Management System
Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 15/123
Introduction DAPHNE Background EuroHPC Vega & AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega &,Deploying DAPHNE HPC & GenAI Language Video Machine Power Opportunities
DAPHNE: Standards Opportunities
• Gaps
• Federated matrices/frames + distribution primitives
• Hierarchical vectorized pipelines and scheduling
• Applied within the context of AI and BDV
• Priorities
• Data architectures for federated frames dispatch
Are the Roadmapping activities for extreme-scale data analytics
DAPHNE input to the Rolling plan for ICT
Standards 2023
Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 16/123
Introduction DAPHNE Background EuroHPC Vega & AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega &,Deploying DAPHNE HPC & GenAI Language Video Machine Power Opportunities
DAPHNE: Open Topics
Interest and connection to future discussion topics
and future workshops (Q1 2023)
• Skills gap that should be filled in the future
• MLIR kernels programming for dedicated (pre-release) hardware; and,
design of algorithms in the daphne language with the
DAPHNE architecture in mind
• another gap that DAPHNE addresses is the seamless integration of pipelines on a HPC,
and there are plenty of skill gaps and how to do and fill a skills gap from this
• e.g. development of libraries and executables, but also training and teaching of new programmers in the community
• Trustworthy AI and link to the AI on Demand Platform
• DAPHNE will be able to provide full stack for pipelines integration on the level of a whole HPC,
• which is an important added value for creating an AI on Demand Platform;
• an open-source implementation of the DAPHNE system fosters reproducibility for Trustworthy AI
• Enlarging the group with
new related projects from Horizon Europe and EuroHPC JU
• DAPHNE is open-source software
• https://github.com/daphne-eu/daphne
• Apache v2 license
• DAPHNE is present at several top-tier events
• Latest consortium synergy: TUB projects
• The topic of Data Spaces will be recurrent
during all workshops during 2023
• We are planning, in accordance with the project plan,
to bring the bottom-up developed DAPHNE system closer to the top-down developed use cases
• Thus, we are planning a use case workshop for Q2/2023
• but also to stay in line with the research focus of the research and innovation programme
Federated matrices/frames + distribution primitives
Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 17/123
Introduction DAPHNE Background EuroHPC Vega & AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega &,Deploying DAPHNE HPC & GenAI Language Video Machine Power Opportunities
Conferencia Invitada
15 September 2023 at 12:00 CEST
University of Alicante (UA), Departement of Software and Computing Systems (DLSI)
in Sala Ada Lovelace, UA DLSI, Alicante, Spain
Hosted by Elena Lloret Pastor (Profesor/A Titular Universidad, UA DLSI)
EuroHPC AI in DAPHNE
and Text Summarization
—
Part II: EuroHPC Vega
and AI
—
Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 18/123
Introduction DAPHNE Background EuroHPC Vega & AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega &,Deploying DAPHNE HPC & GenAI Language Video Machine Power Opportunities
EuroHPC Vega & AI
(Vega supercomputer in TOP500)
— A Multimedia Tour —
Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 19/123
Introduction DAPHNE Background EuroHPC Vega & AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega &,Deploying DAPHNE HPC & GenAI Language Video Machine Power Opportunities
TOP500: EuroHPC Vega (tour at ASHPC23)
Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 20/123
Introduction DAPHNE Background EuroHPC Vega & AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega &,Deploying DAPHNE HPC & GenAI Language Video Machine Power Opportunities
TOP500: EuroHPC Vega (tour at ASHPC23)
Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 21/123
Introduction DAPHNE Background EuroHPC Vega & AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega &,Deploying DAPHNE HPC & GenAI Language Video Machine Power Opportunities
TOP500: EuroHPC Vega (tour at ASHPC23)
Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 22/123
Introduction DAPHNE Background EuroHPC Vega & AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega &,Deploying DAPHNE HPC & GenAI Language Video Machine Power Opportunities
TOP500: EuroHPC Vega (tour at ASHPC23)
Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 23/123
Introduction DAPHNE Background EuroHPC Vega & AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega &,Deploying DAPHNE HPC & GenAI Language Video Machine Power Opportunities
AI Challenges Shortlist
(Part II: First subpart)
Faced 5 types of challenges, leading to the needs to apply HPC
architectures for benchmarking state-of-the-art topics in
1 text summarization,
2 forest ecosystem modeling, simulation, and
visualization,
3 underwater robotic mission planning,
4 energy production scheduling for hydro-thermal power
plants, and
5 understanding evolutionary algorithms.
Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 24/123
Introduction DAPHNE Background EuroHPC Vega & AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega &,Deploying DAPHNE HPC & GenAI Language Video Machine Power Opportunities
Challenges 1: Text Summarization (Language)
For NLP (Natural Language Processing),
part of ”Big Data”.
Terms across sentences are determined
using a semantic analysis using both:
• coreference resolution (using WordNet) and
• a Concept Matrix (from Freeling).
INPUT
CORPUS
NATURAL
LANGUAGE
PROCESSING
ANALYSIS
CONCEPTS
DISTRIBUTION
PER
SENTENCES
MATRIX OF
CONCEPTS
PROCESSING
CORPUS
PREPROCESSING
PHASE
OPTIMIZATION
TASKS
EXECUTION
PHASE
ASSEMBLE
TASK
DESCRIPTION
SUBMIT
TASKS
TO PARALLEL
EXECUTION
OPTIMIZER
+
TASK DATA
ROUGE
EVALUATION
The detailed new method called
CaBiSDETS is developed in the HPC
approach comprising of:
• a version of evolutionary algorithm
(Differential Evolution, DE),
• self-adaptation, binarization,
constraint adjusting, and some more
pre-computation,
• optimizing the inputs to define the
summarization optimization model.
Aleš Zamuda, Elena Lloret. Optimizing Data-Driven
Models for Summarization as Parallel Tasks. Journal
of Computational Science, 2020, vol. 42, pp. 101101.
DOI: 10.1016/j.jocs.2020.101101
Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 25/123
Introduction DAPHNE Background EuroHPC Vega & AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega &,Deploying DAPHNE HPC & GenAI Language Video Machine Power Opportunities
Challenges 2: Forest Ecosystem Modeling,
Simulation, and Visualization (Real World / Video)
• HPC need to process spatial data and add procedural
content, generating real-world items for producing a
video of 3D space.
Videos: https://www.youtube.com/watch?list=PL7pmTW8neV7tZf2qx1wV5zbD74sUyHL3B&v=V9YJgYO_sIA
Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 26/123
Introduction DAPHNE Background EuroHPC Vega & AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega &,Deploying DAPHNE HPC & GenAI Language Video Machine Power Opportunities
Challenges 3: Underwater Robotic Mission Planning
• Computational Fluid Dynamics (CFD) spatio-temporal model of the
ocean currents for autonomous vehicle navigation path planning.
• Constrained Differential Evolution Optimization
for Underwater Glider Path Planning
in Sub-mesoscale Eddy Sampling.
• Corridor-constrained optimization: eddy
border region sampling — new challenge
for UGPP & DE.
• Feasible path area is constrained —
trajectory in corridor around the border of
an ocean eddy.
The objective of the glider here is to sample the
oceanographic variables more efficiently,
while keeping a bounded trajectory.
HPC: develop new methods and evaluate them.
Video: https://www.youtube.com/watch?v=4kCsXAehAmU
Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 27/123
Introduction DAPHNE Background EuroHPC Vega & AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega &,Deploying DAPHNE HPC & GenAI Language Video Machine Power Opportunities
Challenges 4: Energy Production Scheduling for
Hydro-thermal Power Plants
A. Glotić, A. Zamuda. Short-term combined economic and emission
hydrothermal optimization by surrogate differential evolution.
Applied Energy, 1 March 2015, vol. 141, pp. 42-56.
DOI: 10.1016/j.apenergy.2014.12.020
Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 28/123
Introduction DAPHNE Background EuroHPC Vega & AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega &,Deploying DAPHNE HPC & GenAI Language Video Machine Power Opportunities
Challenges 5: Understanding Evolutionary Algorithms
• Evolutionary algorithms benchmarking to
understand computational intelligence of
these algorithms (→ storage requirement!),
• aim: Machine Learning to design
an optimization algorithm
(learning to learn).
• Example CI Algorithm Mechanism Design:
Control Parameters Self-Adaptation (in DE).
Video: https://www.youtube.com/watch?v=R244LZpZSG0
Application stacks for real code:
inspired by previous computational
optimization competitions in
continuous settings that used
test functions for
optimization application domains:
• single-objective: CEC 2005,
2013, 2014, 2015
• constrained: CEC 2006, CEC 2007, CEC 2010
• multi-modal: CEC 2010, SWEVO 2016
• black-box (target value): BBOB 2009, COCO
2016
• noisy optimization: BBOB 2009
• large-scale: CEC 2008, CEC 2010
• dynamic: CEC 2009, CEC 2014
• real-world: CEC 2011
• computationally expensive: CEC 2013, CEC
2015
• learning-based: CEC 2015
• 100-digit (50% targets): 2019 joined CEC,
SEMCCO, GECCO
• multi-objective: CEC 2002, CEC 2007, CEC
2009, CEC 2014
• bi-objective: CEC 2008
• many objective: CEC 2018
Tuning/ranking/hyperheuristics use. → DEs as usual
winner algorithms.
Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 29/123
Introduction DAPHNE Background EuroHPC Vega & AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega &,Deploying DAPHNE HPC & GenAI Language Video Machine Power Opportunities
Challenges 6: new DAPHNE Use Cases
Example Use Cases
• DLR Earth Observation
• ESA Sentinel-1/2 datasets  4PB/year
• Training of local climate zone classifiers on So2Sat LCZ42
(15 experts, 400K instances, 10 labels each, ~55GB HDF5)
• ML pipeline: preprocessing,
ResNet-20, climate models
• IFAT Semiconductor Ion Beam Tuning
• KAI Semiconductor Material Degradation
• AVL Vehicle Development Process (ejector geometries, KPIs)
• ML-assisted simulations, data cleaning, augmentation
• Cleaning during exploratory query processing
[So2Sat LC42: https://mediatum.ub.tum.de/1454690]
[Xiao Xiang Zhu et al: So2Sat LCZ42: A
Benchmark Dataset for the Classification of
Global Local Climate Zones. GRSM 8(3) 2020]
Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 30/123
Introduction DAPHNE Background EuroHPC Vega & AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega &,Deploying DAPHNE HPC & GenAI Language Video Machine Power Opportunities
HPC Initiatives
(Part II: Second subpart)
Timeline (as member) of recent impactful HPC initiatives including Slovenia:
• SLING: Slovenian national supercomputing network, 2010-05-03–,
• SIHPC: Slovenian High-Performance Computing Centre, 2016-03-04–
• ImAppNIO: Improving Applicability of Nature-Inspired Optimisation
by Joining Theory and Practice, 2016-03-09–2020-10-31
• cHiPSet: High-Performance Modelling and Simulation for Big Data
Applications, 2015-04-08–2019-04-07,
• HPC RIVR: UPGRADING OF NATIONAL RESEARCH INFRASTRUCTURES,
Investment Program, 2018-03-01–2020-09-15,
• TFoB: IEEE CIS Task Force on Benchmarking, January 2020–,
• EuroCC: National Competence Centres in the framework of EuroHPC,
2020-09-01–(2022-08-31),
• DAPHNE: Integrated Data Analysis Pipelines for Large-Scale Data
Management, HPC, and Machine Learning, 2020-12-01–(2024-11-30).
Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 31/123
Introduction DAPHNE Background EuroHPC Vega & AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega &,Deploying DAPHNE HPC & GenAI Language Video Machine Power Opportunities
Initiatives: SLING, SIHPC, HPC RIVR, EuroCC
• There is a federated and orchestrated aim
towards HPC infrastructure in Slovenia,
especially through:
• SLING: Slovenian national supercomputing network
→ has federated the initiative push towards
orchestration of HPC resources across the country.
• SIHPC: Slovenian High-Performance Computing Centre
→ has orchestrated the first EU funds application
towards HPC Teaming in the country
(and Participation of Slovenia in PRACE 2).
• HPC RIVR: UPGRADING OF NATIONAL RESEARCH
INFRASTRUCTURES, Investment Program
→ has provided an investment in experimental HPC
infrastructure.
• EuroCC: National Competence Centres in the framework of
EuroHPC
→ has secured a National Competence Centre (EuroHPC).
Vega supercomputer online
Consortium Slovenian High-Performance Computing Centre
Aškerčeva ulica 6
SI-1000 Ljubljana
Slovenia
Ljubljana, 22. 2. 2017
prof. dr. Anwar Osseyran
PRACE Council Chair
Subject: Participation of Slovenia in PRACE 2
Dear professor Osseyran,
In March 2016 a consortium Slovenski superračunalniški center (Slovenian High-Performance
Computing Centre - SIHPC) was established with a founding act (Attachment 1) where article 6
claims that the consortium will join PRACE and that University of Ljubljana, Faculty of
mechanical engineering (ULFME) will represent the consortium in the PRACE. The legal
representative of ULFME in the consortium and therefore, also in the PRACE (i.e., the delegate
in PRACE Council with full authorization) is prof. dr. Jožef Duhovnik, as follows from
appointment of the dean of ULFME (Attachment 2).
The consortium has also agreed that it joins PRACE 2 optional programme as contributing GP.
With best regards,
assist. prof. dr. Aleš Zamuda, vice-president of SIHPC
Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 32/123
Introduction DAPHNE Background EuroHPC Vega & AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega &,Deploying DAPHNE HPC & GenAI Language Video Machine Power Opportunities
Initiatives: ImAppNIO, cHiPSet, TFoB, DAPHNE
Aim towards software to run HPC and improve capabilities:
• ImAppNIO: Improving Applicability of Nature-Inspired Optimisation
by Joining Theory and Practice,
→ improve capabilities through benchmarking (to understand (and to
learn to learn)) CI algorithms
• cHiPSet: High-Performance Modelling and Simulation for Big Data
Applications,
→ include HPC in Modelling and Simulation (of the process to be
learned)
• TFoB: IEEE CIS Task Force on Benchmarking,
→ includes CI benchmarking opportunities, where HPC would enable
new capabilities.
• DAPHNE: Integrated Data Analysis Pipelines for Large-Scale Data
Management, HPC, and Machine Learning.
→ to define and build an open and extensible system infrastructure
for integrated data analysis pipelines, including data management and
processing, high-performance computing (HPC), and machine learning
(ML) training and scoring https://daphne-eu.github.io/
Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 33/123
Introduction DAPHNE Background EuroHPC Vega & AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega &,Deploying DAPHNE HPC & GenAI Language Video Machine Power Opportunities
EuroHPC Vega &
Deploying DAPHNE
(Part II: Third subpart)
MODA (Monitoring and Operational Data Analytics) tools for
• collecting, analyzing, and visualizing
• rich system and application data, and
• my opinion on how one can make sense of the data for
actionable insights.
• Explained through previous examples:
from a HPC User Perspective.
Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 34/123
Introduction DAPHNE Background EuroHPC Vega & AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega &,Deploying DAPHNE HPC & GenAI Language Video Machine Power Opportunities
MODA Actionable Insights, Explained From a HPC
User Perspective, Through the Example of
Summarization
Most interesting findings of summarization on HPC example
are
• the fitness of the NLP model keeps increasing with prolonging
the dedicated HPC resources (see below) and that
• the fitness improvement correlates with ROUGE evaluation in
the benchmark, i.e. better summaries.
-0.05
0
0.05
0.1
0.15
0.2
0.25
0.3
0.35
0.4
1 10 100 1000 10000
ROUGE-1R
ROUGE-2R
ROUGE-LR
ROUGE-SU4R
Fitness (scaled)
Hence, the use of HPC
significantly
contributes to
capability of this NLP
challenge.
However, the MODA insight also provided the
useful task running times and resource
usage.
Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 35/123
Introduction DAPHNE Background EuroHPC Vega & AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega &,Deploying DAPHNE HPC & GenAI Language Video Machine Power Opportunities
Running the Tasks on HPC: ARC Job Preparation
Parallel summarization tasks on grid through ARC.
Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 36/123
Introduction DAPHNE Background EuroHPC Vega & AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega &,Deploying DAPHNE HPC & GenAI Language Video Machine Power Opportunities
Running the Tasks on HPC: ARC Job Submission,
Results Retrieval & Merging [JoCS2020]
Through an HPC
approach and by
parallelization of tasks,
a data-driven
summarization model
optimization yields
improved benchmark
metric results (drawn
using gnuplot merge).
MODA is needed
to run again and
improve upon, to
forecast how to
set required task
running time and
resources
(predicting system
response).
Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 37/123
Introduction DAPHNE Background EuroHPC Vega & AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega &,Deploying DAPHNE HPC & GenAI Language Video Machine Power Opportunities
Monitoring and Operational Data Analytics
• Monitor used (jobs, CPU/wall time, etc.):
Smirnova, Oxana. The Grid Monitor. Usage manual,
Tech. Rep. NORDUGRID-MANUAL-5, The NorduGrid
Collaboration, 2003.
http://www.nordugrid.org/documents/
http://www.nordugrid.org/manuals.html
http://www.nordugrid.org/documents/monitor.pdf
• Deployed at:
www.nordugrid.org/monitor/
• NorduGrid Grid Monitor
Sampled: 2021-06-28 at 17-57-08
• Nation-wide in Slovenia:
https://www.sling.si/gridmonitor/loadmon.php
http://www.nordugrid.org/monitor/index.php?
display=vo=Slovenia
Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 38/123
Introduction DAPHNE Background EuroHPC Vega & AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega &,Deploying DAPHNE HPC & GenAI Language Video Machine Power Opportunities
MODA Example From: ARC at Jost
Example experiments from DOI: 10.1016/j.swevo.2015.10.007 (SPSRDEMMS)
– job YYULDmGOXpmnmmR0Xox1SiGmABFKDmABFKDmrtMKDmABFKDm66faPo.
Sample ARC file gridlog/diag (2–3 day Wall Times).
runtimeenvironments=APPS/ARNES/MPI−1.6−R ;
CPUUsage=99%
MaxResidentMemory=5824kB
AverageUnsharedMemory=0kB
AverageUnsharedStack=0kB
AverageSharedMemory=0kB
PageSize=4096B
MajorPageFaults=4
MinorPageFaults=1213758
Swaps=0
ForcedSwitches=36371494
WaitSwitches=170435
Inputs=45608
Outputs=477168
SocketReceived=0
SocketSent=0
Signals =0
nodename=wn003 . arnes . s i
WallTime=148332s
Processors=16
UserTime=147921.14s
KernelTime =2.54 s
AverageTotalMemory=0kB
AverageResidentMemory=0kB
LRMSStartTime=20150906104626Z
LRMSEndTime=20150908035838Z
exitcode=0
Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 39/123
Introduction DAPHNE Background EuroHPC Vega & AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega &,Deploying DAPHNE HPC & GenAI Language Video Machine Power Opportunities
EuroCC HPC: Vega (TOP500 #106, HPCG #56 — June 2021)
• Researchers apply to EuroHPC JU calls for access.
• Regular calls opened in 2021 fall (Benchmark & Development).
• https://prace-ri.eu/benchmark-and-development-access-information-for-applicants/
• 60% capacities for national share (70% OA, 20% commercial, 10% host (community, urgent
priority of national importance, maintenance)) + 35% EuroHPC JU share (approved applications)
• Has a SLURM dev partition for SSH login (SLURM partitions w/ CPUs: login[0001-0004]=128;
login[0005-0008]=64; cn[0001-384,0577-0960]=256; cn[0385-0576]=256; gn[01-60]=256).
Listing 1: Setting up at Vega — slurm dev partition access (login).
1 [ ales . zamuda@vglogin0007 ˜]$ s i n g u l a r i t y pull qmake . s i f docker : / / ak352 /qmake−opencv
2 [ ales . zamuda@vglogin0007 ˜]$ s i n g u l a r i t y run qmake . s i f bash
3 cd sum; qmake ; make clean ; make
4
5 [ ales . zamuda@vglogin0007 ˜]$ cat runme . sh
6 # ! / bin / bash
7 cd sum && time mpirun 
8 −
−mca btl openib warn no device params found 0 
9 . / summarizer 
10 −
−useBinaryDEMPI −
−i n p u t f i l e mRNA−1273−t x t 
11 −
−withoutStatementMarkersInput 
12 −
−printPreprocessProgress calcInverseTermFrequencyndTermWeights 
13 −
−printOptimizationBestInGeneration 
14 −
−summarylength 600 −
−NP 200 
15 −
−GMAX 400 
16 > summarizer . out . $SLURM PROCID 
17 2> summarizer . err . $SLURM PROCID
Text summarization/generation systems
are getting more and more useful
and accessible on deployed systems
(e.g. OpenAI’s ChatGPT, Microsoft’s Bing AI part,
NVIDIA’s (Fin)Megatron, BLOOM,
LaMDA, BERT, VALL-E, Point-E, etc.). -0.65
-0.6
-0.55
-0.5
-0.45
-0.4
-0.35
-0.3
-0.25
-0.2
-0.15
-0.1
1 10 100
Evaluation
Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 40/123
Introduction DAPHNE Background EuroHPC Vega & AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega &,Deploying DAPHNE HPC & GenAI Language Video Machine Power Opportunities
MODA at First EuroCC HPC Vega Supercomputer
Listing 2: Runnig at Vega & MODA.
1 ===================================================================== GMAX=200 =====
2 [ ales . zamuda@vglogin0002 ˜]$ srun −
−cpu−bind=cores −
−nodes=1 −
−ntasks−per−node=101 
3 −
−cpus−per−task=2 −
−
mem=180G s i n g u l a r i t y run qmake . s i f bash
4 srun : job 4531374 queued and waiting for resources
5 srun : job 4531374 has been allocated resources
6 [ ”$SLURM PROCID” = 0 ] && . / runme . sh
7 real 5m22.475 s
8 user 484m42.262 s
9 sys 1m38.304 s
10 ===================================================================== NODES=51 =====
11 [ ales . zamuda@vglogin0002 ˜]$ srun −
−cpu−bind=cores −
−nodes=1 −
−ntasks−per−node=51 
12 −
−cpus−per−task=2 −
−
mem=180G s i n g u l a r i t y run qmake . s i f bash
13 srun : job 4531746 queued and waiting for resources
14 srun : job 4531746 has been allocated resources
15 [ ”$SLURM PROCID” = 0 ] && . / runme . sh
16 real 13m57.851 s
17 user 431m25.833 s
18 sys 0m29.272 s
19 ===================================================================== GMAX=400 =====
20 [ ales . zamuda@vglogin0002 ˜]$ srun −
−cpu−bind=cores −
−nodes=1 −
−ntasks−per−node=101 
21 −
−cpus−per−task=2 −
−
mem=180G s i n g u l a r i t y run qmake . s i f bash
22 srun : job 4532697 queued and waiting for resources
23 srun : job 4532697 has been allocated resources
24 [ ”$SLURM PROCID” = 0 ] && . / runme . sh
25 real 6m14.687 s
26 user 590m45.641 s
27 sys 1m40.930 s
Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 41/123
Introduction DAPHNE Background EuroHPC Vega & AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega &,Deploying DAPHNE HPC & GenAI Language Video Machine Power Opportunities
More Output: SLURM accounting
Listing 3: Example accounting tool at Vega: sacct.
[ ales . zamuda@vglogin0002 ˜]$ sacct
4531374. ext+ extern vega−users 202 COMPLETED 0:0
4531746. ext+ extern vega−users 102 COMPLETED 0:0
4532697. ext+ extern vega−users 202 COMPLETED 0:0
[ ales . zamuda@vglogin0002 ˜]$ sacct −j 4531374 −j 4531746 −j 4532697 
−o MaxRSS , MaxVMSize , AvePages
MaxRSS MaxVMSize AvePages
−
−
−
−
−
−
−
−
−
− −
−
−
−
−
−
−
−
−
− −
−
−
−
−
−
−
−
−
−
0 217052K 0
26403828K 1264384K 22
0 217052K 0
13325268K 1264380K 0
0 217052K 0
26404356K 1264384K 30
Future MODA testings:
• testing the web interface for job analysis (as available from HPC RIVR);
• profiling MPI inter-node communication;
• use profilers and monitoring tools available
— in the context of heterogeneous setups, like e.g.
• TAU Performance System — http://www.cs.uoregon.edu/research/tau/home.php,
• LIKWID Performance Tools — https://hpc.fau.de/research/tools/likwid/.
Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 42/123
Introduction DAPHNE Background EuroHPC Vega & AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega &,Deploying DAPHNE HPC & GenAI Language Video Machine Power Opportunities
Deploying DAPHNE on Vega
Main documentation file:
Deploy.md
Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 43/123
Introduction DAPHNE Background EuroHPC Vega & AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega &,Deploying DAPHNE HPC & GenAI Language Video Machine Power Opportunities
SLURM
Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 44/123
Introduction DAPHNE Background EuroHPC Vega & AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega &,Deploying DAPHNE HPC & GenAI Language Video Machine Power Opportunities
Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 45/123
Introduction DAPHNE Background EuroHPC Vega & AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega &,Deploying DAPHNE HPC & GenAI Language Video Machine Power Opportunities
Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 46/123
Introduction DAPHNE Background EuroHPC Vega & AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega &,Deploying DAPHNE HPC & GenAI Language Video Machine Power Opportunities
More HPC User Perspective Nation-wide in Slovenia
More: at University of Maribor, Bologna study courses for
teaching (training) of Computer Science at cycles — click URL:
• level 1 (BSc)
• year 1: Programming I – e.g. C++ syntax
• year 2: Computer Architectures – e.g. assembly, microcode, ILP
• year 3: Parallell and Distributed Computing – e.g. OpenMP, MPI, CUDA
• level 2 (MSc)
• year 1: Cloud Computing Deployment and Management – e.g. arc, slurm,
Hadoop, containers (docker, singularity) through
virtualization
• level 3 (PhD)
• EU and other national projects research:
HPC RIVR, EuroCC, DAPHNE, ... – e.g. scaling new systems
of CI & Operational Research of ... over HPC
• IEEE Computational Intelligence Task Force on Benchmarking
• Scientific Journals (e.g. SWEVO, TEVC, JoCS, ASOC, INS)
These contribute towards Sustainable Development of HPC.
Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 47/123
Introduction DAPHNE Background EuroHPC Vega & AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega &,Deploying DAPHNE HPC & GenAI Language Video Machine Power Opportunities
Conferencia Invitada
15 September 2023 at 12:00 CEST
University of Alicante (UA), Departement of Software and Computing Systems (DLSI)
in Sala Ada Lovelace, UA DLSI, Alicante, Spain
Hosted by Elena Lloret Pastor (Profesor/A Titular Universidad, UA DLSI)
EuroHPC AI in DAPHNE
and Text Summarization
—
Part III: HPC and AI
Generative Models
—
Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 48/123
Introduction DAPHNE Background EuroHPC Vega & AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega &,Deploying DAPHNE HPC & GenAI Language Video Machine Power Opportunities
Conferencia Invitada
15 September 2023 at 12:00 CEST
University of Alicante (UA), Departement of Software and Computing Systems (DLSI)
in Sala Ada Lovelace, UA DLSI, Alicante, Spain
Hosted by Elena Lloret Pastor (Profesor/A Titular Universidad, UA DLSI)
EuroHPC AI in DAPHNE
and Text Summarization
—
Part I: Generative AI
—
Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 49/123
Introduction DAPHNE Background EuroHPC Vega & AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega &,Deploying DAPHNE HPC & GenAI Language Video Machine Power Opportunities
Generative AI — Modalities & Access (HPC, H100)
• Generative AI (GenAI) is being
used for modalities such as
• text generation using
Transformers (like
ChatGPT),
• image generation using
Stable Diffusion (like
Midjouney and DALL-E),
• and video speech
generation (like Synthesia)
• GenAI provided recent interesting applications served by
HPC deployments (supported by e.g. NVIDIA H100).
• Therefore, two of my models for Generative AI,
• from Summarizer and TPP-PSADE@NPdynϵjDE,
• extended to support HPC deployment using MPI,
• are described in following & some results are presented.
Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 50/123
Introduction DAPHNE Background EuroHPC Vega & AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega &,Deploying DAPHNE HPC & GenAI Language Video Machine Power Opportunities
Generative AI — Some Background
• Early Learning to Learn, Google DeepMind after AlphaZero, deep RL algorithms
• https://gecco-2019.sigevo.org/index.html/Keynotes [@aleszamuda/status/1150672932588462081: ”Learning to learn ...”]
• Recent: with Reinforcement Learning (RL) trained Large Language Models (LLMs)
using Deep Neural Networks (DNNs) — Transformers (replacing RNN LSTMs; by Google —
2017, Attention Is All You Need: https://arxiv.org/abs/1706.03762, Submitted on 12 Jun 2017 (v1) — for NIPS’17 in December
(Jakob proposed replacing RNNs with self-attention and startedthe effort to evaluate this idea))
• A deployed LLM (Free Research
Preview of ChatGPT May 24
Version, 2023.) GPT-4 Technical Report:
https://arxiv.org/pdf/2303.08774.pdf
• Sample LLM code (Transformers by Hugging
Face), using Python3, AutoTokenizer, and
google/flan-t5-base
Transformers
architecture
Wikipedia (CC BY-SA
3.0), File:The-
Transformer-model-
architecture.png
• My GenAI backgrounds come from (evolutionary) generation of 3D scenery sequences (animation, AL — Artificial Life)
• In my 2020 journal article published with University of Alicante (w/ Elena Lloret), we
demonstrated HPC importance in NLP performance impact (Summarizer — developed on SLING)
• cites e.g. Salesforce Research’s NN paper on A Deep Reinforced Model for Abstractive Summarization, Submitted on 11 May
2017 (v1), https://arxiv.org/abs/1705.04304
Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 51/123
Introduction DAPHNE Background EuroHPC Vega & AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega &,Deploying DAPHNE HPC & GenAI Language Video Machine Power Opportunities
Conferencia Invitada
15 September 2023 at 12:00 CEST
University of Alicante (UA), Departement of Software and Computing Systems (DLSI)
in Sala Ada Lovelace, UA DLSI, Alicante, Spain
Hosted by Elena Lloret Pastor (Profesor/A Titular Universidad, UA DLSI)
EuroHPC AI in DAPHNE
and Text Summarization
—
Part III: Language (1)
—
Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 52/123
Introduction DAPHNE Background EuroHPC Vega & AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega &,Deploying DAPHNE HPC & GenAI Language Video Machine Power Opportunities
HPC Application 1:
Text Summarization
• NLP and computational linguistics for Text Summarization:
• Multi-Document Text Summarization is a hard CI challenge.
• Basically, an evolutionary algorithm is applied for
summarization,
• it is a state-of-the-art topic of text summarization for NLP (part of
”Big Data”) and presented as a collaboration [JoCS2020],
acknowledging several efforts.
• we add: self-adaptation of optimization control parameters;
analysis through benchmarking using HPC, and
apply additional NLP tools.
• How it works: for the abstract, sentences from original text are
selected for full inclusion (extraction).
• To extract a combination of sentences:
• can be computationally demanding,
• we use heuristic optimization,
• the time to run optimization can be limited.
Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 53/123
Introduction DAPHNE Background EuroHPC Vega & AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega &,Deploying DAPHNE HPC & GenAI Language Video Machine Power Opportunities
1 – Preprocessing (environment sensing, knowledge
representation) (1/2)
1) The files of documents are each taken through the following
process using NLP (Natural Language Processing) tools:
INPUT
CORPUS
NATURAL
LANGUAGE
PROCESSING
ANALYSIS
CONCEPTS
DISTRIBUTION
PER
SENTENCES
MATRIX OF
CONCEPTS
PROCESSING
CORPUS
PREPROCESSING
PHASE
OPTIMIZATION
TASKS
EXECUTION
PHASE
ASSEMBLE
TASK
DESCRIPTION
SUBMIT
TASKS
TO PARALLEL
EXECUTION
OPTIMIZER
+
TASK DATA
ROUGE
EVALUATION
2) For each document is D, sentences are indexed using NLP
tools.
• Terms across sentences are determined using a semantic
analysis using both:
• coreference resolution (using WordNet) and
• a Concept Matrix (from Freeling).
Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 54/123
Introduction DAPHNE Background EuroHPC Vega & AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega &,Deploying DAPHNE HPC & GenAI Language Video Machine Power Opportunities
1 – Preprocessing (environment sensing, knowledge
representation) (2/2)
• 3) For each i-th term (wi), during indexing
• number of occurences in the text is gathered, and
• number of occurences (nk) of a term in some k-th
statement,
• 4) For each term wi in the document, inverse frequency:
isfw i = log(
n
nk
),
• where n denotes number of statements in the document,
and
• nk number of statements including a term wi.
• 5) To conclude preprocessing, for each term in the
document, a weight is calculated:
wi,k = tfi,kisfk,
where tfik is number of occurences (term frequency) of a
term wk in a statement si.
Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 55/123
Introduction DAPHNE Background EuroHPC Vega & AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega &,Deploying DAPHNE HPC & GenAI Language Video Machine Power Opportunities
2 – Summary Optimization (1/3)
• Sentence combination (X) is optimized using jDE
algorithm:
• as a 0/1 knapsack problem, we want to include optimal
selection of statements in the final output
• an i-th sentence si is selected (xi = 1) or unselected
(xi = 0).
• a) Price of a knapsack (its fitness) should be maximized,
• the fitness represents a ratio between content coverage,
V(X), and redundancy, R(X):
f(X) =
V(X)
R(X)
,
• considering a constraint: the summary length is L ± ϵ
words.
• Constraint handling with solutions:
• each feasable solution is better than unfeasable,
• unfeasable compared by constraint value (lower better),
• feasable compared by fitness (higher better).
Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 56/123
Introduction DAPHNE Background EuroHPC Vega & AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega &,Deploying DAPHNE HPC & GenAI Language Video Machine Power Opportunities
2 – Summary Optimization (2/3)
• b) Content coverage V(X) is computed as a double sum of
similarities (defined at d)):
V(X) =
n−1
X
i=1
n
X
j=i+1
(sim(si, O) + sim(sj, O))xi,j,
• where xi,j denotes inclusion of both statement, si, and sj,
• xi,j is only 1 if xi = xj = 1, otherwise 0,
• and O is a vector of average term weights wi,k:
O = (o1, o2, ..., om) for all i = {1..m} different text terms:
oi =
Pn
j=1 wi,j
n
.
• c) Redundance R(X) is also measured as double similarity
(defined at d)) sum for all statements:
R(X) =
n−1
X
i=1
n
X
j=i+1
sim(si, sj)xi,j,
• where xi,j denotes inclusion of both statement, si, and sj,
• again, xi,j is only 1 if xi = xj = 1, otherwise 0.
Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 57/123
Introduction DAPHNE Background EuroHPC Vega & AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega &,Deploying DAPHNE HPC & GenAI Language Video Machine Power Opportunities
2 – Summary Optimization (3/3)
• d) Similarity between statements si = [wi,1, wi,2, ..., wi,m]
and sj = [wj,1, wj,2, ..., wj,m] is computed:
sim(si, sj) =
m
X
k=1
wi,kwj,k
qPm
k=1 wi,kwi,k
Pm
k=1 wj,kwj,k
,
where wi,k is term weight (defined in 5)) and m number of
all terms in text.
• e) When concluded:
• the selected statements from the best assessed
combination are printed,
• in order as they appear in the text, and
• the summary is stored.
Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 58/123
Introduction DAPHNE Background EuroHPC Vega & AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega &,Deploying DAPHNE HPC & GenAI Language Video Machine Power Opportunities
Summary Optimization — Algorithm Pseudocode
The detailed new method called
CaBiSDETS is developed in the
HPC approach comprising of:
• a version of evolutionary
algorithm (Differential
Evolution, DE),
• self-adaptation, binarization,
constraint adjusting, and
some more pre-computation,
• optimizing the inputs to
define the summarization
optimization model.
Aleš Zamuda, Elena Lloret. Optimizing
Data-Driven Models for Summarization
as Parallel Tasks. Journal of
Computational Science, 2020, vol. 42, pp.
101101. DOI 10.1016/j.jocs.2020.101101.
Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 59/123
Introduction DAPHNE Background EuroHPC Vega & AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega &,Deploying DAPHNE HPC & GenAI Language Video Machine Power Opportunities
Running the Tasks on HPC: ARC Job Preparation &
Submission, Results Retrieval & Merging [JoCS2020]
Through an HPC approach and by parallelization of tasks,
a data-driven summarization model optimization yields
improved benchmark metric results (drawn using gnuplot merge).
Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 60/123
Introduction DAPHNE Background EuroHPC Vega & AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega &,Deploying DAPHNE HPC & GenAI Language Video Machine Power Opportunities
Results Published in Journal of Computational Science
The most interesting finding of the HPC study though is that
• the fitness of the NLP model keeps increasing with
prolonging the dedicated HPC resources (see below) and
that
• the fitness improvement correlates with ROUGE
evaluation in the benchmark, i.e. better summaries.
-0.05
0
0.05
0.1
0.15
0.2
0.25
0.3
0.35
0.4
1 10 100 1000 10000
ROUGE-1R
ROUGE-2R
ROUGE-LR
ROUGE-SU4R
Fitness (scaled)
Hence, the use of HPC significantly contributes to
capability of this NLP challenge.
Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 61/123
Introduction DAPHNE Background EuroHPC Vega & AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega &,Deploying DAPHNE HPC & GenAI Language Video Machine Power Opportunities
Summarization Implementation — Parallel
• In text summarization,
once the text model has been built,
• different lengths of summaries can be created with
Summarizer using CaBiSDETS algorithm in parallel.
• Generates large summaries
(more output tokens than single prebuilt LLMs)
& accepts long task inputs (no pre-clustering).
• Results are explainable, tracable (no hallucinated
content/citations), and automatic (no manual RL scoring).
• But moreover from HPC & Big-Data perspective,
• the input text can be preprocessed in parallel
• by computing the cosine sentence similarity pairs
in parallel using MPI in an integrated pipeline,
• before the generation of summaries commences and
• just before a complete parameterization of the summary
generation process.
• During summary optimization,
the fitness function evaluations can be run in parallel.
Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 62/123
Introduction DAPHNE Background EuroHPC Vega & AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega &,Deploying DAPHNE HPC & GenAI Language Video Machine Power Opportunities
Summarization Task Configuration & Execution
• An example task for Summarizer computational architecture
• based on a recent lecturing material,
PDF −→ text (language), on:
• Optimization Algorithms and Autonomous Systems
• Las Palmas de Gran Canaria, March 2023.
• https://www.slideshare.net/AlesZamuda/ulpgc2023erasmuslecturesaleszamuda
systemstheoryintelligentautonomoussystemseswamllsgohpcdeugpppdf
• for Summarizer runs, the parameters were:
--GMAX 1000
--NP 319
--summarylength 500
--epsilonLengthSummary 20
• For the job execution, the configuration for SLURM was:
srun --ntasks-per-node=16
--mpi=pmix
./summarizer.sif
./summarizer
Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 63/123
Introduction DAPHNE Background EuroHPC Vega & AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega &,Deploying DAPHNE HPC & GenAI Language Video Machine Power Opportunities
Summarization Job Speedup Results
• Speedup comparison: SLURM --nodes parameter = 1, 2, 3, 4, or 5
• obtained timings on this scaling as seen on the graphs below.
150
200
250
300
350
400
450
500
550
16 32 48 64 80
Seconds
to
compute
a
workload
Number of tasks (equals 16 times the SLURM --nodes parameter)
Summarizer workload
0.3
0.4
0.5
0.6
0.7
0.8
0.9
1
16 32 48 64 80
Seconds
to
compute
a
workload
Number of tasks (equals 16 times the SLURM --nodes parameter)
Summarizer workload
1
2
3
4
5
16 32 48 64 80
Workload
scaling
(wall
time)
Number of tasks (equals 16 times the SLURM --nodes parameter)
Summarizer workload
Resources
Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 64/123
Introduction DAPHNE Background EuroHPC Vega & AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega &,Deploying DAPHNE HPC & GenAI Language Video Machine Power Opportunities
Conferencia Invitada
15 September 2023 at 12:00 CEST
University of Alicante (UA), Departement of Software and Computing Systems (DLSI)
in Sala Ada Lovelace, UA DLSI, Alicante, Spain
Hosted by Elena Lloret Pastor (Profesor/A Titular Universidad, UA DLSI)
EuroHPC AI in DAPHNE
and Text Summarization
—
Part III: Video (2)
—
Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 65/123
Introduction DAPHNE Background EuroHPC Vega & AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega &,Deploying DAPHNE HPC & GenAI Language Video Machine Power Opportunities
Introduction and Main Goals
• Plant animation in emergent ecosystems
• Plant morphology reconstruction
• from real photography (through evolutionary optimization)
Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 66/123
Introduction DAPHNE Background EuroHPC Vega & AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega &,Deploying DAPHNE HPC & GenAI Language Video Machine Power Opportunities
Ecosystem Animation and Simulation:
Thousands of Trees
1: algorithm ecosystem simulation
Require: v - plant species list;
r - plant list for each species;
f - living condition factors on terrain;
Ensure: ecosystem afforestation simulation
2: loop
3: add new plants to species(v, r);
4: grow all plants(r, f);
5: remove dead plants(r);
6: end loop
Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 67/123
Introduction DAPHNE Background EuroHPC Vega & AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega &,Deploying DAPHNE HPC & GenAI Language Video Machine Power Opportunities
Ecosystem Afforestation: Terrain Models
• Tree models are put to terrain based on ecosystem
growth
pi,k =

xi yi,k zk
T
, i, k ∈ [0, 99].
• Power: fitness (height, slope, moisture, sunniness, windiness), age,
growth.
as,p = vs,phs,par;p, ar;p =
tf;s
max
s
{tf;s}
, vs,p =
ky;s,p + ms,p + ls,p + ws,p + ss,p
5
.
Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 68/123
Introduction DAPHNE Background EuroHPC Vega  AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega ,Deploying DAPHNE HPC  GenAI Language Video Machine Power Opportunities
Another Ecosystem Scenario: Interactive Breeding
Motivation: interactive ecosystems breeding (von Mammen, 2009)
Vir: von Mammen (2009)
The approach – 6 modeled operators in EcoMod.
Operators of selection, crossover, and mutation.
Operators over terrain and environmental conditions.
Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 69/123
Introduction DAPHNE Background EuroHPC Vega  AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega ,Deploying DAPHNE HPC  GenAI Language Video Machine Power Opportunities
Spatial Morphological Tree Model Reconstruction
• New approach for construction of trees
• three-dimensional spatial models,
• in computer graphics and animation,
• the user had to sketch basic branches.
• Our tree reconstruction includes:
• evolutionary algorithms and
• procedural modelling of trees.
Source:
→ CEC 2009
• An L-systems approach used
procedural models in a 2D plane, we
extend it:
• on 3D procedural models and
• more complex trees.
• Our approach combines open-sources:
• ecosystem framework EcoMod and
• algorithm MOjDE (DEMOwSA + jDE).
Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 70/123
Introduction DAPHNE Background EuroHPC Vega  AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega ,Deploying DAPHNE HPC  GenAI Language Video Machine Power Opportunities
Woody Plants Procedural Model
• 3D tree models are compactly represented using a procedure
• our EcoMod framework uses a numerically coded procedural
model with fixed dimensionality
• suitable for parameter estimation using DE/MOjDE.
• Parameterized procedural model builds a 3D structure of a tree
and all its building parts:
• by recursively executing a fixed procedure,
• over a set of numerically coded input
parameters,
• e.g. branch thickness, relative length, and
branching.
Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 71/123
Introduction DAPHNE Background EuroHPC Vega  AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega ,Deploying DAPHNE HPC  GenAI Language Video Machine Power Opportunities
Trees Representable by EcoMod Framework
• Foliage or coniferous trees with very different branching
structures,
• each branch and each leaf can be animated in real time to show
the growth of a tree or its sway in the wind.
Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 72/123
Introduction DAPHNE Background EuroHPC Vega  AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega ,Deploying DAPHNE HPC  GenAI Language Video Machine Power Opportunities
1: procedure branchsegment(g, w, S0, L0, l0, M0, M−1
m;0)
Require: g, w - Gravelius and Weibull index of base branch; S0 - number of strands in base branch; L0, l0 - base
branch relative and actual length; M0 - base branch coordinate system; M−1
m;0
- inverse matrix of rotations
for gravimorphism in coordinate system for base branch; global (i.e. part of breeder) kd, kc, ltype, k
g,w
s ,
Mg,w
, mg,w
, k
g,w
l
, α
g,w
m , αg,w
, t, kf, ws, wg
Ensure: rendered tree image
2: d := kd
p
S0; {thickness calculation from Mandelbrot}
3: render base branch(M0, l0, d);
4: if S0 = 1 then
5: render leaves(ltype); return;
6: end if
7: S1 :=
l
1 + k
g,w
s (S0 − 2)
m
, S2 = S0 − S1; {strands}
8: r1 := max

min
r
S1
S0
, Mg,w

, mg,w

; {branch length}
9: r2 := max

min
r
S2
S0
, Mg,w

, mg,w

;
10: L1 := r1L0, L2 := r2L0; {relative lengths of subbranches}
11: l1 := k
g,w
l
L1, l2 := k
g,w
l
L2; {active subbranch lengths}
12: α1 := kc
r
S2
S0
αg,w
, α2 := αg,w
− α1; {branching angles}
13: M1 := Rz(α1)Ry(αp)Ry×ym (α
g,w
m )Ty(l0)M0; {transform}
14: M2 := Rz(α2)Ry(αp)Ry×ym (α
g,w
m )Ty(l0)M0;
15: M−1
m;1
:= Ry×ym (−α
g,w
m )Ry(−αp)Rx(−αx(t))Rz(−α1 − αz(t))M−1
m;0
; {refreshing inverse matrix}
16: M−1
m;2
:= Ry×ym (−α
g,w
m )Ry(−αp)Rx(−αx(t))Rz(−α2 − αz(t))M−1
m;0
;
17: branchsegment(g + 1, w + 1, S2, L2, l2, M2, M−1
m;2
); {minor branch development}
18: branchsegment(g, w + 1, S1, L1, l1, M1, M−1
m;1
); {major branch development}
19: return; {from recursive procedure call}
Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 73/123
Introduction DAPHNE Background EuroHPC Vega  AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega ,Deploying DAPHNE HPC  GenAI Language Video Machine Power Opportunities
Ecosystem Afforestation Geometry Video
Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 74/123
Introduction DAPHNE Background EuroHPC Vega  AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega ,Deploying DAPHNE HPC  GenAI Language Video Machine Power Opportunities
Image-based Approaches to Automatic Tree Modeling
• Image-based approaches have the best potential to
produce realistically looking plants
• they rely on images of real plants.
• Little work has been done to design trees with the use of a
general reconstruction from images without user
interaction
• use of sketch based guide techniques or
• the procedural models reconstructed were
two-dimensional.
• We now extended this recognition to the domain of 3D
procedural models
• suitable to model woody plants without user interaction.
Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 75/123
Introduction DAPHNE Background EuroHPC Vega  AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega ,Deploying DAPHNE HPC  GenAI Language Video Machine Power Opportunities
Tree Model Reconstruction Innovization Using
Multi-objective Differential Evolution
• Based on an optimization procedure with three main
parts:
• Part I: genotype encoding,
• Part II: genotype-phenotype mapping, and
• Part III: fitness evaluation:
• phenotype and reference image comparison.
Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 76/123
Introduction DAPHNE Background EuroHPC Vega  AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega ,Deploying DAPHNE HPC  GenAI Language Video Machine Power Opportunities
Basic Genotype Encoding
• An individual genotype vector x of a DE population
represents a set of procedural model parameters,
• by computing recursive procedure using a set of
parameters, EcoMod renders a tree (woody plant),
• dimensionality of the genotype x is D = 4509,
• where g ∈ {0, G = 15}, w ∈ {0, W = 50}, and
• each local G × W = 750 real-coded parameter encodes:
one matrix of a Gravelius and Weibull ordered parameter
for recursive calculations, and
• all xi,j ∈ [0, 1], i ∈ {1, 2, ..., NP} and j ∈ {1, 2, ..., D} are
linearly normalized by scaling in the [0,1] interval.
Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 77/123
Introduction DAPHNE Background EuroHPC Vega  AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega ,Deploying DAPHNE HPC  GenAI Language Video Machine Power Opportunities
Bounds and Scaling of Genotype-encoded Parameters
xi,j
Parameter Formula Interval
Number of strands in a tree (tree com-
plexity)
S = 400xi,0 + 10 S ∈ [10, 410]
Height of base trunk l
0,0
0
= 10xi,1 l
0,0
0
∈ [0 m, 10 m]
Coefficient of branch thickness kd = 0.05xi,2 kd ∈ [0, 0.05]
Phyllotaxis angle αp = 360xi,3 αp ∈ [0◦
, 360◦
]
Branching ratio of subbranch strands
distribution
k
g,w
s = 0.5xi,j + 0.5, ∀j ∈ [4, 753] k
g,w
s ∈ [ 1
2
, 1]
Branching angle between dividing sub-
branches
αg,w
= 180xi,j ∀j ∈ [754, 1503] αg,w
∈ 0◦
, 180◦
Maximum relative sub-branch to base
branch length
Mg,w
= 20xi,j ∀j ∈ [1504, 2253] Mg,w
∈ [0, 20]
Minimum relative sub-branch to base
branch length
mg,w
= 20xi,j ∀j ∈ [2254, 3003] mg,w
∈ [0, 20]
Branch length scaling factor k
g,w
l
= 20xi,j, ∀j ∈ [3004, 3753] k
g,w
l
∈ [0, 20]
Gravicentralism impact kc = xi,3754 kc ∈ [0, 1]
Gravimorphism impact (i.e. gravitational
bending of branches)
α
g,w
m = 360xi,j − 180, ∀j ∈ [3755, 4504] α
g,w
m ∈ [−180◦
, 180◦
]
Enabling leaves display on a tree Bl = xi,4505  0.5?0 : 1 Bl ∈ {0, 1}
Size of leaves ll = 0.3xi,4506 ll ∈ [0, 0.3]
Density of leaves ρl = 30xi,4507 ρl ∈ [0, 30]
Leaf distribution type ltype = 5xi,4508 Spiral, Stacked, Stagg-
ered, Bunched, or Conif-
erous
Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 78/123
Introduction DAPHNE Background EuroHPC Vega  AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega ,Deploying DAPHNE HPC  GenAI Language Video Machine Power Opportunities
Advanced Genotype Encoding:
Auxiliary Local Parameters get Vectorized
Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 79/123
Introduction DAPHNE Background EuroHPC Vega  AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega ,Deploying DAPHNE HPC  GenAI Language Video Machine Power Opportunities
Genotype-phenotype Mapping
• Reconstruction method is based on reconstruction of
two-dimensional images of woody plants z∗ (photo),
• to compare the three-dimensional tree evolved with the
use of genotype x to the reference image z∗, genotype x
must be transformed to its phenotype first,
• phenotype is a rendered two-dimensional image z,
• images z∗
and z are all of dimensionality X × Y pixels,
• the reference image is scaled to the given resolution,
if necessary.
• both images are converted to black and white, where
white (0) pixels mark background and black (1) pixels mark
material, e.g. wood,
• An evolved procedural model is rendered for comparison
twice
• to favor three-dimensional procedural models generation,
• projections differ by β = 90◦
camera view angle along the
trunk base (i.e. z axis for OpenGL).
Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 80/123
Introduction DAPHNE Background EuroHPC Vega  AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega ,Deploying DAPHNE HPC  GenAI Language Video Machine Power Opportunities
Phenotype and Reference Image Comparison
• The recognition success is measured by similarity of
• the reference original images (2D) and
• the rendered image (2D) projections of evolved
parametrized procedural models.
• Images are compared pixel-wise by e.g. two criteria:
1 in the evolved image, for each pixel rendered as material
(1):
• the Manhattan distance to the nearest material pixel in the
reference image is computed
• and vice-versa, for each material (1) pixel of an evolved
model image,
2 count of differing pixels (0/1) among comparing images.
Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 81/123
Introduction DAPHNE Background EuroHPC Vega  AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega ,Deploying DAPHNE HPC  GenAI Language Video Machine Power Opportunities
1: procedure MO reconstruction(z∗)
Require: S0 - maximum number of strands in base branch; also, other default parameters for MOjDE and
EcoMod
Ensure: Pareto set of reconstructed parameterized procedural 3D woody plant models
2: uniform randomly generate DE initial population xi,0 ∈ [0, 1] for i = 1..NP;
3: for DE generation loop g (while FEs  10000) do
4: for DE iteration loop i (for all individuals xi,g of a population) do
5: DE individual xi,g creation (adaptation, mutation, crossover):
6: Fi,G+1 =
(
Fl + rand1 × Fu if rand2  τ1,
Fi,G otherwise
; CRi,G+1 =
(
rand3 if rand4  τ2,
CRi,G otherwise
;
8: vi,G+1 = xr1,G + Fi,G+1(xr2,G − xr3,G);
9: ui,j,G+1 =
(
vi,j,G+1 if rand(0, 1) ≤ CRi,G+1 or j = jrand
xi,j,G otherwise
;
10: DE fitness evaluation (genotype-phenotype mapping, rendering, and comparison):
11: z1 = g(ui,g, β1), z2 = g(ui,g, β2) {Execute Algorithm branchsegment twice}
12: h1(z1) =
P
x,y m1(z1
x,y, z∗
x,y) +
P
x,y m1(z∗
x,y, z1
x,y); {First difference metric, at 0◦
}
13: h1(z2) =
P
x,y m1(z2
x,y, z∗
x,y) +
P
x,y m1(z∗
x,y, z2
x,y); {First difference metric, at 90◦
}
14: f1(x) = f(g(x, β1), g(x, β2)) = h1(z1) + h1(z2); {Fitness evaluation, 1st criterion}
15: h2(z1) =
P
x,y w(z1
x,y, z∗
x,y) +
P
x,y w(z∗
x,y, z1
x,y); {Second difference metric, 0◦
}
16: h2(z2) =
P
x,y w(z2
x,y, z∗
x,y) +
P
x,y w(z∗
x,y, z2
x,y); {Second difference metric, 90◦
}
17: f2(x) = f(g(x, β1), g(x, β2)) = h2(z1) + h2(z2); {Fitness evaluation, 2nd criterion}
18: f(x) = {f1(x), f2(x)}; {Fitness evaluation, all criteria combined done}
19: DE selection:
20: xi,G+1 =
(
ui,G+1 if f(ui,G+1) ⪯ f(xi,G)
xi,G otherwise
; {Multi-objective comparison operator}
21: if not (ui,G+1 ⪯ xi,G or xi,G ⪯ ui,G+1 ) then add ui,G+1 to population archive;
22: end for
23: Truncate DE population archive to a size of NP using SPEA2 mechanism.
24: end for
25: return the best individuals obtained;
Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 82/123
Introduction DAPHNE Background EuroHPC Vega  AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega ,Deploying DAPHNE HPC  GenAI Language Video Machine Power Opportunities
Advanced Approach (INS2014): Overview
→
Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 83/123
Introduction DAPHNE Background EuroHPC Vega  AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega ,Deploying DAPHNE HPC  GenAI Language Video Machine Power Opportunities
More: See Published Articles
INS2014 A. Zamuda and J. Brest. Vectorized Procedural Models for Animated
Trees Reconstruction using Differential Evolution. Information
Sciences 2014, vol. 278, pp. 1-21. DOI: 10.1016/j.ins.2014.04.037.
IF2014 = 4.038 (Q1).
INS2013 A. Zamuda and J. Brest. Environmental Framework to Visualize
Emergent Artificial Forest Ecosystems. Information Sciences
220:522–540. 2013. DOI: 10.1016/j.ins.2012.07.031. IF2013 = 3.893
(Q1).
ASC2011 A. Zamuda, J. Brest, B. Bošković and V. Žumer. Differential Evolution
for Parameterized Procedural Woody Plant Models
Reconstruction. Applied Soft Computing 11(8):4904–4912. 2011.
DOI: 10.1016/j.asoc.2011.06.009. IF2011 = 2.612 (Q1).
CEC2012 A. Zamuda, J. Brest. Tree Model Reconstruction Innovization Using
Multi-objective Differential Evolution. 2012 IEEE World Congress on
Computational Intelligence (IEEE WCCI 2012), Brisbane, Australia,
2012, pp. 575-582.
Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 84/123
Introduction DAPHNE Background EuroHPC Vega  AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega ,Deploying DAPHNE HPC  GenAI Language Video Machine Power Opportunities
Tree Reconstruction: Summary Conclusions
• An approach to design woody plant geometrical models
[ASOC2011],
• rendered images are compared to the reference source
images, for reconstruction, to guide the optimization
process,
• sampled randomly to reconstruct geometrical models,
• procedural models are rendered using EcoMod
framework [INS2013],
• renderings of sample evolved models,
• parameters of the procedural model are iteratively
evolved using multi-objective differential evolution MOjDE
algorithm [CEC2012]
• fitness is evaluated by two criteria, which are not
pre-weighted,
• multi-objective optimization obtains multiple criteria
trade-offs shown using:
• attainment surfaces (trade-offs distribution), and
• rendered final approximation set models.
• The advanced approach [INS2014] adds
• real images preprocessing (tree trunk and crown
extraction),
• post-effects on reconstructed geometry (growth, wind,
leafs).
Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 85/123
Introduction DAPHNE Background EuroHPC Vega  AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega ,Deploying DAPHNE HPC  GenAI Language Video Machine Power Opportunities
Conferencia Invitada
15 September 2023 at 12:00 CEST
University of Alicante (UA), Departement of Software and Computing Systems (DLSI)
in Sala Ada Lovelace, UA DLSI, Alicante, Spain
Hosted by Elena Lloret Pastor (Profesor/A Titular Universidad, UA DLSI)
EuroHPC AI in DAPHNE
and Text Summarization
—
Part III: Machine (3)
—
Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 86/123
Introduction DAPHNE Background EuroHPC Vega  AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega ,Deploying DAPHNE HPC  GenAI Language Video Machine Power Opportunities
HPC Application 2:
Underwater Glider: Autonomous, Unmanned, Robotic
• underwater glider – navigating sea oceans,
• Autonomous Underwater Vehicle (AUV)
̸=
Unmanned Aerial Vehicle (UAV)
• AUV Slocum model (expertise in domain of ULPGC, work
with J. D. Hernández Sosa)
Images:
”Photo: Richard Watt/MOD” (License: OGL v1.0)
Slocum-Glider-Auvpicture 5.jpg (License: Public Domain)
MiniU.jpg (License: CC-BY-SA 3.0)
Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 87/123
Introduction DAPHNE Background EuroHPC Vega  AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega ,Deploying DAPHNE HPC  GenAI Language Video Machine Power Opportunities
The Buoyancy Drive and Submarine Probes
Usefulness
• Driving ”yoyo” uses little energy, most only on descent
and rise (pump); also for maintaining direction little
power is consumed.
+ Use: improving ocean models with real data,
+ the real data at the point of capture,
+ sampling flow of oil discharges,
+ monitoring cable lines, and
+ real-time monitoring of different
sensor data.
1
http://spectrum.ieee.org/image/1523708
Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 88/123
Introduction DAPHNE Background EuroHPC Vega  AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega ,Deploying DAPHNE HPC  GenAI Language Video Machine Power Opportunities
Preparations – Simulation Scenarios
https://www.google.si/maps/@28.059806,-15.998355,650054m/data=!3m1!1e3
Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 89/123
Introduction DAPHNE Background EuroHPC Vega  AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega ,Deploying DAPHNE HPC  GenAI Language Video Machine Power Opportunities
Trajectory Optimization: P201,ESTOC2013 3
+ BigData, MyOcean IBI,
satelite link, GPS location
The real trajectory and collected data is available in a Google Earth KML file at the EGO network:
http://www.ego-network.org/dokuwiki/doku.php?glider=P201,ESTOC2013_3
Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 90/123
Introduction DAPHNE Background EuroHPC Vega  AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega ,Deploying DAPHNE HPC  GenAI Language Video Machine Power Opportunities
Constrained Differential Evolution Optimization
for Underwater Glider Path Planning
in Sub-mesoscale Eddy Sampling
• Corridor-constrained optimization:
eddy border region sampling
• new challenge for UGPP  DE
• Feasible path area is constrained
• trajectory in corridor around
the border of an ocean eddy
The objective of the glider here is to
sample the oceanographic variables
more efficiently,
while keeping a bounded trajectory
Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 91/123
Introduction DAPHNE Background EuroHPC Vega  AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega ,Deploying DAPHNE HPC  GenAI Language Video Machine Power Opportunities
Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 92/123
Introduction DAPHNE Background EuroHPC Vega  AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega ,Deploying DAPHNE HPC  GenAI Language Video Machine Power Opportunities
Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 93/123
Introduction DAPHNE Background EuroHPC Vega  AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega ,Deploying DAPHNE HPC  GenAI Language Video Machine Power Opportunities
Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 94/123
Introduction DAPHNE Background EuroHPC Vega  AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega ,Deploying DAPHNE HPC  GenAI Language Video Machine Power Opportunities
Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 95/123
Introduction DAPHNE Background EuroHPC Vega  AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega ,Deploying DAPHNE HPC  GenAI Language Video Machine Power Opportunities
HoP — New Trajectories:
Success history applied to expert system for
underwater glider path planning using differential
evolution
• Improved underwater glider path
planning mission scenarios:
optimization with L-SHADE.
• Several configured algorithms
are also compared to,
analysed, and further
improved.
• Outranked all other previous
results from literature and
ranked first in comparison.
• New algorithm yielded
practically stable and
competitive output
trajectories.
Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 96/123
Introduction DAPHNE Background EuroHPC Vega  AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega ,Deploying DAPHNE HPC  GenAI Language Video Machine Power Opportunities
Ranking UGPP —
Benchmarking
Aggregation
• Statistically,
all results
from previous paper
were outperformed.
• Main reasons:
tuning (NP),
parameter control
(L-SHADE).
Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 97/123
Introduction DAPHNE Background EuroHPC Vega  AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega ,Deploying DAPHNE HPC  GenAI Language Video Machine Power Opportunities
Conferencia Invitada
15 September 2023 at 12:00 CEST
University of Alicante (UA), Departement of Software and Computing Systems (DLSI)
in Sala Ada Lovelace, UA DLSI, Alicante, Spain
Hosted by Elena Lloret Pastor (Profesor/A Titular Universidad, UA DLSI)
EuroHPC AI in DAPHNE
and Text Summarization
—
Part III: Power (4)
—
Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 98/123
Introduction DAPHNE Background EuroHPC Vega  AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega ,Deploying DAPHNE HPC  GenAI Language Video Machine Power Opportunities
HTS: Hydro Power Plants (HPPs) and Thermal Power
Plants (TPPs) Scheduling – Introduction
Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 99/123
Introduction DAPHNE Background EuroHPC Vega  AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega ,Deploying DAPHNE HPC  GenAI Language Video Machine Power Opportunities
Hydro and Thermal Power Plants Systems Model:
Nomenclature
Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 100/123
Introduction DAPHNE Background EuroHPC Vega  AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega ,Deploying DAPHNE HPC  GenAI Language Video Machine Power Opportunities
Hydro and Thermal Power Plants Systems Model:
Definitions
Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 101/123
Introduction DAPHNE Background EuroHPC Vega  AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega ,Deploying DAPHNE HPC  GenAI Language Video Machine Power Opportunities
Hydro and Thermal Power Plants Systems Model:
Models
• The application
contributes to algorithms
development,
with the aims of:
• improving performance
of the electrical energy
production and
• emissions and carbon
footprint reduction
(thermal units),
• while simultaneously
satisfying a 24-hour
system demands in
scheduling energy
demand and all other
operational
requirements.
Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 102/123
Introduction DAPHNE Background EuroHPC Vega  AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega ,Deploying DAPHNE HPC  GenAI Language Video Machine Power Opportunities
Hydro and Thermal Power Plants Systems Model:
Nomenclature and Definitions
• Equality constraints for energy production (scheduling).
• Constraints handling during optimization: ϵ-comparison.
• Algorithm output: 24-hour settings plan for TPPs and
HPPs.
Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 103/123
Introduction DAPHNE Background EuroHPC Vega  AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega ,Deploying DAPHNE HPC  GenAI Language Video Machine Power Opportunities
Additional DE Mechanisms and Parallelization of HTS
• Population size (NP) adjustment, multilevel parallelization,
• sub-population gathering and BmW offspring strategy:
• A. Glotić, A. Glotić, P. Kitak, J. Pihler, I. Tičar. Parallel self-adaptive differential evolution
algorithm for solving short-term hydro scheduling problem. IEEE Transactions on Power Systems
2014, 29 (5), pp. 2347–2358.
Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 104/123
Introduction DAPHNE Background EuroHPC Vega  AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega ,Deploying DAPHNE HPC  GenAI Language Video Machine Power Opportunities
HTS Parallelization and Pre-processing (1/3)
• New approach in optimization for scheduling energy
production among units of HPPs (hydro) and TPPs
(thermo).
• The approach allows a faster computation than before:
• 1) The DE for HTS is parallelized.
• 2) The TPPs optimization part features a novel
architecture, including a pre-computed surrogate model,
• this model is same during optimization of the whole HTS
optimized model for hydro and thermo units
(pre-processing), and
• obtained parameter values (x) of the TPPs surrogate model
on practical accuracy are stored for re-use in the global HTS
optimization.
Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 105/123
Introduction DAPHNE Background EuroHPC Vega  AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega ,Deploying DAPHNE HPC  GenAI Language Video Machine Power Opportunities
HTS Parallelization and Pre-processing (2/3)
• Two algorithms are built for this approach:
• the first algorithm (NPdynϵjDE) addresses a specialized
treatment of constraints handling and optimizes
scheduling for TPPs – thermo units parameters are
pre-computed up to practical accuracy,
• the second algorithm (PSADEs) utilizes the output of the
first algorithm, in order to optimize combined
production of hydro units, where hydro units settings are
probed and thermo units settings are looked-up in the
surrogate model matrix output from the NPdynϵjDE;
• here, both algorithms use
practical accuracy of the parameters for time schedule
of the thermo units load.
Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 106/123
Introduction DAPHNE Background EuroHPC Vega  AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega ,Deploying DAPHNE HPC  GenAI Language Video Machine Power Opportunities
HTS Parallelization and Pre-processing (3/3)
• The results of testing this approach on established HTS
benchmarks from literature show:
a larger performance improvement on all scenarios
under all criteria, compared to the approaches known
before. (Still.)
• Literature article with detailed results coverage follows:
A. Glotić, A. Zamuda. Short-term combined economic and
emission hydrothermal optimization by surrogate
differential evolution. Applied Energy, 1 March 2015, vol.
141, pp. 42-56. DOI 10.1016/j.apenergy.2014.12.020.
IF=5.613
Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 107/123
Introduction DAPHNE Background EuroHPC Vega  AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega ,Deploying DAPHNE HPC  GenAI Language Video Machine Power Opportunities
HTS Optimization: New DE Algorithms Architecture
Surrogate Matrix – Input and Output
Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 108/123
Introduction DAPHNE Background EuroHPC Vega  AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega ,Deploying DAPHNE HPC  GenAI Language Video Machine Power Opportunities
HTS Optimization: New DE Algorithms Architecture
Fitness Function and Practical Accuracy (discretized)
Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 109/123
Introduction DAPHNE Background EuroHPC Vega  AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega ,Deploying DAPHNE HPC  GenAI Language Video Machine Power Opportunities
HTS Optimization: New DE Algorithms Architecture
Fitness Function and Practical Accuracy: the Algorithm
Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 110/123
Introduction DAPHNE Background EuroHPC Vega  AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega ,Deploying DAPHNE HPC  GenAI Language Video Machine Power Opportunities
Results and Comparisons (1/2)
ELS
@NPdynϵjDE
EES
@NPdynϵjDE
CEES
@NPdynϵjDE
Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 111/123
Introduction DAPHNE Background EuroHPC Vega  AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega ,Deploying DAPHNE HPC  GenAI Language Video Machine Power Opportunities
Results and Comparisons (2/2)
• Results on different types of scheduling, compared to best
works from literature: the proposed approach
outperforms all by far, for all models: ELS, EES, and CEES,
• This approach exhibits as well the lowest time latency (due
to PSADEs parallelization and NPdynϵjDE pre-processing).
Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 112/123
Introduction DAPHNE Background EuroHPC Vega  AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega ,Deploying DAPHNE HPC  GenAI Language Video Machine Power Opportunities
A. Glotić, A. Zamuda. Short-term combined economic and emission
hydrothermal optimization by surrogate differential evolution.
Applied Energy, 1 March 2015, vol. 141, pp. 42-56.
DOI 10.1016/j.apenergy.2014.12.020. IF=5.613
Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 113/123
Introduction DAPHNE Background EuroHPC Vega  AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega ,Deploying DAPHNE HPC  GenAI Language Video Machine Power Opportunities
HTS — HPC-deployable Results
• When generating energy plans,
• like load balancing schedules for energy power plants,
• data analytics and machine learning algorithm
can be used (PSADE@NPdynϵjDE [1]).
• Here, a surrogated [1] part of computation for the whole
problem is split non-trivially [1]
• and offloaded to an offline pre-computation phase.
• In this initial phase, an HPC can be used to generate a
lookup matrix, included as a surrogate matrix
during the next, online optimization phase.
• Experiment: an example set of 86500 long runs
(each 1 million fitness evaluations)
• to generate a surrogate matrix
with 0.01 MW granularity step
• for power demands
from 110 MW to 975 MW and 0.01 MW apart
• from 106
simulations for each of
86500 power demand scenarios.
0 1 2 3 4 5 6 7 8 9
Combined power from 110 MW to 975 MW, step 0.01 MW#104
0
100
200
300
400
500
600
700
Individual
output
(power
[MW]
or
unit
total
cost
[$])
Cost, TC / 3
Powerplant P1 power
Powerplant P2 power
Powerplant P3 power
[1] A. Glotić, A. Zamuda. Short-term combined economic and emission hydrothermal optimization by surrogate
differential evolution. Applied Energy, 1 March 2015, vol. 141, pp. 42-56. DOI 10.1016/j.apenergy.2014.12.020.
Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 114/123
EuroHPC AI in DAPHNE and Text Summarization
EuroHPC AI in DAPHNE and Text Summarization
EuroHPC AI in DAPHNE and Text Summarization
EuroHPC AI in DAPHNE and Text Summarization
EuroHPC AI in DAPHNE and Text Summarization
EuroHPC AI in DAPHNE and Text Summarization
EuroHPC AI in DAPHNE and Text Summarization
EuroHPC AI in DAPHNE and Text Summarization
EuroHPC AI in DAPHNE and Text Summarization

More Related Content

Similar to EuroHPC AI in DAPHNE and Text Summarization

Communication Frameworks for HPC and Big Data
Communication Frameworks for HPC and Big DataCommunication Frameworks for HPC and Big Data
Communication Frameworks for HPC and Big Datainside-BigData.com
 
Sysml 2019 demo_paper
Sysml 2019 demo_paperSysml 2019 demo_paper
Sysml 2019 demo_paperstrange_loop
 
Designing HPC & Deep Learning Middleware for Exascale Systems
Designing HPC & Deep Learning Middleware for Exascale SystemsDesigning HPC & Deep Learning Middleware for Exascale Systems
Designing HPC & Deep Learning Middleware for Exascale Systemsinside-BigData.com
 
How to Design Scalable HPC, Deep Learning, and Cloud Middleware for Exascale ...
How to Design Scalable HPC, Deep Learning, and Cloud Middleware for Exascale ...How to Design Scalable HPC, Deep Learning, and Cloud Middleware for Exascale ...
How to Design Scalable HPC, Deep Learning, and Cloud Middleware for Exascale ...inside-BigData.com
 
Designing Software Libraries and Middleware for Exascale Systems: Opportuniti...
Designing Software Libraries and Middleware for Exascale Systems: Opportuniti...Designing Software Libraries and Middleware for Exascale Systems: Opportuniti...
Designing Software Libraries and Middleware for Exascale Systems: Opportuniti...inside-BigData.com
 
How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...
How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...
How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...inside-BigData.com
 
ExtremeEarth: Hopsworks, a data-intensive AI platform for Deep Learning with ...
ExtremeEarth: Hopsworks, a data-intensive AI platform for Deep Learning with ...ExtremeEarth: Hopsworks, a data-intensive AI platform for Deep Learning with ...
ExtremeEarth: Hopsworks, a data-intensive AI platform for Deep Learning with ...Big Data Value Association
 
Hopsworks - ExtremeEarth Open Workshop
Hopsworks - ExtremeEarth Open WorkshopHopsworks - ExtremeEarth Open Workshop
Hopsworks - ExtremeEarth Open WorkshopExtremeEarth
 
AI OpenPOWER Academia Discussion Group
AI OpenPOWER Academia Discussion Group AI OpenPOWER Academia Discussion Group
AI OpenPOWER Academia Discussion Group Ganesan Narayanasamy
 
Designing HPC, Deep Learning, and Cloud Middleware for Exascale Systems
Designing HPC, Deep Learning, and Cloud Middleware for Exascale SystemsDesigning HPC, Deep Learning, and Cloud Middleware for Exascale Systems
Designing HPC, Deep Learning, and Cloud Middleware for Exascale Systemsinside-BigData.com
 
MVAPICH2 and MVAPICH2-X Projects: Latest Developments and Future Plans
MVAPICH2 and MVAPICH2-X Projects: Latest Developments and Future PlansMVAPICH2 and MVAPICH2-X Projects: Latest Developments and Future Plans
MVAPICH2 and MVAPICH2-X Projects: Latest Developments and Future Plansinside-BigData.com
 
Present and future of unified, portable and efficient data processing with Ap...
Present and future of unified, portable and efficient data processing with Ap...Present and future of unified, portable and efficient data processing with Ap...
Present and future of unified, portable and efficient data processing with Ap...DataWorks Summit
 
HPC Deployment / Use Cases (EVEREST + DAPHNE: Workshop on Design and Programm...
HPC Deployment / Use Cases (EVEREST + DAPHNE: Workshop on Design and Programm...HPC Deployment / Use Cases (EVEREST + DAPHNE: Workshop on Design and Programm...
HPC Deployment / Use Cases (EVEREST + DAPHNE: Workshop on Design and Programm...University of Maribor
 
Working together with SURF Raymond Oonk Annette Langedijk SURF
Working together with SURF Raymond Oonk Annette Langedijk SURFWorking together with SURF Raymond Oonk Annette Langedijk SURF
Working together with SURF Raymond Oonk Annette Langedijk SURFCommunicatieSURF
 
Realizing the promise of portable data processing with Apache Beam
Realizing the promise of portable data processing with Apache BeamRealizing the promise of portable data processing with Apache Beam
Realizing the promise of portable data processing with Apache BeamDataWorks Summit
 
CloudLightning and the OPM-based Use Case
CloudLightning and the OPM-based Use CaseCloudLightning and the OPM-based Use Case
CloudLightning and the OPM-based Use CaseCloudLightning
 
OpenACC and Open Hackathons Monthly Highlights: July 2022.pptx
OpenACC and Open Hackathons Monthly Highlights: July 2022.pptxOpenACC and Open Hackathons Monthly Highlights: July 2022.pptx
OpenACC and Open Hackathons Monthly Highlights: July 2022.pptxOpenACC
 
AI LAB using IBM Power 9 Processor
AI LAB using IBM Power 9 ProcessorAI LAB using IBM Power 9 Processor
AI LAB using IBM Power 9 ProcessorGanesan Narayanasamy
 
Petapath HP Cast 12 - Programming for High Performance Accelerated Systems
Petapath HP Cast 12 - Programming for High Performance Accelerated SystemsPetapath HP Cast 12 - Programming for High Performance Accelerated Systems
Petapath HP Cast 12 - Programming for High Performance Accelerated Systemsdairsie
 
Panda scalable hpc_bestpractices_tue100418
Panda scalable hpc_bestpractices_tue100418Panda scalable hpc_bestpractices_tue100418
Panda scalable hpc_bestpractices_tue100418inside-BigData.com
 

Similar to EuroHPC AI in DAPHNE and Text Summarization (20)

Communication Frameworks for HPC and Big Data
Communication Frameworks for HPC and Big DataCommunication Frameworks for HPC and Big Data
Communication Frameworks for HPC and Big Data
 
Sysml 2019 demo_paper
Sysml 2019 demo_paperSysml 2019 demo_paper
Sysml 2019 demo_paper
 
Designing HPC & Deep Learning Middleware for Exascale Systems
Designing HPC & Deep Learning Middleware for Exascale SystemsDesigning HPC & Deep Learning Middleware for Exascale Systems
Designing HPC & Deep Learning Middleware for Exascale Systems
 
How to Design Scalable HPC, Deep Learning, and Cloud Middleware for Exascale ...
How to Design Scalable HPC, Deep Learning, and Cloud Middleware for Exascale ...How to Design Scalable HPC, Deep Learning, and Cloud Middleware for Exascale ...
How to Design Scalable HPC, Deep Learning, and Cloud Middleware for Exascale ...
 
Designing Software Libraries and Middleware for Exascale Systems: Opportuniti...
Designing Software Libraries and Middleware for Exascale Systems: Opportuniti...Designing Software Libraries and Middleware for Exascale Systems: Opportuniti...
Designing Software Libraries and Middleware for Exascale Systems: Opportuniti...
 
How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...
How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...
How to Achieve High-Performance, Scalable and Distributed DNN Training on Mod...
 
ExtremeEarth: Hopsworks, a data-intensive AI platform for Deep Learning with ...
ExtremeEarth: Hopsworks, a data-intensive AI platform for Deep Learning with ...ExtremeEarth: Hopsworks, a data-intensive AI platform for Deep Learning with ...
ExtremeEarth: Hopsworks, a data-intensive AI platform for Deep Learning with ...
 
Hopsworks - ExtremeEarth Open Workshop
Hopsworks - ExtremeEarth Open WorkshopHopsworks - ExtremeEarth Open Workshop
Hopsworks - ExtremeEarth Open Workshop
 
AI OpenPOWER Academia Discussion Group
AI OpenPOWER Academia Discussion Group AI OpenPOWER Academia Discussion Group
AI OpenPOWER Academia Discussion Group
 
Designing HPC, Deep Learning, and Cloud Middleware for Exascale Systems
Designing HPC, Deep Learning, and Cloud Middleware for Exascale SystemsDesigning HPC, Deep Learning, and Cloud Middleware for Exascale Systems
Designing HPC, Deep Learning, and Cloud Middleware for Exascale Systems
 
MVAPICH2 and MVAPICH2-X Projects: Latest Developments and Future Plans
MVAPICH2 and MVAPICH2-X Projects: Latest Developments and Future PlansMVAPICH2 and MVAPICH2-X Projects: Latest Developments and Future Plans
MVAPICH2 and MVAPICH2-X Projects: Latest Developments and Future Plans
 
Present and future of unified, portable and efficient data processing with Ap...
Present and future of unified, portable and efficient data processing with Ap...Present and future of unified, portable and efficient data processing with Ap...
Present and future of unified, portable and efficient data processing with Ap...
 
HPC Deployment / Use Cases (EVEREST + DAPHNE: Workshop on Design and Programm...
HPC Deployment / Use Cases (EVEREST + DAPHNE: Workshop on Design and Programm...HPC Deployment / Use Cases (EVEREST + DAPHNE: Workshop on Design and Programm...
HPC Deployment / Use Cases (EVEREST + DAPHNE: Workshop on Design and Programm...
 
Working together with SURF Raymond Oonk Annette Langedijk SURF
Working together with SURF Raymond Oonk Annette Langedijk SURFWorking together with SURF Raymond Oonk Annette Langedijk SURF
Working together with SURF Raymond Oonk Annette Langedijk SURF
 
Realizing the promise of portable data processing with Apache Beam
Realizing the promise of portable data processing with Apache BeamRealizing the promise of portable data processing with Apache Beam
Realizing the promise of portable data processing with Apache Beam
 
CloudLightning and the OPM-based Use Case
CloudLightning and the OPM-based Use CaseCloudLightning and the OPM-based Use Case
CloudLightning and the OPM-based Use Case
 
OpenACC and Open Hackathons Monthly Highlights: July 2022.pptx
OpenACC and Open Hackathons Monthly Highlights: July 2022.pptxOpenACC and Open Hackathons Monthly Highlights: July 2022.pptx
OpenACC and Open Hackathons Monthly Highlights: July 2022.pptx
 
AI LAB using IBM Power 9 Processor
AI LAB using IBM Power 9 ProcessorAI LAB using IBM Power 9 Processor
AI LAB using IBM Power 9 Processor
 
Petapath HP Cast 12 - Programming for High Performance Accelerated Systems
Petapath HP Cast 12 - Programming for High Performance Accelerated SystemsPetapath HP Cast 12 - Programming for High Performance Accelerated Systems
Petapath HP Cast 12 - Programming for High Performance Accelerated Systems
 
Panda scalable hpc_bestpractices_tue100418
Panda scalable hpc_bestpractices_tue100418Panda scalable hpc_bestpractices_tue100418
Panda scalable hpc_bestpractices_tue100418
 

More from University of Maribor

Poročilo odbora CIS (CH08873) za leto 2023 na letni skupščini IEEE Slovenija ...
Poročilo odbora CIS (CH08873) za leto 2023 na letni skupščini IEEE Slovenija ...Poročilo odbora CIS (CH08873) za leto 2023 na letni skupščini IEEE Slovenija ...
Poročilo odbora CIS (CH08873) za leto 2023 na letni skupščini IEEE Slovenija ...University of Maribor
 
Deployment of Randomised Optimisation Algorithms Benchmarking in DAPHNE
Deployment of Randomised Optimisation Algorithms Benchmarking in DAPHNEDeployment of Randomised Optimisation Algorithms Benchmarking in DAPHNE
Deployment of Randomised Optimisation Algorithms Benchmarking in DAPHNEUniversity of Maribor
 
Evolutionary Optimization Algorithms & Large-Scale Machine Learning
Evolutionary Optimization Algorithms & Large-Scale Machine LearningEvolutionary Optimization Algorithms & Large-Scale Machine Learning
Evolutionary Optimization Algorithms & Large-Scale Machine LearningUniversity of Maribor
 
Solving 100-Digit Challenge with Score 100 by Extended Running Time and Paral...
Solving 100-Digit Challenge with Score 100 by Extended Running Time and Paral...Solving 100-Digit Challenge with Score 100 by Extended Running Time and Paral...
Solving 100-Digit Challenge with Score 100 by Extended Running Time and Paral...University of Maribor
 
Load balancing energy power plants with high-performance data analytics (HPDA...
Load balancing energy power plants with high-performance data analytics (HPDA...Load balancing energy power plants with high-performance data analytics (HPDA...
Load balancing energy power plants with high-performance data analytics (HPDA...University of Maribor
 
ULPGC2023-Erasmus+lectures_Ales_Zamuda_SystemsTheory+IntelligentAutonomousSys...
ULPGC2023-Erasmus+lectures_Ales_Zamuda_SystemsTheory+IntelligentAutonomousSys...ULPGC2023-Erasmus+lectures_Ales_Zamuda_SystemsTheory+IntelligentAutonomousSys...
ULPGC2023-Erasmus+lectures_Ales_Zamuda_SystemsTheory+IntelligentAutonomousSys...University of Maribor
 
Poročilo ODBORA CIS (CH08873) za leto 2020
Poročilo ODBORA CIS (CH08873) za leto 2020Poročilo ODBORA CIS (CH08873) za leto 2020
Poročilo ODBORA CIS (CH08873) za leto 2020University of Maribor
 
Poročilo ODBORA CIS (CH08873) za leto 2019
Poročilo ODBORA CIS (CH08873) za leto 2019 Poročilo ODBORA CIS (CH08873) za leto 2019
Poročilo ODBORA CIS (CH08873) za leto 2019 University of Maribor
 
Poročilo ODBORA CIS (CH08873) za leto 2022
Poročilo ODBORA CIS (CH08873) za leto 2022Poročilo ODBORA CIS (CH08873) za leto 2022
Poročilo ODBORA CIS (CH08873) za leto 2022University of Maribor
 
Delo vodje odbora IEEE: naloge, dobre prakse, poročanje
Delo vodje odbora IEEE: naloge, dobre prakse, poročanjeDelo vodje odbora IEEE: naloge, dobre prakse, poročanje
Delo vodje odbora IEEE: naloge, dobre prakse, poročanjeUniversity of Maribor
 
Speeding Up Vectorized Benchmarking of Optimization Algorithms
Speeding Up Vectorized Benchmarking of Optimization AlgorithmsSpeeding Up Vectorized Benchmarking of Optimization Algorithms
Speeding Up Vectorized Benchmarking of Optimization AlgorithmsUniversity of Maribor
 
IEEE Slovenia: Introduction (in Slovene), with details in English
IEEE Slovenia: Introduction (in Slovene), with details in EnglishIEEE Slovenia: Introduction (in Slovene), with details in English
IEEE Slovenia: Introduction (in Slovene), with details in EnglishUniversity of Maribor
 
IEEE Slovenia CIS in 2021 (a report in Slovene language)
IEEE Slovenia CIS in 2021 (a report in Slovene language)IEEE Slovenia CIS in 2021 (a report in Slovene language)
IEEE Slovenia CIS in 2021 (a report in Slovene language)University of Maribor
 
Superračunalništvo v Mariboru (2021, CIS11, ZID)
Superračunalništvo v Mariboru (2021, CIS11, ZID)Superračunalništvo v Mariboru (2021, CIS11, ZID)
Superračunalništvo v Mariboru (2021, CIS11, ZID)University of Maribor
 
Parallelization of Benchmarking using HPC: Text Summarization in Natural Lang...
Parallelization of Benchmarking using HPC: Text Summarization in Natural Lang...Parallelization of Benchmarking using HPC: Text Summarization in Natural Lang...
Parallelization of Benchmarking using HPC: Text Summarization in Natural Lang...University of Maribor
 
Examples Implementing Black-Box Discrete Optimization Benchmarking Survey for...
Examples Implementing Black-Box Discrete Optimization Benchmarking Survey for...Examples Implementing Black-Box Discrete Optimization Benchmarking Survey for...
Examples Implementing Black-Box Discrete Optimization Benchmarking Survey for...University of Maribor
 
Adaptive Constraint Handling and Success History Differential Evolution for C...
Adaptive Constraint Handling and Success History Differential Evolution for C...Adaptive Constraint Handling and Success History Differential Evolution for C...
Adaptive Constraint Handling and Success History Differential Evolution for C...University of Maribor
 

More from University of Maribor (17)

Poročilo odbora CIS (CH08873) za leto 2023 na letni skupščini IEEE Slovenija ...
Poročilo odbora CIS (CH08873) za leto 2023 na letni skupščini IEEE Slovenija ...Poročilo odbora CIS (CH08873) za leto 2023 na letni skupščini IEEE Slovenija ...
Poročilo odbora CIS (CH08873) za leto 2023 na letni skupščini IEEE Slovenija ...
 
Deployment of Randomised Optimisation Algorithms Benchmarking in DAPHNE
Deployment of Randomised Optimisation Algorithms Benchmarking in DAPHNEDeployment of Randomised Optimisation Algorithms Benchmarking in DAPHNE
Deployment of Randomised Optimisation Algorithms Benchmarking in DAPHNE
 
Evolutionary Optimization Algorithms & Large-Scale Machine Learning
Evolutionary Optimization Algorithms & Large-Scale Machine LearningEvolutionary Optimization Algorithms & Large-Scale Machine Learning
Evolutionary Optimization Algorithms & Large-Scale Machine Learning
 
Solving 100-Digit Challenge with Score 100 by Extended Running Time and Paral...
Solving 100-Digit Challenge with Score 100 by Extended Running Time and Paral...Solving 100-Digit Challenge with Score 100 by Extended Running Time and Paral...
Solving 100-Digit Challenge with Score 100 by Extended Running Time and Paral...
 
Load balancing energy power plants with high-performance data analytics (HPDA...
Load balancing energy power plants with high-performance data analytics (HPDA...Load balancing energy power plants with high-performance data analytics (HPDA...
Load balancing energy power plants with high-performance data analytics (HPDA...
 
ULPGC2023-Erasmus+lectures_Ales_Zamuda_SystemsTheory+IntelligentAutonomousSys...
ULPGC2023-Erasmus+lectures_Ales_Zamuda_SystemsTheory+IntelligentAutonomousSys...ULPGC2023-Erasmus+lectures_Ales_Zamuda_SystemsTheory+IntelligentAutonomousSys...
ULPGC2023-Erasmus+lectures_Ales_Zamuda_SystemsTheory+IntelligentAutonomousSys...
 
Poročilo ODBORA CIS (CH08873) za leto 2020
Poročilo ODBORA CIS (CH08873) za leto 2020Poročilo ODBORA CIS (CH08873) za leto 2020
Poročilo ODBORA CIS (CH08873) za leto 2020
 
Poročilo ODBORA CIS (CH08873) za leto 2019
Poročilo ODBORA CIS (CH08873) za leto 2019 Poročilo ODBORA CIS (CH08873) za leto 2019
Poročilo ODBORA CIS (CH08873) za leto 2019
 
Poročilo ODBORA CIS (CH08873) za leto 2022
Poročilo ODBORA CIS (CH08873) za leto 2022Poročilo ODBORA CIS (CH08873) za leto 2022
Poročilo ODBORA CIS (CH08873) za leto 2022
 
Delo vodje odbora IEEE: naloge, dobre prakse, poročanje
Delo vodje odbora IEEE: naloge, dobre prakse, poročanjeDelo vodje odbora IEEE: naloge, dobre prakse, poročanje
Delo vodje odbora IEEE: naloge, dobre prakse, poročanje
 
Speeding Up Vectorized Benchmarking of Optimization Algorithms
Speeding Up Vectorized Benchmarking of Optimization AlgorithmsSpeeding Up Vectorized Benchmarking of Optimization Algorithms
Speeding Up Vectorized Benchmarking of Optimization Algorithms
 
IEEE Slovenia: Introduction (in Slovene), with details in English
IEEE Slovenia: Introduction (in Slovene), with details in EnglishIEEE Slovenia: Introduction (in Slovene), with details in English
IEEE Slovenia: Introduction (in Slovene), with details in English
 
IEEE Slovenia CIS in 2021 (a report in Slovene language)
IEEE Slovenia CIS in 2021 (a report in Slovene language)IEEE Slovenia CIS in 2021 (a report in Slovene language)
IEEE Slovenia CIS in 2021 (a report in Slovene language)
 
Superračunalništvo v Mariboru (2021, CIS11, ZID)
Superračunalništvo v Mariboru (2021, CIS11, ZID)Superračunalništvo v Mariboru (2021, CIS11, ZID)
Superračunalništvo v Mariboru (2021, CIS11, ZID)
 
Parallelization of Benchmarking using HPC: Text Summarization in Natural Lang...
Parallelization of Benchmarking using HPC: Text Summarization in Natural Lang...Parallelization of Benchmarking using HPC: Text Summarization in Natural Lang...
Parallelization of Benchmarking using HPC: Text Summarization in Natural Lang...
 
Examples Implementing Black-Box Discrete Optimization Benchmarking Survey for...
Examples Implementing Black-Box Discrete Optimization Benchmarking Survey for...Examples Implementing Black-Box Discrete Optimization Benchmarking Survey for...
Examples Implementing Black-Box Discrete Optimization Benchmarking Survey for...
 
Adaptive Constraint Handling and Success History Differential Evolution for C...
Adaptive Constraint Handling and Success History Differential Evolution for C...Adaptive Constraint Handling and Success History Differential Evolution for C...
Adaptive Constraint Handling and Success History Differential Evolution for C...
 

Recently uploaded

Disentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTDisentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTSérgio Sacani
 
G9 Science Q4- Week 1-2 Projectile Motion.ppt
G9 Science Q4- Week 1-2 Projectile Motion.pptG9 Science Q4- Week 1-2 Projectile Motion.ppt
G9 Science Q4- Week 1-2 Projectile Motion.pptMAESTRELLAMesa2
 
Animal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxAnimal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxUmerFayaz5
 
Spermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatidSpermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatidSarthak Sekhar Mondal
 
VIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C PVIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C PPRINCE C P
 
GFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptxGFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptxAleenaTreesaSaji
 
Luciferase in rDNA technology (biotechnology).pptx
Luciferase in rDNA technology (biotechnology).pptxLuciferase in rDNA technology (biotechnology).pptx
Luciferase in rDNA technology (biotechnology).pptxAleenaTreesaSaji
 
Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)PraveenaKalaiselvan1
 
Grafana in space: Monitoring Japan's SLIM moon lander in real time
Grafana in space: Monitoring Japan's SLIM moon lander  in real timeGrafana in space: Monitoring Japan's SLIM moon lander  in real time
Grafana in space: Monitoring Japan's SLIM moon lander in real timeSatoshi NAKAHIRA
 
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...jana861314
 
Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...Nistarini College, Purulia (W.B) India
 
zoogeography of pakistan.pptx fauna of Pakistan
zoogeography of pakistan.pptx fauna of Pakistanzoogeography of pakistan.pptx fauna of Pakistan
zoogeography of pakistan.pptx fauna of Pakistanzohaibmir069
 
Artificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C PArtificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C PPRINCE C P
 
Boyles law module in the grade 10 science
Boyles law module in the grade 10 scienceBoyles law module in the grade 10 science
Boyles law module in the grade 10 sciencefloriejanemacaya1
 
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsHubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsSérgio Sacani
 
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRStunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRDelhi Call girls
 
Nanoparticles synthesis and characterization​ ​
Nanoparticles synthesis and characterization​  ​Nanoparticles synthesis and characterization​  ​
Nanoparticles synthesis and characterization​ ​kaibalyasahoo82800
 

Recently uploaded (20)

Disentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTDisentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOST
 
G9 Science Q4- Week 1-2 Projectile Motion.ppt
G9 Science Q4- Week 1-2 Projectile Motion.pptG9 Science Q4- Week 1-2 Projectile Motion.ppt
G9 Science Q4- Week 1-2 Projectile Motion.ppt
 
Animal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxAnimal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptx
 
Spermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatidSpermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatid
 
VIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C PVIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C P
 
GFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptxGFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptx
 
Luciferase in rDNA technology (biotechnology).pptx
Luciferase in rDNA technology (biotechnology).pptxLuciferase in rDNA technology (biotechnology).pptx
Luciferase in rDNA technology (biotechnology).pptx
 
The Philosophy of Science
The Philosophy of ScienceThe Philosophy of Science
The Philosophy of Science
 
Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)
 
Grafana in space: Monitoring Japan's SLIM moon lander in real time
Grafana in space: Monitoring Japan's SLIM moon lander  in real timeGrafana in space: Monitoring Japan's SLIM moon lander  in real time
Grafana in space: Monitoring Japan's SLIM moon lander in real time
 
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...
 
9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service
9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service
9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service
 
Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...
 
zoogeography of pakistan.pptx fauna of Pakistan
zoogeography of pakistan.pptx fauna of Pakistanzoogeography of pakistan.pptx fauna of Pakistan
zoogeography of pakistan.pptx fauna of Pakistan
 
Artificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C PArtificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C P
 
Boyles law module in the grade 10 science
Boyles law module in the grade 10 scienceBoyles law module in the grade 10 science
Boyles law module in the grade 10 science
 
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsHubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
 
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRStunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
 
Engler and Prantl system of classification in plant taxonomy
Engler and Prantl system of classification in plant taxonomyEngler and Prantl system of classification in plant taxonomy
Engler and Prantl system of classification in plant taxonomy
 
Nanoparticles synthesis and characterization​ ​
Nanoparticles synthesis and characterization​  ​Nanoparticles synthesis and characterization​  ​
Nanoparticles synthesis and characterization​ ​
 

EuroHPC AI in DAPHNE and Text Summarization

  • 1. Introduction DAPHNE Background EuroHPC Vega & AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega &,Deploying DAPHNE HPC & GenAI Language Video Machine Power Opportunities Conferencia Invitada 15 September 2023 at 12:00 CEST University of Alicante (UA), Departement of Software and Computing Systems (DLSI) in Sala Ada Lovelace, UA DLSI, Alicante, Spain Hosted by Elena Lloret Pastor (Profesor/A Titular Universidad, UA DLSI) 150 200 250 300 350 400 450 500 550 16 32 48 64 80 Seconds to compute a workload Number of tasks (equals 16 times the SLURM --nodes parameter) Summarizer workload 1 2 3 4 5 16 32 48 64 80 Workload scaling (wall time) Number of tasks (equals 16 times the SLURM --nodes parameter) Summarizer workload Resources EuroHPC AI in DAPHNE and Text Summarization 15 September 2023 @ UA DLSI Aleš Zamuda <ales.zamuda@um.si> Acknowledgement: this work is supported by ARRS programme P2-0041 and EU project no. 957407. -0.05 0 0.05 0.1 0.15 0.2 0.25 0.3 0.35 0.4 1 10 100 1000 10000 ROUGE-1R ROUGE-2R ROUGE-LR ROUGE-SU4R Fitness (scaled) Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 1/123
  • 2. Introduction DAPHNE Background EuroHPC Vega & AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega &,Deploying DAPHNE HPC & GenAI Language Video Machine Power Opportunities Introduction & Outline: Aims of this Talk 1 (10 minutes) Part I: Background on DAPHNE Essentials + Project presentation: DAPHNE (Integrated Data Analysis Pipelines for Large-Scale Data Management, HPC, and Machine Learning) 2 (10 minutes) Part II: Background on EuroHPC Vega and AI + Selected recent developments and opportunities with DAPHNE in EuroHPC deployments like Vega in Maribor 3 (20 minutes) Part III: HPC and AI Opportunities + Including container reuse for generative AI — text summarization, autonomous machines, energy scheduling, and ecosystems. 4 (10 minutes) Part IV: SORS together and Leadership + The leadership approach towards EuroHPC AI in DAPHNE, + presented and discussed for potential collaboration, + to connect and conclude with take-aways of the talk. 5 (10 minutes) Questions, Misc Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 2/123
  • 3. Introduction DAPHNE Background EuroHPC Vega & AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega &,Deploying DAPHNE HPC & GenAI Language Video Machine Power Opportunities Introduction: Overview (Focus, Use, Scope) • This contribution focuses on HPC and AI sors for EuroHPC and DAPHNE • and discusses their collaboration potential w/ BSC. • Backgrounds on HPC deployments’ container reuse and integration @EuroHPC scope are discussed. • Backgrounds: generative AI [1], autonomous machines [4], energy scheduling [3], and ecosystems [2]: • the Summarizer [1] (language): generating text summaries, • UGPP [4] (machine): generating deep ocean trajectories, • load balancing of energy power plants (PSADE@NPdynϵjDE [3]): generating schedules, and • EcoMod [2]: generating procedural 3D models of natural ecosystems with trees. [1] A. Zamuda, E. Lloret, Optimizing Data-Driven Models for Summarization as Parallel Tasks. Journal of Computational Science 42, 101101 (2020). [2] A. Zamuda, J. Brest. Vectorized Procedural Models for Animated Trees Reconstruction using Differential Evolution. Information Sciences 278, pp. 1-21, 2014. DOI 10.1016/j.ins.2014.04.037. [3] A. Glotić, A. Zamuda. Short-term combined economic and emission hydrothermal optimization by surrogate differential evolution. Applied Energy, 1 March 2015, vol. 141, pp. 42-56. DOI: 10.1016/j.apenergy.2014.12.020 [4] A. Zamuda, J. D. Hernández Sosa, Success history applied to expert system for underwater glider path planning using differential evolution. Expert Systems with Applications 119, 155-170 (2019). Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 3/123
  • 4. Introduction DAPHNE Background EuroHPC Vega & AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega &,Deploying DAPHNE HPC & GenAI Language Video Machine Power Opportunities Conferencia Invitada 15 September 2023 at 12:00 CEST University of Alicante (UA), Departement of Software and Computing Systems (DLSI) in Sala Ada Lovelace, UA DLSI, Alicante, Spain Hosted by Elena Lloret Pastor (Profesor/A Titular Universidad, UA DLSI) EuroHPC AI in DAPHNE and Text Summarization — Part I: DAPHNE Background — Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 4/123
  • 5. Introduction DAPHNE Background EuroHPC Vega & AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega &,Deploying DAPHNE HPC & GenAI Language Video Machine Power Opportunities DAPHNE Partners https://daphne-eu.eu Project Consortium 13 partner institutions from 7 countries • DM, ML, HPC • Academia & industry • Different application domains 14 • Technical University Berlin University of Maribor (UM): UM FERI research team I lead (DAPHNE), SLING connection (EuroHPC Vega). https://feri.um.si/en/research/international-and-structural-funds-projects/ integrated-data-analysis-pipelines-for-large-scale-data-management-hpc-and-machine-learning/ Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 5/123
  • 6. Introduction DAPHNE Background EuroHPC Vega & AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega &,Deploying DAPHNE HPC & GenAI Language Video Machine Power Opportunities DAPHNE: Overview Overview: Generic Aspect of the Project • Deployment Challenges • Hardware Challenges • DM+ML+HPC share compilation and runtime techniques / converging cluster hardware • End of Dennard scaling: P = α CFV2 (power density 1) • End of Moore’s law • Amdahl’s law: sp = 1/s  Increasing Specialization #1 Data Representations Sparsity Exploitation from Algorithms to HW dense graph sparse compressed #2 Data Placement Local vs distributed CPUs/ NUMA GPUs FPGAs/ ASICs #3 Data (Value) Types FP32, FP64, INT8, INT32, INT64, UINT8, BF16, TF32, FlexPoint [NVIDIA A100]  DAPHNE Overall Objective: Open and extensible system infrastructure Different Systems/ Libraries Dev Teams Programming Models Resource Managers Cluster Under- utilization Data/File Exchange 3 lessons learnt so far choices made, methodology Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 6/123
  • 7. Introduction DAPHNE Background EuroHPC Vega & AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega &,Deploying DAPHNE HPC & GenAI Language Video Machine Power Opportunities DAPHNE: Functionalities y Functionality Introduction: from Language Abstractions to Distributed Vectorized Execution and Use Cases • Federated matrices/frames + distribution primitives • Hierarchical vectorized pipelines and scheduling • Coordinator (spawns distributed fused pipeline) • #1 Prepare Inputs (N/A, repartition, broadcasts, slices broadcasts as necessary) • #2 Coarse-grained Tasks (tasks run vectorized pipeline) • #3 Combine Outputs (N/A, all-reduce, rbind/cbind) Node 1 X [1: 100M] Node 2 X [100M: 200M] colmu colsd y y (X) XTX XTy dc = DaphneContext() G = dc.from_numpy(npG) G = (G != 0) c = components(G, 100, True).compute() Python API DaphneLib def components(G, maxi, verbose) { n = nrow(G); // get the number of vertexes maxi = 100; c = seq(1, n); // init vertex IDs diff = inf; // init diff to +Infinity iter = 1; // iterative computation of connected components while(diff>0 & iter<=maxi) { u = max(rowMaxs(G * t(c)), c); // neighbor prop diff = sum(u != c); // # of changed vertexes c = u; // update assignment iter = iter + 1; } } Domain-specific Language DaphneDSL Multiple dispatch of functions/kernels Example Use Cases • DLR Earth Observation • ESA Sentinel-1/2 datasets  4PB/year • Training of local climate zone classifiers on So2Sat LCZ42 (15 experts, 400K instances, 10 labels each, ~55GB HDF5) • ML pipeline: preprocessing, ResNet-20, climate models • IFAT Semiconductor Ion Beam Tuning • KAI Semiconductor Material Degradation • AVL Vehicle Development Process (ejector geometries, KPIs) • ML-assisted simulations, data cleaning, augmentation • Cleaning during exploratory query processing Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 7/123
  • 8. Introduction DAPHNE Background EuroHPC Vega & AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega &,Deploying DAPHNE HPC & GenAI Language Video Machine Power Opportunities DAPHNE: Data Spaces Contribution to Data Spaces • How does your project contribute to the Data Spaces (e.g. use cases combining data spaces and big data/extreme data analytics, specific functionality / building blocks, contribution to existing building blocks, specific requirements, etc.) • focus on the open and extensible infrastructure we are building • that would enable adaption to Data Spaces as anybody can extend DAPHNE with custom readers and filters • furthermore, as we try to bring IDA, ML and HPC communities together, we can make a case for more efficient in memory processing if the ingestion and processing happen within the same framework • additionally, the DAPHNE infrastructure fosters development of compiler passes that optimize the end to end analytics task by facilitating operator fusion and reuse of intermediates Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 8/123
  • 9. Introduction DAPHNE Background EuroHPC Vega & AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega &,Deploying DAPHNE HPC & GenAI Language Video Machine Power Opportunities DAPHNE: Interoperability Contribution to Interoperability • DAPHNE is open-source software • https://github.com/daphne-eu/daphne • Apache v2 license • Towards an inclusive dev community  Potential for collaboration in 2023-2024 • Check out our website • https://daphne-eu.eu • Follow us on twitter • @daphne_eu Enable researchers to experiment with new prototypes and extensions Long-term stability development available through GitHub data sets – reproducibility, use cases Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 9/123
  • 10. Introduction DAPHNE Background EuroHPC Vega & AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega &,Deploying DAPHNE HPC & GenAI Language Video Machine Power Opportunities DAPHNE: Data Spaces – Common Framework Contribution to Data Spaces • How would a common conceptual/functional framework for Data spaces and Big Data / Extreme data analytics look like? • At Big Data technologies and extreme-scale analytics Projects Workshop (Horizon ICT-51- 2020), organized by Big Data Value Association (BDVA) in collaboration with project EUH4D (European Federation of Data Driven Innovation Hubs), on September 27, 2022 (online), the DAPHNE project has been presented by Aleš Zamuda (UM) and Eva Paulusberger (KNOW) • with the aim to engage for future road-mapping and creating a community around the topic in extreme-scale data analytics • during the presentation, we had the opportunity to channel our work towards the activities of BDVA, • especially in relation to Data Sharing Spaces and standardization. • we have highlighted the specific technical and non-technical progress of the DAPHNE project, our Use Cases, the main lessons learnt so far, and contributions to road-mapping activities in the field of extreme-scale data analytics • the workshop has allowed us to reflect on the value and needs for collaborating with other ICT-51 projects (MORE, SELMA, VesselAI, EVEREST, and MARVEL), and how to federate our data sets and services under EUH4D • Framework: published at CIDR 2022 Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 10/123
  • 11. Introduction DAPHNE Background EuroHPC Vega & AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega &,Deploying DAPHNE HPC & GenAI Language Video Machine Power Opportunities DAPHNE: Data Spaces – Integrated Architecture System Architecture LLVM Python API w/ lazy evaluation MLIR Dialects, Extension Catalog (new data types, kernels, scheduling algs) Sideways Entry, DSL-level constraints (data formats & data/op placement) Contribution to Data Spaces How would a common conceptual/functional framework for Data spaces and Big Data / Extreme data analytics look like? Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 11/123
  • 12. Introduction DAPHNE Background EuroHPC Vega & AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega &,Deploying DAPHNE HPC & GenAI Language Video Machine Power Opportunities DAPHNE: Vision Paper, Integrating DM + ML + HPC • Current Status • System architecture and design • Initial DSL and Python API • Prototype of MLIR-based compiler and runtime • Vectorized execution (fused pipelines, scheduling) • GPU (and FPGA) integration, BLAS/DNN libraries, I/O primitives • Standalone distributed runtime w/ different distribution primitives • Joint Paper on System Architecture • Published at CIDR 2022 Contribution Towards common conceptual/functional framework for Data spaces and Big Data / Extreme data analytics  DAPHNE Overall Objective: Open and extensible system infrastructure DM + ML + HPC Recently: reached out to other EU projects towards interoperability and standardization • synergies with EVEREST and eFlows4HPC Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 12/123
  • 13. Introduction DAPHNE Background EuroHPC Vega & AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega &,Deploying DAPHNE HPC & GenAI Language Video Machine Power Opportunities DAPHNE: Joint Framework on MLIR Collaboration on technical aspects – Joint framework • What are the aspects that interest you the most to work on. Prioritise those ones of your interest, and address a summarised contribution to those that you prioritised • ML/DL systems and system support • Architectures for gathering heterogeneous data • System tools, e.g. language, intermediate representation • Method for extreme-scale analytics, e.g. combination of ML models, simulations and subsequent data analysis in different use cases • Standardized interconnection methods, e.g. runtime integration, HPC libraries • Data fusion and data integration technologies • Others Different Systems/ Libraries Dev Teams Programming Models Resource Managers Cluster Under- utilization Data/File Exchange https://mlir.llvm.org Based on Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 13/123
  • 14. Introduction DAPHNE Background EuroHPC Vega & AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega &,Deploying DAPHNE HPC & GenAI Language Video Machine Power Opportunities HPC workloads: new DAPHNE Use Cases Example Use Cases • DLR Earth Observation • ESA Sentinel-1/2 datasets  4PB/year • Training of local climate zone classifiers on So2Sat LCZ42 (15 experts, 400K instances, 10 labels each, ~55GB HDF5) • ML pipeline: preprocessing, ResNet-20, climate models • IFAT Semiconductor Ion Beam Tuning • KAI Semiconductor Material Degradation • AVL Vehicle Development Process (ejector geometries, KPIs) • ML-assisted simulations, data cleaning, augmentation • Cleaning during exploratory query processing [So2Sat LC42: https://mediatum.ub.tum.de/1454690] [Xiao Xiang Zhu et al: So2Sat LCZ42: A Benchmark Dataset for the Classification of Global Local Climate Zones. GRSM 8(3) 2020] Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 14/123
  • 15. Introduction DAPHNE Background EuroHPC Vega & AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega &,Deploying DAPHNE HPC & GenAI Language Video Machine Power Opportunities DAPHNE: ICT Rolling Plan for Data and AI Actions Potential contribution to the Rolling plan for ICT Standards 2022 (Data and AI actions) Example identified opportunities: • IEEE CIS Standards Committee: P2976 - Standard for XAI - eXplainable AI Working Group (CIS/SC/XAI WG), https://sagroups.ieee.org/1855/ • IEEE CIS Standards Committee: P1849 – Working Group for eXtensible Event Stream (XES) for Achieving Interoperability in Event Logs and Event Streams, https://standards.ieee.org/ieee/1849/10907/ • IEEE other: https://ethicsinaction.ieee.org/p7000/; other ML&AI: • IEEE P2841, Framework and Process for Deep Learning Evaluation • IEEE 3652.1-2020, IEEE Guide for Architectural Framework and Application of Federated Machine Learning, https://standards.ieee.org/project/3652_1.html • ISO/IEC JTC 1 SC 42 (AI) - published: • ISO/IEC 20547-3:2020 Information technology — Big data reference architecture — Part 3: Reference architecture • ISO/IEC TR 20547-5:2018 Information technology — Big data reference architecture — Part 5: Standards roadmap • ISO/IEC TR 24029-1:2021 Artificial Intelligence (AI) — Assessment of the robustness of neural networks — Part 1: Overview • ISO/IEC TR 24030:2021 Information technology — Artificial intelligence (AI) — Use cases • ISO/IEC 24372 Information technology -- Artificial Intelligence (AI) -- Overview of computational approaches for AI systems • ISO/IEC JTC 1 SC 42 (AI) – under development: • ISO/IEC DIS 22989 Artificial Intelligence Concepts and Terminology • ISO/IEC 23053 Framework for Artificial Intelligence Systems Using Machine Learning • ISO/IEC 42001 Artificial Intelligence - Management System Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 15/123
  • 16. Introduction DAPHNE Background EuroHPC Vega & AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega &,Deploying DAPHNE HPC & GenAI Language Video Machine Power Opportunities DAPHNE: Standards Opportunities • Gaps • Federated matrices/frames + distribution primitives • Hierarchical vectorized pipelines and scheduling • Applied within the context of AI and BDV • Priorities • Data architectures for federated frames dispatch Are the Roadmapping activities for extreme-scale data analytics DAPHNE input to the Rolling plan for ICT Standards 2023 Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 16/123
  • 17. Introduction DAPHNE Background EuroHPC Vega & AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega &,Deploying DAPHNE HPC & GenAI Language Video Machine Power Opportunities DAPHNE: Open Topics Interest and connection to future discussion topics and future workshops (Q1 2023) • Skills gap that should be filled in the future • MLIR kernels programming for dedicated (pre-release) hardware; and, design of algorithms in the daphne language with the DAPHNE architecture in mind • another gap that DAPHNE addresses is the seamless integration of pipelines on a HPC, and there are plenty of skill gaps and how to do and fill a skills gap from this • e.g. development of libraries and executables, but also training and teaching of new programmers in the community • Trustworthy AI and link to the AI on Demand Platform • DAPHNE will be able to provide full stack for pipelines integration on the level of a whole HPC, • which is an important added value for creating an AI on Demand Platform; • an open-source implementation of the DAPHNE system fosters reproducibility for Trustworthy AI • Enlarging the group with new related projects from Horizon Europe and EuroHPC JU • DAPHNE is open-source software • https://github.com/daphne-eu/daphne • Apache v2 license • DAPHNE is present at several top-tier events • Latest consortium synergy: TUB projects • The topic of Data Spaces will be recurrent during all workshops during 2023 • We are planning, in accordance with the project plan, to bring the bottom-up developed DAPHNE system closer to the top-down developed use cases • Thus, we are planning a use case workshop for Q2/2023 • but also to stay in line with the research focus of the research and innovation programme Federated matrices/frames + distribution primitives Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 17/123
  • 18. Introduction DAPHNE Background EuroHPC Vega & AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega &,Deploying DAPHNE HPC & GenAI Language Video Machine Power Opportunities Conferencia Invitada 15 September 2023 at 12:00 CEST University of Alicante (UA), Departement of Software and Computing Systems (DLSI) in Sala Ada Lovelace, UA DLSI, Alicante, Spain Hosted by Elena Lloret Pastor (Profesor/A Titular Universidad, UA DLSI) EuroHPC AI in DAPHNE and Text Summarization — Part II: EuroHPC Vega and AI — Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 18/123
  • 19. Introduction DAPHNE Background EuroHPC Vega & AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega &,Deploying DAPHNE HPC & GenAI Language Video Machine Power Opportunities EuroHPC Vega & AI (Vega supercomputer in TOP500) — A Multimedia Tour — Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 19/123
  • 20. Introduction DAPHNE Background EuroHPC Vega & AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega &,Deploying DAPHNE HPC & GenAI Language Video Machine Power Opportunities TOP500: EuroHPC Vega (tour at ASHPC23) Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 20/123
  • 21. Introduction DAPHNE Background EuroHPC Vega & AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega &,Deploying DAPHNE HPC & GenAI Language Video Machine Power Opportunities TOP500: EuroHPC Vega (tour at ASHPC23) Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 21/123
  • 22. Introduction DAPHNE Background EuroHPC Vega & AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega &,Deploying DAPHNE HPC & GenAI Language Video Machine Power Opportunities TOP500: EuroHPC Vega (tour at ASHPC23) Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 22/123
  • 23. Introduction DAPHNE Background EuroHPC Vega & AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega &,Deploying DAPHNE HPC & GenAI Language Video Machine Power Opportunities TOP500: EuroHPC Vega (tour at ASHPC23) Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 23/123
  • 24. Introduction DAPHNE Background EuroHPC Vega & AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega &,Deploying DAPHNE HPC & GenAI Language Video Machine Power Opportunities AI Challenges Shortlist (Part II: First subpart) Faced 5 types of challenges, leading to the needs to apply HPC architectures for benchmarking state-of-the-art topics in 1 text summarization, 2 forest ecosystem modeling, simulation, and visualization, 3 underwater robotic mission planning, 4 energy production scheduling for hydro-thermal power plants, and 5 understanding evolutionary algorithms. Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 24/123
  • 25. Introduction DAPHNE Background EuroHPC Vega & AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega &,Deploying DAPHNE HPC & GenAI Language Video Machine Power Opportunities Challenges 1: Text Summarization (Language) For NLP (Natural Language Processing), part of ”Big Data”. Terms across sentences are determined using a semantic analysis using both: • coreference resolution (using WordNet) and • a Concept Matrix (from Freeling). INPUT CORPUS NATURAL LANGUAGE PROCESSING ANALYSIS CONCEPTS DISTRIBUTION PER SENTENCES MATRIX OF CONCEPTS PROCESSING CORPUS PREPROCESSING PHASE OPTIMIZATION TASKS EXECUTION PHASE ASSEMBLE TASK DESCRIPTION SUBMIT TASKS TO PARALLEL EXECUTION OPTIMIZER + TASK DATA ROUGE EVALUATION The detailed new method called CaBiSDETS is developed in the HPC approach comprising of: • a version of evolutionary algorithm (Differential Evolution, DE), • self-adaptation, binarization, constraint adjusting, and some more pre-computation, • optimizing the inputs to define the summarization optimization model. Aleš Zamuda, Elena Lloret. Optimizing Data-Driven Models for Summarization as Parallel Tasks. Journal of Computational Science, 2020, vol. 42, pp. 101101. DOI: 10.1016/j.jocs.2020.101101 Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 25/123
  • 26. Introduction DAPHNE Background EuroHPC Vega & AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega &,Deploying DAPHNE HPC & GenAI Language Video Machine Power Opportunities Challenges 2: Forest Ecosystem Modeling, Simulation, and Visualization (Real World / Video) • HPC need to process spatial data and add procedural content, generating real-world items for producing a video of 3D space. Videos: https://www.youtube.com/watch?list=PL7pmTW8neV7tZf2qx1wV5zbD74sUyHL3B&v=V9YJgYO_sIA Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 26/123
  • 27. Introduction DAPHNE Background EuroHPC Vega & AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega &,Deploying DAPHNE HPC & GenAI Language Video Machine Power Opportunities Challenges 3: Underwater Robotic Mission Planning • Computational Fluid Dynamics (CFD) spatio-temporal model of the ocean currents for autonomous vehicle navigation path planning. • Constrained Differential Evolution Optimization for Underwater Glider Path Planning in Sub-mesoscale Eddy Sampling. • Corridor-constrained optimization: eddy border region sampling — new challenge for UGPP & DE. • Feasible path area is constrained — trajectory in corridor around the border of an ocean eddy. The objective of the glider here is to sample the oceanographic variables more efficiently, while keeping a bounded trajectory. HPC: develop new methods and evaluate them. Video: https://www.youtube.com/watch?v=4kCsXAehAmU Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 27/123
  • 28. Introduction DAPHNE Background EuroHPC Vega & AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega &,Deploying DAPHNE HPC & GenAI Language Video Machine Power Opportunities Challenges 4: Energy Production Scheduling for Hydro-thermal Power Plants A. Glotić, A. Zamuda. Short-term combined economic and emission hydrothermal optimization by surrogate differential evolution. Applied Energy, 1 March 2015, vol. 141, pp. 42-56. DOI: 10.1016/j.apenergy.2014.12.020 Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 28/123
  • 29. Introduction DAPHNE Background EuroHPC Vega & AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega &,Deploying DAPHNE HPC & GenAI Language Video Machine Power Opportunities Challenges 5: Understanding Evolutionary Algorithms • Evolutionary algorithms benchmarking to understand computational intelligence of these algorithms (→ storage requirement!), • aim: Machine Learning to design an optimization algorithm (learning to learn). • Example CI Algorithm Mechanism Design: Control Parameters Self-Adaptation (in DE). Video: https://www.youtube.com/watch?v=R244LZpZSG0 Application stacks for real code: inspired by previous computational optimization competitions in continuous settings that used test functions for optimization application domains: • single-objective: CEC 2005, 2013, 2014, 2015 • constrained: CEC 2006, CEC 2007, CEC 2010 • multi-modal: CEC 2010, SWEVO 2016 • black-box (target value): BBOB 2009, COCO 2016 • noisy optimization: BBOB 2009 • large-scale: CEC 2008, CEC 2010 • dynamic: CEC 2009, CEC 2014 • real-world: CEC 2011 • computationally expensive: CEC 2013, CEC 2015 • learning-based: CEC 2015 • 100-digit (50% targets): 2019 joined CEC, SEMCCO, GECCO • multi-objective: CEC 2002, CEC 2007, CEC 2009, CEC 2014 • bi-objective: CEC 2008 • many objective: CEC 2018 Tuning/ranking/hyperheuristics use. → DEs as usual winner algorithms. Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 29/123
  • 30. Introduction DAPHNE Background EuroHPC Vega & AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega &,Deploying DAPHNE HPC & GenAI Language Video Machine Power Opportunities Challenges 6: new DAPHNE Use Cases Example Use Cases • DLR Earth Observation • ESA Sentinel-1/2 datasets  4PB/year • Training of local climate zone classifiers on So2Sat LCZ42 (15 experts, 400K instances, 10 labels each, ~55GB HDF5) • ML pipeline: preprocessing, ResNet-20, climate models • IFAT Semiconductor Ion Beam Tuning • KAI Semiconductor Material Degradation • AVL Vehicle Development Process (ejector geometries, KPIs) • ML-assisted simulations, data cleaning, augmentation • Cleaning during exploratory query processing [So2Sat LC42: https://mediatum.ub.tum.de/1454690] [Xiao Xiang Zhu et al: So2Sat LCZ42: A Benchmark Dataset for the Classification of Global Local Climate Zones. GRSM 8(3) 2020] Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 30/123
  • 31. Introduction DAPHNE Background EuroHPC Vega & AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega &,Deploying DAPHNE HPC & GenAI Language Video Machine Power Opportunities HPC Initiatives (Part II: Second subpart) Timeline (as member) of recent impactful HPC initiatives including Slovenia: • SLING: Slovenian national supercomputing network, 2010-05-03–, • SIHPC: Slovenian High-Performance Computing Centre, 2016-03-04– • ImAppNIO: Improving Applicability of Nature-Inspired Optimisation by Joining Theory and Practice, 2016-03-09–2020-10-31 • cHiPSet: High-Performance Modelling and Simulation for Big Data Applications, 2015-04-08–2019-04-07, • HPC RIVR: UPGRADING OF NATIONAL RESEARCH INFRASTRUCTURES, Investment Program, 2018-03-01–2020-09-15, • TFoB: IEEE CIS Task Force on Benchmarking, January 2020–, • EuroCC: National Competence Centres in the framework of EuroHPC, 2020-09-01–(2022-08-31), • DAPHNE: Integrated Data Analysis Pipelines for Large-Scale Data Management, HPC, and Machine Learning, 2020-12-01–(2024-11-30). Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 31/123
  • 32. Introduction DAPHNE Background EuroHPC Vega & AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega &,Deploying DAPHNE HPC & GenAI Language Video Machine Power Opportunities Initiatives: SLING, SIHPC, HPC RIVR, EuroCC • There is a federated and orchestrated aim towards HPC infrastructure in Slovenia, especially through: • SLING: Slovenian national supercomputing network → has federated the initiative push towards orchestration of HPC resources across the country. • SIHPC: Slovenian High-Performance Computing Centre → has orchestrated the first EU funds application towards HPC Teaming in the country (and Participation of Slovenia in PRACE 2). • HPC RIVR: UPGRADING OF NATIONAL RESEARCH INFRASTRUCTURES, Investment Program → has provided an investment in experimental HPC infrastructure. • EuroCC: National Competence Centres in the framework of EuroHPC → has secured a National Competence Centre (EuroHPC). Vega supercomputer online Consortium Slovenian High-Performance Computing Centre Aškerčeva ulica 6 SI-1000 Ljubljana Slovenia Ljubljana, 22. 2. 2017 prof. dr. Anwar Osseyran PRACE Council Chair Subject: Participation of Slovenia in PRACE 2 Dear professor Osseyran, In March 2016 a consortium Slovenski superračunalniški center (Slovenian High-Performance Computing Centre - SIHPC) was established with a founding act (Attachment 1) where article 6 claims that the consortium will join PRACE and that University of Ljubljana, Faculty of mechanical engineering (ULFME) will represent the consortium in the PRACE. The legal representative of ULFME in the consortium and therefore, also in the PRACE (i.e., the delegate in PRACE Council with full authorization) is prof. dr. Jožef Duhovnik, as follows from appointment of the dean of ULFME (Attachment 2). The consortium has also agreed that it joins PRACE 2 optional programme as contributing GP. With best regards, assist. prof. dr. Aleš Zamuda, vice-president of SIHPC Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 32/123
  • 33. Introduction DAPHNE Background EuroHPC Vega & AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega &,Deploying DAPHNE HPC & GenAI Language Video Machine Power Opportunities Initiatives: ImAppNIO, cHiPSet, TFoB, DAPHNE Aim towards software to run HPC and improve capabilities: • ImAppNIO: Improving Applicability of Nature-Inspired Optimisation by Joining Theory and Practice, → improve capabilities through benchmarking (to understand (and to learn to learn)) CI algorithms • cHiPSet: High-Performance Modelling and Simulation for Big Data Applications, → include HPC in Modelling and Simulation (of the process to be learned) • TFoB: IEEE CIS Task Force on Benchmarking, → includes CI benchmarking opportunities, where HPC would enable new capabilities. • DAPHNE: Integrated Data Analysis Pipelines for Large-Scale Data Management, HPC, and Machine Learning. → to define and build an open and extensible system infrastructure for integrated data analysis pipelines, including data management and processing, high-performance computing (HPC), and machine learning (ML) training and scoring https://daphne-eu.github.io/ Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 33/123
  • 34. Introduction DAPHNE Background EuroHPC Vega & AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega &,Deploying DAPHNE HPC & GenAI Language Video Machine Power Opportunities EuroHPC Vega & Deploying DAPHNE (Part II: Third subpart) MODA (Monitoring and Operational Data Analytics) tools for • collecting, analyzing, and visualizing • rich system and application data, and • my opinion on how one can make sense of the data for actionable insights. • Explained through previous examples: from a HPC User Perspective. Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 34/123
  • 35. Introduction DAPHNE Background EuroHPC Vega & AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega &,Deploying DAPHNE HPC & GenAI Language Video Machine Power Opportunities MODA Actionable Insights, Explained From a HPC User Perspective, Through the Example of Summarization Most interesting findings of summarization on HPC example are • the fitness of the NLP model keeps increasing with prolonging the dedicated HPC resources (see below) and that • the fitness improvement correlates with ROUGE evaluation in the benchmark, i.e. better summaries. -0.05 0 0.05 0.1 0.15 0.2 0.25 0.3 0.35 0.4 1 10 100 1000 10000 ROUGE-1R ROUGE-2R ROUGE-LR ROUGE-SU4R Fitness (scaled) Hence, the use of HPC significantly contributes to capability of this NLP challenge. However, the MODA insight also provided the useful task running times and resource usage. Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 35/123
  • 36. Introduction DAPHNE Background EuroHPC Vega & AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega &,Deploying DAPHNE HPC & GenAI Language Video Machine Power Opportunities Running the Tasks on HPC: ARC Job Preparation Parallel summarization tasks on grid through ARC. Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 36/123
  • 37. Introduction DAPHNE Background EuroHPC Vega & AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega &,Deploying DAPHNE HPC & GenAI Language Video Machine Power Opportunities Running the Tasks on HPC: ARC Job Submission, Results Retrieval & Merging [JoCS2020] Through an HPC approach and by parallelization of tasks, a data-driven summarization model optimization yields improved benchmark metric results (drawn using gnuplot merge). MODA is needed to run again and improve upon, to forecast how to set required task running time and resources (predicting system response). Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 37/123
  • 38. Introduction DAPHNE Background EuroHPC Vega & AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega &,Deploying DAPHNE HPC & GenAI Language Video Machine Power Opportunities Monitoring and Operational Data Analytics • Monitor used (jobs, CPU/wall time, etc.): Smirnova, Oxana. The Grid Monitor. Usage manual, Tech. Rep. NORDUGRID-MANUAL-5, The NorduGrid Collaboration, 2003. http://www.nordugrid.org/documents/ http://www.nordugrid.org/manuals.html http://www.nordugrid.org/documents/monitor.pdf • Deployed at: www.nordugrid.org/monitor/ • NorduGrid Grid Monitor Sampled: 2021-06-28 at 17-57-08 • Nation-wide in Slovenia: https://www.sling.si/gridmonitor/loadmon.php http://www.nordugrid.org/monitor/index.php? display=vo=Slovenia Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 38/123
  • 39. Introduction DAPHNE Background EuroHPC Vega & AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega &,Deploying DAPHNE HPC & GenAI Language Video Machine Power Opportunities MODA Example From: ARC at Jost Example experiments from DOI: 10.1016/j.swevo.2015.10.007 (SPSRDEMMS) – job YYULDmGOXpmnmmR0Xox1SiGmABFKDmABFKDmrtMKDmABFKDm66faPo. Sample ARC file gridlog/diag (2–3 day Wall Times). runtimeenvironments=APPS/ARNES/MPI−1.6−R ; CPUUsage=99% MaxResidentMemory=5824kB AverageUnsharedMemory=0kB AverageUnsharedStack=0kB AverageSharedMemory=0kB PageSize=4096B MajorPageFaults=4 MinorPageFaults=1213758 Swaps=0 ForcedSwitches=36371494 WaitSwitches=170435 Inputs=45608 Outputs=477168 SocketReceived=0 SocketSent=0 Signals =0 nodename=wn003 . arnes . s i WallTime=148332s Processors=16 UserTime=147921.14s KernelTime =2.54 s AverageTotalMemory=0kB AverageResidentMemory=0kB LRMSStartTime=20150906104626Z LRMSEndTime=20150908035838Z exitcode=0 Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 39/123
  • 40. Introduction DAPHNE Background EuroHPC Vega & AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega &,Deploying DAPHNE HPC & GenAI Language Video Machine Power Opportunities EuroCC HPC: Vega (TOP500 #106, HPCG #56 — June 2021) • Researchers apply to EuroHPC JU calls for access. • Regular calls opened in 2021 fall (Benchmark & Development). • https://prace-ri.eu/benchmark-and-development-access-information-for-applicants/ • 60% capacities for national share (70% OA, 20% commercial, 10% host (community, urgent priority of national importance, maintenance)) + 35% EuroHPC JU share (approved applications) • Has a SLURM dev partition for SSH login (SLURM partitions w/ CPUs: login[0001-0004]=128; login[0005-0008]=64; cn[0001-384,0577-0960]=256; cn[0385-0576]=256; gn[01-60]=256). Listing 1: Setting up at Vega — slurm dev partition access (login). 1 [ ales . zamuda@vglogin0007 ˜]$ s i n g u l a r i t y pull qmake . s i f docker : / / ak352 /qmake−opencv 2 [ ales . zamuda@vglogin0007 ˜]$ s i n g u l a r i t y run qmake . s i f bash 3 cd sum; qmake ; make clean ; make 4 5 [ ales . zamuda@vglogin0007 ˜]$ cat runme . sh 6 # ! / bin / bash 7 cd sum && time mpirun 8 − −mca btl openib warn no device params found 0 9 . / summarizer 10 − −useBinaryDEMPI − −i n p u t f i l e mRNA−1273−t x t 11 − −withoutStatementMarkersInput 12 − −printPreprocessProgress calcInverseTermFrequencyndTermWeights 13 − −printOptimizationBestInGeneration 14 − −summarylength 600 − −NP 200 15 − −GMAX 400 16 > summarizer . out . $SLURM PROCID 17 2> summarizer . err . $SLURM PROCID Text summarization/generation systems are getting more and more useful and accessible on deployed systems (e.g. OpenAI’s ChatGPT, Microsoft’s Bing AI part, NVIDIA’s (Fin)Megatron, BLOOM, LaMDA, BERT, VALL-E, Point-E, etc.). -0.65 -0.6 -0.55 -0.5 -0.45 -0.4 -0.35 -0.3 -0.25 -0.2 -0.15 -0.1 1 10 100 Evaluation Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 40/123
  • 41. Introduction DAPHNE Background EuroHPC Vega & AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega &,Deploying DAPHNE HPC & GenAI Language Video Machine Power Opportunities MODA at First EuroCC HPC Vega Supercomputer Listing 2: Runnig at Vega & MODA. 1 ===================================================================== GMAX=200 ===== 2 [ ales . zamuda@vglogin0002 ˜]$ srun − −cpu−bind=cores − −nodes=1 − −ntasks−per−node=101 3 − −cpus−per−task=2 − − mem=180G s i n g u l a r i t y run qmake . s i f bash 4 srun : job 4531374 queued and waiting for resources 5 srun : job 4531374 has been allocated resources 6 [ ”$SLURM PROCID” = 0 ] && . / runme . sh 7 real 5m22.475 s 8 user 484m42.262 s 9 sys 1m38.304 s 10 ===================================================================== NODES=51 ===== 11 [ ales . zamuda@vglogin0002 ˜]$ srun − −cpu−bind=cores − −nodes=1 − −ntasks−per−node=51 12 − −cpus−per−task=2 − − mem=180G s i n g u l a r i t y run qmake . s i f bash 13 srun : job 4531746 queued and waiting for resources 14 srun : job 4531746 has been allocated resources 15 [ ”$SLURM PROCID” = 0 ] && . / runme . sh 16 real 13m57.851 s 17 user 431m25.833 s 18 sys 0m29.272 s 19 ===================================================================== GMAX=400 ===== 20 [ ales . zamuda@vglogin0002 ˜]$ srun − −cpu−bind=cores − −nodes=1 − −ntasks−per−node=101 21 − −cpus−per−task=2 − − mem=180G s i n g u l a r i t y run qmake . s i f bash 22 srun : job 4532697 queued and waiting for resources 23 srun : job 4532697 has been allocated resources 24 [ ”$SLURM PROCID” = 0 ] && . / runme . sh 25 real 6m14.687 s 26 user 590m45.641 s 27 sys 1m40.930 s Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 41/123
  • 42. Introduction DAPHNE Background EuroHPC Vega & AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega &,Deploying DAPHNE HPC & GenAI Language Video Machine Power Opportunities More Output: SLURM accounting Listing 3: Example accounting tool at Vega: sacct. [ ales . zamuda@vglogin0002 ˜]$ sacct 4531374. ext+ extern vega−users 202 COMPLETED 0:0 4531746. ext+ extern vega−users 102 COMPLETED 0:0 4532697. ext+ extern vega−users 202 COMPLETED 0:0 [ ales . zamuda@vglogin0002 ˜]$ sacct −j 4531374 −j 4531746 −j 4532697 −o MaxRSS , MaxVMSize , AvePages MaxRSS MaxVMSize AvePages − − − − − − − − − − − − − − − − − − − − − − − − − − − − − − 0 217052K 0 26403828K 1264384K 22 0 217052K 0 13325268K 1264380K 0 0 217052K 0 26404356K 1264384K 30 Future MODA testings: • testing the web interface for job analysis (as available from HPC RIVR); • profiling MPI inter-node communication; • use profilers and monitoring tools available — in the context of heterogeneous setups, like e.g. • TAU Performance System — http://www.cs.uoregon.edu/research/tau/home.php, • LIKWID Performance Tools — https://hpc.fau.de/research/tools/likwid/. Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 42/123
  • 43. Introduction DAPHNE Background EuroHPC Vega & AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega &,Deploying DAPHNE HPC & GenAI Language Video Machine Power Opportunities Deploying DAPHNE on Vega Main documentation file: Deploy.md Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 43/123
  • 44. Introduction DAPHNE Background EuroHPC Vega & AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega &,Deploying DAPHNE HPC & GenAI Language Video Machine Power Opportunities SLURM Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 44/123
  • 45. Introduction DAPHNE Background EuroHPC Vega & AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega &,Deploying DAPHNE HPC & GenAI Language Video Machine Power Opportunities Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 45/123
  • 46. Introduction DAPHNE Background EuroHPC Vega & AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega &,Deploying DAPHNE HPC & GenAI Language Video Machine Power Opportunities Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 46/123
  • 47. Introduction DAPHNE Background EuroHPC Vega & AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega &,Deploying DAPHNE HPC & GenAI Language Video Machine Power Opportunities More HPC User Perspective Nation-wide in Slovenia More: at University of Maribor, Bologna study courses for teaching (training) of Computer Science at cycles — click URL: • level 1 (BSc) • year 1: Programming I – e.g. C++ syntax • year 2: Computer Architectures – e.g. assembly, microcode, ILP • year 3: Parallell and Distributed Computing – e.g. OpenMP, MPI, CUDA • level 2 (MSc) • year 1: Cloud Computing Deployment and Management – e.g. arc, slurm, Hadoop, containers (docker, singularity) through virtualization • level 3 (PhD) • EU and other national projects research: HPC RIVR, EuroCC, DAPHNE, ... – e.g. scaling new systems of CI & Operational Research of ... over HPC • IEEE Computational Intelligence Task Force on Benchmarking • Scientific Journals (e.g. SWEVO, TEVC, JoCS, ASOC, INS) These contribute towards Sustainable Development of HPC. Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 47/123
  • 48. Introduction DAPHNE Background EuroHPC Vega & AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega &,Deploying DAPHNE HPC & GenAI Language Video Machine Power Opportunities Conferencia Invitada 15 September 2023 at 12:00 CEST University of Alicante (UA), Departement of Software and Computing Systems (DLSI) in Sala Ada Lovelace, UA DLSI, Alicante, Spain Hosted by Elena Lloret Pastor (Profesor/A Titular Universidad, UA DLSI) EuroHPC AI in DAPHNE and Text Summarization — Part III: HPC and AI Generative Models — Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 48/123
  • 49. Introduction DAPHNE Background EuroHPC Vega & AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega &,Deploying DAPHNE HPC & GenAI Language Video Machine Power Opportunities Conferencia Invitada 15 September 2023 at 12:00 CEST University of Alicante (UA), Departement of Software and Computing Systems (DLSI) in Sala Ada Lovelace, UA DLSI, Alicante, Spain Hosted by Elena Lloret Pastor (Profesor/A Titular Universidad, UA DLSI) EuroHPC AI in DAPHNE and Text Summarization — Part I: Generative AI — Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 49/123
  • 50. Introduction DAPHNE Background EuroHPC Vega & AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega &,Deploying DAPHNE HPC & GenAI Language Video Machine Power Opportunities Generative AI — Modalities & Access (HPC, H100) • Generative AI (GenAI) is being used for modalities such as • text generation using Transformers (like ChatGPT), • image generation using Stable Diffusion (like Midjouney and DALL-E), • and video speech generation (like Synthesia) • GenAI provided recent interesting applications served by HPC deployments (supported by e.g. NVIDIA H100). • Therefore, two of my models for Generative AI, • from Summarizer and TPP-PSADE@NPdynϵjDE, • extended to support HPC deployment using MPI, • are described in following & some results are presented. Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 50/123
  • 51. Introduction DAPHNE Background EuroHPC Vega & AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega &,Deploying DAPHNE HPC & GenAI Language Video Machine Power Opportunities Generative AI — Some Background • Early Learning to Learn, Google DeepMind after AlphaZero, deep RL algorithms • https://gecco-2019.sigevo.org/index.html/Keynotes [@aleszamuda/status/1150672932588462081: ”Learning to learn ...”] • Recent: with Reinforcement Learning (RL) trained Large Language Models (LLMs) using Deep Neural Networks (DNNs) — Transformers (replacing RNN LSTMs; by Google — 2017, Attention Is All You Need: https://arxiv.org/abs/1706.03762, Submitted on 12 Jun 2017 (v1) — for NIPS’17 in December (Jakob proposed replacing RNNs with self-attention and startedthe effort to evaluate this idea)) • A deployed LLM (Free Research Preview of ChatGPT May 24 Version, 2023.) GPT-4 Technical Report: https://arxiv.org/pdf/2303.08774.pdf • Sample LLM code (Transformers by Hugging Face), using Python3, AutoTokenizer, and google/flan-t5-base Transformers architecture Wikipedia (CC BY-SA 3.0), File:The- Transformer-model- architecture.png • My GenAI backgrounds come from (evolutionary) generation of 3D scenery sequences (animation, AL — Artificial Life) • In my 2020 journal article published with University of Alicante (w/ Elena Lloret), we demonstrated HPC importance in NLP performance impact (Summarizer — developed on SLING) • cites e.g. Salesforce Research’s NN paper on A Deep Reinforced Model for Abstractive Summarization, Submitted on 11 May 2017 (v1), https://arxiv.org/abs/1705.04304 Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 51/123
  • 52. Introduction DAPHNE Background EuroHPC Vega & AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega &,Deploying DAPHNE HPC & GenAI Language Video Machine Power Opportunities Conferencia Invitada 15 September 2023 at 12:00 CEST University of Alicante (UA), Departement of Software and Computing Systems (DLSI) in Sala Ada Lovelace, UA DLSI, Alicante, Spain Hosted by Elena Lloret Pastor (Profesor/A Titular Universidad, UA DLSI) EuroHPC AI in DAPHNE and Text Summarization — Part III: Language (1) — Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 52/123
  • 53. Introduction DAPHNE Background EuroHPC Vega & AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega &,Deploying DAPHNE HPC & GenAI Language Video Machine Power Opportunities HPC Application 1: Text Summarization • NLP and computational linguistics for Text Summarization: • Multi-Document Text Summarization is a hard CI challenge. • Basically, an evolutionary algorithm is applied for summarization, • it is a state-of-the-art topic of text summarization for NLP (part of ”Big Data”) and presented as a collaboration [JoCS2020], acknowledging several efforts. • we add: self-adaptation of optimization control parameters; analysis through benchmarking using HPC, and apply additional NLP tools. • How it works: for the abstract, sentences from original text are selected for full inclusion (extraction). • To extract a combination of sentences: • can be computationally demanding, • we use heuristic optimization, • the time to run optimization can be limited. Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 53/123
  • 54. Introduction DAPHNE Background EuroHPC Vega & AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega &,Deploying DAPHNE HPC & GenAI Language Video Machine Power Opportunities 1 – Preprocessing (environment sensing, knowledge representation) (1/2) 1) The files of documents are each taken through the following process using NLP (Natural Language Processing) tools: INPUT CORPUS NATURAL LANGUAGE PROCESSING ANALYSIS CONCEPTS DISTRIBUTION PER SENTENCES MATRIX OF CONCEPTS PROCESSING CORPUS PREPROCESSING PHASE OPTIMIZATION TASKS EXECUTION PHASE ASSEMBLE TASK DESCRIPTION SUBMIT TASKS TO PARALLEL EXECUTION OPTIMIZER + TASK DATA ROUGE EVALUATION 2) For each document is D, sentences are indexed using NLP tools. • Terms across sentences are determined using a semantic analysis using both: • coreference resolution (using WordNet) and • a Concept Matrix (from Freeling). Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 54/123
  • 55. Introduction DAPHNE Background EuroHPC Vega & AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega &,Deploying DAPHNE HPC & GenAI Language Video Machine Power Opportunities 1 – Preprocessing (environment sensing, knowledge representation) (2/2) • 3) For each i-th term (wi), during indexing • number of occurences in the text is gathered, and • number of occurences (nk) of a term in some k-th statement, • 4) For each term wi in the document, inverse frequency: isfw i = log( n nk ), • where n denotes number of statements in the document, and • nk number of statements including a term wi. • 5) To conclude preprocessing, for each term in the document, a weight is calculated: wi,k = tfi,kisfk, where tfik is number of occurences (term frequency) of a term wk in a statement si. Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 55/123
  • 56. Introduction DAPHNE Background EuroHPC Vega & AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega &,Deploying DAPHNE HPC & GenAI Language Video Machine Power Opportunities 2 – Summary Optimization (1/3) • Sentence combination (X) is optimized using jDE algorithm: • as a 0/1 knapsack problem, we want to include optimal selection of statements in the final output • an i-th sentence si is selected (xi = 1) or unselected (xi = 0). • a) Price of a knapsack (its fitness) should be maximized, • the fitness represents a ratio between content coverage, V(X), and redundancy, R(X): f(X) = V(X) R(X) , • considering a constraint: the summary length is L ± ϵ words. • Constraint handling with solutions: • each feasable solution is better than unfeasable, • unfeasable compared by constraint value (lower better), • feasable compared by fitness (higher better). Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 56/123
  • 57. Introduction DAPHNE Background EuroHPC Vega & AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega &,Deploying DAPHNE HPC & GenAI Language Video Machine Power Opportunities 2 – Summary Optimization (2/3) • b) Content coverage V(X) is computed as a double sum of similarities (defined at d)): V(X) = n−1 X i=1 n X j=i+1 (sim(si, O) + sim(sj, O))xi,j, • where xi,j denotes inclusion of both statement, si, and sj, • xi,j is only 1 if xi = xj = 1, otherwise 0, • and O is a vector of average term weights wi,k: O = (o1, o2, ..., om) for all i = {1..m} different text terms: oi = Pn j=1 wi,j n . • c) Redundance R(X) is also measured as double similarity (defined at d)) sum for all statements: R(X) = n−1 X i=1 n X j=i+1 sim(si, sj)xi,j, • where xi,j denotes inclusion of both statement, si, and sj, • again, xi,j is only 1 if xi = xj = 1, otherwise 0. Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 57/123
  • 58. Introduction DAPHNE Background EuroHPC Vega & AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega &,Deploying DAPHNE HPC & GenAI Language Video Machine Power Opportunities 2 – Summary Optimization (3/3) • d) Similarity between statements si = [wi,1, wi,2, ..., wi,m] and sj = [wj,1, wj,2, ..., wj,m] is computed: sim(si, sj) = m X k=1 wi,kwj,k qPm k=1 wi,kwi,k Pm k=1 wj,kwj,k , where wi,k is term weight (defined in 5)) and m number of all terms in text. • e) When concluded: • the selected statements from the best assessed combination are printed, • in order as they appear in the text, and • the summary is stored. Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 58/123
  • 59. Introduction DAPHNE Background EuroHPC Vega & AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega &,Deploying DAPHNE HPC & GenAI Language Video Machine Power Opportunities Summary Optimization — Algorithm Pseudocode The detailed new method called CaBiSDETS is developed in the HPC approach comprising of: • a version of evolutionary algorithm (Differential Evolution, DE), • self-adaptation, binarization, constraint adjusting, and some more pre-computation, • optimizing the inputs to define the summarization optimization model. Aleš Zamuda, Elena Lloret. Optimizing Data-Driven Models for Summarization as Parallel Tasks. Journal of Computational Science, 2020, vol. 42, pp. 101101. DOI 10.1016/j.jocs.2020.101101. Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 59/123
  • 60. Introduction DAPHNE Background EuroHPC Vega & AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega &,Deploying DAPHNE HPC & GenAI Language Video Machine Power Opportunities Running the Tasks on HPC: ARC Job Preparation & Submission, Results Retrieval & Merging [JoCS2020] Through an HPC approach and by parallelization of tasks, a data-driven summarization model optimization yields improved benchmark metric results (drawn using gnuplot merge). Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 60/123
  • 61. Introduction DAPHNE Background EuroHPC Vega & AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega &,Deploying DAPHNE HPC & GenAI Language Video Machine Power Opportunities Results Published in Journal of Computational Science The most interesting finding of the HPC study though is that • the fitness of the NLP model keeps increasing with prolonging the dedicated HPC resources (see below) and that • the fitness improvement correlates with ROUGE evaluation in the benchmark, i.e. better summaries. -0.05 0 0.05 0.1 0.15 0.2 0.25 0.3 0.35 0.4 1 10 100 1000 10000 ROUGE-1R ROUGE-2R ROUGE-LR ROUGE-SU4R Fitness (scaled) Hence, the use of HPC significantly contributes to capability of this NLP challenge. Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 61/123
  • 62. Introduction DAPHNE Background EuroHPC Vega & AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega &,Deploying DAPHNE HPC & GenAI Language Video Machine Power Opportunities Summarization Implementation — Parallel • In text summarization, once the text model has been built, • different lengths of summaries can be created with Summarizer using CaBiSDETS algorithm in parallel. • Generates large summaries (more output tokens than single prebuilt LLMs) & accepts long task inputs (no pre-clustering). • Results are explainable, tracable (no hallucinated content/citations), and automatic (no manual RL scoring). • But moreover from HPC & Big-Data perspective, • the input text can be preprocessed in parallel • by computing the cosine sentence similarity pairs in parallel using MPI in an integrated pipeline, • before the generation of summaries commences and • just before a complete parameterization of the summary generation process. • During summary optimization, the fitness function evaluations can be run in parallel. Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 62/123
  • 63. Introduction DAPHNE Background EuroHPC Vega & AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega &,Deploying DAPHNE HPC & GenAI Language Video Machine Power Opportunities Summarization Task Configuration & Execution • An example task for Summarizer computational architecture • based on a recent lecturing material, PDF −→ text (language), on: • Optimization Algorithms and Autonomous Systems • Las Palmas de Gran Canaria, March 2023. • https://www.slideshare.net/AlesZamuda/ulpgc2023erasmuslecturesaleszamuda systemstheoryintelligentautonomoussystemseswamllsgohpcdeugpppdf • for Summarizer runs, the parameters were: --GMAX 1000 --NP 319 --summarylength 500 --epsilonLengthSummary 20 • For the job execution, the configuration for SLURM was: srun --ntasks-per-node=16 --mpi=pmix ./summarizer.sif ./summarizer Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 63/123
  • 64. Introduction DAPHNE Background EuroHPC Vega & AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega &,Deploying DAPHNE HPC & GenAI Language Video Machine Power Opportunities Summarization Job Speedup Results • Speedup comparison: SLURM --nodes parameter = 1, 2, 3, 4, or 5 • obtained timings on this scaling as seen on the graphs below. 150 200 250 300 350 400 450 500 550 16 32 48 64 80 Seconds to compute a workload Number of tasks (equals 16 times the SLURM --nodes parameter) Summarizer workload 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 16 32 48 64 80 Seconds to compute a workload Number of tasks (equals 16 times the SLURM --nodes parameter) Summarizer workload 1 2 3 4 5 16 32 48 64 80 Workload scaling (wall time) Number of tasks (equals 16 times the SLURM --nodes parameter) Summarizer workload Resources Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 64/123
  • 65. Introduction DAPHNE Background EuroHPC Vega & AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega &,Deploying DAPHNE HPC & GenAI Language Video Machine Power Opportunities Conferencia Invitada 15 September 2023 at 12:00 CEST University of Alicante (UA), Departement of Software and Computing Systems (DLSI) in Sala Ada Lovelace, UA DLSI, Alicante, Spain Hosted by Elena Lloret Pastor (Profesor/A Titular Universidad, UA DLSI) EuroHPC AI in DAPHNE and Text Summarization — Part III: Video (2) — Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 65/123
  • 66. Introduction DAPHNE Background EuroHPC Vega & AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega &,Deploying DAPHNE HPC & GenAI Language Video Machine Power Opportunities Introduction and Main Goals • Plant animation in emergent ecosystems • Plant morphology reconstruction • from real photography (through evolutionary optimization) Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 66/123
  • 67. Introduction DAPHNE Background EuroHPC Vega & AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega &,Deploying DAPHNE HPC & GenAI Language Video Machine Power Opportunities Ecosystem Animation and Simulation: Thousands of Trees 1: algorithm ecosystem simulation Require: v - plant species list; r - plant list for each species; f - living condition factors on terrain; Ensure: ecosystem afforestation simulation 2: loop 3: add new plants to species(v, r); 4: grow all plants(r, f); 5: remove dead plants(r); 6: end loop Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 67/123
  • 68. Introduction DAPHNE Background EuroHPC Vega & AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega &,Deploying DAPHNE HPC & GenAI Language Video Machine Power Opportunities Ecosystem Afforestation: Terrain Models • Tree models are put to terrain based on ecosystem growth pi,k = xi yi,k zk T , i, k ∈ [0, 99]. • Power: fitness (height, slope, moisture, sunniness, windiness), age, growth. as,p = vs,phs,par;p, ar;p = tf;s max s {tf;s} , vs,p = ky;s,p + ms,p + ls,p + ws,p + ss,p 5 . Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 68/123
  • 69. Introduction DAPHNE Background EuroHPC Vega AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega ,Deploying DAPHNE HPC GenAI Language Video Machine Power Opportunities Another Ecosystem Scenario: Interactive Breeding Motivation: interactive ecosystems breeding (von Mammen, 2009) Vir: von Mammen (2009) The approach – 6 modeled operators in EcoMod. Operators of selection, crossover, and mutation. Operators over terrain and environmental conditions. Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 69/123
  • 70. Introduction DAPHNE Background EuroHPC Vega AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega ,Deploying DAPHNE HPC GenAI Language Video Machine Power Opportunities Spatial Morphological Tree Model Reconstruction • New approach for construction of trees • three-dimensional spatial models, • in computer graphics and animation, • the user had to sketch basic branches. • Our tree reconstruction includes: • evolutionary algorithms and • procedural modelling of trees. Source: → CEC 2009 • An L-systems approach used procedural models in a 2D plane, we extend it: • on 3D procedural models and • more complex trees. • Our approach combines open-sources: • ecosystem framework EcoMod and • algorithm MOjDE (DEMOwSA + jDE). Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 70/123
  • 71. Introduction DAPHNE Background EuroHPC Vega AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega ,Deploying DAPHNE HPC GenAI Language Video Machine Power Opportunities Woody Plants Procedural Model • 3D tree models are compactly represented using a procedure • our EcoMod framework uses a numerically coded procedural model with fixed dimensionality • suitable for parameter estimation using DE/MOjDE. • Parameterized procedural model builds a 3D structure of a tree and all its building parts: • by recursively executing a fixed procedure, • over a set of numerically coded input parameters, • e.g. branch thickness, relative length, and branching. Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 71/123
  • 72. Introduction DAPHNE Background EuroHPC Vega AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega ,Deploying DAPHNE HPC GenAI Language Video Machine Power Opportunities Trees Representable by EcoMod Framework • Foliage or coniferous trees with very different branching structures, • each branch and each leaf can be animated in real time to show the growth of a tree or its sway in the wind. Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 72/123
  • 73. Introduction DAPHNE Background EuroHPC Vega AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega ,Deploying DAPHNE HPC GenAI Language Video Machine Power Opportunities 1: procedure branchsegment(g, w, S0, L0, l0, M0, M−1 m;0) Require: g, w - Gravelius and Weibull index of base branch; S0 - number of strands in base branch; L0, l0 - base branch relative and actual length; M0 - base branch coordinate system; M−1 m;0 - inverse matrix of rotations for gravimorphism in coordinate system for base branch; global (i.e. part of breeder) kd, kc, ltype, k g,w s , Mg,w , mg,w , k g,w l , α g,w m , αg,w , t, kf, ws, wg Ensure: rendered tree image 2: d := kd p S0; {thickness calculation from Mandelbrot} 3: render base branch(M0, l0, d); 4: if S0 = 1 then 5: render leaves(ltype); return; 6: end if 7: S1 := l 1 + k g,w s (S0 − 2) m , S2 = S0 − S1; {strands} 8: r1 := max min r S1 S0 , Mg,w , mg,w ; {branch length} 9: r2 := max min r S2 S0 , Mg,w , mg,w ; 10: L1 := r1L0, L2 := r2L0; {relative lengths of subbranches} 11: l1 := k g,w l L1, l2 := k g,w l L2; {active subbranch lengths} 12: α1 := kc r S2 S0 αg,w , α2 := αg,w − α1; {branching angles} 13: M1 := Rz(α1)Ry(αp)Ry×ym (α g,w m )Ty(l0)M0; {transform} 14: M2 := Rz(α2)Ry(αp)Ry×ym (α g,w m )Ty(l0)M0; 15: M−1 m;1 := Ry×ym (−α g,w m )Ry(−αp)Rx(−αx(t))Rz(−α1 − αz(t))M−1 m;0 ; {refreshing inverse matrix} 16: M−1 m;2 := Ry×ym (−α g,w m )Ry(−αp)Rx(−αx(t))Rz(−α2 − αz(t))M−1 m;0 ; 17: branchsegment(g + 1, w + 1, S2, L2, l2, M2, M−1 m;2 ); {minor branch development} 18: branchsegment(g, w + 1, S1, L1, l1, M1, M−1 m;1 ); {major branch development} 19: return; {from recursive procedure call} Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 73/123
  • 74. Introduction DAPHNE Background EuroHPC Vega AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega ,Deploying DAPHNE HPC GenAI Language Video Machine Power Opportunities Ecosystem Afforestation Geometry Video Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 74/123
  • 75. Introduction DAPHNE Background EuroHPC Vega AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega ,Deploying DAPHNE HPC GenAI Language Video Machine Power Opportunities Image-based Approaches to Automatic Tree Modeling • Image-based approaches have the best potential to produce realistically looking plants • they rely on images of real plants. • Little work has been done to design trees with the use of a general reconstruction from images without user interaction • use of sketch based guide techniques or • the procedural models reconstructed were two-dimensional. • We now extended this recognition to the domain of 3D procedural models • suitable to model woody plants without user interaction. Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 75/123
  • 76. Introduction DAPHNE Background EuroHPC Vega AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega ,Deploying DAPHNE HPC GenAI Language Video Machine Power Opportunities Tree Model Reconstruction Innovization Using Multi-objective Differential Evolution • Based on an optimization procedure with three main parts: • Part I: genotype encoding, • Part II: genotype-phenotype mapping, and • Part III: fitness evaluation: • phenotype and reference image comparison. Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 76/123
  • 77. Introduction DAPHNE Background EuroHPC Vega AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega ,Deploying DAPHNE HPC GenAI Language Video Machine Power Opportunities Basic Genotype Encoding • An individual genotype vector x of a DE population represents a set of procedural model parameters, • by computing recursive procedure using a set of parameters, EcoMod renders a tree (woody plant), • dimensionality of the genotype x is D = 4509, • where g ∈ {0, G = 15}, w ∈ {0, W = 50}, and • each local G × W = 750 real-coded parameter encodes: one matrix of a Gravelius and Weibull ordered parameter for recursive calculations, and • all xi,j ∈ [0, 1], i ∈ {1, 2, ..., NP} and j ∈ {1, 2, ..., D} are linearly normalized by scaling in the [0,1] interval. Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 77/123
  • 78. Introduction DAPHNE Background EuroHPC Vega AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega ,Deploying DAPHNE HPC GenAI Language Video Machine Power Opportunities Bounds and Scaling of Genotype-encoded Parameters xi,j Parameter Formula Interval Number of strands in a tree (tree com- plexity) S = 400xi,0 + 10 S ∈ [10, 410] Height of base trunk l 0,0 0 = 10xi,1 l 0,0 0 ∈ [0 m, 10 m] Coefficient of branch thickness kd = 0.05xi,2 kd ∈ [0, 0.05] Phyllotaxis angle αp = 360xi,3 αp ∈ [0◦ , 360◦ ] Branching ratio of subbranch strands distribution k g,w s = 0.5xi,j + 0.5, ∀j ∈ [4, 753] k g,w s ∈ [ 1 2 , 1] Branching angle between dividing sub- branches αg,w = 180xi,j ∀j ∈ [754, 1503] αg,w ∈ 0◦ , 180◦ Maximum relative sub-branch to base branch length Mg,w = 20xi,j ∀j ∈ [1504, 2253] Mg,w ∈ [0, 20] Minimum relative sub-branch to base branch length mg,w = 20xi,j ∀j ∈ [2254, 3003] mg,w ∈ [0, 20] Branch length scaling factor k g,w l = 20xi,j, ∀j ∈ [3004, 3753] k g,w l ∈ [0, 20] Gravicentralism impact kc = xi,3754 kc ∈ [0, 1] Gravimorphism impact (i.e. gravitational bending of branches) α g,w m = 360xi,j − 180, ∀j ∈ [3755, 4504] α g,w m ∈ [−180◦ , 180◦ ] Enabling leaves display on a tree Bl = xi,4505 0.5?0 : 1 Bl ∈ {0, 1} Size of leaves ll = 0.3xi,4506 ll ∈ [0, 0.3] Density of leaves ρl = 30xi,4507 ρl ∈ [0, 30] Leaf distribution type ltype = 5xi,4508 Spiral, Stacked, Stagg- ered, Bunched, or Conif- erous Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 78/123
  • 79. Introduction DAPHNE Background EuroHPC Vega AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega ,Deploying DAPHNE HPC GenAI Language Video Machine Power Opportunities Advanced Genotype Encoding: Auxiliary Local Parameters get Vectorized Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 79/123
  • 80. Introduction DAPHNE Background EuroHPC Vega AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega ,Deploying DAPHNE HPC GenAI Language Video Machine Power Opportunities Genotype-phenotype Mapping • Reconstruction method is based on reconstruction of two-dimensional images of woody plants z∗ (photo), • to compare the three-dimensional tree evolved with the use of genotype x to the reference image z∗, genotype x must be transformed to its phenotype first, • phenotype is a rendered two-dimensional image z, • images z∗ and z are all of dimensionality X × Y pixels, • the reference image is scaled to the given resolution, if necessary. • both images are converted to black and white, where white (0) pixels mark background and black (1) pixels mark material, e.g. wood, • An evolved procedural model is rendered for comparison twice • to favor three-dimensional procedural models generation, • projections differ by β = 90◦ camera view angle along the trunk base (i.e. z axis for OpenGL). Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 80/123
  • 81. Introduction DAPHNE Background EuroHPC Vega AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega ,Deploying DAPHNE HPC GenAI Language Video Machine Power Opportunities Phenotype and Reference Image Comparison • The recognition success is measured by similarity of • the reference original images (2D) and • the rendered image (2D) projections of evolved parametrized procedural models. • Images are compared pixel-wise by e.g. two criteria: 1 in the evolved image, for each pixel rendered as material (1): • the Manhattan distance to the nearest material pixel in the reference image is computed • and vice-versa, for each material (1) pixel of an evolved model image, 2 count of differing pixels (0/1) among comparing images. Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 81/123
  • 82. Introduction DAPHNE Background EuroHPC Vega AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega ,Deploying DAPHNE HPC GenAI Language Video Machine Power Opportunities 1: procedure MO reconstruction(z∗) Require: S0 - maximum number of strands in base branch; also, other default parameters for MOjDE and EcoMod Ensure: Pareto set of reconstructed parameterized procedural 3D woody plant models 2: uniform randomly generate DE initial population xi,0 ∈ [0, 1] for i = 1..NP; 3: for DE generation loop g (while FEs 10000) do 4: for DE iteration loop i (for all individuals xi,g of a population) do 5: DE individual xi,g creation (adaptation, mutation, crossover): 6: Fi,G+1 = ( Fl + rand1 × Fu if rand2 τ1, Fi,G otherwise ; CRi,G+1 = ( rand3 if rand4 τ2, CRi,G otherwise ; 8: vi,G+1 = xr1,G + Fi,G+1(xr2,G − xr3,G); 9: ui,j,G+1 = ( vi,j,G+1 if rand(0, 1) ≤ CRi,G+1 or j = jrand xi,j,G otherwise ; 10: DE fitness evaluation (genotype-phenotype mapping, rendering, and comparison): 11: z1 = g(ui,g, β1), z2 = g(ui,g, β2) {Execute Algorithm branchsegment twice} 12: h1(z1) = P x,y m1(z1 x,y, z∗ x,y) + P x,y m1(z∗ x,y, z1 x,y); {First difference metric, at 0◦ } 13: h1(z2) = P x,y m1(z2 x,y, z∗ x,y) + P x,y m1(z∗ x,y, z2 x,y); {First difference metric, at 90◦ } 14: f1(x) = f(g(x, β1), g(x, β2)) = h1(z1) + h1(z2); {Fitness evaluation, 1st criterion} 15: h2(z1) = P x,y w(z1 x,y, z∗ x,y) + P x,y w(z∗ x,y, z1 x,y); {Second difference metric, 0◦ } 16: h2(z2) = P x,y w(z2 x,y, z∗ x,y) + P x,y w(z∗ x,y, z2 x,y); {Second difference metric, 90◦ } 17: f2(x) = f(g(x, β1), g(x, β2)) = h2(z1) + h2(z2); {Fitness evaluation, 2nd criterion} 18: f(x) = {f1(x), f2(x)}; {Fitness evaluation, all criteria combined done} 19: DE selection: 20: xi,G+1 = ( ui,G+1 if f(ui,G+1) ⪯ f(xi,G) xi,G otherwise ; {Multi-objective comparison operator} 21: if not (ui,G+1 ⪯ xi,G or xi,G ⪯ ui,G+1 ) then add ui,G+1 to population archive; 22: end for 23: Truncate DE population archive to a size of NP using SPEA2 mechanism. 24: end for 25: return the best individuals obtained; Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 82/123
  • 83. Introduction DAPHNE Background EuroHPC Vega AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega ,Deploying DAPHNE HPC GenAI Language Video Machine Power Opportunities Advanced Approach (INS2014): Overview → Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 83/123
  • 84. Introduction DAPHNE Background EuroHPC Vega AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega ,Deploying DAPHNE HPC GenAI Language Video Machine Power Opportunities More: See Published Articles INS2014 A. Zamuda and J. Brest. Vectorized Procedural Models for Animated Trees Reconstruction using Differential Evolution. Information Sciences 2014, vol. 278, pp. 1-21. DOI: 10.1016/j.ins.2014.04.037. IF2014 = 4.038 (Q1). INS2013 A. Zamuda and J. Brest. Environmental Framework to Visualize Emergent Artificial Forest Ecosystems. Information Sciences 220:522–540. 2013. DOI: 10.1016/j.ins.2012.07.031. IF2013 = 3.893 (Q1). ASC2011 A. Zamuda, J. Brest, B. Bošković and V. Žumer. Differential Evolution for Parameterized Procedural Woody Plant Models Reconstruction. Applied Soft Computing 11(8):4904–4912. 2011. DOI: 10.1016/j.asoc.2011.06.009. IF2011 = 2.612 (Q1). CEC2012 A. Zamuda, J. Brest. Tree Model Reconstruction Innovization Using Multi-objective Differential Evolution. 2012 IEEE World Congress on Computational Intelligence (IEEE WCCI 2012), Brisbane, Australia, 2012, pp. 575-582. Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 84/123
  • 85. Introduction DAPHNE Background EuroHPC Vega AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega ,Deploying DAPHNE HPC GenAI Language Video Machine Power Opportunities Tree Reconstruction: Summary Conclusions • An approach to design woody plant geometrical models [ASOC2011], • rendered images are compared to the reference source images, for reconstruction, to guide the optimization process, • sampled randomly to reconstruct geometrical models, • procedural models are rendered using EcoMod framework [INS2013], • renderings of sample evolved models, • parameters of the procedural model are iteratively evolved using multi-objective differential evolution MOjDE algorithm [CEC2012] • fitness is evaluated by two criteria, which are not pre-weighted, • multi-objective optimization obtains multiple criteria trade-offs shown using: • attainment surfaces (trade-offs distribution), and • rendered final approximation set models. • The advanced approach [INS2014] adds • real images preprocessing (tree trunk and crown extraction), • post-effects on reconstructed geometry (growth, wind, leafs). Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 85/123
  • 86. Introduction DAPHNE Background EuroHPC Vega AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega ,Deploying DAPHNE HPC GenAI Language Video Machine Power Opportunities Conferencia Invitada 15 September 2023 at 12:00 CEST University of Alicante (UA), Departement of Software and Computing Systems (DLSI) in Sala Ada Lovelace, UA DLSI, Alicante, Spain Hosted by Elena Lloret Pastor (Profesor/A Titular Universidad, UA DLSI) EuroHPC AI in DAPHNE and Text Summarization — Part III: Machine (3) — Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 86/123
  • 87. Introduction DAPHNE Background EuroHPC Vega AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega ,Deploying DAPHNE HPC GenAI Language Video Machine Power Opportunities HPC Application 2: Underwater Glider: Autonomous, Unmanned, Robotic • underwater glider – navigating sea oceans, • Autonomous Underwater Vehicle (AUV) ̸= Unmanned Aerial Vehicle (UAV) • AUV Slocum model (expertise in domain of ULPGC, work with J. D. Hernández Sosa) Images: ”Photo: Richard Watt/MOD” (License: OGL v1.0) Slocum-Glider-Auvpicture 5.jpg (License: Public Domain) MiniU.jpg (License: CC-BY-SA 3.0) Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 87/123
  • 88. Introduction DAPHNE Background EuroHPC Vega AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega ,Deploying DAPHNE HPC GenAI Language Video Machine Power Opportunities The Buoyancy Drive and Submarine Probes Usefulness • Driving ”yoyo” uses little energy, most only on descent and rise (pump); also for maintaining direction little power is consumed. + Use: improving ocean models with real data, + the real data at the point of capture, + sampling flow of oil discharges, + monitoring cable lines, and + real-time monitoring of different sensor data. 1 http://spectrum.ieee.org/image/1523708 Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 88/123
  • 89. Introduction DAPHNE Background EuroHPC Vega AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega ,Deploying DAPHNE HPC GenAI Language Video Machine Power Opportunities Preparations – Simulation Scenarios https://www.google.si/maps/@28.059806,-15.998355,650054m/data=!3m1!1e3 Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 89/123
  • 90. Introduction DAPHNE Background EuroHPC Vega AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega ,Deploying DAPHNE HPC GenAI Language Video Machine Power Opportunities Trajectory Optimization: P201,ESTOC2013 3 + BigData, MyOcean IBI, satelite link, GPS location The real trajectory and collected data is available in a Google Earth KML file at the EGO network: http://www.ego-network.org/dokuwiki/doku.php?glider=P201,ESTOC2013_3 Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 90/123
  • 91. Introduction DAPHNE Background EuroHPC Vega AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega ,Deploying DAPHNE HPC GenAI Language Video Machine Power Opportunities Constrained Differential Evolution Optimization for Underwater Glider Path Planning in Sub-mesoscale Eddy Sampling • Corridor-constrained optimization: eddy border region sampling • new challenge for UGPP DE • Feasible path area is constrained • trajectory in corridor around the border of an ocean eddy The objective of the glider here is to sample the oceanographic variables more efficiently, while keeping a bounded trajectory Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 91/123
  • 92. Introduction DAPHNE Background EuroHPC Vega AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega ,Deploying DAPHNE HPC GenAI Language Video Machine Power Opportunities Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 92/123
  • 93. Introduction DAPHNE Background EuroHPC Vega AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega ,Deploying DAPHNE HPC GenAI Language Video Machine Power Opportunities Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 93/123
  • 94. Introduction DAPHNE Background EuroHPC Vega AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega ,Deploying DAPHNE HPC GenAI Language Video Machine Power Opportunities Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 94/123
  • 95. Introduction DAPHNE Background EuroHPC Vega AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega ,Deploying DAPHNE HPC GenAI Language Video Machine Power Opportunities Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 95/123
  • 96. Introduction DAPHNE Background EuroHPC Vega AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega ,Deploying DAPHNE HPC GenAI Language Video Machine Power Opportunities HoP — New Trajectories: Success history applied to expert system for underwater glider path planning using differential evolution • Improved underwater glider path planning mission scenarios: optimization with L-SHADE. • Several configured algorithms are also compared to, analysed, and further improved. • Outranked all other previous results from literature and ranked first in comparison. • New algorithm yielded practically stable and competitive output trajectories. Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 96/123
  • 97. Introduction DAPHNE Background EuroHPC Vega AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega ,Deploying DAPHNE HPC GenAI Language Video Machine Power Opportunities Ranking UGPP — Benchmarking Aggregation • Statistically, all results from previous paper were outperformed. • Main reasons: tuning (NP), parameter control (L-SHADE). Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 97/123
  • 98. Introduction DAPHNE Background EuroHPC Vega AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega ,Deploying DAPHNE HPC GenAI Language Video Machine Power Opportunities Conferencia Invitada 15 September 2023 at 12:00 CEST University of Alicante (UA), Departement of Software and Computing Systems (DLSI) in Sala Ada Lovelace, UA DLSI, Alicante, Spain Hosted by Elena Lloret Pastor (Profesor/A Titular Universidad, UA DLSI) EuroHPC AI in DAPHNE and Text Summarization — Part III: Power (4) — Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 98/123
  • 99. Introduction DAPHNE Background EuroHPC Vega AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega ,Deploying DAPHNE HPC GenAI Language Video Machine Power Opportunities HTS: Hydro Power Plants (HPPs) and Thermal Power Plants (TPPs) Scheduling – Introduction Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 99/123
  • 100. Introduction DAPHNE Background EuroHPC Vega AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega ,Deploying DAPHNE HPC GenAI Language Video Machine Power Opportunities Hydro and Thermal Power Plants Systems Model: Nomenclature Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 100/123
  • 101. Introduction DAPHNE Background EuroHPC Vega AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega ,Deploying DAPHNE HPC GenAI Language Video Machine Power Opportunities Hydro and Thermal Power Plants Systems Model: Definitions Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 101/123
  • 102. Introduction DAPHNE Background EuroHPC Vega AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega ,Deploying DAPHNE HPC GenAI Language Video Machine Power Opportunities Hydro and Thermal Power Plants Systems Model: Models • The application contributes to algorithms development, with the aims of: • improving performance of the electrical energy production and • emissions and carbon footprint reduction (thermal units), • while simultaneously satisfying a 24-hour system demands in scheduling energy demand and all other operational requirements. Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 102/123
  • 103. Introduction DAPHNE Background EuroHPC Vega AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega ,Deploying DAPHNE HPC GenAI Language Video Machine Power Opportunities Hydro and Thermal Power Plants Systems Model: Nomenclature and Definitions • Equality constraints for energy production (scheduling). • Constraints handling during optimization: ϵ-comparison. • Algorithm output: 24-hour settings plan for TPPs and HPPs. Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 103/123
  • 104. Introduction DAPHNE Background EuroHPC Vega AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega ,Deploying DAPHNE HPC GenAI Language Video Machine Power Opportunities Additional DE Mechanisms and Parallelization of HTS • Population size (NP) adjustment, multilevel parallelization, • sub-population gathering and BmW offspring strategy: • A. Glotić, A. Glotić, P. Kitak, J. Pihler, I. Tičar. Parallel self-adaptive differential evolution algorithm for solving short-term hydro scheduling problem. IEEE Transactions on Power Systems 2014, 29 (5), pp. 2347–2358. Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 104/123
  • 105. Introduction DAPHNE Background EuroHPC Vega AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega ,Deploying DAPHNE HPC GenAI Language Video Machine Power Opportunities HTS Parallelization and Pre-processing (1/3) • New approach in optimization for scheduling energy production among units of HPPs (hydro) and TPPs (thermo). • The approach allows a faster computation than before: • 1) The DE for HTS is parallelized. • 2) The TPPs optimization part features a novel architecture, including a pre-computed surrogate model, • this model is same during optimization of the whole HTS optimized model for hydro and thermo units (pre-processing), and • obtained parameter values (x) of the TPPs surrogate model on practical accuracy are stored for re-use in the global HTS optimization. Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 105/123
  • 106. Introduction DAPHNE Background EuroHPC Vega AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega ,Deploying DAPHNE HPC GenAI Language Video Machine Power Opportunities HTS Parallelization and Pre-processing (2/3) • Two algorithms are built for this approach: • the first algorithm (NPdynϵjDE) addresses a specialized treatment of constraints handling and optimizes scheduling for TPPs – thermo units parameters are pre-computed up to practical accuracy, • the second algorithm (PSADEs) utilizes the output of the first algorithm, in order to optimize combined production of hydro units, where hydro units settings are probed and thermo units settings are looked-up in the surrogate model matrix output from the NPdynϵjDE; • here, both algorithms use practical accuracy of the parameters for time schedule of the thermo units load. Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 106/123
  • 107. Introduction DAPHNE Background EuroHPC Vega AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega ,Deploying DAPHNE HPC GenAI Language Video Machine Power Opportunities HTS Parallelization and Pre-processing (3/3) • The results of testing this approach on established HTS benchmarks from literature show: a larger performance improvement on all scenarios under all criteria, compared to the approaches known before. (Still.) • Literature article with detailed results coverage follows: A. Glotić, A. Zamuda. Short-term combined economic and emission hydrothermal optimization by surrogate differential evolution. Applied Energy, 1 March 2015, vol. 141, pp. 42-56. DOI 10.1016/j.apenergy.2014.12.020. IF=5.613 Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 107/123
  • 108. Introduction DAPHNE Background EuroHPC Vega AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega ,Deploying DAPHNE HPC GenAI Language Video Machine Power Opportunities HTS Optimization: New DE Algorithms Architecture Surrogate Matrix – Input and Output Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 108/123
  • 109. Introduction DAPHNE Background EuroHPC Vega AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega ,Deploying DAPHNE HPC GenAI Language Video Machine Power Opportunities HTS Optimization: New DE Algorithms Architecture Fitness Function and Practical Accuracy (discretized) Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 109/123
  • 110. Introduction DAPHNE Background EuroHPC Vega AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega ,Deploying DAPHNE HPC GenAI Language Video Machine Power Opportunities HTS Optimization: New DE Algorithms Architecture Fitness Function and Practical Accuracy: the Algorithm Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 110/123
  • 111. Introduction DAPHNE Background EuroHPC Vega AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega ,Deploying DAPHNE HPC GenAI Language Video Machine Power Opportunities Results and Comparisons (1/2) ELS @NPdynϵjDE EES @NPdynϵjDE CEES @NPdynϵjDE Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 111/123
  • 112. Introduction DAPHNE Background EuroHPC Vega AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega ,Deploying DAPHNE HPC GenAI Language Video Machine Power Opportunities Results and Comparisons (2/2) • Results on different types of scheduling, compared to best works from literature: the proposed approach outperforms all by far, for all models: ELS, EES, and CEES, • This approach exhibits as well the lowest time latency (due to PSADEs parallelization and NPdynϵjDE pre-processing). Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 112/123
  • 113. Introduction DAPHNE Background EuroHPC Vega AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega ,Deploying DAPHNE HPC GenAI Language Video Machine Power Opportunities A. Glotić, A. Zamuda. Short-term combined economic and emission hydrothermal optimization by surrogate differential evolution. Applied Energy, 1 March 2015, vol. 141, pp. 42-56. DOI 10.1016/j.apenergy.2014.12.020. IF=5.613 Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 113/123
  • 114. Introduction DAPHNE Background EuroHPC Vega AI AI Challenges Shortlist HPC Initiatives EuroHPC Vega ,Deploying DAPHNE HPC GenAI Language Video Machine Power Opportunities HTS — HPC-deployable Results • When generating energy plans, • like load balancing schedules for energy power plants, • data analytics and machine learning algorithm can be used (PSADE@NPdynϵjDE [1]). • Here, a surrogated [1] part of computation for the whole problem is split non-trivially [1] • and offloaded to an offline pre-computation phase. • In this initial phase, an HPC can be used to generate a lookup matrix, included as a surrogate matrix during the next, online optimization phase. • Experiment: an example set of 86500 long runs (each 1 million fitness evaluations) • to generate a surrogate matrix with 0.01 MW granularity step • for power demands from 110 MW to 975 MW and 0.01 MW apart • from 106 simulations for each of 86500 power demand scenarios. 0 1 2 3 4 5 6 7 8 9 Combined power from 110 MW to 975 MW, step 0.01 MW#104 0 100 200 300 400 500 600 700 Individual output (power [MW] or unit total cost [$]) Cost, TC / 3 Powerplant P1 power Powerplant P2 power Powerplant P3 power [1] A. Glotić, A. Zamuda. Short-term combined economic and emission hydrothermal optimization by surrogate differential evolution. Applied Energy, 1 March 2015, vol. 141, pp. 42-56. DOI 10.1016/j.apenergy.2014.12.020. Aleš Zamuda 7@aleszamuda EuroHPC AI in DAPHNE and Text Summarization @ UA DLSI, 15 September 2023 114/123