AIXMOOC 2.5 lucabenini
Intelligenza Artificiale Generativa
P(Tkk+1)
Tk[0..k]
W
Tk[k+1]
Large Language Models @[2022…today]
Weights & FLOPs
Weights (pesi) → 0.12, 0.04, …0.81
FLOPs → (+,*)
4.
AIXMOOC 2.5 lucabenini
Intelligenza Artificiale Generativa
P(Tkk+1)
Tk[0..k]
W
Tk[k+1]
Large Language Models @[2022…today]
Weights & FLOPs
5.
AIXMOOC 2.5 lucabenini
Training (Allenamento)
P(Tkk+1)
Tk[0..k]
Tk[k+1]
W
~54k GPUs (1st Top500)
~14k GPUs
~100k GPUs
Large Language Models @[2022…today]
#Weights ∝ Billions
#Flops ∝ Millions of Billions
Intelligenza Artificiale Generativa
6.
AIXMOOC 2.5 lucabenini
Training Inferenza
P(Tkk+1)
Tk[0..k]
Tk[k+1]
W
~54k GPUs (1st Top500)
~14k GPUs
~100k GPUs
~4-8 GPUs
70B Llama3.2
Large Language Models @[2022…today]
#Weights ∝ Billions
#Flops ∝ Millions of Billions
Intelligenza Artificiale Generativa
7.
AIXMOOC 2.5 lucabenini
Intelligenza Artificiale Generativa
P(Tkk+1)
Tk[0..k]
Tk[k+1]
W
Large Language Models @[2022…today]
#Weights ∝ Billions
#Flops ∝ Millions of Billions
FLOPs == (+,*)
NVIDIA H100 GPU
• ~2 milioni di miliardi di (+,*) al secondo
• 700W (~1/2 Phon)
• ∝ 30K€ (~VW GOLF nuova)