SlideShare a Scribd company logo
1 of 58
Download to read offline
Hyper realistic future landscape horizon
Computing large
Foundational models
PROPRIETARY & CONFIDENTIAL 2
Generative AI’s impact
and capabilities are just
getting started
PROPRIETARY & CONFIDENTIAL
Internet 1997 - 2021 AI 2021 - 2030 (estimate)
Line drawing of the Guggenheim
PROPRIETARY
&
CONFIDENTIAL
3
Cathie Wood: AI market
could be $87 Trillion
$90T
Enterprise value ($T)
AI
applications
Foundation AI model-
as-a-service APIs
AI hardware
AI software
PROPRIETARY & CONFIDENTIAL
AI age
Computer age
4
Internet age
PROPRIETARY & CONFIDENTIAL 5
The cost & time to produce any
content is rapidly approaching
zero
New classes of AI models are increasingly cheaper and better across
all modalities
Classic western movie
30.0s | 2¢
To generate a high quality
image with text
0.5s | 0.2¢
8.0s | 2¢
To generate 750 words
of human level text
Price and times quoted are industry benchmarks and not meant to be specific to Stability
5
6.0s | 0.2¢
To generate 750 words
of human level text
To generate a high quality
image with text
2022 2023
PROPRIETARY & CONFIDENTIAL
Creative agencies face a multi-
billion dollar opportunity to
embrace AI or get left behind
6
Create more surreal experiences
Generative AI enables them to…
Serve more clients
Expand margins
AI supercharges creativity and output for every member
of the team so they can do more of their best work
Rapidly prototype and blend new creative ideas
across all content formats
By being able to…
Generate 100s of variants
in seconds and
maximize inspiration
Write proposals 3x faster
Enter novel creative realms
Create campaigns
personalized to each
client customer
Futuristic city with towering skyscrapers
PROPRIETARY & CONFIDENTIAL
Companies across the media landscape are
already racing to be the first to adopt
7
Next-generation AI
gains steam as Jasper
gets $1.5B valuation
PITCHBOOK | AI
How BBDO is Supercharging
the Creative Process With
Generative AI
ADDWEEK | ARTIFICIAL
INTELLIGENCE
Reinventing search with a new AI-
powered Microsoft Bing and
Edge, your copilot for the web
OFFICIAL MICROSOFT BLOG
It feels natural for us to dive head first into the potential
of AI … we are building on their legacy by exploring
how humans and machines work in harmony.
DDB EXECUTIVE GLEN LOMAS
Generative AI is an evolution of people working with
machines to create content. For us, the magic occurs
when you combine human insight – and cultural insight
– with this ability to generate content with machines …
This is WPP’s role. We apply these technologies,
combine them with insight, and help our clients grow.
STEPHAN PRETORIUS
PROPRIETARY & CONFIDENTIAL 8
Stability AI is the leader
in open-source,
enterprise, generative
AI for media companies
PROPRIETARY & CONFIDENTIAL
Built and supported by some
of the best names in AI
David Ha
HEAD OF STRATEGY
& RESEARCH
Robin Rombach
CO-INVENTOR
OF STABLE DIFFUSION
Patrick Hebron
VP OF PRODUCT R&D
Stanislav Fort
SENIOR RESEARCH SCIENTIST
Emad Mostaque
CEO
Tom Mason
CTO
Former CTO at
Chorus Intelligence
Ren Ito
COO
Former CEO of Mercari Europe,
Japan’s 1st Unicorn $7B IPO
Peter O’Donoghue
CFO
Former UK Head of Technology
and Audit Partner at Deloitte
Our investors:
9
PROPRIETARY & CONFIDENTIAL
10
Infrastructure:
Ezra-1
Unlike most startups, we have a critical
strategic asset. One of the fastest
supercomputers globally is the Ezra-1
UltraCluster at a steep discount to
market value.
The combined UltraCluster has 48,000
Cores, 576 Tb RAM, 4 Pb NVMe SSD,
4,000 A100s. We have 10 Pb of high
speed FSx Lustre SSD storage that we
can scale well above 100 Pb.
PROPRIETARY & CONFIDENTIAL
Our 3 technology pillars
11
Platform API
Engineering
HPC
Data
Serving an optimised infrastructure and application
stack for our customers to use our models.
Technology pillars in a grassy field
RLHF
Applied ML
Fine-tuned Models
Custom Pipelines
Modality Teams
Research
Labelling
Foundation Models
PROPRIETARY & CONFIDENTIAL
Stable Diffusion is
redefining creativity
The silly monsters parade, 8k, hyper
details, rich colors, photograph
140K+
Stable Diffusion
community
Stable Foundation Discord
members include Artists, Beta
Testers & Developers
3B+
Images generated
since launch
125K
Reddit
subscribers
52K
Stable Diffusion
Github stars
341
Hugging Face Stable
Diffusion models
Japanese spaceship in the style
of a woodblock print
12
PROPRIETARY & CONFIDENTIAL
13
Stable Diffusion got over 50,000
GitHub stars in 150 days
360 days 1,080 2,160
# of stars on GitHub since repository was started
3,240 days
2,880
720 1,440 1,800
Transformers
Stable Diffusion
Cockroach
Ethereum
Bitcoin
Vercel
2,520
50K
PROPRIETARY
&
CONFIDENTIAL
Open source =
platform value
Stable Diffusion is the
foundation layer
40M users
2M downloads
5M monthly
traffic
2M users/
month
15M images
generated
4M users/
month
41M downloads
Brutalist architecture on top of K2
PROPRIETARY
&
CONFIDENTIAL
14
PROPRIETARY & CONFIDENTIAL
Our new content model,
SDXL, was made for
professional media use
● More expressive: 2.4B parameters (3x more
than before)
● Easier to use: less complex prompting to get
beautiful outputs
● Enhanced image composition: greater
capability to produce and position legible text
● Wider breadth & depth of available styles:
better incorporation of photorealistic and other
applicable styles
15
PROPRIETARY & CONFIDENTIAL
Stability offers the full-suite of AI
models tailored to enterprises
16
State-of-the-art models across
all media modalities
Stable Diffusion
DeepFloyd “IF” (Q1)
StableChat (Q2)
StableMusic (Q2)
Text-to-3D (Q4)
Text-to-Video (Q3)
Tailored to enterprise who care
about IP security & compliance
No saving or usage of
proprietary IP in training
Fully auditable model architecture
and dataset construction
Enterprise hosted SLAs &
support or on-premise open
source support
Interior of a spaceship
Timelines are estimates and subject to change
PROPRIETARY & CONFIDENTIAL
We’re building the default
models for every domain
Stable 3D
COMING Q4
Stable Music
COMING Q2
Stable Video
COMING Q3
PROPRIETARY
&
CONFIDENTIAL
1
7
PROPRIETARY & CONFIDENTIAL
Stability Animation Alpha-testers are already
demonstrating the model’s capabilities
18
PROPRIETARY & CONFIDENTIAL
Our newest models have
state-of-the-art
performance
DeepFloyd “IF”
COMING Q2
FID Scores
Stable Diffusion = 12.6
DALLE = 10.3
Google Imagen = 7.2
DeepFloyd IF = 6.6
(Lower is better)
19
PROPRIETARY & CONFIDENTIAL
Realistic singing or spoken
voice conversion
20
Convert between
spoken voices…
The Interior of a spaceship
Original Converted
… and singing voices.
PROPRIETARY & CONFIDENTIAL 21
We have quickly become the standard across the
media ecosystem
Photoshop
plugin
Semantic
mixing
Seamless
3D textures
All integrated with
Stable Diffusion
Enterprises can leverage Stability’s
models via native integrations with the
largest cloud providers
Short-form creative
content
https://aws.amazon.com/blogs/machine-
learning/stability-ai-builds-foundation-
models-on-amazon-sagemaker/
https://stability.ai/blog/stability-ai-makes-
its-stable-diffusion-models-available-on-
amazons-new-bedrock-service
PROPRIETARY & CONFIDENTIAL 22
We have quickly become the standard across the
media ecosystem
Photoshop
plugin
Semantic
mixing
Seamless
3D textures
All integrated with
Stable Diffusion
Stability to be the default AI on every chip
Short-form creative
content
PROPRIETARY & CONFIDENTIAL 23
Deployment: Easily fine-tune Stable Diffusion on your data
using AWS Bedrock
23
Data on the
Virtual Private
Cloud (VPC)
Bedrock
Fine-tuned
Model
Encrypted data that
does not leave the
VPC and which will
not be used to train
the original base
model
Fine-tuning our
models for the desired
task without having to
annotate large
volumes of data
Soon, AWS Bedrock will allow for an easy fine-tuning process for various use cases without worrying about data’s privacy.
Our full suite of models will also be available for training on this API in the future.
PROPRIETARY & CONFIDENTIAL 24
Customers get the benefit of AWS Sagemaker &
Stability’s models tightly integrated
Easily deploy, manage and fine-tune
Stability models at scale with optimized
infrastructure
Enterprise level SLAs with 99.95%
uptime and downtime redundancy
Dedicated expertise and proof of
concept support from Stability trained
Sagemaker specialists
PROPRIETARY & CONFIDENTIAL
Platform API Docs Site
25
platform.stability.ai
Stability SDK
● Packaged / PyPI
● T2I, I2I, Inpainting
● Variants (models / upscalers)
Typescript Client
● Helper functions
● Node.js support
Interfaces
● gRPC
● REST
PROPRIETARY & CONFIDENTIAL
Platform Interfaces / SDK
26
REST API
● Generations
● Upscaler
Python / Typescript SDK
● Examples
Discord Bots
● Demo new models (XL launched in Discord)
● Gather human feedback
Notebooks
● Gradio / Colab
● Support developers
History / Asset Service
● Asset storage S3/R2
● Persisted history
● User assets (e.g. fine-tuning)
PROPRIETARY & CONFIDENTIAL 27
Get better ideas faster with the
Stability Platform API
New concepts can be created by simply sketching a design and
pairing it with a text prompt using Stable Diffusion controllable networks
Input image
Output images
PROPRIETARY
&
CONFIDENTIAL
28
Stable
Diffusion
2.0
Upscaler
4x
PROPRIETARY
&
CONFIDENTIAL
Presets
29
● New style presets
● Available via API using preset tag
● Use a combination of additional
positive prompts and negative
prompts
● Available in DreamStudio
PROPRIETARY & CONFIDENTIAL
Fine-Tuning API
30
- Evaluation > Dreambooth LoRA
- 130 seconds pre-proc + training time!
- Support for objects (e.g animals) + styles
- Ingestion Pipeline (CLIPSeg)
- Deployment in SageMaker (training)
- Integration of API middleware
- Jobs requested through gRPC/REST
- Routed via queue
- Using SM training routines
- CloudWatch dashboard
PROPRIETARY & CONFIDENTIAL
31
PROPRIETARY & CONFIDENTIAL
Explore personalisation by leveraging your
universe of past photos to inspire the future
Collect a set of input images and use our fine-tuning API to
learn a “style” to then output similar creative concepts
Input style
Output images
32
PROPRIETARY & CONFIDENTIAL
Animation API
33
● Static image animation, video output
● Methods including 2D, 3D, 3D Warp, Video Init
● Frame interpolation
● Project storage in asset service
● Gradio Notebook
Prompt:
“A cyberpunk futuristic colourful crowded luxurious
pedestrian street avenue hi-tech at morning time,
blue neon lights, ray tracing, hdr, realistic shaded,
extremely detailed, sharp focus, soft lighting, sunny"
PROPRIETARY & CONFIDENTIAL 34
Soon, fine-tuning our animation models on specific characters and scenes will allow for
a quick animation production process, in addition to other visual effect capabilities
Our text-to-video and animation
technologies are rapidly evolving
PROPRIETARY & CONFIDENTIAL
Animation /
35
emoji 1.0 photo stickers personalized
AniMoji
PROPRIETARY & CONFIDENTIAL
Our recent competition with Peter Gabriel
#diffusetogether
36
PROPRIETARY & CONFIDENTIAL
Optimisation
37
OneFlow (Static Compilation)
● It is reasonably fast:
○ 56.98 iterations per second on A100 GPUs, over 2x faster than
Transformers.
○ Not the fastest compared to TensorRT (62.2it/s) and Paddle
(68.2it/s), but:
● Deployment-friendly, enough to justify the overhead:
○ Nice multi-resolution support through multiple graphs (see
https://github.com/Oneflow-Inc/diffusers/wiki/Optimization-for-
Multi-Resolution-Picture for details). Fewer warm-up instances
needed for multi-resolution and less middleware complexity
○ Fast compilation: a couple of seconds vs. 10+ mins.
○ Can dynamically load weights, nice for custom models.
○ Easy to modify network; sth like ControlNet or loading Lora
weights are not huge hassles (unlike AIT and TensorRT,
basically impossible without re-compilation).
○ Mocking torch environment, low maintenance once deployed.
○ Rapid ongoing development.
Upcoming
Frameworks
Multiple targets / control planes
PROPRIETARY & CONFIDENTIAL
Integrations
38
PROPRIETARY & CONFIDENTIAL
DreamStudio
39
● Framework
○ Migration to React
○ New UI/UX
● Generation interface
● Editor
● Presets
● Back-End
○ History
○ Asset Service
PROPRIETARY & CONFIDENTIAL
Multiple Generations Styles
40
PROPRIETARY & CONFIDENTIAL
Variations
Infinite History
41
Stable Diffusion 2.0 Depth to Image
TRANSFORM IMAGES DYNAMICALLY
Depth to Image
TRANSFORM IMAGES DYNAMICALLY
Stable Diffusion Inpainting
CHANGE IMAGE DETAILS
Stable Diffusion v2.1: “Professional photograph of Game of Thrones as a Japanese drama…”
Stable Diffusion v2.1: “Professional photograph of Game of Thrones as a Japanese drama…”
PROPRIETARY & CONFIDENTIAL
Stability’s content models & surrounding infrastructure will
soon enable full control of creative outputs
47
Content models with enough flexibility to get exact outputs while still
providing creative exploration
Create custom styles &
personalized content models
based on private assets with
Stability’s fine-tuning
infrastructure
Leverage SDXL variants
optimized for specific use-cases
& stylistic outputs e.g. cinematic,
Decide which parts of images to
keep constant and which to
change along a variety of
dimensions (depth, pose,
boundary, etc.) through custom
SDXL adapters
Allow users to modify images
using natural language with
Stability Instruct models
PROPRIETARY & CONFIDENTIAL 48
Tap into global markets automatically with
soon to be released fully controllable models
Output images
International
cultural themes
Today
Use a base design and
pass into Stable Diffusion
img2img for quick
concepting and then
finalize in software of
choice
Q2
Use custom T2I adapters
with our latest model
(Stable Diffusion XL) to
make fine-grained
adjustments with multiple
control types
Input image
PROPRIETARY & CONFIDENTIAL 49
Sample POCs that can be executed in 2023
Film & TV Creative
Concepting
Get to better ideas faster by concepting thousands of creative ideas
in minutes and leveraging your past universe of creative work
H1 2023
Advertising Collateral
& Thumbnails
Save time & money on marketing by leveraging production
content to automatically generate first-pass marketing
collateral & thumbnails
H1 2023
Script & Storyline
Assistance
Get feedback and ideas on scripts with an evision
specific large language model
H2 2023
Post Video Production
Augmentation
Remove the need for extra shoots by automating post-
production touch-ups
H2 2023
Instant International
Dubbing
Reach new audience segments by allowing any content to be
instantly dubbed in another language
H2 2023
Timelines are estimates and subject to change
PROPRIETARY & CONFIDENTIAL 50
Minimize re-screens with post-production scene
augmentation
Empower editors to experiment across all
content dimensions
Scenes & Props (SDXL & Controllable Networks)
Dialogue, Sounds & Interactions (StableVoice & StableMusic)
1. Fine-tune a custom Stable Diffusion model on a chosen
repository of production assets
2. Enable members of the post production team to issue
natural language commands on a subset of frames to
test changes such as scene and prop alterations
3. Utilize these changes directly in production or to more
efficiently target re-screens
Streamline post production edits with natural-
language based editing
Lush jungle background
Urban town scene
PROPRIETARY & CONFIDENTIAL 51
The next generation of film, tv, animation & music,
will be redefined by Generative AI
Dynamic & Interactive Content
Personalized to Consumers
● Real-time adaptation of movies & shows with characters, scenes
and whole storylines generated on-the-fly.
● Seamless dubbing and content translation to facilitate global
accessibility and engagement.
● Hyper-personalized music & voice mixing to create the perfect
composition.
Massive Leverage to Creators who have
the Best Visions
● Large-scale ideation enabled across all building blocks of movie
/ show development.
● Democratized access to easy-to-use tooling to go from concept
to high-quality content.
● Novel music and voices generated via a combination of text and
existing tooling.
51
“Create a Jumanji themed world for my 5 years old son”
PROPRIETARY & CONFIDENTIAL 52
52
The content production & development pipeline in the
Generative AI Era will be rapid and efficient
Content production Character development Scene creation Audio effects and dubbing
Fine-tune StableLM on relevant
stories and characters for daily
content production
Fine-tune Stable Diffusion on your
characters for further
development
Create stunning videos and
animations centered on chosen
characters and scenes
Fast and low-cost dubbing using
our audio models
PROPRIETARY & CONFIDENTIAL 53
53
Create the perfect character using multiple variations
A Princess
An Arabian Princess
Add a desert
background
The chosen character
PROPRIETARY & CONFIDENTIAL
Direct the model to elaborate and develop the
content
Deploy the customized model to help create
characters and plots based on given themes
and genres
StableLM will be used for script writing, content creation,
and character development
54
As a child, Princess Farah’s father, High-King of the Rub’ Al
Khali’s seven Emirates, died under mysterious circumstances.
The king had previously banned his brother Oday from ever being
a rightful heir due to the latter’s crimes and corruption in the
kingdom. However, Oday took the chance and declared himself
king, banishing Princess Farah to the deep valleys of a distant
land.
Help me create a story of a female Arabian
princess. Main themes should be family and
unity
Give me a general outline of how the plot will
look like
An old sage learns of Farah’s identity from the King’s Mark on her
right palm. He then tells her the prophecy of a princess who
retrieves the lost Staff of Unity to reunite a kingdom in times of
deep turmoil. However, in the process, the princess sacrifices the
most precious gift of love to save the kingdom. Farah then makes
it her life’s goal to fulfill her destiny.
Fine-tune StableLM on various characters
and plots
55
Instant access to interactive, human-like
characters is becoming widely accessible
Large language models provide on-demand
conversation & action simulation
● Accessible: Easily interactive as if talking to
another person
● Personalized: Provide context-dependant
answers & reasoning
● Knowledgeable: Understand storylines, world-
building & fundamental subjects
● Customizable: Able to leverage external tools
and knowledge bases to take on new functions
& personalities
55
“Come on, John. Let’s get out of here”
Follow her Go back
PROPRIETARY & CONFIDENTIAL 56
Supercharge creative abilities and keep
IP safe with multiple ways to partner
Be among the first to know and
integrate new models
• Early access to new foundation
models and model updates.
• Dedicated 1x1 “Ask me
anything” each month.
• Shared Stability Slack channel
for asynchronous assistance.
Access enterprise-grade, hosted
APIs of Stability AI’s models.
• Model and fine-tuning APIs.
• Enterprise support & SLAs
via Amazon Sagemaker.
• Usage-based, tiered
pricing.
• No research usage of
proprietary data.
Leverage Stability’s AI engineers
to build custom models based on
Meitu’s unique assets
• Meitu specific styles and
models created based on past
assets. Both design and
language.
• Pipelines setup to enable
continual updates and
modifications.
• Hosted on prem or via
Sagemaker.
Stability Hosted
API
Preferred Access
Program
Custom
Models
Rocket blasting into space
The future of creativity
is already here
PROPRIETARY & CONFIDENTIAL 58
Integrate generative AI correctly, efficiently & safely with
Stability’s three-pronged approach
58
Create the appropriate POC creation &
evaluation pipeline for integration
• Dataset aggregation and labeling
pipeline is created for any custom
model work needed. All models hosted
on-prem or via AWS Sagemaker.
• POC sandbox for evaluation is setup
for frequent Stability & evision review
sessions on POC progress and needed
adjustments.
Integrate POC throughout relevant orgs
in a phased approach while restarting the
process for the next POC
• evision and Stability work together to
conduct a secure, phased roll-out with
the appropriate quality assurance &
reporting procedures.
• Scoping starts on the next set of
workflows & experiences to tackle.
• Frequent cadence of research and
engineering previews to keep you up to
date on what’s coming.
POC Creation & Evaluation Integration & Deployment
Find the right use-case & implementation
strategy for today while planning for
tomorrow
Scoping
• Audit evision’s internal processes and
internal data to determine feasible &
quick high ROI tasks to triage.
• POCs mutually agreed upon based on
a combination of ROI, speed of testing,
ease of integration / deployment,
dataset availability and technological
maturity.
1 2 3

More Related Content

What's hot

Generative AI con Amazon Bedrock.pdf
Generative AI con Amazon Bedrock.pdfGenerative AI con Amazon Bedrock.pdf
Generative AI con Amazon Bedrock.pdfGuido Maria Nebiolo
 
AI Toolkit for Educators
AI Toolkit for EducatorsAI Toolkit for Educators
AI Toolkit for EducatorsInge de Waard
 
Google Cloud GenAI Overview_071223.pptx
Google Cloud GenAI Overview_071223.pptxGoogle Cloud GenAI Overview_071223.pptx
Google Cloud GenAI Overview_071223.pptxVishPothapu
 
Prompt Engineering.pptx
Prompt Engineering.pptxPrompt Engineering.pptx
Prompt Engineering.pptxahmedmishfaq
 
An Introduction to Generative AI - May 18, 2023
An Introduction  to Generative AI - May 18, 2023An Introduction  to Generative AI - May 18, 2023
An Introduction to Generative AI - May 18, 2023CoriFaklaris1
 
Large Language Models Bootcamp
Large Language Models BootcampLarge Language Models Bootcamp
Large Language Models BootcampData Science Dojo
 
Generative AI at the edge.pdf
Generative AI at the edge.pdfGenerative AI at the edge.pdf
Generative AI at the edge.pdfQualcomm Research
 
MLOps and Reproducible ML on AWS with Kubeflow and SageMaker
MLOps and Reproducible ML on AWS with Kubeflow and SageMakerMLOps and Reproducible ML on AWS with Kubeflow and SageMaker
MLOps and Reproducible ML on AWS with Kubeflow and SageMakerProvectus
 
leewayhertz.com-The architecture of Generative AI for enterprises.pdf
leewayhertz.com-The architecture of Generative AI for enterprises.pdfleewayhertz.com-The architecture of Generative AI for enterprises.pdf
leewayhertz.com-The architecture of Generative AI for enterprises.pdfKristiLBurns
 
AI in Finance: Moving forward!
AI in Finance: Moving forward!AI in Finance: Moving forward!
AI in Finance: Moving forward!Adrian Hornsby
 
MLOps and Data Quality: Deploying Reliable ML Models in Production
MLOps and Data Quality: Deploying Reliable ML Models in ProductionMLOps and Data Quality: Deploying Reliable ML Models in Production
MLOps and Data Quality: Deploying Reliable ML Models in ProductionProvectus
 
Being Well-Architected in the Cloud
Being Well-Architected in the CloudBeing Well-Architected in the Cloud
Being Well-Architected in the CloudAmazon Web Services
 
Prompt Engineering by Dr. Naveed.pdf
Prompt Engineering by Dr. Naveed.pdfPrompt Engineering by Dr. Naveed.pdf
Prompt Engineering by Dr. Naveed.pdfNaveed Ahmed Siddiqui
 
Embracing AI for student and staff productivity.pptx
Embracing AI for student and staff productivity.pptxEmbracing AI for student and staff productivity.pptx
Embracing AI for student and staff productivity.pptxCharles Darwin University
 
The Future of AI is Generative not Discriminative 5/26/2021
The Future of AI is Generative not Discriminative 5/26/2021The Future of AI is Generative not Discriminative 5/26/2021
The Future of AI is Generative not Discriminative 5/26/2021Steve Omohundro
 
[DSC DACH 23] ChatGPT and Beyond: How generative AI is Changing the way peopl...
[DSC DACH 23] ChatGPT and Beyond: How generative AI is Changing the way peopl...[DSC DACH 23] ChatGPT and Beyond: How generative AI is Changing the way peopl...
[DSC DACH 23] ChatGPT and Beyond: How generative AI is Changing the way peopl...DataScienceConferenc1
 

What's hot (20)

Generative AI con Amazon Bedrock.pdf
Generative AI con Amazon Bedrock.pdfGenerative AI con Amazon Bedrock.pdf
Generative AI con Amazon Bedrock.pdf
 
AI Toolkit for Educators
AI Toolkit for EducatorsAI Toolkit for Educators
AI Toolkit for Educators
 
introduction Azure OpenAI by Usama wahab khan
introduction  Azure OpenAI by Usama wahab khanintroduction  Azure OpenAI by Usama wahab khan
introduction Azure OpenAI by Usama wahab khan
 
Google Cloud GenAI Overview_071223.pptx
Google Cloud GenAI Overview_071223.pptxGoogle Cloud GenAI Overview_071223.pptx
Google Cloud GenAI Overview_071223.pptx
 
Prompt Engineering.pptx
Prompt Engineering.pptxPrompt Engineering.pptx
Prompt Engineering.pptx
 
An Introduction to Generative AI - May 18, 2023
An Introduction  to Generative AI - May 18, 2023An Introduction  to Generative AI - May 18, 2023
An Introduction to Generative AI - May 18, 2023
 
Large Language Models Bootcamp
Large Language Models BootcampLarge Language Models Bootcamp
Large Language Models Bootcamp
 
Generative AI at the edge.pdf
Generative AI at the edge.pdfGenerative AI at the edge.pdf
Generative AI at the edge.pdf
 
MLOps and Reproducible ML on AWS with Kubeflow and SageMaker
MLOps and Reproducible ML on AWS with Kubeflow and SageMakerMLOps and Reproducible ML on AWS with Kubeflow and SageMaker
MLOps and Reproducible ML on AWS with Kubeflow and SageMaker
 
leewayhertz.com-The architecture of Generative AI for enterprises.pdf
leewayhertz.com-The architecture of Generative AI for enterprises.pdfleewayhertz.com-The architecture of Generative AI for enterprises.pdf
leewayhertz.com-The architecture of Generative AI for enterprises.pdf
 
AI in Finance: Moving forward!
AI in Finance: Moving forward!AI in Finance: Moving forward!
AI in Finance: Moving forward!
 
MLOps and Data Quality: Deploying Reliable ML Models in Production
MLOps and Data Quality: Deploying Reliable ML Models in ProductionMLOps and Data Quality: Deploying Reliable ML Models in Production
MLOps and Data Quality: Deploying Reliable ML Models in Production
 
Being Well-Architected in the Cloud
Being Well-Architected in the CloudBeing Well-Architected in the Cloud
Being Well-Architected in the Cloud
 
MLOps in action
MLOps in actionMLOps in action
MLOps in action
 
Prompt Engineering by Dr. Naveed.pdf
Prompt Engineering by Dr. Naveed.pdfPrompt Engineering by Dr. Naveed.pdf
Prompt Engineering by Dr. Naveed.pdf
 
Journey of Generative AI
Journey of Generative AIJourney of Generative AI
Journey of Generative AI
 
Embracing AI for student and staff productivity.pptx
Embracing AI for student and staff productivity.pptxEmbracing AI for student and staff productivity.pptx
Embracing AI for student and staff productivity.pptx
 
The Future of AI is Generative not Discriminative 5/26/2021
The Future of AI is Generative not Discriminative 5/26/2021The Future of AI is Generative not Discriminative 5/26/2021
The Future of AI is Generative not Discriminative 5/26/2021
 
Athena & Glue
Athena & GlueAthena & Glue
Athena & Glue
 
[DSC DACH 23] ChatGPT and Beyond: How generative AI is Changing the way peopl...
[DSC DACH 23] ChatGPT and Beyond: How generative AI is Changing the way peopl...[DSC DACH 23] ChatGPT and Beyond: How generative AI is Changing the way peopl...
[DSC DACH 23] ChatGPT and Beyond: How generative AI is Changing the way peopl...
 

Similar to Tom Mason (Stability AI) - Computing Large Foundational Models Unlisted

The Modern Tech Stack: Microservices - The Dark Side
The Modern Tech Stack: Microservices - The Dark SideThe Modern Tech Stack: Microservices - The Dark Side
The Modern Tech Stack: Microservices - The Dark SideAggregage
 
MongoDB World 2019: Gaining ML Insight with Google Vision API and MongoDB
MongoDB World 2019: Gaining ML Insight with Google Vision API and MongoDBMongoDB World 2019: Gaining ML Insight with Google Vision API and MongoDB
MongoDB World 2019: Gaining ML Insight with Google Vision API and MongoDBMongoDB
 
Seattle Cassandra Users: An OSS Java Abstraction Layer for Cassandra
Seattle Cassandra Users: An OSS Java Abstraction Layer for CassandraSeattle Cassandra Users: An OSS Java Abstraction Layer for Cassandra
Seattle Cassandra Users: An OSS Java Abstraction Layer for CassandraJosh Turner
 
Bahrain ch9 introduction to docker 5th birthday
Bahrain ch9 introduction to docker 5th birthday Bahrain ch9 introduction to docker 5th birthday
Bahrain ch9 introduction to docker 5th birthday Walid Shaari
 
Welcome to Hybrid Cloud Innovation Tour 2016
Welcome to Hybrid Cloud Innovation Tour 2016Welcome to Hybrid Cloud Innovation Tour 2016
Welcome to Hybrid Cloud Innovation Tour 2016LaurenWendler
 
AI and Innovations on AWS
AI and Innovations on AWSAI and Innovations on AWS
AI and Innovations on AWSAdrian Hornsby
 
BUILD with Microsoft - Radu Stefan
 BUILD with Microsoft - Radu Stefan BUILD with Microsoft - Radu Stefan
BUILD with Microsoft - Radu StefanITCamp
 
Synctree Capabilties Deck
Synctree Capabilties DeckSynctree Capabilties Deck
Synctree Capabilties DeckPhoebe B. Scott
 
re:cap Generative AI journey with Bedrock
re:cap Generative AI journey  with Bedrockre:cap Generative AI journey  with Bedrock
re:cap Generative AI journey with BedrockPhilipBasford
 
AI Solutions with Macnica.ai - AI Expo 2018 Tokyo Japan
AI Solutions with Macnica.ai - AI Expo 2018 Tokyo JapanAI Solutions with Macnica.ai - AI Expo 2018 Tokyo Japan
AI Solutions with Macnica.ai - AI Expo 2018 Tokyo JapanAvkash Chauhan
 
Gen AI Cognizant & AWS event presentation_12 Oct.pdf
Gen AI Cognizant & AWS event presentation_12 Oct.pdfGen AI Cognizant & AWS event presentation_12 Oct.pdf
Gen AI Cognizant & AWS event presentation_12 Oct.pdfPhilipBasford
 
Docker Orchestration: Welcome to the Jungle! JavaOne 2015
Docker Orchestration: Welcome to the Jungle! JavaOne 2015Docker Orchestration: Welcome to the Jungle! JavaOne 2015
Docker Orchestration: Welcome to the Jungle! JavaOne 2015Patrick Chanezon
 
Designing Your Best Architectural Diagrams
Designing Your Best Architectural DiagramsDesigning Your Best Architectural Diagrams
Designing Your Best Architectural DiagramsEric D. Schabell
 
Google Cloud: Data Analysis and Machine Learningn Technologies
Google Cloud: Data Analysis and Machine Learningn Technologies Google Cloud: Data Analysis and Machine Learningn Technologies
Google Cloud: Data Analysis and Machine Learningn Technologies Andrés Leonardo Martinez Ortiz
 
End-to-End Big Data AI with Analytics Zoo
End-to-End Big Data AI with Analytics ZooEnd-to-End Big Data AI with Analytics Zoo
End-to-End Big Data AI with Analytics ZooJason Dai
 
201705 neoteric software development intro
201705 neoteric software development intro201705 neoteric software development intro
201705 neoteric software development introMatt Kurleto
 
Google cloud Study Jam 2023.pptx
Google cloud Study Jam 2023.pptxGoogle cloud Study Jam 2023.pptx
Google cloud Study Jam 2023.pptxGDSCNiT
 
IBC 2010 press conference
IBC 2010 press conferenceIBC 2010 press conference
IBC 2010 press conferenceQuantel
 
Critical Breakthroughs and Challenges in Big Data and Analytics
Critical Breakthroughs and Challenges in Big Data and AnalyticsCritical Breakthroughs and Challenges in Big Data and Analytics
Critical Breakthroughs and Challenges in Big Data and AnalyticsData Driven Innovation
 

Similar to Tom Mason (Stability AI) - Computing Large Foundational Models Unlisted (20)

The Modern Tech Stack: Microservices - The Dark Side
The Modern Tech Stack: Microservices - The Dark SideThe Modern Tech Stack: Microservices - The Dark Side
The Modern Tech Stack: Microservices - The Dark Side
 
MongoDB World 2019: Gaining ML Insight with Google Vision API and MongoDB
MongoDB World 2019: Gaining ML Insight with Google Vision API and MongoDBMongoDB World 2019: Gaining ML Insight with Google Vision API and MongoDB
MongoDB World 2019: Gaining ML Insight with Google Vision API and MongoDB
 
Seattle Cassandra Users: An OSS Java Abstraction Layer for Cassandra
Seattle Cassandra Users: An OSS Java Abstraction Layer for CassandraSeattle Cassandra Users: An OSS Java Abstraction Layer for Cassandra
Seattle Cassandra Users: An OSS Java Abstraction Layer for Cassandra
 
Bahrain ch9 introduction to docker 5th birthday
Bahrain ch9 introduction to docker 5th birthday Bahrain ch9 introduction to docker 5th birthday
Bahrain ch9 introduction to docker 5th birthday
 
Welcome to Hybrid Cloud Innovation Tour 2016
Welcome to Hybrid Cloud Innovation Tour 2016Welcome to Hybrid Cloud Innovation Tour 2016
Welcome to Hybrid Cloud Innovation Tour 2016
 
AI and Innovations on AWS
AI and Innovations on AWSAI and Innovations on AWS
AI and Innovations on AWS
 
BUILD with Microsoft - Radu Stefan
 BUILD with Microsoft - Radu Stefan BUILD with Microsoft - Radu Stefan
BUILD with Microsoft - Radu Stefan
 
Synctree Capabilties Deck
Synctree Capabilties DeckSynctree Capabilties Deck
Synctree Capabilties Deck
 
re:cap Generative AI journey with Bedrock
re:cap Generative AI journey  with Bedrockre:cap Generative AI journey  with Bedrock
re:cap Generative AI journey with Bedrock
 
AI Solutions with Macnica.ai - AI Expo 2018 Tokyo Japan
AI Solutions with Macnica.ai - AI Expo 2018 Tokyo JapanAI Solutions with Macnica.ai - AI Expo 2018 Tokyo Japan
AI Solutions with Macnica.ai - AI Expo 2018 Tokyo Japan
 
Gen AI Cognizant & AWS event presentation_12 Oct.pdf
Gen AI Cognizant & AWS event presentation_12 Oct.pdfGen AI Cognizant & AWS event presentation_12 Oct.pdf
Gen AI Cognizant & AWS event presentation_12 Oct.pdf
 
Docker Orchestration: Welcome to the Jungle! JavaOne 2015
Docker Orchestration: Welcome to the Jungle! JavaOne 2015Docker Orchestration: Welcome to the Jungle! JavaOne 2015
Docker Orchestration: Welcome to the Jungle! JavaOne 2015
 
Designing Your Best Architectural Diagrams
Designing Your Best Architectural DiagramsDesigning Your Best Architectural Diagrams
Designing Your Best Architectural Diagrams
 
AI as a service
AI as a serviceAI as a service
AI as a service
 
Google Cloud: Data Analysis and Machine Learningn Technologies
Google Cloud: Data Analysis and Machine Learningn Technologies Google Cloud: Data Analysis and Machine Learningn Technologies
Google Cloud: Data Analysis and Machine Learningn Technologies
 
End-to-End Big Data AI with Analytics Zoo
End-to-End Big Data AI with Analytics ZooEnd-to-End Big Data AI with Analytics Zoo
End-to-End Big Data AI with Analytics Zoo
 
201705 neoteric software development intro
201705 neoteric software development intro201705 neoteric software development intro
201705 neoteric software development intro
 
Google cloud Study Jam 2023.pptx
Google cloud Study Jam 2023.pptxGoogle cloud Study Jam 2023.pptx
Google cloud Study Jam 2023.pptx
 
IBC 2010 press conference
IBC 2010 press conferenceIBC 2010 press conference
IBC 2010 press conference
 
Critical Breakthroughs and Challenges in Big Data and Analytics
Critical Breakthroughs and Challenges in Big Data and AnalyticsCritical Breakthroughs and Challenges in Big Data and Analytics
Critical Breakthroughs and Challenges in Big Data and Analytics
 

More from Techsylvania

Sergiu Biris (MultiversX) - Blurring the Lines Between Web 2.0 and Web 3.0
Sergiu Biris (MultiversX) - Blurring the Lines Between Web 2.0 and Web 3.0Sergiu Biris (MultiversX) - Blurring the Lines Between Web 2.0 and Web 3.0
Sergiu Biris (MultiversX) - Blurring the Lines Between Web 2.0 and Web 3.0Techsylvania
 
Conversation w/ Tijana Kovacevic (Happening) - Keeping Your Startup Heart Whi...
Conversation w/ Tijana Kovacevic (Happening) - Keeping Your Startup Heart Whi...Conversation w/ Tijana Kovacevic (Happening) - Keeping Your Startup Heart Whi...
Conversation w/ Tijana Kovacevic (Happening) - Keeping Your Startup Heart Whi...Techsylvania
 
Aarik Mudgal (METRO.Digital) - How to Implement DDoS Protection in GCP
Aarik Mudgal (METRO.Digital) - How to Implement DDoS Protection in GCPAarik Mudgal (METRO.Digital) - How to Implement DDoS Protection in GCP
Aarik Mudgal (METRO.Digital) - How to Implement DDoS Protection in GCPTechsylvania
 
Tudor Mafteianu (Blu Capital Partners) - What Drives the Value of Your Business?
Tudor Mafteianu (Blu Capital Partners) - What Drives the Value of Your Business?Tudor Mafteianu (Blu Capital Partners) - What Drives the Value of Your Business?
Tudor Mafteianu (Blu Capital Partners) - What Drives the Value of Your Business?Techsylvania
 
Andrew O’Neal (Clearbit) - Scaling a Product Vision Through World-Class Team ...
Andrew O’Neal (Clearbit) - Scaling a Product Vision Through World-Class Team ...Andrew O’Neal (Clearbit) - Scaling a Product Vision Through World-Class Team ...
Andrew O’Neal (Clearbit) - Scaling a Product Vision Through World-Class Team ...Techsylvania
 
Andrew Davies (Paddle) - From Zero to $350m Revenue: Finding and Scaling Your...
Andrew Davies (Paddle) - From Zero to $350m Revenue: Finding and Scaling Your...Andrew Davies (Paddle) - From Zero to $350m Revenue: Finding and Scaling Your...
Andrew Davies (Paddle) - From Zero to $350m Revenue: Finding and Scaling Your...Techsylvania
 
Jonathan Oakes (Google) - Powering Health and Fitness Products
Jonathan Oakes (Google) - Powering Health and Fitness ProductsJonathan Oakes (Google) - Powering Health and Fitness Products
Jonathan Oakes (Google) - Powering Health and Fitness ProductsTechsylvania
 
Yossi Matias (Google) - Driving Societal Change Through AI Innovation
Yossi Matias (Google) - Driving Societal Change Through AI InnovationYossi Matias (Google) - Driving Societal Change Through AI Innovation
Yossi Matias (Google) - Driving Societal Change Through AI InnovationTechsylvania
 
Angus Keck (AgUnity) - Mastering Determination, Adaptability, and Storytellin...
Angus Keck (AgUnity) - Mastering Determination, Adaptability, and Storytellin...Angus Keck (AgUnity) - Mastering Determination, Adaptability, and Storytellin...
Angus Keck (AgUnity) - Mastering Determination, Adaptability, and Storytellin...Techsylvania
 
Efi Dahan (PayPal) - From Local to Global: Tips and Trends to Scale Your Busi...
Efi Dahan (PayPal) - From Local to Global: Tips and Trends to Scale Your Busi...Efi Dahan (PayPal) - From Local to Global: Tips and Trends to Scale Your Busi...
Efi Dahan (PayPal) - From Local to Global: Tips and Trends to Scale Your Busi...Techsylvania
 
Amy Varney (Systemiq Capital) - Has Climate Tech Graduated?
Amy Varney (Systemiq Capital) - Has Climate Tech Graduated?Amy Varney (Systemiq Capital) - Has Climate Tech Graduated?
Amy Varney (Systemiq Capital) - Has Climate Tech Graduated?Techsylvania
 
Nima Banai - Vision to Product: Product Design, Development, and Manufacturin...
Nima Banai - Vision to Product: Product Design, Development, and Manufacturin...Nima Banai - Vision to Product: Product Design, Development, and Manufacturin...
Nima Banai - Vision to Product: Product Design, Development, and Manufacturin...Techsylvania
 
Chris Leacock aka Jillionaire - Embracing Diversity Through Interdisciplinary...
Chris Leacock aka Jillionaire - Embracing Diversity Through Interdisciplinary...Chris Leacock aka Jillionaire - Embracing Diversity Through Interdisciplinary...
Chris Leacock aka Jillionaire - Embracing Diversity Through Interdisciplinary...Techsylvania
 
Emil Boc (Mayor of Cluj-Napoca) - Opening Remarks Day 1
Emil Boc (Mayor of Cluj-Napoca) - Opening Remarks Day 1Emil Boc (Mayor of Cluj-Napoca) - Opening Remarks Day 1
Emil Boc (Mayor of Cluj-Napoca) - Opening Remarks Day 1Techsylvania
 
Patrick Poels (Snyk) - The 3 Key Rules of Building Globally Distributed Teams
Patrick Poels (Snyk) - The 3 Key Rules of Building Globally Distributed TeamsPatrick Poels (Snyk) - The 3 Key Rules of Building Globally Distributed Teams
Patrick Poels (Snyk) - The 3 Key Rules of Building Globally Distributed TeamsTechsylvania
 
Eduard Varvara (Barings, MassMutual) - How Barings is Shaping a Culture of In...
Eduard Varvara (Barings, MassMutual) - How Barings is Shaping a Culture of In...Eduard Varvara (Barings, MassMutual) - How Barings is Shaping a Culture of In...
Eduard Varvara (Barings, MassMutual) - How Barings is Shaping a Culture of In...Techsylvania
 
Cristina Morariu (MassMutual Romania) -How Is Technology & Innovation Shaping...
Cristina Morariu (MassMutual Romania) -How Is Technology & Innovation Shaping...Cristina Morariu (MassMutual Romania) -How Is Technology & Innovation Shaping...
Cristina Morariu (MassMutual Romania) -How Is Technology & Innovation Shaping...Techsylvania
 
Marie Astrid Molina (Scaleway), How to Design for a Product You Understand No...
Marie Astrid Molina (Scaleway), How to Design for a Product You Understand No...Marie Astrid Molina (Scaleway), How to Design for a Product You Understand No...
Marie Astrid Molina (Scaleway), How to Design for a Product You Understand No...Techsylvania
 
Julie Xu (Carta) - Designing a product experience vision at scale
Julie Xu (Carta) - Designing a product experience vision at scaleJulie Xu (Carta) - Designing a product experience vision at scale
Julie Xu (Carta) - Designing a product experience vision at scaleTechsylvania
 
Pavlo Pedenko (Wise) - Product Mindset in Fundraising for Charities_ $5M in 3...
Pavlo Pedenko (Wise) - Product Mindset in Fundraising for Charities_ $5M in 3...Pavlo Pedenko (Wise) - Product Mindset in Fundraising for Charities_ $5M in 3...
Pavlo Pedenko (Wise) - Product Mindset in Fundraising for Charities_ $5M in 3...Techsylvania
 

More from Techsylvania (20)

Sergiu Biris (MultiversX) - Blurring the Lines Between Web 2.0 and Web 3.0
Sergiu Biris (MultiversX) - Blurring the Lines Between Web 2.0 and Web 3.0Sergiu Biris (MultiversX) - Blurring the Lines Between Web 2.0 and Web 3.0
Sergiu Biris (MultiversX) - Blurring the Lines Between Web 2.0 and Web 3.0
 
Conversation w/ Tijana Kovacevic (Happening) - Keeping Your Startup Heart Whi...
Conversation w/ Tijana Kovacevic (Happening) - Keeping Your Startup Heart Whi...Conversation w/ Tijana Kovacevic (Happening) - Keeping Your Startup Heart Whi...
Conversation w/ Tijana Kovacevic (Happening) - Keeping Your Startup Heart Whi...
 
Aarik Mudgal (METRO.Digital) - How to Implement DDoS Protection in GCP
Aarik Mudgal (METRO.Digital) - How to Implement DDoS Protection in GCPAarik Mudgal (METRO.Digital) - How to Implement DDoS Protection in GCP
Aarik Mudgal (METRO.Digital) - How to Implement DDoS Protection in GCP
 
Tudor Mafteianu (Blu Capital Partners) - What Drives the Value of Your Business?
Tudor Mafteianu (Blu Capital Partners) - What Drives the Value of Your Business?Tudor Mafteianu (Blu Capital Partners) - What Drives the Value of Your Business?
Tudor Mafteianu (Blu Capital Partners) - What Drives the Value of Your Business?
 
Andrew O’Neal (Clearbit) - Scaling a Product Vision Through World-Class Team ...
Andrew O’Neal (Clearbit) - Scaling a Product Vision Through World-Class Team ...Andrew O’Neal (Clearbit) - Scaling a Product Vision Through World-Class Team ...
Andrew O’Neal (Clearbit) - Scaling a Product Vision Through World-Class Team ...
 
Andrew Davies (Paddle) - From Zero to $350m Revenue: Finding and Scaling Your...
Andrew Davies (Paddle) - From Zero to $350m Revenue: Finding and Scaling Your...Andrew Davies (Paddle) - From Zero to $350m Revenue: Finding and Scaling Your...
Andrew Davies (Paddle) - From Zero to $350m Revenue: Finding and Scaling Your...
 
Jonathan Oakes (Google) - Powering Health and Fitness Products
Jonathan Oakes (Google) - Powering Health and Fitness ProductsJonathan Oakes (Google) - Powering Health and Fitness Products
Jonathan Oakes (Google) - Powering Health and Fitness Products
 
Yossi Matias (Google) - Driving Societal Change Through AI Innovation
Yossi Matias (Google) - Driving Societal Change Through AI InnovationYossi Matias (Google) - Driving Societal Change Through AI Innovation
Yossi Matias (Google) - Driving Societal Change Through AI Innovation
 
Angus Keck (AgUnity) - Mastering Determination, Adaptability, and Storytellin...
Angus Keck (AgUnity) - Mastering Determination, Adaptability, and Storytellin...Angus Keck (AgUnity) - Mastering Determination, Adaptability, and Storytellin...
Angus Keck (AgUnity) - Mastering Determination, Adaptability, and Storytellin...
 
Efi Dahan (PayPal) - From Local to Global: Tips and Trends to Scale Your Busi...
Efi Dahan (PayPal) - From Local to Global: Tips and Trends to Scale Your Busi...Efi Dahan (PayPal) - From Local to Global: Tips and Trends to Scale Your Busi...
Efi Dahan (PayPal) - From Local to Global: Tips and Trends to Scale Your Busi...
 
Amy Varney (Systemiq Capital) - Has Climate Tech Graduated?
Amy Varney (Systemiq Capital) - Has Climate Tech Graduated?Amy Varney (Systemiq Capital) - Has Climate Tech Graduated?
Amy Varney (Systemiq Capital) - Has Climate Tech Graduated?
 
Nima Banai - Vision to Product: Product Design, Development, and Manufacturin...
Nima Banai - Vision to Product: Product Design, Development, and Manufacturin...Nima Banai - Vision to Product: Product Design, Development, and Manufacturin...
Nima Banai - Vision to Product: Product Design, Development, and Manufacturin...
 
Chris Leacock aka Jillionaire - Embracing Diversity Through Interdisciplinary...
Chris Leacock aka Jillionaire - Embracing Diversity Through Interdisciplinary...Chris Leacock aka Jillionaire - Embracing Diversity Through Interdisciplinary...
Chris Leacock aka Jillionaire - Embracing Diversity Through Interdisciplinary...
 
Emil Boc (Mayor of Cluj-Napoca) - Opening Remarks Day 1
Emil Boc (Mayor of Cluj-Napoca) - Opening Remarks Day 1Emil Boc (Mayor of Cluj-Napoca) - Opening Remarks Day 1
Emil Boc (Mayor of Cluj-Napoca) - Opening Remarks Day 1
 
Patrick Poels (Snyk) - The 3 Key Rules of Building Globally Distributed Teams
Patrick Poels (Snyk) - The 3 Key Rules of Building Globally Distributed TeamsPatrick Poels (Snyk) - The 3 Key Rules of Building Globally Distributed Teams
Patrick Poels (Snyk) - The 3 Key Rules of Building Globally Distributed Teams
 
Eduard Varvara (Barings, MassMutual) - How Barings is Shaping a Culture of In...
Eduard Varvara (Barings, MassMutual) - How Barings is Shaping a Culture of In...Eduard Varvara (Barings, MassMutual) - How Barings is Shaping a Culture of In...
Eduard Varvara (Barings, MassMutual) - How Barings is Shaping a Culture of In...
 
Cristina Morariu (MassMutual Romania) -How Is Technology & Innovation Shaping...
Cristina Morariu (MassMutual Romania) -How Is Technology & Innovation Shaping...Cristina Morariu (MassMutual Romania) -How Is Technology & Innovation Shaping...
Cristina Morariu (MassMutual Romania) -How Is Technology & Innovation Shaping...
 
Marie Astrid Molina (Scaleway), How to Design for a Product You Understand No...
Marie Astrid Molina (Scaleway), How to Design for a Product You Understand No...Marie Astrid Molina (Scaleway), How to Design for a Product You Understand No...
Marie Astrid Molina (Scaleway), How to Design for a Product You Understand No...
 
Julie Xu (Carta) - Designing a product experience vision at scale
Julie Xu (Carta) - Designing a product experience vision at scaleJulie Xu (Carta) - Designing a product experience vision at scale
Julie Xu (Carta) - Designing a product experience vision at scale
 
Pavlo Pedenko (Wise) - Product Mindset in Fundraising for Charities_ $5M in 3...
Pavlo Pedenko (Wise) - Product Mindset in Fundraising for Charities_ $5M in 3...Pavlo Pedenko (Wise) - Product Mindset in Fundraising for Charities_ $5M in 3...
Pavlo Pedenko (Wise) - Product Mindset in Fundraising for Charities_ $5M in 3...
 

Recently uploaded

Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
APIForce Zurich 5 April Automation LPDG
APIForce Zurich 5 April  Automation LPDGAPIForce Zurich 5 April  Automation LPDG
APIForce Zurich 5 April Automation LPDGMarianaLemus7
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxhariprasad279825
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024The Digital Insurer
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piececharlottematthew16
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyAlfredo García Lavilla
 

Recently uploaded (20)

Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
APIForce Zurich 5 April Automation LPDG
APIForce Zurich 5 April  Automation LPDGAPIForce Zurich 5 April  Automation LPDG
APIForce Zurich 5 April Automation LPDG
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptx
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptx
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food Manufacturing
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piece
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
 

Tom Mason (Stability AI) - Computing Large Foundational Models Unlisted

  • 1. Hyper realistic future landscape horizon Computing large Foundational models
  • 2. PROPRIETARY & CONFIDENTIAL 2 Generative AI’s impact and capabilities are just getting started
  • 3. PROPRIETARY & CONFIDENTIAL Internet 1997 - 2021 AI 2021 - 2030 (estimate) Line drawing of the Guggenheim PROPRIETARY & CONFIDENTIAL 3 Cathie Wood: AI market could be $87 Trillion $90T Enterprise value ($T) AI applications Foundation AI model- as-a-service APIs AI hardware AI software
  • 4. PROPRIETARY & CONFIDENTIAL AI age Computer age 4 Internet age
  • 5. PROPRIETARY & CONFIDENTIAL 5 The cost & time to produce any content is rapidly approaching zero New classes of AI models are increasingly cheaper and better across all modalities Classic western movie 30.0s | 2¢ To generate a high quality image with text 0.5s | 0.2¢ 8.0s | 2¢ To generate 750 words of human level text Price and times quoted are industry benchmarks and not meant to be specific to Stability 5 6.0s | 0.2¢ To generate 750 words of human level text To generate a high quality image with text 2022 2023
  • 6. PROPRIETARY & CONFIDENTIAL Creative agencies face a multi- billion dollar opportunity to embrace AI or get left behind 6 Create more surreal experiences Generative AI enables them to… Serve more clients Expand margins AI supercharges creativity and output for every member of the team so they can do more of their best work Rapidly prototype and blend new creative ideas across all content formats By being able to… Generate 100s of variants in seconds and maximize inspiration Write proposals 3x faster Enter novel creative realms Create campaigns personalized to each client customer Futuristic city with towering skyscrapers
  • 7. PROPRIETARY & CONFIDENTIAL Companies across the media landscape are already racing to be the first to adopt 7 Next-generation AI gains steam as Jasper gets $1.5B valuation PITCHBOOK | AI How BBDO is Supercharging the Creative Process With Generative AI ADDWEEK | ARTIFICIAL INTELLIGENCE Reinventing search with a new AI- powered Microsoft Bing and Edge, your copilot for the web OFFICIAL MICROSOFT BLOG It feels natural for us to dive head first into the potential of AI … we are building on their legacy by exploring how humans and machines work in harmony. DDB EXECUTIVE GLEN LOMAS Generative AI is an evolution of people working with machines to create content. For us, the magic occurs when you combine human insight – and cultural insight – with this ability to generate content with machines … This is WPP’s role. We apply these technologies, combine them with insight, and help our clients grow. STEPHAN PRETORIUS
  • 8. PROPRIETARY & CONFIDENTIAL 8 Stability AI is the leader in open-source, enterprise, generative AI for media companies
  • 9. PROPRIETARY & CONFIDENTIAL Built and supported by some of the best names in AI David Ha HEAD OF STRATEGY & RESEARCH Robin Rombach CO-INVENTOR OF STABLE DIFFUSION Patrick Hebron VP OF PRODUCT R&D Stanislav Fort SENIOR RESEARCH SCIENTIST Emad Mostaque CEO Tom Mason CTO Former CTO at Chorus Intelligence Ren Ito COO Former CEO of Mercari Europe, Japan’s 1st Unicorn $7B IPO Peter O’Donoghue CFO Former UK Head of Technology and Audit Partner at Deloitte Our investors: 9
  • 10. PROPRIETARY & CONFIDENTIAL 10 Infrastructure: Ezra-1 Unlike most startups, we have a critical strategic asset. One of the fastest supercomputers globally is the Ezra-1 UltraCluster at a steep discount to market value. The combined UltraCluster has 48,000 Cores, 576 Tb RAM, 4 Pb NVMe SSD, 4,000 A100s. We have 10 Pb of high speed FSx Lustre SSD storage that we can scale well above 100 Pb.
  • 11. PROPRIETARY & CONFIDENTIAL Our 3 technology pillars 11 Platform API Engineering HPC Data Serving an optimised infrastructure and application stack for our customers to use our models. Technology pillars in a grassy field RLHF Applied ML Fine-tuned Models Custom Pipelines Modality Teams Research Labelling Foundation Models
  • 12. PROPRIETARY & CONFIDENTIAL Stable Diffusion is redefining creativity The silly monsters parade, 8k, hyper details, rich colors, photograph 140K+ Stable Diffusion community Stable Foundation Discord members include Artists, Beta Testers & Developers 3B+ Images generated since launch 125K Reddit subscribers 52K Stable Diffusion Github stars 341 Hugging Face Stable Diffusion models Japanese spaceship in the style of a woodblock print 12
  • 13. PROPRIETARY & CONFIDENTIAL 13 Stable Diffusion got over 50,000 GitHub stars in 150 days 360 days 1,080 2,160 # of stars on GitHub since repository was started 3,240 days 2,880 720 1,440 1,800 Transformers Stable Diffusion Cockroach Ethereum Bitcoin Vercel 2,520 50K
  • 14. PROPRIETARY & CONFIDENTIAL Open source = platform value Stable Diffusion is the foundation layer 40M users 2M downloads 5M monthly traffic 2M users/ month 15M images generated 4M users/ month 41M downloads Brutalist architecture on top of K2 PROPRIETARY & CONFIDENTIAL 14
  • 15. PROPRIETARY & CONFIDENTIAL Our new content model, SDXL, was made for professional media use ● More expressive: 2.4B parameters (3x more than before) ● Easier to use: less complex prompting to get beautiful outputs ● Enhanced image composition: greater capability to produce and position legible text ● Wider breadth & depth of available styles: better incorporation of photorealistic and other applicable styles 15
  • 16. PROPRIETARY & CONFIDENTIAL Stability offers the full-suite of AI models tailored to enterprises 16 State-of-the-art models across all media modalities Stable Diffusion DeepFloyd “IF” (Q1) StableChat (Q2) StableMusic (Q2) Text-to-3D (Q4) Text-to-Video (Q3) Tailored to enterprise who care about IP security & compliance No saving or usage of proprietary IP in training Fully auditable model architecture and dataset construction Enterprise hosted SLAs & support or on-premise open source support Interior of a spaceship Timelines are estimates and subject to change
  • 17. PROPRIETARY & CONFIDENTIAL We’re building the default models for every domain Stable 3D COMING Q4 Stable Music COMING Q2 Stable Video COMING Q3 PROPRIETARY & CONFIDENTIAL 1 7
  • 18. PROPRIETARY & CONFIDENTIAL Stability Animation Alpha-testers are already demonstrating the model’s capabilities 18
  • 19. PROPRIETARY & CONFIDENTIAL Our newest models have state-of-the-art performance DeepFloyd “IF” COMING Q2 FID Scores Stable Diffusion = 12.6 DALLE = 10.3 Google Imagen = 7.2 DeepFloyd IF = 6.6 (Lower is better) 19
  • 20. PROPRIETARY & CONFIDENTIAL Realistic singing or spoken voice conversion 20 Convert between spoken voices… The Interior of a spaceship Original Converted … and singing voices.
  • 21. PROPRIETARY & CONFIDENTIAL 21 We have quickly become the standard across the media ecosystem Photoshop plugin Semantic mixing Seamless 3D textures All integrated with Stable Diffusion Enterprises can leverage Stability’s models via native integrations with the largest cloud providers Short-form creative content https://aws.amazon.com/blogs/machine- learning/stability-ai-builds-foundation- models-on-amazon-sagemaker/ https://stability.ai/blog/stability-ai-makes- its-stable-diffusion-models-available-on- amazons-new-bedrock-service
  • 22. PROPRIETARY & CONFIDENTIAL 22 We have quickly become the standard across the media ecosystem Photoshop plugin Semantic mixing Seamless 3D textures All integrated with Stable Diffusion Stability to be the default AI on every chip Short-form creative content
  • 23. PROPRIETARY & CONFIDENTIAL 23 Deployment: Easily fine-tune Stable Diffusion on your data using AWS Bedrock 23 Data on the Virtual Private Cloud (VPC) Bedrock Fine-tuned Model Encrypted data that does not leave the VPC and which will not be used to train the original base model Fine-tuning our models for the desired task without having to annotate large volumes of data Soon, AWS Bedrock will allow for an easy fine-tuning process for various use cases without worrying about data’s privacy. Our full suite of models will also be available for training on this API in the future.
  • 24. PROPRIETARY & CONFIDENTIAL 24 Customers get the benefit of AWS Sagemaker & Stability’s models tightly integrated Easily deploy, manage and fine-tune Stability models at scale with optimized infrastructure Enterprise level SLAs with 99.95% uptime and downtime redundancy Dedicated expertise and proof of concept support from Stability trained Sagemaker specialists
  • 25. PROPRIETARY & CONFIDENTIAL Platform API Docs Site 25 platform.stability.ai Stability SDK ● Packaged / PyPI ● T2I, I2I, Inpainting ● Variants (models / upscalers) Typescript Client ● Helper functions ● Node.js support Interfaces ● gRPC ● REST
  • 26. PROPRIETARY & CONFIDENTIAL Platform Interfaces / SDK 26 REST API ● Generations ● Upscaler Python / Typescript SDK ● Examples Discord Bots ● Demo new models (XL launched in Discord) ● Gather human feedback Notebooks ● Gradio / Colab ● Support developers History / Asset Service ● Asset storage S3/R2 ● Persisted history ● User assets (e.g. fine-tuning)
  • 27. PROPRIETARY & CONFIDENTIAL 27 Get better ideas faster with the Stability Platform API New concepts can be created by simply sketching a design and pairing it with a text prompt using Stable Diffusion controllable networks Input image Output images
  • 29. PROPRIETARY & CONFIDENTIAL Presets 29 ● New style presets ● Available via API using preset tag ● Use a combination of additional positive prompts and negative prompts ● Available in DreamStudio
  • 30. PROPRIETARY & CONFIDENTIAL Fine-Tuning API 30 - Evaluation > Dreambooth LoRA - 130 seconds pre-proc + training time! - Support for objects (e.g animals) + styles - Ingestion Pipeline (CLIPSeg) - Deployment in SageMaker (training) - Integration of API middleware - Jobs requested through gRPC/REST - Routed via queue - Using SM training routines - CloudWatch dashboard
  • 32. PROPRIETARY & CONFIDENTIAL Explore personalisation by leveraging your universe of past photos to inspire the future Collect a set of input images and use our fine-tuning API to learn a “style” to then output similar creative concepts Input style Output images 32
  • 33. PROPRIETARY & CONFIDENTIAL Animation API 33 ● Static image animation, video output ● Methods including 2D, 3D, 3D Warp, Video Init ● Frame interpolation ● Project storage in asset service ● Gradio Notebook Prompt: “A cyberpunk futuristic colourful crowded luxurious pedestrian street avenue hi-tech at morning time, blue neon lights, ray tracing, hdr, realistic shaded, extremely detailed, sharp focus, soft lighting, sunny"
  • 34. PROPRIETARY & CONFIDENTIAL 34 Soon, fine-tuning our animation models on specific characters and scenes will allow for a quick animation production process, in addition to other visual effect capabilities Our text-to-video and animation technologies are rapidly evolving
  • 35. PROPRIETARY & CONFIDENTIAL Animation / 35 emoji 1.0 photo stickers personalized AniMoji
  • 36. PROPRIETARY & CONFIDENTIAL Our recent competition with Peter Gabriel #diffusetogether 36
  • 37. PROPRIETARY & CONFIDENTIAL Optimisation 37 OneFlow (Static Compilation) ● It is reasonably fast: ○ 56.98 iterations per second on A100 GPUs, over 2x faster than Transformers. ○ Not the fastest compared to TensorRT (62.2it/s) and Paddle (68.2it/s), but: ● Deployment-friendly, enough to justify the overhead: ○ Nice multi-resolution support through multiple graphs (see https://github.com/Oneflow-Inc/diffusers/wiki/Optimization-for- Multi-Resolution-Picture for details). Fewer warm-up instances needed for multi-resolution and less middleware complexity ○ Fast compilation: a couple of seconds vs. 10+ mins. ○ Can dynamically load weights, nice for custom models. ○ Easy to modify network; sth like ControlNet or loading Lora weights are not huge hassles (unlike AIT and TensorRT, basically impossible without re-compilation). ○ Mocking torch environment, low maintenance once deployed. ○ Rapid ongoing development. Upcoming Frameworks Multiple targets / control planes
  • 39. PROPRIETARY & CONFIDENTIAL DreamStudio 39 ● Framework ○ Migration to React ○ New UI/UX ● Generation interface ● Editor ● Presets ● Back-End ○ History ○ Asset Service
  • 40. PROPRIETARY & CONFIDENTIAL Multiple Generations Styles 40
  • 42. Stable Diffusion 2.0 Depth to Image TRANSFORM IMAGES DYNAMICALLY
  • 43. Depth to Image TRANSFORM IMAGES DYNAMICALLY
  • 45. Stable Diffusion v2.1: “Professional photograph of Game of Thrones as a Japanese drama…”
  • 46. Stable Diffusion v2.1: “Professional photograph of Game of Thrones as a Japanese drama…”
  • 47. PROPRIETARY & CONFIDENTIAL Stability’s content models & surrounding infrastructure will soon enable full control of creative outputs 47 Content models with enough flexibility to get exact outputs while still providing creative exploration Create custom styles & personalized content models based on private assets with Stability’s fine-tuning infrastructure Leverage SDXL variants optimized for specific use-cases & stylistic outputs e.g. cinematic, Decide which parts of images to keep constant and which to change along a variety of dimensions (depth, pose, boundary, etc.) through custom SDXL adapters Allow users to modify images using natural language with Stability Instruct models
  • 48. PROPRIETARY & CONFIDENTIAL 48 Tap into global markets automatically with soon to be released fully controllable models Output images International cultural themes Today Use a base design and pass into Stable Diffusion img2img for quick concepting and then finalize in software of choice Q2 Use custom T2I adapters with our latest model (Stable Diffusion XL) to make fine-grained adjustments with multiple control types Input image
  • 49. PROPRIETARY & CONFIDENTIAL 49 Sample POCs that can be executed in 2023 Film & TV Creative Concepting Get to better ideas faster by concepting thousands of creative ideas in minutes and leveraging your past universe of creative work H1 2023 Advertising Collateral & Thumbnails Save time & money on marketing by leveraging production content to automatically generate first-pass marketing collateral & thumbnails H1 2023 Script & Storyline Assistance Get feedback and ideas on scripts with an evision specific large language model H2 2023 Post Video Production Augmentation Remove the need for extra shoots by automating post- production touch-ups H2 2023 Instant International Dubbing Reach new audience segments by allowing any content to be instantly dubbed in another language H2 2023 Timelines are estimates and subject to change
  • 50. PROPRIETARY & CONFIDENTIAL 50 Minimize re-screens with post-production scene augmentation Empower editors to experiment across all content dimensions Scenes & Props (SDXL & Controllable Networks) Dialogue, Sounds & Interactions (StableVoice & StableMusic) 1. Fine-tune a custom Stable Diffusion model on a chosen repository of production assets 2. Enable members of the post production team to issue natural language commands on a subset of frames to test changes such as scene and prop alterations 3. Utilize these changes directly in production or to more efficiently target re-screens Streamline post production edits with natural- language based editing Lush jungle background Urban town scene
  • 51. PROPRIETARY & CONFIDENTIAL 51 The next generation of film, tv, animation & music, will be redefined by Generative AI Dynamic & Interactive Content Personalized to Consumers ● Real-time adaptation of movies & shows with characters, scenes and whole storylines generated on-the-fly. ● Seamless dubbing and content translation to facilitate global accessibility and engagement. ● Hyper-personalized music & voice mixing to create the perfect composition. Massive Leverage to Creators who have the Best Visions ● Large-scale ideation enabled across all building blocks of movie / show development. ● Democratized access to easy-to-use tooling to go from concept to high-quality content. ● Novel music and voices generated via a combination of text and existing tooling. 51 “Create a Jumanji themed world for my 5 years old son”
  • 52. PROPRIETARY & CONFIDENTIAL 52 52 The content production & development pipeline in the Generative AI Era will be rapid and efficient Content production Character development Scene creation Audio effects and dubbing Fine-tune StableLM on relevant stories and characters for daily content production Fine-tune Stable Diffusion on your characters for further development Create stunning videos and animations centered on chosen characters and scenes Fast and low-cost dubbing using our audio models
  • 53. PROPRIETARY & CONFIDENTIAL 53 53 Create the perfect character using multiple variations A Princess An Arabian Princess Add a desert background The chosen character
  • 54. PROPRIETARY & CONFIDENTIAL Direct the model to elaborate and develop the content Deploy the customized model to help create characters and plots based on given themes and genres StableLM will be used for script writing, content creation, and character development 54 As a child, Princess Farah’s father, High-King of the Rub’ Al Khali’s seven Emirates, died under mysterious circumstances. The king had previously banned his brother Oday from ever being a rightful heir due to the latter’s crimes and corruption in the kingdom. However, Oday took the chance and declared himself king, banishing Princess Farah to the deep valleys of a distant land. Help me create a story of a female Arabian princess. Main themes should be family and unity Give me a general outline of how the plot will look like An old sage learns of Farah’s identity from the King’s Mark on her right palm. He then tells her the prophecy of a princess who retrieves the lost Staff of Unity to reunite a kingdom in times of deep turmoil. However, in the process, the princess sacrifices the most precious gift of love to save the kingdom. Farah then makes it her life’s goal to fulfill her destiny. Fine-tune StableLM on various characters and plots
  • 55. 55 Instant access to interactive, human-like characters is becoming widely accessible Large language models provide on-demand conversation & action simulation ● Accessible: Easily interactive as if talking to another person ● Personalized: Provide context-dependant answers & reasoning ● Knowledgeable: Understand storylines, world- building & fundamental subjects ● Customizable: Able to leverage external tools and knowledge bases to take on new functions & personalities 55 “Come on, John. Let’s get out of here” Follow her Go back
  • 56. PROPRIETARY & CONFIDENTIAL 56 Supercharge creative abilities and keep IP safe with multiple ways to partner Be among the first to know and integrate new models • Early access to new foundation models and model updates. • Dedicated 1x1 “Ask me anything” each month. • Shared Stability Slack channel for asynchronous assistance. Access enterprise-grade, hosted APIs of Stability AI’s models. • Model and fine-tuning APIs. • Enterprise support & SLAs via Amazon Sagemaker. • Usage-based, tiered pricing. • No research usage of proprietary data. Leverage Stability’s AI engineers to build custom models based on Meitu’s unique assets • Meitu specific styles and models created based on past assets. Both design and language. • Pipelines setup to enable continual updates and modifications. • Hosted on prem or via Sagemaker. Stability Hosted API Preferred Access Program Custom Models
  • 57. Rocket blasting into space The future of creativity is already here
  • 58. PROPRIETARY & CONFIDENTIAL 58 Integrate generative AI correctly, efficiently & safely with Stability’s three-pronged approach 58 Create the appropriate POC creation & evaluation pipeline for integration • Dataset aggregation and labeling pipeline is created for any custom model work needed. All models hosted on-prem or via AWS Sagemaker. • POC sandbox for evaluation is setup for frequent Stability & evision review sessions on POC progress and needed adjustments. Integrate POC throughout relevant orgs in a phased approach while restarting the process for the next POC • evision and Stability work together to conduct a secure, phased roll-out with the appropriate quality assurance & reporting procedures. • Scoping starts on the next set of workflows & experiences to tackle. • Frequent cadence of research and engineering previews to keep you up to date on what’s coming. POC Creation & Evaluation Integration & Deployment Find the right use-case & implementation strategy for today while planning for tomorrow Scoping • Audit evision’s internal processes and internal data to determine feasible & quick high ROI tasks to triage. • POCs mutually agreed upon based on a combination of ROI, speed of testing, ease of integration / deployment, dataset availability and technological maturity. 1 2 3