SlideShare a Scribd company logo
DEEP LEARNING
applications
Julia Rabetti Giannella
Research assistant at VISGRAF Lab
PhD in Design and Technology (PPDESDI-UERJ)
juliagiannella@gmail.com
APPLICATIONS
• Colorization of Black and White Images
• Adding Sounds To Silent Movies
• Object Classification in Photographs
• Automatic Handwriting Generation
• Character Text Generation.
• Image Caption Generation.
• Automatic Game Playing
• Artistic style transfer
Source: http://machinelearningmastery.com/inspirational-applications-deep-learning/
1) Colorization of Black and White Images
• problem of adding color to black and white photographs
• traditionally, this was done by hand with human effort
• CV task attacked by different approaches
• topic of relative importance in SIGGRAPH and EUROGRAPH
• DL approach involves the use of very large CNN 

and supervised layers that recreate the image with 

the addition of color
Paper Colorful Image Colorization (ECCV, 2016)
Source: http://richzhang.github.io/colorization/
Network architecture
Source: https://arxiv.org/pdf/1603.08511.pdf
Semantic interpretability of results
Source: http://richzhang.github.io/colorization/
[Algorithmia] Demo
Source: http://demos.algorithmia.com/colorize-photos/
Dana Keller - designer and photo colorizer
Source: https://www.youtube.com/watch?v=bYHnWhZkAIc
Source: http://www.danarkeller.com/about/
Comparing
Keller Algorithmia
Comparing
Keller Algorithmia
Comparing
Keller Algorithmia
Comparing
Keller Algorithmia
Comparing
Keller Algorithmia
Comparing
Keller Algorithmia
2) Object Classification in Photographs
• task requires the classification of objects within a
photograph as one of a set of previously known objects
• State-of-the-art results have been achieved on benchmark
examples of this problem using very large CNN
• derives from image classification task
• breakthrough: ImageNet Classification with Deep
Convolutional Neural Networks (Krizhevsky et al., 2012)
• AlexNet won ILSVRC-2012 challenge
Source: http://www.cs.toronto.edu/~fritz/absps/imagenet.pdf
Classification with localization
• more complex variation
of this task involves
specifically identifying
one or more objects
within the scene of the
photograph and drawing
a box around them
• GoogLeNet won
ILSVRC-2014 challenge
in this task
Source: https://research.googleblog.com/2014/09/building-deeper-understanding-of-images.html
2.1) DL and RIO2016
• VISGRAF project (out 2016)
• task: automatically classify and cluster images by subject
features related to the Olympic Games, Olympic Torch
• CNN model and supervised learning
• TensorFlow (open source software library)
• Inception-v3 (Going Deeper with Convolutions, 2015)
• transfer learning (manually labeled 100 examples)
Source: http://lvelho.impa.br/dl_rio2016/index.html
Source: https://arxiv.org/abs/1409.4842
Confidence score
Source: http://lvelho.impa.br/dl_rio2016/metodologia.html
A subset of 12
from 2091
images with
confidence
score over 83%
for the Olympic
torch category
Torch Mosaic
Source: http://lvelho.impa.br/dl_rio2016/mosaico.html
Torch Mosaic
Source: http://lvelho.impa.br/dl_rio2016/mosaico.html
2.2) Twitter Facial Analysis Reveals Demographics
of Presidential Campaign Followers
• (Mit Technology Review, march 2016)
• IN: Conference on Web and Social Media
• understand follower demographics of Trump and Clinton by
crossing Twitter metadata and facial features
• a CNN model on followers’ profile images extracts
information on gender, race and age
Source: https://www.technologyreview.com/s/601074/twitter-facial-analysis-reveals-demographics-of-presidential-
campaign-followers/?utm_campaign=add_this&utm_source=email&utm_medium=post
Source: https://arxiv.org/abs/1603.03097
A Comparison of the Trumpists and Clintonists
Source: https://arxiv.org/abs/1603.03097
C"lintonists"

in the Twitter
Sphere
2.3) NVIDIA DRIVENet Demo - Visualizing 

a Self-Driving Car
Source: https://www.youtube.com/watch?v=HJ58dbd5g8g
3) Artistic style transfer
• task: separate and recombine content and style of arbitrary
images, providing a neural algorithm for the creation of
artistic images
• A Neural Algorithm of Artistic Style (Gatys et al., 2015)
Source: https://arxiv.org/abs/1508.06576
Convolutional Neural Network (CNN)
Source: https://arxiv.org/abs/1508.06576
An example
Source: https://research.googleblog.com/2016/02/exploring-intersection-of-art-and.html
The style
transfer
algorithm
crosses a photo
with a painting
style; for
example Neil
deGrasse Tyson
in the style of
Kadinsky’s Jane
Rouge Bleu.
Photo by
Guillaume Piolle,
used with
permission.
3.1) DeepDream
Source: http://deepdreamgenerator.com/
Source: https://en.wikipedia.org/wiki/DeepDream
• computer vision program created by Google
• given an input image returns a version with h"allucinogenic"
appearance
• originates in a CNN codenamed Inception after the film of
the same name developed for the ILSVRC-2014
• CNN can also be run in reverse, to do synthesis
• enhance faces and certain animals -> pareidolia results
3.1) DeepDream
Source: http://deepdreamgenerator.com/
Source: https://en.wikipedia.org/wiki/DeepDream
3.2) Prisma App
Source: http://prisma-ai.com/
Source: https://en.wikipedia.org/wiki/Prisma_(app)
• photo-editing application that utilizes a neural network and
to transform the image into an artistic effect
• became popular on July 2016
• created by Alexey Moiseenkov
• reference A Neural Algorithm of Artistic Style (2016)
3.2) Prisma App
3.2) Prisma App
3.3) Artistic style transfer (video)
Source: https://arxiv.org/abs/1604.08610
Source: https://www.youtube.com/watch?v=Khuj4ASldmU
• Artistic style transfer for videos (Ruder et al.,2016)
3.4) Supercharging Style Transfer for video
Source: https://arxiv.org/abs/1610.07629
Source: https://research.googleblog.com/2016/10/supercharging-style-transfer.html
• A Learned Representation For Artistic Style (Dumoulin et al.,
2016)
• CNN that learns multiple styles at the same time
• method enables style interpolation
3.4) Supercharging Style Transfer for video
Source: https://www.youtube.com/watch?v=6ZHiARZmiUI

More Related Content

Similar to Deep Learning applications

Music and Art with Machine Learning | GDG DevFest Bangkok 2017 (Oct 7th, ...
Music and Art with Machine Learning   |  GDG DevFest Bangkok 2017  (Oct 7th, ...Music and Art with Machine Learning   |  GDG DevFest Bangkok 2017  (Oct 7th, ...
Music and Art with Machine Learning | GDG DevFest Bangkok 2017 (Oct 7th, ...
Virot "Ta" Chiraphadhanakul
 
CERTH @ MediaEval 2014 Social Event Detection Task
CERTH @ MediaEval 2014 Social Event Detection TaskCERTH @ MediaEval 2014 Social Event Detection Task
CERTH @ MediaEval 2014 Social Event Detection Task
multimediaeval
 
Deep Learning behind Prisma
Deep Learning behind PrismaDeep Learning behind Prisma
Deep Learning behind Prisma
lostleaves
 
DH101 2013/2014 course 4 - Digitization techniques 2D and 3D
DH101 2013/2014 course 4 - Digitization techniques 2D and 3D DH101 2013/2014 course 4 - Digitization techniques 2D and 3D
DH101 2013/2014 course 4 - Digitization techniques 2D and 3D Frederic Kaplan
 
Promises of Deep Learning
Promises of Deep LearningPromises of Deep Learning
Promises of Deep Learning
David Khosid
 
Can AI say from our eyes when we read relevant information?
Can AI say from our eyes when we read relevant information?Can AI say from our eyes when we read relevant information?
Can AI say from our eyes when we read relevant information?
Nilavra Bhattacharya
 
Lecture 29 Convolutional Neural Networks - Computer Vision Spring2015
Lecture 29 Convolutional Neural Networks -  Computer Vision Spring2015Lecture 29 Convolutional Neural Networks -  Computer Vision Spring2015
Lecture 29 Convolutional Neural Networks - Computer Vision Spring2015
Jia-Bin Huang
 
A Web-based Service for Image Tampering Detection
A Web-based Service for Image Tampering DetectionA Web-based Service for Image Tampering Detection
A Web-based Service for Image Tampering Detection
Symeon Papadopoulos
 
Deep Learning and Reinforcement Learning
Deep Learning and Reinforcement LearningDeep Learning and Reinforcement Learning
Deep Learning and Reinforcement Learning
Renārs Liepiņš
 
Imagine camp, Developing Image Processing app for windows phone platform
Imagine camp, Developing Image Processing app for windows phone platformImagine camp, Developing Image Processing app for windows phone platform
Imagine camp, Developing Image Processing app for windows phone platform
Rahat Yasir
 
Generative adversarial network and its applications to speech signal and natu...
Generative adversarial network and its applications to speech signal and natu...Generative adversarial network and its applications to speech signal and natu...
Generative adversarial network and its applications to speech signal and natu...
宏毅 李
 
Generative Adversarial Network and its Applications to Speech Processing an...
Generative Adversarial Network and its Applications to Speech Processing an...Generative Adversarial Network and its Applications to Speech Processing an...
Generative Adversarial Network and its Applications to Speech Processing an...
宏毅 李
 
Automated_attendance_system_project.pptx
Automated_attendance_system_project.pptxAutomated_attendance_system_project.pptx
Automated_attendance_system_project.pptx
Naveensai51
 
[CVPR 2018] Visual Search (Image Retrieval) and Metric Learning
[CVPR 2018] Visual Search (Image Retrieval) and Metric Learning[CVPR 2018] Visual Search (Image Retrieval) and Metric Learning
[CVPR 2018] Visual Search (Image Retrieval) and Metric Learning
NAVER Engineering
 
Image Search: Then and Now
Image Search: Then and NowImage Search: Then and Now
Image Search: Then and Now
Si Krishan
 
Searching Images: Recent research at Southampton
Searching Images: Recent research at SouthamptonSearching Images: Recent research at Southampton
Searching Images: Recent research at Southampton
Jonathon Hare
 
Generative models
Generative modelsGenerative models
Generative models
Birger Moell
 
7-200404101602.pdf
7-200404101602.pdf7-200404101602.pdf
7-200404101602.pdf
ssuser07e9f2
 
Image caption generation L18_CNN_RNN_2.pptx
Image caption generation L18_CNN_RNN_2.pptxImage caption generation L18_CNN_RNN_2.pptx
Image caption generation L18_CNN_RNN_2.pptx
erharshkumarroy
 
Large Scale Image Forensics using Tika and Tensorflow [ICMR MFSec 2017]
Large Scale Image Forensics using Tika and Tensorflow [ICMR MFSec 2017]Large Scale Image Forensics using Tika and Tensorflow [ICMR MFSec 2017]
Large Scale Image Forensics using Tika and Tensorflow [ICMR MFSec 2017]
Thamme Gowda
 

Similar to Deep Learning applications (20)

Music and Art with Machine Learning | GDG DevFest Bangkok 2017 (Oct 7th, ...
Music and Art with Machine Learning   |  GDG DevFest Bangkok 2017  (Oct 7th, ...Music and Art with Machine Learning   |  GDG DevFest Bangkok 2017  (Oct 7th, ...
Music and Art with Machine Learning | GDG DevFest Bangkok 2017 (Oct 7th, ...
 
CERTH @ MediaEval 2014 Social Event Detection Task
CERTH @ MediaEval 2014 Social Event Detection TaskCERTH @ MediaEval 2014 Social Event Detection Task
CERTH @ MediaEval 2014 Social Event Detection Task
 
Deep Learning behind Prisma
Deep Learning behind PrismaDeep Learning behind Prisma
Deep Learning behind Prisma
 
DH101 2013/2014 course 4 - Digitization techniques 2D and 3D
DH101 2013/2014 course 4 - Digitization techniques 2D and 3D DH101 2013/2014 course 4 - Digitization techniques 2D and 3D
DH101 2013/2014 course 4 - Digitization techniques 2D and 3D
 
Promises of Deep Learning
Promises of Deep LearningPromises of Deep Learning
Promises of Deep Learning
 
Can AI say from our eyes when we read relevant information?
Can AI say from our eyes when we read relevant information?Can AI say from our eyes when we read relevant information?
Can AI say from our eyes when we read relevant information?
 
Lecture 29 Convolutional Neural Networks - Computer Vision Spring2015
Lecture 29 Convolutional Neural Networks -  Computer Vision Spring2015Lecture 29 Convolutional Neural Networks -  Computer Vision Spring2015
Lecture 29 Convolutional Neural Networks - Computer Vision Spring2015
 
A Web-based Service for Image Tampering Detection
A Web-based Service for Image Tampering DetectionA Web-based Service for Image Tampering Detection
A Web-based Service for Image Tampering Detection
 
Deep Learning and Reinforcement Learning
Deep Learning and Reinforcement LearningDeep Learning and Reinforcement Learning
Deep Learning and Reinforcement Learning
 
Imagine camp, Developing Image Processing app for windows phone platform
Imagine camp, Developing Image Processing app for windows phone platformImagine camp, Developing Image Processing app for windows phone platform
Imagine camp, Developing Image Processing app for windows phone platform
 
Generative adversarial network and its applications to speech signal and natu...
Generative adversarial network and its applications to speech signal and natu...Generative adversarial network and its applications to speech signal and natu...
Generative adversarial network and its applications to speech signal and natu...
 
Generative Adversarial Network and its Applications to Speech Processing an...
Generative Adversarial Network and its Applications to Speech Processing an...Generative Adversarial Network and its Applications to Speech Processing an...
Generative Adversarial Network and its Applications to Speech Processing an...
 
Automated_attendance_system_project.pptx
Automated_attendance_system_project.pptxAutomated_attendance_system_project.pptx
Automated_attendance_system_project.pptx
 
[CVPR 2018] Visual Search (Image Retrieval) and Metric Learning
[CVPR 2018] Visual Search (Image Retrieval) and Metric Learning[CVPR 2018] Visual Search (Image Retrieval) and Metric Learning
[CVPR 2018] Visual Search (Image Retrieval) and Metric Learning
 
Image Search: Then and Now
Image Search: Then and NowImage Search: Then and Now
Image Search: Then and Now
 
Searching Images: Recent research at Southampton
Searching Images: Recent research at SouthamptonSearching Images: Recent research at Southampton
Searching Images: Recent research at Southampton
 
Generative models
Generative modelsGenerative models
Generative models
 
7-200404101602.pdf
7-200404101602.pdf7-200404101602.pdf
7-200404101602.pdf
 
Image caption generation L18_CNN_RNN_2.pptx
Image caption generation L18_CNN_RNN_2.pptxImage caption generation L18_CNN_RNN_2.pptx
Image caption generation L18_CNN_RNN_2.pptx
 
Large Scale Image Forensics using Tika and Tensorflow [ICMR MFSec 2017]
Large Scale Image Forensics using Tika and Tensorflow [ICMR MFSec 2017]Large Scale Image Forensics using Tika and Tensorflow [ICMR MFSec 2017]
Large Scale Image Forensics using Tika and Tensorflow [ICMR MFSec 2017]
 

More from Júlia Rabetti Giannella

Interface gráfica do usuário para exploração e visualização de grandes coleçõ...
Interface gráfica do usuário para exploração e visualização de grandes coleçõ...Interface gráfica do usuário para exploração e visualização de grandes coleçõ...
Interface gráfica do usuário para exploração e visualização de grandes coleçõ...
Júlia Rabetti Giannella
 
Hands-on activity on Data Visualization
Hands-on activity on Data VisualizationHands-on activity on Data Visualization
Hands-on activity on Data Visualization
Júlia Rabetti Giannella
 
Perspectivas para integração do Design nas Humanidades Digitais frente ao des...
Perspectivas para integração do Design nas Humanidades Digitais frente ao des...Perspectivas para integração do Design nas Humanidades Digitais frente ao des...
Perspectivas para integração do Design nas Humanidades Digitais frente ao des...
Júlia Rabetti Giannella
 
ObsevartóR!O2016 - interseções entre arte e técnicas de Deep Learning
ObsevartóR!O2016 - interseções entre arte e técnicas de Deep LearningObsevartóR!O2016 - interseções entre arte e técnicas de Deep Learning
ObsevartóR!O2016 - interseções entre arte e técnicas de Deep Learning
Júlia Rabetti Giannella
 
Dataviz, Twitter e #Rio2016: percepção dos dados olímpicos
Dataviz, Twitter e #Rio2016: percepção dos dados olímpicosDataviz, Twitter e #Rio2016: percepção dos dados olímpicos
Dataviz, Twitter e #Rio2016: percepção dos dados olímpicos
Júlia Rabetti Giannella
 
Rio 2016 Contestation Images: a critical perception from content collected on...
Rio 2016 Contestation Images: a critical perception from content collected on...Rio 2016 Contestation Images: a critical perception from content collected on...
Rio 2016 Contestation Images: a critical perception from content collected on...
Júlia Rabetti Giannella
 
Campus Party Brasil 2017: OBSERVATÓR!O2016: perceptions of olympics through d...
Campus Party Brasil 2017: OBSERVATÓR!O2016: perceptions of olympics through d...Campus Party Brasil 2017: OBSERVATÓR!O2016: perceptions of olympics through d...
Campus Party Brasil 2017: OBSERVATÓR!O2016: perceptions of olympics through d...
Júlia Rabetti Giannella
 
Data visualization: advances by brazilian researches
Data visualization: advances by brazilian researches Data visualization: advances by brazilian researches
Data visualization: advances by brazilian researches
Júlia Rabetti Giannella
 
Design e interfaces cartográficas: avanços para pesquisa e atuação profissional
Design e interfaces cartográficas: avanços para pesquisa e atuação profissionalDesign e interfaces cartográficas: avanços para pesquisa e atuação profissional
Design e interfaces cartográficas: avanços para pesquisa e atuação profissional
Júlia Rabetti Giannella
 
Dispositivo infovis: interfaces entre visualização da informação, infografia ...
Dispositivo infovis: interfaces entre visualização da informação, infografia ...Dispositivo infovis: interfaces entre visualização da informação, infografia ...
Dispositivo infovis: interfaces entre visualização da informação, infografia ...
Júlia Rabetti Giannella
 

More from Júlia Rabetti Giannella (10)

Interface gráfica do usuário para exploração e visualização de grandes coleçõ...
Interface gráfica do usuário para exploração e visualização de grandes coleçõ...Interface gráfica do usuário para exploração e visualização de grandes coleçõ...
Interface gráfica do usuário para exploração e visualização de grandes coleçõ...
 
Hands-on activity on Data Visualization
Hands-on activity on Data VisualizationHands-on activity on Data Visualization
Hands-on activity on Data Visualization
 
Perspectivas para integração do Design nas Humanidades Digitais frente ao des...
Perspectivas para integração do Design nas Humanidades Digitais frente ao des...Perspectivas para integração do Design nas Humanidades Digitais frente ao des...
Perspectivas para integração do Design nas Humanidades Digitais frente ao des...
 
ObsevartóR!O2016 - interseções entre arte e técnicas de Deep Learning
ObsevartóR!O2016 - interseções entre arte e técnicas de Deep LearningObsevartóR!O2016 - interseções entre arte e técnicas de Deep Learning
ObsevartóR!O2016 - interseções entre arte e técnicas de Deep Learning
 
Dataviz, Twitter e #Rio2016: percepção dos dados olímpicos
Dataviz, Twitter e #Rio2016: percepção dos dados olímpicosDataviz, Twitter e #Rio2016: percepção dos dados olímpicos
Dataviz, Twitter e #Rio2016: percepção dos dados olímpicos
 
Rio 2016 Contestation Images: a critical perception from content collected on...
Rio 2016 Contestation Images: a critical perception from content collected on...Rio 2016 Contestation Images: a critical perception from content collected on...
Rio 2016 Contestation Images: a critical perception from content collected on...
 
Campus Party Brasil 2017: OBSERVATÓR!O2016: perceptions of olympics through d...
Campus Party Brasil 2017: OBSERVATÓR!O2016: perceptions of olympics through d...Campus Party Brasil 2017: OBSERVATÓR!O2016: perceptions of olympics through d...
Campus Party Brasil 2017: OBSERVATÓR!O2016: perceptions of olympics through d...
 
Data visualization: advances by brazilian researches
Data visualization: advances by brazilian researches Data visualization: advances by brazilian researches
Data visualization: advances by brazilian researches
 
Design e interfaces cartográficas: avanços para pesquisa e atuação profissional
Design e interfaces cartográficas: avanços para pesquisa e atuação profissionalDesign e interfaces cartográficas: avanços para pesquisa e atuação profissional
Design e interfaces cartográficas: avanços para pesquisa e atuação profissional
 
Dispositivo infovis: interfaces entre visualização da informação, infografia ...
Dispositivo infovis: interfaces entre visualização da informação, infografia ...Dispositivo infovis: interfaces entre visualização da informação, infografia ...
Dispositivo infovis: interfaces entre visualização da informação, infografia ...
 

Recently uploaded

Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Ramesh Iyer
 
Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...
Product School
 
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
Product School
 
When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...
Elena Simperl
 
Knowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and backKnowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and back
Elena Simperl
 
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered QualitySoftware Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Inflectra
 
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdfSmart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
91mobiles
 
FIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdfFIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance
 
Accelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish CachingAccelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish Caching
Thijs Feryn
 
Connector Corner: Automate dynamic content and events by pushing a button
Connector Corner: Automate dynamic content and events by pushing a buttonConnector Corner: Automate dynamic content and events by pushing a button
Connector Corner: Automate dynamic content and events by pushing a button
DianaGray10
 
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
BookNet Canada
 
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
Product School
 
The Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and SalesThe Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and Sales
Laura Byrne
 
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdfFIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance
 
Generating a custom Ruby SDK for your web service or Rails API using Smithy
Generating a custom Ruby SDK for your web service or Rails API using SmithyGenerating a custom Ruby SDK for your web service or Rails API using Smithy
Generating a custom Ruby SDK for your web service or Rails API using Smithy
g2nightmarescribd
 
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Product School
 
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdfFIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance
 
The Future of Platform Engineering
The Future of Platform EngineeringThe Future of Platform Engineering
The Future of Platform Engineering
Jemma Hussein Allen
 
Monitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR EventsMonitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR Events
Ana-Maria Mihalceanu
 
Assuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyesAssuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyes
ThousandEyes
 

Recently uploaded (20)

Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
 
Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...
 
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
 
When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...
 
Knowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and backKnowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and back
 
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered QualitySoftware Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
 
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdfSmart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
 
FIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdfFIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdf
 
Accelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish CachingAccelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish Caching
 
Connector Corner: Automate dynamic content and events by pushing a button
Connector Corner: Automate dynamic content and events by pushing a buttonConnector Corner: Automate dynamic content and events by pushing a button
Connector Corner: Automate dynamic content and events by pushing a button
 
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
 
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
 
The Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and SalesThe Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and Sales
 
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdfFIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
 
Generating a custom Ruby SDK for your web service or Rails API using Smithy
Generating a custom Ruby SDK for your web service or Rails API using SmithyGenerating a custom Ruby SDK for your web service or Rails API using Smithy
Generating a custom Ruby SDK for your web service or Rails API using Smithy
 
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
 
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdfFIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
 
The Future of Platform Engineering
The Future of Platform EngineeringThe Future of Platform Engineering
The Future of Platform Engineering
 
Monitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR EventsMonitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR Events
 
Assuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyesAssuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyes
 

Deep Learning applications

  • 1. DEEP LEARNING applications Julia Rabetti Giannella Research assistant at VISGRAF Lab PhD in Design and Technology (PPDESDI-UERJ) juliagiannella@gmail.com
  • 2. APPLICATIONS • Colorization of Black and White Images • Adding Sounds To Silent Movies • Object Classification in Photographs • Automatic Handwriting Generation • Character Text Generation. • Image Caption Generation. • Automatic Game Playing • Artistic style transfer Source: http://machinelearningmastery.com/inspirational-applications-deep-learning/
  • 3. 1) Colorization of Black and White Images • problem of adding color to black and white photographs • traditionally, this was done by hand with human effort • CV task attacked by different approaches • topic of relative importance in SIGGRAPH and EUROGRAPH • DL approach involves the use of very large CNN 
 and supervised layers that recreate the image with 
 the addition of color
  • 4. Paper Colorful Image Colorization (ECCV, 2016) Source: http://richzhang.github.io/colorization/
  • 6. Semantic interpretability of results Source: http://richzhang.github.io/colorization/
  • 8. Dana Keller - designer and photo colorizer Source: https://www.youtube.com/watch?v=bYHnWhZkAIc Source: http://www.danarkeller.com/about/
  • 15. 2) Object Classification in Photographs • task requires the classification of objects within a photograph as one of a set of previously known objects • State-of-the-art results have been achieved on benchmark examples of this problem using very large CNN • derives from image classification task • breakthrough: ImageNet Classification with Deep Convolutional Neural Networks (Krizhevsky et al., 2012) • AlexNet won ILSVRC-2012 challenge Source: http://www.cs.toronto.edu/~fritz/absps/imagenet.pdf
  • 16. Classification with localization • more complex variation of this task involves specifically identifying one or more objects within the scene of the photograph and drawing a box around them • GoogLeNet won ILSVRC-2014 challenge in this task Source: https://research.googleblog.com/2014/09/building-deeper-understanding-of-images.html
  • 17. 2.1) DL and RIO2016 • VISGRAF project (out 2016) • task: automatically classify and cluster images by subject features related to the Olympic Games, Olympic Torch • CNN model and supervised learning • TensorFlow (open source software library) • Inception-v3 (Going Deeper with Convolutions, 2015) • transfer learning (manually labeled 100 examples) Source: http://lvelho.impa.br/dl_rio2016/index.html Source: https://arxiv.org/abs/1409.4842
  • 18. Confidence score Source: http://lvelho.impa.br/dl_rio2016/metodologia.html A subset of 12 from 2091 images with confidence score over 83% for the Olympic torch category
  • 21. 2.2) Twitter Facial Analysis Reveals Demographics of Presidential Campaign Followers • (Mit Technology Review, march 2016) • IN: Conference on Web and Social Media • understand follower demographics of Trump and Clinton by crossing Twitter metadata and facial features • a CNN model on followers’ profile images extracts information on gender, race and age Source: https://www.technologyreview.com/s/601074/twitter-facial-analysis-reveals-demographics-of-presidential- campaign-followers/?utm_campaign=add_this&utm_source=email&utm_medium=post Source: https://arxiv.org/abs/1603.03097
  • 22. A Comparison of the Trumpists and Clintonists Source: https://arxiv.org/abs/1603.03097 C"lintonists"
 in the Twitter Sphere
  • 23. 2.3) NVIDIA DRIVENet Demo - Visualizing 
 a Self-Driving Car Source: https://www.youtube.com/watch?v=HJ58dbd5g8g
  • 24. 3) Artistic style transfer • task: separate and recombine content and style of arbitrary images, providing a neural algorithm for the creation of artistic images • A Neural Algorithm of Artistic Style (Gatys et al., 2015) Source: https://arxiv.org/abs/1508.06576
  • 25. Convolutional Neural Network (CNN) Source: https://arxiv.org/abs/1508.06576
  • 26. An example Source: https://research.googleblog.com/2016/02/exploring-intersection-of-art-and.html The style transfer algorithm crosses a photo with a painting style; for example Neil deGrasse Tyson in the style of Kadinsky’s Jane Rouge Bleu. Photo by Guillaume Piolle, used with permission.
  • 27. 3.1) DeepDream Source: http://deepdreamgenerator.com/ Source: https://en.wikipedia.org/wiki/DeepDream • computer vision program created by Google • given an input image returns a version with h"allucinogenic" appearance • originates in a CNN codenamed Inception after the film of the same name developed for the ILSVRC-2014 • CNN can also be run in reverse, to do synthesis • enhance faces and certain animals -> pareidolia results
  • 28. 3.1) DeepDream Source: http://deepdreamgenerator.com/ Source: https://en.wikipedia.org/wiki/DeepDream
  • 29. 3.2) Prisma App Source: http://prisma-ai.com/ Source: https://en.wikipedia.org/wiki/Prisma_(app) • photo-editing application that utilizes a neural network and to transform the image into an artistic effect • became popular on July 2016 • created by Alexey Moiseenkov • reference A Neural Algorithm of Artistic Style (2016)
  • 32. 3.3) Artistic style transfer (video) Source: https://arxiv.org/abs/1604.08610 Source: https://www.youtube.com/watch?v=Khuj4ASldmU • Artistic style transfer for videos (Ruder et al.,2016)
  • 33. 3.4) Supercharging Style Transfer for video Source: https://arxiv.org/abs/1610.07629 Source: https://research.googleblog.com/2016/10/supercharging-style-transfer.html • A Learned Representation For Artistic Style (Dumoulin et al., 2016) • CNN that learns multiple styles at the same time • method enables style interpolation
  • 34. 3.4) Supercharging Style Transfer for video Source: https://www.youtube.com/watch?v=6ZHiARZmiUI