SlideShare a Scribd company logo
STORY VISUALIZATION USING
DEEP LEARNING
Members:
Hassan Latif 2017152
Adil Jamal 2017228
Rehana Saboor 2017387
Saad Iqbal 2017395
Supervisor:
Mr. Badre Munir
Co-supervisor:
Dr. Muhammad Hanif
Motivation
 Creating images and videos from text is a challenging task even for modern-day AI.
 Usually the images and videos are created pixel by pixel and this approach lacks clarity in
images/videos and also the duration of videos is too short.
 To improve upon the generation of videos from text, we decided to use artificial intelligence and
augmented reality to create 3D scenes of story books.
 Existing Systems:
There are a number of storytelling apps developed using augmented reality and virtual reality:
 Quiver Augmented Reality
 Website: https://quivervision.com/
 Wonderscope
 Website: https://wonderscope.com/
Introduction
 Our project implements an Android application that uses Artificial Intelligence and
Augmented Reality to visualize a story.
 We visualize some paragraphs of descriptive comprehensions, primarily stories in 3D
geometric comprehensions.
 Each scene contains background information and, characters and their actions.
Description
 An image is captured, containing a few paragraphs of a storybook.
The text is extracted from the image which is then fed to BERT
models.
 BERT models are natural language processing models that extract
scene-specific information i.e. the background, the characters,
and the actions of these characters present in the scene.
 This information is then sent to Unity, which generates 3D
backgrounds and characters.
 All of this information is further passed on to Vuforia for
visualization in Augmented Reality.
3
Run inference and create scene
1
Combine background, character
and actions.
2
Database
Take a picture
Project Flow
Background
Character
Action
Technologies
 Our project is an Android application so we have utilized Android
development tools.
 TensorFlow is used to train BERT models.
 Characters and animations were downloaded from mixamo.com.
 Unity is used to generate 3D scenes.
 Vuforia is used to visualize the scenes in AR.
 Flask and Firebase are used to implement the backend.
Functional Requirements
 User Sign Up.
 User Log In.
 Select an image from stored images in file manager or capture a new image.
 Save the scene (the text of the scene).
 Delete saved scene.
 Choose different voices to narrate the text of the scene.
 Visualize the scene on a surface.
 Change scale of scene.
 Change characters in the scene (if any).
 Change background of the scene.
 Delete saved scene (text description of the scene).
 Logout from the application.

More Related Content

Similar to FYP Presentation

3D in Android
3D in Android3D in Android
3D in Android
Wen Jiun Yap
 
Lessons Learned: Designer/Developer Productivity in Windows Presentation Foun...
Lessons Learned: Designer/Developer Productivity in Windows Presentation Foun...Lessons Learned: Designer/Developer Productivity in Windows Presentation Foun...
Lessons Learned: Designer/Developer Productivity in Windows Presentation Foun...
goodfriday
 
10 best animation software for windows
10 best animation software for windows10 best animation software for windows
10 best animation software for windows
Javubar Sathick
 
Devraj_Nataraj_CV_PDF
Devraj_Nataraj_CV_PDFDevraj_Nataraj_CV_PDF
Devraj_Nataraj_CV_PDF
Devraj Nataraj
 
Sea dragon technology
Sea dragon technologySea dragon technology
Sea dragon technology
Sagar Patel
 
Fake Video Creation and Detection: A Review
Fake Video Creation and Detection: A ReviewFake Video Creation and Detection: A Review
Fake Video Creation and Detection: A Review
IRJET Journal
 
Build an AR app v2.0
Build an AR app v2.0Build an AR app v2.0
Build an AR app v2.0
Kumar Ahir
 
Development of a 3d animated short film using 3d animation technique
Development of a 3d animated short film using 3d animation techniqueDevelopment of a 3d animated short film using 3d animation technique
Development of a 3d animated short film using 3d animation technique
Afsarah Jahin
 
Adobe after effects creative cloud cc 2017
Adobe after effects creative cloud cc 2017Adobe after effects creative cloud cc 2017
Adobe after effects creative cloud cc 2017
DAVID RAUDALES
 
ParisfxLab - 4DViews
ParisfxLab - 4DViewsParisfxLab - 4DViews
ParisfxLab - 4DViews
François Hanat
 
Motion Capturing Using Computer Vision.pdf
Motion Capturing Using Computer Vision.pdfMotion Capturing Using Computer Vision.pdf
Motion Capturing Using Computer Vision.pdf
RaghavGupta397411
 
Magic Leap Developers Presentation by Cliff Baldridge 2018
Magic Leap Developers Presentation by Cliff Baldridge 2018Magic Leap Developers Presentation by Cliff Baldridge 2018
Magic Leap Developers Presentation by Cliff Baldridge 2018
Cliff Baldridge
 
VFX
VFXVFX
What is 3D Animation
What is 3D AnimationWhat is 3D Animation
What is 3D Animation
FrankLeo16
 
Exploring Generating AI with Diffusion Models
Exploring Generating AI with Diffusion ModelsExploring Generating AI with Diffusion Models
Exploring Generating AI with Diffusion Models
KonfHubTechConferenc
 
Automated Image Captioning – Model Based on CNN – GRU Architecture
Automated Image Captioning – Model Based on CNN – GRU ArchitectureAutomated Image Captioning – Model Based on CNN – GRU Architecture
Automated Image Captioning – Model Based on CNN – GRU Architecture
IRJET Journal
 
"Exploring the Potential of Apple Vision Pro: A New Era in Human-Computer Int...
"Exploring the Potential of Apple Vision Pro: A New Era in Human-Computer Int..."Exploring the Potential of Apple Vision Pro: A New Era in Human-Computer Int...
"Exploring the Potential of Apple Vision Pro: A New Era in Human-Computer Int...
AslamAslu19
 
HTML5DevConf - Unleash the power of 3D with babylon.js
HTML5DevConf - Unleash the power of 3D with babylon.jsHTML5DevConf - Unleash the power of 3D with babylon.js
HTML5DevConf - Unleash the power of 3D with babylon.js
David Catuhe
 
Develop Store Apps with Kinect for Windows v2
Develop Store Apps with Kinect for Windows v2Develop Store Apps with Kinect for Windows v2
Develop Store Apps with Kinect for Windows v2
Clemente Giorio
 
Develop store apps with kinect for windows v2
Develop store apps with kinect for windows v2Develop store apps with kinect for windows v2
Develop store apps with kinect for windows v2
Matteo Valoriani
 

Similar to FYP Presentation (20)

3D in Android
3D in Android3D in Android
3D in Android
 
Lessons Learned: Designer/Developer Productivity in Windows Presentation Foun...
Lessons Learned: Designer/Developer Productivity in Windows Presentation Foun...Lessons Learned: Designer/Developer Productivity in Windows Presentation Foun...
Lessons Learned: Designer/Developer Productivity in Windows Presentation Foun...
 
10 best animation software for windows
10 best animation software for windows10 best animation software for windows
10 best animation software for windows
 
Devraj_Nataraj_CV_PDF
Devraj_Nataraj_CV_PDFDevraj_Nataraj_CV_PDF
Devraj_Nataraj_CV_PDF
 
Sea dragon technology
Sea dragon technologySea dragon technology
Sea dragon technology
 
Fake Video Creation and Detection: A Review
Fake Video Creation and Detection: A ReviewFake Video Creation and Detection: A Review
Fake Video Creation and Detection: A Review
 
Build an AR app v2.0
Build an AR app v2.0Build an AR app v2.0
Build an AR app v2.0
 
Development of a 3d animated short film using 3d animation technique
Development of a 3d animated short film using 3d animation techniqueDevelopment of a 3d animated short film using 3d animation technique
Development of a 3d animated short film using 3d animation technique
 
Adobe after effects creative cloud cc 2017
Adobe after effects creative cloud cc 2017Adobe after effects creative cloud cc 2017
Adobe after effects creative cloud cc 2017
 
ParisfxLab - 4DViews
ParisfxLab - 4DViewsParisfxLab - 4DViews
ParisfxLab - 4DViews
 
Motion Capturing Using Computer Vision.pdf
Motion Capturing Using Computer Vision.pdfMotion Capturing Using Computer Vision.pdf
Motion Capturing Using Computer Vision.pdf
 
Magic Leap Developers Presentation by Cliff Baldridge 2018
Magic Leap Developers Presentation by Cliff Baldridge 2018Magic Leap Developers Presentation by Cliff Baldridge 2018
Magic Leap Developers Presentation by Cliff Baldridge 2018
 
VFX
VFXVFX
VFX
 
What is 3D Animation
What is 3D AnimationWhat is 3D Animation
What is 3D Animation
 
Exploring Generating AI with Diffusion Models
Exploring Generating AI with Diffusion ModelsExploring Generating AI with Diffusion Models
Exploring Generating AI with Diffusion Models
 
Automated Image Captioning – Model Based on CNN – GRU Architecture
Automated Image Captioning – Model Based on CNN – GRU ArchitectureAutomated Image Captioning – Model Based on CNN – GRU Architecture
Automated Image Captioning – Model Based on CNN – GRU Architecture
 
"Exploring the Potential of Apple Vision Pro: A New Era in Human-Computer Int...
"Exploring the Potential of Apple Vision Pro: A New Era in Human-Computer Int..."Exploring the Potential of Apple Vision Pro: A New Era in Human-Computer Int...
"Exploring the Potential of Apple Vision Pro: A New Era in Human-Computer Int...
 
HTML5DevConf - Unleash the power of 3D with babylon.js
HTML5DevConf - Unleash the power of 3D with babylon.jsHTML5DevConf - Unleash the power of 3D with babylon.js
HTML5DevConf - Unleash the power of 3D with babylon.js
 
Develop Store Apps with Kinect for Windows v2
Develop Store Apps with Kinect for Windows v2Develop Store Apps with Kinect for Windows v2
Develop Store Apps with Kinect for Windows v2
 
Develop store apps with kinect for windows v2
Develop store apps with kinect for windows v2Develop store apps with kinect for windows v2
Develop store apps with kinect for windows v2
 

Recently uploaded

Main Java[All of the Base Concepts}.docx
Main Java[All of the Base Concepts}.docxMain Java[All of the Base Concepts}.docx
Main Java[All of the Base Concepts}.docx
adhitya5119
 
ZK on Polkadot zero knowledge proofs - sub0.pptx
ZK on Polkadot zero knowledge proofs - sub0.pptxZK on Polkadot zero knowledge proofs - sub0.pptx
ZK on Polkadot zero knowledge proofs - sub0.pptx
dot55audits
 
math operations ued in python and all used
math operations ued in python and all usedmath operations ued in python and all used
math operations ued in python and all used
ssuser13ffe4
 
คำศัพท์ คำพื้นฐานการอ่าน ภาษาอังกฤษ ระดับชั้น ม.1
คำศัพท์ คำพื้นฐานการอ่าน ภาษาอังกฤษ ระดับชั้น ม.1คำศัพท์ คำพื้นฐานการอ่าน ภาษาอังกฤษ ระดับชั้น ม.1
คำศัพท์ คำพื้นฐานการอ่าน ภาษาอังกฤษ ระดับชั้น ม.1
สมใจ จันสุกสี
 
Your Skill Boost Masterclass: Strategies for Effective Upskilling
Your Skill Boost Masterclass: Strategies for Effective UpskillingYour Skill Boost Masterclass: Strategies for Effective Upskilling
Your Skill Boost Masterclass: Strategies for Effective Upskilling
Excellence Foundation for South Sudan
 
NEWSPAPERS - QUESTION 1 - REVISION POWERPOINT.pptx
NEWSPAPERS - QUESTION 1 - REVISION POWERPOINT.pptxNEWSPAPERS - QUESTION 1 - REVISION POWERPOINT.pptx
NEWSPAPERS - QUESTION 1 - REVISION POWERPOINT.pptx
iammrhaywood
 
writing about opinions about Australia the movie
writing about opinions about Australia the moviewriting about opinions about Australia the movie
writing about opinions about Australia the movie
Nicholas Montgomery
 
Traditional Musical Instruments of Arunachal Pradesh and Uttar Pradesh - RAYH...
Traditional Musical Instruments of Arunachal Pradesh and Uttar Pradesh - RAYH...Traditional Musical Instruments of Arunachal Pradesh and Uttar Pradesh - RAYH...
Traditional Musical Instruments of Arunachal Pradesh and Uttar Pradesh - RAYH...
imrankhan141184
 
Présentationvvvvvvvvvvvvvvvvvvvvvvvvvvvv2.pptx
Présentationvvvvvvvvvvvvvvvvvvvvvvvvvvvv2.pptxPrésentationvvvvvvvvvvvvvvvvvvvvvvvvvvvv2.pptx
Présentationvvvvvvvvvvvvvvvvvvvvvvvvvvvv2.pptx
siemaillard
 
Constructing Your Course Container for Effective Communication
Constructing Your Course Container for Effective CommunicationConstructing Your Course Container for Effective Communication
Constructing Your Course Container for Effective Communication
Chevonnese Chevers Whyte, MBA, B.Sc.
 
spot a liar (Haiqa 146).pptx Technical writhing and presentation skills
spot a liar (Haiqa 146).pptx Technical writhing and presentation skillsspot a liar (Haiqa 146).pptx Technical writhing and presentation skills
spot a liar (Haiqa 146).pptx Technical writhing and presentation skills
haiqairshad
 
Advanced Java[Extra Concepts, Not Difficult].docx
Advanced Java[Extra Concepts, Not Difficult].docxAdvanced Java[Extra Concepts, Not Difficult].docx
Advanced Java[Extra Concepts, Not Difficult].docx
adhitya5119
 
BBR 2024 Summer Sessions Interview Training
BBR  2024 Summer Sessions Interview TrainingBBR  2024 Summer Sessions Interview Training
BBR 2024 Summer Sessions Interview Training
Katrina Pritchard
 
The History of Stoke Newington Street Names
The History of Stoke Newington Street NamesThe History of Stoke Newington Street Names
The History of Stoke Newington Street Names
History of Stoke Newington
 
How to deliver Powerpoint Presentations.pptx
How to deliver Powerpoint  Presentations.pptxHow to deliver Powerpoint  Presentations.pptx
How to deliver Powerpoint Presentations.pptx
HajraNaeem15
 
Temple of Asclepius in Thrace. Excavation results
Temple of Asclepius in Thrace. Excavation resultsTemple of Asclepius in Thrace. Excavation results
Temple of Asclepius in Thrace. Excavation results
Krassimira Luka
 
Leveraging Generative AI to Drive Nonprofit Innovation
Leveraging Generative AI to Drive Nonprofit InnovationLeveraging Generative AI to Drive Nonprofit Innovation
Leveraging Generative AI to Drive Nonprofit Innovation
TechSoup
 
Walmart Business+ and Spark Good for Nonprofits.pdf
Walmart Business+ and Spark Good for Nonprofits.pdfWalmart Business+ and Spark Good for Nonprofits.pdf
Walmart Business+ and Spark Good for Nonprofits.pdf
TechSoup
 
Mule event processing models | MuleSoft Mysore Meetup #47
Mule event processing models | MuleSoft Mysore Meetup #47Mule event processing models | MuleSoft Mysore Meetup #47
Mule event processing models | MuleSoft Mysore Meetup #47
MysoreMuleSoftMeetup
 
Bed Making ( Introduction, Purpose, Types, Articles, Scientific principles, N...
Bed Making ( Introduction, Purpose, Types, Articles, Scientific principles, N...Bed Making ( Introduction, Purpose, Types, Articles, Scientific principles, N...
Bed Making ( Introduction, Purpose, Types, Articles, Scientific principles, N...
Leena Ghag-Sakpal
 

Recently uploaded (20)

Main Java[All of the Base Concepts}.docx
Main Java[All of the Base Concepts}.docxMain Java[All of the Base Concepts}.docx
Main Java[All of the Base Concepts}.docx
 
ZK on Polkadot zero knowledge proofs - sub0.pptx
ZK on Polkadot zero knowledge proofs - sub0.pptxZK on Polkadot zero knowledge proofs - sub0.pptx
ZK on Polkadot zero knowledge proofs - sub0.pptx
 
math operations ued in python and all used
math operations ued in python and all usedmath operations ued in python and all used
math operations ued in python and all used
 
คำศัพท์ คำพื้นฐานการอ่าน ภาษาอังกฤษ ระดับชั้น ม.1
คำศัพท์ คำพื้นฐานการอ่าน ภาษาอังกฤษ ระดับชั้น ม.1คำศัพท์ คำพื้นฐานการอ่าน ภาษาอังกฤษ ระดับชั้น ม.1
คำศัพท์ คำพื้นฐานการอ่าน ภาษาอังกฤษ ระดับชั้น ม.1
 
Your Skill Boost Masterclass: Strategies for Effective Upskilling
Your Skill Boost Masterclass: Strategies for Effective UpskillingYour Skill Boost Masterclass: Strategies for Effective Upskilling
Your Skill Boost Masterclass: Strategies for Effective Upskilling
 
NEWSPAPERS - QUESTION 1 - REVISION POWERPOINT.pptx
NEWSPAPERS - QUESTION 1 - REVISION POWERPOINT.pptxNEWSPAPERS - QUESTION 1 - REVISION POWERPOINT.pptx
NEWSPAPERS - QUESTION 1 - REVISION POWERPOINT.pptx
 
writing about opinions about Australia the movie
writing about opinions about Australia the moviewriting about opinions about Australia the movie
writing about opinions about Australia the movie
 
Traditional Musical Instruments of Arunachal Pradesh and Uttar Pradesh - RAYH...
Traditional Musical Instruments of Arunachal Pradesh and Uttar Pradesh - RAYH...Traditional Musical Instruments of Arunachal Pradesh and Uttar Pradesh - RAYH...
Traditional Musical Instruments of Arunachal Pradesh and Uttar Pradesh - RAYH...
 
Présentationvvvvvvvvvvvvvvvvvvvvvvvvvvvv2.pptx
Présentationvvvvvvvvvvvvvvvvvvvvvvvvvvvv2.pptxPrésentationvvvvvvvvvvvvvvvvvvvvvvvvvvvv2.pptx
Présentationvvvvvvvvvvvvvvvvvvvvvvvvvvvv2.pptx
 
Constructing Your Course Container for Effective Communication
Constructing Your Course Container for Effective CommunicationConstructing Your Course Container for Effective Communication
Constructing Your Course Container for Effective Communication
 
spot a liar (Haiqa 146).pptx Technical writhing and presentation skills
spot a liar (Haiqa 146).pptx Technical writhing and presentation skillsspot a liar (Haiqa 146).pptx Technical writhing and presentation skills
spot a liar (Haiqa 146).pptx Technical writhing and presentation skills
 
Advanced Java[Extra Concepts, Not Difficult].docx
Advanced Java[Extra Concepts, Not Difficult].docxAdvanced Java[Extra Concepts, Not Difficult].docx
Advanced Java[Extra Concepts, Not Difficult].docx
 
BBR 2024 Summer Sessions Interview Training
BBR  2024 Summer Sessions Interview TrainingBBR  2024 Summer Sessions Interview Training
BBR 2024 Summer Sessions Interview Training
 
The History of Stoke Newington Street Names
The History of Stoke Newington Street NamesThe History of Stoke Newington Street Names
The History of Stoke Newington Street Names
 
How to deliver Powerpoint Presentations.pptx
How to deliver Powerpoint  Presentations.pptxHow to deliver Powerpoint  Presentations.pptx
How to deliver Powerpoint Presentations.pptx
 
Temple of Asclepius in Thrace. Excavation results
Temple of Asclepius in Thrace. Excavation resultsTemple of Asclepius in Thrace. Excavation results
Temple of Asclepius in Thrace. Excavation results
 
Leveraging Generative AI to Drive Nonprofit Innovation
Leveraging Generative AI to Drive Nonprofit InnovationLeveraging Generative AI to Drive Nonprofit Innovation
Leveraging Generative AI to Drive Nonprofit Innovation
 
Walmart Business+ and Spark Good for Nonprofits.pdf
Walmart Business+ and Spark Good for Nonprofits.pdfWalmart Business+ and Spark Good for Nonprofits.pdf
Walmart Business+ and Spark Good for Nonprofits.pdf
 
Mule event processing models | MuleSoft Mysore Meetup #47
Mule event processing models | MuleSoft Mysore Meetup #47Mule event processing models | MuleSoft Mysore Meetup #47
Mule event processing models | MuleSoft Mysore Meetup #47
 
Bed Making ( Introduction, Purpose, Types, Articles, Scientific principles, N...
Bed Making ( Introduction, Purpose, Types, Articles, Scientific principles, N...Bed Making ( Introduction, Purpose, Types, Articles, Scientific principles, N...
Bed Making ( Introduction, Purpose, Types, Articles, Scientific principles, N...
 

FYP Presentation

  • 1. STORY VISUALIZATION USING DEEP LEARNING Members: Hassan Latif 2017152 Adil Jamal 2017228 Rehana Saboor 2017387 Saad Iqbal 2017395 Supervisor: Mr. Badre Munir Co-supervisor: Dr. Muhammad Hanif
  • 2. Motivation  Creating images and videos from text is a challenging task even for modern-day AI.  Usually the images and videos are created pixel by pixel and this approach lacks clarity in images/videos and also the duration of videos is too short.  To improve upon the generation of videos from text, we decided to use artificial intelligence and augmented reality to create 3D scenes of story books.  Existing Systems: There are a number of storytelling apps developed using augmented reality and virtual reality:  Quiver Augmented Reality  Website: https://quivervision.com/  Wonderscope  Website: https://wonderscope.com/
  • 3. Introduction  Our project implements an Android application that uses Artificial Intelligence and Augmented Reality to visualize a story.  We visualize some paragraphs of descriptive comprehensions, primarily stories in 3D geometric comprehensions.  Each scene contains background information and, characters and their actions.
  • 4. Description  An image is captured, containing a few paragraphs of a storybook. The text is extracted from the image which is then fed to BERT models.  BERT models are natural language processing models that extract scene-specific information i.e. the background, the characters, and the actions of these characters present in the scene.  This information is then sent to Unity, which generates 3D backgrounds and characters.  All of this information is further passed on to Vuforia for visualization in Augmented Reality.
  • 5. 3 Run inference and create scene 1 Combine background, character and actions. 2 Database Take a picture Project Flow Background Character Action
  • 6. Technologies  Our project is an Android application so we have utilized Android development tools.  TensorFlow is used to train BERT models.  Characters and animations were downloaded from mixamo.com.  Unity is used to generate 3D scenes.  Vuforia is used to visualize the scenes in AR.  Flask and Firebase are used to implement the backend.
  • 7. Functional Requirements  User Sign Up.  User Log In.  Select an image from stored images in file manager or capture a new image.  Save the scene (the text of the scene).  Delete saved scene.  Choose different voices to narrate the text of the scene.  Visualize the scene on a surface.  Change scale of scene.  Change characters in the scene (if any).  Change background of the scene.  Delete saved scene (text description of the scene).  Logout from the application.