SlideShare a Scribd company logo
1 of 1
Download to read offline
South Australian Machine Learning Seminar Series
Abstract
The fields of natural language processing (NLP) and computer
vision (CV) have seen great advances in their respective goals
of analysing and generating text, and of understanding images
and videos. While both fields share a similar set of methods
rooted in artificial intelligence and machine learning, they have
historically developed separately. Recent years, however, have
seen an upsurge of interest in problems that require
combination of linguistic and visual information. For example,
Image Captioning and Visual Question Answering (VQA) are
two important research topics in this area. Image captioning
requires the machine to describe the image using human
readable sentences while the VQA asks a machine to answer
language-based questions based on the visual information. In
this talk we outline some of the most recent progresses,
present some theories and techniques for these two Vision-to-
Language tasks, and show a live demo of the image
captioning.
About the Speaker
Dr Qi Wu obtained a BSc in Information and Computing
Science from the China Jiliang University (China), and an MSc
in Global Computing and Media Technology, a PhD in
Computer Science from the University of Bath (United
Kingdom). He is currently a Senior Research Associate in the
Australia Centre for Visual Technology (ACVT) in the University
of Adelaide, Australia.
Dr Qi Wu joined the ACVT in 2015 and started to work on the
Vision-to-Language problems. He is especially interested in
the problem of Image Captioning and Visual Question
Answering. He has two papers accepted in the CVPR 2016, all
about these two topics. His image captioning model produced
the best result in the Microsoft COCO Image Captioning
Challenges in the last year and his VQA model is the current
state-of-the-art in the area.
Speaker:
Dr Qi Wu
Senior Research Associate-
Australia Centre for Visual
Technology (ACVT), University
of Adelaide, Australia
Date:
25 August 2016
Time:
12:10 to 13:00
Followed by pizza!
Location:
Uni. Adelaide
Lower Napier Bldg
LG28 Lecture Theatre
Seating is limited:
RSVP on eventbrite
https://visualquestionanswering.even
tbrite.com.au
More info:
mark.mcdonnell@unisa.edu.au
sebastien.wong@dsto.defence.gov.au
The IEEE Computer Society &
IEEE Signal Processing Society present

More Related Content

What's hot

Human and Technological Dimensions of Making in FabLab
Human and Technological Dimensions of Making in FabLabHuman and Technological Dimensions of Making in FabLab
Human and Technological Dimensions of Making in FabLabIván Sánchez Milara
 
Globagile 2011: Global Software Engineering for Agile Teams
Globagile 2011: Global Software Engineering for Agile TeamsGlobagile 2011: Global Software Engineering for Agile Teams
Globagile 2011: Global Software Engineering for Agile TeamsPUCRS University
 
Btsn dt-techpresentation2012
Btsn dt-techpresentation2012Btsn dt-techpresentation2012
Btsn dt-techpresentation2012Julie Lemley
 
Btsn DT Presentation
Btsn DT PresentationBtsn DT Presentation
Btsn DT PresentationJulie Lemley
 
BTSN Presentation - MYP Technology
BTSN Presentation - MYP TechnologyBTSN Presentation - MYP Technology
BTSN Presentation - MYP TechnologyJulie Lemley
 

What's hot (7)

Human and Technological Dimensions of Making in FabLab
Human and Technological Dimensions of Making in FabLabHuman and Technological Dimensions of Making in FabLab
Human and Technological Dimensions of Making in FabLab
 
Globagile 2011: Global Software Engineering for Agile Teams
Globagile 2011: Global Software Engineering for Agile TeamsGlobagile 2011: Global Software Engineering for Agile Teams
Globagile 2011: Global Software Engineering for Agile Teams
 
Btsn dt-techpresentation2012
Btsn dt-techpresentation2012Btsn dt-techpresentation2012
Btsn dt-techpresentation2012
 
Btsn DT Presentation
Btsn DT PresentationBtsn DT Presentation
Btsn DT Presentation
 
BTSN Presentation - MYP Technology
BTSN Presentation - MYP TechnologyBTSN Presentation - MYP Technology
BTSN Presentation - MYP Technology
 
0133971988 pp1
0133971988 pp10133971988 pp1
0133971988 pp1
 
Debugging 2013- Klaus kolle
Debugging 2013- Klaus kolleDebugging 2013- Klaus kolle
Debugging 2013- Klaus kolle
 

Similar to South australian machine learning seminar series talk 4 25 august 2016

Jia-Bin Huang's Curriculum Vitae
Jia-Bin Huang's Curriculum VitaeJia-Bin Huang's Curriculum Vitae
Jia-Bin Huang's Curriculum VitaeJia-Bin Huang
 
The power of animation and video in transforming student learning
The power of animation and video in transforming student learningThe power of animation and video in transforming student learning
The power of animation and video in transforming student learningJennifer Keenahan
 
CVLinkedIn
CVLinkedInCVLinkedIn
CVLinkedInJun Ma
 
KenNewman_CV2015
KenNewman_CV2015KenNewman_CV2015
KenNewman_CV2015Ken Newman
 
EXPLORING AUDIO-TACTILE DESIGN APPROACHES IN CREATING A HOME-AWAY-FROM-HOME F...
EXPLORING AUDIO-TACTILE DESIGN APPROACHES IN CREATING A HOME-AWAY-FROM-HOME F...EXPLORING AUDIO-TACTILE DESIGN APPROACHES IN CREATING A HOME-AWAY-FROM-HOME F...
EXPLORING AUDIO-TACTILE DESIGN APPROACHES IN CREATING A HOME-AWAY-FROM-HOME F...DamilareOG
 
Teaching & Reserach Profile
Teaching & Reserach ProfileTeaching & Reserach Profile
Teaching & Reserach ProfileMitch Goodwin
 
Kadir A_20160804_res_tea
Kadir A_20160804_res_teaKadir A_20160804_res_tea
Kadir A_20160804_res_teaKadir A Peker
 
Maino & Goodfellow: Educator, Preceptor, Researcher, Author and ...Techno-Geek
Maino & Goodfellow: Educator, Preceptor,  Researcher, Author and ...Techno-GeekMaino & Goodfellow: Educator, Preceptor,  Researcher, Author and ...Techno-Geek
Maino & Goodfellow: Educator, Preceptor, Researcher, Author and ...Techno-GeekDominick Maino
 
Developing critique and academic argument in a blended-learning data visual...
Developing critique and academic argument in a blended-learning data visual...Developing critique and academic argument in a blended-learning data visual...
Developing critique and academic argument in a blended-learning data visual...Cape Peninsula University of Technology
 
Seminar on KM for Business
Seminar on KM for BusinessSeminar on KM for Business
Seminar on KM for Business2016
 
VIDEO OBJECTS DESCRIPTION IN HINDI TEXT LANGUAGE
VIDEO OBJECTS DESCRIPTION IN HINDI TEXT LANGUAGE VIDEO OBJECTS DESCRIPTION IN HINDI TEXT LANGUAGE
VIDEO OBJECTS DESCRIPTION IN HINDI TEXT LANGUAGE ijmpict
 
Autobiography
AutobiographyAutobiography
AutobiographyRick Hsu
 
2022_Adrian_Adascalitei25EN.ppsx
2022_Adrian_Adascalitei25EN.ppsx2022_Adrian_Adascalitei25EN.ppsx
2022_Adrian_Adascalitei25EN.ppsxAdrian Adascalitei
 

Similar to South australian machine learning seminar series talk 4 25 august 2016 (20)

Jia-Bin Huang's Curriculum Vitae
Jia-Bin Huang's Curriculum VitaeJia-Bin Huang's Curriculum Vitae
Jia-Bin Huang's Curriculum Vitae
 
The power of animation and video in transforming student learning
The power of animation and video in transforming student learningThe power of animation and video in transforming student learning
The power of animation and video in transforming student learning
 
CVLinkedIn
CVLinkedInCVLinkedIn
CVLinkedIn
 
KenNewman_CV2015
KenNewman_CV2015KenNewman_CV2015
KenNewman_CV2015
 
EXPLORING AUDIO-TACTILE DESIGN APPROACHES IN CREATING A HOME-AWAY-FROM-HOME F...
EXPLORING AUDIO-TACTILE DESIGN APPROACHES IN CREATING A HOME-AWAY-FROM-HOME F...EXPLORING AUDIO-TACTILE DESIGN APPROACHES IN CREATING A HOME-AWAY-FROM-HOME F...
EXPLORING AUDIO-TACTILE DESIGN APPROACHES IN CREATING A HOME-AWAY-FROM-HOME F...
 
Teaching & Reserach Profile
Teaching & Reserach ProfileTeaching & Reserach Profile
Teaching & Reserach Profile
 
PS-Science
PS-SciencePS-Science
PS-Science
 
PS-Science
PS-SciencePS-Science
PS-Science
 
Program_Canvas_Final
Program_Canvas_FinalProgram_Canvas_Final
Program_Canvas_Final
 
cv_10
cv_10cv_10
cv_10
 
Short intro: Chung-Ching Huang
Short intro: Chung-Ching HuangShort intro: Chung-Ching Huang
Short intro: Chung-Ching Huang
 
Kadir A_20160804_res_tea
Kadir A_20160804_res_teaKadir A_20160804_res_tea
Kadir A_20160804_res_tea
 
Muath resume
Muath resumeMuath resume
Muath resume
 
Maino & Goodfellow: Educator, Preceptor, Researcher, Author and ...Techno-Geek
Maino & Goodfellow: Educator, Preceptor,  Researcher, Author and ...Techno-GeekMaino & Goodfellow: Educator, Preceptor,  Researcher, Author and ...Techno-Geek
Maino & Goodfellow: Educator, Preceptor, Researcher, Author and ...Techno-Geek
 
Developing critique and academic argument in a blended-learning data visual...
Developing critique and academic argument in a blended-learning data visual...Developing critique and academic argument in a blended-learning data visual...
Developing critique and academic argument in a blended-learning data visual...
 
Seminar on KM for Business
Seminar on KM for BusinessSeminar on KM for Business
Seminar on KM for Business
 
VIDEO OBJECTS DESCRIPTION IN HINDI TEXT LANGUAGE
VIDEO OBJECTS DESCRIPTION IN HINDI TEXT LANGUAGE VIDEO OBJECTS DESCRIPTION IN HINDI TEXT LANGUAGE
VIDEO OBJECTS DESCRIPTION IN HINDI TEXT LANGUAGE
 
Autobiography
AutobiographyAutobiography
Autobiography
 
2022_Adrian_Adascalitei25EN.ppsx
2022_Adrian_Adascalitei25EN.ppsx2022_Adrian_Adascalitei25EN.ppsx
2022_Adrian_Adascalitei25EN.ppsx
 
KendiMuchungiJan2016
KendiMuchungiJan2016KendiMuchungiJan2016
KendiMuchungiJan2016
 

Recently uploaded

[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
Azure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAzure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAndikSusilo4
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Paola De la Torre
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxKatpro Technologies
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxOnBoard
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptxLBM Solutions
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?XfilesPro
 

Recently uploaded (20)

[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food Manufacturing
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Azure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAzure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & Application
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptx
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptx
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?
 

South australian machine learning seminar series talk 4 25 august 2016

  • 1. South Australian Machine Learning Seminar Series Abstract The fields of natural language processing (NLP) and computer vision (CV) have seen great advances in their respective goals of analysing and generating text, and of understanding images and videos. While both fields share a similar set of methods rooted in artificial intelligence and machine learning, they have historically developed separately. Recent years, however, have seen an upsurge of interest in problems that require combination of linguistic and visual information. For example, Image Captioning and Visual Question Answering (VQA) are two important research topics in this area. Image captioning requires the machine to describe the image using human readable sentences while the VQA asks a machine to answer language-based questions based on the visual information. In this talk we outline some of the most recent progresses, present some theories and techniques for these two Vision-to- Language tasks, and show a live demo of the image captioning. About the Speaker Dr Qi Wu obtained a BSc in Information and Computing Science from the China Jiliang University (China), and an MSc in Global Computing and Media Technology, a PhD in Computer Science from the University of Bath (United Kingdom). He is currently a Senior Research Associate in the Australia Centre for Visual Technology (ACVT) in the University of Adelaide, Australia. Dr Qi Wu joined the ACVT in 2015 and started to work on the Vision-to-Language problems. He is especially interested in the problem of Image Captioning and Visual Question Answering. He has two papers accepted in the CVPR 2016, all about these two topics. His image captioning model produced the best result in the Microsoft COCO Image Captioning Challenges in the last year and his VQA model is the current state-of-the-art in the area. Speaker: Dr Qi Wu Senior Research Associate- Australia Centre for Visual Technology (ACVT), University of Adelaide, Australia Date: 25 August 2016 Time: 12:10 to 13:00 Followed by pizza! Location: Uni. Adelaide Lower Napier Bldg LG28 Lecture Theatre Seating is limited: RSVP on eventbrite https://visualquestionanswering.even tbrite.com.au More info: mark.mcdonnell@unisa.edu.au sebastien.wong@dsto.defence.gov.au The IEEE Computer Society & IEEE Signal Processing Society present