How to Remove Document Management Hurdles with X-Docs?
South australian machine learning seminar series talk 4 25 august 2016
1. South Australian Machine Learning Seminar Series
Abstract
The fields of natural language processing (NLP) and computer
vision (CV) have seen great advances in their respective goals
of analysing and generating text, and of understanding images
and videos. While both fields share a similar set of methods
rooted in artificial intelligence and machine learning, they have
historically developed separately. Recent years, however, have
seen an upsurge of interest in problems that require
combination of linguistic and visual information. For example,
Image Captioning and Visual Question Answering (VQA) are
two important research topics in this area. Image captioning
requires the machine to describe the image using human
readable sentences while the VQA asks a machine to answer
language-based questions based on the visual information. In
this talk we outline some of the most recent progresses,
present some theories and techniques for these two Vision-to-
Language tasks, and show a live demo of the image
captioning.
About the Speaker
Dr Qi Wu obtained a BSc in Information and Computing
Science from the China Jiliang University (China), and an MSc
in Global Computing and Media Technology, a PhD in
Computer Science from the University of Bath (United
Kingdom). He is currently a Senior Research Associate in the
Australia Centre for Visual Technology (ACVT) in the University
of Adelaide, Australia.
Dr Qi Wu joined the ACVT in 2015 and started to work on the
Vision-to-Language problems. He is especially interested in
the problem of Image Captioning and Visual Question
Answering. He has two papers accepted in the CVPR 2016, all
about these two topics. His image captioning model produced
the best result in the Microsoft COCO Image Captioning
Challenges in the last year and his VQA model is the current
state-of-the-art in the area.
Speaker:
Dr Qi Wu
Senior Research Associate-
Australia Centre for Visual
Technology (ACVT), University
of Adelaide, Australia
Date:
25 August 2016
Time:
12:10 to 13:00
Followed by pizza!
Location:
Uni. Adelaide
Lower Napier Bldg
LG28 Lecture Theatre
Seating is limited:
RSVP on eventbrite
https://visualquestionanswering.even
tbrite.com.au
More info:
mark.mcdonnell@unisa.edu.au
sebastien.wong@dsto.defence.gov.au
The IEEE Computer Society &
IEEE Signal Processing Society present