Personal Information
Organization / Workplace
Barcelona Area, Spain Spain
Industry
Technology / Software / Internet
About
Xavier Giro-i-Nieto is an assistant professor at the Universitat Politecnica de Catalunya (UPC). He graduated in Electrical Engineering studies at ETSETB (UPC) in 2000, after completing his master thesis on image compression at the Vrije Universiteit in Brussels (VUB) under the direction of Professor Peter Schelkens. In 2001 he worked in the digital television group of Sony Brussels, before returning to Barcelona and joining the Image Processing Group at the UPC. In 2003, he started teaching courses in Electrical Engineering degress at the EET and ETSETB schools from UPC. He obtained his Phd on image retrieval in 2012, under the supervision by Professor Ferran Marques from UPC and Profess...
Tags
deep learning
computer vision
recurrent neural networks
convolutional neural networks
visual saliency
video processing
object detection
generative adversarial networks
unsupervised learning
natural language processing
neural networks
video retrieval
video
generative models
medical imaging
retrieval
multimedia
audio
multimodal deep learning
attention models
visual question answering
artificial intelligence
image classification
self-supervised learning
reinforcement learning
imagenet
machine learning
semantic segmentation
video summarization
image processing
neural machine translation
perceptron
gan
instance segmentation
instance retrieval
visualization
image segmentation
object tracking
speech recognition
speech synthesis
backpropagation
adversarial training
transfer learning
lifelogging
clustering
eeg
figure-ground segmentation
sign language
interpretability
optimization
speech
dqn
deep belief network
architectures
affective computing
object candidates
upc
video object segmentation
image retrieval
cross-modal learning
policy
image captioning
lifelong learning
vision
object segmentation
eye fixation
wavenet
nlp
training
restricted boltzmann machine
variational autoencoder
face recognition
domain adaptation
tensorflow
egocentric vision
video analysis
spatial transformer
wearable cameras
user interaction
broadcasters archive
search
mediaeval
saliency
crowdsourcing
human computing
deep neural networks
gradient descent
moderation
memes
hate speech
ai hype
annotations
3d reconstruction
gnn
graph neural networks
visual scanpath
pixelcnn
vae
loss function
skip rnn
multilayer perceptron
incremental learning
lstm
teaching
dataset
optical flow
nmt
speaker identification
inception
resnet
word embeddings
autoencoder
rbm
barcelonatech
person retrieval
face detection
hpc
keras
backward propagation
activity locatization
ranking
3d convolution
cnn
wearables
diversity
event recognition
sentiment prediction
search engine
classification
video indexing
video annotation
barcelona
reranking
google web toolkit
indexing
social
event detection
hierarchical partitions
surf
computer
brain
segmentation
nist
trecvid
visual grounding
minecraft
xai
explainable ai
representation learning
seq2seq
q-learning
panoptic segmentation
autonomous driving
moco
perpcetron
ai for social good
mild cognitive impairment
dementia
visual dialog
davis
rvos
mattnet
referring expressions
chain rule
computational graph
social networks
resnext
automl
vgg
neural architectures
normalizing flows
catastrophic forgetting
incrmental learning
policy gradients
value function
value
markov decision proccess
cloud computing
sgd
adam
ealy stopping
mini-batch
batch normalization
cross-entropy
mlp
point clouds
3d analysis
set learning
biometrics
video segmentation
visual localization
geometry
local
iclr2018
dynamic computation
self-learning
lipreading
captioning
motion estimation
action detection
action classification
action recognition
rework
adaptive computation time
pixelrnn
dbn
methodology
error function
supercomputing
gru
lip reading
softmax
logistic regression
linear regression
active learning
interestingness
higher education
rgbd
multiview
3d images
depth
joint embeddings
software
t-sne
epoch
batch
visual reasoning
astronomy
space
ethics
language
cbir
deep
remote sensing
activity recognition
soundnet
sonorization
language model
googlenet
skip connections
deep q-network
network in network
densenet
nin
skip thought
word2vec
data partition
vanishing gradient
relu
catalunya
catalonia
cgan
colorization
location retrieval
coclustering
search engine optimization
data augmentation
computing
theano
software development
caption
natural language
memorability
eye tracker
attributes
college
outreach
aprenentatge automàtic
inteligència artificial
robots
convnet
open source
alzheimer
narrative
object
endoscopy
rapid serial visual presentation
electroencephalography
brain-computer interfaces
ccma
relevance feedback
lifeblogging
python
social event photo clustering
instance search
visual descriptors
pattern recognition
algorithm
email
image
nearest neighbor
bundling interest points
digital images
images
web
coding
3d
streaming media
iphone
http
mysql
linux
ios
crowdmm
acmmm
etsetb
telecom
mobilitat
erasmus
javascript
web toolkit
wt
c++
html
web service
web interface
semantic shots
image edge detection
image representation
professional documentalists
video signal processing
semimanual solution
automatic keyframe selection
companies
broadcasting
single representative keyframe
algorithm design and analysis
multimedia communication
mutual reinforcement algorithm
mediavela
workshop
pixable
time series
columbia
regions
phd thesis
hyperlinking
bag of features
signal
bci
interface
twitter
television
interactive
microblogging
tv
labeling game
See more
Presentations
(297)
See all
Likes
(2)
Visual Information Retrieval: Advances, Challenges and Opportunities
Oge Marques
•
7 years ago
Presentations
(297)
See all
Likes
(2)
Visual Information Retrieval: Advances, Challenges and Opportunities
Oge Marques
•
7 years ago
Personal Information
Organization / Workplace
Barcelona Area, Spain Spain
Industry
Technology / Software / Internet
About
Xavier Giro-i-Nieto is an assistant professor at the Universitat Politecnica de Catalunya (UPC). He graduated in Electrical Engineering studies at ETSETB (UPC) in 2000, after completing his master thesis on image compression at the Vrije Universiteit in Brussels (VUB) under the direction of Professor Peter Schelkens. In 2001 he worked in the digital television group of Sony Brussels, before returning to Barcelona and joining the Image Processing Group at the UPC. In 2003, he started teaching courses in Electrical Engineering degress at the EET and ETSETB schools from UPC. He obtained his Phd on image retrieval in 2012, under the supervision by Professor Ferran Marques from UPC and Profess...
Tags
deep learning
computer vision
recurrent neural networks
convolutional neural networks
visual saliency
video processing
object detection
generative adversarial networks
unsupervised learning
natural language processing
neural networks
video retrieval
video
generative models
medical imaging
retrieval
multimedia
audio
multimodal deep learning
attention models
visual question answering
artificial intelligence
image classification
self-supervised learning
reinforcement learning
imagenet
machine learning
semantic segmentation
video summarization
image processing
neural machine translation
perceptron
gan
instance segmentation
instance retrieval
visualization
image segmentation
object tracking
speech recognition
speech synthesis
backpropagation
adversarial training
transfer learning
lifelogging
clustering
eeg
figure-ground segmentation
sign language
interpretability
optimization
speech
dqn
deep belief network
architectures
affective computing
object candidates
upc
video object segmentation
image retrieval
cross-modal learning
policy
image captioning
lifelong learning
vision
object segmentation
eye fixation
wavenet
nlp
training
restricted boltzmann machine
variational autoencoder
face recognition
domain adaptation
tensorflow
egocentric vision
video analysis
spatial transformer
wearable cameras
user interaction
broadcasters archive
search
mediaeval
saliency
crowdsourcing
human computing
deep neural networks
gradient descent
moderation
memes
hate speech
ai hype
annotations
3d reconstruction
gnn
graph neural networks
visual scanpath
pixelcnn
vae
loss function
skip rnn
multilayer perceptron
incremental learning
lstm
teaching
dataset
optical flow
nmt
speaker identification
inception
resnet
word embeddings
autoencoder
rbm
barcelonatech
person retrieval
face detection
hpc
keras
backward propagation
activity locatization
ranking
3d convolution
cnn
wearables
diversity
event recognition
sentiment prediction
search engine
classification
video indexing
video annotation
barcelona
reranking
google web toolkit
indexing
social
event detection
hierarchical partitions
surf
computer
brain
segmentation
nist
trecvid
visual grounding
minecraft
xai
explainable ai
representation learning
seq2seq
q-learning
panoptic segmentation
autonomous driving
moco
perpcetron
ai for social good
mild cognitive impairment
dementia
visual dialog
davis
rvos
mattnet
referring expressions
chain rule
computational graph
social networks
resnext
automl
vgg
neural architectures
normalizing flows
catastrophic forgetting
incrmental learning
policy gradients
value function
value
markov decision proccess
cloud computing
sgd
adam
ealy stopping
mini-batch
batch normalization
cross-entropy
mlp
point clouds
3d analysis
set learning
biometrics
video segmentation
visual localization
geometry
local
iclr2018
dynamic computation
self-learning
lipreading
captioning
motion estimation
action detection
action classification
action recognition
rework
adaptive computation time
pixelrnn
dbn
methodology
error function
supercomputing
gru
lip reading
softmax
logistic regression
linear regression
active learning
interestingness
higher education
rgbd
multiview
3d images
depth
joint embeddings
software
t-sne
epoch
batch
visual reasoning
astronomy
space
ethics
language
cbir
deep
remote sensing
activity recognition
soundnet
sonorization
language model
googlenet
skip connections
deep q-network
network in network
densenet
nin
skip thought
word2vec
data partition
vanishing gradient
relu
catalunya
catalonia
cgan
colorization
location retrieval
coclustering
search engine optimization
data augmentation
computing
theano
software development
caption
natural language
memorability
eye tracker
attributes
college
outreach
aprenentatge automàtic
inteligència artificial
robots
convnet
open source
alzheimer
narrative
object
endoscopy
rapid serial visual presentation
electroencephalography
brain-computer interfaces
ccma
relevance feedback
lifeblogging
python
social event photo clustering
instance search
visual descriptors
pattern recognition
algorithm
email
image
nearest neighbor
bundling interest points
digital images
images
web
coding
3d
streaming media
iphone
http
mysql
linux
ios
crowdmm
acmmm
etsetb
telecom
mobilitat
erasmus
javascript
web toolkit
wt
c++
html
web service
web interface
semantic shots
image edge detection
image representation
professional documentalists
video signal processing
semimanual solution
automatic keyframe selection
companies
broadcasting
single representative keyframe
algorithm design and analysis
multimedia communication
mutual reinforcement algorithm
mediavela
workshop
pixable
time series
columbia
regions
phd thesis
hyperlinking
bag of features
signal
bci
interface
twitter
television
interactive
microblogging
tv
labeling game
See more