cvpr2009: class specific hough forest for object detection

•

2 likes•1,101 views

This document presents class-specific Hough forests, a method for object detection that combines spatial information from object parts with class information during learning. The method trains random forests to learn the relationship between image patches and their spatial position relative to the object center. At detection time, the forests cast class-specific votes in 3D or 4D space (x,y,scale (or x,y,scale,ratio)) that are accumulated to detect objects. The method achieves state-of-the-art results on several datasets and offers advantages over related Hough-based methods such as combining spatial and class information during learning.

Education

Class-Specific Hough Forests
for Object Detection
Juergen Gall1 and Victor Lempitsky2

1BIWI,ETH Zurich
1Max-Planck-Institute for Informatics
2Microsoft Research Cambridge

Motivation

Parts of an object provide useful
spatial information
Classification of object parts
(foreground/background)
Combine spatial information and
class information during learning

Related Work

Explicit model of object: Detect parts → Assemble parts
together (e.g. Pictorial Structures)
Implicit model of object: Learn relation of parts
Codebook based on appearance (e.g. Leibe et al. IJCV’08)
Codebook based on appearance and spatial information
(Opelt et al. IJCV’08; Shotton et al. PAMI’08)
Grid-based classifier for object parts (Winn and Shotton
CVPR’06)
Class-specific Hough forest: Generalized Hough transform
within Random forest framework (Breiman ML’01)

Random Forest

Image patch:

Binary tests:

Binary tests are selected during
training from a random subset of
all binary tests

Training

Training set:

Class information: ci (class label)
Spatial information: di (relative position to object center)

Binary Tests Selection

Test with optimal split:

Class-label uncertainty:

Offset uncertainty:

Interleaved: Type of uncertainty is randomly selected for
each node

Leaves

{Pi ∈ L : ci = 1}{Pi ∈ A : ci = 0}
Class probability: C L =
{Pi ∈ L : ci = 0}{Pi ∈ A : ci = 1} + {Pi ∈ L : ci = 1}{Pi ∈ A : ci = 0}

Spatial probability

For location x and given image patch I(y) and tree T

Over all trees:

Accumulation over all image patches:

Multi-Scale and Multi-Ratio

Multi Scale: 3D Votes (x, y, scale)

Multi-Ratio: 4D Votes (x, y, scale, ratio)

UIUC Cars - Multi Scale

Wrong (EER)

Correct

Recenter

Object’s center ≠ Centre of bounding box
Split training data → Estimate centers iteratively

Summary

Superior to previous methods using related techniques
State-of-the-art for several datasets
Advantages over related Hough-based methods:
Combine spatial information and class information
No sparse features like SIFT
GPU → real-time performance is feasible
Large and high-dimensional datasets
Bounding box-annotated training data is sufficient
Focus: Get strong signal → Improve Detection
2-class problem → Multi-class problem

Thank you for your attention.

The major part of the research project was undertaken when Juergen Gall was
an intern with Microsoft Research Cambridge. The advice from Toby Sharp,
Jamie Shotton, and other members of the Computer Vision Group at MSRC is
gratefully acknowledged. We would also like to thank all the researchers, who
have collected and published the datasets we used for the evaluation.

Slide for study session given by Dr. Daisuke Sato at Arithmer inc. It is a summary of methods for semantic segmentation for 3D pointcloud using 2D weakly-supervised learning. Arithmer株式会社は東京大学大学院数理科学研究科発の数学の会社です。私達は現代数学を応用して、様々な分野のソリューションに、新しい高度AIシステムを導入しています。AIをいかに上手に使って仕事を効率化するか、そして人々の役に立つ結果を生み出すのか、それを考えるのが私たちの仕事です。 Arithmer began at the University of Tokyo Graduate School of Mathematical Sciences. Today, our research of modern mathematics and AI systems has the capability of providing solutions when dealing with tough complex issues. At Arithmer we believe it is our job to realize the functions of AI through improving work efficiency and producing more useful results for society.

Summary of survey papers on deep learning method to 3D data

Arithmer Inc.

Slide for study session given by Dr. Takashi Nakano (Arithmer inc.) at Arithmer inc. It is a summary of recent survey papers on deep learning method to 3D data. Arithmer株式会社は東京大学大学院数理科学研究科発の数学の会社です。私達は現代数学を応用して、様々な分野のソリューションに、新しい高度AIシステムを導入しています。AIをいかに上手に使って仕事を効率化するか、そして人々の役に立つ結果を生み出すのか、それを考えるのが私たちの仕事です。 Arithmer began at the University of Tokyo Graduate School of Mathematical Sciences. Today, our research of modern mathematics and AI systems has the capability of providing solutions when dealing with tough complex issues. At Arithmer we believe it is our job to realize the functions of AI through improving work efficiency and producing more useful results for society.

Centernet

Arithmer Inc.

Slide for study session given by Christian Saravia at Arithmer inc. It is a summary of recent method for object detection, centernet. Arithmer株式会社は東京大学大学院数理科学研究科発の数学の会社です。私達は現代数学を応用して、様々な分野のソリューションに、新しい高度AIシステムを導入しています。AIをいかに上手に使って仕事を効率化するか、そして人々の役に立つ結果を生み出すのか、それを考えるのが私たちの仕事です。 Arithmer began at the University of Tokyo Graduate School of Mathematical Sciences. Today, our research of modern mathematics and AI systems has the capability of providing solutions when dealing with tough complex issues. At Arithmer we believe it is our job to realize the functions of AI through improving work efficiency and producing more useful results for society.

Shai Avidan's Support vector tracking and ensemble tracking

wolf

This document summarizes two object tracking algorithms: Support Vector Tracking (SVT) and Ensemble Tracking. SVT uses support vector machines to classify pixels as object or background and finds the maximum scoring bounding rectangle. Ensemble Tracking trains an ensemble of weak classifiers over time to distinguish the object from background and outputs a confidence map, then uses mean shift to locate the object. Both algorithms use multiple resolutions and can handle challenges like occlusion and camera motion.

Introduction to object detection

Brodmann17

YOLACT

Arithmer Inc.

Slide for study session given by Dr. Enrico Rinaldi at Arithmer inc. It is a summary of recent methods for real-time instance segmentation "YOLACT", which is especially useful in robotics. Arithmer株式会社は東京大学大学院数理科学研究科発の数学の会社です。私達は現代数学を応用して、様々な分野のソリューションに、新しい高度AIシステムを導入しています。AIをいかに上手に使って仕事を効率化するか、そして人々の役に立つ結果を生み出すのか、それを考えるのが私たちの仕事です。 Arithmer began at the University of Tokyo Graduate School of Mathematical Sciences. Today, our research of modern mathematics and AI systems has the capability of providing solutions when dealing with tough complex issues. At Arithmer we believe it is our job to realize the functions of AI through improving work efficiency and producing more useful results for society.

Deep learning based object detection basics

Brodmann17

The document discusses different approaches to object detection in images using deep learning. It begins with describing detection as classification, where an image is classified into categories for what objects are present. It then discusses approaches that involve separating detection into a classification head and localization head. The document also covers improvements like R-CNN which uses region proposals to first generate candidate object regions before running classification and bounding box regression on those regions using CNN features. This helps address issues with previous approaches like being too slow when running the CNN over the entire image at multiple locations and scales.

Object detection - RCNNs vs Retinanet

Rishabh Indoria

Support vector machines learn hyperplanes that maximize the margin between two classes of data points. They introduce slack variables to handle non-linearly separable data, trying to maximize margins while minimizing errors. Popular SVMs use kernel functions to map data into higher dimensions, finding good linear separators in this space. SVMs find optimal hyperplanes by solving a convex optimization problem, but can be slow for large datasets. SVMs generally achieve high accuracy compared to other methods.

Fast Non-Uniform Filtering with Symmetric Weighted Integral Images

davidmarimon

Oral presentation at IEEE International Conference on Image Processing (ICIP), Hong Kong, September 2010. Abstract: Non-uniform filters are frequently used in many image processing applications to describe regions or to detect specific features. However, non-uniform filtering is a computationally complex task. This paper presents a method to perform fast non-uniform filtering using a reduced number of memory accesses. The idea is based on integral images which are commonly used for box or Haar wavelet filtering. The disadvantage of those filters for several applications is their uniform shape. We describe a method to build Symmetric Weighted Integral Images that are tailored for a variety of kernels and the process to perform fast filtering with them. We show a relevant speedup when compared to Kernel Integral Images and large when compared to conventional non-uniform filtering by reducing the computational complexity.

Deep image retrieval - learning global representations for image search - ub ...

Universitat de Barcelona

This document summarizes a research paper on deep image retrieval using global image representations. It presents three key ideas: 1) A siamese network trained with a triplet loss to learn image representations optimized for retrieval. 2) Replacing rigid region grids with a region proposal network to localize regions of interest. 3) Experiments showing their method outperforms classification features and achieves state-of-the-art results on standard retrieval datasets. Their work demonstrates an effective and scalable approach to image retrieval based on learning compact global image signatures.

Deformable DETR Review [CDM]

Dongmin Choi

Convolutional Patch Representations for Image Retrieval An unsupervised approach

Universitat de Barcelona

1. The document presents an unsupervised approach using convolutional neural networks to generate patch-level descriptors for image retrieval. 2. It trains a convolutional kernel network on unlabeled image patches to learn feature representations in a kernel space without requiring manual labels. 3. Experiments show the convolutional kernel descriptors achieve similar or better performance than supervised convolutional neural networks on standard patch and image retrieval datasets while requiring less training time.

Deep Learning for Computer Vision: Unsupervised Learning (UPC 2016)

Universitat Politècnica de Catalunya

http://imatge-upc.github.io/telecombcn-2016-dlcv/ Deep learning technologies are at the core of the current revolution in artificial intelligence for multimedia data analysis. The convergence of big annotated data and affordable GPU hardware has allowed the training of neural networks for data analysis tasks which had been addressed until now with hand-crafted features. Architectures such as convolutional neural networks, recurrent neural networks and Q-nets for reinforcement learning have shaped a brand new scenario in signal processing. This course will cover the basic principles and applications of deep learning to computer vision problems, such as image classification, object detection or text captioning.

optimal subsampling

Tian Tian

This document proposes a new optimal subsampling strategy for logistic regression models based on D-optimal designs. The algorithm iteratively takes subsamples of increasing size based on the current parameter estimates. It selects data points that maximize the determinant of the information matrix to better preserve information from the full dataset. Simulation results show the new algorithm outperforms random sampling and existing subsampling methods, achieving lower mean squared errors for parameter estimates, especially in small sample size scenarios. Ongoing work looks to incorporate additional modeling improvements and applications.

You only look once (YOLO) : unified real time object detection

Entrepreneur / Startup

YOLO (You Only Look Once) is a real-time object detection system that frames object detection as a regression problem. It uses a single neural network that predicts bounding boxes and class probabilities directly from full images in one evaluation. This approach allows YOLO to process images and perform object detection over 45 frames per second while maintaining high accuracy compared to previous systems. YOLO was trained on natural images from PASCAL VOC and can generalize to new domains like artwork without significant degradation in performance, unlike other methods that struggle with domain shift.

Yolo

Sourav Garai

This document discusses the real-time object detection method YOLO (You Only Look Once). YOLO divides an image into grids and predicts bounding boxes and class probabilities for each grid cell. It sees the full image at once rather than using a sliding window approach. This allows it to detect objects in one pass of the neural network, making it very fast compared to other methods. YOLO is also accurate, achieving a high mean average precision. However, it can struggle to precisely localize small objects and objects that appear in dense groups.

Deep Learning for Computer Vision: Attention Models (UPC 2016)

Universitat Politècnica de Catalunya

Review : Prototype Mixture Models for Few-shot Semantic Segmentation

Dongmin Choi

Deep Learning for Computer Vision: Object Detection (UPC 2016)

Universitat Politècnica de Catalunya

Tutorial of topological data analysis part 3(Mapper algorithm)

Ha Phuong

The document provides an overview of the Mapper algorithm, a technique from topological data analysis. It begins by introducing basic concepts from topology like Reeb graphs and Morse theory. It then describes the key steps of the Mapper algorithm: (1) defining a filter function on the data, (2) clustering inverse images of the filter, and (3) connecting clusters to form a graph. The document discusses practical considerations like choosing filter functions and parameters. It also provides examples of applying Mapper for tasks like clustering, feature selection, and data exploration.

Integrating Practical2009

ISSGC Summer School

The document describes an integrating practical simulation involving searching for pillars on a surface to find words of wisdom. Participants are expected to write programs to interface tools to search across the surface for pillars and plaques, read words on the plaques, recognize patterns, and make use of capabilities. Instructions are provided for using scanner tools to search areas and submitting findings, as well as technology-specific instructions. Participants will report results and insights gained from searching strategies and technology evaluations.

Data Applied: Clustering

DataminingTools Inc

This document provides an introduction to clustering techniques and the BIRCH algorithm. It defines clustering as dividing data instances into natural groups rather than predicting classes. The BIRCH algorithm incrementally clusters multi-dimensional data to produce high quality clusters using minimal resources. It can handle large datasets by performing clustering in one data scan and allows for outliers. The algorithm builds a CF tree using clustering features to summarize cluster information during the incremental clustering process.

Review : Multi-Domain Image Completion for Random Missing Input Data [cdm]

Dongmin Choi

Unsupervised Learning (D2L6 2017 UPC Deep Learning for Computer Vision)

Universitat Politècnica de Catalunya

https://telecombcn-dl.github.io/2017-dlcv/ Deep learning technologies are at the core of the current revolution in artificial intelligence for multimedia data analysis. The convergence of large-scale annotated datasets and affordable GPU hardware has allowed the training of neural networks for data analysis tasks which were previously addressed with hand-crafted features. Architectures such as convolutional neural networks, recurrent neural networks and Q-nets for reinforcement learning have shaped a brand new scenario in signal processing. This course will cover the basic principles and applications of deep learning to computer vision problems, such as image classification, object detection or image captioning.

Review: You Only Look One-level Feature

Dongmin Choi

3D Scene Analysis via Sequenced Predictions over Points and Regions

Flavia Grosan

Deep image retrieval learning global representations for image search

Universitat Politècnica de Catalunya

Slides by Albert Jimenez about the following paper: Gordo, Albert, Jon Almazan, Jerome Revaud, and Diane Larlus. "Deep Image Retrieval: Learning global representations for image search." arXiv preprint arXiv:1604.01325 (2016). We propose a novel approach for instance-level image retrieval. It produces a global and compact fixed-length representation for each image by aggregating many region-wise descriptors. In contrast to previous works employing pre-trained deep networks as a black box to produce features, our method leverages a deep architecture trained for the specific task of image retrieval. Our contribution is twofold: (i) we introduce a ranking framework to learn convolution and projection weights that are used to build the region features; and (ii) we employ a region proposal network to learn which regions should be pooled to form the final global descriptor. We show that using clean training data is key to the success of our approach. To that aim, we leverage a large scale but noisy landmark dataset and develop an automatic cleaning approach. The proposed architecture produces a global image representation in a single forward pass. Our approach significantly outperforms previous approaches based on global descriptors on standard datasets. It even surpasses most prior works based on costly local descriptor indexing and spatial verification. We intend to release our pre-trained model.

PhD defence - Steven Vanonckelen

Steven Vanonckelen

Atmospheric and topographic corrections improve Landsat satellite imagery for analysis of forest cover dynamics in mountainous areas. Corrections increase pixel homogeneity and reduce dependency of reflectance values on terrain illumination. Image preprocessing, including topographic correction and compositing, leads to more accurate land cover classification and change mapping over large areas. Factors controlling forest cover dynamics in the Romanian Carpathians from 1985-2010 include accessibility, demographic changes, land use policies, slope, elevation, and soil type.

Techniques for effective and efficient fire detection from social media images

Universidade de São Paulo

Social media provides information, in the form of images, that is valuable to a vast set of human activities, including salvage and rescue in the case of crisis situations (such as accidents, explosions, and fire). However, these services produce images in a rate that is impossible for human beings to absorb and analyze; thus, it is a requirement to have methods for automatic analysis. However, despite the multiple works on image analysis, there are no studies on the specific topic of fire detection over social media. To fill this gap, this work describes the use and the evaluation of an ample set of content-based image retrieval and classification techniques in the task of fire detection. In our intent, we (1) built a ground-truth set of annotated images regarding fire occurrence; (2) engineered the Fast-Fire Detection and Retrieval ($\FFDnR$) architecture to combine configurations of feature extractors and distance functions to work with instance-based learning; and (3) evaluated 36 image descriptors in the task of fire detection. Our results demonstrated that, for fire detection, the best image descriptors concerning efficacy (F-measure, Precision-Recall, and ROC) and processing efficiency (wall-clock time) are achieved with MPEG-7 feature extractors Color Structure and Scalable Color, and with distance functions City-Block and Euclidean. Our work shall provide basis for further developments regarding monitoring of images from social media.

What's hot

Linear Discrimination Centering on Support Vector Machines

butest

Fast Non-Uniform Filtering with Symmetric Weighted Integral Images

davidmarimon

Deep image retrieval - learning global representations for image search - ub ...

Universitat de Barcelona

Deformable DETR Review [CDM]

Dongmin Choi

Convolutional Patch Representations for Image Retrieval An unsupervised approach

Universitat de Barcelona

Deep Learning for Computer Vision: Unsupervised Learning (UPC 2016)

Universitat Politècnica de Catalunya

optimal subsampling

Tian Tian

You only look once (YOLO) : unified real time object detection

Entrepreneur / Startup

Yolo

Sourav Garai

Deep Learning for Computer Vision: Attention Models (UPC 2016)

Universitat Politècnica de Catalunya

Review : Prototype Mixture Models for Few-shot Semantic Segmentation

Dongmin Choi

Deep Learning for Computer Vision: Object Detection (UPC 2016)

Universitat Politècnica de Catalunya

Tutorial of topological data analysis part 3(Mapper algorithm)

Ha Phuong

Integrating Practical2009

ISSGC Summer School

Data Applied: Clustering

DataminingTools Inc

Review : Multi-Domain Image Completion for Random Missing Input Data [cdm]

Dongmin Choi

Unsupervised Learning (D2L6 2017 UPC Deep Learning for Computer Vision)

Universitat Politècnica de Catalunya

Review: You Only Look One-level Feature

Dongmin Choi

3D Scene Analysis via Sequenced Predictions over Points and Regions

Flavia Grosan

Deep image retrieval learning global representations for image search

Universitat Politècnica de Catalunya

What's hot (20)

Linear Discrimination Centering on Support Vector Machines

Fast Non-Uniform Filtering with Symmetric Weighted Integral Images

Deep image retrieval - learning global representations for image search - ub ...

Deformable DETR Review [CDM]

Convolutional Patch Representations for Image Retrieval An unsupervised approach

Deep Learning for Computer Vision: Unsupervised Learning (UPC 2016)

optimal subsampling

You only look once (YOLO) : unified real time object detection

Yolo

Deep Learning for Computer Vision: Attention Models (UPC 2016)

Review : Prototype Mixture Models for Few-shot Semantic Segmentation

Deep Learning for Computer Vision: Object Detection (UPC 2016)

Tutorial of topological data analysis part 3(Mapper algorithm)

Integrating Practical2009

Data Applied: Clustering

Review : Multi-Domain Image Completion for Random Missing Input Data [cdm]

Unsupervised Learning (D2L6 2017 UPC Deep Learning for Computer Vision)

Review: You Only Look One-level Feature

3D Scene Analysis via Sequenced Predictions over Points and Regions

Deep image retrieval learning global representations for image search

Viewers also liked

PhD defence - Steven Vanonckelen

Steven Vanonckelen

Techniques for effective and efficient fire detection from social media images

Universidade de São Paulo

Color and color models

Safwan Hashmi

Color is a sensation produced by the human visual system. The two most common color models are RGB, used for computer displays, and CMYK, used for printing. RGB is an additive model that uses combinations of red, green, and blue light to produce colors. CMYK is a subtractive model that uses combinations of cyan, magenta, yellow, and black inks to produce colors. Both models represent colors using three numeric values corresponding to the intensities of the primary colors.

Color Models Computer Graphics

dhruv141293

A color model specifies a color space and visible subset of colors within it. There are four main hardware-oriented color models: RGB, CMY, CMYK, and YIQ. However, these are not intuitive for describing color in terms of hue, saturation and brightness. Therefore, models like HSV, HLS, and HVC were developed which relate more directly to human perception of color. The RGB and CMY models represent colors as combinations of red, green, blue and cyan, magenta, yellow primary colors respectively and are used in monitors and printing.

Fire detection & alarm system

Politeknik Sultan Haji Ahmad Shah

This document discusses fire detection and alarm systems. It covers the design requirements based on building standards, planning the system based on building type and size, selecting the type of coverage needed, configuring zones within the premises, guidelines for zone configuration, types of alarm detection systems including conventional and addressable, and addressing techniques for detectors. The overall purpose is to provide early warning of fires and allow firefighting actions before situations get out of control.

Fire alarm system (sistem penggera kebakaran)

Nurul Husna

fire detection and alarm system

singh1515

The document discusses fire detection and alarm systems. It provides details on: 1) The purposes of fire detection systems which are to detect fires, notify occupants, summon assistance and initiate suppression systems. 2) The basic components of systems including input devices like manual pull stations and detectors, and output devices like alarms and controls. 3) Different types of detectors like heat, smoke and gas detectors and their functions. 4) Factors to consider for detector placement like area size and layout. 5) Conventional and addressable microprocessor-based systems and their advantages. 6) Approvals and standards required for fire detection systems.

Color models

Haitham Ahmed

This document discusses various color models used in computer graphics including RGB, HSV, HSL, CMY, and CMYK. It explains the key components of each model such as hue, saturation, value, and how colors are represented. Common applications of different color models are also summarized such as RGB for computer displays and CMYK for printing. In addition, the concepts of dithering and half-toning techniques used to reproduce colors on devices are introduced.

Color Models

Mustafa Salam

This document discusses different color models used in computer graphics and printing. It explains that color models are systems for creating a range of colors from a small set of primary colors. The two main types are additive models which use light, like RGB, and subtractive models which use inks, like CMYK. RGB uses red, green and blue light and is for computer displays. CMYK uses cyan, magenta, yellow and black inks and is the standard for color printing. It provides details on how each model mixes colors and describes other models like HSV which represents color in terms of hue, saturation and value.

Fire Detection and Alarm Systems

J.T.A.JONES

Fire detection and alarm systems are installed to notify occupants of a fire, summon assistance to fight fires, and initiate automatic suppression systems. There are different types of automatic alarm initiating devices like heat, smoke, and flame detectors that sense fire. Indicating devices like audible alarms and visible strobes alert people of a fire. Automatic alarm systems transmit alarm signals off-site to notify emergency responders. These systems are supervised to ensure proper operation and may include auxiliary functions to support firefighting and safety.

Viewers also liked (10)

PhD defence - Steven Vanonckelen

Techniques for effective and efficient fire detection from social media images

Color and color models

Color Models Computer Graphics

Fire detection & alarm system

Fire alarm system (sistem penggera kebakaran)

fire detection and alarm system

Color models

Color Models

Fire Detection and Alarm Systems

Similar to cvpr2009: class specific hough forest for object detection

Presentation on unsupervised learning

ANKUSH PAL

Spatially Coherent Latent Topic Model For Concurrent Object Segmentation and ...

Shao-Chuan Wang

The document summarizes a research paper on spatially coherent latent topic modeling for concurrent object segmentation and classification from images. The proposed model represents images as a collection of regions, each associated with a latent topic. It incorporates spatial relationships between regions by encouraging neighboring regions to take on similar topics. The model is trained using variational message passing to maximize the log likelihood of image data. Experimental results show the model can segment objects even under occlusion and achieve good performance on supervised classification tasks using natural scene images.

Introduction to conventional machine learning techniques

Xavier Rafael Palou

This document provides an overview of machine learning techniques for classification and anomaly detection. It begins with an introduction to machine learning and common tasks like classification, clustering, and anomaly detection. Basic classification techniques are then discussed, including probabilistic classifiers like Naive Bayes, decision trees, instance-based learning like k-nearest neighbors, and linear classifiers like logistic regression. The document provides examples and comparisons of these different methods. It concludes by discussing anomaly detection and how it differs from classification problems, noting challenges like having few positive examples of anomalies.

nnml.ppt

yang947066

This document provides an overview of machine learning and neural network techniques. It defines machine learning as the field that focuses on algorithms that can learn. The document discusses several key components of a machine learning model, including what is being learned (the domain) and from what information the learner is learning. It then summarizes several common machine learning algorithms like k-NN, Naive Bayes classifiers, decision trees, reinforcement learning, and the Rocchio algorithm for relevance feedback in information retrieval. For each technique, it provides a brief definition and examples of applications.

Pattern learning and recognition on statistical manifolds: An information-geo...

Frank Nielsen

This document provides an overview of Frank Nielsen's talk on pattern learning and recognition using information geometry and statistical manifolds. The talk focuses on departing from vector space representations and dealing with (dis)similarities that do not have Euclidean or metric properties. This poses new theoretical and computational challenges for pattern recognition. The talk describes using exponential family mixture models defined on dually flat statistical manifolds induced by convex functions. On these manifolds, dual coordinate systems and dual affine geodesics allow for computing-friendly representations of divergences and similarities between probabilistic patterns. The techniques aim to achieve statistical invariance and enable algorithmic approaches to problems like Gaussian mixture modeling, shape retrieval, and diffusion tensor imaging analysis.

Conventional Neural Networks and compute

YobuDJob1

Fcv learn yu

cvpr2009: class specific hough forest for object detection

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Viewers also liked

Viewers also liked (10)

Similar to cvpr2009: class specific hough forest for object detection

Similar to cvpr2009: class specific hough forest for object detection (20)

More from zukun

More from zukun (20)

Recently uploaded

Recently uploaded (20)

cvpr2009: class specific hough forest for object detection