SlideShare a Scribd company logo
1 of 1
Download to read offline
iBrutus-Computer Vision Module
                             iBrutus-Computer Vision Module
                                          Rajagopal Vasudevan, Hareendra Manuru, Ashok Sasidharan
                                                           http://iss.osu.edu/iBrutus/

                          Introduction                                                                    Person Tracking


Overview
The computer vision system is responsible for adding/removing/tracking
people in the scene. For every frame it updates the location and the
segment of the source. This is necessary to enable the dialog system to
interact with multiple parties. Its primary purpose is to know where
everyone is in the scene so Brutus could look at them when talking or                          Figure 3. Person Tracking using Kinect
listening to them.
                                                                              Person tracking was done using a variant of Depth Forest algorithm used
Giving Brutus Eyes                                                            by Microsoft Kinect.
                                                                              The algorithm takes care of partial occlusion and also provides the skeleton
  640 x 480 Microsoft Kinect color camera with 5 7 º horizontal and 43º
                                                                              for two people(active) closest to the Kinect. The range of tracking is 0.8 - 4m.
  vertical FOVs with a tilt of 27 º up or down.
 Depth map provide by kinect is used to limit Brutus' region of interest      This algorithm can track upto 6 people.
 up-to a certain distance.
                                                                                                  Person Interest Determination




                  Figure 1. Microsoft Kinect
                                                                                              Figure 4. Person Interest Determination
                                                                              Face(facial features) detection was performed using Viola J ones Classifiers.
                                                                              Person interest made binary depending on the features detected.

                                                                                                     Sound Source Localization
                                                                                                      Sound Source Localization
                                                                                                         Gaze vector estimation


                    Figure 2. Kinect images


                      Objectives and Goals


                                          Identifies a person
                            PERSON        entering the scene                                         Figure 4. Highlighted segments
                           TRACKING
                                          Tracks his/her location             The beam and source angles are provided by the SDK.
                                          through all the frames
                                                                              The segment in which the person talks is highlighted.
                                                                              Can focus on a particular segment which helps in speech recognition.

       CV                     PERSON INTEREST
                              DETERMINATION
                                                Detects the face and its
                                                features
                                                Determines start/continuity
                                                of conversation
                                                                                                             Future Scope

                                                                              Kalman filtering to manage full occlusion.
                                                                              Face recognition.
                                         Microphone array used to find        Gesture recognition.
                        SOUND SOURCE     the source segment
                         LOCALIZATION
                                         Helps in directing the avatar' s                                 Acknowledgements
                                         gaze
                                                                              This project is a collaboration between CETI and ISS in association
                                                                              with iShoe and Ohio State Athletics.
                                                                              A special thanks to Dr. Rajiv Ramnath and Dr. Lee Potter for their
                                                                              continued support.

More Related Content

What's hot

Who is the Customer? What is Experience? Indispensable Insights to empower yo...
Who is the Customer? What is Experience? Indispensable Insights to empower yo...Who is the Customer? What is Experience? Indispensable Insights to empower yo...
Who is the Customer? What is Experience? Indispensable Insights to empower yo...CHI Poland
 
Richer Interactions And Experiences Over Mobile Phone
Richer Interactions And Experiences Over Mobile PhoneRicher Interactions And Experiences Over Mobile Phone
Richer Interactions And Experiences Over Mobile Phoneguest96ff156
 
Mollenbach Single Gaze Gestures
Mollenbach Single Gaze GesturesMollenbach Single Gaze Gestures
Mollenbach Single Gaze GesturesKalle
 
1 facial expression 9 (1)
1 facial expression 9 (1)1 facial expression 9 (1)
1 facial expression 9 (1)prjpublications
 
Open source print quality software
Open source print quality softwareOpen source print quality software
Open source print quality softwareChristophe Mercier
 
Fingerprint recognition using correlation
Fingerprint recognition using correlationFingerprint recognition using correlation
Fingerprint recognition using correlationWABCO
 

What's hot (11)

Who is the Customer? What is Experience? Indispensable Insights to empower yo...
Who is the Customer? What is Experience? Indispensable Insights to empower yo...Who is the Customer? What is Experience? Indispensable Insights to empower yo...
Who is the Customer? What is Experience? Indispensable Insights to empower yo...
 
Dohyoung lee icassp2012_poster
Dohyoung lee icassp2012_posterDohyoung lee icassp2012_poster
Dohyoung lee icassp2012_poster
 
42 128-1-pb
42 128-1-pb42 128-1-pb
42 128-1-pb
 
Richer Interactions And Experiences Over Mobile Phone
Richer Interactions And Experiences Over Mobile PhoneRicher Interactions And Experiences Over Mobile Phone
Richer Interactions And Experiences Over Mobile Phone
 
Mollenbach Single Gaze Gestures
Mollenbach Single Gaze GesturesMollenbach Single Gaze Gestures
Mollenbach Single Gaze Gestures
 
Nayana
Nayana Nayana
Nayana
 
Eye-based head gestures
Eye-based head gesturesEye-based head gestures
Eye-based head gestures
 
1 facial expression 9 (1)
1 facial expression 9 (1)1 facial expression 9 (1)
1 facial expression 9 (1)
 
Open source print quality software
Open source print quality softwareOpen source print quality software
Open source print quality software
 
Improving access to mobile technologies using tactile feedback
Improving access to mobile technologies using tactile feedback Improving access to mobile technologies using tactile feedback
Improving access to mobile technologies using tactile feedback
 
Fingerprint recognition using correlation
Fingerprint recognition using correlationFingerprint recognition using correlation
Fingerprint recognition using correlation
 

Viewers also liked

7 puntos a_considerar_cuando_creas_tu_marca_empresarial
7 puntos a_considerar_cuando_creas_tu_marca_empresarial7 puntos a_considerar_cuando_creas_tu_marca_empresarial
7 puntos a_considerar_cuando_creas_tu_marca_empresarialVicente Sánchez Vargas
 
Year 8 scientist challenge
Year 8 scientist challengeYear 8 scientist challenge
Year 8 scientist challengeKHSscience
 
Property taxes
Property taxesProperty taxes
Property taxesjlcrane
 
Fruitandvegetablecutandpaste
FruitandvegetablecutandpasteFruitandvegetablecutandpaste
Fruitandvegetablecutandpastemalopo
 
Immer gut gekleidet1(fri)
Immer gut gekleidet1(fri)Immer gut gekleidet1(fri)
Immer gut gekleidet1(fri)Petefull
 
It's all right with me di pietro mazzone -eng
 It's all right with me   di pietro mazzone -eng It's all right with me   di pietro mazzone -eng
It's all right with me di pietro mazzone -engLucio Ferrara
 
Uvodna radionica februar
Uvodna radionica februarUvodna radionica februar
Uvodna radionica februarAtria Group SEE
 
El chihuaha monstruoso 5ºa
El chihuaha monstruoso 5ºaEl chihuaha monstruoso 5ºa
El chihuaha monstruoso 5ºamjluquino
 
Lectura 2014
Lectura 2014Lectura 2014
Lectura 2014marianria
 
Biografia pana larry joe
Biografia pana larry joeBiografia pana larry joe
Biografia pana larry joePana Larry Joe
 
Richiestaelezioni
RichiestaelezioniRichiestaelezioni
RichiestaelezioniFaisa Cisal
 
Pré produção
Pré produçãoPré produção
Pré produçãoal31899
 
Work and travel 2012 renan (1)
Work and travel 2012 renan (1)Work and travel 2012 renan (1)
Work and travel 2012 renan (1)judithtem
 

Viewers also liked (18)

7 puntos a_considerar_cuando_creas_tu_marca_empresarial
7 puntos a_considerar_cuando_creas_tu_marca_empresarial7 puntos a_considerar_cuando_creas_tu_marca_empresarial
7 puntos a_considerar_cuando_creas_tu_marca_empresarial
 
Year 8 scientist challenge
Year 8 scientist challengeYear 8 scientist challenge
Year 8 scientist challenge
 
02.06 podia ser pior
02.06   podia ser pior02.06   podia ser pior
02.06 podia ser pior
 
Property taxes
Property taxesProperty taxes
Property taxes
 
Fruitandvegetablecutandpaste
FruitandvegetablecutandpasteFruitandvegetablecutandpaste
Fruitandvegetablecutandpaste
 
Immer gut gekleidet1(fri)
Immer gut gekleidet1(fri)Immer gut gekleidet1(fri)
Immer gut gekleidet1(fri)
 
It's all right with me di pietro mazzone -eng
 It's all right with me   di pietro mazzone -eng It's all right with me   di pietro mazzone -eng
It's all right with me di pietro mazzone -eng
 
Hak Akses SI Aktifitas Kuliah
Hak Akses SI Aktifitas KuliahHak Akses SI Aktifitas Kuliah
Hak Akses SI Aktifitas Kuliah
 
Uvodna radionica februar
Uvodna radionica februarUvodna radionica februar
Uvodna radionica februar
 
Insurance(Pt A4)
Insurance(Pt A4)Insurance(Pt A4)
Insurance(Pt A4)
 
El chihuaha monstruoso 5ºa
El chihuaha monstruoso 5ºaEl chihuaha monstruoso 5ºa
El chihuaha monstruoso 5ºa
 
Lectura 2014
Lectura 2014Lectura 2014
Lectura 2014
 
Biografia pana larry joe
Biografia pana larry joeBiografia pana larry joe
Biografia pana larry joe
 
Richiestaelezioni
RichiestaelezioniRichiestaelezioni
Richiestaelezioni
 
Pré produção
Pré produçãoPré produção
Pré produção
 
Geometria
GeometriaGeometria
Geometria
 
Silvia
SilviaSilvia
Silvia
 
Work and travel 2012 renan (1)
Work and travel 2012 renan (1)Work and travel 2012 renan (1)
Work and travel 2012 renan (1)
 

Similar to Ib2

426 Lecture 9: Research Directions in AR
426 Lecture 9: Research Directions in AR426 Lecture 9: Research Directions in AR
426 Lecture 9: Research Directions in ARMark Billinghurst
 
A NOVAL ARTECHTURE FOR 3D MODEL IN VIRTUAL COMMUNITIES FROM FACE DETECTION
A NOVAL ARTECHTURE FOR 3D MODEL IN VIRTUAL COMMUNITIES FROM FACE DETECTIONA NOVAL ARTECHTURE FOR 3D MODEL IN VIRTUAL COMMUNITIES FROM FACE DETECTION
A NOVAL ARTECHTURE FOR 3D MODEL IN VIRTUAL COMMUNITIES FROM FACE DETECTIONIJASCSE
 
Depth-Image-based Facial Analysis between Age Groups and Recognition of 3D Faces
Depth-Image-based Facial Analysis between Age Groups and Recognition of 3D FacesDepth-Image-based Facial Analysis between Age Groups and Recognition of 3D Faces
Depth-Image-based Facial Analysis between Age Groups and Recognition of 3D FacesIDES Editor
 
Presence @ Winterschool 2008
Presence @ Winterschool 2008Presence @ Winterschool 2008
Presence @ Winterschool 2008scottw
 
AI Therapist – Emotion Detection using Facial Detection and Recognition and S...
AI Therapist – Emotion Detection using Facial Detection and Recognition and S...AI Therapist – Emotion Detection using Facial Detection and Recognition and S...
AI Therapist – Emotion Detection using Facial Detection and Recognition and S...ijtsrd
 
Deep Learning in Robotics Enhancing Perception and Decision-Making.pptx
Deep Learning in Robotics Enhancing Perception and Decision-Making.pptxDeep Learning in Robotics Enhancing Perception and Decision-Making.pptx
Deep Learning in Robotics Enhancing Perception and Decision-Making.pptxMicrosoft azure
 
Icetet 2010 id 94 fkp segmentation
Icetet 2010   id 94 fkp segmentationIcetet 2010   id 94 fkp segmentation
Icetet 2010 id 94 fkp segmentationDr. Vinayak Bharadi
 
Facial expression recognition on real world face images (OPTIK)
Facial expression recognition on real world face images (OPTIK)Facial expression recognition on real world face images (OPTIK)
Facial expression recognition on real world face images (OPTIK)Sohail Ahmed
 
A Textural Approach to Palmprint Identification
A Textural Approach to Palmprint IdentificationA Textural Approach to Palmprint Identification
A Textural Approach to Palmprint IdentificationIJASCSE
 
Fingerprint, seminar at IASRI, New Delhi
Fingerprint, seminar at IASRI, New DelhiFingerprint, seminar at IASRI, New Delhi
Fingerprint, seminar at IASRI, New DelhiNishikant Taksande
 
Image semantic coding using OTB
Image semantic coding using OTBImage semantic coding using OTB
Image semantic coding using OTBmelaneum
 
Synops emotion recognize
Synops emotion recognizeSynops emotion recognize
Synops emotion recognizeAvdhesh Gupta
 
Mit6870 template matching and histograms
Mit6870 template matching and histogramsMit6870 template matching and histograms
Mit6870 template matching and histogramszukun
 
Implementation of video tagging identifying characters in video
Implementation of video tagging identifying characters in videoImplementation of video tagging identifying characters in video
Implementation of video tagging identifying characters in videoeSAT Publishing House
 
10.1.1.105.2133
10.1.1.105.213310.1.1.105.2133
10.1.1.105.2133liji3
 
Unlocking Any Door In The 21st Century. Immersion In Biometric Security.
Unlocking Any Door In The 21st Century. Immersion In Biometric Security.Unlocking Any Door In The 21st Century. Immersion In Biometric Security.
Unlocking Any Door In The 21st Century. Immersion In Biometric Security.Payment Village
 
MIT 6.870 - Template Matching and Histograms (Nicolas Pinto, MIT)
MIT 6.870 - Template Matching and Histograms (Nicolas Pinto, MIT)MIT 6.870 - Template Matching and Histograms (Nicolas Pinto, MIT)
MIT 6.870 - Template Matching and Histograms (Nicolas Pinto, MIT)npinto
 

Similar to Ib2 (20)

426 Lecture 9: Research Directions in AR
426 Lecture 9: Research Directions in AR426 Lecture 9: Research Directions in AR
426 Lecture 9: Research Directions in AR
 
A NOVAL ARTECHTURE FOR 3D MODEL IN VIRTUAL COMMUNITIES FROM FACE DETECTION
A NOVAL ARTECHTURE FOR 3D MODEL IN VIRTUAL COMMUNITIES FROM FACE DETECTIONA NOVAL ARTECHTURE FOR 3D MODEL IN VIRTUAL COMMUNITIES FROM FACE DETECTION
A NOVAL ARTECHTURE FOR 3D MODEL IN VIRTUAL COMMUNITIES FROM FACE DETECTION
 
Depth-Image-based Facial Analysis between Age Groups and Recognition of 3D Faces
Depth-Image-based Facial Analysis between Age Groups and Recognition of 3D FacesDepth-Image-based Facial Analysis between Age Groups and Recognition of 3D Faces
Depth-Image-based Facial Analysis between Age Groups and Recognition of 3D Faces
 
Presence @ Winterschool 2008
Presence @ Winterschool 2008Presence @ Winterschool 2008
Presence @ Winterschool 2008
 
AI Therapist – Emotion Detection using Facial Detection and Recognition and S...
AI Therapist – Emotion Detection using Facial Detection and Recognition and S...AI Therapist – Emotion Detection using Facial Detection and Recognition and S...
AI Therapist – Emotion Detection using Facial Detection and Recognition and S...
 
Deep Learning in Robotics Enhancing Perception and Decision-Making.pptx
Deep Learning in Robotics Enhancing Perception and Decision-Making.pptxDeep Learning in Robotics Enhancing Perception and Decision-Making.pptx
Deep Learning in Robotics Enhancing Perception and Decision-Making.pptx
 
Tagsense ppt
Tagsense pptTagsense ppt
Tagsense ppt
 
Icetet 2010 id 94 fkp segmentation
Icetet 2010   id 94 fkp segmentationIcetet 2010   id 94 fkp segmentation
Icetet 2010 id 94 fkp segmentation
 
Facial expression recognition on real world face images (OPTIK)
Facial expression recognition on real world face images (OPTIK)Facial expression recognition on real world face images (OPTIK)
Facial expression recognition on real world face images (OPTIK)
 
A Textural Approach to Palmprint Identification
A Textural Approach to Palmprint IdentificationA Textural Approach to Palmprint Identification
A Textural Approach to Palmprint Identification
 
Fingerprint, seminar at IASRI, New Delhi
Fingerprint, seminar at IASRI, New DelhiFingerprint, seminar at IASRI, New Delhi
Fingerprint, seminar at IASRI, New Delhi
 
human face counting
human face countinghuman face counting
human face counting
 
Image semantic coding using OTB
Image semantic coding using OTBImage semantic coding using OTB
Image semantic coding using OTB
 
Synops emotion recognize
Synops emotion recognizeSynops emotion recognize
Synops emotion recognize
 
Pedro Nakazato Andrade | Portfolio
Pedro Nakazato Andrade | PortfolioPedro Nakazato Andrade | Portfolio
Pedro Nakazato Andrade | Portfolio
 
Mit6870 template matching and histograms
Mit6870 template matching and histogramsMit6870 template matching and histograms
Mit6870 template matching and histograms
 
Implementation of video tagging identifying characters in video
Implementation of video tagging identifying characters in videoImplementation of video tagging identifying characters in video
Implementation of video tagging identifying characters in video
 
10.1.1.105.2133
10.1.1.105.213310.1.1.105.2133
10.1.1.105.2133
 
Unlocking Any Door In The 21st Century. Immersion In Biometric Security.
Unlocking Any Door In The 21st Century. Immersion In Biometric Security.Unlocking Any Door In The 21st Century. Immersion In Biometric Security.
Unlocking Any Door In The 21st Century. Immersion In Biometric Security.
 
MIT 6.870 - Template Matching and Histograms (Nicolas Pinto, MIT)
MIT 6.870 - Template Matching and Histograms (Nicolas Pinto, MIT)MIT 6.870 - Template Matching and Histograms (Nicolas Pinto, MIT)
MIT 6.870 - Template Matching and Histograms (Nicolas Pinto, MIT)
 

Ib2

  • 1. iBrutus-Computer Vision Module iBrutus-Computer Vision Module Rajagopal Vasudevan, Hareendra Manuru, Ashok Sasidharan http://iss.osu.edu/iBrutus/ Introduction Person Tracking Overview The computer vision system is responsible for adding/removing/tracking people in the scene. For every frame it updates the location and the segment of the source. This is necessary to enable the dialog system to interact with multiple parties. Its primary purpose is to know where everyone is in the scene so Brutus could look at them when talking or Figure 3. Person Tracking using Kinect listening to them. Person tracking was done using a variant of Depth Forest algorithm used Giving Brutus Eyes by Microsoft Kinect. The algorithm takes care of partial occlusion and also provides the skeleton 640 x 480 Microsoft Kinect color camera with 5 7 º horizontal and 43º for two people(active) closest to the Kinect. The range of tracking is 0.8 - 4m. vertical FOVs with a tilt of 27 º up or down. Depth map provide by kinect is used to limit Brutus' region of interest This algorithm can track upto 6 people. up-to a certain distance. Person Interest Determination Figure 1. Microsoft Kinect Figure 4. Person Interest Determination Face(facial features) detection was performed using Viola J ones Classifiers. Person interest made binary depending on the features detected. Sound Source Localization Sound Source Localization Gaze vector estimation Figure 2. Kinect images Objectives and Goals Identifies a person PERSON entering the scene Figure 4. Highlighted segments TRACKING Tracks his/her location The beam and source angles are provided by the SDK. through all the frames The segment in which the person talks is highlighted. Can focus on a particular segment which helps in speech recognition. CV PERSON INTEREST DETERMINATION Detects the face and its features Determines start/continuity of conversation Future Scope Kalman filtering to manage full occlusion. Face recognition. Microphone array used to find Gesture recognition. SOUND SOURCE the source segment LOCALIZATION Helps in directing the avatar' s Acknowledgements gaze This project is a collaboration between CETI and ISS in association with iShoe and Ohio State Athletics. A special thanks to Dr. Rajiv Ramnath and Dr. Lee Potter for their continued support.