SlideShare a Scribd company logo
1 of 1
Download to read offline
Abstract
Like Intelligent IPCameras, Autonomous driving cars rely on built-in intelligence using video analytics. Unlike in IPCameras, a bad
decision from video analytics software in an automobiles could cause loss of life. Hence cars must overcome accuracy limitations to
improve detection accuracy for safety and avoid false alarms to maintain the driver experience. With the recent success in research
around several depth perception technologies, and their cost effective availability, surveillance cameras are starting to adopt these
highly accurate technologies. The paper sheds light into some of these technologies and how they improve overall analytics accuracy.
Problems with Monocular Vision
Binocular vision helps humans perceive depth. Still, humans are prone to illusions. Most security cameras are worse. They are
monocular and cannot sense depth, so security cameras cannot easily distinguish between a person and his/her shadow or distinguish
between two people standing behind each other (problem of occlusion). Monocular vision imposes stringent limits on the accuracy of
any video analytics.
The “turning lady illusion” is a good example to illustrate this problem. The mid-section of the lady in the video has no associated depth
information. In observing the illusion, humans subconsciously apply various availability and representative heuristics to make an
assumption that the mid-section of the lady is either part of the raised leg or part of the leg on the floor. Viewers who pursue a decision
path that associates the mid-section to the lady’s raised leg concludes that the lady is rotating clockwise while the rest would judge that
the lady is rotating counterclockwise.
Automotive Vision Technologies Improve Accuracy of Intelligent IPCameras
Jacob Jose,Product Manager, Embedded Processors, Texas Instruments
Depth sensing solutions for IPCameras
Processor solutions for intelligent IPCameras that use depth sensing need to incorporate power efficient signal processing as well as
peripheral interfaces that connect to various sensing devices. The processors need an optimal signal processing capability that
balances cost, processing power and heat dissipation. Integrated processor solutions hence need a combination of ARM and DSP
processors, vector engines and dedicated ASICs.
Texas Instruments DaVinci DM8127 integrates an inbuilt DSP engine and a vision co-processor for depth sensing and fusion, in
addition to an ARM processor and several ASIC engine blocks. DSPs are ideal for raw vision processing and also comes with the
programming flexibility that is usually associated with a general purpose processor. Compared to an ARM processor, a DSP consumes
much less power and dissipate less heat for the same MIPS consumed.
Structured Lighting using Digital Light Processing (DLP) technology
DLP technology relies on structured lighting to illuminate a grid on the scene and observe the distortions in the grid to perceive depth.
DLP offers significantly higher resolution than TOF technology, but has limited range.
Effective use of temporal information could increase the accuracy of
analytics in certain scenarios through background modelling. Video
Analytics algorithms offered with Texas Instruments DaVinci DMVA and
DM81xx processors offer accuracies that approach these theoretical
limits.
Use of Depth in Intelligent IPCameras
Use of depth perception to augment video analytics
algorithms will reduce false detections in Intelligent
IPCameras by 5-10x. Figure 2 below shows an example
where stereo sensors are used for depth perception. The
hand of the subject is clearly marked out by the analytics
engine despite the hand being part of the bigger subject
and within a crowd of details. It may also be noted that
fingers are further marked with a different color that
represents a higher proximity, augmenting the accuracy of
information.
Techniques like auto-calibration, noise cleanup and fusion
with redundant depth information from alternate sensors
are now highly refined in automotive applications.
Increased application of these technologies in video
surveillance will usher in a new generation of highly
intelligent IPCameras.
Depth sensing by self­driving cars
Smart and autonomous driving cars often have more than 10 cameras in a single automobile. However, as shown in the Table 1 below,
a combination of various non-visual sensors are often employed perceive depth accurately in different environments
For example, based on Table 1, a combination of stereo vision systems and a redundant mid-range radar can effectively detect
pedestrians, while long range radar is required for adaptive cruise control. Different technologies vary in accuracy, range and cost.
Hence a combination of diverse technologies are needed for different applications.
Figure 4 – Designing an optimal vision processing pipeline
Typical radar systems use either a CSI-2 or
JESD interface to transmit depth data to host
processor. The DM8127 processor also
integrates a MIPI interface. Some of the other
DSP processors from TI for the market
integrate a JESD interface, enabling
seamless connection to radar devices without
using additional glue logic.
Various depth sensing solutions, especially those that are commonly used in the automotive market, offer unique advantages to
different IPCamera applications. As with smart cars, a combination of technologies are required to solve the diverse product needs in
the intelligent camera market. For instance, Structured Lighting or Time of Flight sensors offer good accuracy in dimly lit indoor
environments, but may not be effective outdoors where solar radiation is abundant. On the other hand, stereo sensors that may
struggle indoors are highly effective outdoors due to the more beneficial lighting conditions.
Among the prominent depth sensing technologies, two technologies are explored below:
Time of Flight (TOF) Technology
Time of flight technology relies on flooding a scene with modulated pulses, often infrared or laser. The reflections of each of the pulses
are then timed to accurately calculate the distance from the object that the pulses have been reflected from. TOF technology can be
employed very cost effectively for near-field perception. Using laser or higher levels of illumination (more LEDs) will increase the depth
of field or resolution of the sensor, at a cost of increased power consumption.
Originally used for gesture recognition and accuracy assistance in automotive surround view systems, TOF technology is very effective
to perceive depth in dark environments, where stereo can be ineffective due to lack of illumination. TOF resolution is inversely
proportional to the distance from the sensor, making it an ideal technology for use in small rooms. On the other hand, TOF may not be
appropriate for outdoor cameras since solar radiation will negatively impact accuracy. Large warehouse rooms or parking lots that
require a large depth of field are also likely to need one of the alternate technologies.
Figure 5 – TOF Principle of Operation Figure 6 – Depth Resolution vs Distance
Texas Instruments offers DS325, a depth sense tool kit with an LED driver, an analog front end, a Time of Flight controller, sensor and
associated power components. A typical block diagram is in figure 7. More information can be found here:
http://www.ti.com/ww/en/analog/3dtof/index.shtml
Figure 8 – Principle of operation of DLP
structured lighting and application results
Texas Instruments offers DLP4500NIR Digital Micro-Mirror
Device (DMD) and controllers, DLPC200 and DLPC300 for this
market. An added advantage with DLP technology is that
investment in DLP technology can often be leveraged in
security, retail and machine vision markets, benefiting firms
with interests in more than one of these markets. More
information on DLP technology can be found here:
http://www.ti.com/solution/3d_machine_vision
A table comparing four of the depth sensing technologies is
outlined below. Common applications other than Intelligent
Cameras are also listed:
Intelligent IPCamera Reference Designs
Texas Instruments, in association with its partners, offers a
combination of reference design and turnkey solutions to help
TI’s Intelligent Camera customers get to market quickly using
some of the latest available technology.
Table 2 – Comparison of various depth sensing technologies
TI’s Intelligent IPCamera reference designs bundle 5 commonly used video analytics algorithms
within in a scalable software framework. These 5 algorithms are People counting, Trip zone,
Streaming metadata, Tamper detection and Motion detection. A set of performance optimized vision
API libraries - VLIB for DSPs and VCOPLIB for vision co-processors - offers a strong foundation for
vision software. The 5 bundled algorithms take advantage of the optimized libraries. TI's customers
further differentiate through designing their own analytics functions using APIs like discrepancy
mapping (for depth sensing) included in these libraries. More information on these reference
designs are available at www.ti.com/ipcamera.
Texas Instruments (TI) is a leader in embedded processor and analog technology for autonomous
driving cars. As the automotive technologies made available by TI become mainstream in the
differentiated intelligent camera market, early adopters stand to gain significantly.
Figure 9 ­
Figure 10 ­ Five high level video analytics algorithms bundled with TI DMVA processors Figure 11 ­ DMVA Analytics Software Framework

More Related Content

What's hot

INTELLIGENT URBAN TRAFFIC CONTROL SYSTEM / VIDEO SENSOR / WAEL SAAD HAMEEDI /...
INTELLIGENT URBAN TRAFFIC CONTROL SYSTEM / VIDEO SENSOR / WAEL SAAD HAMEEDI /...INTELLIGENT URBAN TRAFFIC CONTROL SYSTEM / VIDEO SENSOR / WAEL SAAD HAMEEDI /...
INTELLIGENT URBAN TRAFFIC CONTROL SYSTEM / VIDEO SENSOR / WAEL SAAD HAMEEDI /...
Wael Alawsey
 
Gesture recognition 2
Gesture recognition 2Gesture recognition 2
Gesture recognition 2
karimkabbani
 

What's hot (20)

final project ppt
final project pptfinal project ppt
final project ppt
 
Gesture Technology
Gesture TechnologyGesture Technology
Gesture Technology
 
Gesture recognition adi
Gesture recognition adiGesture recognition adi
Gesture recognition adi
 
Robotic design: Frontiers in visual and tactile sensing
Robotic design: Frontiers in visual and tactile sensingRobotic design: Frontiers in visual and tactile sensing
Robotic design: Frontiers in visual and tactile sensing
 
GESTURE RECOGNITION TECHNOLOGY
GESTURE RECOGNITION TECHNOLOGYGESTURE RECOGNITION TECHNOLOGY
GESTURE RECOGNITION TECHNOLOGY
 
Gesture Recognition
Gesture RecognitionGesture Recognition
Gesture Recognition
 
Gesture recognition techniques
Gesture  recognition techniques Gesture  recognition techniques
Gesture recognition techniques
 
Gesture recognition technology
Gesture recognition technology Gesture recognition technology
Gesture recognition technology
 
final project
final projectfinal project
final project
 
gesture recognition!
gesture recognition!gesture recognition!
gesture recognition!
 
Gesture recognition systems
Gesture recognition systemsGesture recognition systems
Gesture recognition systems
 
INTELLIGENT URBAN TRAFFIC CONTROL SYSTEM / VIDEO SENSOR / WAEL SAAD HAMEEDI /...
INTELLIGENT URBAN TRAFFIC CONTROL SYSTEM / VIDEO SENSOR / WAEL SAAD HAMEEDI /...INTELLIGENT URBAN TRAFFIC CONTROL SYSTEM / VIDEO SENSOR / WAEL SAAD HAMEEDI /...
INTELLIGENT URBAN TRAFFIC CONTROL SYSTEM / VIDEO SENSOR / WAEL SAAD HAMEEDI /...
 
Part 2 - Gesture Recognition Technology
Part   2 - Gesture Recognition TechnologyPart   2 - Gesture Recognition Technology
Part 2 - Gesture Recognition Technology
 
Gesture recognition 2
Gesture recognition 2Gesture recognition 2
Gesture recognition 2
 
Hand Gesture Recognition Using OpenCV Python
Hand Gesture Recognition Using OpenCV Python Hand Gesture Recognition Using OpenCV Python
Hand Gesture Recognition Using OpenCV Python
 
Real Time Head & Hand Tracking Using 2.5D Data
Real Time Head & Hand Tracking Using 2.5D Data Real Time Head & Hand Tracking Using 2.5D Data
Real Time Head & Hand Tracking Using 2.5D Data
 
GESTURE prestation
GESTURE prestation GESTURE prestation
GESTURE prestation
 
Gesture recognition
Gesture recognitionGesture recognition
Gesture recognition
 
Gesture Recognition Technology
Gesture Recognition TechnologyGesture Recognition Technology
Gesture Recognition Technology
 
screen less display
screen less displayscreen less display
screen less display
 

Similar to ASIS Poster - Final

F3D Corporate Snapshot V3.1 - Generic GP
F3D Corporate Snapshot V3.1 - Generic GPF3D Corporate Snapshot V3.1 - Generic GP
F3D Corporate Snapshot V3.1 - Generic GP
Sonia Willcox
 
Density based traffic light controlling (2)
Density based traffic light controlling (2)Density based traffic light controlling (2)
Density based traffic light controlling (2)
hardik1240
 

Similar to ASIS Poster - Final (20)

Considerations When Choosing A Smart Camera
Considerations When Choosing A Smart CameraConsiderations When Choosing A Smart Camera
Considerations When Choosing A Smart Camera
 
Considerations When Choosing A Smart Camera
Considerations When Choosing A Smart CameraConsiderations When Choosing A Smart Camera
Considerations When Choosing A Smart Camera
 
Wireless Multimedia Sensor Network: A Survey on Multimedia Sensors
Wireless Multimedia Sensor Network: A Survey on Multimedia SensorsWireless Multimedia Sensor Network: A Survey on Multimedia Sensors
Wireless Multimedia Sensor Network: A Survey on Multimedia Sensors
 
41640 smart cams
41640 smart cams41640 smart cams
41640 smart cams
 
DISTRIBUTED SYSTEM FOR 3D REMOTE MONITORING USING KINECT DEPTH CAMERAS
DISTRIBUTED SYSTEM FOR 3D REMOTE MONITORING USING KINECT DEPTH CAMERASDISTRIBUTED SYSTEM FOR 3D REMOTE MONITORING USING KINECT DEPTH CAMERAS
DISTRIBUTED SYSTEM FOR 3D REMOTE MONITORING USING KINECT DEPTH CAMERAS
 
Top 16 Applications of Computer Vision in Video Surveillance and Security.pdf
Top 16 Applications of Computer Vision in Video Surveillance and Security.pdfTop 16 Applications of Computer Vision in Video Surveillance and Security.pdf
Top 16 Applications of Computer Vision in Video Surveillance and Security.pdf
 
F3D Corporate Snapshot V3.1 - Generic GP
F3D Corporate Snapshot V3.1 - Generic GPF3D Corporate Snapshot V3.1 - Generic GP
F3D Corporate Snapshot V3.1 - Generic GP
 
FACE COUNTING USING OPEN CV & PYTHON FOR ANALYZING UNUSUAL EVENTS IN CROWDS
FACE COUNTING USING OPEN CV & PYTHON FOR ANALYZING UNUSUAL EVENTS IN CROWDSFACE COUNTING USING OPEN CV & PYTHON FOR ANALYZING UNUSUAL EVENTS IN CROWDS
FACE COUNTING USING OPEN CV & PYTHON FOR ANALYZING UNUSUAL EVENTS IN CROWDS
 
Density based traffic light controlling (2)
Density based traffic light controlling (2)Density based traffic light controlling (2)
Density based traffic light controlling (2)
 
Whitepaper networkvideo safecity
Whitepaper networkvideo safecityWhitepaper networkvideo safecity
Whitepaper networkvideo safecity
 
Whitepaper networkvideo safecity
Whitepaper networkvideo safecityWhitepaper networkvideo safecity
Whitepaper networkvideo safecity
 
Dell NVIDIA AI Roadshow - South Western Ontario
Dell NVIDIA AI Roadshow - South Western OntarioDell NVIDIA AI Roadshow - South Western Ontario
Dell NVIDIA AI Roadshow - South Western Ontario
 
Real Time Video Processing in FPGA
Real Time Video Processing in FPGA Real Time Video Processing in FPGA
Real Time Video Processing in FPGA
 
Remote HD and 3D image processing challenges in Embedded Systems
Remote HD and 3D image processing challenges in Embedded SystemsRemote HD and 3D image processing challenges in Embedded Systems
Remote HD and 3D image processing challenges in Embedded Systems
 
Design and Implementation of Smart Bell Notification System using IoT
Design and Implementation of Smart Bell Notification System using IoTDesign and Implementation of Smart Bell Notification System using IoT
Design and Implementation of Smart Bell Notification System using IoT
 
Design and Implementation of Smart Bell Notification System using IoT
Design and Implementation of Smart Bell Notification System using IoT	Design and Implementation of Smart Bell Notification System using IoT
Design and Implementation of Smart Bell Notification System using IoT
 
IRJET - Automatic Toll E-Tickting System for Transportation and Finding o...
IRJET -  	  Automatic Toll E-Tickting System for Transportation and Finding o...IRJET -  	  Automatic Toll E-Tickting System for Transportation and Finding o...
IRJET - Automatic Toll E-Tickting System for Transportation and Finding o...
 
IRJET- Smart Surveillance Cam using Face Recongition Alogrithm
IRJET-  	  Smart Surveillance Cam using Face Recongition AlogrithmIRJET-  	  Smart Surveillance Cam using Face Recongition Alogrithm
IRJET- Smart Surveillance Cam using Face Recongition Alogrithm
 
Applications of Fuzzy Logic in Image Processing – A Brief Study
Applications of Fuzzy Logic in Image Processing – A Brief StudyApplications of Fuzzy Logic in Image Processing – A Brief Study
Applications of Fuzzy Logic in Image Processing – A Brief Study
 
Mitchell Reifel (pmdtechnologies ag): pmd Time-of-Flight – the Swiss Army Kni...
Mitchell Reifel (pmdtechnologies ag): pmd Time-of-Flight – the Swiss Army Kni...Mitchell Reifel (pmdtechnologies ag): pmd Time-of-Flight – the Swiss Army Kni...
Mitchell Reifel (pmdtechnologies ag): pmd Time-of-Flight – the Swiss Army Kni...
 

ASIS Poster - Final

  • 1. Abstract Like Intelligent IPCameras, Autonomous driving cars rely on built-in intelligence using video analytics. Unlike in IPCameras, a bad decision from video analytics software in an automobiles could cause loss of life. Hence cars must overcome accuracy limitations to improve detection accuracy for safety and avoid false alarms to maintain the driver experience. With the recent success in research around several depth perception technologies, and their cost effective availability, surveillance cameras are starting to adopt these highly accurate technologies. The paper sheds light into some of these technologies and how they improve overall analytics accuracy. Problems with Monocular Vision Binocular vision helps humans perceive depth. Still, humans are prone to illusions. Most security cameras are worse. They are monocular and cannot sense depth, so security cameras cannot easily distinguish between a person and his/her shadow or distinguish between two people standing behind each other (problem of occlusion). Monocular vision imposes stringent limits on the accuracy of any video analytics. The “turning lady illusion” is a good example to illustrate this problem. The mid-section of the lady in the video has no associated depth information. In observing the illusion, humans subconsciously apply various availability and representative heuristics to make an assumption that the mid-section of the lady is either part of the raised leg or part of the leg on the floor. Viewers who pursue a decision path that associates the mid-section to the lady’s raised leg concludes that the lady is rotating clockwise while the rest would judge that the lady is rotating counterclockwise. Automotive Vision Technologies Improve Accuracy of Intelligent IPCameras Jacob Jose,Product Manager, Embedded Processors, Texas Instruments Depth sensing solutions for IPCameras Processor solutions for intelligent IPCameras that use depth sensing need to incorporate power efficient signal processing as well as peripheral interfaces that connect to various sensing devices. The processors need an optimal signal processing capability that balances cost, processing power and heat dissipation. Integrated processor solutions hence need a combination of ARM and DSP processors, vector engines and dedicated ASICs. Texas Instruments DaVinci DM8127 integrates an inbuilt DSP engine and a vision co-processor for depth sensing and fusion, in addition to an ARM processor and several ASIC engine blocks. DSPs are ideal for raw vision processing and also comes with the programming flexibility that is usually associated with a general purpose processor. Compared to an ARM processor, a DSP consumes much less power and dissipate less heat for the same MIPS consumed. Structured Lighting using Digital Light Processing (DLP) technology DLP technology relies on structured lighting to illuminate a grid on the scene and observe the distortions in the grid to perceive depth. DLP offers significantly higher resolution than TOF technology, but has limited range. Effective use of temporal information could increase the accuracy of analytics in certain scenarios through background modelling. Video Analytics algorithms offered with Texas Instruments DaVinci DMVA and DM81xx processors offer accuracies that approach these theoretical limits. Use of Depth in Intelligent IPCameras Use of depth perception to augment video analytics algorithms will reduce false detections in Intelligent IPCameras by 5-10x. Figure 2 below shows an example where stereo sensors are used for depth perception. The hand of the subject is clearly marked out by the analytics engine despite the hand being part of the bigger subject and within a crowd of details. It may also be noted that fingers are further marked with a different color that represents a higher proximity, augmenting the accuracy of information. Techniques like auto-calibration, noise cleanup and fusion with redundant depth information from alternate sensors are now highly refined in automotive applications. Increased application of these technologies in video surveillance will usher in a new generation of highly intelligent IPCameras. Depth sensing by self­driving cars Smart and autonomous driving cars often have more than 10 cameras in a single automobile. However, as shown in the Table 1 below, a combination of various non-visual sensors are often employed perceive depth accurately in different environments For example, based on Table 1, a combination of stereo vision systems and a redundant mid-range radar can effectively detect pedestrians, while long range radar is required for adaptive cruise control. Different technologies vary in accuracy, range and cost. Hence a combination of diverse technologies are needed for different applications. Figure 4 – Designing an optimal vision processing pipeline Typical radar systems use either a CSI-2 or JESD interface to transmit depth data to host processor. The DM8127 processor also integrates a MIPI interface. Some of the other DSP processors from TI for the market integrate a JESD interface, enabling seamless connection to radar devices without using additional glue logic. Various depth sensing solutions, especially those that are commonly used in the automotive market, offer unique advantages to different IPCamera applications. As with smart cars, a combination of technologies are required to solve the diverse product needs in the intelligent camera market. For instance, Structured Lighting or Time of Flight sensors offer good accuracy in dimly lit indoor environments, but may not be effective outdoors where solar radiation is abundant. On the other hand, stereo sensors that may struggle indoors are highly effective outdoors due to the more beneficial lighting conditions. Among the prominent depth sensing technologies, two technologies are explored below: Time of Flight (TOF) Technology Time of flight technology relies on flooding a scene with modulated pulses, often infrared or laser. The reflections of each of the pulses are then timed to accurately calculate the distance from the object that the pulses have been reflected from. TOF technology can be employed very cost effectively for near-field perception. Using laser or higher levels of illumination (more LEDs) will increase the depth of field or resolution of the sensor, at a cost of increased power consumption. Originally used for gesture recognition and accuracy assistance in automotive surround view systems, TOF technology is very effective to perceive depth in dark environments, where stereo can be ineffective due to lack of illumination. TOF resolution is inversely proportional to the distance from the sensor, making it an ideal technology for use in small rooms. On the other hand, TOF may not be appropriate for outdoor cameras since solar radiation will negatively impact accuracy. Large warehouse rooms or parking lots that require a large depth of field are also likely to need one of the alternate technologies. Figure 5 – TOF Principle of Operation Figure 6 – Depth Resolution vs Distance Texas Instruments offers DS325, a depth sense tool kit with an LED driver, an analog front end, a Time of Flight controller, sensor and associated power components. A typical block diagram is in figure 7. More information can be found here: http://www.ti.com/ww/en/analog/3dtof/index.shtml Figure 8 – Principle of operation of DLP structured lighting and application results Texas Instruments offers DLP4500NIR Digital Micro-Mirror Device (DMD) and controllers, DLPC200 and DLPC300 for this market. An added advantage with DLP technology is that investment in DLP technology can often be leveraged in security, retail and machine vision markets, benefiting firms with interests in more than one of these markets. More information on DLP technology can be found here: http://www.ti.com/solution/3d_machine_vision A table comparing four of the depth sensing technologies is outlined below. Common applications other than Intelligent Cameras are also listed: Intelligent IPCamera Reference Designs Texas Instruments, in association with its partners, offers a combination of reference design and turnkey solutions to help TI’s Intelligent Camera customers get to market quickly using some of the latest available technology. Table 2 – Comparison of various depth sensing technologies TI’s Intelligent IPCamera reference designs bundle 5 commonly used video analytics algorithms within in a scalable software framework. These 5 algorithms are People counting, Trip zone, Streaming metadata, Tamper detection and Motion detection. A set of performance optimized vision API libraries - VLIB for DSPs and VCOPLIB for vision co-processors - offers a strong foundation for vision software. The 5 bundled algorithms take advantage of the optimized libraries. TI's customers further differentiate through designing their own analytics functions using APIs like discrepancy mapping (for depth sensing) included in these libraries. More information on these reference designs are available at www.ti.com/ipcamera. Texas Instruments (TI) is a leader in embedded processor and analog technology for autonomous driving cars. As the automotive technologies made available by TI become mainstream in the differentiated intelligent camera market, early adopters stand to gain significantly. Figure 9 ­ Figure 10 ­ Five high level video analytics algorithms bundled with TI DMVA processors Figure 11 ­ DMVA Analytics Software Framework