Buildings Recognition and Camera Localization Using Image Texture Description
Upcoming SlideShare
Loading in...5
×
 

Buildings Recognition and Camera Localization Using Image Texture Description

on

  • 495 views

3D GIS model/2D image registration called much attention in the recent years because of its vast range of potential applications in real and virtual navigation. However, automatic registration remains ...

3D GIS model/2D image registration called much attention in the recent years because of its vast range of potential applications in real and virtual navigation. However, automatic registration remains until now a challenge. This paper presents a methodology for enhancing and complementing a GIS database of buildings with a descriptor of their texture by using information extracted from a reference images. This descriptor is used to locate any other image by searching similar texture in the image. Then the absolute position and orientation of the camera of the new image can be computed if the camera parameters (like focal length) are known. The paper proposes a technique that can be used for achieving the identification of the facade in the photograph, calibrated camera geolocation and discusses the quality of the results.

Statistics

Views

Total Views
495
Views on SlideShare
495
Embed Views
0

Actions

Likes
0
Downloads
6
Comments
0

0 Embeds 0

No embeds

Accessibility

Categories

Upload Details

Uploaded via as Microsoft PowerPoint

Usage Rights

© All Rights Reserved

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
    Processing…
Post Comment
Edit your comment
  • The results are not always good, because of the error in the 3D model, the complexity of urban environment and the collusion Sourimant et al., 2009. GPS, GIS AND VIDEO REGISTRATION FOR BUILDING RECONSTRUCTION
  • The results are not always good, because of the error in the 3D model, the complexity of urban environment and the collusion Sourimant et al., 2009. GPS, GIS AND VIDEO REGISTRATION FOR BUILDING RECONSTRUCTION
  • The results are not always good, because of the error in the 3D model, the complexity of urban environment and the collusion Sourimant et al., 2009. GPS, GIS AND VIDEO REGISTRATION FOR BUILDING RECONSTRUCTION
  • SURF is faster than SIFT descriptor. But our tests show that the SIFT descriptor gives better results when the angle of view vary between two images SIFT code of VLFEAT and SURF code from OPENSURF. http://www.vlfeat.org/~vedaldi/code/sift.html http://www.mathworks.com/matlabcentral/fileexchange/28300
  • SURF is faster than SIFT descriptor. But our tests show that the SIFT descriptor gives better results when the angle of view vary between two images SIFT code of VLFEAT and SURF code from OPENSURF. http://www.vlfeat.org/~vedaldi/code/sift.html http://www.mathworks.com/matlabcentral/fileexchange/28300
  • SURF is faster than SIFT descriptor. But our tests show that the SIFT descriptor gives better results when the angle of view vary between two images SIFT code of VLFEAT and SURF code from OPENSURF. http://www.vlfeat.org/~vedaldi/code/sift.html http://www.mathworks.com/matlabcentral/fileexchange/28300
  • The results are not always good, because of the error in the 3D model, the complexity of urban environment and the collusion Sourimant et al., 2009. GPS, GIS AND VIDEO REGISTRATION FOR BUILDING RECONSTRUCTION
  • The level of the recognition based on the number of the interest points and their discriminative descriptor
  • The level of the recognition based on the number of the interest points and their discriminative descriptor
  • The level of the recognition based on the number of the interest points and their discriminative descriptor
  • The results are not always good, because of the error in the 3D model, the complexity of urban environment and the collusion Sourimant et al., 2009. GPS, GIS AND VIDEO REGISTRATION FOR BUILDING RECONSTRUCTION
  • With four points (three non collinear scene points or non collinear image points). (Yang et al. 2009) is applied. It is based on the homography matrix between the front wall in the query image and the 3D model. The best result is obtained when the camera is located just in front of the building at a distance less than 100 meters. The quality falls down steeply also if the distance between the camera and the building exceeds 70 m (low resolution), and the view angle is above 30 degrees (non affine descriptor).
  • With four points (three non collinear scene points or non collinear image points). (Yang et al. 2009) is applied. It is based on the homography matrix between the front wall in the query image and the 3D model. The best result is obtained when the camera is located just in front of the building at a distance less than 100 meters. The quality falls down steeply also if the distance between the camera and the building exceeds 70 m (low resolution), and the view angle is above 30 degrees (non affine descriptor).
  • We need at least 4 non coplanar points. The SOFTPOSIT algorithm (David et al. 2004) is used which has a high efficiency If one of the discovered facades shows an angle above 40 degrees, this facade will cause more error in the computation and contribute to the drop down of accuracy in the position of the camera
  • The results are not always good, because of the error in the 3D model, the complexity of urban environment and the collusion Sourimant et al., 2009. GPS, GIS AND VIDEO REGISTRATION FOR BUILDING RECONSTRUCTION
  • Approximate position and orientation of the camera are the first step of an automatic registration process Urban environment with low GPS precision (Li et al. 2006) (Bioret et al. 2009). Affine descriptor (Mikolajczyk & Schmid 2004) for the facade texture. This could greatly ameliorate the result for multiple facades detection If the method should be applied on Smartphones with continue video data, the SURF descriptor could perform better than SIFT because it is faster ( http://www.kooaba.com/ )
  • Approximate position and orientation of the camera are the first step of an automatic registration process Urban environment with low GPS precision (Li et al. 2006) (Bioret et al. 2009). Affine descriptor (Mikolajczyk & Schmid 2004) for the facade texture. This could greatly ameliorate the result for multiple facades detection If the method should be applied on Smartphones with continue video data, the SURF descriptor could perform better than SIFT because it is faster ( http://www.kooaba.com/ )
  • Approximate position and orientation of the camera are the first step of an automatic registration process Urban environment with low GPS precision (Li et al. 2006) (Bioret et al. 2009). Affine descriptor (Mikolajczyk & Schmid 2004) for the facade texture. This could greatly ameliorate the result for multiple facades detection If the method should be applied on Smartphones with continue video data, the SURF descriptor could perform better than SIFT because it is faster ( http://www.kooaba.com/ )
  • The results are not always good, because of the error in the 3D model, the complexity of urban environment and the collusion Sourimant et al., 2009. GPS, GIS AND VIDEO REGISTRATION FOR BUILDING RECONSTRUCTION

Buildings Recognition and Camera Localization Using Image Texture Description Buildings Recognition and Camera Localization Using Image Texture Description Presentation Transcript

  • Buildings Recognition and Camera Localization Using Image Texture Description SULEIMAN Wassim 1 , JOLIVEAU Thierry 1 , FAVIER Eric 2 1 ISTHME-ISIG CNRS/UMR EVS, Université Jean Monnet - Saint-Etienne. 2 DIPI EA 3719 École Nationale d'Ingénieurs de Saint-Etienne [email_address] [email_address] [email_address] 25th International Cartographic Conference (Sageo) – 8 july 2011 – Palais de congrès Paris
  • Objective Find a building in an image SIG 3D 3D GIS Locate the camera that took the image by using the location of the building
  • Methodology
    • Enhancing the GIS database with information which can describe the building
    • unique information
    • quantifiable information
      • The texture signature
    Texture signature
  • How to isolate the building facade in the image?
    • Manual method (long)
    Sourimant 2009
    • Automatic method : (3D SIG model/2D image) registration (simple building)
  • Work plan
    • GIS database enhancement with building texture information
    • Facade recognition
    • Camera geolocation
    • Possible applications
    • Limits
  • Work plan
    • Enhancing GIS databases with building texture information
    • Facade recognition
    • Camera Geolocation
    • Possible Applications
    • Limits
  • Enhancing GIS databases with building texture information Texture analyses (SIFT) Finding the interest points with their local descriptor
  • Enhancing GIS databases with building texture information Finding the (x,y,z) of the interest points Homography constraints 3D GIS model
  • Enhancing GIS databases with building texture information The texture descriptor : list of interest points with their local descriptor and their 3D position
  • Work plan
    • Enhancing GIS databases with building texture information
    • Facade recognition
    • Camera Geolocation
    • Possible Applications
    • Limits
  • Facade recognition False matching because of the locality of the descriptor
  • Facade recognition
    • Eliminate the false matching using the homography constraints
    • Select the best matching score between the current image and the stored descriptor in the databases
  • Facade recognition The facade in the 3D GIS
  • Work plan
    • Enhancing the GIS database with building texture information
    • Facade recognition
    • Camera geolocation
    • Possible applications
    • Limits
  • Camera Geolocation Association of the interest points with the 3D position of the matched points in the GIS databases
  • Camera geolocation 4 points non-collinear (Yang & al. 2009) Real position Measured position
  • Camera Geolocation
    • Error for distance (20-100)m and angle (0-30°) camera direction and facade normal :
      • Position : 1 - 3 m
      • Orientation : 5 - 10°
    • Error because the texture description is not an affine function
    4 points non-coplanaires SOFTPOSIT (David et al. 2004) Camera Geolocation Real position Measured position
  • Work plan
    • Enhancing the GIS database with building texture information
    • Facade recognition
    • Camera geolocation
    • Possible applications
    • Limits
  • Possible Applications
    • Management of photos taken in urban areas
    Link GIS
  • Possible Applications
    • Navigation systems support in an urban environment
      • Satellites visibility
      • Multipath
  • Possible applications
    • Initial phase in the (2D/3D) registration
    Sourimant 2009
  • Work plan
    • Enhancing the GIS database with building texture information
    • Facade recognition
    • Camera geolocation
    • Possible applications
    • Limits
  • Limits
    • Angle between camera direction and facade normal has to be less than 30°
  • Limits
    • Distance between camera and facade has to be less than 200 m
  • Limits
    • Identical facades :
  • Limits
    • Glass facade which reflects the sky and other buildings
  • Thank you For your attention Suleiman wassim [email_address]