WO2011109856A1 - Sensor data processing - Google Patents

Sensor data processing Download PDF

Info

Publication number
WO2011109856A1
WO2011109856A1 PCT/AU2011/000205 AU2011000205W WO2011109856A1 WO 2011109856 A1 WO2011109856 A1 WO 2011109856A1 AU 2011000205 W AU2011000205 W AU 2011000205W WO 2011109856 A1 WO2011109856 A1 WO 2011109856A1
Authority
WO
WIPO (PCT)
Prior art keywords
image
point
scene
value
laser
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Ceased
Application number
PCT/AU2011/000205
Other languages
French (fr)
Inventor
Thierry Peynot
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
University of Sydney
Original Assignee
University of Sydney
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by University of Sydney filed Critical University of Sydney
Priority to EP11752738.2A priority Critical patent/EP2545707A4/en
Priority to AU2011226732A priority patent/AU2011226732A1/en
Priority to US13/583,456 priority patent/US20130058527A1/en
Publication of WO2011109856A1 publication Critical patent/WO2011109856A1/en
Anticipated expiration legal-status Critical
Ceased legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G01MEASURING; TESTING
    • G01SRADIO DIRECTION-FINDING; RADIO NAVIGATION; DETERMINING DISTANCE OR VELOCITY BY USE OF RADIO WAVES; LOCATING OR PRESENCE-DETECTING BY USE OF THE REFLECTION OR RERADIATION OF RADIO WAVES; ANALOGOUS ARRANGEMENTS USING OTHER WAVES
    • G01S17/00Systems using the reflection or reradiation of electromagnetic waves other than radio waves, e.g. lidar systems
    • G01S17/88Lidar systems specially adapted for specific applications
    • G01S17/89Lidar systems specially adapted for specific applications for mapping or imaging
    • GPHYSICS
    • G01MEASURING; TESTING
    • G01SRADIO DIRECTION-FINDING; RADIO NAVIGATION; DETERMINING DISTANCE OR VELOCITY BY USE OF RADIO WAVES; LOCATING OR PRESENCE-DETECTING BY USE OF THE REFLECTION OR RERADIATION OF RADIO WAVES; ANALOGOUS ARRANGEMENTS USING OTHER WAVES
    • G01S17/00Systems using the reflection or reradiation of electromagnetic waves other than radio waves, e.g. lidar systems
    • G01S17/86Combinations of lidar systems with systems other than lidar, radar or sonar, e.g. with direction finders
    • GPHYSICS
    • G01MEASURING; TESTING
    • G01SRADIO DIRECTION-FINDING; RADIO NAVIGATION; DETERMINING DISTANCE OR VELOCITY BY USE OF RADIO WAVES; LOCATING OR PRESENCE-DETECTING BY USE OF THE REFLECTION OR RERADIATION OF RADIO WAVES; ANALOGOUS ARRANGEMENTS USING OTHER WAVES
    • G01S7/00Details of systems according to groups G01S13/00, G01S15/00, G01S17/00
    • G01S7/48Details of systems according to groups G01S13/00, G01S15/00, G01S17/00 of systems according to group G01S17/00
    • G01S7/497Means for monitoring or calibrating
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/70Determining position or orientation of objects or cameras
    • G06T7/73Determining position or orientation of objects or cameras using feature-based methods
    • GPHYSICS
    • G01MEASURING; TESTING
    • G01SRADIO DIRECTION-FINDING; RADIO NAVIGATION; DETERMINING DISTANCE OR VELOCITY BY USE OF RADIO WAVES; LOCATING OR PRESENCE-DETECTING BY USE OF THE REFLECTION OR RERADIATION OF RADIO WAVES; ANALOGOUS ARRANGEMENTS USING OTHER WAVES
    • G01S17/00Systems using the reflection or reradiation of electromagnetic waves other than radio waves, e.g. lidar systems
    • G01S17/88Lidar systems specially adapted for specific applications
    • G01S17/93Lidar systems specially adapted for specific applications for anti-collision purposes
    • G01S17/931Lidar systems specially adapted for specific applications for anti-collision purposes of land vehicles
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/10Image acquisition modality
    • G06T2207/10004Still image; Photographic image
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/10Image acquisition modality
    • G06T2207/10028Range image; Depth image; 3D point clouds
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/20Special algorithmic details
    • G06T2207/20112Image segmentation details
    • G06T2207/20164Salient point detection; Corner detection
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/20Special algorithmic details
    • G06T2207/20212Image combination
    • G06T2207/20221Image fusion; Image merging
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/30Subject of image; Context of image processing
    • G06T2207/30248Vehicle exterior or interior
    • G06T2207/30252Vehicle exterior; Vicinity of vehicle
    • G06T2207/30261Obstacle

Definitions

  • the present invention relates to processing of sensor data.
  • the present invention relates to the processing of data corresponding to respective images of a scene generated using two respective sensors.
  • the term "perception” relates to an autonomous vehicle obtaining information about its environment and current state through the use of various sensors.
  • Conventional perception systems tend to fail in a number of situations.
  • conventional systems tend to fail in challenging environmental conditions, for example in environments where smoke or airborne dust is present.
  • a typical problem that arises in such cases is that of a laser range finder tending to detect a dust cloud as much as it detects an obstacle.
  • This results in conventional perception systems tending to identify the dust or smoke as an actual obstacle.
  • the ability of an autonomous vehicle may be adversely affected because obstacles that are not present have been identified by the vehicle's perception system.
  • the present invention provides a method of processing sensor data, the method comprising measuring a value of a first parameter of a scene using a first sensor to produce a first image of the scene, measuring a value of a second parameter of the scene using a second sensor to produce a second image of the scene, identifying a first point, the first point being a point of the first image that corresponds to a class of features of the scene, identifying a second point, the second point being a point of the second image that corresponds to the class of features, projecting the second point onto the first image, determining a similarity value between the first point and the projection of the second point on to the first image, and comparing the determined similarity value to a predetermined threshold value.
  • the similarity value may be a value related to a distance in the first image between the first point and the projection of the second point on to the first image.
  • the method may further comprise defining a neighbourhood in the second image around the second point, and projecting the neighbourhood onto the first image, wherein the step of identifying the first point comprises identifying the first point such that the first point lies within the projection of the neighbourhood onto the first image .
  • the step of determining a value related to a distance may comprise defining a probability distribution mask over the projection of the neighbourhood in the first image, the probability distribution mask being centred on the projection of the second point on the first image, and determining a value of the probability distribution mask at the first point.
  • the first parameter may be different to the second parameter .
  • the first sensor may be a different type of sensor to the second sensor.
  • the first parameter may be light intensity
  • the first sensor type may be a camera
  • the second parameter may be range
  • the second sensor type may be a laser scanner .
  • the method may further comprise calibrating the second image of the scene with respect to the first image of the scene.
  • the step of calibrating the second image of the scene with respect to the first image of the scene may comprise determining a transformation to project points in the second image to corresponding points in the first image.
  • a step of projecting may be performed using the determined transformation.
  • the similarity value may be a value of a probability that the second image corresponds to the first image.
  • the probability may be calculated using the following formula :
  • A is the event that the second image corresponds to the first image
  • the present invention provides apparatus for processing sensor data, the apparatus comprising a first sensor for measuring a value of a first parameter of a scene to produce a first image of the scene, a second sensor for measuring a value of a second parameter of the scene to produce a second image of the scene, and one or more processors arranged to: identify a first point, the first point being a point of the first image that corresponds to a class of features of the scene, identify a second point, the second point being a point of the second image that corresponds to the class of features, project the second point onto the first image, determine a similarity value between the first point and the projection of the second point on to the first image, and compare the determined similarity value to a predetermined threshold value.
  • the similarity value may be a value related to a distance in the first image between the first point and the projection of the second point on to the first image.
  • the present invention provides an autonomous vehicle comprising the apparatus of the above aspect.
  • the present invention provides a computer program or plurality of computer programs arranged such that when executed by a computer system it/they cause the computer system to operate in accordance with the method of any of the above aspects.
  • the present invention provides a machine readable storage medium storing a computer program or at least one of the plurality of computer programs according to the above aspect.
  • Figure 1 is a schematic illustration (not to scale) of an example scenario in which an embodiment of a process for improving perception integrity is implemented.
  • Figure 2 is a process flow chart showing certain steps of an embodiment of the process for improving perception integrity.
  • Figure 1 is a schematic illustration (not to scale) of an example scenario 1 in which an embodiment of a process for improving perception integrity is implemented.
  • perception is used herein to refer to a process by which a vehicle's sensors are used to perform measurements of the vehicle's surroundings and process these measurements in order to enable the vehicle to successfully navigate through the surroundings.
  • the process for improving perception integrity is described in more detail later below with reference to Figure 2.
  • a vehicle 2 comprises a camera 4, a laser scanner 6 and a processor 8.
  • the camera 4 and the laser scanner 6 are each coupled to the processor 8.
  • the vehicle 2 is a land-based vehicle.
  • the vehicle 2 performs autonomous navigation within its surroundings.
  • the vehicle's surroundings comprise a plurality of obstacles, which are represented in Figure 1 by a single box and indicated by the reference numeral 10.
  • the autonomous navigation of the vehicle 2 is facilitated by measurements made by the vehicle 2 of the obstacles 10. These measurements are made using the camera 4 and the laser scanner 6.
  • the camera 4 takes light intensity measurements of the obstacles 10 from the vehicle.
  • This intensity data (hereinafter referred to as “camera data”) is sent from the camera 4 to the processor 8.
  • the camera data is, in effect, a visual image of the obstacles 10 and is hereinafter referred to as the "camera image”.
  • the camera 4 is a conventional camera.
  • the laser scanner 6 takes range or bearing measurements of the obstacles 10 from the vehicle 2. This range data (hereinafter referred to as “laser data”) is sent from the laser scanner 6 to the processor 8.
  • the laser data is, in effect, an image of the obstacles 10 and/or the dust cloud 12. This image is hereinafter referred to as the "laser scan”.
  • the laser scanner 6 is a conventional laser scanner.
  • the camera image and the laser scan are continuously acquired and time-stamped.
  • images and scans may be acquired on intermittent bases.
  • time- stamping need not be employed, and instead any other suitable form of time-alignment or image/scan association may be used.
  • the camera image and the laser scan are processed by the processor 8 to enable the vehicle 2 to navigate within its surroundings.
  • the processor 8 compares a laser scan to a camera image, the laser scan being the closest laser scan in time (e.g. based on the time-stamping) to the camera image, as described in more detail below.
  • the images of the obstacles 10 generated using the camera 4 and the laser scanner 6 are made through a dust cloud 12.
  • the dust cloud 12 at least partially obscures the obstacles 10 from the camera 4 and/or laser scanner 6 on the vehicle.
  • the presence of the dust cloud 12 affects the measurements taken by the camera 4 and the laser scanner 6 to different degrees.
  • the laser scanner 6 detects the dust cloud 12 the same as it would detect an obstacle, whereas the dust cloud 12 does not significantly affect the measurements of the obstacles 10 taken by the camera 4.
  • Figure 2 is a process flow chart showing certain steps of an embodiment of the process for improving perception integrity.
  • a calibration process is performed on the camera image and the laser scan to determine an estimate of the transformation between the laser frame and the camera frame.
  • This transformation is hereinafter referred to as the "laser-camera transformation".
  • this calibration process is a conventional process.
  • the camera may be calibrated using the camera calibration toolbox for Matlab(TM) developed by Bouget et al .
  • the estimate of the laser-camera transformation may be determined using a conventional technique .
  • This step provides that every laser point whose projection under the laser-camera transformation belongs to the camera image may be projected onto the camera image .
  • Steps s4 to s20 define a process by which a value that is indicative of how well the laser scan and the camera image correspond to each other (after the performance of the laser-camera transformation) is determined .
  • the camera image is converted in to a gray-scale image.
  • this conversion is performed in a conventional manner.
  • an edge detection process is performed on the gray-scale camera image.
  • edge detection in the camera image is performed in a conventional manner using a Sobel filter.
  • the laser scanner 6 is arranged to scan the obstacles 10 in a plane that is substantially horizontal to a ground surface, i.e. scanning is performed in a plane substantially perpendicular to the image plane of the camera image.
  • a filter for detecting vertical edges is used.
  • a different type of filter for detecting different edges may be used, for example a filter designed for detecting horizontal edges may be used in embodiments where the laser scanner is arranged to scan vertically.
  • the filtered image is, in effect, the image that results from the convolution of the grey- scale image with the following mask:
  • an edge in the camera image is indicated by a sudden variation in the intensity of the grey-scale camera image.
  • step s8 gradient values for the range parameter measured by the laser scanner 6 are determined.
  • the gradient values are obtained in a conventional way from the convolution of the laser scan with the following mask:
  • points in the laser scan that correspond to "corners” are identified.
  • the terminology "corner” is used herein to indicate a point at which there is a sudden variation of range along the scan. This may, for example, be caused by the laser scanner 6 measuring the range from the vehicle 2 to an obstacle 10. As the laser scanner 6 scans beyond the edge or corner of the obstacle 10, there is a sudden and possibly significant change in the range values measured by the laser scanner 6.
  • points in the laser scan that correspond to a corner are those points that have a gradient value (determined at step sl4) that has an absolute value that is greater than a first threshold value.
  • This first threshold value is set to be above a noise floor.
  • two successive points of the laser scan correspond to corners, i.e. one laser poi nt either side of the laser scan discontinuity. These form pairs of corners.
  • the laser scan is segmented, i.e. a plurality of segments is defined over the laser scan.
  • a segment of the laser scan comprises all the points between two successive laser corner points.
  • step sl4 one of each of the two laser points (corners) of the pairs identified at step slO is selected.
  • the laser point that corresponds to a shorter range is selected.
  • This selected laser point is the most likely point of the pair to correspond to an edge in the camera image, after proj ection .
  • the selected laser corner points are projected onto the camera image (using the laser-camera transformation) . Also, respective pixel neighbourhoods of uncertainty corresponding to each of the respective projected points are computed.
  • these neighbourhoods of uncertainty are determined in a conventional manner as follows .
  • Points in the laser scan are related to corresponding points in the camera image as follows:
  • P c is a point in the camera image corresponding to the point P l ;
  • is a translation offset
  • is a rotation matrix with Euler angles
  • the laser-camera calibration optimisation (described above at step s2) returns values for ⁇ and ⁇ by minimising the sum of the squares of the normal errors.
  • the normal errors are simply the Euclidean distance of the laser points from the calibration plane in the camera image frame of reference.
  • Jackknife samples are taken from the dataset.
  • the ith Jackknife sample Xi is simply all the data points
  • a different technique of computing the neighbourhoods of uncertainty may be used. For example, a common ' 'calibration object' could be identified in the camera image and laser scan. An edge of this calibration object may then be used to generate a maximum error value, which can be used to define a neighbourhood of uncertainty. The computed neighbourhoods of uncertainty corresponding to each of the respective projected points are also projected onto the camera image (using the laser- camera transformation) .
  • a selected laser corner point, and a respective neighbourhood of uncertainty that surrounds that laser point each have a projected image on the camera image under the laser-camera transformation.
  • the projection of a laser corner point is, a priori, a best estimate of the pixel of the camera image that corresponds to that laser point.
  • the projection of a neighbourhood surrounds the projection of the corresponding laser corner point.
  • step sl8 for each laser corner point projected on to the camera image at step sl6, it is determined whether there is a matching edge in the camera image within the projection of the neighbourhood of uncertainty of that laser point.
  • matching edge in the camera image refers to at least two points (pixels) in the camera image, in two different consecutive lines and connected columns of the relevant neighbourhood of uncertainty, having a Sobel intensity greater than a predefined second threshold value.
  • the matching process of step sl8 comprises identifying a camera image edge within a neighbourhood of a projection of a laser corner point.
  • the matching process comprises identifying points in the camera image within a projection of a neighbourhood, the points having an intensity value greater than the second threshold value.
  • a probability that the laser information corresponds to the information in the camera image acquired at the same time is estimated.
  • the probability of correspondence between the laser and camera image, for a certain projected laser point corresponding to a selected corner is determined using the following formula:
  • A is the event that the laser and camera information correspond
  • is a normalisation factor
  • P(A ⁇ B,C) is the probability that, for a given laser corner point, the laser and camera information correspond given the projection of that laser corner point and given that an edge was found in the projection of the neighbourhood of that projected laser corner point;
  • P(C ⁇ A,B) is the probability of the certain laser data projection on the camera image, given that the laser and camera data correspond, and given that a visual edge was found in the projection of the neighbourhood of the certain laser point projection.
  • This term is directly related to the uncertainty of the projection of the laser point on the image.
  • the value of this term is computed using a Gaussian mask over the neighbourhood of the certain projected laser point. This Gaussian represents the distribution of probability for the position of the laser projected point.
  • P(B I A) is the probability that a visual edge is found in the neighbourhood, given that the laser and camera information do correspond. This term describes the likelihood of the assumption that if the laser and camera information do correspond, then any laser corner should correspond to a visual edge in the camera image. In this embodiment, the value of this term is fixed and close to 1, i.e. knowing the laser and camera data do correspond, a visual edge usually exists.
  • P(A) is the a priori probability that the laser data and camera data correspond.
  • the value of this term is set to a fixed uncertain value. This represents the fact that, in this embodiment, there is no a priori knowledge on that event;
  • the value of the term P(B ⁇ A) is the probability of finding a visual edge anywhere in the camera image (using the process described at step s6 above) ;
  • P(C ⁇ B) P(C ⁇ B,A)P(A)+ P(C ⁇ B,A)P(A) , where only the term remains to be described is P(C ⁇ B,A).
  • This term corresponds to the confidence in the calibration (i.e. quality of the projection) . In this embodiment it is taken as the best chance for the projection, i.e. the probability read at the centre of the neighbourhood of uncertainty (in other words, the maximum probability in the neighbourhood) .
  • the determined values of the probability that the laser information corresponds to the information in the camera image are values related to a distance in the camera image between a camera edge and the projection of a corresponding laser corner on to the camera image.
  • other similarity values i.e. values encapsulating the similarity between the camera edge and the projection of a corresponding laser corner on to the camera image, may be used.
  • a validation process is performed to validate the laser data relative to the camera data.
  • the validation process comprises making a decision about whether each of the laser scan segments corresponds to the camera image data.
  • the corners belonging to this segment have a matching edge in the camera image, i.e. the probabilities for those corners, determined at step s20, are greater than a predefined threshold (hereinafter referred to as the "third threshold value") , then the laser data of that segment is considered to correspond to the camera data, i.e. the laser data is validated and can be combined (or associated) with the camera image data.
  • a predefined threshold hereinafter referred to as the "third threshold value”
  • the laser data of the certain segment is considered to not correspond to the camera data.
  • the data from both types of sensors is treated differently.
  • the laser data is considered as inconsistent with the camera data (i.e. it has been corrupted by the presence of the dust cloud 12) . Therefore, fusion of laser data and camera data is not permitted for the purposes of navigation of the vehicle 2.
  • different validation processes may be used.
  • the laser and camera data can be fused.
  • the fused data is integrated with any other sensing data in a perception system of the vehicle 2.
  • the laser and camera image do not correspond, only the most reliable of the data (in this embodiment the data corresponding to the camera image) is integrated with any other sensing data in a perception system of the vehicle 2. This advantageously avoids the utilising of non-robust data for the purposes of perception.
  • the above described method advantageously tends to provide better perception capabilities of the vehicle 2. In other words, the above described method advantageously tends to increase the integrity of a perception system of the vehicle 2.
  • An advantage of the above described method is that it tends to increase the integrity of a vehicle's perception capabilities in challenging environmental conditions, such as the presence of smoke or airborne dust.
  • the increasing of the integrity of the vehicle's perception capabilities tends to enable the vehicle to navigate better within an environment .
  • the present invention advantageously compares data from laser scans and camera images to detect inconsistencies or discrepancies.
  • these discrepancies arise when the laser scanner 6 detects dust from the dust cloud 12.
  • the effect of this dust tends to be less significant on the visual camera (or infrared) image, at least as long as the density of the dust cloud remains "reasonable”.
  • the method is capable of advantageously identifying that there is a discrepancy between the laser data and the camera data, so that only the relatively unaffected camera data is used for the purposes of navigating the vehicle.
  • a further advantage of the present invention is that a process of comparing laser data (comprising range/bearing information) to camera image data (comprising measurements of intensity, or colour, distributed in space on the camera plane) tends to be provided.
  • laser data comprising range/bearing information
  • camera image data comprising measurements of intensity, or colour, distributed in space on the camera plane
  • common characteristics in the data in particular geometrical characteristics, are compared to provide this advantage.
  • the present invention advantageously tends to exploit redundancies in the observations made by the laser scanner and the camera in order to identify features that correspond to each other in the laser scan and the camera image.
  • an estimate of a likelihood that the sensing data provided by the laser does corresponds to the data in the image is advantageously provided. This allows a decision upon the veracity of the laser data compared to the camera data to be made.
  • a further advantage of the above embodiments is that the detection of discrepancies between laser data and camera data tends to be possible. Moreover, this tends to allow for the detection of misalignment errors, typically when the discrepancies/inconsistencies concern the whole laser scan.
  • the vehicle is a land-based vehicle.
  • the vehicle is a different type of vehicle, for example an aircraft.
  • the vehicle performs autonomous navigation.
  • navigation of the vehicle is not performed autonomously.
  • an embodiment of a method of improving the integrity of perception is used to support/advise a human navigator of a vehicle (e.g. a driver or a pilot) who may be on or remote from the vehicle .
  • the vehicle comprises a laser scanner and a camera.
  • the vehicle comprises any two different heterogeneous sensors, the data from which may be processed according to the method of improving perception integrity as described above.
  • one of the sensors is an infrared camera.
  • An advantage provided by an infrared camera is that resulting images tend not to be significantly affected by the presence of smoke clouds.
  • the laser scan of the vehicles surroundings is affected by the presence of the dust cloud (i.e. the laser scanner measures range values from the vehicle to the dust cloud as opposed to range values from the vehicle to the obstacles) .
  • the laser scan is affected by a different entity, for example smoke, cloud, or fog.
  • the process may also be used advantageously in situations in which there are no dust clouds etc.
  • the likelihood of correspondence of laser and camera data is determined by identifying laser corner points and matching edges in the camera image.
  • different features of the respective images may be used.
  • other points of a laser segment i.e. points not corresponding to corners
  • an inference process may need to be used in addition to the above described method steps in order to accurately check the consistency of the laser/camera images.
  • a probability value is determined to indicate the probability that a certain laser corner point corresponds to a matched edge in the camera image.
  • a different appropriate metric indicative of the extent to which a certain laser corner point corresponds to a matched edge in the camera image is used.
  • a decision about whether or not the laser scan and the camera image correspond to one another is dependent on probability values that certain laser corner points correspond to respective matched edges in the camera image. However, in other embodiments this decision is based upon different appropriate criteria.
  • Apparatus including the processor, for performing the method steps described above, may be provided by an apparatus having components on the vehicle, external to the vehicle, or by an apparatus having some components on the vehicle and others remote from the vehicle. Also, the apparatus may be provided by configuring or adapting any suitable apparatus, for example one or more computers or other processing apparatus or processors, and/or providing additional modules.
  • the apparatus may comprise a computer, a network of computers, or one or more processors, for implementing instructions and using data, including instructions and data in the form of a computer program or plurality of computer programs stored in or on a machine readable storage medium such as computer memory, a computer disk, ROM, PROM etc., or any combination of these or other storage media.

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Radar, Positioning & Navigation (AREA)
  • Remote Sensing (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Electromagnetism (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Theoretical Computer Science (AREA)
  • Traffic Control Systems (AREA)
  • Image Analysis (AREA)

Abstract

A method and apparatus for processing sensor data comprising measuring a value of a first parameter of a scene (10) using a first sensor (4) (e.g. a camera) to produce a first image of the scene (10), measuring a value of a second parameter of the scene (10) using a second sensor (6) (e.g. a laser scanner) to produce a second image, identifying a first point of the first image that corresponds to a class of features of the scene (10), identifying a second point of the second image that corresponds to the class of features, projecting the second point onto the first image, determining a similarity value between the first point and the projection of the second point on to the first image, and comparing the determined similarity value to a predetermined threshold value. The method or apparatus may be used on an autonomous vehicle (2).

Description

SENSOR DATA PROCESSING
FIELD OF THE INVENTION
The present invention relates to processing of sensor data. In particular, the present invention relates to the processing of data corresponding to respective images of a scene generated using two respective sensors.
BACKGROUND
In the field of autonomous vehicles, the term "perception" relates to an autonomous vehicle obtaining information about its environment and current state through the use of various sensors.
Conventional perception systems tend to fail in a number of situations. In particular, conventional systems tend to fail in challenging environmental conditions, for example in environments where smoke or airborne dust is present. A typical problem that arises in such cases is that of a laser range finder tending to detect a dust cloud as much as it detects an obstacle. This results in conventional perception systems tending to identify the dust or smoke as an actual obstacle. Thus, the ability of an autonomous vehicle may be adversely affected because obstacles that are not present have been identified by the vehicle's perception system.
SUMMARY OF THE INVENTION
In a first aspect, the present invention provides a method of processing sensor data, the method comprising measuring a value of a first parameter of a scene using a first sensor to produce a first image of the scene, measuring a value of a second parameter of the scene using a second sensor to produce a second image of the scene, identifying a first point, the first point being a point of the first image that corresponds to a class of features of the scene, identifying a second point, the second point being a point of the second image that corresponds to the class of features, projecting the second point onto the first image, determining a similarity value between the first point and the projection of the second point on to the first image, and comparing the determined similarity value to a predetermined threshold value.
The similarity value may be a value related to a distance in the first image between the first point and the projection of the second point on to the first image.
The method may further comprise defining a neighbourhood in the second image around the second point, and projecting the neighbourhood onto the first image, wherein the step of identifying the first point comprises identifying the first point such that the first point lies within the projection of the neighbourhood onto the first image .
The step of determining a value related to a distance may comprise defining a probability distribution mask over the projection of the neighbourhood in the first image, the probability distribution mask being centred on the projection of the second point on the first image, and determining a value of the probability distribution mask at the first point.
The first parameter may be different to the second parameter .
The first sensor may be a different type of sensor to the second sensor. The first parameter may be light intensity, the first sensor type may be a camera, the second parameter may be range, and/or the second sensor type may be a laser scanner .
The method may further comprise calibrating the second image of the scene with respect to the first image of the scene.
The step of calibrating the second image of the scene with respect to the first image of the scene may comprise determining a transformation to project points in the second image to corresponding points in the first image.
A step of projecting may be performed using the determined transformation.
The similarity value may be a value of a probability that the second image corresponds to the first image.
The probability may be calculated using the following formula :
Figure imgf000004_0001
where: A is the event that the second image corresponds to the first image;
B is the event that the first point lies within the projection of the neighbourhood onto the first image;
C is the projection of the second point onto the first image; and
η is a normalisation factor. In a further aspect, the present invention provides apparatus for processing sensor data, the apparatus comprising a first sensor for measuring a value of a first parameter of a scene to produce a first image of the scene, a second sensor for measuring a value of a second parameter of the scene to produce a second image of the scene, and one or more processors arranged to: identify a first point, the first point being a point of the first image that corresponds to a class of features of the scene, identify a second point, the second point being a point of the second image that corresponds to the class of features, project the second point onto the first image, determine a similarity value between the first point and the projection of the second point on to the first image, and compare the determined similarity value to a predetermined threshold value.
The similarity value may be a value related to a distance in the first image between the first point and the projection of the second point on to the first image.
In a further aspect, the present invention provides an autonomous vehicle comprising the apparatus of the above aspect.
In a further aspect, the present invention provides a computer program or plurality of computer programs arranged such that when executed by a computer system it/they cause the computer system to operate in accordance with the method of any of the above aspects.
In a further aspect, the present invention provides a machine readable storage medium storing a computer program or at least one of the plurality of computer programs according to the above aspect. BRIEF DESCRIPTION OF THE DRAWINGS
Figure 1 is a schematic illustration (not to scale) of an example scenario in which an embodiment of a process for improving perception integrity is implemented; and
Figure 2 is a process flow chart showing certain steps of an embodiment of the process for improving perception integrity.
DETAILED DESCRIPTION
Figure 1 is a schematic illustration (not to scale) of an example scenario 1 in which an embodiment of a process for improving perception integrity is implemented.
The terminology "perception" is used herein to refer to a process by which a vehicle's sensors are used to perform measurements of the vehicle's surroundings and process these measurements in order to enable the vehicle to successfully navigate through the surroundings. The process for improving perception integrity is described in more detail later below with reference to Figure 2.
In the scenario 1, a vehicle 2 comprises a camera 4, a laser scanner 6 and a processor 8. The camera 4 and the laser scanner 6 are each coupled to the processor 8. The vehicle 2 is a land-based vehicle.
In the scenario 1, the vehicle 2 performs autonomous navigation within its surroundings. The vehicle's surroundings comprise a plurality of obstacles, which are represented in Figure 1 by a single box and indicated by the reference numeral 10.
The autonomous navigation of the vehicle 2 is facilitated by measurements made by the vehicle 2 of the obstacles 10. These measurements are made using the camera 4 and the laser scanner 6.
The camera 4 takes light intensity measurements of the obstacles 10 from the vehicle. This intensity data (hereinafter referred to as "camera data") is sent from the camera 4 to the processor 8. The camera data is, in effect, a visual image of the obstacles 10 and is hereinafter referred to as the "camera image". The camera 4 is a conventional camera.
The laser scanner 6 takes range or bearing measurements of the obstacles 10 from the vehicle 2. This range data (hereinafter referred to as "laser data") is sent from the laser scanner 6 to the processor 8. The laser data is, in effect, an image of the obstacles 10 and/or the dust cloud 12. This image is hereinafter referred to as the "laser scan". The laser scanner 6 is a conventional laser scanner.
In this embodiment, the camera image and the laser scan are continuously acquired and time-stamped. However, in other embodiments, images and scans may be acquired on intermittent bases. Also, in other embodiments, time- stamping need not be employed, and instead any other suitable form of time-alignment or image/scan association may be used.
The use of the camera 4 and the laser scanner 6 to make measurements of the obstacles 10 is represented in Figure 1 by sets of dotted lines between the camera 4 and the obstacles 10 and between the laser scanner 6 and the obstacles 10 respectively.
The camera image and the laser scan are processed by the processor 8 to enable the vehicle 2 to navigate within its surroundings. The processor 8 compares a laser scan to a camera image, the laser scan being the closest laser scan in time (e.g. based on the time-stamping) to the camera image, as described in more detail below.
In the scenario 1, the images of the obstacles 10 generated using the camera 4 and the laser scanner 6 are made through a dust cloud 12. In other words, the dust cloud 12 at least partially obscures the obstacles 10 from the camera 4 and/or laser scanner 6 on the vehicle. For the purpose of illustrating the present invention, it is assumed that the presence of the dust cloud 12 affects the measurements taken by the camera 4 and the laser scanner 6 to different degrees. In particular, the laser scanner 6 detects the dust cloud 12 the same as it would detect an obstacle, whereas the dust cloud 12 does not significantly affect the measurements of the obstacles 10 taken by the camera 4.
An embodiment of a process for improving perception integrity, i.e. improving the vehicle's perception of its surroundings (in particular, the obstacles 10) in the presence of the dust cloud 12, will now be described.
Figure 2 is a process flow chart showing certain steps of an embodiment of the process for improving perception integrity.
It should be noted that certain of the process steps depicted in the flowchart of Figure 2 and described above may be omitted or such process steps may be performed in differing order to that presented above and shown in Figure 2. Furthermore, although all the process steps have, for convenience and ease of understanding, been depicted as discrete temporally-sequential steps, nevertheless some of the process steps may in fact be performed simultaneously or at least overlapping to some extent temporally.
At step s2, a calibration process is performed on the camera image and the laser scan to determine an estimate of the transformation between the laser frame and the camera frame. This transformation is hereinafter referred to as the "laser-camera transformation". In this embodiment, this calibration process is a conventional process. For example, the camera may be calibrated using the camera calibration toolbox for Matlab(TM) developed by Bouget et al . , and the estimate of the laser-camera transformation may be determined using a conventional technique .
This step provides that every laser point whose projection under the laser-camera transformation belongs to the camera image may be projected onto the camera image .
Steps s4 to s20, described below, define a process by which a value that is indicative of how well the laser scan and the camera image correspond to each other (after the performance of the laser-camera transformation) is determined .
At step s4, the camera image is converted in to a gray-scale image. In this embodiment, this conversion is performed in a conventional manner.
At step s6, an edge detection process is performed on the gray-scale camera image.
In this embodiment, edge detection in the camera image is performed in a conventional manner using a Sobel filter. In this embodiment, the laser scanner 6 is arranged to scan the obstacles 10 in a plane that is substantially horizontal to a ground surface, i.e. scanning is performed in a plane substantially perpendicular to the image plane of the camera image. Thus, in this embodiment a filter for detecting vertical edges is used. In other embodiments, a different type of filter for detecting different edges may be used, for example a filter designed for detecting horizontal edges may be used in embodiments where the laser scanner is arranged to scan vertically.
In this embodiment, the filtered image is, in effect, the image that results from the convolution of the grey- scale image with the following mask:
1 0 -1
2 0 -2
1 0 -2
In this embodiment, an edge in the camera image is indicated by a sudden variation in the intensity of the grey-scale camera image.
At step s8, gradient values for the range parameter measured by the laser scanner 6 are determined.
In this embodiment, the gradient values are obtained in a conventional way from the convolution of the laser scan with the following mask:
[-1/2 0 1/2]
At step slO, points in the laser scan that correspond to "corners" are identified. The terminology "corner" is used herein to indicate a point at which there is a sudden variation of range along the scan. This may, for example, be caused by the laser scanner 6 measuring the range from the vehicle 2 to an obstacle 10. As the laser scanner 6 scans beyond the edge or corner of the obstacle 10, there is a sudden and possibly significant change in the range values measured by the laser scanner 6.
In this embodiment, points in the laser scan that correspond to a corner are those points that have a gradient value (determined at step sl4) that has an absolute value that is greater than a first threshold value. This first threshold value is set to be above a noise floor.
In this embodiment, in many cases, two successive points of the laser scan correspond to corners, i.e. one laser poi nt either side of the laser scan discontinuity. These form pairs of corners.
At step sl2, the laser scan is segmented, i.e. a plurality of segments is defined over the laser scan. In this embodiment, a segment of the laser scan comprises all the points between two successive laser corner points.
At step sl4, one of each of the two laser points (corners) of the pairs identified at step slO is selected.
In this embodiment, of each of the pairs of laser points corresponding to successive corners, the laser point that corresponds to a shorter range is selected. This selected laser point is the most likely point of the pair to correspond to an edge in the camera image, after proj ection . At step sl6, the selected laser corner points are projected onto the camera image (using the laser-camera transformation) . Also, respective pixel neighbourhoods of uncertainty corresponding to each of the respective projected points are computed.
In this embodiment, these neighbourhoods of uncertainty are determined in a conventional manner as follows .
Points in the laser scan are related to corresponding points in the camera image as follows:
/>=<D(PC-A) where: Pl is a point in the laser scan;
Pc is a point in the camera image corresponding to the point Pl ;
Δ is a translation offset; and
Figure imgf000012_0001
Φ is a rotation matrix with Euler angles
Figure imgf000012_0002
In this embodiment, the laser-camera calibration optimisation (described above at step s2) returns values for Δ and Φ by minimising the sum of the squares of the normal errors. The normal errors are simply the Euclidean distance of the laser points from the calibration plane in the camera image frame of reference.
In this embodiment, the uncertainty of the parameters Δ and Φ is found by implementing a method as described in Approximate Tests of Correlation in Time-Series", Quenouille M.H., 1949, Journal of the Royal Statistical
Society, Vol. 11, which is incorporated herein by reference. What will now be described is certain aspects relating to ways in which this method may be applied to the above mentioned laser-camera calibration optimisation.
Let Xi represent the set of data points of the ith
laser scan of the dataset.
So called "Jackknife" samples are taken from the dataset. The ith Jackknife sample Xi is simply all the data points
except those of the ith laser scan,
Xi {Xi, X2r --r Xi-1 r Xi+1 r ··· r %n }
Thus, n different Jackknife samples are produced. Let Pi = [ δχ δγ δζ φχ φγ φζ] ί be the parameter vector
obtained from running the optimisation on the ith Jackknife
sample. The mean of the parameter vectors is given by
P . The standard error of the parameters is therefore given by:
Figure imgf000014_0001
An uncertainty propagation method used in this embodiment employed is described in "Reliable and Safe Autonomy for Ground Vehicles in Unstructured Environments", Underwood, J. P., Mechanical and Mechatronic Engineering, 2008, School of Aerospace, The University of Sydney, Sydney, which is incorporated herein by reference.
In other embodiments, a different technique of computing the neighbourhoods of uncertainty may be used. For example, a common ''calibration object' could be identified in the camera image and laser scan. An edge of this calibration object may then be used to generate a maximum error value, which can be used to define a neighbourhood of uncertainty. The computed neighbourhoods of uncertainty corresponding to each of the respective projected points are also projected onto the camera image (using the laser- camera transformation) .
A selected laser corner point, and a respective neighbourhood of uncertainty that surrounds that laser point, each have a projected image on the camera image under the laser-camera transformation. The projection of a laser corner point is, a priori, a best estimate of the pixel of the camera image that corresponds to that laser point. The projection of a neighbourhood surrounds the projection of the corresponding laser corner point.
At step sl8, for each laser corner point projected on to the camera image at step sl6, it is determined whether there is a matching edge in the camera image within the projection of the neighbourhood of uncertainty of that laser point. In this embodiment, the terminology
"matching edge in the camera image" refers to at least two points (pixels) in the camera image, in two different consecutive lines and connected columns of the relevant neighbourhood of uncertainty, having a Sobel intensity greater than a predefined second threshold value.
The matching process of step sl8 comprises identifying a camera image edge within a neighbourhood of a projection of a laser corner point. In other words, the matching process comprises identifying points in the camera image within a projection of a neighbourhood, the points having an intensity value greater than the second threshold value.
At step s20, for each projected laser point, a probability that the laser information corresponds to the information in the camera image acquired at the same time is estimated.
In this embodiment, the probability of correspondence between the laser and camera image, for a certain projected laser point corresponding to a selected corner, is determined using the following formula:
P(C\A,B)P(B\A)P(A)
P(C I B) P(B)
where: A is the event that the laser and camera information correspond;
B is the event that an edge is found in the projection of the neighbourhood of the certain projected laser point;
C is the projection on to the camera image of this certain projected laser point; and
η is a normalisation factor.
Thus, the terms in the above equation have the following definitions:
P(A\B,C) is the probability that, for a given laser corner point, the laser and camera information correspond given the projection of that laser corner point and given that an edge was found in the projection of the neighbourhood of that projected laser corner point;
P(C\A,B) is the probability of the certain laser data projection on the camera image, given that the laser and camera data correspond, and given that a visual edge was found in the projection of the neighbourhood of the certain laser point projection. This term is directly related to the uncertainty of the projection of the laser point on the image. In this embodiment, the value of this term is computed using a Gaussian mask over the neighbourhood of the certain projected laser point. This Gaussian represents the distribution of probability for the position of the laser projected point. In this embodiment, if a visual edge was found in the projection of the neighbourhood, then the value of the term P(C I A,B = 1) is the value of the Gaussian mask at the closest pixel belonging to the visual edge. Also, if no visual edge was found in the projection of the neighbourhood, then the value of the term P(C\A,B = 0) is the probability that the projection of the laser point is outside of the projection of the neighbourhood.
P(B I A) is the probability that a visual edge is found in the neighbourhood, given that the laser and camera information do correspond. This term describes the likelihood of the assumption that if the laser and camera information do correspond, then any laser corner should correspond to a visual edge in the camera image. In this embodiment, the value of this term is fixed and close to 1, i.e. knowing the laser and camera data do correspond, a visual edge usually exists.
P(A) is the a priori probability that the laser data and camera data correspond. In this embodiment, the value of this term is set to a fixed uncertain value. This represents the fact that, in this embodiment, there is no a priori knowledge on that event;
P(B) is the a priori probability of finding a visual edge. It is expressed as: P(B) =P(B \ A)P(A)+P(B \A)P(A) with P{A) = P{A). In this embodiment, the value of the term P(B\A) is the probability of finding a visual edge anywhere in the camera image (using the process described at step s6 above) ; and
P(C\B) = P(C\B,A)P(A)+ P(C\B,A)P(A) , where only the term remains to be described is P(C\B,A). This term corresponds to the confidence on the projection of the laser point on to the image, knowing that the laser and camera data do not correspond (not A) and that an edge was found or not (B) . If the data do not correspond, then the even B does not provide more information for C, thus we can say that P(C I B,A) = P(C I A) . This term corresponds to the confidence in the calibration (i.e. quality of the projection) . In this embodiment it is taken as the best chance for the projection, i.e. the probability read at the centre of the neighbourhood of uncertainty (in other words, the maximum probability in the neighbourhood) .
Thus, in this embodiment the determined values of the probability that the laser information corresponds to the information in the camera image (for a certain projected laser point corresponding to a selected corner) are values related to a distance in the camera image between a camera edge and the projection of a corresponding laser corner on to the camera image. In other embodiments other similarity values, i.e. values encapsulating the similarity between the camera edge and the projection of a corresponding laser corner on to the camera image, may be used.
At step s22, a validation process is performed to validate the laser data relative to the camera data.
In this embodiment, the validation process comprises making a decision about whether each of the laser scan segments corresponds to the camera image data. In this embodiment, for a given laser scan segment, if the corners belonging to this segment have a matching edge in the camera image, i.e. the probabilities for those corners, determined at step s20, are greater than a predefined threshold (hereinafter referred to as the "third threshold value") , then the laser data of that segment is considered to correspond to the camera data, i.e. the laser data is validated and can be combined (or associated) with the camera image data. However, if the corners belonging to this segment have no matching edge in the image (i.e. the probabilities for those points are lower than the third threshold) , then the laser data of the certain segment is considered to not correspond to the camera data. In this case, the data from both types of sensors is treated differently. In this embodiment, the laser data is considered as inconsistent with the camera data (i.e. it has been corrupted by the presence of the dust cloud 12) . Therefore, fusion of laser data and camera data is not permitted for the purposes of navigation of the vehicle 2. In other embodiments, different validation processes may be used.
If it is determined that the laser scan and camera image correspond, the laser and camera data can be fused. The fused data is integrated with any other sensing data in a perception system of the vehicle 2. However, if it is determined that the laser and camera image do not correspond, only the most reliable of the data (in this embodiment the data corresponding to the camera image) is integrated with any other sensing data in a perception system of the vehicle 2. This advantageously avoids the utilising of non-robust data for the purposes of perception. Also, the above described method advantageously tends to provide better perception capabilities of the vehicle 2. In other words, the above described method advantageously tends to increase the integrity of a perception system of the vehicle 2.
An advantage of the above described method is that it tends to increase the integrity of a vehicle's perception capabilities in challenging environmental conditions, such as the presence of smoke or airborne dust. The increasing of the integrity of the vehicle's perception capabilities tends to enable the vehicle to navigate better within an environment .
The present invention advantageously compares data from laser scans and camera images to detect inconsistencies or discrepancies. In this embodiment, these discrepancies arise when the laser scanner 6 detects dust from the dust cloud 12. The effect of this dust tends to be less significant on the visual camera (or infrared) image, at least as long as the density of the dust cloud remains "reasonable". The method is capable of advantageously identifying that there is a discrepancy between the laser data and the camera data, so that only the relatively unaffected camera data is used for the purposes of navigating the vehicle.
A further advantage of the present invention is that a process of comparing laser data (comprising range/bearing information) to camera image data (comprising measurements of intensity, or colour, distributed in space on the camera plane) tends to be provided. In this embodiment, common characteristics in the data, in particular geometrical characteristics, are compared to provide this advantage. The present invention advantageously tends to exploit redundancies in the observations made by the laser scanner and the camera in order to identify features that correspond to each other in the laser scan and the camera image. Also, for each laser point that can be projected onto the camera image, an estimate of a likelihood that the sensing data provided by the laser does corresponds to the data in the image is advantageously provided. This allows a decision upon the veracity of the laser data compared to the camera data to be made.
A further advantage of the above embodiments is that the detection of discrepancies between laser data and camera data tends to be possible. Moreover, this tends to allow for the detection of misalignment errors, typically when the discrepancies/inconsistencies concern the whole laser scan.
In the above embodiments the vehicle is a land-based vehicle. However, in other embodiments the vehicle is a different type of vehicle, for example an aircraft.
In the above embodiments the vehicle performs autonomous navigation. However, in other embodiments navigation of the vehicle is not performed autonomously. For example, in other embodiments an embodiment of a method of improving the integrity of perception is used to support/advise a human navigator of a vehicle (e.g. a driver or a pilot) who may be on or remote from the vehicle .
In the above embodiments the vehicle comprises a laser scanner and a camera. However, in other embodiments the vehicle comprises any two different heterogeneous sensors, the data from which may be processed according to the method of improving perception integrity as described above. For example, in other embodiments one of the sensors is an infrared camera. An advantage provided by an infrared camera is that resulting images tend not to be significantly affected by the presence of smoke clouds.
In the above embodiments there are two heterogeneous sensors (the laser scanner and the camera) . However, in other embodiments there are more than two sensors, including at least two heterogeneous sensors.
In the above embodiments, the laser scan of the vehicles surroundings (determined from data from the laser scanner) is affected by the presence of the dust cloud (i.e. the laser scanner measures range values from the vehicle to the dust cloud as opposed to range values from the vehicle to the obstacles) . However, in other embodiments the laser scan is affected by a different entity, for example smoke, cloud, or fog. Furthermore, the process may also be used advantageously in situations in which there are no dust clouds etc.
In the above embodiments, the likelihood of correspondence of laser and camera data is determined by identifying laser corner points and matching edges in the camera image. However, in other embodiments different features of the respective images may be used. For example, in other embodiments other points of a laser segment (i.e. points not corresponding to corners) are used. In such embodiments, an inference process may need to be used in addition to the above described method steps in order to accurately check the consistency of the laser/camera images.
In the above embodiments, a probability value is determined to indicate the probability that a certain laser corner point corresponds to a matched edge in the camera image. However, in other embodiments a different appropriate metric indicative of the extent to which a certain laser corner point corresponds to a matched edge in the camera image is used.
In the above embodiments, a decision about whether or not the laser scan and the camera image correspond to one another is dependent on probability values that certain laser corner points correspond to respective matched edges in the camera image. However, in other embodiments this decision is based upon different appropriate criteria.
Apparatus, including the processor, for performing the method steps described above, may be provided by an apparatus having components on the vehicle, external to the vehicle, or by an apparatus having some components on the vehicle and others remote from the vehicle. Also, the apparatus may be provided by configuring or adapting any suitable apparatus, for example one or more computers or other processing apparatus or processors, and/or providing additional modules. The apparatus may comprise a computer, a network of computers, or one or more processors, for implementing instructions and using data, including instructions and data in the form of a computer program or plurality of computer programs stored in or on a machine readable storage medium such as computer memory, a computer disk, ROM, PROM etc., or any combination of these or other storage media.
In the claims which follow and in the preceding description of the invention, except where the context requires otherwise due to express language or necessary implication, the word "comprise" or variations such as "comprises" or "comprising" is used in an inclusive sense, i.e. to specify the presence of the stated features but not to preclude the presence or addition of further features in various embodiments of the invention.
It is to be understood that, if any prior art publication is referred to herein, such reference does not constitute an admission that the publication forms a part of the common general knowledge in the art, in Australia or any other country.

Claims

THE CLAIMS DEFINING THE INVENTION ARE AS FOLLOWS:
1. A method of processing sensor data, the method comprising :
measuring a value of a first parameter of a scene (10) using a first sensor (4) to produce a first image of the scene (10) ;
measuring a value of a second parameter of the scene (10) using a second sensor (6) to produce a second image of the scene (10) ;
identifying a first point, the first point being a point of the first image that corresponds to a class of features of the scene (10);
identifying a second point, the second point being a point of the second image that corresponds to the class of features ;
projecting the second point onto the first image;
determining a similarity value between the first point and the projection of the second point on to the first image; and
comparing the determined similarity value to a predetermined threshold value.
2. A method according to claim 1, wherein the similarity value is a value related to a distance in the first image between the first point and the projection of the second point on to the first image.
3. A method according to claim 1 or claim 2, the method further comprising:
defining a neighbourhood in the second image around the second point; and projecting the neighbourhood onto the first image; wherein
the step of identifying the first point comprises identifying the first point such that the first point lies within the projection of the neighbourhood onto the first image .
4. A method according to claim 3, wherein the step of determining a value related to a distance comprises:
defining a probability distribution mask over the projection of the neighbourhood in the first image, the probability distribution mask being centred on the projection of the second point on the first image; and
determining a value of the probability distribution mask at the first point.
5. A method according to any of claims 1 to 4, wherein the first parameter is different to the second parameter.
6. A method according to any of claims 1 to 5, wherein the first sensor (4) is a different type of sensor to the second sensor (6) .
7. A method according to claim 6, wherein the first parameter is light intensity, the first sensor type is a camera, the second parameter is range, and the second sensor type is a laser scanner.
8. A method according to any of claims 1 to 7, the method further comprising calibrating the second image of the scene with respect to the first image of the scene.
9. A method according to claim 8, wherein the step of calibrating the second image of the scene with respect to the first image of the scene comprises determining a transformation to project points in the second image to corresponding points in the first image.
10. A method according to claim 9, wherein a step of projecting is performed using the determined transformation .
11. A method according to any of claims 1 to 10, wherein the similarity value is a value of a probability that the second image corresponds to the first image.
12. A method according to claim 11 when dependent on claim 3, where the probability is calculated using the following formula:
Figure imgf000027_0001
where: A is the event that the second image corresponds to the first image;
B is the event that the first point lies within the projection of the neighbourhood onto the first image;
C is the projection of the second point onto the first image; and
η is a normalisation factor.
13. Apparatus for processing sensor data, the apparatus comprising :
a first sensor (4) for measuring a value of a first parameter of a scene (10) to produce a first image of the scene (10) ; a second sensor (6) for measuring a value of a second parameter of the scene (10) to produce a second image of the scene (10); and
one or more processors (8) arranged to:
identify a first point, the first point being a point of the first image that corresponds to a class of features of the scene (10) ;
identify a second point, the second point being a point of the second image that corresponds to the class of features ;
project the second point onto the first image;
determine a similarity value between the first point and the projection of the second point on to the first image; and
compare the determined similarity value to a predetermined threshold value.
14. An apparatus according to claim 13, wherein the similarity value is a value related to a distance in the first image between the first point and the projection of the second point on to the first image.
15. An autonomous vehicle (2) comprising the apparatus of claim 13 or claim 14.
16. A computer program or plurality of computer programs arranged such that when executed by a computer system it/they cause the computer system to operate in accordance with the method of any of claims 1 to 12.
17. A machine readable storage medium storing a computer program or at least one of the plurality of computer programs according to claim 16.
PCT/AU2011/000205 2010-03-09 2011-02-25 Sensor data processing Ceased WO2011109856A1 (en)

Priority Applications (3)

Application Number Priority Date Filing Date Title
EP11752738.2A EP2545707A4 (en) 2010-03-09 2011-02-25 PROCESSING SENSOR DATA
AU2011226732A AU2011226732A1 (en) 2010-03-09 2011-02-25 Sensor data processing
US13/583,456 US20130058527A1 (en) 2010-03-09 2011-02-25 Sensor data processing

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
AU2010200875A AU2010200875A1 (en) 2010-03-09 2010-03-09 Sensor data processing
AU2010200875 2010-03-09

Publications (1)

Publication Number Publication Date
WO2011109856A1 true WO2011109856A1 (en) 2011-09-15

Family

ID=44562731

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/AU2011/000205 Ceased WO2011109856A1 (en) 2010-03-09 2011-02-25 Sensor data processing

Country Status (4)

Country Link
US (1) US20130058527A1 (en)
EP (1) EP2545707A4 (en)
AU (2) AU2010200875A1 (en)
WO (1) WO2011109856A1 (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP2884305A1 (en) * 2013-12-13 2015-06-17 Sikorsky Aircraft Corporation Semantics based safe landing area detection for an unmanned vehicle
US9164511B1 (en) 2013-04-17 2015-10-20 Google Inc. Use of detected objects for image processing

Families Citing this family (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9221396B1 (en) 2012-09-27 2015-12-29 Google Inc. Cross-validating sensors of an autonomous vehicle
US9062979B1 (en) 2013-07-08 2015-06-23 Google Inc. Pose estimation using long range features
DE102015207375A1 (en) * 2015-04-22 2016-10-27 Robert Bosch Gmbh Method and device for monitoring an area in front of a vehicle
CN106323288A (en) * 2016-08-01 2017-01-11 杰发科技(合肥)有限公司 Transportation-tool positioning and searching method, positioning device and mobile terminal
US10678260B2 (en) * 2017-07-06 2020-06-09 GM Global Technology Operations LLC Calibration methods for autonomous vehicle operations
US11227409B1 (en) 2018-08-20 2022-01-18 Waymo Llc Camera assessment techniques for autonomous vehicles
US11699207B2 (en) 2018-08-20 2023-07-11 Waymo Llc Camera assessment techniques for autonomous vehicles
US10928819B2 (en) * 2018-10-29 2021-02-23 Here Global B.V. Method and apparatus for comparing relevant information between sensor measurements
CN114942638A (en) * 2019-04-02 2022-08-26 北京石头创新科技有限公司 Robot working area map construction method and device
CN110084992A (en) * 2019-05-16 2019-08-02 武汉科技大学 Ancient buildings fire alarm method, device and storage medium based on unmanned plane
EP4071516A4 (en) * 2019-12-03 2022-12-14 Konica Minolta, Inc. Image processing device, monitoring system, and image processing method
US11567197B2 (en) * 2020-02-20 2023-01-31 SafeAI, Inc. Automated object detection in a dusty environment

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5170352A (en) * 1990-05-07 1992-12-08 Fmc Corporation Multi-purpose autonomous vehicle with path plotting
US20070019181A1 (en) * 2003-04-17 2007-01-25 Sinclair Kenneth H Object detection system

Family Cites Families (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6822563B2 (en) * 1997-09-22 2004-11-23 Donnelly Corporation Vehicle imaging system with accessory control
JP3156817B2 (en) * 1994-03-14 2001-04-16 矢崎総業株式会社 Vehicle periphery monitoring device
JP3417377B2 (en) * 1999-04-30 2003-06-16 日本電気株式会社 Three-dimensional shape measuring method and apparatus, and recording medium
US6952488B2 (en) * 2001-08-27 2005-10-04 Carnegie Mellon University System and method for object localization
JP3868876B2 (en) * 2002-09-25 2007-01-17 株式会社東芝 Obstacle detection apparatus and method
JP4406381B2 (en) * 2004-07-13 2010-01-27 株式会社東芝 Obstacle detection apparatus and method
DE102004041115A1 (en) * 2004-08-24 2006-03-09 Tbs Holding Ag Airline passenger`s finger-or face characteristics recording method, involves recording objects by two sensors, where all points of surface to be displayed are represented in two different directions in digital two-dimensional pictures
US7738687B2 (en) * 2005-04-07 2010-06-15 L-3 Communications Security And Detection Systems, Inc. Method of registration in a contraband detection system
US7786898B2 (en) * 2006-05-31 2010-08-31 Mobileye Technologies Ltd. Fusion of far infrared and visible images in enhanced obstacle detection in automotive applications
US20080080748A1 (en) * 2006-09-28 2008-04-03 Kabushiki Kaisha Toshiba Person recognition apparatus and person recognition method
JP4852006B2 (en) * 2007-07-27 2012-01-11 株式会社パスコ Spatial information database generation device and spatial information database generation program
US20100085371A1 (en) * 2008-10-02 2010-04-08 Microsoft Corporation Optimal 2d texturing from multiple images

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5170352A (en) * 1990-05-07 1992-12-08 Fmc Corporation Multi-purpose autonomous vehicle with path plotting
US20070019181A1 (en) * 2003-04-17 2007-01-25 Sinclair Kenneth H Object detection system

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
PEYNOT, T. ET AL.: "Towards reliable perception for Unmanned Ground Vehicles in challenging conditions", IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, 10 October 2009 (2009-10-10), ST. LOUIS, MO, pages 1170 - 1176, XP031580842 *
See also references of EP2545707A4 *

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9164511B1 (en) 2013-04-17 2015-10-20 Google Inc. Use of detected objects for image processing
US9804597B1 (en) 2013-04-17 2017-10-31 Waymo Llc Use of detected objects for image processing
US10509402B1 (en) 2013-04-17 2019-12-17 Waymo Llc Use of detected objects for image processing
US11181914B2 (en) 2013-04-17 2021-11-23 Waymo Llc Use of detected objects for image processing
US12019443B2 (en) 2013-04-17 2024-06-25 Waymo Llc Use of detected objects for image processing
EP2884305A1 (en) * 2013-12-13 2015-06-17 Sikorsky Aircraft Corporation Semantics based safe landing area detection for an unmanned vehicle
US9177481B2 (en) 2013-12-13 2015-11-03 Sikorsky Aircraft Corporation Semantics based safe landing area detection for an unmanned vehicle

Also Published As

Publication number Publication date
EP2545707A4 (en) 2013-10-02
AU2011226732A1 (en) 2012-09-27
US20130058527A1 (en) 2013-03-07
EP2545707A1 (en) 2013-01-16
AU2010200875A1 (en) 2011-09-22

Similar Documents

Publication Publication Date Title
EP2545707A1 (en) Sensor data processing
US11860315B2 (en) Methods and systems for processing LIDAR sensor data
CN109961468B (en) Volume measurement method and device based on binocular vision and storage medium
EP3875905B1 (en) Method, device and medium for detecting environmental change
US11860281B2 (en) Methods and systems for filtering data points when merging LIDAR sensor datasets
CN104035071B (en) Merge radar/video camera object data and the method and apparatus of LiDAR scanning element
KR101775114B1 (en) System and method for simultaneous localization and mapping of mobile robot
US20160104047A1 (en) Image recognition system for a vehicle and corresponding method
CN112964276A (en) Online calibration method based on laser and vision fusion
US9135510B2 (en) Method of processing sensor data for navigating a vehicle
KR102490521B1 (en) Automatic calibration through vector matching of the LiDAR coordinate system and the camera coordinate system
CN107016705A (en) Ground level estimation in computer vision system
WO2021212319A1 (en) Infrared image processing method, apparatus and system, and mobile platform
KR102114558B1 (en) Ground and non ground detection apparatus and method utilizing lidar
US11567501B2 (en) Method and system for fusing occupancy maps
CN118262336B (en) Indoor parking lot positioning method and system based on visual SLAM
Kırcalı et al. Ground plane detection using an RGB-D sensor
KR102316012B1 (en) Apparatus and method for determining possibility of collision with flying object in front of drone using camera image provided in drone
US12535322B2 (en) System and methods of vehicle pose estimation
Jaimez et al. Robust planar odometry based on symmetric range flow and multiscan alignment
CN121170642A (en) Unmanned aerial vehicle dynamic environment sensing method based on visual large model, electronic equipment and storage medium
EP3819665B1 (en) Method and computer device for calibrating lidar system
Strand et al. Modeling inter-vehicle occlusion scenarios in multi-camera traffic surveillance systems
Vaida et al. Automatic extrinsic calibration of LiDAR and monocular camera images
CN118149796A (en) Map construction method, device, electronic device and storage medium in degraded environment

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 11752738

Country of ref document: EP

Kind code of ref document: A1

WWE Wipo information: entry into national phase

Ref document number: 2011226732

Country of ref document: AU

WWE Wipo information: entry into national phase

Ref document number: 7830/DELNP/2012

Country of ref document: IN

NENP Non-entry into the national phase

Ref country code: DE

ENP Entry into the national phase

Ref document number: 2011226732

Country of ref document: AU

Date of ref document: 20110225

Kind code of ref document: A

WWE Wipo information: entry into national phase

Ref document number: 2011752738

Country of ref document: EP

WWE Wipo information: entry into national phase

Ref document number: 13583456

Country of ref document: US