WO2024093372A1 - 测距方法和装置 - Google Patents
测距方法和装置 Download PDFInfo
- Publication number
- WO2024093372A1 WO2024093372A1 PCT/CN2023/108397 CN2023108397W WO2024093372A1 WO 2024093372 A1 WO2024093372 A1 WO 2024093372A1 CN 2023108397 W CN2023108397 W CN 2023108397W WO 2024093372 A1 WO2024093372 A1 WO 2024093372A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- image
- camera
- feature
- vehicle
- map
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Ceased
Links
Classifications
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01S—RADIO DIRECTION-FINDING; RADIO NAVIGATION; DETERMINING DISTANCE OR VELOCITY BY USE OF RADIO WAVES; LOCATING OR PRESENCE-DETECTING BY USE OF THE REFLECTION OR RERADIATION OF RADIO WAVES; ANALOGOUS ARRANGEMENTS USING OTHER WAVES
- G01S13/00—Systems using the reflection or reradiation of radio waves, e.g. radar systems; Analogous systems using reflection or reradiation of waves whose nature or wavelength is irrelevant or unspecified
- G01S13/86—Combinations of radar systems with non-radar systems, e.g. sonar, direction finder
- G01S13/867—Combination of radar systems with cameras
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis
- G06T7/50—Depth or shape recovery
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01S—RADIO DIRECTION-FINDING; RADIO NAVIGATION; DETERMINING DISTANCE OR VELOCITY BY USE OF RADIO WAVES; LOCATING OR PRESENCE-DETECTING BY USE OF THE REFLECTION OR RERADIATION OF RADIO WAVES; ANALOGOUS ARRANGEMENTS USING OTHER WAVES
- G01S13/00—Systems using the reflection or reradiation of radio waves, e.g. radar systems; Analogous systems using reflection or reradiation of waves whose nature or wavelength is irrelevant or unspecified
- G01S13/02—Systems using reflection of radio waves, e.g. primary radar systems; Analogous systems
- G01S13/06—Systems determining position data of a target
- G01S13/08—Systems for measuring distance only
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01S—RADIO DIRECTION-FINDING; RADIO NAVIGATION; DETERMINING DISTANCE OR VELOCITY BY USE OF RADIO WAVES; LOCATING OR PRESENCE-DETECTING BY USE OF THE REFLECTION OR RERADIATION OF RADIO WAVES; ANALOGOUS ARRANGEMENTS USING OTHER WAVES
- G01S13/00—Systems using the reflection or reradiation of radio waves, e.g. radar systems; Analogous systems using reflection or reradiation of waves whose nature or wavelength is irrelevant or unspecified
- G01S13/88—Radar or analogous systems specially adapted for specific applications
- G01S13/93—Radar or analogous systems specially adapted for specific applications for anti-collision purposes
- G01S13/931—Radar or analogous systems specially adapted for specific applications for anti-collision purposes of land vehicles
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01S—RADIO DIRECTION-FINDING; RADIO NAVIGATION; DETERMINING DISTANCE OR VELOCITY BY USE OF RADIO WAVES; LOCATING OR PRESENCE-DETECTING BY USE OF THE REFLECTION OR RERADIATION OF RADIO WAVES; ANALOGOUS ARRANGEMENTS USING OTHER WAVES
- G01S15/00—Systems using the reflection or reradiation of acoustic waves, e.g. sonar systems
- G01S15/02—Systems using the reflection or reradiation of acoustic waves, e.g. sonar systems using reflection of acoustic waves
- G01S15/06—Systems determining the position data of a target
- G01S15/08—Systems for measuring distance only
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01S—RADIO DIRECTION-FINDING; RADIO NAVIGATION; DETERMINING DISTANCE OR VELOCITY BY USE OF RADIO WAVES; LOCATING OR PRESENCE-DETECTING BY USE OF THE REFLECTION OR RERADIATION OF RADIO WAVES; ANALOGOUS ARRANGEMENTS USING OTHER WAVES
- G01S15/00—Systems using the reflection or reradiation of acoustic waves, e.g. sonar systems
- G01S15/86—Combinations of sonar systems with lidar systems; Combinations of sonar systems with systems not using wave reflection
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01S—RADIO DIRECTION-FINDING; RADIO NAVIGATION; DETERMINING DISTANCE OR VELOCITY BY USE OF RADIO WAVES; LOCATING OR PRESENCE-DETECTING BY USE OF THE REFLECTION OR RERADIATION OF RADIO WAVES; ANALOGOUS ARRANGEMENTS USING OTHER WAVES
- G01S15/00—Systems using the reflection or reradiation of acoustic waves, e.g. sonar systems
- G01S15/88—Sonar systems specially adapted for specific applications
- G01S15/93—Sonar systems specially adapted for specific applications for anti-collision purposes
- G01S15/931—Sonar systems specially adapted for specific applications for anti-collision purposes of land vehicles
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01S—RADIO DIRECTION-FINDING; RADIO NAVIGATION; DETERMINING DISTANCE OR VELOCITY BY USE OF RADIO WAVES; LOCATING OR PRESENCE-DETECTING BY USE OF THE REFLECTION OR RERADIATION OF RADIO WAVES; ANALOGOUS ARRANGEMENTS USING OTHER WAVES
- G01S17/00—Systems using the reflection or reradiation of electromagnetic waves other than radio waves, e.g. lidar systems
- G01S17/02—Systems using the reflection of electromagnetic waves other than radio waves
- G01S17/06—Systems determining position data of a target
- G01S17/08—Systems determining position data of a target for measuring distance only
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01S—RADIO DIRECTION-FINDING; RADIO NAVIGATION; DETERMINING DISTANCE OR VELOCITY BY USE OF RADIO WAVES; LOCATING OR PRESENCE-DETECTING BY USE OF THE REFLECTION OR RERADIATION OF RADIO WAVES; ANALOGOUS ARRANGEMENTS USING OTHER WAVES
- G01S17/00—Systems using the reflection or reradiation of electromagnetic waves other than radio waves, e.g. lidar systems
- G01S17/86—Combinations of lidar systems with systems other than lidar, radar or sonar, e.g. with direction finders
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01S—RADIO DIRECTION-FINDING; RADIO NAVIGATION; DETERMINING DISTANCE OR VELOCITY BY USE OF RADIO WAVES; LOCATING OR PRESENCE-DETECTING BY USE OF THE REFLECTION OR RERADIATION OF RADIO WAVES; ANALOGOUS ARRANGEMENTS USING OTHER WAVES
- G01S17/00—Systems using the reflection or reradiation of electromagnetic waves other than radio waves, e.g. lidar systems
- G01S17/88—Lidar systems specially adapted for specific applications
- G01S17/93—Lidar systems specially adapted for specific applications for anti-collision purposes
- G01S17/931—Lidar systems specially adapted for specific applications for anti-collision purposes of land vehicles
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis
- G06T7/10—Segmentation; Edge detection
- G06T7/13—Edge detection
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis
- G06T7/10—Segmentation; Edge detection
- G06T7/136—Segmentation; Edge detection involving thresholding
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis
- G06T7/50—Depth or shape recovery
- G06T7/55—Depth or shape recovery from multiple images
- G06T7/593—Depth or shape recovery from multiple images from stereo images
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis
- G06T7/80—Analysis of captured images to determine intrinsic or extrinsic camera parameters, i.e. camera calibration
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis
- G06T7/90—Determination of colour characteristics
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01S—RADIO DIRECTION-FINDING; RADIO NAVIGATION; DETERMINING DISTANCE OR VELOCITY BY USE OF RADIO WAVES; LOCATING OR PRESENCE-DETECTING BY USE OF THE REFLECTION OR RERADIATION OF RADIO WAVES; ANALOGOUS ARRANGEMENTS USING OTHER WAVES
- G01S13/00—Systems using the reflection or reradiation of radio waves, e.g. radar systems; Analogous systems using reflection or reradiation of waves whose nature or wavelength is irrelevant or unspecified
- G01S13/88—Radar or analogous systems specially adapted for specific applications
- G01S13/93—Radar or analogous systems specially adapted for specific applications for anti-collision purposes
- G01S13/931—Radar or analogous systems specially adapted for specific applications for anti-collision purposes of land vehicles
- G01S2013/9323—Alternative operation using light waves
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01S—RADIO DIRECTION-FINDING; RADIO NAVIGATION; DETERMINING DISTANCE OR VELOCITY BY USE OF RADIO WAVES; LOCATING OR PRESENCE-DETECTING BY USE OF THE REFLECTION OR RERADIATION OF RADIO WAVES; ANALOGOUS ARRANGEMENTS USING OTHER WAVES
- G01S13/00—Systems using the reflection or reradiation of radio waves, e.g. radar systems; Analogous systems using reflection or reradiation of waves whose nature or wavelength is irrelevant or unspecified
- G01S13/88—Radar or analogous systems specially adapted for specific applications
- G01S13/93—Radar or analogous systems specially adapted for specific applications for anti-collision purposes
- G01S13/931—Radar or analogous systems specially adapted for specific applications for anti-collision purposes of land vehicles
- G01S2013/9324—Alternative operation using ultrasonic waves
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/10—Image acquisition modality
- G06T2207/10004—Still image; Photographic image
- G06T2207/10012—Stereo images
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/20—Special algorithmic details
- G06T2207/20084—Artificial neural networks [ANN]
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/30—Subject of image; Context of image processing
- G06T2207/30248—Vehicle exterior or interior
- G06T2207/30252—Vehicle exterior; Vicinity of vehicle
- G06T2207/30261—Obstacle
Definitions
- the embodiments of the present application relate to the field of autonomous driving, and in particular to distance measurement methods and devices.
- the distance to obstacles around the vehicle is mainly sensed by distance measuring sensors (such as lidar, millimeter wave radar and ultrasonic radar) installed around the vehicle.
- distance measuring sensors such as lidar, millimeter wave radar and ultrasonic radar
- the distance measuring sensors installed around the vehicle have detection blind spots.
- the embodiment of the present application provides a distance measurement method and device for a vehicle, which enables the vehicle to measure the distance to obstacles in the detection blind spot. To achieve the above purpose, the embodiment of the present application adopts the following technical solutions:
- an embodiment of the present application provides a distance measurement method, which is applied to a vehicle, wherein the vehicle includes a first camera and a second camera, and the method includes: first acquiring a first image and a second image. Then acquiring a first depth map and a second depth map. Thereafter, determining the distance between an object in the first image and/or the second image and the vehicle based on the first depth map and the second depth map. Determining the distance between an object in the first image and/or the second image and the vehicle based on the first depth map and the second depth map.
- the first image is an image captured by the first camera
- the second image is an image captured by the second camera
- the first camera and the second camera have a common viewing area
- the first camera is a fisheye camera
- the second camera is a pinhole camera
- the first depth map is a depth map corresponding to the first image
- the second depth map is a depth map corresponding to the second image.
- the distance measurement method provided in the embodiment of the present application obtains a first depth map and a second depth map through a first image captured by a fisheye camera with a large field of view and a second image captured by a pinhole camera in a common viewing area with the camera, and then determines the distance between the object in the first image and/or the second image and the vehicle based on the first depth map and the second depth map.
- the fisheye camera with a large field of view can make up for the blind spots inherent in the layout of the distance measurement sensor, so that the vehicle can measure the distance to obstacles (such as suspended obstacles) in the detection blind spots.
- the first image and the second image may be input into a target network to obtain a first depth map and a second depth map.
- the first depth map and the second depth map are obtained, and then the distance between the object in the first image and the second image and the vehicle is determined according to the first depth map and the second depth map.
- the fisheye camera with a large field of view can make up for the blind spots inherent in the layout of the ranging sensor, so that the vehicle can measure the distance to obstacles (such as suspended obstacles) in the detection blind spots.
- the first camera may be a fisheye camera with a field of view greater than a preset angle.
- the preset angle is 180 degrees or 192 degrees.
- some objects may not be in the common viewing area of the first camera and the second camera, but exist alone in the field of view of the first camera or the second camera, that is, exist alone in the first image or the second image.
- the distance between these objects and the vehicle can be determined by the first depth map and the second depth map.
- a first feature map and a second feature map may be obtained. Then, a third feature map may be obtained based on a first feature point of the first feature map and a plurality of target feature points corresponding to the first feature point. Then, a fourth feature map may be obtained based on a second feature point of the second feature map and a plurality of target feature points corresponding to the second feature point. Then, the first depth map and the second depth map may be obtained based on the third feature map and the fourth feature map.
- the first feature map is a feature map corresponding to the first image
- the second feature map is a feature map corresponding to the second image.
- the first feature point is any feature point in the first feature map.
- the multiple target feature points corresponding to the first feature point are feature points in the second feature map that meet the epipolar constraint with the first feature point.
- the second feature point is any feature point in the second feature map, and the multiple target feature points corresponding to the second feature point are feature points in the first feature map that meet the epipolar constraint with the second feature point.
- the epipolar constraint refers to the constraint formed by the image point and the camera optical center under the projection model when describing the projection of the same point onto two images with different perspectives.
- the epipolar line is not necessarily a straight line, but may also be a curve.
- e1 is the intersection of the line connecting the optical centers O1O2 of the two images and the plane of image one
- e2 is the intersection of the line connecting the optical centers O1O2 of the two images and the plane of image two.
- feature matching of feature points through target feature points that meet epipolar constraints in the image of the common view area corresponding to the feature points can reduce the computational complexity of the feature matching process on the one hand; on the other hand, since the feature points that meet epipolar constraints in the image of the common view area corresponding to the feature points have a high similarity with the feature points, feature matching of feature points through target feature points of feature points can make the matched feature points fuse the features of the target feature points, increase the recognition of the feature points, and enable the target network to obtain the corresponding depth map more accurately according to the feature map after feature fusion, providing high ranging accuracy.
- the first feature map corresponding to the first image is flattened into a one-dimensional feature representation as [a0, a1, ..., aH1xW1], with a length of H1xW1
- the second feature map corresponding to the second image is flattened into a one-dimensional feature representation as [b0, b1, ... bH2xW2], with a length of H2xW2.
- the one-dimensional feature C is then mapped into three features, Q, K, and V, using a network, whose dimensions remain the same as C.
- the feature bi with index position i in the second feature map has n feature index positions corresponding to the depth range (dmin, dmax) in the first feature map after calculation through the epipolar constraint, which are ⁇ ad0, ad1, ..., adn ⁇ respectively.
- the above operation is performed on each feature point to obtain a one-dimensional feature C', and then it is split and converted into a third feature map corresponding to the first feature map and a fourth feature map corresponding to the second feature map according to the splicing order of C.
- the target feature point may also be a feature point that meets the epipolar constraint in an image where an image corresponding to the feature point exists in a common view area, and feature points around the feature point that meets the epipolar constraint.
- the first feature map corresponding to the first image is flattened into a one-dimensional feature representation as [a0, a1, ..., aH1xW1], with a length of H1xW1
- the second feature map corresponding to the second image is flattened into a one-dimensional feature representation as [b0, b1, ... bH2xW2], with a length of H2xW2.
- the feature bi at index position i in the second feature map has n feature index positions corresponding to the depth range (dmin, dmax) in the first feature map after calculation through the epipolar constraint, they are ⁇ ad0, ad1, ..., adn ⁇ , and there are m candidate points after dilation processing, which are generally represented as ⁇ ad0, ad1, ..., adn, adn+1, ..., adn+m ⁇ .
- the element qii in Q corresponding to the pinhole image feature index position i does not need to be dot-multiplied with all elements in K of length H1xW1+H2xW2, but only needs to be dot-multiplied with the n+m elements corresponding to ⁇ ad0, ad1, ..., adn, adn+1, ..., adn+m ⁇ to obtain each element of qii and the n+m elements.
- the distance between the object and the vehicle in the first image and/or the second image can be determined based on the first depth map, the second depth map, the first structural semantics, and the second structural semantics, wherein the first structural semantics is used to indicate the edges and planes of the objects in the first image, and the second structural semantics is used to indicate the edges and planes of the objects in the second image.
- the first image and the second image have a common viewing area, and the same object may exist in the first image and the second image. Since the depth of the pixel in the depth map is relative to the camera coordinate system corresponding to the image where the pixel is located, the depth of the same pixel in different camera coordinate systems may have deviations when converted to the unified coordinate system established by the vehicle. This deviation may affect the accuracy of the distance between the edge point and the vehicle. For this reason, the same pixel of different cameras can be aligned in the unified coordinate system by characterizing the structural semantics of the edge and plane structure of each object in the image to eliminate the deviation and improve the ranging accuracy.
- the first image, the first feature map, or the third feature map may be input into the target network to obtain the first structural semantics.
- the second image, the second feature map, or the fourth feature map may be input into the target network to obtain the second structural semantics.
- the third feature map is used as input, the first structural semantics obtained may be more accurate because the third feature map is a feature map fused based on the epipolar constraint; similarly, when the fourth feature map is used as input, the second structural semantics obtained may also be more accurate.
- the target network can also output the edges and planes of objects in the feature map corresponding to the image based on the image or feature map. Since the first image and the second image have a common view area, the same object may exist in the first image and the second image. Since the depth of the pixel in the depth map is relative to the camera coordinate system corresponding to the picture where the pixel is located, the depth of the same pixel in different camera coordinate systems may have deviations when converted to the unified coordinate system established with the vehicle. This deviation may affect the accuracy of the distance between the edge point and the vehicle. For this reason, the same pixel of different cameras can be aligned in the unified coordinate system by characterizing the structural semantics of the edge and plane structure of each object in the image to eliminate the deviation and improve the ranging accuracy.
- the first image is an image captured by the first camera at a first moment
- the second image is an image captured by the second camera at the first moment
- the distance between the object and the vehicle in the first image and/or the second image may be determined according to the first depth map, the second depth map, the first instance segmentation result, the second instance segmentation result, the first distance information, and the second distance information.
- the first instance segmentation result is used to indicate the background and movable objects in the first image
- the second instance segmentation result is used to indicate the background and movable objects in the second image
- the first distance information is used to indicate the distance between the object and the vehicle in the third image
- the third image is the image captured by the first camera at the second moment
- the second distance information is used to indicate the distance between the object and the vehicle in the fourth image
- the fourth image is the image captured by the second camera at the second moment.
- the first image and the second image have a common viewing area, and the same object may exist in the first image and the second image. Since the depth of the pixel in the depth map is relative to the camera coordinate system corresponding to the image where the pixel is located, the depth of the same pixel in different camera coordinate systems may have deviations when converted to the unified coordinate system established by the vehicle, and the deviation may affect the accuracy of the distance between the edge point and the vehicle.
- the distance between the object and the vehicle in the third image captured by the first camera at the second moment in the image and the instance segmentation result of the first image can be represented, and the distance between each object and the vehicle in the first image can be corrected with reference to the fixed background in the two images.
- the distance between the object and the vehicle in the fourth image captured by the second camera at the second moment in the image and the instance segmentation result of the second image can be represented, and the distance between each object and the vehicle in the first two images can be corrected with reference to the fixed background in the two images, so that the same pixel of different cameras can be aligned in the unified coordinate system to eliminate the deviation, thereby improving the ranging accuracy.
- the first image, the first feature map, or the third feature map may be input into the target network to obtain the first instance segmentation.
- the second image, the second feature map, or the fourth feature map may be input into the target network to obtain the second instance segmentation result.
- the third feature map is used as input, the first instance segmentation obtained may be more accurate because the third feature map is a feature map fused based on the epipolar constraint; similarly, when the fourth feature map is used as input, the second instance segmentation obtained may also be more accurate.
- the target network can also output the instance segmentation result of the image corresponding to the feature map based on the feature map. Since the first image and the second image have a common view area, the first image and the second image may contain the same object. Since the depth of the pixel in the depth map is relative to the camera coordinate system corresponding to the image where the pixel is located, the depth of the same pixel in different camera coordinate systems is converted to the camera coordinate system. There may be deviations in the unified coordinate system established by the vehicle, and the deviations may affect the accuracy of the distance between the edge point and the vehicle.
- the distance between the object and the vehicle in the third image acquired by the first camera at the second moment in the image and the instance segmentation result of the first image can be represented, and the distance between each object and the vehicle in the first image can be corrected with reference to the fixed background in the two images.
- the distance between the object and the vehicle in the fourth image acquired by the second camera at the second moment in the image and the instance segmentation result of the second image can be represented, and the distance between each object and the vehicle in the first two images can be corrected with reference to the fixed background in the two images, so that the same pixel points of different cameras are aligned in the unified coordinate system to eliminate the deviation, thereby improving the ranging accuracy.
- the first image may be calibrated according to the internal parameters of the first camera and the preset internal parameters of the fisheye camera, and then a corresponding depth map is obtained based on the calibrated first image to determine the distance between the object and the vehicle.
- the first image captured by the first camera can be calibrated by presetting the internal parameters of the fisheye camera to eliminate the deviations and further improve the ranging accuracy.
- the second image may be calibrated according to the internal parameters of the second camera and the preset pinhole camera internal parameters, and then a corresponding depth map is obtained based on the calibrated second image to determine the distance between the object and the vehicle.
- the second image captured by the second camera can be calibrated by presetting the internal parameters of the pinhole camera to eliminate the deviations and further improve the ranging accuracy.
- the objects in the first image and the second image may be three-dimensionally reconstructed according to the distances between the objects in the first image and the second image and the vehicle, and the three-dimensionally reconstructed objects may be displayed.
- performing three-dimensional reconstruction of the objects in the first image and/or the second image based on the distance between the objects in the first image and/or the second image and the vehicle and displaying the three-dimensionally reconstructed objects can help users to more intuitively understand the positional relationship between the objects in the first image and/or the second image and the vehicle.
- prompt information may be displayed according to the distance between the object in the first image and the second image and the vehicle.
- a collision warning prompt message is displayed to remind the user that the vehicle may collide with the object in the first image.
- distance prompt information is displayed to remind the user that the distance between the vehicle and the object in the second image is relatively close.
- the distance between the object in the first image and/or the second image and the vehicle is the distance between the object in the first image and/or the second image and the isometric contour of the vehicle.
- the isometric contour of the vehicle is an isometric contour set according to the outer contour of the vehicle.
- the isometric contour may be an isometric line extending outward from the outer contour line of the vehicle body in a two-dimensional (2D) top view, or may be an isometric surface extending outward from the three-dimensional (3D) outer contour of the vehicle body.
- the vehicle isometric contour may be adjusted according to the distance between the object and the vehicle in the first image and the second image.
- the color of the equidistant contour is adjusted to yellow.
- the color of the equidistant contour is adjusted to red.
- the first camera may be a rear-view fisheye camera
- the second camera may be a rear-view pinhole camera
- the first camera may be a forward-looking fisheye camera
- the second camera may be a forward-looking pinhole camera
- the first camera may be a left-looking fisheye camera
- the second camera may be a left-looking pinhole camera
- the first camera may be a right-looking fisheye camera
- the second camera may be a right-looking pinhole camera
- an embodiment of the present application provides a distance measuring device, which is applied to a vehicle including a first camera and a second camera, and the distance measuring device includes: an acquisition unit, a network unit and a determination unit.
- the acquisition unit is used to acquire a first image and a second image, the first image is an image captured by the first camera, the second image is an image captured by the second camera, the first camera and the second camera have a common viewing area, the first camera is a fisheye camera, and the second camera is a pinhole camera.
- the network unit is used to acquire a first depth map and a second depth map, the first depth map is a depth map corresponding to the first image, and the second depth map is a depth map corresponding to the second image.
- the determination unit is used to determine the distance between the object in the first image and/or the second image and the vehicle based on the first depth map and the second depth map.
- the network unit is specifically configured to: obtain a first feature map and a second feature map, wherein the first feature map The feature map is a feature map corresponding to the first image, and the second feature map is a feature map corresponding to the second image.
- a third feature map is obtained based on a first feature point of the first feature map and a plurality of target feature points corresponding to the first feature point, wherein the first feature point is an arbitrary feature point in the first feature map, and the plurality of target feature points corresponding to the first feature point are feature points in the second feature map that meet the epipolar constraint with the first feature point.
- a fourth feature map is obtained based on a second feature point of the second feature map and a plurality of target feature points corresponding to the second feature point, wherein the second feature point is an arbitrary feature point in the second feature map, and the plurality of target feature points corresponding to the second feature point are feature points in the first feature map that meet the epipolar constraint with the second feature point.
- the first depth map and the second depth map are obtained based on the third feature map and the fourth feature map.
- the determination unit is specifically used to determine the distance between the object and the vehicle in the first image and/or the second image based on the first depth map, the second depth map, the first structural semantics, and the second structural semantics, wherein the first structural semantics is used to indicate the edges and planes of the objects in the first image, and the second structural semantics is used to indicate the edges and planes of the objects in the second image.
- the first image is an image captured by the first camera at a first moment.
- the determination unit is specifically used to determine the distance between the object and the vehicle in the first image and/or the second image according to the first depth map, the second depth map, the first instance segmentation result, the second instance segmentation result, the first distance information and the second distance information, the first instance segmentation result is used to indicate the background and movable objects in the first image, the second instance segmentation result is used to indicate the background and movable objects in the second image, the first distance information is used to indicate the distance between the object and the vehicle in a third image, the third image is an image captured by the first camera at the second moment, the second distance information is used to indicate the distance between the object and the vehicle in a fourth image, and the fourth image is an image captured by the second camera at the second moment.
- the acquisition unit is further configured to: calibrate the first image according to an intrinsic parameter of the first camera and a preset fisheye camera intrinsic parameter.
- the acquisition unit is further used to: calibrate the second image according to an intrinsic parameter of the second camera and a preset pinhole camera intrinsic parameter.
- the determination unit is further used to: perform three-dimensional reconstruction of the objects in the first image and the second image according to the distance between the objects in the first image and the second image and the vehicle; and display the three-dimensionally reconstructed objects.
- the determining unit is further configured to: display prompt information according to the distance between the object and the vehicle in the first image and the second image.
- an embodiment of the present application further provides a ranging device, which includes: one or more processors, when the one or more processors execute program codes or instructions, implement the method described in the above first aspect or any possible implementation method thereof.
- the distance measuring device may further include one or more memories, and the one or more memories are used to store the program code or instruction.
- an embodiment of the present application further provides a chip, comprising: an input interface, an output interface, and one or more processors.
- the chip also includes a memory.
- the one or more processors are used to execute the code in the memory, and when the one or more processors execute the code, the chip implements the method described in the first aspect or any possible implementation thereof.
- the above chip may also be an integrated circuit.
- an embodiment of the present application further provides a computer-readable storage medium for storing a computer program, wherein the computer program includes methods for implementing the method described in the above-mentioned first aspect or any possible implementation thereof.
- an embodiment of the present application further provides a computer program product comprising instructions, which, when executed on a computer, enables the computer to implement the method described in the first aspect or any possible implementation thereof.
- the embodiment of the present application also provides a distance measuring device, including: an acquisition unit, a network unit and a determination unit.
- the acquisition unit is used to acquire a first image and a second image, the first image is an image captured by a first camera, the second image is an image captured by a second camera, the first camera and the second camera have a common viewing area, the first camera is a fisheye camera, and the second camera is a pinhole camera;
- the network unit is used to acquire a first depth map and a second depth map, the first depth map is a depth map corresponding to the first image, and the second depth map is a depth map corresponding to the second image;
- the determination unit is used to determine the distance between the object in the first image and/or the second image and the vehicle according to the first depth map and the second depth map.
- the above-mentioned distance measuring device is also used to implement the method described in the above-mentioned first aspect or any possible implementation thereof.
- an embodiment of the present application provides a ranging system, including one or more first cameras, one or more second cameras, and a computing device, wherein the one or more first cameras are used to acquire a first image, and the one or more second cameras are used to acquire a second image.
- the computing device is used to perform ranging based on the first image and the second image using the method described in the first aspect or any possible implementation thereof.
- an embodiment of the present application provides a vehicle, which includes one or more fisheye cameras, one or more pinhole cameras and one or more processors, and the one or more processors implement the method described in the first aspect or any possible implementation thereof.
- the vehicle also includes a display screen for displaying information such as road conditions, distance prompt information, a two-dimensional/three-dimensional model of the vehicle or a two-dimensional/three-dimensional model of an obstacle.
- the vehicle also includes a speaker for playing voice prompt information, and the voice prompt information may include information such as danger prompts and/or the distance between the vehicle and the obstacle. For example, when the distance between the vehicle and the obstacle is less than a preset threshold, the voice prompts the driver to pay attention to the existence of the obstacle.
- the vehicle can use only the display screen to display prompt information to remind the driver, or only the voice prompt information to remind the driver, or combine the display screen display and the voice prompt to remind the driver. For example, when the distance between the vehicle and the obstacle is lower than the first threshold, the prompt information is only displayed on the display screen. When the distance between the vehicle and the obstacle is lower than the second threshold (the second threshold is lower than the first threshold), the driver is prompted to pay attention to the obstacle while displaying the prompt information, thereby attracting the driver's attention.
- the distance measuring device, computer storage medium, computer program product and chip provided in this embodiment are all used to execute the method provided above. Therefore, the beneficial effects that can be achieved can refer to the beneficial effects in the method provided above and will not be repeated here.
- FIG1 is a schematic diagram of an image provided in an embodiment of the present application.
- FIG2 is a schematic diagram of the structure of a distance measurement system provided in an embodiment of the present application.
- FIG3 is a schematic diagram of the structure of an image acquisition system provided in an embodiment of the present application.
- FIG4 is a schematic diagram of a flow chart of a distance measurement method provided in an embodiment of the present application.
- FIG5 is a schematic diagram of the structure of a target network provided in an embodiment of the present application.
- FIG6 is a schematic diagram of extracting image features provided by an embodiment of the present application.
- FIG7 is a schematic diagram of aligning pixels provided in an embodiment of the present application.
- FIG8 is a schematic diagram of a ranging scenario provided in an embodiment of the present application.
- FIG9 is a schematic diagram of a display interface provided in an embodiment of the present application.
- FIG10 is a schematic diagram of another display interface provided in an embodiment of the present application.
- FIG11 is a schematic diagram of another ranging scenario provided in an embodiment of the present application.
- FIG12 is a schematic diagram of another display interface provided in an embodiment of the present application.
- FIG13 is a schematic diagram of another display interface provided in an embodiment of the present application.
- FIG14 is a schematic diagram of another display interface provided in an embodiment of the present application.
- FIG15 is a schematic diagram of another display interface provided in an embodiment of the present application.
- FIG16 is a schematic diagram of another display interface provided in an embodiment of the present application.
- FIG17 is a schematic diagram of another display interface provided in an embodiment of the present application.
- FIG18 is a schematic diagram of another display interface provided in an embodiment of the present application.
- FIG19 is a schematic diagram of the structure of a distance measuring device provided in an embodiment of the present application.
- FIG20 is a schematic diagram of the structure of a chip provided in an embodiment of the present application.
- FIG. 21 is a schematic diagram of the structure of an electronic device provided in an embodiment of the present application.
- a and/or B in this article is merely a description of the association relationship of associated objects, indicating that three relationships may exist.
- a and/or B can mean: A exists alone, A and B exist at the same time, and B exists alone.
- first and second in the description and drawings of the embodiments of the present application are used to distinguish different objects, or to It is used to distinguish different treatments of the same object, rather than to describe a specific order of objects.
- Image distortion is caused by the deviation of lens manufacturing precision and assembly process, which will introduce distortion and cause the original image to be distorted.
- general cameras must be dedistorted when in use, especially fisheye cameras. If dedistortion is not performed, the target size distribution in the original image of the camera will be uneven, which will cause great interference to the perception algorithm. Therefore, the original image must be dedistorted. However, information will be lost after the original image is dedistorted. The loss of information is very fatal in unmanned driving and has the potential risk of causing traffic accidents.
- Epipolar constraint describes the constraints formed by the image point and the camera optical center under the projection model when the same point is projected onto two images with different perspectives.
- the spatial point P or P' on the image point P1, its image point P2 in image 2 must be on the epipolar line e2P2, which is expressed as an epipolar constraint.
- the epipolar line is not necessarily a straight line, but may also be a curve.
- e1 is the intersection of the line connecting the optical centers O1O2 of the two corresponding cameras of the two images and the plane of image 1
- e2 is the intersection of the line connecting the optical centers O1O2 of the two corresponding cameras of the two images and the plane of image 2.
- Common visual area refers to the area of visual field that has intersection or overlap.
- Feature fusion Take the fusion of feature maps 2 and 3 corresponding to two images as an example.
- Feature map 2 is flattened into a one-dimensional feature representation as [a0, a1, ..., aH1xW1], with a length of H1xW1
- feature map 3 is flattened into a one-dimensional feature representation as [b0, b1, ...bH2xW2], with a length of H2xW2.
- the MLP network is then used to map feature C into three features, namely Q, K, and V, whose dimensions remain the same as C.
- the three mapped features are then input into the Transformer network.
- the fused feature C' is obtained, and then the fused feature C' is split into fused feature map 2 corresponding to feature map 2 and fused feature map 3 corresponding to feature map 3.
- QKT represents vector dot product.
- the depth estimation of multiple images with overlapping areas is mainly performed through the disparity estimation method.
- the disparity estimation method needs to determine the pixel point corresponding to each pixel point in the overlapping area of the image on the other image and calculate the disparity between the pixel point and the corresponding pixel point, and then calculate the depth of the pixel point through the disparity between the pixel point and the corresponding pixel point.
- the disparity estimation method can calculate the depth of all pixels in the overlapping area through disparity, but since there are no corresponding pixels in other images for pixels outside the overlapping area in the image, it is impossible to obtain the disparity of the pixels outside the overlapping area in the image, nor to calculate the depth of the pixels outside the overlapping area in the image.
- an embodiment of the present application provides a distance measurement method that can perform depth estimation on multiple images with overlapping areas.
- the method is applicable to a distance measurement system, and FIG2 shows a possible existence form of the distance measurement system.
- the distance measurement system includes an image acquisition system and a computer device.
- the image acquisition system and the computer device can communicate in a wired or wireless manner.
- An image acquisition system is used to acquire a first image and a second image having a common viewing area.
- a computer device is used to determine the distance between the object in the first image and the second image and the vehicle based on the first image and the second image having a common viewing area acquired by the image acquisition system.
- the image acquisition system may be composed of a plurality of cameras having a common viewing area.
- the image acquisition system includes a first camera, which is a camera with a field of view greater than a preset angle.
- the preset angle may be 180 degrees or 192 degrees.
- the multiple cameras may be cameras of the same specification.
- the image acquisition system can be composed of multiple fisheye cameras.
- the multiple cameras may be cameras of different specifications.
- the image acquisition system may be composed of one or more fisheye cameras and one or more pinhole cameras.
- the image acquisition system may be arranged on a vehicle.
- the above-mentioned vehicle may be a land vehicle or a non-land vehicle.
- the above-mentioned land vehicles may include a compact car, a full-size sport utility vehicle (SUV), a van, a truck, a van, a bus, a motorcycle, a bicycle, a scooter, a train, a snowmobile, a wheeled vehicle, a tracked vehicle or a rail-mounted vehicle.
- SUV sport utility vehicle
- Such non-land vehicles may include drones, airplanes, hovercraft, spacecraft, ships, and sailboats.
- the image acquisition system can be composed of four fisheye cameras (front view, rear view, left view and right view) and six pinhole cameras (front view, rear view, left and right front side view, left and right rear side view) arranged around the vehicle body, and there is a common viewing area between the fisheye cameras and the pinhole cameras.
- the image acquisition system may be composed of a front-view pinhole camera, a front-view fisheye camera, a rear-view pinhole camera, and a rear-view fisheye camera arranged around the vehicle body.
- the field of view of the front-view pinhole camera and the field of view of the front-view fisheye camera have a common viewing area
- the field of view of the rear-view pinhole camera and the field of view of the rear-view fisheye camera have a common viewing area.
- the image acquisition system may be composed of a rear-view fisheye camera and a rear-view pinhole camera, and the two have a common viewing area.
- the image acquisition system may be composed of a forward-looking fisheye camera and a forward-looking pinhole camera, and the two have a common viewing area.
- the image acquisition system may be composed of a left-viewing fisheye camera and a left-viewing pinhole camera, and the two cameras have a common viewing area.
- the image acquisition system may be composed of a right-viewing fisheye camera and a right-viewing pinhole camera, and the two cameras have a common viewing area.
- the computer device may be a terminal or a server.
- the terminal may be a vehicle-mounted terminal, a smart phone, a tablet computer, a laptop computer, a desktop computer, a smart speaker, a smart watch, a smart TV, etc., but is not limited thereto.
- the server may be an independent physical server, or a server cluster or distributed system composed of multiple physical servers, or a cloud server providing cloud computing services.
- the image acquisition system and the computer device can communicate via wired or wireless means.
- the above wireless method can achieve communication through a communication network, which can be a local area network, or a wide area network transferred through a relay device, or a local area network and a wide area network.
- a communication network can be a wifi hotspot network, a wifi P2P network, a Bluetooth network, a zigbee network, a near field communication (NFC) network, or a possible general short-range communication network in the future, a dedicated short-range communication (DSRC) network, etc.
- the above-mentioned communication network can be a third-generation mobile communication technology (3rd-generation wireless telephone technology, 3G) network, a fourth-generation mobile communication technology (the 4th-generation mobile communication technology, 4G) network, a fifth-generation mobile communication technology (5th-generation mobile communication technology, 5G) network, a public land mobile network (Public Land Mobile Network, PLMN) or the Internet, etc., and the embodiments of the present application are not limited to this.
- 3G third-generation mobile communication technology
- 4G fourth-generation mobile communication technology
- 5th-generation mobile communication technology 5th-generation mobile communication technology
- PLMN Public Land Mobile Network
- the ranging method provided in the embodiment of the present application is introduced below in conjunction with the ranging system shown in FIG. 2 .
- FIG4 shows a distance measurement method provided by an embodiment of the present application, which is applied to a vehicle, wherein the vehicle includes a first camera and a second camera.
- the method can be executed by a computer device in the distance measurement system. As shown in FIG4 , the method includes:
- a computer device acquires a first image and a second image.
- the first image is an image captured by a first camera
- the second image is an image captured by a second camera
- the first camera is a fisheye camera
- the second camera is a pinhole camera.
- the common viewing area refers to the area where the visual field has an intersecting or overlapping range.
- the first camera may be a camera with a field of view greater than a preset angle.
- the preset angle may be 180 degrees or 192 degrees.
- the computer device acquires the first image and the second image taken by a camera from the image acquisition system.
- the computer device may acquire a first image and a second image captured by a forward-looking fisheye camera and a forward-looking pinhole camera arranged on the vehicle body, wherein the forward-looking fisheye camera and the forward-looking pinhole camera have a common viewing area.
- the computer device may also acquire multiple groups of images, each group of images includes a first image and a second image, and cameras that capture the first image and the second image of the same group have a common viewing area.
- the computer device obtains 4 groups of images taken by 8 cameras from an image acquisition system consisting of 4 fisheye cameras (front view, rear view, left view and right view) and 4 pinhole cameras (front view, rear view, left view and right view) arranged around the vehicle body.
- the first group of images includes the first image 1 taken by the front fisheye camera and the second image 1 taken by the front pinhole camera, and there is a common viewing area between the front fisheye camera and the front pinhole camera.
- the second group of images includes the first image 2 taken by the rear fisheye camera and the second image 2 taken by the rear pinhole camera, and there is a common viewing area between the rear fisheye camera and the rear pinhole camera.
- the third group of images includes the first image 3 taken by the left fisheye camera and the second image 3 taken by the left pinhole camera, and there is a common viewing area between the left fisheye camera and the left pinhole camera.
- the fourth group of images includes the first image 4 taken by the right fisheye camera and the second image 4 taken by the right pinhole camera, and there is a common viewing area between the right fisheye camera and the right pinhole camera.
- the computer device may acquire multiple groups of images captured by the image acquisition system within a period of time, wherein the acquisition time of multiple first images in the same group is the same, and the acquisition time of first images in different groups is different.
- the computer device can acquire 5 groups of images, the acquisition time of the first image and the second image in the first group of images is both 10:00:00, the acquisition time of the first image and the second image in the second group of images is both 10:00:01, the acquisition time of the first image and the second image in the third group of images is both 10:00:02, the acquisition time of the first image and the second image in the fourth group of images is both 10:00:03, and the acquisition time of the first image and the second image in the fifth group of images is both 10:00:04.
- the computer device may also calibrate the coordinates of the pixel points in each of the above images according to the intrinsic parameters of the camera corresponding to each image and the preset camera intrinsic parameters.
- the first image is calibrated according to the intrinsic parameters of the first camera and the preset fisheye camera intrinsic parameters.
- the second image is calibrated according to the intrinsic parameters of the second camera and the preset pinhole camera intrinsic parameters.
- the camera intrinsic parameters of a fisheye camera include focal length (fx, fy), imaging center position (cx, cy) and distortion parameters (k1, k2, k3, k4)
- the camera internal parameters of the pinhole camera include focal length (fx, fy), imaging center position (cx, cy) and corresponding distortion parameters.
- the distortion parameters include radial distortion coefficients (k1, k2, k3) and tangential distortion coefficients (p1, p2).
- the camera external parameters are relative to the preset coordinate system, and the parameters are the three-dimensional position offset (x, y, z) and the angle between the camera optical axis and the coordinate axis (yaw, pitch, roll).
- the preset coordinate system can be the body coordinate system established relative to the vehicle.
- the inverse projection solution obtains the distorted point coordinates (xdistorted, ydistorted) on the unit depth plane.
- the coordinates (x, y) of the undistorted point on the unit depth plane are then brought into the imaging process under the intrinsic parameters of the corresponding camera of the template camera system (after distortion and projection transformation), and then the coordinates (u', v') of the calibrated pixel points are obtained based on the coordinates of the undistorted point and the preset camera intrinsic parameters.
- the correspondence between the coordinates of the calibrated pixel points and the coordinates of the calibrated pixel points is established, and each pixel point in the first image and the second image is converted according to the correspondence, so as to calibrate the camera images (the first image and the second image) into the images of the template camera.
- an interpolation algorithm may be used to smooth the calibration image.
- the image can be calibrated into the image of the template camera system.
- the pixels of the camera image are back-projected onto the unit depth plane to simulate the real light entry path, and then projected onto the template camera.
- the computer device obtains a first depth map and a second depth map.
- the first depth map is the depth map corresponding to the first image
- the second depth map is the depth map corresponding to the second image.
- Degree graph
- a computer device may obtain a first feature map and a second feature map. Then, a third feature map is obtained based on the first feature point of the first feature map and the multiple target feature points corresponding to the first feature point. Then, a fourth feature map is obtained based on the second feature point of the second feature map and the multiple target feature points corresponding to the second feature point. Then, the first depth map and the second depth map are obtained based on the third feature map and the fourth feature map.
- the first feature map is a feature map corresponding to the first image
- the second feature map is a feature map corresponding to the second image.
- the first feature point is any feature point in the first feature map
- the multiple target feature points corresponding to the first feature point are feature points in the second feature map that meet the epipolar constraint with the first feature point.
- the second feature point is any feature point in the second feature map, and the multiple target feature points corresponding to the second feature point are feature points in the first feature map that meet the epipolar constraint with the second feature point.
- feature matching of feature points through target feature points that meet the epipolar constraint in the image in the common view area corresponding to the feature points can, on the one hand, reduce the amount of calculation in the feature matching process; on the other hand, since the feature points that meet the epipolar constraint in the image in the common view area corresponding to the feature points have a high similarity with the feature points, feature matching of feature points through the target feature points of the feature points can make the matched feature points fuse the features of the target feature points, thereby increasing the recognition of the feature points and enabling the corresponding depth map to be obtained more accurately based on the feature map after feature fusion, thereby providing high ranging accuracy.
- the computer device may input the first image and the second image into the target network to obtain the first depth map and the second depth map.
- the target network may include a first subnetwork, and the first subnetwork is used to output a feature map of an image based on an input image.
- a first image with a size of HxW taken by a fisheye camera can be input into the first subnetwork to obtain a first feature map with a size of H1xW1 for characterizing features of the first image
- a second image with a size of H’xW’ taken by a pinhole camera can be input into the first subnetwork to obtain a second feature map with a size of H2xW2 for characterizing features of the second image.
- the same first subnetwork may be used to extract features of images captured by different cameras to obtain feature maps corresponding to the images.
- the resent50 feature extraction network is used to extract the features of the images captured by the pinhole camera and the fisheye camera to obtain the feature maps corresponding to the images.
- different feature extraction networks may be used for different images to extract features of the images to obtain feature maps corresponding to the images.
- the resent50 feature extraction network is used to extract the features of the image to obtain the feature map corresponding to the image.
- the resent50 feature extraction network with deformable convolution is used to extract the features of the image to obtain the feature map corresponding to the image.
- the first feature map and the second feature map may be size-aligned.
- the first feature map is a feature map of a first image captured by a fisheye camera
- the second feature map is a feature map of a second image captured by a pinhole camera.
- the focal length of the fisheye camera is N
- the focal length of the pinhole camera is 4N, that is, the focal length of the pinhole camera is 4 times that of the fisheye camera.
- the first feature map can be enlarged by 4 times to align the sizes of the first feature map and the second feature map.
- the target network may include a second subnetwork, and the second subnetwork is used to output a corresponding fused feature map according to the input feature map.
- the second subnetwork can obtain a third feature map (i.e., a fused feature map of the first feature map) based on the first feature point of the first feature map and the multiple target feature points corresponding to the first feature point.
- a fourth feature map i.e., a fused feature map of the second feature map is obtained based on the second feature point of the second feature map and the multiple target feature points corresponding to the second feature point.
- the first feature map corresponding to the first image is flattened into a one-dimensional feature representation as [a0, a1, ..., aH1xW1], with a length of H1xW1
- the second feature map corresponding to the second image is flattened into a one-dimensional feature representation as [b0, b1, ...bH2xW2], with a length of H2xW2.
- the one-dimensional feature C is then mapped into three features, Q, K, and V, using a network.
- the above operation is performed on each feature point to obtain a one-dimensional feature C', and then it is split and converted into a third feature map corresponding to the first feature map and a fourth feature map corresponding to the second feature map according to the splicing order of C.
- the target feature point may also be a feature point that meets the epipolar constraint in an image where an image corresponding to the feature point exists in a common view area, and feature points around the feature point that meets the epipolar constraint.
- the first feature map corresponding to the first image is flattened into a one-dimensional feature representation as [a0, a1, ..., aH1xW1], with a length of H1xW1
- the second feature map corresponding to the second image is flattened into a one-dimensional feature representation as [b0, b1, ... bH2xW2], with a length of H2xW2.
- the network is then used to map the one-dimensional feature C into three features, namely Q, K, and V. Assuming that the feature bi at index position i in the second feature map has n feature index positions corresponding to the depth range (dmin, dmax) in the first feature map after calculation through the epipolar constraint, they are ⁇ ad0, ad1,..., adn ⁇ .
- the target network may include a third subnetwork, and the third subnetwork is used to output a corresponding depth map according to the input feature map or fused feature map.
- the third feature map and the fourth feature map may be input into a third subnetwork of the target network to obtain a first depth map and a second depth map.
- the first feature map and the second feature map may be input into a third subnetwork of the target network to obtain a first depth map and a second depth map.
- the third sub-network is trained using a first training data sample set, and the first training data sample set includes a plurality of images and depth maps corresponding to the plurality of images.
- a real-value car with 360-degree laser scanning can be used to obtain synchronous frame data of point cloud and image. Then, the depth map corresponding to the image is obtained through the point cloud, and the third sub-network is supervised and trained using the image and the depth map corresponding to the image. At the same time, self-supervised training and consistency between temporal frames can be used to assist in training the third sub-network.
- S303 The computer device determines the distance between the object in the first image and/or the second image and the vehicle according to the first depth map and the second depth map.
- the three-dimensional coordinates of each pixel in the first image in the first camera coordinate system can be obtained according to the first depth map (or the third depth map), and the three-dimensional coordinates of each pixel in the second image in the second camera coordinate system can be obtained according to the second depth map (or the fourth depth map), and then the three-dimensional coordinates of the pixel in the first camera coordinate system and the three-dimensional coordinates of the pixel in the second camera coordinate system are converted into the coordinates of the pixel in the vehicle coordinate system, and then the distance between the object and the vehicle in the first image and/or the second image is determined by the three-dimensional coordinates of the pixel in the vehicle coordinate system.
- the first camera coordinate system is a coordinate system established with the optical center of the first camera as the coordinate origin
- the second coordinate system is a coordinate system established with the optical center of the second camera as the coordinate origin
- the vehicle coordinate system is a coordinate system established with the vehicle body reference point (such as the center of the rear axle of the vehicle) as the coordinate origin.
- some objects may not be in the common viewing area of the first camera and the second camera, but may be in the single camera.
- the objects exist solely within the field of view of the first camera or the second camera, that is, exist solely in the first image or the second image.
- the distances between these objects and the vehicle can be determined by the first depth map and the second depth map.
- the distance between the object and the vehicle in the first image and/or the second image can be determined based on the first depth map (or the third depth map), the second depth map (or the fourth depth map), the first structural semantics and the second structural semantics.
- the three-dimensional coordinates of each pixel in the first image in the first camera coordinate system can be obtained according to the first depth map (or the third depth map), and the three-dimensional coordinates of each pixel in the second image in the second camera coordinate system can be obtained according to the second depth map (or the fourth depth map), and then the three-dimensional coordinates of the pixel in the first camera coordinate system and the three-dimensional coordinates of the pixel in the second camera coordinate system are converted into the coordinates of the pixel in the vehicle coordinate system, and then the pixel points corresponding to the edge points of the target object are aligned according to the first structural semantics and the second structural semantics, and then the distance between the object in the first image and/or the second image and the vehicle is determined by the three-dimensional coordinates of the pixel in the vehicle coordinate system.
- the target object is an object that exists in both the first image and the second image.
- an edge point of an object in a given space may appear simultaneously in a first image taken by a fisheye camera and a second image taken by a pinhole camera, but since the depth map corresponding to the image corresponds to each camera, the pixel coordinates provided by the fisheye camera and the pixel coordinates provided by the pinhole camera may still produce deviations in a unified coordinate system. Therefore, the distance measurement method provided in the embodiment of the present application uses structural semantics to align the pixel points corresponding to the edge points of the target object according to the above-mentioned first structural semantics and the above-mentioned second structural semantics, which can reduce the above-mentioned deviation.
- the first image and the second image in FIG. 7 show the edge of object 1.
- the pixel points corresponding to the edge of object 1 in the first image taken by the fisheye camera are converted to a string of points [q1, q2, ..., qm] in the vehicle coordinate system, and the pixel points corresponding to the edge of object 1 in the second image taken by the pinhole camera are converted to another string of points [p1, p2, ..., pn] in the vehicle coordinate system.
- the pixel points corresponding to the edge of object 1 in the first image are rotated and translated by RT operation, they can be aligned with the pixel points corresponding to the edge of object 1 in the second image, that is, the sum of the Euclidean geometric distances between the two most adjacent points is the smallest.
- the RT matrix can be calculated by the gradient solution algorithm to align the edges of object 1 in the first image and/or the second image. Similarly, the same optimization can be performed on the pixel points of other identical objects in the first image and/or the second image.
- the distance between the object and the vehicle in the first image and/or the second image can be determined based on the first depth map (or third depth map), the second depth map (or fourth depth map), the first instance segmentation result, the second instance segmentation result, the first distance information, and the second distance information.
- the above-mentioned first instance segmentation result is used to indicate the background and movable objects in the above-mentioned first image
- the above-mentioned second instance segmentation result is used to indicate the background and movable objects in the above-mentioned second image
- the above-mentioned first distance information is used to indicate the distance between the object and the vehicle in the third image
- the above-mentioned third image is the image captured by the above-mentioned first camera at the second moment
- the above-mentioned second distance information is used to indicate the distance between the object and the vehicle in the fourth image
- the above-mentioned fourth image is the image captured by the above-mentioned second camera at the above-mentioned second moment.
- the three-dimensional coordinates of each pixel in the first image in the first camera coordinate system can be obtained according to the first depth map (or the third depth map), and the three-dimensional coordinates of each pixel in the second image in the second camera coordinate system can be obtained according to the second depth map (or the fourth depth map), and then the three-dimensional coordinates of the pixel in the first camera coordinate system and the three-dimensional coordinates of the pixel in the second camera coordinate system are converted into the coordinates of the pixel in the vehicle coordinate system, and then the distance between the object and the vehicle in the first image and/or the second image is determined by the three-dimensional coordinates of the pixel in the vehicle coordinate system. Then, the distance between the object and the vehicle in the first image and/or the second image is corrected by the above-mentioned first instance segmentation result, the above-mentioned second instance segmentation result, the first distance information and the second distance information.
- the position of the background is fixed, and the distance between the vehicle and the background at one of the two moments is determined by the position relationship of the vehicle between the two moments and the distance between the vehicle and the background at the other moment.
- the vehicle is 5 meters away from the wall at moment 1
- the vehicle travels 0.5 meters away from the wall between moment 1 and moment 2
- this relationship is used to correct the distance between the object and the vehicle in the first image and/or the second image obtained by the first instance segmentation result, the second instance segmentation result, the first distance information, and the second distance information.
- the distance measurement method obtains the first depth map and the second depth map corresponding to the above images by inputting the first image captured by a camera with a field of view greater than a preset angle and the second image captured by a camera having a common viewing area with the camera into the target network, and then determines the distance between the object in the first image and the above second image and the vehicle according to the first depth map and the second depth map.
- the camera with a field of view greater than the preset angle can make up for the blind spots inherent in the layout of the distance measurement sensor, so that the vehicle can measure the distance to obstacles in the detection blind spots.
- the related art uses ultrasonic sensors for distance measurement. Due to the blind spots in the sensor layout, the ultrasonic sensor cannot detect the suspended obstacles behind the vehicle and give the user a reminder during the vehicle reversing process, which can easily cause accidents.
- the distance measurement method provided in the embodiment of the present application uses the images collected by the first camera (such as a fisheye camera) and the second camera (such as a pinhole camera) with a common viewing area to jointly measure the distance, thereby compensating for the blind spots inherent in the distance measurement sensor layout, so that the vehicle can detect the suspended obstacles in the detection blind spots of the ultrasonic sensor and give the user a reminder, so that the user can promptly discover the suspended obstacles behind the vehicle, thereby reducing the probability of the vehicle colliding with the suspended obstacles.
- the first camera such as a fisheye camera
- the second camera such as a pinhole camera
- the ranging method provided in the embodiment of the present application may further include:
- the target network may include a fourth subnetwork, and the fourth subnetwork is used to output corresponding structural semantics according to the input image or feature map, wherein the structural semantics is used to indicate the edges and planes of objects in the image.
- the first image, the first feature map or the third feature map may be input into the fourth subnetwork of the target network to obtain the first structural semantics.
- the second image, the second feature map or the fourth feature map may be input into the fourth subnetwork of the target network to obtain the second structural semantics.
- the first structural semantics is used to indicate the edges and planes of objects in the first image
- the second structure is used to indicate the edges and planes of objects in the second image.
- the object edge of the object may be represented by a heat map of the edge, and the plane structure of the object may be represented by a three-dimensional normal vector map.
- instance segmentation and annotation can be performed on multiple images to obtain the edges of objects in the multiple images, and then the plane normal vectors of each area of the image can be calculated based on the geometric information of the point cloud and the semantic information of the instance segmentation annotation to obtain the planar structure of the objects in the multiple images. Then, the second sub-network is trained through the supervision of the object edges in the multiple images and the planar structure of the objects in the multiple images.
- the target network may include a fifth subnetwork, which is used to output corresponding structural semantics according to the input image or feature map.
- the object attribute includes the instance segmentation result of the image.
- the instance segmentation of the image is used to indicate the background and movable objects in the image.
- the instance segmentation of the image is used to indicate movable objects such as vehicles and pedestrians in the image and backgrounds such as the ground and walls in the image.
- the first image, the first feature map or the third feature map may be input into the fifth subnetwork of the target network to obtain a first instance segmentation result.
- the second image, the second feature map or the fourth feature map may be input into the fifth subnetwork of the target network to obtain a second instance segmentation result.
- the first instance segmentation result is used to indicate the instance segmentation result of the first image
- the second instance segmentation result is used to indicate the instance segmentation result of the second image.
- instance segmentation annotation may be performed on multiple images to obtain instance segmentation structures of the multiple images, and then the fifth sub-network may be trained using the multiple images and the instance segmentation structures of the multiple images.
- S306 The computer device performs three-dimensional reconstruction of the objects in the first image and the second image according to the distances between the objects in the first image and the second image and the vehicle.
- the computer device performs three-dimensional reconstruction of the objects in the first image and the second image according to the distance between the objects and the vehicle and the color information in the first image and the second image, wherein the color information is used to indicate the color of each pixel in the first image and the second image.
- the computer device displays the three-dimensionally reconstructed object.
- the terminal may display the three-dimensionally reconstructed object through a display panel.
- the server may send a display instruction to the terminal, and the terminal may display the three-dimensional image according to the display instruction after receiving the display instruction.
- the constructed object may be any suitable object that can be used to display the three-dimensional image according to the display instruction after receiving the display instruction.
- S308 The computer device displays prompt information according to the distance between the object in the first image and the second image and the vehicle.
- a collision warning prompt message is displayed to remind the user that the vehicle may collide with the object in the first image.
- distance prompt information is displayed to remind the user that the distance between the vehicle and the object in the second image is relatively close.
- the above prompt information may be text information, sound information or image information.
- the distance between the object in the first image and/or the second image and the vehicle is the distance between the object in the first image and/or the second image and the isometric contour of the vehicle.
- the isometric contour of the vehicle is an isometric contour set according to the outer contour of the vehicle.
- the isometric contour may be an isometric line extending outward from the outer contour line of the vehicle body in a two-dimensional (2D) top view, or may be an isometric surface extending outward from the three-dimensional (3D) outer contour of the vehicle body.
- the vehicle isometric profile may be adjusted according to the distance between the object and the vehicle in the first image and the second image.
- the color of the equidistant contour is adjusted to yellow.
- the color of the equidistant contour is adjusted to red.
- the method may further include: displaying a first interface.
- the first interface may include a display function control and a setting function control.
- the display function is used to display objects around the vehicle.
- the user can click the display function control to display the second interface, through which the sensor used in the display function operation process can be selected.
- the user has selected to turn on the front fisheye camera, rear fisheye camera, front pinhole camera, rear pinhole camera, laser radar and ultrasonic detector.
- object 1, object 2, and object 3 are located in the field of view of the forward-looking fisheye camera
- object 2 and object 4 are located in the field of view of the forward-looking pinhole camera
- Object 2 is located in the common field of view of the forward-looking fisheye camera and the forward-looking pinhole camera.
- the user only chooses to turn on the forward-looking fisheye camera and turn off the forward-looking pinhole camera in the second interface, then returns to the first interface and clicks on the display function control. Then the third interface shown in Figure 13 is displayed. It can be seen from the third interface shown in Figure 13 that there are objects 1, 2, and 3 in front of the vehicle. Since the forward-looking pinhole camera is turned off, the third interface shown in Figure 13 lacks object 4 within the field of view of the forward-looking pinhole camera compared to the actual scene shown in Figure 11. Therefore, there is a detection blind spot when only using the forward-looking fisheye camera for object detection, and object 4 in front of the vehicle cannot be detected.
- the user only chooses to turn on the front-view pinhole camera and turn off the front-view fisheye camera in the second interface, then returns to the first interface and clicks the display function control. Then the third interface shown in Figure 15 is displayed. It can be seen from the third interface shown in Figure 15 that there are objects 2 and 4 in front of the vehicle. Since the front-view fisheye camera is turned off, the third interface shown in Figure 15 lacks object 4 in the field of view of the front-view fisheye camera compared to the actual scene shown in Figure 11. Therefore, there is a detection blind spot when only using the front-view pinhole camera for object detection, and objects 1 and 3 in front of the vehicle cannot be found.
- the user only chooses to turn on the front pinhole camera and the front fisheye camera in the second interface, then returns to the first interface and clicks the display function control. Then the third interface shown in FIG17 is displayed. It can be seen from the third interface shown in FIG17 that there are objects 1, 2, 3 and 4 in front of the vehicle, which is consistent with the actual scene shown in FIG11.
- the embodiment of the present application can compensate for the blind spots inherent in the single sensor layout by selecting multiple sensors to measure distance together, so that the vehicle can measure the distance to obstacles in the detection blind spot of a single sensor.
- the third interface may also display prompt information. For example, when the distance between the vehicle and the object is less than 0.5 meters, the third interface displays prompt information.
- the embodiment of the present application can remind the user when the distance between an object and the vehicle is close, so that the user can deal with it in time, thereby avoiding a collision between the vehicle and the object.
- the ranging method provided in the embodiment of the present application can be integrated into a public cloud and released externally as a service.
- the distance measurement method can also protect the data uploaded by users. For example, for images, users can be required to upload images that have been encrypted in advance.
- the distance measurement method provided in the embodiment of the present application can also be integrated into a private cloud and used internally as a service.
- the distance measurement method can be determined whether to protect the user uploaded data according to actual needs.
- the ranging method provided in the embodiment of the present application may also be integrated into a hybrid cloud, wherein a hybrid cloud refers to an architecture including one or more public clouds and one or more private clouds.
- the service may provide an application programming interface (API) and/or a user interface (also referred to as a user interface).
- the user interface may be a graphical user interface (GUI) or a command user interface (CUI).
- GUI graphical user interface
- CUI command user interface
- a business system such as an operating system or a software system may directly call the API provided by the service for distance measurement, or the service may receive an image input by the user through the GUI or CUI and perform distance measurement based on the image.
- the distance measurement method provided in the embodiment of the present application can be packaged into a software package for sale, and the user can install and use it in the user's operating environment after purchasing the software package.
- the above software package can also be pre-installed in various devices (for example, desktop computers, laptops, tablet computers, smart phones, etc.), and the user purchases a device with a pre-installed software package and uses the device to measure distance based on the image.
- the distance measuring device for executing the above distance measuring method will be introduced below in conjunction with FIG. 19 .
- the ranging device includes hardware and/or software modules corresponding to the execution of each function.
- the embodiments of the present application can be implemented in the form of hardware or a combination of hardware and computer software. Whether a function is executed in the form of hardware or computer software driving hardware depends on the specific application and design constraints of the technical solution. Those skilled in the art can use different methods to implement the described functions for each specific application in combination with the embodiments, but such implementation should not be considered to exceed the scope of the embodiments of the present application.
- the embodiment of the present application can divide the functional modules of the ranging device according to the above method example.
- each functional module can be divided according to each function, or two or more functions can be integrated into one processing module.
- the above integrated module can be implemented in the form of hardware. It should be noted that the division of modules in this embodiment is schematic and is only a logical function division. There may be other division methods in actual implementation.
- Figure 19 shows a possible composition diagram of the ranging device involved in the above embodiment.
- the ranging device 1800 may include: an acquisition unit 1801, a network unit 1802 and a determination unit 1803.
- the acquisition unit 1801 is used to acquire a first image and a second image, where the first image is an image captured by a first camera, and the second image is an image captured by a second camera.
- the first camera is a fisheye camera
- the second camera is a pinhole camera.
- the network unit 1802 is configured to obtain a first depth map and a second depth map, where the first depth map is a depth map corresponding to the first image, and the second depth map is a depth map corresponding to the second image.
- the determining unit 1803 is configured to determine the distance between the object in the first image and/or the second image and the vehicle according to the first depth map and the second depth map.
- the network unit is specifically used to: perform feature extraction on the first image and the second image to obtain a first feature map and a second feature map, wherein the first feature map is a feature map corresponding to the first image, and the second feature map is a feature map corresponding to the second image.
- the network unit is specifically used to: input the third feature map into the target network to obtain the first depth map and the first structural semantics, and the first structural semantics is used to indicate the edges and planes of objects in the first image.
- the determination unit is specifically configured to determine the distance between the object and the vehicle in the first image and the second image according to the first depth map, the second depth map, the first structural semantics, and the second structural semantics.
- the network unit is specifically used to: input the third feature map into the target network to obtain the first depth map and the first instance segmentation result, and the first instance segmentation result is used to indicate the background and movable objects in the first image.
- the first image is an image captured by the first camera at a first moment
- the second image is an image captured by the second camera at the first moment
- the determination unit is specifically used to determine the distance between the object and the vehicle in the first image and the second image according to the first depth map, the second depth map, the first instance segmentation result, the second instance segmentation result, the first distance information, and the second distance information, wherein the first distance information is used to indicate the distance between the object and the vehicle in the third image, the third image is the image captured by the first camera at the second moment, and the second distance information is used to indicate the distance between the object and the vehicle in the fourth image, and the fourth image is the image captured by the second camera at the second moment.
- the acquisition unit is further used to: calibrate the first image according to an intrinsic parameter of the first camera and a preset camera intrinsic parameter.
- the second image is an image captured by a second camera
- the acquisition unit is further used to calibrate the second image according to an intrinsic parameter of the second camera and a preset camera intrinsic parameter.
- the determination unit is further configured to: perform three-dimensional reconstruction of the objects in the first image and the second image according to the distance between the objects in the first image and the second image and the vehicle, and display the three-dimensionally reconstructed objects.
- the determination unit is further configured to: display prompt information according to the distance between the object in the first image and the second image and the vehicle.
- the first camera is a fisheye camera
- the second camera is a pinhole camera
- the embodiment of the present application also provides a chip.
- the chip can be a system on chip (SOC) or other chips.
- the chip 1900 includes one or more processors 1901 and an interface circuit 1902. Optionally, the chip 1900 may also include a bus 1903.
- the processor 1901 may be an integrated circuit chip with signal processing capability. In the implementation process, each step of the above distance measurement method may be completed by an integrated logic circuit of hardware in the processor 1901 or by instructions in the form of software.
- the processor 1901 can be a general-purpose processor, a digital signal processing (DSP), an integrated circuit (ASIC), a field-programmable gate array (FPGA) or other programmable logic devices, discrete gates or transistor logic devices, discrete hardware components.
- DSP digital signal processing
- ASIC integrated circuit
- FPGA field-programmable gate array
- the methods and steps disclosed in the embodiments of the present application can be implemented or executed.
- the general-purpose processor can be a microprocessor or the processor can also be any conventional processor, etc.
- the interface circuit 1902 can be used to send or receive data, instructions or information.
- the processor 1901 can use the data, instructions or other information received by the interface circuit 1902 to process, and can send the processing completion information through the interface circuit 1902.
- the chip also includes a memory, which may include a read-only memory and a random access memory, and provides operation instructions and data to the processor.
- a portion of the memory may also include a non-volatile random access memory (NVRAM).
- NVRAM non-volatile random access memory
- the memory stores executable software modules or data structures
- the processor can perform corresponding operations by calling operation instructions stored in the memory (the operation instructions can be stored in the operating system).
- the chip can be used in the distance measuring device involved in the embodiment of the present application.
- the interface circuit 1902 can be used to output the execution result of the processor 1901.
- the distance measuring method provided by one or more embodiments of the embodiment of the present application can refer to the above embodiments, which will not be repeated here.
- processor 1901 and the interface circuit 1902 can be implemented through hardware design, software design, or a combination of hardware and software, and there is no limitation here.
- the electronic device 100 may be a mobile phone, a tablet computer, a wearable device, a vehicle-mounted device, an augmented reality (AR)/virtual reality (VR) device, a laptop computer, an ultra-mobile personal computer (UMPC), a netbook, a personal digital assistant (PDA), a model processing device, or a chip or functional module in a model processing device.
- AR augmented reality
- VR virtual reality
- UMPC ultra-mobile personal computer
- PDA personal digital assistant
- model processing device or a chip or functional module in a model processing device.
- FIG21 is a schematic diagram of the structure of an electronic device 100 provided in an embodiment of the present application.
- the electronic device 100 may include a processor 110, an external memory interface 120, an internal memory 121, a universal serial bus (USB) interface 130, charging management module 140, power management module 141, battery 142, antenna 1, antenna 2, mobile communication module 150, wireless communication module 160, audio module 170, speaker 170A, receiver 170B, microphone 170C, headphone jack 170D, sensor module 180, button 190, motor 191, indicator 192, camera 193, display screen 194, and subscriber identification module (SIM) card interface 195, etc.
- SIM subscriber identification module
- the sensor module 180 may include a pressure sensor 180A, a gyroscope sensor 180B, an air pressure sensor 180C, a magnetic sensor 180D, an acceleration sensor 180E, a distance sensor 180F, a proximity light sensor 180G, a fingerprint sensor 180H, a temperature sensor 180J, a touch sensor 180K, an ambient light sensor 180L, a bone conduction sensor 180M, etc.
- the structure illustrated in the embodiment of the present application does not constitute a specific limitation on the electronic device 100.
- the electronic device 100 may include more or fewer components than shown in the figure, or combine some components, or split some components, or arrange the components differently.
- the components shown in the figure may be implemented in hardware, software, or a combination of software and hardware.
- the processor 110 may include one or more processing units, for example, the processor 110 may include an application processor (AP), a modem processor, a graphics processor (GPU), an image signal processor (ISP), a controller, a memory, a video codec, a digital signal processor (DSP), a baseband processor, and/or a neural-network processing unit (NPU), etc.
- AP application processor
- GPU graphics processor
- ISP image signal processor
- controller a memory
- video codec a digital signal processor
- DSP digital signal processor
- NPU neural-network processing unit
- Different processing units may be independent devices or integrated in one or more processors.
- the controller may be the nerve center and command center of the electronic device 100.
- the controller may generate an operation control signal according to the instruction operation code and the timing signal to complete the control of fetching and executing instructions.
- a memory may also be provided in the processor 110 for storing instructions and data.
- the processor 110 may include one or more interfaces.
- the interface may include an inter-integrated circuit (I2C) interface, an inter-integrated circuit sound (I2S) interface, a pulse code modulation (PCM) interface, a universal asynchronous receiver/transmitter (UART) interface, a mobile industry processor interface (MIPI), a general-purpose input/output (GPIO) interface, a subscriber identity module (SIM) interface, and/or a universal serial bus (USB) interface, etc.
- I2C inter-integrated circuit
- I2S inter-integrated circuit sound
- PCM pulse code modulation
- UART universal asynchronous receiver/transmitter
- MIPI mobile industry processor interface
- GPIO general-purpose input/output
- SIM subscriber identity module
- USB universal serial bus
- the I2C interface is a bidirectional synchronous serial bus.
- the processor 110 can couple the touch sensor 180K through the I2C interface, so that the processor 110 and the touch sensor 180K communicate through the I2C bus interface to realize the touch function of the electronic device 100.
- the MIPI interface can be used to connect the processor 110 with peripheral devices such as the display screen 194 and the camera 193.
- the MIPI interface includes a camera serial interface (CSI), a display serial interface (DSI), etc.
- the processor 110 and the camera 193 communicate through the CSI interface to realize the shooting function of the electronic device 100.
- the processor 110 and the display screen 194 communicate through the DSI interface to realize the display function of the electronic device 100.
- the interface connection relationship between the modules illustrated in the embodiment of the present application is only a schematic illustration and does not constitute a structural limitation on the electronic device 100.
- the electronic device 100 may also adopt different interface connection methods in the above embodiments, or a combination of multiple interface connection methods.
- the charging management module 140 is used to receive charging input from a charger.
- the charger can be a wireless charger or a wired charger.
- the power management module 141 is used to connect the battery 142, the charging management module 140 and the processor 110.
- the power management module 141 receives input from the battery 142 and/or the charging management module 140, and provides power to the processor 110, the internal memory 121, the external memory, the display screen 194, the camera 193, and the wireless communication module 160.
- the electronic device 100 implements the display function through a GPU, a display screen 194, and an application processor.
- the GPU is a microprocessor for image processing, which connects the display screen 194 and the application processor.
- the GPU is used to perform mathematical and geometric calculations for graphics rendering.
- the processor 110 may include one or more GPUs that execute program instructions to generate or change display information.
- the display screen 194 is used to display images, videos, etc.
- the display screen 194 includes a display panel.
- the display panel can be a liquid crystal display (LCD), an organic light-emitting diode (OLED), an active-matrix organic light-emitting diode or an active-matrix organic light-emitting diode (AMOLED), a flexible light-emitting diode (FLED), Miniled, MicroLed, Micro-oLed, quantum dot light-emitting diodes (QLED), etc.
- the electronic device 100 may include 1 or N display screens 194, where N is a positive integer greater than 1.
- the electronic device 100 can realize the shooting function through ISP, camera 193, touch sensor, video codec, GPU, display screen 194 and application processor.
- the ISP is used to process the data fed back by the camera 193. For example, when taking a photo, the shutter is opened and light is transmitted to the camera through the lens. On the camera photosensitive element, the light signal is converted into an electrical signal, and the camera photosensitive element transmits the electrical signal to the ISP for processing and converts it into an image visible to the naked eye.
- the ISP can also perform algorithm optimization on the noise, brightness, and skin color of the image.
- the ISP can also optimize the exposure, color temperature and other parameters of the shooting scene.
- the ISP can be set in the camera 193.
- the camera 193 is used to capture still images or videos.
- an optical image is generated through a lens and projected onto a photosensitive element.
- the photosensitive element may be a charge coupled device (CCD) or a complementary metal oxide semiconductor (CMOS) phototransistor.
- CMOS complementary metal oxide semiconductor
- the photosensitive element converts the optical signal into an electrical signal, and then transmits the electrical signal to the ISP for conversion into a digital image signal.
- the ISP outputs the digital image signal to the DSP for processing.
- the DSP converts the digital image signal into an image signal in a standard RGB, YUV or other format. It should be understood that in the description of the embodiments of the present application, an image in RGB format is used as an example for introduction, and the embodiments of the present application do not limit the image format.
- the electronic device 100 may include 1 or N cameras 193, where N is a positive integer greater than 1.
- the digital signal processor is used to process digital signals, and can process not only digital image signals but also other digital signals. For example, when the electronic device 100 is selecting a frequency point, the digital signal processor is used to perform Fourier transform on the frequency point energy.
- Video codecs are used to compress or decompress digital videos.
- the electronic device 100 may support one or more video codecs. In this way, the electronic device 100 may play or record videos in a variety of coding formats, such as Moving Picture Experts Group (MPEG) 1, MPEG2, MPEG3, MPEG4, etc.
- MPEG Moving Picture Experts Group
- MPEG2 MPEG2, MPEG3, MPEG4, etc.
- the external memory interface 120 can be used to connect an external memory card, such as a Micro SD card, to expand the storage capacity of the electronic device 100.
- the internal memory 121 can be used to store computer executable program codes, which include instructions.
- the processor 110 executes various functional applications and data processing of the electronic device 100 by running the instructions stored in the internal memory 121.
- the internal memory 121 may include a program storage area and a data storage area.
- the electronic device 100 can implement audio functions such as music playing and recording through the audio module 170, the speaker 170A, the receiver 170B, the microphone 170C, the headphone jack 170D, and the application processor.
- the button 190 includes a power button, a volume button, etc.
- the button 190 can be a mechanical button. It can also be a touch button.
- the electronic device 100 can receive button input and generate key signal input related to the user settings and function control of the electronic device 100.
- the motor 191 can generate a vibration prompt.
- the motor 191 can be used for incoming call vibration prompts, and can also be used for touch vibration feedback. For example, touch operations acting on different applications (such as taking pictures, audio playback, etc.) can correspond to different vibration feedback effects. For touch operations acting on different areas of the display screen 194, the motor 191 can also correspond to different vibration feedback effects.
- the indicator 192 can be an indicator light, which can be used to indicate the charging status, power changes, and can also be used to indicate messages, missed calls, notifications, etc.
- the SIM card interface 195 is used to connect a SIM card.
- the electronic device 100 can be a chip system or a device with a similar structure as shown in Figure 21.
- the chip system can be composed of chips, or it can include chips and other discrete devices.
- the actions, terms, etc. involved in the various embodiments of the present application can refer to each other without limitation.
- the message name or parameter name in the message exchanged between the various devices in the embodiments of the present application is only an example, and other names can also be used in the specific implementation without limitation.
- the component structure shown in Figure 21 does not constitute a limitation on the electronic device 100.
- the electronic device 100 may include more or fewer components than those shown in Figure 21, or combine certain components, or arrange the components differently.
- the processor and transceiver described in the present application can be implemented in an integrated circuit (IC), an analog IC, a radio frequency integrated circuit, a mixed signal IC, an application specific integrated circuit (ASIC), a printed circuit board (PCB), an electronic device, etc.
- the processor and transceiver can also be manufactured using various IC process technologies, such as complementary metal oxide semiconductor (CMOS), N-type metal oxide semiconductor (NMOS), P-type metal oxide semiconductor (positive channel metal oxide semiconductor, PMOS), bipolar junction transistor (BJT), bipolar CMOS (BiCMOS), silicon germanium (SiGe), gallium arsenide (GaAs), etc.
- CMOS complementary metal oxide semiconductor
- NMOS N-type metal oxide semiconductor
- PMOS P-type metal oxide semiconductor
- BJT bipolar junction transistor
- BiCMOS bipolar CMOS
- SiGe silicon germanium
- GaAs gallium arsenide
- An embodiment of the present application further provides a distance measurement device, which includes: one or more processors.
- a distance measurement device which includes: one or more processors.
- the one or more processors execute program codes or instructions, the above-mentioned related method steps are implemented to implement the distance measurement method in the above-mentioned embodiment.
- the apparatus may further include one or more memories for storing the program code or instructions.
- An embodiment of the present application also provides a vehicle, which includes one or more fisheye cameras, one or more pinhole cameras and one or more processors, and the processors can be used in the distance measurement method in the above embodiment.
- the one or more processors can be implemented in the form of the above distance measurement device.
- the vehicle also includes a display screen for displaying information such as road conditions, distance prompt information, a two-dimensional/three-dimensional model of the vehicle or a two-dimensional/three-dimensional model of an obstacle.
- the vehicle also includes a speaker for playing voice prompt information, and the voice prompt information may include danger prompts and/or information such as the distance between the vehicle and the obstacle.
- a voice prompt is used to remind the driver to pay attention to the existence of obstacles, etc.
- the vehicle can use only the display screen to display prompt information to remind the driver, or only the voice prompt information to remind the driver, or the display screen display and voice prompt are combined to remind the driver. For example, when the distance between the vehicle and the obstacle is lower than the first threshold, only the prompt information is displayed on the display screen.
- the present application does not limit the values of the second threshold and the first threshold, it only needs to satisfy that the second threshold is lower than the first threshold, for example, the second threshold is 2 meters and the first threshold is 1 meter), the driver is prompted to pay attention to the obstacle in combination with the voice prompt while displaying, thereby attracting the driver's attention.
- the embodiment of the present application further provides a computer storage medium, in which computer instructions are stored.
- the distance measuring device executes the above-mentioned related method steps to implement the distance measuring method in the above-mentioned embodiment.
- the embodiment of the present application also provides a computer program product.
- the computer program product When the computer program product is run on a computer, the computer is enabled to execute the above-mentioned related steps to implement the ranging method in the above-mentioned embodiment.
- the embodiment of the present application also provides a distance measuring device, which can be a chip, an integrated circuit, a component or a module.
- the device may include a connected processor and a memory for storing instructions, or the device includes one or more processors for obtaining instructions from an external memory.
- the processor can execute instructions so that the chip executes the distance measuring method in the above-mentioned method embodiments.
- the size of the serial numbers of the above-mentioned processes does not mean the order of execution.
- the execution order of each process should be determined by its function and internal logic, and should not constitute any limitation on the implementation process of the embodiments of the present application.
- the disclosed systems, devices and methods can be implemented in other ways.
- the device embodiments described above are only schematic.
- the division of the above units is only a logical function division. There may be other division methods in actual implementation, such as multiple units or components can be combined or integrated into another system, or some features can be ignored or not executed.
- Another point is that the mutual coupling or direct coupling or communication connection shown or discussed can be through some interfaces, indirect coupling or communication connection of devices or units, which can be electrical, mechanical or other forms.
- the units described above as separate components may or may not be physically separated, and the components shown as units may or may not be physical units, that is, they may be located in one place or distributed on multiple network units. Some or all of the units may be selected according to actual needs to achieve the purpose of the solution of this embodiment.
- each functional unit in each embodiment of the present application may be integrated into one processing unit, or each unit may exist physically separately, or two or more units may be integrated into one unit.
- the above functions are implemented in the form of software functional units and sold or used as independent products, they can be stored in a computer-readable storage medium.
- the technical solution of the present application or the part that contributes to the prior art or the part of the technical solution, can be embodied in the form of a software product.
- the computer software product is stored in a storage medium, including several instructions to enable a computer device (which can be a personal computer, server, or network device, etc.) to execute all or part of the steps of the above methods in each embodiment of the present application.
- the aforementioned storage media include: U disk, mobile hard disk, read-only memory (ROM), random access memory (RAM), disk or optical disk, and other media that can store program codes.
Landscapes
- Engineering & Computer Science (AREA)
- Radar, Positioning & Navigation (AREA)
- Remote Sensing (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Computer Networks & Wireless Communication (AREA)
- Electromagnetism (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Theoretical Computer Science (AREA)
- Acoustics & Sound (AREA)
- Image Analysis (AREA)
- Traffic Control Systems (AREA)
Abstract
Description
Claims (20)
- 一种测距方法,应用于车辆,所述车辆包括第一相机和第二相机,其特征在于,包括:获取第一图像和第二图像,所述第一图像为所述第一相机采集的图像,所述第二图像为所述第二相机采集的图像,所述第一相机和所述第二相机存在共视区域,所述第一相机为鱼眼相机,所述第二相机为针孔相机;获取第一深度图和第二深度图,所述第一深度图为所述第一图像对应的深度图,所述第二深度图为所述第二图像对应的深度图;根据所述第一深度图和所述第二深度图确定所述第一图像和/或所述第二图像中的物体与所述车辆之间的距离。
- 根据权利要求1所述的方法,其特征在于,所述获取第一深度图和第二深度图,包括:获取第一特征图和第二特征图,所述第一特征图为所述第一图像对应的特征图,所述第二特征图为所述第二图像对应的特征图;根据所述第一特征图的第一特征点及所述第一特征点对应的多个目标特征点,得到第三特征图,所述第一特征点为所述第一特征图中的任意特征点,所述第一特征点对应的多个目标特征点为所述第二特征图中与所述第一特征点符合极线约束的特征点;根据所述第二特征图的第二特征点及所述第二特征点对应的多个目标特征点,得到第四特征图,所述第二特征点为所述第二特征图中的任意特征点,所述第二特征点对应的多个目标特征点为所述第一特征图中与所述第二特征点符合极线约束的特征点;根据所述第三特征图和所述第四特征图,得到所述第一深度图和所述第二深度图。
- 根据权利要求1或2所述的方法,其特征在于,所述根据所述第一深度图和所述第二深度图确定所述第一图像和/或所述第二图像中的物体与车辆之间的距离,包括:根据所述第一深度图、所述第二深度图、第一结构语义和第二结构语义,确定所述第一图像和/或所述第二图像中的物体与车辆之间的距离,其中,所述第一结构语义用于指示所述第一图像中的物体边缘和平面,所述第二结构语义用于指示所述第二图像中的物体边缘和平面。
- 根据权利要求1或2所述的方法,其特征在于,所述第一图像为所述第一相机在第一时刻采集的图像,所述第二图像为所述第二相机在所述第一时刻采集的图像,所述根据所述第一深度图和所述第二深度图确定所述第一图像和/或所述第二图像中的物体与车辆之间的距离,包括:根据所述第一深度图、所述第二深度图、第一实例分割结果、第二实例分割结果、第一距离信息和第二距离信息,确定所述第一图像和/或所述第二图像中的物体与车辆之间的距离,其中,所述第一实例分割结果用于指示所述第一图像中的背景和可移动物体,所述第二实例分割结果用于指示所述第二图像中的背景和可移动物体,所述第一距离信息用于指示第三图像中物体与车辆之间的距离,所述第三图像为所述第一相机在第二时刻采集的图像,所述第二距离信息用于指示第四图像中物体与车辆之间的距离,所述第四图像为所述第二相机在所述第二时刻采集的图像。
- 根据权利要求1至4中任一项所述的方法,其特征在于,所述方法还包括:根据所述第一相机的内参和预设鱼眼相机内参对所述第一图像进行校准。
- 根据权利要求1至5中任一项所述的方法,其特征在于,所述方法还包括:根据所述第二相机的内参和预设针孔相机内参对所述第二图像进行校准。
- 根据权利要求1至6中任一项所述的方法,其特征在于,所述方法还包括:根据所述第一图像和所述第二图像中的物体与车辆之间的距离对所述第一图像和所述第二图像中的物体进行三维重建;显示三维重建后的物体。
- 根据权利要求1至7中任一项所述的方法,其特征在于,所述方法还包括:根据所述第一图像和所述第二图像中的物体与车辆之间的距离显示提示信息。
- 一种测距装置,应用于包括第一相机和第二相机的车辆,其特征在于,包括:获取单元、网络单元和确定单元;所述获取单元,用于获取第一图像和第二图像,所述第一图像为所述第一相机采集的图像,所述第二图像为所述第二相机采集的图像,所述第一相机和所述第二相机存在共视区域,所述第一相机为 鱼眼相机,所述第二相机为针孔相机;所述网络单元,用于获取第一深度图和第二深度图,所述第一深度图为所述第一图像对应的深度图,所述第二深度图为所述第二图像对应的深度图;所述确定单元,用于根据所述第一深度图和所述第二深度图确定所述第一图像和/或所述第二图像中的物体与所述车辆之间的距离。
- 根据权利要求9所述的装置,其特征在于,所述网络单元具体用于:获取第一特征图和第二特征图,所述第一特征图为所述第一图像对应的特征图,所述第二特征图为所述第二图像对应的特征图;根据所述第一特征图的第一特征点及所述第一特征点对应的多个目标特征点,得到第三特征图,所述第一特征点为所述第一特征图中的任意特征点,所述第一特征点对应的多个目标特征点为所述第二特征图中与所述第一特征点符合极线约束的特征点;根据所述第二特征图的第二特征点及所述第二特征点对应的多个目标特征点,得到第四特征图,所述第二特征点为所述第二特征图中的任意特征点,所述第二特征点对应的多个目标特征点为所述第一特征图中与所述第二特征点符合极线约束的特征点;根据所述第三特征图和所述第四特征图,得到所述第一深度图和所述第二深度图。
- 根据权利要求9或10所述的装置,其特征在于,所述确定单元具体用于:根据所述第一深度图、所述第二深度图、第一结构语义和第二结构语义,确定所述第一图像和/或所述第二图像中的物体与车辆之间的距离,其中,所述第一结构语义用于指示所述第一图像中的物体边缘和平面,所述第二结构语义用于指示所述第二图像中的物体边缘和平面。
- 根据权利要求9至11中任一项所述的装置,其特征在于,所述第一图像为所述第一相机在第一时刻采集的图像,所述第二图像为第二相机在所述第一时刻采集的图像,所述确定单元具体用于:根据所述第一深度图、所述第二深度图、第一实例分割结果、第二实例分割结果、第一距离信息和第二距离信息,确定所述第一图像和/或所述第二图像中的物体与车辆之间的距离,所述第一实例分割结果用于指示所述第一图像中的背景和可移动物体,所述第二实例分割结果用于指示所述第二图像中的背景和可移动物体,所述第一距离信息用于指示第三图像中物体与车辆之间的距离,所述第三图像为所述第一相机在第二时刻采集的图像,所述第二距离信息用于指示第四图像中物体与车辆之间的距离,所述第四图像为所述第二相机在所述第二时刻采集的图像。
- 根据权利要求9至12中任一项所述的装置,其特征在于,所述获取单元还用于:根据所述第一相机的内参和预设鱼眼相机内参对所述第一图像进行校准。
- 根据权利要求9至13中任一项所述的装置,其特征在于,所述获取单元还用于:根据所述第二相机的内参和预设针孔相机内参对所述第二图像进行校准。
- 根据权利要求9至14中任一项所述的装置,其特征在于,所述确定单元还用于:根据所述第一图像和所述第二图像中的物体与车辆之间的距离对所述第一图像和所述第二图像中的物体进行三维重建;显示三维重建后的物体。
- 根据权利要求9至15中任一项所述的装置,其特征在于,所述确定单元还用于:根据所述第一图像和所述第二图像中的物体与车辆之间的距离显示提示信息。
- 一种测距装置,包括一个或多个处理器和存储器,其特征在于,所述一个或多个处理器执行存储在存储器中的程序或指令,以使得所述测距装置实现上述权利要求1至8中任一项所述的方法。
- 一种车辆,其特征在于,所述车辆包括一个或多个鱼眼相机、一个或多个针孔相机和一个或多个处理器,所述处理器用于执行计算机指令以实现如上述权利要求1至8中任一项所述的方法。
- 一种计算机可读存储介质,用于存储计算机程序,其特征在于,当所述计算机程序在计算机或处理器运行时,使得所述计算机或所述处理器实现上述权利要求1至8中任一项所述的方法。
- 一种计算机程序产品,所述计算机程序产品中包含指令,其特征在于,当所述指令在计算机或处理器上运行时,使得所述计算机或所述处理器实现上述权利要求1至8中任一项所述的方法。
Priority Applications (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| EP23884311.4A EP4575579A4 (en) | 2022-10-31 | 2023-07-20 | TELEMETRY METHOD AND DEVICE |
| US19/193,543 US20250259319A1 (en) | 2022-10-31 | 2025-04-29 | Ranging method and apparatus |
Applications Claiming Priority (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN202211349288.4A CN118011428A (zh) | 2022-10-31 | 2022-10-31 | 测距方法和装置 |
| CN202211349288.4 | 2022-10-31 |
Related Child Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| US19/193,543 Continuation US20250259319A1 (en) | 2022-10-31 | 2025-04-29 | Ranging method and apparatus |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| WO2024093372A1 true WO2024093372A1 (zh) | 2024-05-10 |
Family
ID=90929602
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| PCT/CN2023/108397 Ceased WO2024093372A1 (zh) | 2022-10-31 | 2023-07-20 | 测距方法和装置 |
Country Status (4)
| Country | Link |
|---|---|
| US (1) | US20250259319A1 (zh) |
| EP (1) | EP4575579A4 (zh) |
| CN (1) | CN118011428A (zh) |
| WO (1) | WO2024093372A1 (zh) |
Cited By (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| EP4485355A1 (en) * | 2023-06-27 | 2025-01-01 | Lite-On Technology Corporation | Ranging system and ranging method thereof |
Families Citing this family (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN119205869B (zh) * | 2024-08-09 | 2025-11-25 | 南方科技大学 | 一种基于多模态融合的图像深度估计方法、装置及设备 |
| CN120783316A (zh) * | 2025-06-26 | 2025-10-14 | 上海保隆汽车科技股份有限公司 | 一种基于双相机组合的道路障碍检测方法、检测设备及计算机可读存储介质 |
Citations (3)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN109708655A (zh) * | 2018-12-29 | 2019-05-03 | 百度在线网络技术(北京)有限公司 | 导航方法、装置、车辆及计算机可读存储介质 |
| DE102019100303A1 (de) * | 2019-01-08 | 2020-07-09 | HELLA GmbH & Co. KGaA | Verfahren und Vorrichtung zum Ermitteln einer Krümmung einer Fahrbahn |
| CN114782911A (zh) * | 2022-06-20 | 2022-07-22 | 小米汽车科技有限公司 | 图像处理的方法、装置、设备、介质、芯片及车辆 |
Family Cites Families (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN112639864B (zh) * | 2020-05-19 | 2022-05-27 | 华为技术有限公司 | 用于测距的方法和装置 |
-
2022
- 2022-10-31 CN CN202211349288.4A patent/CN118011428A/zh active Pending
-
2023
- 2023-07-20 WO PCT/CN2023/108397 patent/WO2024093372A1/zh not_active Ceased
- 2023-07-20 EP EP23884311.4A patent/EP4575579A4/en active Pending
-
2025
- 2025-04-29 US US19/193,543 patent/US20250259319A1/en active Pending
Patent Citations (3)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN109708655A (zh) * | 2018-12-29 | 2019-05-03 | 百度在线网络技术(北京)有限公司 | 导航方法、装置、车辆及计算机可读存储介质 |
| DE102019100303A1 (de) * | 2019-01-08 | 2020-07-09 | HELLA GmbH & Co. KGaA | Verfahren und Vorrichtung zum Ermitteln einer Krümmung einer Fahrbahn |
| CN114782911A (zh) * | 2022-06-20 | 2022-07-22 | 小米汽车科技有限公司 | 图像处理的方法、装置、设备、介质、芯片及车辆 |
Non-Patent Citations (1)
| Title |
|---|
| See also references of EP4575579A4 |
Cited By (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| EP4485355A1 (en) * | 2023-06-27 | 2025-01-01 | Lite-On Technology Corporation | Ranging system and ranging method thereof |
| US12470834B2 (en) | 2023-06-27 | 2025-11-11 | Lite-On Technology Corporation | Ranging system and ranging method thereof |
Also Published As
| Publication number | Publication date |
|---|---|
| US20250259319A1 (en) | 2025-08-14 |
| EP4575579A4 (en) | 2025-11-26 |
| EP4575579A1 (en) | 2025-06-25 |
| CN118011428A (zh) | 2024-05-10 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| US11989350B2 (en) | Hand key point recognition model training method, hand key point recognition method and device | |
| WO2024093372A1 (zh) | 测距方法和装置 | |
| US10726579B1 (en) | LiDAR-camera calibration | |
| CN118160007A (zh) | 图像修改技术 | |
| US12592086B2 (en) | Pose determining method and related device | |
| TWI755762B (zh) | 目標跟蹤方法、智慧移動設備和儲存介質 | |
| WO2021203868A1 (zh) | 数据处理的方法和装置 | |
| CN114881863B (zh) | 一种图像拼接方法、电子设备及计算机可读存储介质 | |
| CN111950426A (zh) | 目标检测方法、装置及运载工具 | |
| CN114616586A (zh) | 图像标注方法、装置、电子设备及计算机可读存储介质 | |
| CN114339185A (zh) | 用于车辆相机图像的图像彩色化 | |
| CN116052461A (zh) | 虚拟车位确定方法、显示方法、装置、设备、介质及程序 | |
| CN111538009A (zh) | 雷达点的标记方法和装置 | |
| CN119339355A (zh) | 一种基于多视角的多模态信息融合目标检测方法 | |
| WO2021232222A1 (zh) | 用于测距的方法和装置 | |
| US20240048843A1 (en) | Local compute camera calibration | |
| CN117441190A (zh) | 一种部位定位方法及装置 | |
| US11615544B2 (en) | Systems and methods for end-to-end map building from a video sequence using neural camera models | |
| WO2021259287A1 (zh) | 深度图生成方法、设备及存储介质 | |
| US12567239B2 (en) | Hybrid system for feature detection and descriptor generation | |
| CN115358937B (zh) | 图像去反光方法、介质及电子设备 | |
| US12309342B2 (en) | Photographing method, image processing method, and electronic device | |
| KR102534449B1 (ko) | 이미지 처리 방법, 장치, 전자 장치 및 컴퓨터 판독 가능 저장 매체 | |
| CN114581602A (zh) | 一种3d成像方法、装置、电子设备及存储介质 | |
| CN115393237A (zh) | 图像去反光方法、介质及电子设备 |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| 121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 23884311 Country of ref document: EP Kind code of ref document: A1 |
|
| WWE | Wipo information: entry into national phase |
Ref document number: 2023884311 Country of ref document: EP |
|
| ENP | Entry into the national phase |
Ref document number: 2023884311 Country of ref document: EP Effective date: 20250320 |
|
| NENP | Non-entry into the national phase |
Ref country code: DE |
|
| WWP | Wipo information: published in national office |
Ref document number: 2023884311 Country of ref document: EP |