WO2019214201A1 - 活体检测方法及装置、系统、电子设备、存储介质 - Google Patents
活体检测方法及装置、系统、电子设备、存储介质 Download PDFInfo
- Publication number
- WO2019214201A1 WO2019214201A1 PCT/CN2018/115499 CN2018115499W WO2019214201A1 WO 2019214201 A1 WO2019214201 A1 WO 2019214201A1 CN 2018115499 W CN2018115499 W CN 2018115499W WO 2019214201 A1 WO2019214201 A1 WO 2019214201A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- target object
- information
- sensor
- feature information
- key point
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Ceased
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V40/00—Recognition of biometric, human-related or animal-related patterns in image or video data
- G06V40/10—Human or animal bodies, e.g. vehicle occupants or pedestrians; Body parts, e.g. hands
- G06V40/16—Human faces, e.g. facial parts, sketches or expressions
- G06V40/161—Detection; Localisation; Normalisation
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V20/00—Scenes; Scene-specific elements
- G06V20/60—Type of objects
- G06V20/64—Three-dimensional [3D] objects
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V40/00—Recognition of biometric, human-related or animal-related patterns in image or video data
- G06V40/10—Human or animal bodies, e.g. vehicle occupants or pedestrians; Body parts, e.g. hands
- G06V40/16—Human faces, e.g. facial parts, sketches or expressions
- G06V40/161—Detection; Localisation; Normalisation
- G06V40/166—Detection; Localisation; Normalisation using acquisition arrangements
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/25—Fusion techniques
- G06F18/253—Fusion techniques of extracted features
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F21/00—Security arrangements for protecting computers, components thereof, programs or data against unauthorised activity
- G06F21/30—Authentication, i.e. establishing the identity or authorisation of security principals
- G06F21/31—User authentication
- G06F21/32—User authentication using biometric data, e.g. fingerprints, iris scans or voiceprints
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/045—Combinations of networks
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis
- G06T7/50—Depth or shape recovery
- G06T7/55—Depth or shape recovery from multiple images
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/10—Image acquisition
- G06V10/12—Details of acquisition arrangements; Constructional details thereof
- G06V10/14—Optical characteristics of the device performing the acquisition or on the illumination arrangements
- G06V10/143—Sensing or illuminating at different wavelengths
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/70—Arrangements for image or video recognition or understanding using pattern recognition or machine learning
- G06V10/77—Processing image or video features in feature spaces; using data integration or data reduction, e.g. principal component analysis [PCA] or independent component analysis [ICA] or self-organising maps [SOM]; Blind source separation
- G06V10/80—Fusion, i.e. combining data from various sources at the sensor level, preprocessing level, feature extraction level or classification level
- G06V10/806—Fusion, i.e. combining data from various sources at the sensor level, preprocessing level, feature extraction level or classification level of extracted features
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/70—Arrangements for image or video recognition or understanding using pattern recognition or machine learning
- G06V10/82—Arrangements for image or video recognition or understanding using pattern recognition or machine learning using neural networks
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V40/00—Recognition of biometric, human-related or animal-related patterns in image or video data
- G06V40/10—Human or animal bodies, e.g. vehicle occupants or pedestrians; Body parts, e.g. hands
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V40/00—Recognition of biometric, human-related or animal-related patterns in image or video data
- G06V40/10—Human or animal bodies, e.g. vehicle occupants or pedestrians; Body parts, e.g. hands
- G06V40/16—Human faces, e.g. facial parts, sketches or expressions
- G06V40/168—Feature extraction; Face representation
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V40/00—Recognition of biometric, human-related or animal-related patterns in image or video data
- G06V40/10—Human or animal bodies, e.g. vehicle occupants or pedestrians; Body parts, e.g. hands
- G06V40/16—Human faces, e.g. facial parts, sketches or expressions
- G06V40/168—Feature extraction; Face representation
- G06V40/171—Local features and components; Facial parts ; Occluding parts, e.g. glasses; Geometrical relationships
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V40/00—Recognition of biometric, human-related or animal-related patterns in image or video data
- G06V40/10—Human or animal bodies, e.g. vehicle occupants or pedestrians; Body parts, e.g. hands
- G06V40/16—Human faces, e.g. facial parts, sketches or expressions
- G06V40/172—Classification, e.g. identification
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V40/00—Recognition of biometric, human-related or animal-related patterns in image or video data
- G06V40/40—Spoof detection, e.g. liveness detection
- G06V40/45—Detection of the body part being alive
Definitions
- the present disclosure relates to the field of computer vision technology, and in particular, to a living body detection method and device, system, electronic device, and storage medium.
- face recognition technology has been widely used in scenes such as face unlocking, face payment, unmanned supermarkets and video surveillance.
- the face recognition technology has a risk of being easily attacked by a prosthetic face in the form of a physical photograph of a face, an electronic photograph of a face, or a video containing a face. Therefore, living body detection is an indispensable part of face recognition.
- the embodiment of the present disclosure proposes a living body detecting method and apparatus.
- a living body detecting method includes: acquiring depth information of a target object sensed by a first sensor and a target image sensed by a second sensor; performing key on the target image Point detection, obtaining key point information of the target object; and obtaining a living body detection result of the target object based on depth information of the target object and key point information of the target object.
- the target object is a human face.
- the second sensor is an image sensor, for example, the second sensor is an RGB sensor or a near infrared sensor.
- the first sensor is a depth sensor, eg, the first sensor is a time of flight TOF sensor or a structured light sensor.
- the first sensor and the second sensor are integrated in the same device, such as integrated in a 3D camera.
- the method before performing key point detection on the target image, the method further includes:
- obtaining a living body detection result of the target object including:
- the first feature information is obtained based on the depth information of the target object and the key point information of the target object, including: inputting depth information of the target object and key point information of the target object Processing by the first neural network to obtain first feature information;
- And obtaining the second feature information based on the key point information of the target object including: inputting the target image and the key point information of the target object into the second neural network for processing, to obtain second feature information.
- the first neural network and the second neural network have the same network structure.
- the first feature information is obtained based on the depth information of the target object and the key point information of the target object, including: performing depth information of the target object and key point information of the target object Convolution processing, obtaining a first convolution result; performing downsampling processing on the first convolution result to obtain a first down sampling result; and obtaining first feature information based on the first down sampling result.
- the second feature information is obtained based on the key point information of the target object, including:
- determining the living body detection result of the target object based on the first feature information and the second feature information comprises: performing fusion processing on the first feature information and the second feature information Obtaining third feature information; and determining a living body detection result of the target object according to the third feature information.
- determining the living body detection result according to the third feature information includes:
- the living body detection result of the target object is determined according to the probability that the target object is a living body.
- a living body detecting apparatus including:
- An acquiring module configured to acquire depth information of the target object sensed by the first sensor and a target image sensed by the second sensor;
- a detection module configured to perform key point detection on the target image to obtain key point information of the target object
- a determining module configured to obtain a living body detection result of the target object based on the depth information of the target object and the key point information of the target object.
- the target object is a human face.
- the second sensor is an image sensor, for example, the second sensor is an RGB sensor or a near infrared sensor.
- the first sensor is a depth sensor, eg, the first sensor is a time of flight TOF sensor or a structured light sensor.
- the first sensor and the second sensor are integrated in the same device, such as integrated in a 3D camera.
- the apparatus further includes an alignment module configured to align the depth information of the target object and the target image based on parameters of the first sensor and parameters of the second sensor.
- the determining module includes: a first determining submodule configured to obtain first feature information based on depth information of the target object and key point information of the target object; and a second determining submodule, And configured to obtain second feature information based on the key point information of the target object; and the third determining submodule is configured to determine a living body detection result of the target object based on the first feature information and the second feature information.
- the first determining submodule is configured to: input depth information of the target object and key point information of the target object into the first neural network for processing, to obtain first feature information;
- the second determining sub-module is configured to: input the target image and the key point information of the target object into the second neural network for processing, to obtain second feature information.
- the first neural network and the second neural network have the same network structure.
- the first determining submodule includes: a first convolution unit configured to perform convolution processing on depth information of the target object and key point information of the target object to obtain a first convolution a first sampling unit configured to perform a downsampling process on the first convolution result to obtain a first down sampling result.
- the first determining unit is configured to obtain the first feature based on the first down sampling result. information.
- the second determining submodule includes: a second convolution unit configured to perform convolution processing on the target image and the key point information of the target object to obtain a second convolution result;
- the second downsampling unit is configured to perform a downsampling process on the second convolution result to obtain a second down sampling result.
- the second determining unit is configured to obtain second feature information based on the second down sampling result.
- the third determining submodule includes: a fully connected unit configured to perform fusion processing on the first feature information and the second feature information to obtain third feature information; and a third determining unit, And configured to determine a living body detection result of the target object according to the third feature information.
- the third determining unit includes: a first determining subunit configured to obtain, according to the third feature information, a probability that the target object is a living body; and a second determining subunit configured to be configured according to the The probability that the target object is a living body determines the living body detection result of the target object.
- the living body detecting apparatus provided by the embodiment of the present disclosure is for performing the living body detecting method in any of the above embodiments, and includes modules and units for performing the steps and/or processes of any of the possible living body detecting methods described above.
- a living body detecting apparatus comprising: a processor; a memory configured to store processor-executable instructions; wherein the processor is configured to perform the above method.
- a non-transitory computer readable storage medium having stored thereon computer program instructions, wherein the computer program instructions are executed by a processor to implement the above method.
- a living body detecting system including the above-described living body detecting device, the first sensor, and the second sensor is provided.
- a living body detecting system including the above-described nonvolatile computer readable storage medium, the first sensor, and the second sensor is provided.
- an electronic device including:
- a first sensor configured to detect depth information of the target object
- a second sensor configured to acquire a target image including the target object
- a processor configured to perform key point detection on the target object collected by the second sensor, obtain key point information of the target object, and based on the depth information and the target object detected by the first sensor The key point information of the target object is obtained, and the living body detection result of the target object is obtained.
- the second sensor is an RGB sensor or a near infrared sensor.
- the first sensor is a time of flight TOF sensor or a structured light sensor.
- the processor is further configured to: align the depth information of the target object and the target image according to parameters of the first sensor and parameters of the second sensor.
- the processor is configured to: obtain first feature information based on depth information of the target object and key point information of the target object; and obtain a second based on key point information of the target object Feature information; determining a living body detection result of the target object based on the first feature information and the second feature information.
- the processor is configured to: input depth information of the target object and key point information of the target object into the first neural network for processing, to obtain first feature information;
- And obtaining the second feature information based on the key point information of the target object including: inputting the target image and the key point information of the target object into the second neural network for processing, to obtain second feature information.
- the processor is configured to perform convolution processing on depth information of the target object and key point information of the target object to obtain a first convolution result; and the first convolution result Performing a downsampling process to obtain a first down sampling result; and based on the first down sampling result, obtaining first feature information.
- the processor is configured to perform convolution processing on the target image and key point information of the target object to obtain a second convolution result; and downsample the second convolution result Processing, obtaining a second down sampling result; and obtaining second feature information based on the second down sampling result.
- the processor is configured to: perform fusion processing on the first feature information and the second feature information to obtain third feature information; and determine the target object according to the third feature information. The result of the in vivo test.
- the processor is configured to: obtain a probability that the target object is a living body based on the third feature information; and determine a living body detection result of the target object according to a probability that the target object is a living body .
- the living body detection method of each aspect of the present disclosure can perform living body detection by combining the depth information of the target object and the target image, thereby enabling the living body detection using the depth information of the target object and the key point information of the target object in the target image, thereby improving The accuracy of in vivo detection.
- FIG. 1 illustrates a flow chart of a living body detecting method according to an embodiment of the present disclosure.
- FIG. 2 illustrates an exemplary flowchart of a living body detecting method according to an embodiment of the present disclosure.
- FIG. 3 illustrates an exemplary flowchart of the step S13 of the living body detecting method according to an embodiment of the present disclosure.
- FIG. 4A illustrates a block diagram of a living body detecting device applied to a human face in accordance with an embodiment of the present disclosure.
- FIG. 4B shows a block diagram of the data pre-processing module of FIG. 4A, in accordance with an embodiment of the present disclosure.
- FIG. 4C shows a block diagram of the deep neural network module of FIG. 4A, in accordance with an embodiment of the present disclosure.
- FIG. 5 illustrates an exemplary flowchart of the living body detecting method step S131 according to an embodiment of the present disclosure.
- FIG. 6 illustrates an exemplary flowchart of the living body detecting method step S132 according to an embodiment of the present disclosure.
- FIG. 7 illustrates an exemplary flowchart of the living body detecting method step S133 according to an embodiment of the present disclosure.
- FIG. 8 illustrates an exemplary flowchart of the living body detecting method step S1332 according to an embodiment of the present disclosure.
- FIG. 9 illustrates a block diagram of a living body detecting apparatus according to an embodiment of the present disclosure.
- FIG. 10 illustrates an exemplary block diagram of a living body detecting apparatus according to an embodiment of the present disclosure.
- FIG. 11 is a block diagram of a living body detecting apparatus 800, according to an exemplary embodiment.
- FIG. 1 illustrates a flow chart of a living body detecting method according to an embodiment of the present disclosure.
- the method can be applied to a terminal device having a face recognition function such as a mobile phone, a tablet computer, a digital camera or an access control device.
- the method can be applied to scenes such as face unlocking, face payment, unmanned supermarket and video surveillance. As shown in FIG. 1, the method includes steps S11 to S13.
- step S11 the depth information of the target object sensed by the first sensor and the target image sensed by the second sensor are acquired.
- the target object is a human face.
- the first sensor is a three-dimensional sensor.
- the first sensor can be a ToF (Time of Flight) sensor, a structured light sensor, a binocular sensor, or other type of depth sensor.
- ToF Time of Flight
- the embodiment of the present disclosure performs the living body detection by using the depth information including the target object, and can fully excavate the depth information of the target object, thereby improving the accuracy of the living body detection.
- the embodiment of the present disclosure performs the living body detection by using the depth information including the face, and can fully exploit the depth information of the face data, thereby improving the accuracy of the living face detection.
- the first sensor is described above with a ToF sensor, a structured light sensor, and a binocular sensor, those skilled in the art can understand that the embodiments of the present disclosure are not limited thereto.
- a person skilled in the art can flexibly select the type of the first sensor according to actual application scenario requirements and/or personal preferences, as long as the depth information of the target object can be sensed by the first sensor.
- the depth information of the target object may be any information that can reflect the depth of the target object.
- the specific implementation of the depth information of the target object is not limited in the embodiment of the present disclosure.
- the depth information of the target object may be a depth image of the target object.
- the depth information of the target object may be a point cloud of the target object.
- the point cloud of the target object can record the three-dimensional coordinates of each point of the target object.
- the depth information of the target object may be a table or other type of file that records the depth of various points of the target object.
- the second sensor can be an RGB (Red, Red; Green, Green; Blue, Blue) sensor or a near infrared sensor. If the second sensor is an RGB sensor or other type of image sensor, the target image sensed by the second sensor is an RGB image. If the second sensor is a near-infrared sensor, the target image sensed by the second sensor is a near-infrared image.
- the near-infrared image may be a near-infrared image with a spot, or a near-infrared image without a spot, and the like. It should be noted that although the second sensor is described above with the RGB sensor and the near-infrared sensor, those skilled in the art can understand that the embodiment of the present disclosure is not limited thereto. A person skilled in the art can flexibly select the type of the second sensor according to the actual application scenario requirement and/or personal preference, as long as the key point information of the target object can be acquired by the target image sensed by the second sensor.
- the depth map and the target image are acquired by a 3D camera, wherein the 3D camera includes an image sensor for acquiring an image and a depth sensor for acquiring depth information.
- the terminal device collects three-dimensional information of the target object through a 3D camera set by itself.
- the depth information and the target image are acquired from other devices, for example, a living body detection request sent by the terminal device, the living body detection request carrying the depth information and the target image.
- step S12 key point detection is performed on the target image to obtain key point information of the target object.
- the key point information of the target object may include location information of a key point of the target object.
- the key points of the target object may include one or more of an eye key point, an eyebrow key point, a nose key point, a mouth key point, and a face contour key point.
- the eye key points may include one or more of an eye contour key point, an eye corner key point, and a pupil key point.
- step S13 based on the depth information of the target object and the key point information of the target object, the living body detection result of the target object is obtained.
- the result of the living body detection of the target object may be that the target object is a living body or the target object is a prosthesis.
- the living body detection result of the target object may be that the target object is a living face or the target object is a prosthetic face.
- the embodiment of the present disclosure performs living body detection by combining the depth information of the target object and the target image, thereby enabling the living body detection using the depth information of the target object and the key point information of the target object in the target image, thereby improving the accuracy of the living body detection. .
- FIG. 2 illustrates an exemplary flowchart of a living body detecting method according to an embodiment of the present disclosure. As shown in FIG. 2, the method may include steps S21 to S24.
- step S21 the depth information of the target object sensed by the first sensor and the target image sensed by the second sensor are acquired.
- the depth information of the target object and the target image are aligned according to the parameters of the first sensor and the parameters of the second sensor.
- the depth information of the target image may be subjected to a conversion process such that the converted depth information is aligned with the target image. For example, if the depth information of the target object is the depth image of the target object, determining a conversion matrix of the parameter matrix of the first sensor to the parameter matrix of the second sensor according to the parameter matrix of the first sensor and the parameter matrix of the second sensor; The conversion matrix converts the depth image of the target object.
- the target image may be subjected to a conversion process such that the converted target image is aligned with the depth information. For example, if the depth information of the target object is the depth image of the target object, determining a conversion matrix of the parameter matrix of the second sensor to the parameter matrix of the first sensor according to the parameter matrix of the first sensor and the parameter matrix of the second sensor; The conversion matrix converts the target image.
- the parameters of the first sensor may include internal parameters and/or external parameters of the first sensor
- the parameters of the second sensor may include internal parameters and/or external parameters of the second sensor
- the depth information of the target object is the depth image of the target object
- the depth image of the target object and the corresponding portion of the target image can be made in two images. The location is the same.
- step S23 key point detection is performed on the target image to obtain key point information of the target object.
- step S12 the description of step S12 above is referred to step S23.
- step S24 based on the depth information of the target object and the key point information of the target object, the living body detection result of the target object is obtained.
- step S13 the description of step S13 above is referred to step S24.
- FIG. 3 illustrates an exemplary flowchart of the step S13 of the living body detecting method according to an embodiment of the present disclosure. As shown in FIG. 3, step S13 may include steps S131 to S133.
- step S131 first feature information is obtained based on the depth information of the target object and the key point information of the target object.
- the first feature information is obtained based on the depth information of the target object and the key point information of the target object, including: inputting the depth information of the target object and the key point information of the target object into the first neural network for processing, and obtaining First feature information.
- the first neural network may include a convolutional layer, a downsampling layer, and a fully connected layer.
- the first neural network may include a primary convolution layer, a primary downsampling layer, and a first level fully connected layer.
- the level convolutional layer may include one or more convolution layers
- the level downsampling layer may include one or more downsampling layers
- the level of fully connected layers may include one or more fully connected layers.
- the first neural network can include a multi-level convolutional layer, a multi-level downsampling layer, and a first-level fully connected layer.
- each level of convolutional layer may include one or more convolutional layers
- each level of the downsampling layer may include one or more downsampling layers
- the level of fully connected layers may include one or more fully connected layers.
- the i-th convolution layer is cascaded to the i-th down-sampling layer
- the i-th down-sampling layer is cascaded to the i+1th-order convolutional layer
- the n-th down-sampling layer is cascaded to the fully-connected layer, wherein , i and n are both positive integers, 1 ⁇ i ⁇ n, and n represents the number of stages of the convolutional layer and the downsampling layer in the first neural network.
- the first neural network may include a convolutional layer, a downsampling layer, a normalized layer, and a fully connected layer.
- the first neural network may include a first convolutional layer, a normalized layer, a primary downsampling layer, and a first level fully connected layer.
- the level convolutional layer may include one or more convolution layers
- the level downsampling layer may include one or more downsampling layers
- the level of fully connected layers may include one or more fully connected layers.
- the first neural network can include a multi-level convolutional layer, a plurality of normalized layers, and a multi-level downsampling layer and a first level fully connected layer.
- each level of convolutional layer may include one or more convolutional layers
- each level of the downsampling layer may include one or more downsampling layers
- the level of fully connected layers may include one or more fully connected layers.
- the i-th convolution layer is cascaded with the i-th normalization layer
- the i-th normalization layer is cascaded with the i-th sub-sampling layer
- the i-th sub-sampling layer is cascaded with the i+1th level Convolution layer
- the nth stage downsampling layer is cascaded to the fully connected layer, where i and n are positive integers, 1 ⁇ i ⁇ n, and n represents the number of levels of the convolutional layer and the downsampling layer in the first neural network. And the number of normalized layers.
- step S132 second feature information is obtained based on the key point information of the target object.
- the second feature information is obtained based on the key point information of the target object, including: inputting the target image and the key point information of the target object into the second neural network for processing, to obtain the second feature information.
- the second neural network may include a convolutional layer, a downsampling layer, and a fully connected layer.
- the second neural network can include a primary convolution layer, a primary downsampling layer, and a first level fully connected layer.
- the level convolutional layer may include one or more convolution layers
- the level downsampling layer may include one or more downsampling layers
- the level of fully connected layers may include one or more fully connected layers.
- the second neural network can include a multi-level convolutional layer, a multi-level downsampling layer, and a first-level fully connected layer.
- each level of convolutional layer may include one or more convolutional layers
- each level of the downsampling layer may include one or more downsampling layers
- the level of fully connected layers may include one or more fully connected layers.
- the jth level convolution layer is cascaded to the jth level downsampling layer
- the jth level downsampling layer is cascaded to the j+1th level convolutional layer
- the mth level downsampling layer is cascaded to the full connection layer
- j and m are both positive integers, 1 ⁇ j ⁇ m
- m represents the number of stages of the convolutional layer and the downsampling layer in the second neural network.
- the second neural network may include a convolutional layer, a downsampling layer, a normalized layer, and a fully connected layer.
- the second neural network can include a primary convolutional layer, a normalized layer, a primary downsampling layer, and a first level fully connected layer.
- the level convolutional layer may include one or more convolution layers
- the level downsampling layer may include one or more downsampling layers
- the level of fully connected layers may include one or more fully connected layers.
- the second neural network can include a multi-level convolutional layer, a plurality of normalized layers, and a multi-level downsampling layer and a first-level fully connected layer.
- each level of convolutional layer may include one or more convolutional layers
- each level of the downsampling layer may include one or more downsampling layers
- the level of fully connected layers may include one or more fully connected layers.
- the jth level convolution layer is cascaded to the jth normalization layer
- the jth normalization layer is cascaded to the jth level downsampling layer
- the jth level downsampling layer is cascaded to the j+1th level Convolution layer
- the m-th level downsampling layer is cascaded to the fully connected layer, where j and m are positive integers, 1 ⁇ j ⁇ m, and m represents the number of levels of the convolutional layer and the downsampling layer in the second neural network. And the number of normalized layers.
- the first neural network and the second neural network have the same network structure.
- step S133 the living body detection result of the target object is determined based on the first feature information and the second feature information.
- step S131 may be performed first and then step S132 may be performed, or step S132 may be performed first and then step S131 may be performed, or step S131 and step S132 may be performed simultaneously.
- FIG. 5 illustrates an exemplary flowchart of the living body detecting method step S131 according to an embodiment of the present disclosure. As shown in FIG. 5, step S131 may include steps S1311 through S1313.
- step S1311 the depth information of the target object and the key point information of the target object are convoluted to obtain a first convolution result.
- step S1312 the first convolution result is subjected to downsampling processing to obtain a first downsampling result.
- the depth information of the target object and the key point information of the target object may be subjected to convolution processing and downsampling processing by the first convolutional layer and the first level downsampling layer.
- the level convolution layer may comprise one or more convolution layers
- the level downsampling layer may comprise one or more downsampling layers.
- the depth information of the target object and the key point information of the target object may be convoluted and downsampled by the multi-level convolution layer and the multi-level downsampling layer.
- each level of the convolution layer may include one or more convolution layers
- each level of the downsampling layer may include one or more downsampling layers.
- performing a downsampling process on the first convolution result to obtain a first downsampling result may include: normalizing the first convolution result to obtain a first normalization result; The normalized result is subjected to downsampling processing to obtain a first downsampling result.
- step S1313 based on the first downsampling result, the first feature information is obtained.
- the first downsampling result may be input to the fully connected layer, and the first downsampling result is subjected to a fusion process (eg, a full join operation) through the fully connected layer to obtain first feature information.
- a fusion process eg, a full join operation
- FIG. 6 illustrates an exemplary flowchart of the living body detecting method step S132 according to an embodiment of the present disclosure. As shown in FIG. 6, step S132 may include steps S1321 through S1323.
- step S1321 convolution processing is performed on the target image and the key point information of the target object to obtain a second convolution result.
- step S1322 the second convolution result is subjected to downsampling processing to obtain a second downsampling result.
- the target image and the key point information of the target object may be subjected to convolution processing and downsampling processing by the first convolutional layer and the first level downsampling layer.
- the level convolution layer may include one or more convolution layers
- the level downsampling layer may include one or more downsampling layers.
- the key point information of the target image and the target object may be convoluted and downsampled by the multi-level convolution layer and the multi-level downsampling layer.
- each level of the convolution layer may include one or more convolution layers
- each level of the downsampling layer may include one or more downsampling layers.
- performing a downsampling process on the second convolution result to obtain a second downsampling result may include: normalizing the second convolution result to obtain a second normalization result; The normalized result is subjected to downsampling processing to obtain a second downsampling result.
- step S1323 based on the second down sampling result, the second feature information is obtained.
- the second downsampling result may be input to the fully connected layer, and the second downsampling result is subjected to a fusion process (eg, a full join operation) through the fully connected layer to obtain second feature information.
- a fusion process eg, a full join operation
- FIG. 7 illustrates an exemplary flowchart of the living body detecting method step S133 according to an embodiment of the present disclosure. As shown in FIG. 7, step S133 may include step S1331 and step S1332.
- step S1331 the first feature information and the second feature information are subjected to a fusion process (for example, a full join operation) to obtain third feature information.
- a fusion process for example, a full join operation
- the first feature information and the second feature information may be connected (eg, channel superimposed) or added to obtain third feature information.
- the first feature information and the second feature information are fully connected to each other through the fully connected layer to obtain third feature information.
- step S1332 based on the third feature information, the living body detection result of the target object is determined.
- FIG. 8 illustrates an exemplary flowchart of the living body detecting method step S1332 according to an embodiment of the present disclosure. As shown in FIG. 8, step S1332 may include step S13321 and step S13322.
- step S13321 based on the third feature information, a probability that the target object is a living body is obtained.
- the third feature information may be input into the Softmax layer, and the probability that the target object is a living body is obtained through the Softmax layer.
- the Softmax layer may include two neurons, where one neuron represents the probability that the target object is a living body and the other neuron represents the probability that the target object is a prosthesis.
- step S13322 the living body detection result of the target object is determined according to the probability that the target object is a living body.
- determining a living body detection result of the target object according to a probability that the target object is a living body including: if the probability that the target object is a living body is greater than the first threshold, determining that the living body detection result of the target object is the target object is a living body; If the probability that the target object is a living body is less than or equal to the first threshold, it is determined that the living body detection result of the target object is a prosthesis.
- the probability that the target object is a prosthesis may be obtained based on the third feature information, and the biometric detection result of the target object is determined according to the probability that the target object is a prosthesis. In this implementation, if the probability that the target object is a prosthesis is greater than the second threshold, determining that the target object's living body detection result is the target object is a prosthesis; if the target object is a prosthesis, the probability is less than or equal to the second threshold, then The living body detection result of the target object is determined to be a living body.
- the embodiment of the present disclosure performs living body detection by combining the depth information of the target object and the target image, thereby enabling the living body detection using the depth information of the target object and the key point information of the target object in the target image, thereby improving the accuracy of the living body detection. And the calculation complexity is low, and the accurate living body detection result can still be obtained when the camera is slightly shaken or shaken.
- face recognition With the development of face recognition technology, the accuracy of face recognition has been able to surpass fingerprints, so it is widely used in various scenes, such as video surveillance, face unlocking, face payment and other applications.
- face recognition is at risk of being hacked, and in vivo detection is an essential part of face recognition applications.
- the monocular living body detection uses an image captured by a normal camera as an input, and has a drawback that it is easily passed by a high-definition seamless hack image.
- the binocular in vivo detection technology uses two cameras (ordinary RGB camera or ordinary near-infrared camera) as input, and the performance is superior to monocular in vivo detection.
- the calculation of the depth distribution information of the face by binocular matching has the disadvantages of large computational complexity and low accuracy of depth information, and the camera may be subject to changes in camera parameters due to shaking, vibration, etc., resulting in computational failure.
- 3D, 3Dimensions In recent years, three-dimensional (3D, 3Dimensions) sensor technology has advanced by leaps and bounds, including Time Of Flight (TOF) sensors, structured light sensors, binocular sensors, etc., enabling users to easily obtain high-precision depth directly from sensors (Sensors). information.
- Embodiments of the present disclosure take 3D data and near-infrared data or RGB color mode data as input, use near-infrared image or RGB image to obtain face key point information, and then fuse face 3D depth map, near-infrared or RGB image, person One or more of face key information, eye corner features, pupil features, etc., using the deep learning model, can more effectively distinguish between real faces and hacks.
- FIG. 4A illustrates a schematic block diagram of a living body detecting device applied to a human face according to an embodiment of the present disclosure.
- the living body detecting device includes an input module 41, a data preprocessing module 42, a deep neural network module 43, and a detection result output module 44.
- the input module 41 is suitable for data input of different hardware modules.
- the data input form of the input module includes one or more of the following: a depth map, a pure near infrared image, a near infrared image with a spot, an RGB image, and the like. .
- the specific combination is determined by different hardware schemes.
- the data preprocessing module 42 is configured to preprocess the data input by the input module to obtain data required by the deep neural network.
- 4B shows an exemplary block diagram of one implementation of the data pre-processing module 42 of FIG. 4A, wherein the input of the data pre-processing module includes: a depth map obtained by the depth sensor and an image sensor, in accordance with an embodiment of the present disclosure.
- the obtained image pure near-infrared image, infrared image with spot, RGB image, etc.
- the depth map 421 and the near-infrared image 422 are used as inputs to the data pre-processing module 42.
- the data preprocessing module processes the input data as follows: image alignment correction 423 and face key detection 424, wherein face key detection can be implemented using a face keypoint model.
- the image alignment correction 423 if the depth map and the near-infrared map (or RGB map) are not synchronously aligned, the input depth map and the near-infrared map need to be aligned/corrected according to the camera's parameter matrix to achieve image alignment.
- a near-infrared image (or RGB image) is input to the face key point model for face key point detection, and face key point information 425 is obtained.
- the output of the data preprocessing module corresponds to the input, including the aligned corrected face depth map (corresponding to the input depth map 421) and the face near infrared map (corresponding to the input near infrared map 422) and the person Face key information.
- the deep neural network module 43 is a two-category model, for example, for a real face, the label of the classification is 0; for a human face of the hack, the label of the classification is 1. For another example, for a real face, the label of the classification is 1; for the face of the hack, the label of the classification is 0, and so on.
- 4C illustrates a block diagram of one example of the deep neural network module of FIG. 4A, as shown in FIG.
- the input of the deep neural network module includes a face depth map obtained after passing through the data preprocessing module, in accordance with an embodiment of the present disclosure. 431.
- a face near infrared map 432 (or other form of a two-dimensional face image) and face key point information 433.
- the output of the deep neural network module includes a discriminant score, ie, a probability of being determined as a real person or a hack.
- the output of the deep neural network is a binary value, and the output score is compared with a preset threshold, wherein the threshold setting can be adjusted according to the accuracy and the recall rate. For example, if the output score of the neural network is greater than the threshold, then It is judged as a hack, and if it is smaller than the threshold, it is judged to be a living body, and the like.
- the deep neural network is a multi-branch model, and the number of branches is determined by the number of input images.
- FIG. 4C takes a face depth map and a face near infrared image as an example, and the deep neural network includes Two branches, each branch includes a plurality of convolution layers 434, a downsampling layer 435, and a fully connected layer 436, wherein the face depth map 431 and the face key point information 433 are input to the first branch for feature extraction processing, The face near infrared map 432 and the face key information 433 are input to the second branch for feature extraction processing, and finally the features extracted by the multiple branches are connected together and input to the fully connected layer 437, and finally processed by the Softmax layer 438.
- the number of neurons in the output layer is 2, representing the probability of real people and hacks.
- the inputs of the two branches of FIG. 4C all include face key point information
- the all-connection layer 437 integrates the feature information of the output of the two-branch fully connected layer 436 by using the face key point information.
- the output of the fully connected layer 436 in the first branch is the first feature information
- the output of the fully connected layer 436 in the second branch is the second feature information
- the fully connected layer 437 passes the face keypoint information.
- the full join operation fuses the first feature information and the second feature information together.
- the face key point information is used to fuse the face depth map and the face near infrared image to obtain the final output result.
- the output is identified as 0; for a hacked face, the output is identified as 1, but this embodiment of the present disclosure does not limit this.
- a 3D sensor with depth information and other auxiliary images such as near infrared images, RGB images, etc.
- auxiliary images such as near infrared images, RGB images, etc.
- the proposed framework can be applied to a variety of 3D sensor input forms, including 3D depth map + near infrared image provided by TOF camera; 3D depth map provided by structured light camera + near infrared image with spot; 3D depth map + RGB map; 3D depth map + near infrared map + RGB map and other forms including 3D depth map and near infrared map or RGB map.
- the general camera and the binocular are mainly used, and the depth information of the face data is not fully exploited, and there is a disadvantage that the HD face is easily attacked by the high-definition hack, and the face collected by the 3D sensor in the embodiment of the present disclosure is used. Depth maps prevent flat hack attacks.
- 3D depth information, other near-infrared data or RGB data, face keypoint information, and eye corner and pupil features are blended to distinguish between real people and hacks through the training of deep learning models.
- single data is dominant, and the correlation and complementarity between multimodal data are not utilized. That is to say, the conventional binocular calculation depth method has the disadvantages of high computational complexity and low precision, and the embodiment of the present disclosure can effectively utilize the current 3D sensing technology to obtain more accurate 3D face data distribution.
- a multi-branch model is integrated, and the multi-branch model can fully integrate multi-modal data, and is compatible with various data forms, and can learn real facial information features through neural networks.
- the embodiment of the present disclosure combines face depth information, near-infrared face information or RGB map face information, face key point information, and multi-dimensional bio-feature fusion techniques such as eye corner, eye, pupil, etc., to make up for a single technology easy to be The shortcomings of hack.
- FIG. 9 illustrates a block diagram of a living body detecting apparatus according to an embodiment of the present disclosure.
- the device includes: an obtaining module 91 configured to acquire depth information of a target object sensed by the first sensor and a target image sensed by the second sensor; and a detecting module 92 configured to perform the target image
- the key point detection obtains the key point information of the target object;
- the determining module 93 is configured to obtain the living body detection result of the target object based on the depth information of the target object and the key point information of the target object.
- the target object is a human face.
- the second sensor is an RGB sensor or a near infrared sensor.
- FIG. 10 illustrates an exemplary block diagram of a living body detecting apparatus according to an embodiment of the present disclosure. As shown in Figure 10:
- the apparatus further includes an alignment module 94 configured to align the depth information and the target image of the target object based on the parameters of the first sensor and the parameters of the second sensor.
- the determining module 93 includes: a first determining submodule 931 configured to obtain first feature information based on depth information of the target object and key point information of the target object; and second determining submodule 932 configured to be based on The key point information of the target object obtains the second feature information; the third determining submodule 933 is configured to determine the living body detection result of the target object based on the first feature information and the second feature information.
- the first determining sub-module 931 is configured to: input the depth information of the target object and the key point information of the target object into the first neural network for processing to obtain the first feature information; and the second determining sub-module 932 is configured to And inputting the key image information of the target image and the target object into the second neural network for processing, to obtain second feature information.
- the first neural network and the second neural network have the same network structure.
- the first determining submodule 931 includes: a first convolution unit configured to perform convolution processing on the depth information of the target object and the key point information of the target object to obtain a first convolution result;
- the sampling unit is configured to perform a down sampling process on the first convolution result to obtain a first down sampling result.
- the first determining unit is configured to obtain the first feature information based on the first down sampling result.
- the second determining submodule 932 includes: a second convolution unit configured to perform convolution processing on the target image and the key point information of the target object to obtain a second convolution result; and a second downsampling unit,
- the second convolution result is configured to perform a down sampling process to obtain a second down sampling result.
- the second determining unit is configured to obtain second feature information based on the second down sampling result.
- the third determining submodule 933 includes: a fully connected unit configured to perform a fusion process (eg, a full join operation) on the first feature information and the second feature information to obtain third feature information; and a third determining unit And configured to determine a living body detection result of the target object according to the third feature information.
- a fully connected unit configured to perform a fusion process (eg, a full join operation) on the first feature information and the second feature information to obtain third feature information
- a third determining unit And configured to determine a living body detection result of the target object according to the third feature information.
- the third determining unit includes: a first determining subunit configured to obtain a probability that the target object is a living body based on the third feature information; and a second determining subunit configured to be a probability according to the target object being a living body, Determine the in vivo test results of the target object.
- the embodiment of the present disclosure performs living body detection by combining the depth information of the target object and the target image, thereby enabling the living body detection using the depth information of the target object and the key point information of the target object in the target image, thereby improving the accuracy of the living body detection. To prevent prosthetic image attacks.
- FIG. 11 is a block diagram of a living body detecting apparatus 800, according to an exemplary embodiment.
- device 800 can be a mobile phone, a computer, a digital broadcast terminal, a messaging device, a gaming console, a tablet device, a medical device, a fitness device, a personal digital assistant, and the like.
- apparatus 800 can include one or more of the following components: processing component 802, memory 804, power component 806, multimedia component 808, audio component 810, input/output (I/O) interface 812, sensor component 814, And a communication component 816.
- processing component 802 memory 804, power component 806, multimedia component 808, audio component 810, input/output (I/O) interface 812, sensor component 814, And a communication component 816.
- I/O input/output
- Processing component 802 typically controls the overall operation of device 800, such as operations associated with display, telephone calls, data communications, camera operations, and recording operations.
- Processing component 802 can include one or more processors 820 to execute instructions to perform all or part of the steps of the above described methods.
- processing component 802 can include one or more modules to facilitate interaction between component 802 and other components.
- processing component 802 can include a multimedia module to facilitate interaction between multimedia component 808 and processing component 802.
- Memory 804 is configured to store various types of data to support operation at device 800. Examples of such data include instructions for any application or method operating on device 800, contact data, phone book data, messages, pictures, videos, and the like.
- the memory 804 can be implemented by any type of volatile or non-volatile storage device, or a combination thereof, such as static random access memory (SRAM), electrically erasable programmable read only memory (EEPROM), erasable. Programmable Read Only Memory (EPROM), Programmable Read Only Memory (PROM), Read Only Memory (ROM), Magnetic Memory, Flash Memory, Disk or Optical Disk.
- Power component 806 provides power to various components of device 800. Power component 806 can include a power management system, one or more power sources, and other components associated with generating, managing, and distributing power for device 800.
- the multimedia component 808 includes a screen between the device 800 and the user that provides an output interface.
- the screen can include a liquid crystal display (LCD) and a touch panel (TP). If the screen includes a touch panel, the screen can be implemented as a touch screen to receive input signals from the user.
- the touch panel includes one or more touch sensors to sense touches, slides, and gestures on the touch panel. The touch sensor may sense not only the boundary of the touch or sliding action, but also the duration and pressure associated with the touch or slide operation.
- the multimedia component 808 includes a front camera and/or a rear camera. When the device 800 is in an operation mode, such as a shooting mode or a video mode, the front camera and/or the rear camera can receive external multimedia data.
- the audio component 810 is configured to output and/or input an audio signal.
- the audio component 810 includes a microphone (MIC) that is configured to receive an external audio signal when the device 800 is in an operational mode, such as a call mode, a recording mode, and a voice recognition mode.
- the received audio signal may be further stored in memory 804 or transmitted via communication component 816.
- the audio component 810 also includes a speaker for outputting an audio signal.
- the I/O interface 812 provides an interface between the processing component 802 and the peripheral interface module, which may be a keyboard, a click wheel, a button, or the like. These buttons may include, but are not limited to, a home button, a volume button, a start button, and a lock button.
- Sensor assembly 814 includes one or more sensors for providing device 800 with a status assessment of various aspects.
- sensor assembly 814 can detect an open/closed state of device 800, relative positioning of components, such as the display and keypad of device 800, and sensor component 814 can also detect a change in position of one component of device 800 or device 800. The presence or absence of user contact with device 800, device 800 orientation or acceleration/deceleration, and temperature variation of device 800.
- Sensor assembly 814 can include a proximity sensor configured to detect the presence of nearby objects without any physical contact.
- Sensor assembly 814 may also include a light sensor, such as a CMOS or CCD image sensor, for use in imaging applications.
- the sensor assembly 814 can also include an acceleration sensor, a gyro sensor, a magnetic sensor, a pressure sensor, or a temperature sensor.
- Communication component 816 is configured to facilitate wired or wireless communication between device 800 and other devices.
- the device 800 can access a wireless network based on a communication standard, such as WiFi, 2G or 3G, or a combination thereof.
- communication component 816 receives broadcast signals or broadcast associated information from an external broadcast management system via a broadcast channel.
- the communication component 816 also includes a near field communication (NFC) module to facilitate short range communication.
- NFC near field communication
- the NFC module can be implemented based on radio frequency identification (RFID) technology, infrared data association (IrDA) technology, ultra-wideband (UWB) technology, Bluetooth (BT) technology, and other technologies.
- RFID radio frequency identification
- IrDA infrared data association
- UWB ultra-wideband
- Bluetooth Bluetooth
- device 800 may be implemented by one or more application specific integrated circuits (ASICs), digital signal processors (DSPs), digital signal processing devices (DSPDs), programmable logic devices (PLDs), field programmable A gate array (FPGA), controller, microcontroller, microprocessor, or other electronic component implementation for performing the above methods.
- ASICs application specific integrated circuits
- DSPs digital signal processors
- DSPDs digital signal processing devices
- PLDs programmable logic devices
- FPGA field programmable A gate array
- controller microcontroller, microprocessor, or other electronic component implementation for performing the above methods.
- a non-transitory computer readable storage medium such as a memory 804 comprising computer program instructions executable by processor 820 of apparatus 800 to perform the above method.
- Embodiments of the present disclosure may be systems, methods, and/or computer program products.
- the computer program product can comprise a computer readable storage medium having computer readable program instructions embodied thereon for causing a processor to implement various aspects of the present disclosure.
- the computer readable storage medium can be a tangible device that can hold and store the instructions used by the instruction execution device.
- the computer readable storage medium can be, for example, but not limited to, an electrical storage device, a magnetic storage device, an optical storage device, an electromagnetic storage device, a semiconductor storage device, or any suitable combination of the foregoing.
- Non-exhaustive list of computer readable storage media include: portable computer disks, hard disks, random access memory (RAM), read only memory (ROM), erasable programmable read only memory (EPROM) Or flash memory), static random access memory (SRAM), portable compact disk read only memory (CD-ROM), digital versatile disk (DVD), memory stick, floppy disk, mechanical encoding device, for example, with instructions stored thereon A raised structure in the hole card or groove, and any suitable combination of the above.
- a computer readable storage medium as used herein is not to be interpreted as a transient signal itself, such as a radio wave or other freely propagating electromagnetic wave, an electromagnetic wave propagating through a waveguide or other transmission medium (eg, a light pulse through a fiber optic cable), or through a wire The electrical signal transmitted.
- the computer readable program instructions described herein can be downloaded from a computer readable storage medium to various computing/processing devices or downloaded to an external computer or external storage device over a network, such as the Internet, a local area network, a wide area network, and/or a wireless network.
- the network may include copper transmission cables, fiber optic transmissions, wireless transmissions, routers, firewalls, switches, gateway computers, and/or edge servers.
- a network adapter card or network interface in each computing/processing device receives computer readable program instructions from the network and forwards the computer readable program instructions for storage in a computer readable storage medium in each computing/processing device .
- the computer program instructions for performing the operations of the embodiments of the present disclosure may be assembly instructions, instruction set architecture (ISA) instructions, machine instructions, machine related instructions, microcode, firmware instructions, state setting data, or programmed in one or more Source code or object code written in any combination of languages, including object oriented programming languages such as Smalltalk, C++, etc., as well as conventional procedural programming languages such as the "C" language or similar programming languages.
- the computer readable program instructions can execute entirely on the user's computer, partly on the user's computer, as a stand-alone software package, partly on the user's computer, partly on the remote computer, or entirely on the remote computer or server. carried out.
- the remote computer can be connected to the user's computer through any kind of network, including a local area network (LAN) or wide area network (WAN), or can be connected to an external computer (eg, using an Internet service provider to access the Internet) connection).
- the customized electronic circuit such as a programmable logic circuit, a field programmable gate array (FPGA), or a programmable logic array (PLA), can be customized by utilizing state information of computer readable program instructions.
- Computer readable program instructions are executed to implement various aspects of the disclosed embodiments.
- the computer readable program instructions can be provided to a general purpose computer, a special purpose computer, or a processor of other programmable data processing apparatus to produce a machine such that when executed by a processor of a computer or other programmable data processing apparatus Means for implementing the functions/acts specified in one or more of the blocks of the flowcharts and/or block diagrams.
- the computer readable program instructions can also be stored in a computer readable storage medium that causes the computer, programmable data processing device, and/or other device to operate in a particular manner, such that the computer readable medium storing the instructions includes An article of manufacture that includes instructions for implementing various aspects of the functions/acts recited in one or more of the flowcharts.
- the computer readable program instructions can also be loaded onto a computer, other programmable data processing device, or other device to perform a series of operational steps on a computer, other programmable data processing device or other device to produce a computer-implemented process.
- instructions executed on a computer, other programmable data processing apparatus, or other device implement the functions/acts recited in one or more of the flowcharts and/or block diagrams.
- each block in the flowchart or block diagram can represent a module, a program segment, or a portion of an instruction that includes one or more components for implementing the specified logical functions.
- Executable instructions can also occur in a different order than those illustrated in the drawings. For example, two consecutive blocks may be executed substantially in parallel, and they may sometimes be executed in the reverse order, depending upon the functionality involved.
- each block of the block diagrams and/or flowcharts, and combinations of blocks in the block diagrams and/or flowcharts can be implemented in a dedicated hardware-based system that performs the specified function or function. Or it can be implemented by a combination of dedicated hardware and computer instructions.
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Health & Medical Sciences (AREA)
- Multimedia (AREA)
- General Health & Medical Sciences (AREA)
- Oral & Maxillofacial Surgery (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Human Computer Interaction (AREA)
- Evolutionary Computation (AREA)
- Software Systems (AREA)
- Artificial Intelligence (AREA)
- Computing Systems (AREA)
- General Engineering & Computer Science (AREA)
- Data Mining & Analysis (AREA)
- Medical Informatics (AREA)
- Databases & Information Systems (AREA)
- Life Sciences & Earth Sciences (AREA)
- Computer Security & Cryptography (AREA)
- Biophysics (AREA)
- Molecular Biology (AREA)
- Biomedical Technology (AREA)
- Mathematical Physics (AREA)
- Computational Linguistics (AREA)
- Computer Hardware Design (AREA)
- Evolutionary Biology (AREA)
- Bioinformatics & Computational Biology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Image Analysis (AREA)
- Image Processing (AREA)
- Collating Specific Patterns (AREA)
- Measurement Of The Respiration, Hearing Ability, Form, And Blood Characteristics Of Living Organisms (AREA)
Abstract
Description
Claims (36)
- 一种活体检测方法,包括:获取第一传感器感测到的目标对象的深度信息和第二传感器感测到的目标图像;对所述目标图像进行关键点检测,得到所述目标对象的关键点信息;基于所述目标对象的深度信息和所述目标对象的关键点信息,得到所述目标对象的活体检测结果。
- 根据权利要求1所述的方法,所述目标对象为人脸。
- 根据权利要求1或2所述的方法,所述第二传感器为RGB传感器或者近红外传感器。
- 根据权利要求1至3中任意一项所述的方法,所述第一传感器为飞行时间TOF传感器或结构光传感器。
- 根据权利要求1至4中任意一项所述的方法,在对所述目标图像进行关键点检测之前,所述方法还包括:根据所述第一传感器的参数以及所述第二传感器的参数,对齐所述目标对象的深度信息和所述目标图像。
- 根据权利要求1至5中任意一项所述的方法,基于所述目标对象的深度信息和所述目标对象的关键点信息,得到所述目标对象的活体检测结果,包括:基于所述目标对象的深度信息和所述目标对象的关键点信息,得到第一特征信息;基于所述目标对象的关键点信息,得到第二特征信息;基于所述第一特征信息和所述第二特征信息,确定所述目标对象的活体检测结果。
- 根据权利要求6所述的方法,基于所述目标对象的深度信息和所述目标对象的关键点信息,得到第一特征信息,包括:将所述目标对象的深度信息和所述目标对象的关键点信息输入第一神经网络进行处理,得到第一特征信息;基于所述目标对象的关键点信息,得到第二特征信息,包括:将所述目标图像和所述目标对象的关键点信息输入第二神经网络进行处理,得到第二特征信息。
- 根据权利要求6或7所述的方法,基于所述目标对象的深度信息和所述目标对象的关键点信息,得到第一特征信息,包括:对所述目标对象的深度信息和所述目标对象的关键点信息进行卷积处理,得到第一卷积结果;对所述第一卷积结果进行下采样处理,得到第一下采样结果;基于所述第一下采样结果,得到第一特征信息。
- 根据权利要求6至8中任意一项所述的方法,基于所述目标对象的关键点信息,得到第二特征信息,包括:对所述目标图像和所述目标对象的关键点信息进行卷积处理, 得到第二卷积结果;对所述第二卷积结果进行下采样处理,得到第二下采样结果;基于所述第二下采样结果,得到第二特征信息。
- 根据权利要求6至9中任意一项所述的方法,基于所述第一特征信息和所述第二特征信息,确定所述目标对象的活体检测结果,包括:对所述第一特征信息和所述第二特征信息进行融合处理,得到第三特征信息;根据所述第三特征信息,确定所述目标对象的活体检测结果。
- 根据权利要求10所述的方法,根据所述第三特征信息,确定活体检测结果,包括:基于所述第三特征信息,得到所述目标对象为活体的概率;根据所述目标对象为活体的概率,确定所述目标对象的活体检测结果。
- 一种活体检测装置,包括:获取模块,配置为获取第一传感器感测到的目标对象的深度信息和第二传感器感测到的目标图像;检测模块,配置为对所述目标图像进行关键点检测,得到所述目标对象的关键点信息;确定模块,配置为基于所述目标对象的深度信息和所述目标对象的关键点信息,得到所述目标对象的活体检测结果。
- 根据权利要求12所述的装置,所述目标对象为人脸。
- 根据权利要求12或13所述的装置,所述第二传感器为RGB传感器或者近红外传感器。
- 根据权利要求12至14中任意一项所述的装置,所述第一传感器为飞行时间TOF传感器或结构光传感器。
- 根据权利要求12至15中任意一项所述的装置,所述装置还包括:对齐模块,配置为根据所述第一传感器的参数以及所述第二传感器的参数,对齐所述目标对象的深度信息和所述目标图像。
- 根据权利要求12至16中任意一项所述的装置,所述确定模块包括:第一确定子模块,配置为基于所述目标对象的深度信息和所述目标对象的关键点信息,得到第一特征信息;第二确定子模块,配置为基于所述目标对象的关键点信息,得到第二特征信息;第三确定子模块,配置为基于所述第一特征信息和所述第二特征信息,确定所述目标对象的活体检测结果。
- 根据权利要求17所述的装置,所述第一确定子模块配置为:将所述目标对象的深度信息和所述目标对象的关键点信息输入第一神经网络进行处理,得到第一特征信息;所述第二确定子模块配置为:将所述目标图像和所述目标对象的关键点信息输入第二神经网络进行处理,得到第二特征信息。
- 根据权利要求17或18所述的装置,所述第一确定子模块包括:第一卷积单元,配置为对所述目标对象的深度信息和所述目标对象的关键点信息进行卷积处理,得到第一卷积结果;第一下采样单元,配置为对所述第一卷积结果进行下采样处理,得到第一下采样结果;第一确定单元,配置为基于所述第一下采样结果,得到第一特征信息。
- 根据权利要求17至19中任意一项所述的装置,所述第二确定子模块包括:第二卷积单元,配置为对所述目标图像和所述目标对象的关键点信息进行卷积处理,得到第二卷积结果;第二下采样单元,配置为对所述第二卷积结果进行下采样处理,得到第二下采样结果;第二确定单元,配置为基于所述第二下采样结果,得到第二特征信息。
- 根据权利要求17至20中任意一项所述的装置,所述第三确定子模块包括:全连接单元,配置为对所述第一特征信息和所述第二特征信息进行融合处理,得到第三特征信息;第三确定单元,配置为根据所述第三特征信息,确定所述目标对象的活体检测结果。
- 根据权利要求21所述的装置,所述第三确定单元包括:第一确定子单元,配置为基于所述第三特征信息,得到所述目标对象为活体的概率;第二确定子单元,配置为根据所述目标对象为活体的概率,确定所述目标对象的活体检测结果。
- 一种活体检测装置,包括:存储器,配置为存储计算机可读指令;处理器,配置为执行所述存储器中存储的计算机可读指令,其中,对所述计算机可读指令的执行使得所述处理器执行权利要求1至11中任意一项所述的方法。
- 一种非易失性计算机可读存储介质,其上存储有计算机程序指令,所述计算机程序指令被处理器执行时实现权利要求1至11中任意一项所述的方法。
- 一种活体检测系统,包括:权利要求23所述的活体检测装置、所述第一传感器和所述第二传感器。
- 一种活体检测系统,包括:权利要求24所述的非易失性计算机可读存储介质、所述第一传感器和所述第二传感器。
- 一种电子设备,包括:第一传感器,配置为检测目标对象的深度信息;第二传感器,配置为采集包括所述目标对象的目标图像;处理器,配置为对所述第二传感器采集到的目标对象进行关键点检测,得到所述目标对象的关键点信息,并基于所述第一传感器检测到的所述目标对象的深度信息和所述目标对象的关键点信息,得到所述目标对象的活体检测结果。
- 根据权利要求27所述的电子设备,所述第二传感器为RGB传感器或者近红外 传感器。
- 根据权利要求27或28所述的电子设备,所述第一传感器为飞行时间TOF传感器或结构光传感器。
- 根据权利要求27至29中任意一项所述的电子设备,所述处理器还配置为:根据所述第一传感器的参数以及所述第二传感器的参数,对齐所述目标对象的深度信息和所述目标图像。
- 根据权利要求27至30中任意一项所述的电子设备,所述处理器配置为:基于所述目标对象的深度信息和所述目标对象的关键点信息,得到第一特征信息;基于所述目标对象的关键点信息,得到第二特征信息;基于所述第一特征信息和所述第二特征信息,确定所述目标对象的活体检测结果。
- 根据权利要求31所述的电子设备,所述处理器配置为:将所述目标对象的深度信息和所述目标对象的关键点信息输入第一神经网络进行处理,得到第一特征信息;基于所述目标对象的关键点信息,得到第二特征信息,包括:将所述目标图像和所述目标对象的关键点信息输入第二神经网络进行处理,得到第二特征信息。
- 根据权利要求31或32所述的电子设备,所述处理器配置为:对所述目标对象的深度信息和所述目标对象的关键点信息进行卷积处理,得到第一卷积结果;对所述第一卷积结果进行下采样处理,得到第一下采样结果;基于所述第一下采样结果,得到第一特征信息。
- 根据权利要求31至33中任意一项所述的电子设备,所述处理器配置为:对所述目标图像和所述目标对象的关键点信息进行卷积处理,得到第二卷积结果;对所述第二卷积结果进行下采样处理,得到第二下采样结果;基于所述第二下采样结果,得到第二特征信息。
- 根据权利要求31至34中任意一项所述的电子设备,所述处理器配置为:对所述第一特征信息和所述第二特征信息进行融合处理,得到第三特征信息;根据所述第三特征信息,确定所述目标对象的活体检测结果。
- 根据权利要求35所述的电子设备,所述处理器配置为:基于所述第三特征信息,得到所述目标对象为活体的概率;根据所述目标对象为活体的概率,确定所述目标对象的活体检测结果。
Priority Applications (4)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| JP2019515661A JP6852150B2 (ja) | 2018-05-10 | 2018-11-14 | 生体検知方法および装置、システム、電子機器、記憶媒体 |
| EP18839510.7A EP3584745A4 (en) | 2018-05-10 | 2018-11-14 | METHOD AND SYSTEM FOR DETECTING A LIVING BODY, SYSTEM, ELECTRONIC DEVICE AND STORAGE MEDIUM |
| KR1020197019442A KR20190129826A (ko) | 2018-05-10 | 2018-11-14 | 생체 검측 방법 및 장치, 시스템, 전자 기기, 저장 매체 |
| US16/234,434 US10930010B2 (en) | 2018-05-10 | 2018-12-27 | Method and apparatus for detecting living body, system, electronic device, and storage medium |
Applications Claiming Priority (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN201810444105.4A CN108764069B (zh) | 2018-05-10 | 2018-05-10 | 活体检测方法及装置 |
| CN201810444105.4 | 2018-05-10 |
Related Child Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| US16/234,434 Continuation US10930010B2 (en) | 2018-05-10 | 2018-12-27 | Method and apparatus for detecting living body, system, electronic device, and storage medium |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| WO2019214201A1 true WO2019214201A1 (zh) | 2019-11-14 |
Family
ID=64010059
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| PCT/CN2018/115499 Ceased WO2019214201A1 (zh) | 2018-05-10 | 2018-11-14 | 活体检测方法及装置、系统、电子设备、存储介质 |
Country Status (6)
| Country | Link |
|---|---|
| US (1) | US10930010B2 (zh) |
| EP (1) | EP3584745A4 (zh) |
| JP (1) | JP6852150B2 (zh) |
| KR (1) | KR20190129826A (zh) |
| CN (1) | CN108764069B (zh) |
| WO (1) | WO2019214201A1 (zh) |
Cited By (7)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN112926367A (zh) * | 2019-12-06 | 2021-06-08 | 杭州海康威视数字技术股份有限公司 | 一种活体检测的设备及方法 |
| CN113052034A (zh) * | 2021-03-15 | 2021-06-29 | 上海商汤智能科技有限公司 | 基于双目摄像头的活体检测方法及相关装置 |
| CN113128429A (zh) * | 2021-04-24 | 2021-07-16 | 新疆爱华盈通信息技术有限公司 | 基于立体视觉的活体检测方法和相关设备 |
| CN113486829A (zh) * | 2021-07-15 | 2021-10-08 | 京东科技控股股份有限公司 | 人脸活体检测方法、装置、电子设备及存储介质 |
| CN113989813A (zh) * | 2021-10-29 | 2022-01-28 | 北京百度网讯科技有限公司 | 提取图像特征的方法和图像分类方法、装置、设备和介质 |
| CN114333078A (zh) * | 2021-12-01 | 2022-04-12 | 马上消费金融股份有限公司 | 活体检测方法、装置、电子设备及存储介质 |
| CN115436893A (zh) * | 2021-06-01 | 2022-12-06 | 富士通株式会社 | 基于无线雷达信号的关键点修正装置和方法 |
Families Citing this family (37)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN110008783A (zh) * | 2018-01-04 | 2019-07-12 | 杭州海康威视数字技术股份有限公司 | 基于神经网络模型的人脸活体检测方法、装置及电子设备 |
| CN108764069B (zh) * | 2018-05-10 | 2022-01-14 | 北京市商汤科技开发有限公司 | 活体检测方法及装置 |
| CN110852134A (zh) * | 2018-07-27 | 2020-02-28 | 北京市商汤科技开发有限公司 | 活体检测方法、装置及系统、电子设备和存储介质 |
| CN109784149B (zh) * | 2018-12-06 | 2021-08-20 | 苏州飞搜科技有限公司 | 一种人体骨骼关键点的检测方法及系统 |
| CN111368601B (zh) * | 2018-12-26 | 2021-11-16 | 北京市商汤科技开发有限公司 | 活体检测方法和装置、电子设备和计算机可读存储介质 |
| WO2020159437A1 (en) * | 2019-01-29 | 2020-08-06 | Agency For Science, Technology And Research | Method and system for face liveness detection |
| CN111507131B (zh) * | 2019-01-31 | 2023-09-19 | 北京市商汤科技开发有限公司 | 活体检测方法及装置、电子设备和存储介质 |
| CN111783501B (zh) * | 2019-04-03 | 2025-01-07 | 北京地平线机器人技术研发有限公司 | 活体检测方法和装置以及相应的电子设备 |
| US20200342291A1 (en) * | 2019-04-23 | 2020-10-29 | Apical Limited | Neural network processing |
| CN111860078B (zh) * | 2019-04-30 | 2024-05-14 | 北京眼神智能科技有限公司 | 人脸静默活体检测方法、装置、可读存储介质及设备 |
| CN110287900B (zh) * | 2019-06-27 | 2023-08-01 | 深圳市商汤科技有限公司 | 验证方法和验证装置 |
| KR102766550B1 (ko) * | 2019-11-21 | 2025-02-12 | 삼성전자주식회사 | 라이브니스 검사 방법 및 장치, 생체 인증 방법 및 장치 |
| CN110942032B (zh) * | 2019-11-27 | 2022-07-15 | 深圳市商汤科技有限公司 | 活体检测方法及装置、存储介质 |
| CN111881706B (zh) * | 2019-11-27 | 2021-09-03 | 马上消费金融股份有限公司 | 活体检测、图像分类和模型训练方法、装置、设备及介质 |
| CN111325114B (zh) * | 2020-02-03 | 2022-07-19 | 重庆特斯联智慧科技股份有限公司 | 一种人工智能识别分类的安检图像处理方法和装置 |
| CN113255400A (zh) * | 2020-02-10 | 2021-08-13 | 深圳市光鉴科技有限公司 | 活体人脸识别模型的训练、识别方法、系统、设备及介质 |
| FR3109688B1 (fr) * | 2020-04-24 | 2022-04-29 | Idemia Identity & Security France | Procédé d’authentification ou d’identification d’un individu |
| CN111582381B (zh) * | 2020-05-09 | 2024-03-26 | 北京市商汤科技开发有限公司 | 确定性能参数的方法及装置、电子设备和存储介质 |
| CN113761983B (zh) * | 2020-06-05 | 2023-08-22 | 杭州海康威视数字技术股份有限公司 | 更新人脸活体检测模型的方法、装置及图像采集设备 |
| EP3965071B1 (en) * | 2020-09-08 | 2025-01-15 | Samsung Electronics Co., Ltd. | Method and apparatus for pose identification |
| CN112052830B (zh) * | 2020-09-25 | 2022-12-20 | 北京百度网讯科技有限公司 | 人脸检测的方法、装置和计算机存储介质 |
| CN112395963B (zh) * | 2020-11-04 | 2021-11-12 | 北京嘀嘀无限科技发展有限公司 | 对象识别方法和装置、电子设备及存储介质 |
| US11532182B2 (en) * | 2021-03-04 | 2022-12-20 | Lenovo (Singapore) Pte. Ltd. | Authentication of RGB video based on infrared and depth sensing |
| CN112926489A (zh) * | 2021-03-17 | 2021-06-08 | 北京市商汤科技开发有限公司 | 活体检测方法、装置、设备、介质、系统及交通工具 |
| CN113191189A (zh) * | 2021-03-22 | 2021-07-30 | 深圳市百富智能新技术有限公司 | 人脸活体检测方法、终端设备及计算机可读存储介质 |
| CN113128428B (zh) * | 2021-04-24 | 2023-04-07 | 新疆爱华盈通信息技术有限公司 | 基于深度图预测的活体检测方法和相关设备 |
| CN113361349B (zh) * | 2021-05-25 | 2023-08-04 | 北京百度网讯科技有限公司 | 人脸活体检测方法、装置、电子设备和存储介质 |
| CN113239887B (zh) * | 2021-06-04 | 2024-10-01 | Oppo广东移动通信有限公司 | 活体检测方法及装置、计算机可读存储介质和电子设备 |
| CN113449623B (zh) * | 2021-06-21 | 2022-06-28 | 浙江康旭科技有限公司 | 一种基于深度学习的轻型活体检测方法 |
| CN113469036A (zh) * | 2021-06-30 | 2021-10-01 | 北京市商汤科技开发有限公司 | 活体检测方法及装置、电子设备和存储介质 |
| CN113505682B (zh) * | 2021-07-02 | 2024-07-02 | 杭州萤石软件有限公司 | 活体检测方法及装置 |
| US20240289695A1 (en) * | 2021-09-17 | 2024-08-29 | Kyocera Corporation | Trained model generation method, inference apparatus, and trained model generation apparatus |
| CN113869212B (zh) * | 2021-09-28 | 2024-06-21 | 平安科技(深圳)有限公司 | 多模态活体检测方法、装置、计算机设备及存储介质 |
| CN114999003B (zh) * | 2022-05-20 | 2026-01-23 | 深圳市联洲国际技术有限公司 | 目标活体的识别方法及其装置、计算机可读存储介质 |
| CN116091875B (zh) * | 2023-04-11 | 2023-08-29 | 合肥的卢深视科技有限公司 | 模型训练方法、活体检测方法、电子设备及存储介质 |
| CN118711258B (zh) * | 2024-08-29 | 2025-02-18 | 浙江大华技术股份有限公司 | 活体检测方法、设备和存储介质 |
| CN119229510B (zh) * | 2024-12-05 | 2025-05-16 | 山东科技大学 | 一种基于多流注意力交互的面部表情识别方法 |
Citations (3)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN105956518A (zh) * | 2016-04-21 | 2016-09-21 | 腾讯科技(深圳)有限公司 | 一种人脸识别方法、装置和系统 |
| CN107590430A (zh) * | 2017-07-26 | 2018-01-16 | 百度在线网络技术(北京)有限公司 | 活体检测方法、装置、设备及存储介质 |
| CN108764069A (zh) * | 2018-05-10 | 2018-11-06 | 北京市商汤科技开发有限公司 | 活体检测方法及装置 |
Family Cites Families (17)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| KR101700595B1 (ko) | 2010-01-05 | 2017-01-31 | 삼성전자주식회사 | 얼굴 인식 장치 및 그 방법 |
| JP2013156680A (ja) * | 2012-01-26 | 2013-08-15 | Kumamoto Univ | フェーストラッキング方法、フェーストラッカおよび車両 |
| KR101444538B1 (ko) * | 2012-10-26 | 2014-09-24 | 주식회사 에스원 | 3차원 얼굴 인식 시스템 및 그의 얼굴 인식 방법 |
| JP6214334B2 (ja) * | 2013-10-23 | 2017-10-18 | 日本放送協会 | 電子機器、判定方法及びプログラム |
| GB2532003A (en) * | 2014-10-31 | 2016-05-11 | Nokia Technologies Oy | Method for alignment of low-quality noisy depth map to the high-resolution colour image |
| KR20170000748A (ko) | 2015-06-24 | 2017-01-03 | 삼성전자주식회사 | 얼굴 인식 방법 및 장치 |
| CN105518711B (zh) * | 2015-06-29 | 2019-11-29 | 北京旷视科技有限公司 | 活体检测方法、活体检测系统以及计算机程序产品 |
| CN105956572A (zh) * | 2016-05-15 | 2016-09-21 | 北京工业大学 | 一种基于卷积神经网络的活体人脸检测方法 |
| CN107451510B (zh) * | 2016-05-30 | 2023-07-21 | 北京旷视科技有限公司 | 活体检测方法和活体检测系统 |
| US10282530B2 (en) * | 2016-10-03 | 2019-05-07 | Microsoft Technology Licensing, Llc | Verifying identity based on facial dynamics |
| EP3534328B1 (en) * | 2016-10-31 | 2024-10-02 | Nec Corporation | Image processing device, image processing method, facial recogntion system, program, and recording medium |
| CN107368778A (zh) * | 2017-06-02 | 2017-11-21 | 深圳奥比中光科技有限公司 | 人脸表情的捕捉方法、装置及存储装置 |
| CN112861760B (zh) * | 2017-07-25 | 2024-12-27 | 虹软科技股份有限公司 | 一种用于表情识别的方法和装置 |
| CN107506696A (zh) * | 2017-07-29 | 2017-12-22 | 广东欧珀移动通信有限公司 | 防伪处理方法及相关产品 |
| US10679443B2 (en) * | 2017-10-13 | 2020-06-09 | Alcatraz AI, Inc. | System and method for controlling access to a building with facial recognition |
| CN108876833A (zh) * | 2018-03-29 | 2018-11-23 | 北京旷视科技有限公司 | 图像处理方法、图像处理装置和计算机可读存储介质 |
| US10733762B2 (en) * | 2018-04-04 | 2020-08-04 | Motorola Mobility Llc | Dynamically calibrating a depth sensor |
-
2018
- 2018-05-10 CN CN201810444105.4A patent/CN108764069B/zh active Active
- 2018-11-14 EP EP18839510.7A patent/EP3584745A4/en not_active Ceased
- 2018-11-14 KR KR1020197019442A patent/KR20190129826A/ko not_active Ceased
- 2018-11-14 WO PCT/CN2018/115499 patent/WO2019214201A1/zh not_active Ceased
- 2018-11-14 JP JP2019515661A patent/JP6852150B2/ja active Active
- 2018-12-27 US US16/234,434 patent/US10930010B2/en active Active
Patent Citations (3)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN105956518A (zh) * | 2016-04-21 | 2016-09-21 | 腾讯科技(深圳)有限公司 | 一种人脸识别方法、装置和系统 |
| CN107590430A (zh) * | 2017-07-26 | 2018-01-16 | 百度在线网络技术(北京)有限公司 | 活体检测方法、装置、设备及存储介质 |
| CN108764069A (zh) * | 2018-05-10 | 2018-11-06 | 北京市商汤科技开发有限公司 | 活体检测方法及装置 |
Non-Patent Citations (1)
| Title |
|---|
| See also references of EP3584745A4 * |
Cited By (8)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN112926367A (zh) * | 2019-12-06 | 2021-06-08 | 杭州海康威视数字技术股份有限公司 | 一种活体检测的设备及方法 |
| CN113052034A (zh) * | 2021-03-15 | 2021-06-29 | 上海商汤智能科技有限公司 | 基于双目摄像头的活体检测方法及相关装置 |
| CN113128429A (zh) * | 2021-04-24 | 2021-07-16 | 新疆爱华盈通信息技术有限公司 | 基于立体视觉的活体检测方法和相关设备 |
| CN115436893A (zh) * | 2021-06-01 | 2022-12-06 | 富士通株式会社 | 基于无线雷达信号的关键点修正装置和方法 |
| CN113486829A (zh) * | 2021-07-15 | 2021-10-08 | 京东科技控股股份有限公司 | 人脸活体检测方法、装置、电子设备及存储介质 |
| CN113486829B (zh) * | 2021-07-15 | 2023-11-07 | 京东科技控股股份有限公司 | 人脸活体检测方法、装置、电子设备及存储介质 |
| CN113989813A (zh) * | 2021-10-29 | 2022-01-28 | 北京百度网讯科技有限公司 | 提取图像特征的方法和图像分类方法、装置、设备和介质 |
| CN114333078A (zh) * | 2021-12-01 | 2022-04-12 | 马上消费金融股份有限公司 | 活体检测方法、装置、电子设备及存储介质 |
Also Published As
| Publication number | Publication date |
|---|---|
| JP2020522764A (ja) | 2020-07-30 |
| EP3584745A1 (en) | 2019-12-25 |
| US20190347823A1 (en) | 2019-11-14 |
| CN108764069B (zh) | 2022-01-14 |
| JP6852150B2 (ja) | 2021-03-31 |
| CN108764069A (zh) | 2018-11-06 |
| EP3584745A4 (en) | 2019-12-25 |
| KR20190129826A (ko) | 2019-11-20 |
| US10930010B2 (en) | 2021-02-23 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| WO2019214201A1 (zh) | 活体检测方法及装置、系统、电子设备、存储介质 | |
| JP7026225B2 (ja) | 生体検出方法、装置及びシステム、電子機器並びに記憶媒体 | |
| CN110688951B (zh) | 图像处理方法及装置、电子设备和存储介质 | |
| JP7110412B2 (ja) | 生体検出方法及び装置、電子機器並びに記憶媒体 | |
| TWI717146B (zh) | 圖像處理方法及裝置、電子設備和儲存介質 | |
| CN110674719B (zh) | 目标对象匹配方法及装置、电子设备和存储介质 | |
| CN109934275B (zh) | 图像处理方法及装置、电子设备和存储介质 | |
| CN111340766A (zh) | 目标对象的检测方法、装置、设备和存储介质 | |
| CN110532956B (zh) | 图像处理方法及装置、电子设备和存储介质 | |
| CN115035596B (zh) | 行为检测的方法及装置、电子设备和存储介质 | |
| WO2020010927A1 (zh) | 图像处理方法及装置、电子设备和存储介质 | |
| CN110717399A (zh) | 人脸识别方法和电子终端设备 | |
| CN111435422B (zh) | 动作识别方法、控制方法及装置、电子设备和存储介质 | |
| CN112270288A (zh) | 活体识别、门禁设备控制方法和装置、电子设备 | |
| CN108197585A (zh) | 脸部识别方法和装置 | |
| CN111626086A (zh) | 活体检测方法、装置及系统、电子设备和存储介质 | |
| CN112597944B (zh) | 关键点检测方法及装置、电子设备和存储介质 | |
| CN109034150B (zh) | 图像处理方法及装置 | |
| CN112613447A (zh) | 关键点检测方法及装置、电子设备和存储介质 | |
| CN114627356A (zh) | 网络训练、图像处理方法及装置、电子设备和存储介质 | |
| CN107977636B (zh) | 人脸检测方法及装置、终端、存储介质 | |
| CN110781842A (zh) | 图像处理方法及装置、电子设备和存储介质 | |
| CN111507131A (zh) | 活体检测方法及装置、电子设备和存储介质 | |
| CN113807369A (zh) | 目标重识别方法及装置、电子设备和存储介质 | |
| HK40016326A (zh) | 活体检测方法及装置、电子设备和存储介质 |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| ENP | Entry into the national phase |
Ref document number: 2019515661 Country of ref document: JP Kind code of ref document: A |
|
| ENP | Entry into the national phase |
Ref document number: 2018839510 Country of ref document: EP Effective date: 20190204 |
|
| ENP | Entry into the national phase |
Ref document number: 20197019442 Country of ref document: KR Kind code of ref document: A |
|
| WWE | Wipo information: entry into national phase |
Ref document number: KR1020197019442 Country of ref document: KR |
|
| NENP | Non-entry into the national phase |
Ref country code: DE |