WO2015074157A1 - System and method for face recognition - Google Patents

System and method for face recognition Download PDF

Info

Publication number
WO2015074157A1
WO2015074157A1 PCT/CA2014/051117 CA2014051117W WO2015074157A1 WO 2015074157 A1 WO2015074157 A1 WO 2015074157A1 CA 2014051117 W CA2014051117 W CA 2014051117W WO 2015074157 A1 WO2015074157 A1 WO 2015074157A1
Authority
WO
WIPO (PCT)
Prior art keywords
descriptor
image region
response vector
face
image
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Ceased
Application number
PCT/CA2014/051117
Other languages
French (fr)
Inventor
Ehsan Fazl Ersi
John Konstantine TSOTSOS
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Individual
Original Assignee
Individual
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Individual filed Critical Individual
Priority to CA2931348A priority Critical patent/CA2931348A1/en
Priority to EP14864771.2A priority patent/EP3074928A4/en
Priority to US15/038,812 priority patent/US9940506B2/en
Publication of WO2015074157A1 publication Critical patent/WO2015074157A1/en
Anticipated expiration legal-status Critical
Ceased legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V40/00Recognition of biometric, human-related or animal-related patterns in image or video data
    • G06V40/10Human or animal bodies, e.g. vehicle occupants or pedestrians; Body parts, e.g. hands
    • G06V40/16Human faces, e.g. facial parts, sketches or expressions
    • G06V40/168Feature extraction; Face representation
    • G06V40/171Local features and components; Facial parts ; Occluding parts, e.g. glasses; Geometrical relationships
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/40Extraction of image or video features
    • G06V10/50Extraction of image or video features by performing operations within image blocks; by using histograms, e.g. histogram of oriented gradients [HoG]; by summing image-intensity values; Projection analysis
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V40/00Recognition of biometric, human-related or animal-related patterns in image or video data
    • G06V40/10Human or animal bodies, e.g. vehicle occupants or pedestrians; Body parts, e.g. hands
    • G06V40/16Human faces, e.g. facial parts, sketches or expressions
    • G06V40/168Feature extraction; Face representation
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V40/00Recognition of biometric, human-related or animal-related patterns in image or video data
    • G06V40/10Human or animal bodies, e.g. vehicle occupants or pedestrians; Body parts, e.g. hands
    • G06V40/16Human faces, e.g. facial parts, sketches or expressions
    • G06V40/172Classification, e.g. identification

Definitions

  • SYSTEM AND METHOD FOR FACE RECOGNITION TECHNICAL FIELD [0001 ] The following is related generally to image recognition and more specifically face recognition in an image. BACKGROUND [0002]
  • face recognition A wide range of applications can benefit from face recognition. Among such applications identity verification, logon interfaces, video indexing, and speaker verification are of particular interest. Face retrieval and ranking are other interesting applications of face recognition technology, since they could be used to efficiently organize, categorize, tag and manage large sets of digital images, based on the people who are present in them.
  • a method of generating a texture-encoding descriptor for an image region comprising: (a) filtering the image region by applying a derivative of a base filter to the image region to generate a response vector for each pixel in the image region; (b) determining a proximity of each response vector to each of a set of base orientations and a magnitude of each response vector; (c) determining an association of each response vector to each of the base orientations based on the proximity, the magnitude and a significance of the location within the image region from which the response vector was generated; and (d) aggregating the associations of each response vector in the image region to each of the base orientations to generate the descriptor.
  • a texture-encoding descriptor generator for an image region, the descriptor generator comprising: (a) a filter configured to apply a derivative of a base filter to the image region to generate a response vector for each pixel in the image region; and (b) a processor and memory, the memory having instructions encoded thereon which when executed by the processor cause the processor to: (i) determine a proximity of each response vector to each of a set of base orientations and a magnitude of each response vector; (ii) determine an association of each response vector to each of the base orientations based on the proximity, the magnitude and a significance of the location within the image region from which the response vector was generated; and (iii) aggregate the associations of each response vector in the image region to each of the base orientations to generate the descriptor.
  • a method for classifying a probe face image comprising: (a) producing a representation for the face image from a plurality of texture-encoding descriptors extracted from one or more informative locations of the face image; (b) generating scores by comparing the representation of the face image against correspondingly produced representations of further face images stored in a gallery database, based on models learned using Probabilistic Linear Discriminant Analysis (PLDA) or Support Vector Machine (SVM) learning algorithms; and (c) identifying a match, if any, from the gallery database to the probe face by analyzing the scores computed by the learned models.
  • PLDA Probabilistic Linear Discriminant Analysis
  • SVM Support Vector Machine
  • a system for recognizing a probe face image comprising: (a) a descriptor generation module configured to produce a representation for the face image from a plurality of texture-encoding descriptors extracted from one or more informative locations of the face image; and (b) a similarity analyzing module configured to: (i) generate scores by comparing the representation of the face image against correspondingly produced representations of further face images stored in a gallery database, based on models learned using Probabilistic Linear Discriminant Analysis (PLDA) or Support Vector Machine (SVM) learning algorithms; and (ii) identify a match, if any, from the gallery database to the probe face by analyzing the scores computed by the learned models.
  • PLDA Probabilistic Linear Discriminant Analysis
  • SVM Support Vector Machine
  • Fig. 1 is a block diagram of an image processing system
  • Fig. 2 is a flowchart representation of an image processing process
  • Fig. 3 is a flowchart representation of descriptor generation process
  • Fig. 4 is a set of images depicting a first common individual
  • Fig. 5 is a set of images depicting a second common individual.
  • any module, unit, component, server, computer, terminal or device exemplified herein that executes instructions may include or otherwise have access to computer readable media such as storage media, computer storage media, or data storage devices (removable and/or non-removable) such as, for example, magnetic disks, optical disks, or tape.
  • Computer storage media may include volatile and non-volatile, removable and non-removable media implemented in any method or technology for storage of information, such as computer readable
  • any processor or controller set out herein may be implemented as a singular processor or as a plurality of processors.
  • the plurality of processors may be arrayed or distributed, and any processing function referred to herein may be carried out by one or by a plurality of processors, even though a single processor may be exemplified. Any method, application or module herein described may be implemented using computer readable/executable instructions that may be stored or otherwise held by such computer readable media and executed by the one or more processors.
  • image is used to indicate a digital representation of a scene which might include an individual's face.
  • the scene may additionally comprise an object, collection of objects, human physical traits other than the face, and other physical manifestations that may not necessarily be relevant to the individual's face.
  • face image is used to indicate a region of the image that bounds an individual's face, whether aligned to a canonical coordinate frame for upfront viewpoint, or unaligned, as it might appear in the image.
  • the following provides a new image descriptor particularly suitable for but not limited to enabling face recognition.
  • the descriptor has low dimensionality relative to many existing descriptors providing similar or lesser face recognition accuracy.
  • each face image can be represented by a plurality of the descriptors described herein, where the number of such descriptors is determined by the number of facial feature regions considered most relevant and, therefore, considered in the face recognition.
  • the particular feature regions to be selected can be determined using a feature selection technique of the prior art.
  • each response vector to each of a plurality of predetermined base angles, the magnitude of each response vector, and the significance of each response vector based on the location it was generated from, are then combined in a principled way to form the descriptor.
  • the descriptors for a particular probe face image can then be compared to those of the existing gallery face images, for the purpose of matching and/or recognition.
  • the dimensionality of the descriptor is determined by the number of base angles considered in the generation of the descriptor. In the following description, eight such base angles are used. These correspond to the angles
  • a system and method for generating a descriptor for an image region is provided.
  • the descriptor is operable to generate information about the face image to enable face recognition.
  • the descriptor provided herein has low dimensionality relative to many existing descriptors providing similar or lower face recognition accuracy.
  • a system and method for face recognition is provided.
  • Fig. 1 an exemplary system is shown in which the descriptor described herein can be generated and used in the context of face recognition. It will be appreciated that the descriptor can be generated and be useful in systems other than the system described herein. Examples of such systems are object detection (particularly, face and pedestrian/people detection), pattern recognition and classification, face gender recognition, face age classification, and facial expression classification.
  • An image processing module 100 is communicatively linked to an image database 102.
  • the image database 102 stores a plurality of images 104 comprising a training set 106.
  • the images 104 may further comprise a probe set 108.
  • the images 104 may further comprise a gallery set 120.
  • the probe set 108 comprises probe face images depicting face images for which face recognition is desired, while the gallery set 120 comprises face images for which the identity of depicted faces are known, and are to be used to infer the unknown identity of the face images in the probe set.
  • the training set 106 comprises face images for which the identity of depicted faces are known, and are to be used for feature selection and/or to learn face recognition models. There is preferably no overlap between the images and/or identities of the gallery set 120 and the training set 106.
  • Recognition can be understood as having linked a probe face to some index in the gallery set, be it a person's name, alphanumeric index for a gallery face, the specific descriptors for a gallery face, or a hash or transformation of any of the above.
  • the indices may be stored in the image database along with the images, or can be stored in the image files themselves, as further described herein. [0025] Two examples of images from the LFW database, provided by Huang et al. (2007, Labeled Faces in the Wild: A Database for Studying Face Recognition in
  • Figs. 4 and 5 Unconstrained Environments, University of Massachusetts, Amherst, Technical Report), that are appropriate for the gallery set or query set are shown in Figs. 4 and 5.
  • two individuals' faces are shown at slightly various viewpoints with various backgrounds, shading and ancillary objects (e.g. , hairstyle, head coverings/hats, jewellery, print on backgrounds, clothing, etc.).
  • the following is operable to generate descriptors for such faces sufficient to enable the grouping and recognition of common faces. That is, faces shown in images of Fig. 4 can be matched to other faces as shown in Fig. 4, and similarly for the faces shown in images of Fig. 5.
  • the image processing module 100 comprises, or is linked to, a feature selection module 1 10, descriptor generation module 1 12 and similarity analyzing module 1 14. In additional implementations, the image processing module 100 may further comprise or be linked to a preprocessing module 1 16, a machine module 1 18 or both. [0027]
  • the image processing module 100 implements an image processing process that can be applied for training and recognition. Training may comprise the identification of one or more regions of the training images that are most informative in terms of face recognition. In particular implementations, the training may further comprise the learning by the machine module to perform recognition (i.e., on the probe set). Recognition determines the identity of the face in a probe image based on an analysis of the informative regions of the image. [0028] Referring now to Fig.
  • the image processing process may comprise, in some implementations as will be described herein, the preprocessing module 1 16 performing preprocessing on an image at block 200.
  • the preprocessing module 1 16 performing preprocessing on an image at block 200.
  • color images may be converted to greyscale or the contrast or illumination level of the image may be normalized.
  • the image processing module 100 directs the feature selection module 1 10 to perform feature selection.
  • Feature selection enables the identification of one or more regions of the images that are most informative (i.e. , indicative of the face recognition), for subsequent purposes comprising the representations of the training, gallery and probe images.
  • the system and method described herein do not require any specific technique for performing feature selection.
  • the feature selection module 1 10 applies feature selection so that only the descriptors generated for the most informative image regions contribute to the face representation.
  • the face images may be represented through a relatively sparse set of N features F - [ft n ⁇ n 6 [l, iV] ⁇ , covering only the most significant regions of the face, which may help to achieve relatively greater robustness against partial occlusion and non-uniform lighting variations.
  • F - [ft n ⁇ n 6 [l, iV] ⁇ covering only the most significant regions of the face, which may help to achieve relatively greater robustness against partial occlusion and non-uniform lighting variations.
  • the feature selection module implements the feature selection technique iteratively and in a greedy fashion, preferably adding additional features until the addition does not significantly increase the estimated information content of the set of selected features.
  • the feature selection module identifies the features whose similarity kernels (computed from pairs of training face images) have the highest alignment to the target kernel (generated by identifying the commonality of classifications of pairs of the training face images).
  • each feature ft may be represented by a triplet (ft. x, ft. y, ft.
  • ft. x and ft. y describe the location of the feature region and ft. s describes the size of the feature region.
  • ft. x and ft. y correspond to the center of the region while ft. s corresponds to half the height and width of the region, such that feature region
  • an image classification process comprises directing the descriptor generation module 1 12 to generate a descriptor for each of the selected features ft of the image.
  • the dimensionality of the descriptor is preferably kept substantially small to optimize (minimize) processing load while remaining suitable to provide an appropriate accuracy level for face recognition.
  • the descriptor is generated for each feature as an aggregation of certain characteristics of each pixel within the feature region. For each such region, the descriptor encodes the textural properties of the local neighbourhood of pixels by aggregating the magnitude of their filtered responses towards various orientations. For this reason, a suitable nomenclature for the descriptor is "Texel", though other terms may of course be used. [0038] Consequently, the dimensionality of the descriptor is determined by the number of orientations considered in its computation. In a straightforward
  • the dimensionality of the descriptor is equal to the number of such orientations.
  • the number of such orientations is 8, which correspond to the angles 0°, 45°, 90°, 135°, 180°, 225°, 270° and 315°.
  • different angles and different numbers of angles could be selected.
  • Fig. 3 at block 300, for feature region (ft. x. ft. y. ft. s) bounded by top-left corner (ft. x - ft. s, ft. y - ft. s) and bottom-right corner (ft. x + ft. s. ft.
  • the region is filtered for texture enhancement and discrimination, using the derivatives of a base filter with respect to the vertical direction (represented by baseKernel.X) and the horizontal direction (represented by baseKernel. Y), producing vertical and horizontal response maps for the feature region comprising all pixels therein.
  • the base filter may apply the mean of Gabor filters tuned to similar frequency but various orientations.
  • the base filter may apply a pillbox filter.
  • the base filter may apply a Laplacian of Gaussian filter.
  • a response vector is generated from its horizontal and vertical filter responses.
  • the contribution of each pixel in the feature region towards the descriptor of the entire feature region is determined based on the proximity and the association of the pixel's response vector to a set of pre-specified base orientations.
  • the direction of the response vector for each pixel in the feature region ⁇ (i,j) ⁇ ⁇ £ (ft. y - ft. s, ft. y + ft. s),j £ (ft. x - ft. s, ft. x + ft.
  • angularProximityi j ⁇ (cos(0) x cos(a) + s (9) x sin(a)) p
  • is the direction of the given pixel and p is a positive constant selected to enhance the proximity values.
  • each pixel (i,j) is computed, as follows: pixelAssociatiorii j — [weights ⁇ - x angularProximity j x M
  • the matrix weights may be defined using a rotationally symmetric Gaussian low-pass filter of the same size as the feature region.
  • the associations of all pixels in the feature region to each base orientation are aggregated to produce the descriptor, as follows:
  • the extracted descriptors are normalized to sum to 1.
  • the dimensionality of the descriptor can be increased or decreased, as desired, by selecting greater or lesser base angles, respectively.
  • Each face image is represented by a collection of descriptors extracted from the regions specified by the set of selected informative features.
  • the similarity between each pair of face images is measured by the weighted sum of the individual similarities computed between their corresponding descriptors. The weight of each feature may be determined according to its importance (e.g. , features at key facial landmarks such as eyes may be given greater weight than features at, say, the forehead).
  • the representations and identities of a large set of training face images are used to learn within-class and between-class subspaces using the Linear Discriminant Analysis (LDA) or Probabilistic Linear Discriminant Analysis (PLDA). These learned subspaces are then used to compare the LDA and PLDA.
  • LDA Linear Discriminant Analysis
  • PLDA Probabilistic Linear Discriminant Analysis
  • the representations of a set of training positive and negative pairs of faces are used to learn a model for distinguishing between similar and different faces, using the Support Vector Machine (SVM) learning method.
  • SVM Support Vector Machine
  • the learned model is then used to compare the representation of a probe face against those of the gallery faces to recognize the identity of the probe face.
  • the image processing module is operable to perform the recognition process to match a query face with one of the faces represented in the gallery set.
  • the descriptor generation module can be used to generate descriptors for the informative regions determined during the training. In a particular implementation, these descriptors are provided to the machine module for recognition. As with the training images, the preprocessing module 1 16 may perform preprocessing on the gallery and probe face images. [0057] Alternatively, probe images can be compared for similarity and grouping to other images in the probe set. [0058] It will be appreciated that the low dimensionality of the descriptors and the possibility of their implementation through MAC (multiply-accumulate) operations, make them particularly useful for integration into embedded systems, where in most cases only integer variables and operations can be used with relatively small memory.
  • MAC multiply-accumulate
  • Hardware systems can leverage the image processing module as an embedded module that offers computer vision capabilities for a computer system.
  • This embedded module can be used in applications such as, but not limited to: smart sensors, smart cameras, smart phones, mobile tablets, compact vision systems, machine vision cameras, graphics processing units or as a software development kit that enables a hardware resource to leverage the image processing module's processes.
  • this embedded module could be used within the automotive market for face recognition in motor vehicles. Specifically, the embedded module can enhance security features such as keyless entry, driver monitoring applications and driver interaction applications.
  • this embedded module could be used in the robotics industry for humanoid robots that interact with humans. [0062] In another embodiment, this embedded module could be used in the security industry for network cameras, network video recorders, digital video recorders and intelligent appliances. [0063] In another embodiment, this embedded module could be used in consumer electronics such as set-top boxes, smart televisions, personal computers, appliances, smartphones, tablets and other mobile digital devices. [0064]
  • the image processing module can also be leveraged within software systems through the use of a software development kit, application programming interface or other software development libraries. Specifically, video analytic software can leverage the image processing module in order to track and recognize faces for the above-listed embodiments.

Landscapes

  • Engineering & Computer Science (AREA)
  • Health & Medical Sciences (AREA)
  • Oral & Maxillofacial Surgery (AREA)
  • General Health & Medical Sciences (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • Theoretical Computer Science (AREA)
  • Human Computer Interaction (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Image Analysis (AREA)
  • Collating Specific Patterns (AREA)

Abstract

A system and method for generating a descriptor for a face is provided. The descriptor is operable to generate information about a given region in a face image to enable face recognition. The descriptor provided herein is a low dimension relative to many existing descriptors providing similar face recognition accuracy. In another aspect, a system and method for face recognition is provided.

Description

SYSTEM AND METHOD FOR FACE RECOGNITION TECHNICAL FIELD [0001 ] The following is related generally to image recognition and more specifically face recognition in an image. BACKGROUND [0002] A wide range of applications can benefit from face recognition. Among such applications identity verification, logon interfaces, video indexing, and speaker verification are of particular interest. Face retrieval and ranking are other interesting applications of face recognition technology, since they could be used to efficiently organize, categorize, tag and manage large sets of digital images, based on the people who are present in them. SUMMARY [0003] In one aspect, a method of generating a texture-encoding descriptor for an image region is provided, the method comprising: (a) filtering the image region by applying a derivative of a base filter to the image region to generate a response vector for each pixel in the image region; (b) determining a proximity of each response vector to each of a set of base orientations and a magnitude of each response vector; (c) determining an association of each response vector to each of the base orientations based on the proximity, the magnitude and a significance of the location within the image region from which the response vector was generated; and (d) aggregating the associations of each response vector in the image region to each of the base orientations to generate the descriptor. [0004] In another aspect, a texture-encoding descriptor generator for an image region is provided, the descriptor generator comprising: (a) a filter configured to apply a derivative of a base filter to the image region to generate a response vector for each pixel in the image region; and (b) a processor and memory, the memory having instructions encoded thereon which when executed by the processor cause the processor to: (i) determine a proximity of each response vector to each of a set of base orientations and a magnitude of each response vector; (ii) determine an association of each response vector to each of the base orientations based on the proximity, the magnitude and a significance of the location within the image region from which the response vector was generated; and (iii) aggregate the associations of each response vector in the image region to each of the base orientations to generate the descriptor. [0005] In a further aspect, a method for classifying a probe face image is provided, the method comprising: (a) producing a representation for the face image from a plurality of texture-encoding descriptors extracted from one or more informative locations of the face image; (b) generating scores by comparing the representation of the face image against correspondingly produced representations of further face images stored in a gallery database, based on models learned using Probabilistic Linear Discriminant Analysis (PLDA) or Support Vector Machine (SVM) learning algorithms; and (c) identifying a match, if any, from the gallery database to the probe face by analyzing the scores computed by the learned models. [0006] In an additional aspect, a system for recognizing a probe face image, comprising: (a) a descriptor generation module configured to produce a representation for the face image from a plurality of texture-encoding descriptors extracted from one or more informative locations of the face image; and (b) a similarity analyzing module configured to: (i) generate scores by comparing the representation of the face image against correspondingly produced representations of further face images stored in a gallery database, based on models learned using Probabilistic Linear Discriminant Analysis (PLDA) or Support Vector Machine (SVM) learning algorithms; and (ii) identify a match, if any, from the gallery database to the probe face by analyzing the scores computed by the learned models. [0007] These and other aspects are contemplated and described herein. It will be appreciated that the foregoing summary sets out representative aspects of systems and method for face recognition to assist skilled readers in understanding the following detailed description. DESCRIPTION OF THE DRAWINGS [0008] A greater understanding of the embodiments will be had with reference to the Figures, in which: [0009] Fig. 1 is a block diagram of an image processing system; [0010] Fig. 2 is a flowchart representation of an image processing process; [001 1 ] Fig. 3 is a flowchart representation of descriptor generation process; [0012] Fig. 4 is a set of images depicting a first common individual; and [0013] Fig. 5 is a set of images depicting a second common individual. DETAILED DESCRIPTION [0014] It will be appreciated that for simplicity and clarity of illustration, where considered appropriate, reference numerals may be repeated among the Figures to indicate corresponding or analogous elements. In addition, numerous specific details are set forth in order to provide a thorough understanding of the embodiments described herein. However, it will be understood by those of ordinary skill in the art that the embodiments described herein may be practised without these specific details. In other instances, well-known methods, procedures and components have not been described in detail so as not to obscure the embodiments described herein. Also, the description is not to be considered as limiting the scope of the embodiments described herein. [0015] It will be appreciated that various terms used throughout the present description may be read and understood as follows, unless the context indicates otherwise: "or" as used throughout is inclusive, as though written "and/or"; singular articles and pronouns as used throughout include their plural forms, and vice versa; similarly, gendered pronouns include their counterpart pronouns so that pronouns should not be understood as limiting anything described herein to use, implementation, performance, etc. by a single gender. Further definitions for terms may be set out herein; these may apply to prior and subsequent instances of those terms, as will be understood from a reading of the present description. [0016] It will be appreciated that any module, unit, component, server, computer, terminal or device exemplified herein that executes instructions may include or otherwise have access to computer readable media such as storage media, computer storage media, or data storage devices (removable and/or non-removable) such as, for example, magnetic disks, optical disks, or tape. Computer storage media may include volatile and non-volatile, removable and non-removable media implemented in any method or technology for storage of information, such as computer readable
instructions, data structures, program modules, or other data. Examples of computer storage media include RAM, ROM, EEPROM, flash memory or other memory technology, CD-ROM, digital versatile disks (DVD) or other optical storage, magnetic cassettes, magnetic tape, magnetic disk storage or other magnetic storage devices, or any other medium which can be used to store the desired information and which can be accessed by an application, module, or both. Any such computer storage media may be part of the device or accessible or connectable thereto. Further, unless the context clearly indicates otherwise, any processor or controller set out herein may be implemented as a singular processor or as a plurality of processors. The plurality of processors may be arrayed or distributed, and any processing function referred to herein may be carried out by one or by a plurality of processors, even though a single processor may be exemplified. Any method, application or module herein described may be implemented using computer readable/executable instructions that may be stored or otherwise held by such computer readable media and executed by the one or more processors. [0017] In the following description, the term "image" is used to indicate a digital representation of a scene which might include an individual's face. The scene may additionally comprise an object, collection of objects, human physical traits other than the face, and other physical manifestations that may not necessarily be relevant to the individual's face. Furthermore, the term "face image" is used to indicate a region of the image that bounds an individual's face, whether aligned to a canonical coordinate frame for upfront viewpoint, or unaligned, as it might appear in the image. [0018] The following provides a new image descriptor particularly suitable for but not limited to enabling face recognition. The descriptor has low dimensionality relative to many existing descriptors providing similar or lesser face recognition accuracy. In accordance with the following, each face image can be represented by a plurality of the descriptors described herein, where the number of such descriptors is determined by the number of facial feature regions considered most relevant and, therefore, considered in the face recognition. The particular feature regions to be selected can be determined using a feature selection technique of the prior art. Exemplary techniques are provided by Vidal-Naquet and Ullman (2003, Object Recognition with Informative Features and Linear Classification, Proceedings of the International Conference on Computer Vision), and Fazl-Ersi and Tsotsos (2012, Histogram of Oriented Uniform Patters for Robust Place Recognition and Categorization, The International Journal of Robotics), as described further herein. [0019] The descriptor for any given feature region is generated based on response vectors generated for each pixel within the region, where the response vectors are computed from filtering the image region with the derivatives of a base filter. The proximity of each response vector to each of a plurality of predetermined base angles, the magnitude of each response vector, and the significance of each response vector based on the location it was generated from, are then combined in a principled way to form the descriptor. The descriptors for a particular probe face image can then be compared to those of the existing gallery face images, for the purpose of matching and/or recognition. [0020] The dimensionality of the descriptor is determined by the number of base angles considered in the generation of the descriptor. In the following description, eight such base angles are used. These correspond to the angles
{0°, 45°, 90°, 135°, 180°, 225°, 270°, 315°}. However, any number of base angles could be selected depending on the desired trade-off between dimensionality and accuracy of face recognition. Further, it is not strictly required that the selected angles are spaced evenly apart. [0021 ] In accordance with the foregoing, in one aspect, a system and method for generating a descriptor for an image region is provided. The descriptor is operable to generate information about the face image to enable face recognition. The descriptor provided herein has low dimensionality relative to many existing descriptors providing similar or lower face recognition accuracy. For example, it may be possible to represent a face image by less than 1000 8-dimensional descriptors described herein with comparable accuracy to 2900 128-dimensional SIFT descriptors. [0022] In another aspect, a system and method for face recognition is provided. [0023] Referring now to Fig. 1 , an exemplary system is shown in which the descriptor described herein can be generated and used in the context of face recognition. It will be appreciated that the descriptor can be generated and be useful in systems other than the system described herein. Examples of such systems are object detection (particularly, face and pedestrian/people detection), pattern recognition and classification, face gender recognition, face age classification, and facial expression classification. [0024] An image processing module 100 is communicatively linked to an image database 102. The image database 102 stores a plurality of images 104 comprising a training set 106. The images 104 may further comprise a probe set 108. The images 104 may further comprise a gallery set 120. The probe set 108 comprises probe face images depicting face images for which face recognition is desired, while the gallery set 120 comprises face images for which the identity of depicted faces are known, and are to be used to infer the unknown identity of the face images in the probe set. The training set 106 comprises face images for which the identity of depicted faces are known, and are to be used for feature selection and/or to learn face recognition models. There is preferably no overlap between the images and/or identities of the gallery set 120 and the training set 106. Recognition can be understood as having linked a probe face to some index in the gallery set, be it a person's name, alphanumeric index for a gallery face, the specific descriptors for a gallery face, or a hash or transformation of any of the above. The indices may be stored in the image database along with the images, or can be stored in the image files themselves, as further described herein. [0025] Two examples of images from the LFW database, provided by Huang et al. (2007, Labeled Faces in the Wild: A Database for Studying Face Recognition in
Unconstrained Environments, University of Massachusetts, Amherst, Technical Report), that are appropriate for the gallery set or query set are shown in Figs. 4 and 5. In the examples, two individuals' faces are shown at slightly various viewpoints with various backgrounds, shading and ancillary objects (e.g. , hairstyle, head coverings/hats, jewellery, print on backgrounds, clothing, etc.). The following is operable to generate descriptors for such faces sufficient to enable the grouping and recognition of common faces. That is, faces shown in images of Fig. 4 can be matched to other faces as shown in Fig. 4, and similarly for the faces shown in images of Fig. 5. [0026] The image processing module 100 comprises, or is linked to, a feature selection module 1 10, descriptor generation module 1 12 and similarity analyzing module 1 14. In additional implementations, the image processing module 100 may further comprise or be linked to a preprocessing module 1 16, a machine module 1 18 or both. [0027] The image processing module 100 implements an image processing process that can be applied for training and recognition. Training may comprise the identification of one or more regions of the training images that are most informative in terms of face recognition. In particular implementations, the training may further comprise the learning by the machine module to perform recognition (i.e., on the probe set). Recognition determines the identity of the face in a probe image based on an analysis of the informative regions of the image. [0028] Referring now to Fig. 2, the image processing process may comprise, in some implementations as will be described herein, the preprocessing module 1 16 performing preprocessing on an image at block 200. For example, in certain examples, color images may be converted to greyscale or the contrast or illumination level of the image may be normalized. [0029] At block 202, the image processing module 100 directs the feature selection module 1 10 to perform feature selection. Feature selection enables the identification of one or more regions of the images that are most informative (i.e. , indicative of the face recognition), for subsequent purposes comprising the representations of the training, gallery and probe images. [0030] The system and method described herein do not require any specific technique for performing feature selection. Preferably, however, unlike prior techniques which use trial-and-error heuristics to determine arbitrary constants for implementation parameters (e.g. , the size and location of the features), the feature selection module 1 10 applies feature selection so that only the descriptors generated for the most informative image regions contribute to the face representation. [0031 ] Unlike many prior art approaches which apply fixed subdivision schemes to subdivide an entire face image into non-overlapping regions, and represent the face by combining the descriptors extracted from the resulting regions, in the following, the face images may be represented through a relatively sparse set of N features F - [ftn\n 6 [l, iV]}, covering only the most significant regions of the face, which may help to achieve relatively greater robustness against partial occlusion and non-uniform lighting variations. [0032] It has been found that the feature selection technique described in Vidal- Naquet and Ullman, incorporated herein by reference, is suitable. The feature selection module implements the feature selection technique iteratively and in a greedy fashion, preferably adding additional features until the addition does not significantly increase the estimated information content of the set of selected features. [0033] It has also been found that the feature selection technique in Fazl-Ersi and Tsotsos, incorporated herein by reference, is suitable. The feature selection module identifies the features whose similarity kernels (computed from pairs of training face images) have the highest alignment to the target kernel (generated by identifying the commonality of classifications of pairs of the training face images). [0034] Applying this technique, each feature ft may be represented by a triplet (ft. x, ft. y, ft. s) that specifies a region in the image, where ft. x and ft. y describe the location of the feature region and ft. s describes the size of the feature region. In a specific example, ft. x and ft. y correspond to the center of the region while ft. s corresponds to half the height and width of the region, such that feature region
(ft. x, ft. y, ft. s) is bounded by top-left corner (ft. x - ft. s, ft. y - ft. s) and bottom- right corner (ft. x + ft. s, ft. y + ft. s). [0035] Alternatively, ft may represent a circular region, where ft. x and ft. y describe the center of the feature region and ft. s describes the radius of the feature region. [0036] In block 204, an image classification process comprises directing the descriptor generation module 1 12 to generate a descriptor for each of the selected features ft of the image. The dimensionality of the descriptor is preferably kept substantially small to optimize (minimize) processing load while remaining suitable to provide an appropriate accuracy level for face recognition. [0037] The descriptor is generated for each feature as an aggregation of certain characteristics of each pixel within the feature region. For each such region, the descriptor encodes the textural properties of the local neighbourhood of pixels by aggregating the magnitude of their filtered responses towards various orientations. For this reason, a suitable nomenclature for the descriptor is "Texel", though other terms may of course be used. [0038] Consequently, the dimensionality of the descriptor is determined by the number of orientations considered in its computation. In a straightforward
implementation, the dimensionality of the descriptor is equal to the number of such orientations. [0039] In an example implementation, the number of such orientations is 8, which correspond to the angles 0°, 45°, 90°, 135°, 180°, 225°, 270° and 315°. However, different angles and different numbers of angles could be selected. [0040] Referring now to Fig. 3, at block 300, for feature region (ft. x. ft. y. ft. s) bounded by top-left corner (ft. x - ft. s, ft. y - ft. s) and bottom-right corner (ft. x + ft. s. ft. y + ft. s), the region is filtered for texture enhancement and discrimination, using the derivatives of a base filter with respect to the vertical direction (represented by baseKernel.X) and the horizontal direction (represented by baseKernel. Y), producing vertical and horizontal response maps for the feature region comprising all pixels therein. [0041 ] In an example embodiment, the base filter may apply the mean of Gabor filters tuned to similar frequency but various orientations. [0042] In another embodiment, the base filter may apply a pillbox filter. [0043] In yet another embodiment, the base filter may apply a Laplacian of Gaussian filter. [0044] For each pixel in the feature region, a response vector is generated from its horizontal and vertical filter responses. [0045] At blocks 302 and 304, the contribution of each pixel in the feature region towards the descriptor of the entire feature region is determined based on the proximity and the association of the pixel's response vector to a set of pre-specified base orientations. [0046] At block 302, the direction of the response vector for each pixel in the feature region {(i,j) \ ί £ (ft. y - ft. s, ft. y + ft. s),j £ (ft. x - ft. s, ft. x + ft. s)} is evaluated in respect of its proximity to each of the base orientations (the angles previously selected), as follows: angularProximityi j = {(cos(0) x cos(a) + s (9) x sin(a))p |Va £ angles} where Θ is the direction of the given pixel and p is a positive constant selected to enhance the proximity values. [0047] At block 304, the association of each pixel (i,j) to each element (each of the angles) of the descriptor is computed, as follows: pixelAssociatiorii j— [weights^- x angularProximity j x M |vn £ angles] where M is the magnitude of the response vector of the given pixel and weights indicates the significance of each pixel in the computation of the description vector. [0048] In an example embodiment, the matrix weights may be defined using a rotationally symmetric Gaussian low-pass filter of the same size as the feature region. [0049] At block 306, the associations of all pixels in the feature region to each base orientation are aggregated to produce the descriptor, as follows:
I Vn £ angles >
Figure imgf000013_0001
[0050] In a particular embodiment, the extracted descriptors are normalized to sum to 1. [0051 ] As will be appreciated, the dimensionality of the descriptor can be increased or decreased, as desired, by selecting greater or lesser base angles, respectively. [0052] Each face image is represented by a collection of descriptors extracted from the regions specified by the set of selected informative features. [0053] In a particular embodiment, the similarity between each pair of face images is measured by the weighted sum of the individual similarities computed between their corresponding descriptors. The weight of each feature may be determined according to its importance (e.g. , features at key facial landmarks such as eyes may be given greater weight than features at, say, the forehead). [0054] In another particular embodiment, the representations and identities of a large set of training face images are used to learn within-class and between-class subspaces using the Linear Discriminant Analysis (LDA) or Probabilistic Linear Discriminant Analysis (PLDA). These learned subspaces are then used to compare the
representation of a probe face with those of the gallery faces to recognize the identity of the probe face. [0055] In another particular embodiment, the representations of a set of training positive and negative pairs of faces (where a pair of faces is positive when the two faces belong to the same individual, and is negative, when the two faces belong to different individuals) are used to learn a model for distinguishing between similar and different faces, using the Support Vector Machine (SVM) learning method. The learned model is then used to compare the representation of a probe face against those of the gallery faces to recognize the identity of the probe face. [0056] The image processing module is operable to perform the recognition process to match a query face with one of the faces represented in the gallery set. For any particular probe face image, the descriptor generation module can be used to generate descriptors for the informative regions determined during the training. In a particular implementation, these descriptors are provided to the machine module for recognition. As with the training images, the preprocessing module 1 16 may perform preprocessing on the gallery and probe face images. [0057] Alternatively, probe images can be compared for similarity and grouping to other images in the probe set. [0058] It will be appreciated that the low dimensionality of the descriptors and the possibility of their implementation through MAC (multiply-accumulate) operations, make them particularly useful for integration into embedded systems, where in most cases only integer variables and operations can be used with relatively small memory. [0059] It will further be appreciated that the invention can be used within hardware systems, software systems or both in combination. Hardware systems can leverage the image processing module as an embedded module that offers computer vision capabilities for a computer system. This embedded module can be used in applications such as, but not limited to: smart sensors, smart cameras, smart phones, mobile tablets, compact vision systems, machine vision cameras, graphics processing units or as a software development kit that enables a hardware resource to leverage the image processing module's processes. [0060] In one embodiment, this embedded module could be used within the automotive market for face recognition in motor vehicles. Specifically, the embedded module can enhance security features such as keyless entry, driver monitoring applications and driver interaction applications. [0061 ] In another embodiment, this embedded module could be used in the robotics industry for humanoid robots that interact with humans. [0062] In another embodiment, this embedded module could be used in the security industry for network cameras, network video recorders, digital video recorders and intelligent appliances. [0063] In another embodiment, this embedded module could be used in consumer electronics such as set-top boxes, smart televisions, personal computers, appliances, smartphones, tablets and other mobile digital devices. [0064] The image processing module can also be leveraged within software systems through the use of a software development kit, application programming interface or other software development libraries. Specifically, video analytic software can leverage the image processing module in order to track and recognize faces for the above-listed embodiments. [0065] It will be appreciated that any of the foregoing examples may be applied to video images or still images. [0066] Although the foregoing has been described with reference to certain specific embodiments, various modifications thereto will be apparent to those skilled in the art without departing from the spirit and scope of the invention as outlined in the appended claims. The entire disclosures of all references recited above are incorporated herein by reference.

Claims

1 . A method of generating a texture-encoding descriptor for an image region
comprising: a) filtering the image region by applying a derivative of a base filter to the image region to generate a response vector for each pixel in the image region; b) determining a proximity of each response vector to each of a set of base
orientations and a magnitude of each response vector; c) determining an association of each response vector to each of the base
orientations based on the proximity, the magnitude and a significance of the location within the image region from which the response vector was generated; and d) aggregating the associations of each response vector in the image region to each of the base orientations to generate the descriptor.
2. The method of claim 1 , wherein the descriptor has a dimensionality corresponding to the number of base orientations.
3. The method of claim 2, wherein the number of base orientations is eight.
4. The method of claim 3, wherein the base orientations are evenly spaced apart at 45° increments.
5. The method of claim 1 , further comprising preprocessing the image region prior to the filtering.
6. The method of claim 5, wherein the preprocessing comprises at least one of
conversion to greyscale, contrast normalization and illumination normalization.
7. The method of claim 1 , wherein the descriptor encodes textural property of the
image region.
8. The method of claim 1 , wherein the base filter comprises any one of: a Gabor filter, a pillbox filter and a Laplacian of a Gaussian filter.
9. The method of claim 1 , wherein the significance is determined using a rotationally symmetric Gaussian low-pass filter of the same size as the image region.
10. A texture-encoding descriptor generator for an image region, the descriptor
generator comprising: a) a filter configured to apply a derivative of a base filter to the image region to generate a response vector for each pixel in the image region; and b) a processor and memory, the memory having instructions encoded thereon which when executed by the processor cause the processor to: i. determine a proximity of each response vector to each of a set of base orientations and a magnitude of each response vector; ii. determine an association of each response vector to each of the base orientations based on the proximity, the magnitude and a significance of the location within the image region from which the response vector was generated; and iii. aggregate the associations of each response vector in the image
region to each of the base orientations to generate the descriptor.
1 1. The descriptor generator of claim 10, wherein the descriptor has a dimensionality corresponding to the number of base orientations.
12. The descriptor generator of claim 1 1 , wherein the number of base orientations is eight.
13. The descriptor generator of claim 12, wherein the base orientations are evenly
spaced apart at 45° increments.
14. The descriptor generator of claim 10, further comprising or linked to a preprocessor for preprocessing the image region prior to the filtering.
15. The descriptor generator of claim 14, wherein the preprocessing comprises at least one of conversion to greyscale, contrast normalization and illumination
normalization.
16. The descriptor generator of claim 10, wherein the descriptor encodes textural
property of the image region.
17. The descriptor generator of claim 10, wherein the base filter comprises any one of: a Gabor filter, a pillbox filter and a Laplacian of a Gaussian filter.
18. The descriptor generator of claim 10, wherein the significance is determined using a rotationally symmetric Gaussian low-pass filter of the same size as the image region.
19. A method for classifying a probe face image, comprising: a) producing a representation for the face image from a plurality of texture- encoding descriptors extracted from one or more informative locations of the face image; b) generating scores by comparing the representation of the face image against correspondingly produced representations of further face images stored in a gallery database, based on models learned using Probabilistic Linear Discriminant Analysis (PLDA) or Support Vector Machine (SVM) learning algorithms; and c) identifying a match, if any, from the gallery database to the probe face by analyzing the scores computed by the learned models.
20. The method of claim 19, wherein the texture-encoding descriptor is generated by: a) filtering the image region by applying a derivative of a base filter to the image region to generate a response vector for each pixel in the image region; b) determining a proximity of each response vector to each of a set of base orientations and a magnitude of each response vector; c) determining an association of each response vector to each of the base
orientations based on the proximity, the magnitude and a significance of the location within the image region from which the response vector was generated; and d) aggregating the associations of each response vector in the image region to each of the base orientations to generate the descriptor.
21. The method of claim 19, wherein the number of descriptors is equal to the number of informative locations.
22. The method of claim 19, wherein the informative locations are determined by a
feature selection process.
23. The method of claim 22, wherein the feature selection process determines the
informative locations during a training process using a training set of face images.
24. The method of claim 23, wherein the feature selection process identifies features whose similarity kernels are most aligned to a target kernel.
25. The method of claim 24, wherein the similarity kernels are computed from pairs of training images.
26. The method of claim 24, wherein the target kernel is generated by identifying
commonality of classifications of pairs of the training images.
27. The method of claim 19, wherein the representation of the face image is generated by weighting the descriptors for each informative location, the weighting being determined according to importance.
28. The method of claim 19, wherein comparison is performed in pairwise manner and the identifying of a match comprises determining the similarity between pairs of face images.
29. The method of claim 28, wherein the classification results in image recognition
provided the one of the pair of matched face images is associated with a known individual.
30. A system for recognizing a probe face image, comprising: a) a descriptor generation module configured to produce a representation for the face image from a plurality of texture-encoding descriptors extracted from one or more informative locations of the face image; and b) a similarity analyzing module configured to: i. generate scores by comparing the representation of the face image against correspondingly produced representations of further face images stored in a gallery database, based on models learned using Probabilistic Linear Discriminant Analysis (PLDA) or Support Vector Machine (SVM) learning algorithms; and ii. identify a match, if any, from the gallery database to the probe face by analyzing the scores computed by the learned models.
31. The system of claim 30, wherein the descriptor generation module generates the texture-encoding descriptor by: a) filtering the image region by applying a derivative of a base filter to the image region to generate a response vector for each pixel in the image region; b) determining a proximity of each response vector to each of a set of base orientations and a magnitude of each response vector; c) determining an association of each response vector to each of the base orientations based on the proximity, the magnitude and a significance of the location within the image region from which the response vector was generated; and d) aggregating the associations of each response vector in the image region to each of the base orientations to generate the descriptor.
32. The system of claim 30, wherein the number of descriptors is equal to the number of informative locations.
33. The system of claim 30, wherein the informative locations are determined by a
feature selection process.
34. The system of claim 33, wherein the feature selection process determines the
informative locations during a training process using a training set of face images.
35. The system of claim 34, wherein the feature selection process identifies features whose similarity kernels are most aligned to a target kernel.
36. The system of claim 35, wherein the similarity kernels are computed from pairs of training images.
37. The system of claim 35, wherein the target kernel is generated by identifying
commonality of classifications of pairs of the training images.
38. The system of claim 30, wherein the representation of the face image is generated by weighting the descriptors for each informative location, the weighting being determined according to importance.
39. The system of claim 30, wherein comparison is performed in pairwise manner and the identifying of a match comprises determining the similarity between pairs of face images.
40. The system of claim 39, wherein the classification results in image recognition provided the one of the pair of matched face images is associated with a known individual.
PCT/CA2014/051117 2013-11-25 2014-11-24 System and method for face recognition Ceased WO2015074157A1 (en)

Priority Applications (3)

Application Number Priority Date Filing Date Title
CA2931348A CA2931348A1 (en) 2013-11-25 2014-11-24 System and method for face recognition
EP14864771.2A EP3074928A4 (en) 2013-11-25 2014-11-24 System and method for face recognition
US15/038,812 US9940506B2 (en) 2013-11-25 2014-11-24 System and method for face recognition

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US201361908212P 2013-11-25 2013-11-25
US61/908,212 2013-11-25

Publications (1)

Publication Number Publication Date
WO2015074157A1 true WO2015074157A1 (en) 2015-05-28

Family

ID=53178761

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CA2014/051117 Ceased WO2015074157A1 (en) 2013-11-25 2014-11-24 System and method for face recognition

Country Status (4)

Country Link
US (1) US9940506B2 (en)
EP (1) EP3074928A4 (en)
CA (1) CA2931348A1 (en)
WO (1) WO2015074157A1 (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107330412A (en) * 2017-07-06 2017-11-07 湖北科技学院 A kind of face age estimation method based on depth rarefaction representation
CN107657239A (en) * 2017-09-30 2018-02-02 清华大学深圳研究生院 Palmprint image gender classification method and device, computer installation and readable storage medium storing program for executing

Families Citing this family (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10902243B2 (en) * 2016-10-25 2021-01-26 Deep North, Inc. Vision based target tracking that distinguishes facial feature targets
CN106778522B (en) * 2016-11-25 2020-08-04 江南大学 Single-sample face recognition method based on Gabor feature extraction and spatial transformation
US10621416B2 (en) * 2017-10-02 2020-04-14 Microsoft Technology Licensing, Llc Image processing for person recognition
CN108021863B (en) * 2017-11-01 2022-05-06 平安科技(深圳)有限公司 Electronic device, age classification method based on image and storage medium
US11538257B2 (en) * 2017-12-08 2022-12-27 Gatekeeper Inc. Detection, counting and identification of occupants in vehicles
CN109934198B (en) * 2019-03-22 2021-05-14 北京市商汤科技开发有限公司 Face recognition method and device
CN110222663A (en) * 2019-06-13 2019-09-10 红鼎互联(广州)信息科技有限公司 A kind of identity verification method and device based on recognition of face
US10867193B1 (en) 2019-07-10 2020-12-15 Gatekeeper Security, Inc. Imaging systems for facial detection, license plate reading, vehicle overview and vehicle make, model, and color detection
CN110705475B (en) * 2019-09-30 2022-05-17 北京地平线机器人技术研发有限公司 Method, apparatus, medium, and device for target object recognition
US11196965B2 (en) 2019-10-25 2021-12-07 Gatekeeper Security, Inc. Image artifact mitigation in scanners for entry control systems
CN114677607A (en) * 2020-12-10 2022-06-28 富泰华工业(深圳)有限公司 Real-time pedestrian counting method and device based on face recognition

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP2064652B1 (en) * 2006-08-16 2010-10-06 Imperial Innovations Limited Method of image processing
WO2011069023A2 (en) * 2009-12-02 2011-06-09 Qualcomm Incorporated Fast subspace projection of descriptor patches for image recognition

Family Cites Families (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20100303354A1 (en) * 2009-06-01 2010-12-02 Qualcomm Incorporated Efficient coding of probability distributions for image feature descriptors
US8798377B2 (en) * 2010-02-08 2014-08-05 Telefonica, S.A. Efficient scale-space extraction and description of interest points
US20120182442A1 (en) * 2011-01-14 2012-07-19 Graham Kirsch Hardware generation of image descriptors
CA2804439A1 (en) * 2012-12-13 2014-06-13 Ehsan Fazl Ersi System and method for categorizing an image
US20150016668A1 (en) * 2013-07-12 2015-01-15 Ut-Battelle, Llc Settlement mapping systems
US9483706B2 (en) * 2015-01-08 2016-11-01 Linear Algebra Technologies Limited Hardware accelerator for histogram of gradients

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP2064652B1 (en) * 2006-08-16 2010-10-06 Imperial Innovations Limited Method of image processing
WO2011069023A2 (en) * 2009-12-02 2011-06-09 Qualcomm Incorporated Fast subspace projection of descriptor patches for image recognition

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
See also references of EP3074928A4 *

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107330412A (en) * 2017-07-06 2017-11-07 湖北科技学院 A kind of face age estimation method based on depth rarefaction representation
CN107330412B (en) * 2017-07-06 2021-03-26 湖北科技学院 A face age estimation method based on deep sparse representation
CN107657239A (en) * 2017-09-30 2018-02-02 清华大学深圳研究生院 Palmprint image gender classification method and device, computer installation and readable storage medium storing program for executing

Also Published As

Publication number Publication date
US9940506B2 (en) 2018-04-10
US20160379043A1 (en) 2016-12-29
EP3074928A1 (en) 2016-10-05
CA2931348A1 (en) 2015-05-28
EP3074928A4 (en) 2017-11-15

Similar Documents

Publication Publication Date Title
US9940506B2 (en) System and method for face recognition
Fukui et al. Difference subspace and its generalization for subspace-based methods
Patil et al. 3-D face recognition: features, databases, algorithms and challenges
US9323980B2 (en) Pose-robust recognition
US20090010500A1 (en) Face Recognition Methods and Systems
KR100950776B1 (en) Method of face recognition
Ren et al. A chi-squared-transformed subspace of LBP histogram for visual recognition
Luevano et al. A study on the performance of unconstrained very low resolution face recognition: Analyzing current trends and new research directions
Filković et al. Deep metric learning for person Re-identification and De-identification
Gatto et al. Discriminative canonical correlation analysis network for image classification
Günay et al. A new facial age estimation method using centrally overlapped block based local texture features
Jiang et al. Patterns of Weber magnitude and orientation for uncontrolled face representation and recognition
Cui A scheme of human face recognition in complex environments
Saoud et al. Multimodal 2d+ 3d multi-descriptor tensor for face verification
Xu et al. Facial analysis with a Lie group kernel
Venkata Kranthi et al. Real-time facial recognition using deep learning and local binary patterns
Mousa Pasandi Face, Age and Gender Recognition Using Local Descriptors
Jana et al. Gender identification using ensemble linear discriminant analysis algorithm based on facial features
Forczmański et al. Improving the recognition of occluded faces by means of two-dimensional orthogonal projection into local subspaces
Nigam et al. Robust ear recognition using gradient ordinal relationship pattern
Li et al. An efficient robust eye localization by learning the convolution distribution using eye template
Al-Obaydy et al. Patch-based pose invariant features for single sample face recognition
Wei et al. An effective two-dimensional linear discriminant analysis with locality preserving approach for image recognition
Brown et al. Improved automatic face segmentation and recognition for applications with limited training data
Elaiwat et al. 3D face identification using Curvelet transform

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 14864771

Country of ref document: EP

Kind code of ref document: A1

ENP Entry into the national phase

Ref document number: 2931348

Country of ref document: CA

WWE Wipo information: entry into national phase

Ref document number: 15038812

Country of ref document: US

NENP Non-entry into the national phase

Ref country code: DE

REEP Request for entry into the european phase

Ref document number: 2014864771

Country of ref document: EP

WWE Wipo information: entry into national phase

Ref document number: 2014864771

Country of ref document: EP