CN108564016A - A kind of AU categorizing systems based on computer vision and method - Google Patents

A kind of AU categorizing systems based on computer vision and method Download PDF

Info

Publication number
CN108564016A
CN108564016A CN201810297489.1A CN201810297489A CN108564016A CN 108564016 A CN108564016 A CN 108564016A CN 201810297489 A CN201810297489 A CN 201810297489A CN 108564016 A CN108564016 A CN 108564016A
Authority
CN
China
Prior art keywords
key point
face
module
classification
training
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201810297489.1A
Other languages
Chinese (zh)
Inventor
黄环
陈东浩
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Hongyun Zhisheng Technology Co Ltd
Original Assignee
Beijing Hongyun Zhisheng Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Hongyun Zhisheng Technology Co Ltd filed Critical Beijing Hongyun Zhisheng Technology Co Ltd
Priority to CN201810297489.1A priority Critical patent/CN108564016A/en
Publication of CN108564016A publication Critical patent/CN108564016A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V40/00Recognition of biometric, human-related or animal-related patterns in image or video data
    • G06V40/10Human or animal bodies, e.g. vehicle occupants or pedestrians; Body parts, e.g. hands
    • G06V40/16Human faces, e.g. facial parts, sketches or expressions
    • G06V40/168Feature extraction; Face representation
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V40/00Recognition of biometric, human-related or animal-related patterns in image or video data
    • G06V40/10Human or animal bodies, e.g. vehicle occupants or pedestrians; Body parts, e.g. hands
    • G06V40/16Human faces, e.g. facial parts, sketches or expressions
    • G06V40/172Classification, e.g. identification

Landscapes

  • Engineering & Computer Science (AREA)
  • Health & Medical Sciences (AREA)
  • Oral & Maxillofacial Surgery (AREA)
  • General Health & Medical Sciences (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • Theoretical Computer Science (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Image Analysis (AREA)

Abstract

A kind of AU categorizing systems based on computer vision and method, including:Face key point extraction module, power module, control switch, USB interface, key point characteristic extracting module, AU sort modules, feedback module and training module;The invention uses CLNF model inspection face key points position, Laplacian Eigenmaps Dimension-reduced Strategies are taken to carry out dimensionality reduction to feature, and reduce keypoint quantity to rational range, use the method for more efficient Gabor characteristic, so that the complexity of scheme substantially reduces, calculation amount is reduced, without manually participating in, it can automate and carry out FACS codings, that is AU classifies, the specific method applied is both for face picture, compared to other systems, the efficiency of the classification of this system and accuracy are obtained for prodigious raising.

Description

A kind of AU categorizing systems based on computer vision and method
Technical field
The invention belongs to computer approach control technology field, especially a kind of AU categorizing systems based on computer vision And method, it is proposed that a kind of new participates in without artificial, the system that automation carries out FACS codings (i.e. AU classification), in the system In, the specific method applied compares other systems, the efficiency of the classification of this system and accuracy both for face picture It is obtained for prodigious raising.
Background technology
Depression is a kind of common disease in the whole world, and estimation shares 3.5 hundred million patients, every year because depression is led to committed suicide people Number estimation is up to 1,000,000 people, and there is also many societies, pessimistic not self-confident and quite a lot of of abandoning for many slight patients with depression Somatization;Some are studies have shown that the facial expression of depressive patients plays an important role in emotion expression service, if very The facial expression of good analyzing processing patients with depression, it will be able to more effectively quickly make a definite diagnosis such disease, and obtain best Therapeutic effect;
Current correlative study uses automatic face Expression analysis technology, to investigate the clinical data of depression, collects suppression Influence data of the severity of strongly fragrant symptom to facial expression;Facial Action Coding System (FACS) is current facial expression annotation Gold standard;Facial expression is resolved into the component part of referred to as motor unit (AU) by FACS;Motor unit is based on dissection Face action corresponds to the contraction of specific facial muscles;For example, AU 12 encodes the contraction of greater zygomatic muscle, AU 6 encodes eye wheel The contraction of orbiculares;In order to annotate facial expression, AU individually occurs or combines generation with other AU, which realizes 12 AU classifies (i.e. FACS codings), is whether one human face expression picture of mark includes one or more of AU 1-12, if not Including which AU, which is labeled as -1, including being then labeled as 1;For example one picture comprising AU1 and AU3 is uniquely marked Note is:1 -1 1 -1 -1 -1 -1 -1 -1 -1 -1 -1;Because carrying out AU classification manually to need profession, accurately judge, So a large amount of interrogation times of clinician can be expended;
There is also a new invention, the patent No.s for the prior art:CN201710709113.2, title:The feature recognition sides AU Method, device and storage medium, a kind of AU characteristic recognition methods of invention offer, device and computer readable storage medium are described Method includes:The extract real-time facial image that is shot to camera using face recognition algorithms utilizes facial averaging model identification t A face feature point, according to the position extraction local feature of t face feature point, feature vector input respectively it is trained and with Whether the AU graders of this feature Region Matching are more than predetermined threshold value, judgement according to the probability of each AU features in prediction result AU classification results;
Although the patent is also using the automatic technology for carrying out AU classification, the specific side that above-mentioned patent modules use The problem of method classifies to face AU does not have specific aim, and what it is to uses such as critical point detection, feature extractions is all that general classification is asked Topic method used, classification it is inefficient, accuracy rate also cannot be satisfied the demand of practical application, concretely:
1. the patent identifies that face feature point, facial averaging model know all characteristic points together by facial averaging model It does not come out, is not concerned with the local feature of characteristic point, accuracy of identification is relatively low.And next our patent application uses CLNF models, in CLNF models, patch experts are that each face key point training such as eyes, lip, eyebrow is individual Point distribution collection, then each model inspection to key point be fitted to together according to full face key point overall distribution, accuracy is more It is high;
2. above-mentioned patent directly extracts the feature of 76 key points, more and intrinsic dimensionality of counting is high, computationally intensive.Ability 60-70 human face characteristic point of domain generally use optimization, not only can guarantee and has extracted enough features, but also be unlikely to calculation amount mistake Greatly;And the present invention carries out Laplacian Eigenmaps dimensionality reductions after extracting feature, to feature so that the dimension of feature It is reduced to ten orders of magnitude by thousand orders of magnitude.The efficiency of classification is effectively promoted;
3. the local feature of above-mentioned patent extraction is sift features, and this patent uses Gabor characteristic, with sift spies Sign is compared, and Gabor characteristic has stronger robustness to the error that face is aligned, and Gabor transformation simple principle is simple, easy In realization, improves accuracy rate and reduce complexity.
Invention content
In order to solve the above technical problem, the present invention provides a kind of AU categorizing systems based on computer vision and method, It is designed by the structure and algorithm of optimization, it is proposed that one kind is participated in without artificial, the system that automation carries out AU classification;This is The input of system is a face picture, has used the method based on object classification in computer vision to solve the above problems, has changed The accuracy rate of classification has been apt to it, which has that accuracy rate is high, the advantage without manually participating in i.e. codified mass data collection, can be with Classify for practical AU.
A kind of AU categorizing systems based on computer vision and method, wherein:
A kind of AU categorizing systems based on computer vision, including:Face key point extraction module, power module, control Switch, USB interface, key point characteristic extracting module, AU sort modules, feedback module and training module;
Further, the input terminal of the face key point extraction module is defeated for receiving AU face pictures to be sorted Outlet is connect with the input terminal electric signal of the key point characteristic extracting module, one end telecommunications at another end and the USB interface Number connection;
The face key point extraction module extracts structure using dlib human-face detectors, which uses the Dlib of c++ Face datection algorithm in library, the Face datection algorithm in the libraries Dlib use hog features and cascade classifier, it is very Classical human face region detection algorithm;
The dlib human-face detectors are used in a pictures, only detect the boundary of human face region, and ignore other Region;
Illustrate that the data set that the face key point extraction module uses is ck+ data set as a kind of applicating example;
Illustrate as an example, the ck+ data sets include 326 face pictures, per pictures all by professional Mark has got well the classification results of AU manually, which trains its system using these data;
Further, input traffic of the USB interface as face key point extraction module, the extraction of face key point Module is used to extract the coordinate position of picture key point;
Further, the power module for the face key point extraction module, key point characteristic extracting module with And the power supply of AU sort modules;One end of the power module is connect with the other end electric signal of the USB interface, the power supply The other end of module is electrically connected with control switch;
Further, the key point characteristic extracting module output end and the input terminal electric signal of the AU sort modules connect It connects;Key point characteristic extracting module receives the coordinate information for the key point that face key point extraction module extracts, in key point The nearby coordinates extracted region images feature at place, the foundation as subsequent image classification;
Further, after extracting feature, Laplacian Eigenmaps dimension reduction methods are used to feature so that feature Dimension be reduced to ten orders of magnitude by thousand orders of magnitude, then they are input in AU sort modules;Thus, AU sort modules Computation amount, the efficiency and effect of classification have obtained effective promotion simultaneously;
Illustrate as an example, the Laplacian Eigenmaps dimension reduction methods are to go to build from the angle of figure Relationship between data, it requires related point between each other, close as far as possible in the space after dimensionality reduction;
Further, the output end of the AU sort modules is for exporting AU classification results, another end and the training The input terminal electric signal of module connects, and AU sort modules receive near the face key point that key point characteristic extracting module extracts The feature in region carries out Mathematical treatment in AU sort modules to these features of picture;CLNF models, extraction Gabor is special Sign, Laplacian Eigenmaps dimensionality reductions and the classification of SVM bis- combine, and complete the AU classification of face picture;
This system ties CLNF models, extraction Gabor characteristic, Laplacian Eigenmaps dimensionality reductions and the classification of SVM bis- It closes, completes the AU classification of face picture;
Illustrating as an example, the CLNF models are local nerve field model, are used for face key point extraction module, CLNF includes two parts, PDM (points distribution models) and patch experts, PDM for capturing full face key point change in shape, Patch experts are used to capture the local appearance variation of each key point;Specifically, patch experts are 68 people Face key point training individually point distribution collection, then each model inspection to key point be fitted to together according to PDM;
Illustrate as an example, the SVM algorithm is a kind of two classification model, and solution, which aims at, determines one The hyperplane of classification, to maximize the interval on feature space.For the feature of the low-dimensional linearly inseparable of input, core skill is added It is ingeniously implicitly mapped in high-dimensional feature space, is allowed to linear separability;
Further, the output end of the training module is connect with the input terminal electric signal of the feedback module, training mould Block learns the classification of face AU using machine learning algorithm, and trained result is fed back to feedback module, by feedback module AU sort modules are passed to, the feedback result of AU sort module application trainings is right according to the feature of face picture to be sorted AU Picture carries out AU classification, final output AU classification results;
Illustrate as an example, whether the AU classification results are to judge this pictures comprising one in AU 1-12 Or 12 multiple bit digitals;
A kind of AU sorting techniques based on computer vision, including:
Step 1: the foundation of data set:326 face pictures and every figure corresponding have been handled in extraction ck+ data sets 326 AU classification results that dynamic classification is completed;
Step 2: the selection of characteristic point:Face key point extraction module uses CLNF models, and people is extracted from face picture The position coordinates of 68 key points of face;CLNF models train individually point distribution collection to each face key point, further according to complete Face overall distribution is fitted to together so that the accuracy rate of detection gets a greater increase;Then key point characteristic extracting module profit With the position coordinates of key point, the Gabor characteristic of the picture of key point near zone is extracted;
Further, the Gabor characteristic is a kind of feature being used for describing image texture information, to the mistake of face alignment Difference has very strong robustness, therefore extracts the Gabor characteristic of 68 key point near zones of picture;Gabor characteristic is main Adding window is carried out to signal in frequency domain by Gabor cores, so as to describe the local frequency information of signal;One Gabor core The response condition of some frequency neighborhood of image can be got, this response results is a feature of image, with multiple and different frequencies The Gabor cores of rate go obtain image different frequency neighborhood response condition, can be formed image each frequency band feature, This feature is used for describing the frequency information of image;In the present invention, we near each key point, with 5 kinds of sizes, 8 40 Gabor cores in direction extract Gabor characteristic, therefore the intrinsic dimensionality extracted per pictures is:5*8*68=2720;
As one kind in step 2 for example, the workflow of the face key point extraction module includes:
1. inputting a face picture, the training stage inputs the picture in ck+ data sets, and it is to be sorted that test phase inputs AU Face picture;
2. detecting face boundary using the human-face detector of dlib;
3. according to face border cuts picture;
4. to the picture after cutting, with CLNF model inspection face key points;
5. exporting the position coordinates of face key point.
As one kind in step 2 for example, the workflow of the key point characteristic extracting module includes:
1. receiving the coordinate position of face key point from face key point extraction module;
2. extracting Gabor characteristic in face key point near zone;
3. the Gabor characteristic extracted is done normalized;
4. the Gabor characteristic after normalization is written in txt file;
5. the txt file of output storage Gabor characteristic.
Step 3: parameter training:AU sort modules utilize the Gabor characteristic learning classification in step 2, pass through training mould Algorithm study in block, training and the model parameter for optimizing whole system, and fed back in AU sort modules by feedback module;
One kind as step 3 is for example, the training of the AU sort modules, testing process include:
Training flow:
1. corresponding Gabor characteristic in AU classification results and txt file that input ck+ data sets have marked;
2. choosing first AU, dimensionality reduction parameter is initialized, Laplacian Eigenmaps dimensionality reductions are carried out to feature;
3. calculating the quantity of training set and the positive and negative sample of test set, positive and negative sample is distinguished according to mark, according to quantitative requirement Extract training set and the corresponding feature of the positive and negative sample of test set;
4. using SVM model training graders, the accuracy rate of test set is calculated;
5. updating dimensionality reduction parameter in variation range, find out so that the maximum dimensionality reduction parameter of classification accuracy;
6. being repeated the above process respectively to other 11 AU.
Testing process:
1. inputting the Gabor characteristic of AU pictures to be sorted;
2. according to the corresponding dimensionality reduction parameter of maximum accuracy rate that training flow obtains, the Gabor characteristic extracted is carried out Laplacian Eigenmaps dimensionality reductions;
3. calculating classification results with SVM models;
4. 12 AU of pair whole classify;
5. exporting AU classification results, 12 classification results form final classification result.
Because the dimension per pictures is higher (2720 dimension), intrinsic dimensionality is reduced using dimension reduction method, facilitates meter It calculates.In the libraries sklearn, the SpectralEmbedding functions under Manifold learning (manifold learning) directly invoke Realize Laplacian Eigenmaps dimension reduction methods.Due to involved in the dimension reduction method to two parameters n_components, n_ Neighbors, n_components are used to specify the dimension of projection subspace, and it is nearest that n_neighbors is used for knn classification kinds Adjacent quantity (knn classification determines the classification results according to the classification results of several Neighbor Points of a point).The two Parameter is uncertain, so cycle is added on stream to find optimal parameter value;
In the present invention, we call the svm functions in the libraries sklearn, because to be sorted has 12 AU, and SVM is only It can be used for two classification;So in the present invention, carrying out two classification to this 12 AU respectively, i.e., individually being instructed to each AU Practice, to determine whether the picture contains this AU, finally this 12 classification results combine again;
In the training process, sample is divided into training set and test set by us, and training set quantity accounts for 75%, test set quantity Account for 25%.In order to ensure classifying quality, the number of the positive and negative sample used in training set is identical.The standard tested by comparing test set True rate is as a result, we choose suitable dimensionality reduction parameter.During the test, we drop feature using optimal dimensionality reduction parameter Dimension, then classifies to AU with trained SVM classifier.
Step 4: measuring and calculation:After training, AU face picture files to be sorted are input to face key point and are carried In modulus block, as a result being sequentially inputted in key point characteristic extracting module and AU sort modules step by step;
Step 5: feedback calculates:In AU sort modules, the training result passed back using feedback module, to test pictures Result carry out AU classification, export AU classification results.
Advantageous effect:
1. the process automation that AU classifies completely, without manually participating in can be completed the classification of all 12 AU;
2. carrying out dimensionality reduction to feature by using Laplacian Eigenmaps Dimension-reduced Strategies, and reduce crucial points Measure rational range, using the method for more efficient Gabor characteristic so that the complexity of scheme substantially reduces, and calculation amount subtracts It is few;
3. by the innovation and fusion of a variety of methods, the efficiency and effect of AU classification have obtained effective promotion simultaneously;
4. technology, the Gabor characteristic of application fetches key point near zone of application CLNF model inspection face key points Technology, using Laplacian Eigenmaps dimension reduction methods, using bis- sorting techniques of svm, by CLNF models, extraction Gabor Feature, Laplacian Eigenmaps dimensionality reductions and the classification of SVM bis- combine, and complete the AU classification of face picture.
Description of the drawings
Fig. 1 is a kind of overall structure diagram of AU categorizing systems based on computer vision of the present invention
Specific implementation mode
In the following, shown in refer to the attached drawing 1, a kind of AU categorizing systems based on computer vision and method, wherein:
A kind of AU categorizing systems based on computer vision, including:Face key point extraction module 101, power module 102, control switch 103, usb 1 04, key point characteristic extracting module 105, AU sort modules 106, feedback module 107 with And training module 108;
Further, the input terminal of the face key point extraction module 101 is used to receive AU face pictures to be sorted, Output end is connect with the input terminal electric signal of the key point characteristic extracting module 105, another end and the usb 1 04 One end electric signal connection;
The face key point extraction module 101 extracts structure using dlib human-face detectors, and the structure is using c++'s Face datection algorithm in the libraries Dlib, the Face datection algorithm in the libraries Dlib use hog features and cascade classifier, it is Very classical human face region detection algorithm;
The dlib human-face detectors are used in a pictures, only detect the boundary of human face region, and ignore other Region;
Illustrate that the data set that the face key point extraction module 101 uses is ck+ data as a kind of applicating example Collection;
Illustrate as an example, the ck+ data sets include 326 face pictures, per pictures all by professional Mark has got well the classification results of AU manually, which trains its system using these data;
Further, input traffic of the usb 1 04 as face key point extraction module 101, face are crucial Point extraction module 101 is used to extract the coordinate position of picture key point;
Further, the power module 102 is for the face key point extraction module 101, key point feature extraction The power supply of module 105 and AU sort modules 106;The other end of one end of the power module 102 and the usb 1 04 Electric signal connects, and the other end of the power module 102 is electrically connected with the control switch 103;
Further, the input terminal electricity of 105 output end of key point characteristic extracting module and the AU sort modules 106 Signal connects;Key point characteristic extracting module 105 receives the coordinate letter for the key point that face key point extraction module 101 extracts Breath, in the nearby coordinates extracted region images feature where key point, the foundation as subsequent image classification;
Further, after extracting feature, Laplacian Eigenmaps dimension reduction methods are used to feature so that feature Dimension be reduced to ten orders of magnitude by thousand orders of magnitude, then they are input in AU sort modules 106;Thus, which AU classifies The computation amount of module 106, the efficiency and effect of classification have obtained effective promotion simultaneously;
Illustrate as an example, the Laplacian Eigenmaps dimension reduction methods are to go to build from the angle of figure Relationship between data, it requires related point between each other, close as far as possible in the space after dimensionality reduction;
Further, the output end of the AU sort modules 106 is for exporting AU classification results, another end and the instruction Practice the input terminal electric signal connection of module 108, AU sort modules 106 receive the people that key point characteristic extracting module 105 extracts The feature of face key point near zone carries out Mathematical treatment in AU sort modules 106 to these features of picture;
This method ties CLNF models, extraction Gabor characteristic, Laplacian Eigenmaps dimensionality reductions and the classification of SVM bis- It closes, completes the AU classification of face picture;
Illustrating as an example, the CLNF models are local nerve field model, are used for face key point extraction module, CLNF includes two parts, PDM (points distribution models) and patch experts, PDM for capturing full face key point change in shape, Patch experts are used to capture the local appearance variation of each key point;Specifically, patch experts are 68 people Face key point training individually point distribution collection, then each model inspection to key point be fitted to together according to PDM;
Illustrate as an example, the SVM algorithm is a kind of two classification model, and solution, which aims at, determines one The hyperplane of classification, to maximize the interval on feature space;For the feature of the low-dimensional linearly inseparable of input, core skill is added It is ingeniously implicitly mapped in high-dimensional feature space, is allowed to linear separability;
Further, the output end of the training module 108 is connect with the input terminal electric signal of the feedback module 107, Training module 108 learns the classification of face AU using machine learning algorithm, and trained result is fed back to feedback module, by Feedback module 107 passes to AU sort modules 106, and the feedback result of 106 application training of AU sort modules is to be sorted according to AU The feature of face picture carries out AU classification, final output AU classification results to picture;
Illustrate as an example, whether the AU classification results are to judge this pictures comprising one in AU 1-12 Or 12 multiple bit digitals;
A kind of AU sorting techniques based on computer vision, including:
Step 1: the foundation of data set:326 face pictures and every figure corresponding have been handled in extraction ck+ data sets 326 AU classification results that dynamic classification is completed;
Step 2: the selection of characteristic point:Face key point extraction module 101 uses CLNF models, is carried from face picture Take the position coordinates of 68 key points of face;CLNF models train individually point distribution collection, then root to each face key point It is fitted to together according to full face overall distribution so that the accuracy rate of detection gets a greater increase;Then key point feature extraction mould Block 105 extracts the Gabor characteristic of the picture of key point near zone using the position coordinates of key point;
Further, the Gabor characteristic is a kind of feature being used for describing image texture information, to the mistake of face alignment Difference has very strong robustness, therefore extracts the Gabor characteristic of 68 key point near zones of picture;Gabor characteristic is main Adding window is carried out to signal in frequency domain by Gabor cores, so as to describe the local frequency information of signal;One Gabor core The response condition of some frequency neighborhood of image can be got, this response results is a feature of image, with multiple and different frequencies The Gabor cores of rate go obtain image different frequency neighborhood response condition, can be formed image each frequency band feature, This feature is used for describing the frequency information of image;In the present invention, we near each key point, with 5 kinds of sizes, 8 40 Gabor cores in direction extract Gabor characteristic, therefore the intrinsic dimensionality extracted per pictures is:5*8*68=2720;
As one kind in step 2 for example, the workflow of the face key point extraction module 101 includes:
1. inputting a face picture, the training stage inputs picture in ck+ data sets, and it is to be sorted that test phase inputs AU Face picture;
2. detecting face boundary using the human-face detector of dlib;
3. according to face border cuts picture;
4. the picture after pair cutting, with CLNF model inspection face key points;
5. exporting the position coordinates of face key point.
As one kind in step 2 for example, the workflow of the key point characteristic extracting module 105 includes:
1. receiving the coordinate position of face key point from face key point extraction module;
2. extracting Gabor characteristic in face key point near zone;
3. the Gabor characteristic extracted is done normalized;
4. the Gabor characteristic after normalization is written in txt file;
5. the txt file of output storage Gabor characteristic.
Step 3: parameter training:AU sort modules 106 pass through training using the Gabor characteristic learning classification in step 2 Algorithm study in module 108, training and the model parameter for optimizing whole system, and feed back to AU points by feedback module 107 In generic module 106;
One kind as step 3 is for example, the training of the AU sort modules 106, testing process include:
Training flow:
1. corresponding Gabor characteristic in AU classification results and txt file that input ck+ data sets have marked;
2. choosing first AU, dimensionality reduction parameter is initialized, Laplacian Eigenmaps dimensionality reductions are carried out to feature;
3. calculating the quantity of training set and the positive and negative sample of test set, positive and negative sample is distinguished according to mark, according to quantitative requirement Extract training set and the corresponding feature of the positive and negative sample of test set;
4. using SVM model training graders, the accuracy rate of test set is calculated;
5. updating dimensionality reduction parameter in variation range, find out so that the maximum dimensionality reduction parameter of classification accuracy;
6. being repeated the above process respectively to other 11 AU.
Testing process:
1. inputting the Gabor characteristic of AU pictures to be sorted;
2. according to the corresponding dimensionality reduction parameter of maximum accuracy rate that training flow obtains, the Gabor characteristic extracted is carried out Laplacian Eigenmaps dimensionality reductions;
3. calculating classification results with SVM models;
4. 12 AU of pair whole classify;
5. exporting AU classification results, 12 classification results form final classification result.
Because the dimension per pictures is higher (2720 dimension), intrinsic dimensionality is reduced using dimension reduction method, facilitates meter It calculates;In the libraries sklearn, the SpectralEmbedding functions under Manifold learning (manifold learning) directly invoke Realize Laplacian Eigenmaps dimension reduction methods.Due to involved in the dimension reduction method to two parameters n_components, n_ Neighbors, n_components are used to specify the dimension of projection subspace, and it is nearest that n_neighbors is used for knn classification kinds Adjacent quantity (knn classification determines the classification results according to the classification results of several Neighbor Points of a point).The two Parameter is uncertain, so cycle is added on stream to find optimal parameter value;
In the present invention, we call the svm functions in the libraries sklearn, because to be sorted has 12 AU, and SVM is only It can be used for two classification;So in the present invention, carrying out two classification to this 12 AU respectively, i.e., individually being instructed to each AU Practice, to determine whether the picture contains this AU, finally this 12 classification results combine again;
In the training process, sample is divided into training set and test set by us, and training set quantity accounts for 75%, test set quantity Account for 25%.In order to ensure classifying quality, the number of the positive and negative sample used in training set is identical.The standard tested by comparing test set True rate is as a result, we choose suitable dimensionality reduction parameter.During the test, we drop feature using optimal dimensionality reduction parameter Dimension, then classifies to AU with trained SVM classifier.
Step 4: measuring and calculation:After training, AU face picture files to be sorted are input to face key point and are carried In modulus block 101, as a result being sequentially inputted in key point characteristic extracting module 105 and AU sort modules 106 step by step;
Step 5: feedback calculates:In AU sort modules, the training result passed back using feedback module 107, to test chart The result of piece carries out AU classification, exports AU classification results.
The process automation that the present invention completely classifies AU, without manually participating in can be completed the classification of all 12 AU, Dimensionality reduction is carried out to feature by using Laplacian Eigenmaps Dimension-reduced Strategies, and reduces keypoint quantity to reasonably Range uses the method for more efficient Gabor characteristic so that the complexity of scheme substantially reduces, and calculation amount is reduced;By a variety of The innovation and fusion of method, the efficiency and effect of AU classification have obtained effective promotion simultaneously;Using CLNF model inspection faces The technology of key point, the technology of the Gabor characteristic of application fetches key point near zone, using Laplacian Eigenmaps Dimension reduction method, using bis- sorting techniques of svm, by CLNF models, extraction Gabor characteristic, Laplacian Eigenmaps dimensionality reductions and The classification of SVM bis- combines, and completes the AU classification of face picture.
Disclosed above is only the specific embodiment of the application, and however, this application is not limited to this, any this field Technical staff can think variation, should all fall in the protection domain of the application.

Claims (10)

1. a kind of AU categorizing systems based on computer vision, which is characterized in that including:Face key point extraction module, power supply Module, control switch, USB interface, key point characteristic extracting module, AU sort modules, feedback module and training module;
The input terminal of the face key point extraction module is for receiving AU face pictures to be sorted, output end and the key The input terminal electric signal of point feature extraction module connects, and another end is connect with one end electric signal of the USB interface;The USB Input traffic of the interface as face key point extraction module, face key point extraction module is for extracting picture key point Coordinate position;The power module is for the face key point extraction module, key point characteristic extracting module and AU classification The power supply of module;One end of the power module is connect with the other end electric signal of the USB interface, the power module it is another One end is electrically connected with control switch;The input terminal of the key point characteristic extracting module output end and the AU sort modules Electric signal connects;Key point characteristic extracting module receives the coordinate information for the key point that face key point extraction module extracts, The dimensionality reduction sides Laplacian Eigenmaps are used in the nearby coordinates extracted region images feature where key point, and to feature Method so that be input to after the dimension reduction of feature to ten orders of magnitude in AU sort modules, the output end of the AU sort modules is used In output AU classification results, another end is connect with the input terminal electric signal of the training module, the output of the training module End is connect with the input terminal electric signal of the feedback module.
2. a kind of AU categorizing systems based on computer vision according to claim 1, which is characterized in that described Laplacian Eigenmaps dimension reduction methods are from the relationship gone with the angle of figure between structure data, it requires mutual It is related, it is close as far as possible in the space after dimensionality reduction.
3. a kind of AU categorizing systems based on computer vision according to claim 1, which is characterized in that the AU classification Module receives the feature for the face key point near zone that key point characteristic extracting module extracts, right in AU sort modules These features of picture carry out Mathematical treatment, by CLNF models, extraction Gabor characteristic, Laplacian Eigenmaps dimensionality reductions Classify with SVM bis- and combine, completes the AU classification of face picture.
4. a kind of AU categorizing systems based on computer vision according to claim 3, which is characterized in that the CLNF moulds Type is local nerve field model, is used for face critical point detection, CLNF includes two parts, PDM (points distribution models) and patch For capturing full face key point change in shape, the part that patch experts are used to capture each key point is outer by experts, PDM See variation;Individually point distribution collection is trained specifically, patch experts are 68 face key points, then each model The key point detected is fitted to together according to PDM.
5. a kind of AU categorizing systems based on computer vision according to claim 1, which is characterized in that the trained mould Block learns the classification of face AU using machine learning algorithm, and trained result is fed back to feedback module, by feedback module AU sort modules are passed to, the feedback result of AU sort module application trainings is right according to the feature of face picture to be sorted AU Picture carries out AU classification.
6. a kind of AU categorizing systems based on computer vision according to claim 5, which is characterized in that the AU classification As a result judge whether this pictures includes 12 bit digitals of one or more of AU 1-12.
7. a kind of AU categorizing systems based on computer vision according to claim 1 or 3, which is characterized in that the people Face key point extraction module extracts structure using dlib human-face detectors, and the structure is using the Face datection in the libraries Dlib of c++ Algorithm, the Face datection algorithm in the libraries Dlib use hog features and cascade classifier, the face key point extraction module The data set used is ck+ data set;The ck+ data sets include 326 face pictures, per pictures all by professional's hand Dynamic mark has got well the classification results of AU.
8. a kind of AU sorting techniques based on computer vision, which is characterized in that including following processing step:
Step 1: the foundation of data set:326 face pictures and every figure corresponding divide manually in extraction ck+ data sets 326 AU classification results that class is completed;
Step 2: the selection of characteristic point:Face key point extraction module uses CLNF models, and face is extracted from face picture The position coordinates of 68 key points;CLNF models train individually point distribution collection to each face key point, whole further according to full face Body fitting of distribution is to together so that the accuracy rate of detection gets a greater increase;Then key point characteristic extracting module utilizes pass The position coordinates of key point extract the Gabor characteristic of the picture of key point near zone;
Step 3: parameter training:AU sort modules utilize the Gabor characteristic learning classification in step 2, by training module Algorithm study, training simultaneously optimizes the model parameter of whole system, and is fed back in AU sort modules by feedback module;
Step 4: measuring and calculation:After training, AU face picture files to be sorted are input to face key point and extract mould In block, as a result being sequentially inputted in key point characteristic extracting module and AU sort modules step by step;
Step 5: feedback calculates:In AU sort modules, the training result passed back using feedback module, to the knot of test pictures Fruit carries out AU classification, exports AU classification results.
9. a kind of AU sorting techniques based on computer vision according to claim 8, which is characterized in that
The workflow of the face key point extraction module includes:A face picture is inputted, the training stage inputs from ck+ numbers AU face pictures to be sorted are inputted according to picture, test phase is concentrated;Face boundary is detected using the human-face detector of dlib; According to face border cuts picture;To the picture after cutting, with CLNF model inspection face key points;Export face key point Position coordinates;
The workflow of the key point characteristic extracting module includes:Face key point is received from face key point extraction module Coordinate position;Gabor characteristic is extracted in face key point near zone;The Gabor characteristic extracted is done into normalized;Return Gabor characteristic after one change is written in txt file;The txt file of output storage Gabor characteristic;
The training of the AU sort modules, testing process include:
Training flow:Corresponding Gabor characteristic in AU classification results and txt file that input ck+ data sets have marked;Choose the One AU initializes dimensionality reduction parameter, and Laplacian Eigenmaps dimensionality reductions are carried out to feature;Calculate training set and test set just The quantity of negative sample distinguishes positive and negative sample according to mark, extracts training set according to quantitative requirement and the positive and negative sample of test set corresponds to Feature;With SVM model training graders, the accuracy rate of test set is calculated;Dimensionality reduction parameter is updated in variation range, finding out makes Obtain the maximum dimensionality reduction parameter of classification accuracy;Other 11 AU are repeated the above process respectively;
Testing process:Input the Gabor characteristic of AU pictures to be sorted;The corresponding drop of maximum accuracy rate obtained according to training flow Parameter is tieed up, Laplacian Eigenmaps dimensionality reductions are carried out to the Gabor characteristic extracted;Classification results are calculated with SVM models; Classify to all 12 AU;AU classification results are exported, 12 classification results form final classification result.
10. a kind of AU sorting techniques based on computer vision according to claim 8, which is characterized in that because every The dimension of picture is higher, so being reduced intrinsic dimensionality using dimension reduction method, in the libraries sklearn, under Manifold learning SpectralEmbedding functions, which directly invoke, realizes Laplacian Eigenmaps dimension reduction methods;Due in the dimension reduction method It is related to two parameters n_components, n_neighbors, n_components and is used to specify the dimension of projection subspace, N_neighbors is used for the quantity of knn classification kind arest neighbors;The two parameters are uncertain, so being added on stream It recycles to find optimal parameter value;
The svm functions in the libraries sklearn are called, because to be sorted there are 12 AU, and SVM is only used for two classification;So In the present invention, two classification are carried out to this 12 AU respectively, i.e., each AU are individually trained, whether to determine the picture Containing this AU, finally this 12 classification results are combined again.
CN201810297489.1A 2018-04-04 2018-04-04 A kind of AU categorizing systems based on computer vision and method Pending CN108564016A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201810297489.1A CN108564016A (en) 2018-04-04 2018-04-04 A kind of AU categorizing systems based on computer vision and method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201810297489.1A CN108564016A (en) 2018-04-04 2018-04-04 A kind of AU categorizing systems based on computer vision and method

Publications (1)

Publication Number Publication Date
CN108564016A true CN108564016A (en) 2018-09-21

Family

ID=63534059

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201810297489.1A Pending CN108564016A (en) 2018-04-04 2018-04-04 A kind of AU categorizing systems based on computer vision and method

Country Status (1)

Country Link
CN (1) CN108564016A (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110458001A (en) * 2019-06-28 2019-11-15 南昌大学 A Convolutional Neural Network Sight Estimation Method and System Based on Attention Mechanism
CN110992455A (en) * 2019-12-08 2020-04-10 北京中科深智科技有限公司 Real-time expression capturing method and system

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103065122A (en) * 2012-12-21 2013-04-24 西北工业大学 Facial expression recognition method based on facial motion unit combination features
CN106295678A (en) * 2016-07-27 2017-01-04 北京旷视科技有限公司 Neural metwork training and construction method and device and object detection method and device
CN106384083A (en) * 2016-08-31 2017-02-08 上海交通大学 Automatic face expression identification and information recommendation method
CN107633207A (en) * 2017-08-17 2018-01-26 平安科技(深圳)有限公司 AU characteristic recognition methods, device and storage medium

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103065122A (en) * 2012-12-21 2013-04-24 西北工业大学 Facial expression recognition method based on facial motion unit combination features
CN106295678A (en) * 2016-07-27 2017-01-04 北京旷视科技有限公司 Neural metwork training and construction method and device and object detection method and device
CN106384083A (en) * 2016-08-31 2017-02-08 上海交通大学 Automatic face expression identification and information recommendation method
CN107633207A (en) * 2017-08-17 2018-01-26 平安科技(深圳)有限公司 AU characteristic recognition methods, device and storage medium

Non-Patent Citations (5)

* Cited by examiner, † Cited by third party
Title
BALTRUSAITIS T ET AL: "Constrained Local Neural Fields for Robust Facial Landmark Detection in the Wild", 《IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS》 *
PAN S ET AL: "Facial action units for presentation attack detection", 《 INTERNATIONAL CONFERENCE ON EMERGING SECURITY TECHNOLOGIES》 *
S.MOHAMMAD MAVADATI ET AL: "Automatic detection of non-posed facial action units", 《IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING》 *
杨宇: "《机械故障诊断的变量预测模式识别方法》", 31 August 2017 *
牛新亚: "基于深度学习的人脸表情识别研究", 《中国优秀硕士学位论文全文数据库信息科技辑》 *

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110458001A (en) * 2019-06-28 2019-11-15 南昌大学 A Convolutional Neural Network Sight Estimation Method and System Based on Attention Mechanism
CN110992455A (en) * 2019-12-08 2020-04-10 北京中科深智科技有限公司 Real-time expression capturing method and system

Similar Documents

Publication Publication Date Title
Chen et al. Sckansformer: Fine-grained classification of bone marrow cells via kansformer backbone and hierarchical attention mechanisms
CN109543526B (en) True and false facial paralysis recognition system based on depth difference characteristics
Foggia et al. Benchmarking HEp-2 cells classification methods
CN107403142B (en) A kind of detection method of micro- expression
TWI687159B (en) Fry counting system and fry counting method
CN108229268A (en) Expression recognition and convolutional neural network model training method and device and electronic equipment
CN104598888B (en) A kind of recognition methods of face gender
CN106023151A (en) Traditional Chinese medicine tongue manifestation object detection method in open environment
CN102194108A (en) Smiley face expression recognition method based on clustering linear discriminant analysis of feature selection
CN115910338A (en) A method and device for assessing human health status based on multimodal biometrics
CN119027983B (en) Method, device and electronic equipment for identifying images of livestock and poultry diseases
CN103258216A (en) Regional deformation target detection method and system based on online learning
CN117503056A (en) Multi-scale respiratory event automatic positioning method and device based on dual-modal feature learning
Bilang et al. Cactaceae detection using MobileNet architecture
Wang et al. A ROI extraction method for wrist imaging applied in smart bone-age assessment system
CN108564016A (en) A kind of AU categorizing systems based on computer vision and method
CN119359783B (en) Posture detection method based on three-dimensional measurement of livestock
Ponraj et al. Deep learning with histogram of oriented gradients-based computer-aided diagnosis for breast cancer detection and classification
CN107506747A (en) Face identification system and method based on video data characteristic point
CN117636053A (en) Tongue surface image analysis method and device
CN117727091A (en) A human action recognition system and method based on computer vision
Garg et al. Automated detection and diagnosis of skin-lesion using transfer learning based YOLOv7 approach
CN119672296B (en) Fish disease detection method, device, computer equipment and storage medium
CN121281120B (en) Method for constructing obstructive sleep apnea feature recognition model
Ma et al. Research on Crop Leaf Disease Recognition Algorithm Based on Embedded Device

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20180921