WO2017179728A1 - Dispositif, procédé et programme de reconnaissance d'image - Google Patents

Dispositif, procédé et programme de reconnaissance d'image Download PDF

Info

Publication number
WO2017179728A1
WO2017179728A1 PCT/JP2017/015390 JP2017015390W WO2017179728A1 WO 2017179728 A1 WO2017179728 A1 WO 2017179728A1 JP 2017015390 W JP2017015390 W JP 2017015390W WO 2017179728 A1 WO2017179728 A1 WO 2017179728A1
Authority
WO
WIPO (PCT)
Prior art keywords
image
image recognition
actual size
feature
feature point
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Ceased
Application number
PCT/JP2017/015390
Other languages
English (en)
Japanese (ja)
Inventor
郁子 椿
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Sharp Corp
Original Assignee
Sharp Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Sharp Corp filed Critical Sharp Corp
Publication of WO2017179728A1 publication Critical patent/WO2017179728A1/fr
Anticipated expiration legal-status Critical
Ceased legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis

Definitions

  • the following disclosure relates to an image recognition apparatus, an image processing method, and an image recognition program that perform image recognition using a local feature amount of an image.
  • Image recognition is a technique for identifying what an object is included in an image.
  • general object recognition is a technique for identifying a category of an object
  • specific object recognition is a technique for searching for the same object from an image database.
  • Patent Document 1 discloses an image recognition method in which a feature vector is extracted from an image of an object, the object is expressed by a large number of feature vectors, and an object having a matching feature is searched from an image database. .
  • One aspect of the present invention has been made in view of the above circumstances, and an object thereof is to provide an image recognition apparatus, an image recognition method, and an image recognition program that are unlikely to cause erroneous recognition in image recognition. is there.
  • an image recognition apparatus is an image recognition apparatus that identifies an object included in a query image, and is a standard image determined in advance from the query image.
  • a special object having a known size is detected, a detection unit that detects information indicating an actual size of a region corresponding to the special object in the query image, and a feature point of the query image is extracted from the query image.
  • An image recognition method is an image recognition method for identifying an object included in a query image, and a special standard size known in advance is known from the query image.
  • a detection step of detecting an object and detecting information indicating an actual size of a region corresponding to the special object in the query image; an extraction step of extracting a feature point of the query image from the query image; and the query image And an image recognition step for identifying an object included in the query image on the basis of information indicating the actual size of the region corresponding to the special object and a feature point of the query image.
  • FIG. 1 is a block diagram illustrating a schematic configuration example of an image recognition device according to a first embodiment of the present invention. It is a figure which shows the image containing a face image. It is a figure for demonstrating the feature point actual size size. It is a flowchart for demonstrating the flow of the process in the image recognition apparatus which concerns on 1st embodiment of this invention. It is a flowchart for demonstrating the flow of the process in the image recognition apparatus which concerns on 1st embodiment of this invention. It is a block diagram which shows the example of schematic structure of the image recognition apparatus which concerns on 2nd embodiment of this invention. It is a flowchart for demonstrating the flow of a process in the image recognition apparatus which concerns on 2nd embodiment of this invention. It is a flowchart for demonstrating the flow of a process in the image recognition apparatus which concerns on 2nd embodiment of this invention.
  • FIG. 1 is a block diagram showing a schematic configuration example of an image recognition apparatus 10 according to the first embodiment of the present invention.
  • an image recognition apparatus 10 includes a learning image input unit 11 that inputs a learning image including an image of an object to be compared, a special object detection unit 12 that performs object detection related to a special object, A feature amount extraction unit 13 that extracts a feature amount from an image, an actual size calculation unit 14 that calculates a feature point actual size, a storage unit 15 that stores a database, and a query image including an image of an object to be recognized are input.
  • a query image input unit 16 a feature amount matching unit 17 that matches feature amounts, an object ID selection unit 18 that selects an object ID using a matching result, and an output unit 19 that outputs a selection result of the object ID.
  • the actual size calculation unit 14, the feature amount matching unit 17, and the object ID selection unit 18 may be referred to as an image recognition unit.
  • the image recognition apparatus 10 performs object recognition by performing image recognition method processing in two phases, a learning phase and an identification phase.
  • the image recognition apparatus 10 performs a process of inputting a plurality of learning images and creating a database.
  • the identification phase the image recognition device 10 inputs a query image and performs a process of identifying an image similar to the query image from the learning image input in the learning phase.
  • the identification phase will be described.
  • the learning image input unit 11 captures a learning image from the outside, and outputs the captured learning image to the special object detection unit 12 and the feature amount extraction unit 13.
  • Each learning image is given an object ID in advance, and the learning image input unit 11 also captures the object ID from the outside and outputs it to the feature amount extraction unit 13.
  • the learning image input unit 11 may capture an image of an image file recorded in the recording device, or may capture an image acquired through a network.
  • the image captured by the learning image input unit 11 may be a still image or a moving image.
  • the learning image input unit 11 decomposes the moving image into frame images, and sequentially outputs the decomposed frame images to the special object detection unit 12 and the feature amount extraction unit 13.
  • the special object detection unit 12 first performs a special object detection process on the image received from the learning image input unit 11.
  • the special object is an object having an image recognition method for recognizing the object, such as a human face or a car license plate, and an object whose actual size is roughly determined.
  • the special object detection process is a process of detecting whether or not a special object designated in advance is included in the image and detecting information indicating the actual size of the detected special object region.
  • the special object detection unit 12 detects whether a face is included in the image as the special object detection process, and detects the face area size as the special object area size.
  • FIG. 2 shows an image including a face image which is an example of a special object.
  • the image A1 shown in FIG. 2 includes an image of the subject's face A2.
  • the face area size is a value indicating the size of the area of the face A2 in the entire image A1, and for example, the width of the area A3 in which the area of the face A2 in FIG. It is assumed that the number of pixels is A4) in FIG.
  • the face area size is set to the number of pixels of the width A4 of the area A3. Note that such face detection processing can be performed using, for example, the Viola-Jones method.
  • the special object detection unit 12 calculates the special object resolution using the detected special object region size.
  • the special object resolution is a value indicating how many mm of an area of the surface of the special object each pixel represents in the special object area, and is calculated by dividing the actual size of the special object by the special object area size. .
  • the special object detection unit 12 outputs the obtained special object resolution and the center coordinates of the special object region to the actual size calculation unit 14 as information indicating the actual size of the special object region.
  • the special object detection unit 12 calculates the special object resolution for each special object region, and the obtained special object resolution and the center of the special object region are calculated. The coordinates are output to the actual size calculation unit 14.
  • the special object detection unit 12 outputs a signal indicating that no special object exists to the actual size calculation unit 14.
  • the feature amount extraction unit 13 extracts a local feature amount of each learning image received from the learning image input unit 11.
  • SIFT Scale-Invariant Feature Transform
  • the feature quantity extraction unit 13 detects feature points using SIFT and obtains the coordinates of each feature point.
  • the feature amount extraction unit 13 calculates the scale of feature points at the same time.
  • the scale is a value indicating at which resolution the feature point is obtained.
  • the feature quantity extraction unit 13 calculates a SIFT feature quantity for each obtained feature point.
  • the SIFT feature amount is a 128-dimensional vector obtained from the luminance gradient in the peripheral area of the feature point.
  • the feature quantity extraction unit 13 outputs the SIFT feature quantity, coordinates, and scale of each feature point to the actual size calculation unit 14.
  • the feature quantity extraction unit 13 also outputs the object ID received from the learning image input unit 11 to the actual size calculation unit 14 together.
  • the actual size calculation unit 14 calculates the feature point actual size size of each feature point using the special object resolution received from the special object detection unit 12 and the scale received from the feature amount extraction unit 13.
  • the feature point actual size size is a value indicating the size of the texture characterizing the feature point on the actual subject.
  • the feature point actual size is a value obtained by multiplying the number of pixels on the actual subject by the line number connecting the center point of the special object region to each feature point and the actual object resolution.
  • the number of pixels on the actual subject other than the center point of the special object area and the line segment connecting any point in the special object area to each feature point may be used as the actual feature point size.
  • FIG. 3 is a diagram for explaining the actual size of feature points.
  • the center point of the special object area B1 is a point B3.
  • the feature point B2 can be detected using SIFT for each pixel included in the special object region B1.
  • the size of the segment B4 connecting the point B3 to the detected feature point B2 is the actual feature point size of the feature point B2.
  • the face area size is the number of pixels having the width A4 of the area A3
  • the size of the line segment B4 is also calculated as the horizontal size.
  • the actual size of the feature points can be obtained in the same manner by considering the horizontal direction in the vertical direction. Furthermore, even when the face region size is a two-dimensional size in the horizontal and vertical directions, the actual feature point size can be obtained in the same manner by calculating the vertical and horizontal sizes.
  • the image recognition apparatus 10 detects a special object whose actual size can be estimated even when the actual size of the texture is not known, and uses the size to make a feature The point size can be estimated.
  • the feature amount extraction unit 13 obtains feature points by SIFT.
  • the feature point by SIFT has the concept of scale, and in order to consider this, the actual size calculation unit 14 obtains the final feature point actual size by multiplying the actual feature size obtained above by the scale. Calculate as
  • the actual size calculation unit 14 compares the coordinates of the feature points received from the feature amount extraction unit 13 with the center coordinates of each special object region received from the special object detection unit 12, and the special object region with the closest distance. Is used to calculate the actual size of the feature point.
  • the actual size calculation unit 14 sets the actual size of the feature point to zero.
  • the actual size calculation unit 14 outputs the feature amount of each feature point and the feature point actual size to the storage unit 15.
  • the actual size calculation unit 14 also outputs the object ID received from the feature amount extraction unit 13 to the storage unit 15.
  • the storage unit 15 stores a database.
  • the database is composed of fields (columns) that hold feature point IDs, object IDs, feature amounts, and feature point actual size sizes, and one record (row) is data related to one feature point.
  • the storage unit 15 receives the feature amount, the feature point actual size size, and the object ID related to one feature point from the actual size calculation unit 14, the storage unit 15 creates a feature point ID and stores it in the feature point ID field of the storage unit 15.
  • the feature point ID is an arbitrary value that varies depending on the feature point.
  • the feature amount, the actual feature point size, the object ID, and the feature point ID are stored in each field of the storage unit 15. Note that object IDs of a plurality of feature points obtained from one learning image have the same value.
  • step S1 the learning image input unit 11 inputs a learning image and an object ID.
  • step S2 the special object detection unit 12 detects a special object.
  • step S3 the feature quantity extraction unit 13 detects feature points and extracts feature quantities of the feature points.
  • step S4 the actual size calculation unit 14 calculates the feature point actual size of each feature point, and combines the feature point actual size, the feature point ID, the object ID, and the feature amount together with the storage unit 15.
  • the image recognition apparatus 10 performs this process on each learning image.
  • the query image input unit 16 captures a query image from the outside and outputs it to the special object detection unit 12.
  • the query image input unit 16 may capture an image file recorded in the recording device as a query image, or may capture an image acquired through a network as a query image.
  • the special object detection unit 12 performs the same processing as in the learning phase on the image received from the query image input unit 16 and outputs the special object resolution to the actual size calculation unit 14.
  • the special object detection unit 12 calculates the special object resolution for each special object area, and the obtained special object resolution is calculated using the special object area.
  • the actual size is output to the actual size calculator 14 as information indicating the actual size.
  • the special object detection unit 12 outputs a signal indicating that no special object exists to the actual size calculation unit 14.
  • the feature amount extraction unit 13 performs the same processing as that in the learning phase on the query image to calculate the feature amount. Then, the feature amount extraction unit 13 outputs the feature amount, coordinates, and scale of each feature point to the actual size calculation unit 14.
  • the actual size calculation unit 14 performs the same processing as in the learning phase using the special object resolution received from the special object detection unit 12 and the scale received from the feature amount extraction unit 13, and calculates the actual feature size size. To do. Then, the actual size calculation unit 14 outputs the feature amount of each feature point and the feature point actual size size to the feature amount matching unit 17.
  • the feature amount collation unit 17 collates the feature amount of each feature point of the query image received from the actual size calculation unit 14 with the feature amount in the database of the storage unit 15. This feature amount is a feature amount of a feature point detected from each learning image.
  • the verification is performed according to the following procedure.
  • the feature amount matching unit 17 compares the feature point actual size size between the feature point of the query image and all feature points in the database, and the absolute value of the difference between the feature point actual size sizes is within a predetermined threshold value. Are extracted from the database as candidate feature points.
  • the feature point actual size of the feature point of the query image is 0, that is, when the special object detection unit 12 does not detect the special object, the feature amount matching unit 17 performs all of the features in the database.
  • the object ID selection unit 18 selects the object ID having the maximum number of votes from the number of votes for each object ID received from the feature amount matching unit 17 and outputs it to the output unit 19 as a recognition result.
  • the output unit 19 outputs the object ID received from the object ID selection unit 18 to the outside of the image recognition device 10.
  • step S21 the query image input unit 16 inputs a query image.
  • step S22 the special object detection unit 12 detects a special object.
  • step S23 the feature amount extraction unit 13 detects feature points and extracts feature amounts of the feature points.
  • step S24 the actual size calculation unit 14 calculates the actual feature size of each feature point.
  • the feature amount matching unit 17 extracts candidate feature points from the storage unit 15 for each feature point of the query image.
  • the feature amount matching unit 17 obtains a feature point having the smallest distance between the feature amounts from the candidate feature points for each feature point of the query image, and the object ID of the feature point.
  • step S27 the object ID selection unit 18 selects the object ID having the maximum number of votes in step S26 as a recognition result.
  • step S28 the output unit 19 outputs the recognition result.
  • the feature point actual size size is not taken into consideration, so the feature point having the smallest distance is selected from all the feature points in the database. Therefore, a feature point having a completely different feature amount size on the actual subject may be selected and voted for an object ID different from the object ID to be voted.
  • candidate feature points are extracted based on the actual feature point actual size, and the feature point having the smallest distance is selected from the candidate feature points. There is an effect that it becomes difficult to select completely different feature points. As a result, there is an effect that it is difficult for the object ID selection unit 18 to select an incorrect object ID.
  • the feature quantity extraction unit 13 employs a method of determining feature point coordinates by detecting feature points.
  • the present invention is not limited to this, and the feature point coordinates may be determined by other methods.
  • the feature quantity extraction unit 13 may pre-arrange the coordinates of the feature points at grid-like positions and extract the feature quantities at the coordinates.
  • the SIFT feature amount is used as the feature amount.
  • the present invention is not limited to this, and other local feature amounts such as SURF and HOG may be used.
  • the feature amount of each feature point is stored in the storage unit 15 as it is.
  • the present invention is not limited to this, and vector quantized data is recorded as a feature amount using the Bag-of-Features method. May be.
  • clustering is performed by incorporating the actual feature point size together with the feature amount in the Bag-of-Features clustering process.
  • the feature amount matching unit 17 calculates the candidate feature point that minimizes the distance between the feature amounts between the feature points of the query image from the candidate feature points.
  • the candidate feature point whose distance is approximately the minimum may be obtained.
  • the feature amount matching unit 17 can perform an approximate nearest neighbor search by using an ANN (Appearance Nearest Neighbor). Thereby, calculation time can be shortened.
  • ANN Appearance Nearest Neighbor
  • the output unit 19 outputs only the recognition result.
  • the feature point actual size may be output together with the recognition result. This may output the feature point actual size size of all the feature points of the query image, or may output a statistically obtained value such as an average value using the feature point actual size size.
  • erroneous recognition in image recognition can be reduced by considering the size of the feature amount on the actual subject.
  • FIG. 6 is a block diagram illustrating a schematic configuration example of the image recognition apparatus 30 according to the second embodiment of the present invention.
  • the image recognition apparatus 30 according to the present embodiment includes a learning image input unit 11, a special object detection unit 12, a depth structure estimation unit 31 that calculates the composition of an image, and a feature amount extraction unit 13.
  • An actual size calculation unit 32 for calculating the actual size of the feature point a storage unit 15, a query image input unit 16, a feature amount verification unit 17, an object ID selection unit 18, and an output unit 19.
  • the depth structure estimation unit 31 receives the learning image from the special object detection unit 12, and estimates the depth structure of the learning image.
  • the depth structure is a structure composed of depth values indicating the distance between the subject and the camera for each pixel.
  • This depth structure can be estimated using, for example, the technique disclosed in Japanese Patent Application Laid-Open No. 2005-151534.
  • the composition ratio is determined according to the value of the high-frequency component evaluation value of the luminance signal of the non-stereo image input from the high-frequency component evaluation unit at the top of the screen and the high-frequency component evaluation unit at the bottom of the screen.
  • Three types of basic depth models are synthesized according to the ratio. Further, the synthesized basic depth model is superimposed on the R signal of the non-stereo image to obtain final depth estimation data.
  • the depth structure estimation unit 31 outputs the estimated depth structure to the actual size calculation unit 32.
  • the actual size calculation unit 32 first uses the special object resolution received from the special object detection unit 12 and the scale received from the feature amount extraction unit 13 in the same manner as the actual size size calculation unit 14 of the first embodiment. The actual feature point size of each feature point is calculated. Thereafter, the actual size calculation unit 32 corrects the feature point actual size using the depth structure received from the depth structure estimation unit 31. The correction of the actual size of the feature point is performed by multiplying the actual size of the feature point by p / q, where p is the depth value at the center point of the special object region used to calculate the actual size of the feature point and q is the depth value of the feature point. It is better to do it.
  • the size in the captured image that occurs when the distance at which the special object and each feature point are imaged is different. Changes can be corrected.
  • the correction is not limited to this example, and may be performed by other methods such as reducing (qp).
  • step S41 the learning image input unit 11 inputs a learning image and an object ID.
  • step S42 the special object detection unit 12 detects a special object.
  • step S43 the depth structure estimation unit 31 estimates the depth structure.
  • step S44 the feature quantity extraction unit 13 detects feature points and extracts feature quantities of the feature points.
  • step S45 the actual size calculation unit 32 calculates the actual feature size of each feature point, and corrects the actual feature size using the depth structure.
  • the actual size calculation unit 32 stores the corrected feature point actual size, the feature point ID, the object ID, and the feature amount in the storage unit 15 together.
  • the image recognition device 30 performs this processing for each learning image.
  • the depth structure estimation unit 31 receives the query image from the special object detection unit 12, performs the same processing as in the learning phase, and estimates the depth structure of the query image.
  • the depth structure estimation unit 31 outputs the estimated depth structure to the actual size calculation unit 32.
  • the actual size calculation unit 32 uses the special object resolution received from the special object detection unit 12 and the scale received from the feature amount extraction unit 13 to perform the same processing as in the learning phase, and the actual feature size actual size. Is calculated. Thereafter, the actual size calculation unit 32 corrects the actual size of the feature point using the depth structure received from the depth structure estimation unit 31, as in the learning phase. Then, the actual size calculation unit 32 outputs the feature amount of each feature point and the corrected feature point actual size size to the feature amount matching unit 17.
  • step S51 a query image is input to the special object detection unit 12 and the feature amount extraction unit 13 through the query image input unit 16.
  • step S52 the special object detection unit 12 detects a special object.
  • step S53 the depth structure estimation unit 31 estimates the depth structure.
  • step S54 the feature amount extraction unit 13 detects feature points and extracts feature amounts of the feature points.
  • step S55 the actual size calculation unit 32 calculates the actual feature size of each feature point, and corrects the actual feature size using the depth structure.
  • step S56 the feature amount matching unit 17 extracts candidate feature points from the storage unit 15 using the corrected actual feature size for each feature point of the query image.
  • the feature amount matching unit 17 obtains a feature point having the smallest distance between the feature amounts from the candidate feature points for each feature point of the query image, and the object ID of the feature point.
  • step S58 the object ID selection unit 18 selects the object ID having the maximum number of votes in step S57 as the recognition result.
  • the output unit 19 outputs the recognition result.
  • the actual size calculation unit 32 generates an in-captured image that occurs when the distance at which the special object and each feature point are captured is different.
  • the change in size can be corrected. For this reason, erroneous recognition can be further reduced.
  • control blocks of the image recognition devices 10 and 30 (in particular, the special object detection unit 12, the feature amount extraction unit 13, the actual size calculation units 14 and 32, the feature amount comparison unit 17, the output unit 19, and the depth structure estimation unit 31) It may be realized by a logic circuit (hardware) formed in an integrated circuit (IC chip) or the like, or may be realized by software using a CPU (Central Processing Unit).
  • the image recognition device 30 includes a CPU that executes instructions of a program that is software that realizes each function, and a ROM (Read Only Memory) in which the program and various data are recorded so as to be readable by a computer (or CPU).
  • a storage device (these are referred to as “recording media”), a RAM (Random Access Memory) for expanding the program, and the like are provided.
  • the computer (or CPU) reads the program from the recording medium and executes the program, thereby achieving the object of one embodiment of the present invention.
  • a “non-temporary tangible medium” such as a tape, a disk, a card, a semiconductor memory, a programmable logic circuit, or the like can be used.
  • the program may be supplied to the computer via an arbitrary transmission medium (such as a communication network or a broadcast wave) that can transmit the program.
  • an arbitrary transmission medium such as a communication network or a broadcast wave
  • one embodiment of the present invention can also be realized in the form of a data signal embedded in a carrier wave, in which the program is embodied by electronic transmission.
  • the image recognition devices 10 and 30 are image recognition devices 10 and 30 that select an image similar to a query image from each learning image, and are specially selected in advance from the query image and each learning image.
  • a special object detection unit 12 that detects a special object determined as an object, a feature amount of a feature point detected from the query image, and a feature amount of a feature point detected from each learning image.
  • a size calculation unit 14, 32, and a feature amount matching unit 17 that compares the feature amount of the feature point of the query image with the feature amount of the feature point of each learning image. 7, the feature point actual size is compared with the feature quantity of the feature point of the query image that is similar, the feature quantity of the feature point of each learning image.
  • the actual size calculation units 4 and 32 are the number of pixels from a predetermined point in the special object region to each of the feature points. And the special object resolution indicating the actual size per pixel in the special object area calculated by the special object detection unit 12 and the scale of the characteristic points calculated by the feature amount extraction unit 13. Each of the actual size may be calculated.
  • the predetermined point is either the center in the area of the special object or the point in the area of the special object. May be.
  • a reference point that matches the special object can be selected.
  • the image recognition device 30 further includes a depth structure estimation unit 31 that estimates the depth structure of the query image and each learning image in any one of the aspects 1 to 3, and the actual size.
  • the size calculation unit 32 may correct the actual size of the feature points using the depth structure.
  • the image recognition devices 10 and 30 according to the aspect 5 of the present invention include a selection result obtained by selecting an image similar to the query image from the learning images, and the actual feature point actual size. You may further provide the output part 19 which outputs size.
  • the size of the object included in the image on the subject can be known.
  • An image recognition method is an image recognition method for selecting an image similar to a query image from each learning image, and is previously determined as a special object from the query image and each learning image.
  • a special object detecting step of detecting a special object, a feature amount of a feature point detected from the query image, a feature amount extracting step of extracting a feature amount of a feature point detected from each learning image, and the query image A size calculation step of calculating a feature point actual size size corresponding to the special object and feature point detected from each of the learning images, and a feature point actual size size corresponding to the special object and feature point detected from the respective learning images;
  • a feature amount matching step that compares the feature amount of the feature point with the feature amount of the feature point of each learning image, and in the feature amount matching step, And the feature quantity of the feature point of the query image feature points actual size are similar, comparing the feature quantity of the feature point of each learning image.
  • the image recognition apparatuses 10 and 30 according to each aspect of the present invention may be realized by a computer.
  • the image recognition is performed by causing the computer to operate as each unit (software element) included in the image recognition apparatus 30.
  • An image recognition program of the image recognition apparatuses 10 and 30 that realize the apparatus 30 by a computer and a computer-readable recording medium on which the image recognition program is recorded also fall within the category of one aspect of the present invention.
  • Image recognition device Special object detector (detector) 13 Feature extraction unit (extraction unit) 14, 32 Actual size calculation unit (image recognition unit) 17 Feature amount matching unit (image recognition unit) 19 Output unit 31 Depth structure estimation unit

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Image Analysis (AREA)

Abstract

L'invention concerne un dispositif de reconnaissance d'image (10), comprenant : une unité de détection (12) qui détecte, à partir d'une image d'interrogation, un objet particulier prédéterminé ayant une taille standard établie, et détecte des informations qui indiquent la taille exacte d'une région dans l'image d'interrogation qui correspond à l'objet particulier ; une unité d'extraction (13) qui extrait un point caractéristique de l'image d'interrogation à partir de l'image d'interrogation ; et des unités de reconnaissance d'image (14, 17, 18) qui reconnaissent l'objet qui est inclus dans l'image d'interrogation en se basant sur les informations qui indiquent la taille réelle de la région dans l'image d'interrogation qui correspond à l'objet particulier et au point caractéristique de l'image d'interrogation.
PCT/JP2017/015390 2016-04-14 2017-04-14 Dispositif, procédé et programme de reconnaissance d'image Ceased WO2017179728A1 (fr)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
JP2016081433 2016-04-14
JP2016-081433 2016-04-14

Publications (1)

Publication Number Publication Date
WO2017179728A1 true WO2017179728A1 (fr) 2017-10-19

Family

ID=60041780

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/JP2017/015390 Ceased WO2017179728A1 (fr) 2016-04-14 2017-04-14 Dispositif, procédé et programme de reconnaissance d'image

Country Status (1)

Country Link
WO (1) WO2017179728A1 (fr)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2020085805A (ja) * 2018-11-30 2020-06-04 Arithmer株式会社 寸法データ算出装置、プログラム、方法、製品製造装置、及び製品製造システム
US11922649B2 (en) 2018-11-30 2024-03-05 Arithmer Inc. Measurement data calculation apparatus, product manufacturing apparatus, information processing apparatus, silhouette image generating apparatus, and terminal apparatus

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2002222423A (ja) * 2001-01-25 2002-08-09 Fujitsu Ltd 物体認識方法および装置
JP2010271861A (ja) * 2009-05-20 2010-12-02 Canon Inc オブジェクト識別装置及びオブジェクト識別方法
WO2015099016A1 (fr) * 2013-12-26 2015-07-02 日本電気株式会社 Dispositif de traitement d'image, procédé d'identification de sujet et programme
JP2015191626A (ja) * 2014-03-28 2015-11-02 富士重工業株式会社 車外環境認識装置

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2002222423A (ja) * 2001-01-25 2002-08-09 Fujitsu Ltd 物体認識方法および装置
JP2010271861A (ja) * 2009-05-20 2010-12-02 Canon Inc オブジェクト識別装置及びオブジェクト識別方法
WO2015099016A1 (fr) * 2013-12-26 2015-07-02 日本電気株式会社 Dispositif de traitement d'image, procédé d'identification de sujet et programme
JP2015191626A (ja) * 2014-03-28 2015-11-02 富士重工業株式会社 車外環境認識装置

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2020085805A (ja) * 2018-11-30 2020-06-04 Arithmer株式会社 寸法データ算出装置、プログラム、方法、製品製造装置、及び製品製造システム
US11922649B2 (en) 2018-11-30 2024-03-05 Arithmer Inc. Measurement data calculation apparatus, product manufacturing apparatus, information processing apparatus, silhouette image generating apparatus, and terminal apparatus

Similar Documents

Publication Publication Date Title
Aldoma et al. Multimodal cue integration through hypotheses verification for rgb-d object recognition and 6dof pose estimation
JP6544900B2 (ja) オブジェクト識別装置、オブジェクト識別方法及びプログラム
CN101911116B (zh) 不变视觉场景和对象识别
KR101753360B1 (ko) 시점 변화에 강인한 특징점 정합 방법
US10832030B2 (en) Method and apparatus of selecting candidate fingerprint image for fingerprint recognition
CN108875542B (zh) 一种人脸识别方法、装置、系统及计算机存储介质
CN104537376B (zh) 一种识别台标的方法及相关设备、系统
CN113297963A (zh) 多人姿态的估计方法、装置、电子设备以及可读存储介质
KR20190018274A (ko) 이미지에 포함된 특징 포인트의 시간 또는 공간의 움직임에 기초하여 이미지에 존재하는 피사체를 인식하는 장치 및 방법
US10127681B2 (en) Systems and methods for point-based image alignment
JP5936561B2 (ja) 画像における外観及びコンテキストに基づく物体分類
JP2009129237A (ja) 画像処理装置及びその方法
KR101789979B1 (ko) 그래디언트 방향 정보에 기반한 하우스도르프 거리 계산 방법
JP2015007919A (ja) 異なる視点の画像間で高精度な幾何検証を実現するプログラム、装置及び方法
JP2013218605A (ja) 画像認識装置、画像認識方法及びプログラム
KR101521136B1 (ko) 얼굴 인식 방법 및 얼굴 인식 장치
WO2017179728A1 (fr) Dispositif, procédé et programme de reconnaissance d'image
US20030044067A1 (en) Apparatus and methods for pattern recognition based on transform aggregation
US9569681B2 (en) Methods and systems for efficient image cropping and analysis
JP5755516B2 (ja) 物体形状推定装置
JP6393495B2 (ja) 画像処理装置および物体認識方法
JP6717769B2 (ja) 情報処理装置及びプログラム
CN110383283A (zh) 用于分类指纹的方法和设备
Garcia et al. Automatic detection of heads in colored images
Wen et al. An improved SIFT operator-based image registration using cross-correlation information

Legal Events

Date Code Title Description
NENP Non-entry into the national phase

Ref country code: DE

121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 17782541

Country of ref document: EP

Kind code of ref document: A1

122 Ep: pct application non-entry in european phase

Ref document number: 17782541

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: JP