JP2022123734A

JP2022123734A - Image judgment method, image judgment system and image judgment program

Info

Publication number: JP2022123734A
Application number: JP2021021239A
Authority: JP
Inventors: 健太郎斉藤; Kentaro Saito; 大晃竹田; Hiroaki Takeda; 慧青柳; Kei Aoyagi
Original assignee: YE Digital Co Ltd
Current assignee: YE Digital Co Ltd
Priority date: 2021-02-12
Filing date: 2021-02-12
Publication date: 2022-08-24

Abstract

【課題】効率よく判定精度の向上を図る画像判定方法、画像判定システムおよび画像判定プログラムを提供する。【解決手段】画像判定方法は、画像判定装置が、製造ラインにおける製品の画像を取得し、深層学習ネットワークである判定モデルを用いて分類判定する判定工程Ｓ１０７と、学習装置が、画像が判定モデルへ入力された場合の画像に対する判定モデルの着目点を抽出して可視化することによって判定モデルの判定結果を解析する第１の解析工程Ｓ１１１と、画像が判定モデルへ入力された場合の高次元空間における画像の特徴量を次元圧縮による低次元表現へ変換して可視化することによって判定モデルの判定結果を解析する第２の解析工程Ｓ１１２と、第１の解析工程Ｓ１１１および第２の解析工程Ｓ１１２における解析結果に基づいて分類された前記画像に基づいて判定モデルを学習する学習工程Ｓ１０４と、を含む。【選択図】図１２[Problem] To provide an image judgment method, an image judgment system, and an image judgment program for efficiently improving judgment accuracy. [Solution] The image judgment method includes a judgment step S107 in which an image judgment device acquires an image of a product on a production line and performs classification judgment using a judgment model that is a deep learning network, a first analysis step S111 in which a learning device analyzes the judgment result of the judgment model by extracting and visualizing a focus point of the judgment model for an image when the image is input to the judgment model, a second analysis step S112 in which a feature amount of the image in a high-dimensional space when the image is input to the judgment model is converted into a low-dimensional representation by dimensional compression and visualized, and a learning step S104 in which a judgment model is learned based on the image classified based on the analysis results in the first analysis step S111 and the second analysis step S112. [Selected Figure] Fig. 12

Description

開示の実施形態は、画像判定方法、画像判定システムおよび画像判定プログラムに関する。 The disclosed embodiments relate to an image determination method, an image determination system, and an image determination program.

従来、ＡＩ（Artificial Intelligence）の分野において、ＣＮＮ（Convolutional Neural Network）等の深層学習ネットワークを判定モデルとして用いた画像判定により、画像中の物体を分類する技術が知られている（たとえば、特許文献１参照）。 Conventionally, in the field of AI (Artificial Intelligence), there has been known a technique for classifying objects in an image by image determination using a deep learning network such as a CNN (Convolutional Neural Network) as a determination model (for example, Patent Document 1).

こうした技術を利用することにより、たとえば製造ラインで製造された製品の画像から、かかる製品が良品であるか不良品であるか、また不良品であればどのような不良があるのかを分類することができる。 By using such technology, for example, from the image of a product manufactured on a production line, it is possible to classify whether the product is good or defective, and if it is a defective product, what kind of defect it has. can be done.

特開２０１８－０２２４８４号公報JP 2018-022484 A

しかしながら、上述した従来技術には、効率よく判定精度の向上を図るうえで、さらなる改善の余地がある。 However, the conventional technology described above has room for further improvement in terms of efficiently improving the determination accuracy.

たとえば、深層学習ネットワークは、言わば一種の関数であり、ブラックボックスである。このため、従来技術では、誤判定が生じた場合などに、判定精度の向上のために学習用画像を再分類して判定モデルの再学習を行いたくとも、そもそもの判定根拠が不明確なため、適切に学習用画像を再分類することが難しかった。 For example, a deep learning network is a kind of function, a black box. For this reason, in the conventional technology, even if an attempt is made to reclassify the learning images and relearn the determination model in order to improve the determination accuracy when an erroneous determination occurs, the basis for the determination is unclear in the first place. , it was difficult to properly reclassify the training images.

また、判定精度の向上のためには、大量の学習用画像を用いて判定モデルを学習し、かかる判定モデルを大量の検証用画像を用いて検証することが望ましいが、従来技術では、その検証の多くを人が目視で行う必要があり、煩雑であった。 In addition, in order to improve determination accuracy, it is desirable to learn a determination model using a large number of learning images and to verify the determination model using a large amount of verification images. Most of the operations had to be performed visually by a person, which was complicated.

実施形態の一態様は、上記に鑑みてなされたものであって、効率よく判定精度の向上を図ることができる画像判定方法、画像判定システムおよび画像判定プログラムを提供することを目的とする。 One aspect of the embodiments has been made in view of the above, and an object thereof is to provide an image determination method, an image determination system, and an image determination program capable of efficiently improving determination accuracy.

実施形態の一態様に係る画像判定方法は、判定工程と、第１の解析工程と、第２の解析工程と、学習工程とを含む。前記判定工程は、製造ラインにおける製品の画像を取得し、深層学習ネットワークである判定モデルを用いて分類判定する。前記第１の解析工程は、前記画像が前記判定モデルへ入力された場合の前記画像に対する前記判定モデルの着目点を抽出して可視化することによって前記判定モデルの判定結果を解析する。前記第２の解析工程は、前記画像が前記判定モデルへ入力された場合の高次元空間における前記画像の特徴量を次元圧縮による低次元表現へ変換して可視化することによって前記判定モデルの判定結果を解析する。前記学習工程は、前記第１の解析工程および前記第２の解析工程における解析結果に基づいて分類された前記画像に基づいて前記判定モデルを学習する。 An image determination method according to an aspect of an embodiment includes a determination process, a first analysis process, a second analysis process, and a learning process. In the determination step, an image of the product on the production line is acquired, and a classification determination is performed using a determination model that is a deep learning network. The first analysis step analyzes the determination result of the determination model by extracting and visualizing the points of interest of the determination model with respect to the image when the image is input to the determination model. In the second analysis step, when the image is input to the judgment model, the feature amount of the image in a high-dimensional space is converted into a low-dimensional representation by dimensional compression and visualized, thereby making the judgment result of the judgment model. to parse The learning step learns the determination model based on the images classified based on the analysis results in the first analysis step and the second analysis step.

実施形態の一態様によれば、効率よく判定精度の向上を図ることができる。 According to one aspect of the embodiment, it is possible to efficiently improve the determination accuracy.

図１は、実施形態に係る画像判定方法の概要説明図（その１）である。FIG. 1 is a schematic explanatory diagram (Part 1) of an image determination method according to an embodiment. 図２は、実施形態に係る画像判定方法の概要説明図（その２）である。FIG. 2 is a schematic explanatory diagram (Part 2) of the image determination method according to the embodiment. 図３は、実施形態に係る学習装置のブロック図である。FIG. 3 is a block diagram of the learning device according to the embodiment. 図４は、解析部のブロック図である。FIG. 4 is a block diagram of the analysis unit. 図５は、着目点抽出部による可視化の具体例の説明図（その１）である。FIG. 5 is an explanatory diagram (Part 1) of a specific example of visualization by the point-of-interest extraction unit. 図６は、着目点抽出部による可視化の具体例の説明図（その２）である。FIG. 6 is an explanatory diagram (part 2) of a specific example of visualization by the point-of-interest extraction unit. 図７は、次元圧縮部による可視化の具体例の説明図（その１）である。FIG. 7 is an explanatory diagram (part 1) of a specific example of visualization by the dimension compression unit. 図８は、次元圧縮部による可視化の具体例の説明図（その２）である。FIG. 8 is an explanatory diagram (part 2) of a specific example of visualization by the dimension compression unit. 図９は、次元圧縮部による可視化の具体例の説明図（その３）である。FIG. 9 is an explanatory diagram (part 3) of a specific example of visualization by the dimension compression unit. 図１０は、実施形態に係る画像判定装置のブロック図である。FIG. 10 is a block diagram of the image determination device according to the embodiment. 図１１は、実施形態に係るプロジェクタ制御装置のブロック図である。FIG. 11 is a block diagram of the projector control device according to the embodiment. 図１２は、実施形態に係る画像判定装置１００が実行する処理手順を示す処理シーケンスである。FIG. 12 is a processing sequence showing a processing procedure executed by the image determination device 100 according to the embodiment. 図１３は、学習装置の機能を実現するコンピュータの一例を示すハードウェア構成図である。FIG. 13 is a hardware configuration diagram showing an example of a computer that implements the functions of the learning device.

以下、添付図面を参照して、本願の開示する画像判定方法、画像判定システムおよび画像判定プログラムの実施形態を詳細に説明する。なお、以下に示す実施形態によりこの発明が限定されるものではない。 Hereinafter, embodiments of an image determination method, an image determination system, and an image determination program disclosed in the present application will be described in detail with reference to the accompanying drawings. In addition, this invention is not limited by embodiment shown below.

まず、実施形態に係る画像判定方法の概要について、図１および図２を参照して説明する。図１は、実施形態に係る画像判定方法の概要説明図（その１）である。また、図２は、実施形態に係る画像判定方法の概要説明図（その２）である。 First, an outline of an image determination method according to an embodiment will be described with reference to FIGS. 1 and 2. FIG. FIG. 1 is a schematic explanatory diagram (Part 1) of an image determination method according to an embodiment. FIG. 2 is a schematic explanatory diagram (part 2) of the image determination method according to the embodiment.

なお、以下では、製造ラインにおいて製品として丸形のクッキーが製造され、かかるクッキーの出荷前検査等において、欠けや焦げ、割れなどのある不良品を検知する場合を例に挙げて説明を行う。また、以下では、画像判定用の判定モデルが、深層学習ネットワークであるものとする。 In the following description, round cookies are manufactured as a product on a manufacturing line, and defective products with chips, scorches, cracks, etc. are detected in pre-shipment inspections of such cookies. In the following description, it is assumed that the determination model for image determination is a deep learning network.

図１に示すように、実施形態に係る画像判定システム１は、学習装置１０と、画像判定装置１００と、プロジェクタ制御装置２００とを含む。 As shown in FIG. 1, the image determination system 1 according to the embodiment includes a learning device 10, an image determination device 100, and a projector control device 200. FIG.

画像判定装置１００およびプロジェクタ制御装置２００は、いわゆるエッジコンピューティングにおけるエッジプラットフォームに相当する装置であり、カメラ１５０、コンベア装置３００、プロジェクタ４００（図２参照）等を含む製造ラインに設けられる。 The image determination device 100 and the projector control device 200 are devices corresponding to an edge platform in so-called edge computing, and are provided in a production line including a camera 150, a conveyor device 300, a projector 400 (see FIG. 2), and the like.

学習装置１０は、イントラネットやインターネット、携帯電話回線網等のネットワークＮを介して製造ラインと通信可能に設けられる。学習装置１０は、主たる機能として、たとえば製造ラインから学習用画像を収集し、収集した学習用画像を分類して学習用データセットを生成し、かかるデータセットを用いた深層学習により判定モデルを学習する（機能Ｆ１）。また、学習装置１０は、ネットワークＮを介し、学習した判定モデルを画像判定装置１００へ配信する。 The learning device 10 is provided so as to be able to communicate with the manufacturing line via a network N such as an intranet, the Internet, or a mobile phone network. The main functions of the learning device 10 are, for example, collecting learning images from a manufacturing line, classifying the collected learning images to generate a learning data set, and learning a judgment model by deep learning using the data set. (function F1). Also, the learning device 10 distributes the learned determination model to the image determination device 100 via the network N. FIG.

画像判定装置１００は、コンベア装置３００を流れるクッキーＰ１，Ｐ２，Ｐ３…の画像を取得し、学習装置１０によって学習された判定モデルを用いて画像判定を行い、学習装置１０およびプロジェクタ制御装置２００に対し、判定結果を出力する（機能Ｆ２）。 Image determination device 100 acquires images of cookies P1, P2, P3, . In response, the determination result is output (function F2).

たとえば、図１には、画像判定装置１００が画像判定により、クッキーＰ１は「欠け」のある不良品であり、クッキーＰ２は「焦げ」のある不良品であると判定した例を示している。なお、判定結果は少なくとも、判定された画像の分類クラスおよびそのスコア（類似度、確度等）を含む。 For example, FIG. 1 shows an example in which the image determination apparatus 100 determines that the cookie P1 is defective with "missing" and the cookie P2 is defective with "burnt" through image determination. Note that the determination result includes at least the determined classification class of the image and its score (similarity, accuracy, etc.).

また、学習装置１０に対する判定結果には、実際に判定された画像が学習用画像として含まれる。学習装置１０は、かかる判定結果を、たとえばオペレータ（「ユーザ」の一例に相当）等の人手を介して検証し、誤判定等があれば、判定精度の向上のために学習用画像を再分類して判定モデルを再学習する。 In addition, the determination result for the learning device 10 includes the actually determined image as the learning image. The learning device 10 verifies the determination results manually, for example, by an operator (equivalent to an example of a “user”). to relearn the decision model.

このようなフィードバックを繰り返すことにより、画像判定システム１は、判定モデルの判定精度を向上させることができる。 By repeating such feedback, the image determination system 1 can improve the determination accuracy of the determination model.

ところで、既に述べたが、深層学習ネットワークは、言わば一種の関数であり、ブラックボックスである。このため、従来は、誤判定が生じた場合などに、学習用画像を再分類して判定モデルの再学習を行いたくとも、そもそもの判定根拠が不明確なため、適切に学習用画像を再分類することが難しかった。 By the way, as already mentioned, a deep learning network is, so to speak, a kind of function, a black box. For this reason, conventionally, when an erroneous judgment occurs, even if it is desired to reclassify the training images and re-learn the judgment model, the basis for the judgment is unclear in the first place. It was difficult to categorize.

また、判定精度の向上のためには、大量の学習用画像を用いて判定モデルを学習し、かかる判定モデルを大量の検証用画像を用いて検証することが望ましいが、従来は、その検証の多くをオペレータ等が目視で行う必要があり、煩雑であった。 In addition, in order to improve the accuracy of determination, it is desirable to learn a determination model using a large number of training images and to verify the determination model using a large amount of verification images. Many of them had to be visually checked by an operator, which was troublesome.

そこで、実施形態に係る画像判定方法では、製造ラインにおける製品の画像を取得し、深層学習ネットワークである判定モデルを用いて分類判定し、上記画像が判定モデルへ入力された場合の上記画像に対する判定モデルの着目点を抽出して可視化することによって判定モデルの判定結果を解析し、上記画像が判定モデルへ入力された場合の高次元空間における上記画像の特徴量を次元圧縮による低次元表現へ変換して可視化することによって判定モデルの判定結果を解析し、着目点抽出および次元圧縮による解析結果に基づいて分類された上記画像に基づいて判定モデルを学習することとした。 Therefore, in the image determination method according to the embodiment, an image of a product in a manufacturing line is acquired, a classification determination is performed using a determination model that is a deep learning network, and the image is determined when the image is input to the determination model. Analyzing the judgment result of the judgment model by extracting and visualizing the points of interest of the model, and converting the feature amount of the image in the high-dimensional space when the image is input to the judgment model into a low-dimensional representation by dimensional compression. The judgment model is analyzed by visualizing it as a model, and the judgment model is learned based on the images classified based on the analysis results obtained by extracting the point of interest and dimensionality reduction.

具体的には、図１に示すように、実施形態に係る画像判定方法では、学習装置１０が、２つの手法により判定モデルの判定結果を解析する（ステップＳ１）。第１の手法では、学習装置１０は、勾配荷重クラス活性化マッピング手法（Ｇｒａｄ－ＣＡＭ：Gradient-weighted Class Activation Mapping）を用いた「着目点抽出」により、判定モデルによる判定根拠を可視化する。 Specifically, as shown in FIG. 1, in the image determination method according to the embodiment, the learning device 10 analyzes the determination result of the determination model using two methods (step S1). In the first technique, the learning device 10 visualizes the basis of judgment by the judgment model by "focus point extraction" using a gradient-weighted class activation mapping technique (Grad-CAM).

これにより、オペレータは、判定モデルが「どこを見て分類（判定）したか」を一目で把握できるようになるため、たとえば誤判定している場合に、対象画像を適切に分類し直すことが可能となる。なお、かかる第１の手法による可視化の具体例については、図５および図６を用いた説明で後述する。 As a result, the operator can grasp at a glance "where the judgment model was classified (judgment)". It becomes possible. A specific example of visualization by the first method will be described later with reference to FIGS. 5 and 6. FIG.

また、第２の手法では、学習装置１０は、ＵＭＡＰ（Uniform Manifold Approximation and Projection）を用いた「次元圧縮」（次元削減とも言う）により、判定結果をより見やすい形で可視化する。 In the second method, the learning device 10 visualizes the determination result in a more easily viewable form by “dimensionality reduction” (also referred to as dimensionality reduction) using UMAP (Uniform Manifold Approximation and Projection).

ＵＭＡＰは、機械学習による非線形次元圧縮手法であり、リーマン幾何学と代数トポロジーに基づき、高次元空間のデータ構造を保ち、トポロジー間のクロス・エントロピーを最小にしながら低次元のデータに変換する。すなわち、実施形態に係る画像判定方法では、かかるＵＭＡＰを用いて、画像が判定モデルへ入力された場合の高次元空間における画像の特徴量を次元圧縮による低次元表現へ変換して可視化する。 UMAP is a non-linear dimensionality compression technique based on machine learning, based on Riemannian geometry and algebraic topology, which preserves the data structure of high-dimensional space and converts it to low-dimensional data while minimizing cross-entropy between topologies. That is, in the image determination method according to the embodiment, UMAP is used to convert the feature amount of the image in the high-dimensional space when the image is input to the determination model into a low-dimensional expression by dimensional compression and visualize it.

このため、第２の手法によれば、低次元の埋め込み空間に、判定結果の分布をよりバラツキの少ない形で明示することが可能となり、誤判定している画像をオペレータが一目で分かるように可視化することが可能となる。かかる第２の手法による可視化の具体例については、図７～図９を用いた説明で後述する。 For this reason, according to the second method, it is possible to express the distribution of judgment results in a low-dimensional embedding space in a form with less variation. Visualization becomes possible. A specific example of visualization by the second method will be described later with reference to FIGS. 7 to 9. FIG.

そして、学習装置１０は、かかる２つの手法による解析結果に基づき、学習用画像を再分類して再学習を行い（ステップＳ２）、再学習した判定モデルを画像判定装置１００へ配信する。そして、画像判定装置１００は、再学習された判定モデルを用いて、以降の画像判定を行うこととなる。 Then, the learning device 10 reclassifies and re-learns the learning images based on the analysis results obtained by these two methods (step S2), and delivers the re-learned determination model to the image determination device 100. FIG. Then, the image determination apparatus 100 performs subsequent image determination using the re-learned determination model.

したがって、実施形態に係る画像判定方法によれば、効率よく判定精度の向上を図ることができる。 Therefore, according to the image determination method according to the embodiment, it is possible to efficiently improve the determination accuracy.

一方、プロジェクタ制御装置２００は、製造ラインに設けられたプロジェクタ４００を制御する装置である。具体的には、図２に示すように、実施形態に係る画像判定方法では、プロジェクタ制御装置２００は、画像判定装置１００の判定結果に応じたプロジェクタ投影を行う。 On the other hand, the projector control device 200 is a device that controls the projector 400 provided in the manufacturing line. Specifically, as shown in FIG. 2 , in the image determination method according to the embodiment, the projector control device 200 performs projector projection according to the determination result of the image determination device 100 .

より具体的には、プロジェクタ制御装置２００は、コンベア装置３００を流れるクッキーＰ１，Ｐ２，Ｐ３…に対し、プロジェクタ４００によりマーカーを投影させる（ステップＳ３）。 More specifically, projector control device 200 causes projector 400 to project markers onto cookies P1, P2, P3, . . . flowing on conveyor device 300 (step S3).

このとき、プロジェクタ制御装置２００は、コンベア装置３００のコンベアの搬送速度と同じ速度でマーカーをスクロールさせる（ステップＳ３１）。言い換えれば、プロジェクタ制御装置２００は、マーカーが対象のクッキーＰ１，Ｐ２，Ｐ３…をトラッキングするように、プロジェクタ４００によりマーカーを投影させる。 At this time, the projector control device 200 scrolls the marker at the same speed as the conveying speed of the conveyor of the conveyor device 300 (step S31). In other words, projector controller 200 causes projector 400 to project a marker such that the marker tracks target cookies P1, P2, P3, .

また、プロジェクタ制御装置２００は、分類クラスや取るべき処置に応じて、マーカーの色や形を変更させる（ステップＳ３２）。たとえば、プロジェクタ制御装置２００は、分類クラスが「欠け」のクッキーＰ１と、「焦げ」のクッキーＰ２とで、マーカーの色や形を変更させる。 Also, the projector control device 200 changes the color and shape of the marker according to the classification class and the action to be taken (step S32). For example, the projector control device 200 changes the color and shape of the marker between the cookie P1 with the classification class of "missing" and the cookie P2 with the classification class of "burnt".

また、たとえば、プロジェクタ制御装置２００は、「コンベアから除去すべき」や、「生産へフィードバックすべき」といった取るべき処置に応じて、マーカーの色や形を変更させる。また、プロジェクタ制御装置２００は、判定結果に含まれる上述のスコアに応じて、「ＡＩが判定に悩んだもの」、すなわちスコアがグレーゾーンのものに、それと分かるマーカーを投影し、ライン担当者の目視によるチェックを促すようにしてもよい。 Also, for example, the projector control device 200 changes the color or shape of the marker according to the action to be taken, such as "should be removed from the conveyor" or "should be fed back to production." In addition, according to the scores included in the determination result, the projector control device 200 projects a recognizable marker on "what the AI struggled to determine", that is, on the score in the gray zone, and the line manager A visual check may be prompted.

これにより、実施形態に係る画像判定方法によれば、判定モデルの高い判定精度に応じて、その結果を適切に製造ラインに反映させることが可能となる。なお、プロジェクタ制御装置２００は、「判定結果反映装置」の一例である。したがって、判定結果反映装置は、製造ラインに設けられ、画像判定装置１００の判定結果を反映すべき他の装置であってもよい。たとえば、判定結果反映装置は、画像判定装置１００の判定結果に応じて火加減を調節するクッキーのベイク装置等であってもよい。 Thus, according to the image determination method according to the embodiment, it is possible to appropriately reflect the result on the production line according to the high determination accuracy of the determination model. Note that the projector control device 200 is an example of a “determination result reflecting device”. Therefore, the determination result reflection device may be another device that is provided in the manufacturing line and that should reflect the determination result of the image determination device 100 . For example, the determination result reflection device may be a cookie baking device or the like that adjusts the heat level according to the determination result of the image determination device 100 .

以下、上述した実施形態に係る画像判定方法を適用した画像判定システム１の構成について、さらに具体的に説明する。 The configuration of the image determination system 1 to which the image determination method according to the above-described embodiment is applied will now be described more specifically.

図３は、実施形態に係る学習装置１０のブロック図である。また、図４は、解析部１３ｄのブロック図である。なお、図３、図４、および、後に示す図１０，１１では、本実施形態の特徴を説明するために必要な構成要素を機能ブロックで表しており、一般的な構成要素についての記載を省略している。 FIG. 3 is a block diagram of the learning device 10 according to the embodiment. Moreover, FIG. 4 is a block diagram of the analysis part 13d. 3, 4, and FIGS. 10 and 11 shown later, constituent elements necessary for explaining the features of this embodiment are represented by functional blocks, and descriptions of general constituent elements are omitted. is doing.

換言すれば、図３、図４、図１０および図１１に図示される各構成要素は機能概念的なものであり、必ずしも物理的に図示の如く構成されていることを要しない。たとえば、各機能ブロックの分散・統合の具体的形態は図示のものに限られず、その全部または一部を、各種の負荷や使用状況等に応じて、任意の単位で機能的または物理的に分散・統合して構成することが可能である。 In other words, each component illustrated in FIGS. 3, 4, 10 and 11 is functionally conceptual and does not necessarily need to be physically configured as illustrated. For example, the specific forms of distribution and integration of each functional block are not limited to those shown in the figure, and all or part of them can be functionally or physically distributed in arbitrary units according to various loads and usage conditions.・It is possible to integrate and configure.

なお、図３、図４、図１０および図１１を用いた説明では、これまでに既に述べた構成要素については、説明を簡略するか、省略する場合がある。 In the explanation using FIGS. 3, 4, 10 and 11, the explanation of the components already described may be simplified or omitted in some cases.

図３に示すように、実施形態に係る学習装置１０は、通信部１１と、記憶部１２と、制御部１３とを備える。また、学習装置１０は、操作部３と、表示部５とが接続される。操作部３は、キーボードやマウス、タッチパネル等によって実現される。表示部５は、ディスプレイ等によって実現される。 As shown in FIG. 3 , the learning device 10 according to the embodiment includes a communication section 11 , a storage section 12 and a control section 13 . Further, the learning device 10 is connected to the operation unit 3 and the display unit 5 . The operation unit 3 is implemented by a keyboard, mouse, touch panel, or the like. The display unit 5 is realized by a display or the like.

通信部１１は、たとえば、ＮＩＣ（Network Interface Card）等によって実現される。通信部１１は、ネットワークＮに対し有線または無線で接続され、画像判定装置１００を含む製造ラインとの間で情報の送受信を行う。 The communication unit 11 is implemented by, for example, a NIC (Network Interface Card) or the like. The communication unit 11 is wired or wirelessly connected to the network N, and transmits and receives information to and from the manufacturing line including the image determination device 100 .

記憶部１２は、たとえば、ＲＡＭ（Random Access Memory）、ＲＯＭ（Read Only Memory）、フラッシュメモリ（Flash Memory）等の半導体メモリ素子、または、ハードディスク、光ディスク等の記憶装置によって実現される。図３に示す例では、記憶部１２は、収集情報データベース（ＤＢ）１２ａと、学習用データセット１２ｂと、判定モデル１２ｃとを記憶する。 The storage unit 12 is realized by, for example, a semiconductor memory device such as a RAM (Random Access Memory), a ROM (Read Only Memory), a flash memory, or a storage device such as a hard disk or an optical disk. In the example shown in FIG. 3, the storage unit 12 stores a collected information database (DB) 12a, a learning data set 12b, and a determination model 12c.

収集情報データベース１２ａは、通信部１１を介し、後述する収集部１３ａによって収集される判定結果を含む各種のデータが格納されるデータベースである。学習用データセット１２ｂは、収集情報データベース１２ａへ格納された判定結果、および、操作部３を介したオペレータの操作等に基づき、後述する分類部１３ｂによって分類クラスごとに分類された学習用画像のデータセットである。判定モデル１２ｃは、後述する学習部１３ｃによって学習される深層学習ネットワークである。 The collected information database 12a is a database in which various data including determination results collected by a collection unit 13a (to be described later) via the communication unit 11 are stored. The learning data set 12b is a collection of learning images classified into classification classes by the classification unit 13b, which will be described later, based on the determination results stored in the collected information database 12a and the operator's operation via the operation unit 3. is a dataset. The judgment model 12c is a deep learning network learned by a learning unit 13c, which will be described later.

制御部１３は、コントローラ（controller）であり、たとえば、ＣＰＵ（Central Processing Unit）やＭＰＵ（Micro Processing Unit）等によって、記憶部１２に記憶されている図示略の各種プログラムがＲＡＭを作業領域として実行されることにより実現される。また、制御部１３は、たとえば、ＡＳＩＣ（Application Specific Integrated Circuit）やＦＰＧＡ（Field Programmable Gate Array）等の集積回路により実現することができる。 The control unit 13 is a controller, and various programs (not shown) stored in the storage unit 12 are executed by a CPU (Central Processing Unit), an MPU (Micro Processing Unit), or the like, using the RAM as a work area. It is realized by being Also, the control unit 13 can be implemented by an integrated circuit such as an ASIC (Application Specific Integrated Circuit) or an FPGA (Field Programmable Gate Array).

制御部１３は、収集部１３ａと、分類部１３ｂと、学習部１３ｃと、解析部１３ｄと、表示制御部１３ｅと、配信部１３ｆとを有し、以下に説明する情報処理の機能や作用を実現または実行する。 The control unit 13 includes a collection unit 13a, a classification unit 13b, a learning unit 13c, an analysis unit 13d, a display control unit 13e, and a distribution unit 13f. Realize or carry out.

収集部１３ａは、通信部１１を介し、画像判定装置１００からの判定結果を収集する。また、収集部１３ａは、収集した判定結果を収集情報データベース１２ａへ格納する。 The collection unit 13 a collects determination results from the image determination device 100 via the communication unit 11 . The collecting unit 13a also stores the collected determination results in the collected information database 12a.

分類部１３ｂは、収集情報データベース１２ａへ格納された判定結果、および、操作部３を介したオペレータの操作等に基づき、学習用画像を「正常」、「欠け」、「焦げ」、「割れ」といった分類クラスごとに分類し、学習用データセット１２ｂを生成する。学習部１３ｃは、学習用データセット１２ｂに基づき、判定モデル１２ｃを学習する。 Based on the determination result stored in the collected information database 12a and the operator's operation via the operation unit 3, the classification unit 13b classifies the learning image into "normal", "missing", "burnt", and "cracked". are classified for each classification class, and a learning data set 12b is generated. The learning unit 13c learns the judgment model 12c based on the learning data set 12b.

解析部１３ｄは、収集情報データベース１２ａへ格納された判定結果、および、判定モデル１２ｃに基づき、上述した第１の手法および第２の手法によって判定結果を解析する。図４に示すように、解析部１３ｄは、着目点抽出部１３ｄａと、次元圧縮部１３ｄｂとを有する。 The analysis unit 13d analyzes the determination result by the above-described first method and second method based on the determination result stored in the collected information database 12a and the determination model 12c. As shown in FIG. 4, the analysis unit 13d has a point-of-interest extraction unit 13da and a dimension compression unit 13db.

着目点抽出部１３ｄａは、上述したＧｒａｄ－ＣＡＭを用いた着目点抽出により、学習用画像を判定モデル１２ｃへ入力したときの判定モデル１２ｃによる判定根拠を可視化する。 The point-of-interest extraction unit 13da extracts the point of interest using Grad-CAM as described above, and visualizes the basis for determination by the determination model 12c when the learning image is input to the determination model 12c.

次元圧縮部１３ｄｂは、上述したＵＭＡＰを用いた次元圧縮により、低次元の埋め込み空間に、判定結果の分布を可視化する。なお、ＵＭＡＰは、次元圧縮手法の一例であり、他の手法を用いることを限定するものではない。たとえば、主成分分析や、ｔ分布型確率的近傍埋め込み法（ｔ－ＳＮＥ：t-distributed Stochastic Neighbor Embedding）等を用いてもよいが、計算速度は、ＵＭＡＰがより高速である。 The dimension compression unit 13db visualizes the distribution of the determination results in the low-dimensional embedding space by the dimension compression using UMAP described above. Note that UMAP is an example of a dimension compression method, and the use of other methods is not limited. For example, principal component analysis, t-distributed stochastic neighbor embedding (t-SNE), etc. may be used, but UMAP is faster in terms of calculation speed.

ここで、着目点抽出部１３ｄａおよび次元圧縮部１３ｄｂによる可視化の具体例について、図５～図９を用いて説明する。図５は、着目点抽出部１３ｄａによる可視化の具体例の説明図（その１）である。また、図６は、着目点抽出部１３ｄａによる可視化の具体例の説明図（その２）である。 Here, specific examples of visualization by the point-of-interest extraction unit 13da and the dimension compression unit 13db will be described with reference to FIGS. 5 to 9. FIG. FIG. 5 is an explanatory diagram (Part 1) of a specific example of visualization by the point-of-interest extraction unit 13da. FIG. 6 is an explanatory diagram (part 2) of a specific example of visualization by the point-of-interest extraction unit 13da.

また、図７は、次元圧縮部１３ｄｂによる可視化の具体例の説明図（その１）である。また、図８は、次元圧縮部１３ｄｂによる可視化の具体例の説明図（その２）である。また、図９は、次元圧縮部１３ｄｂによる可視化の具体例の説明図（その３）である。 FIG. 7 is an explanatory diagram (part 1) of a specific example of visualization by the dimension compression unit 13db. FIG. 8 is an explanatory diagram (part 2) of a specific example of visualization by the dimension compression unit 13db. FIG. 9 is an explanatory diagram (part 3) of a specific example of visualization by the dimension compression unit 13db.

まず、図６を用いた説明では、図５に示すように、判定モデル１２ｃが、欠けＣがあると判定するクッキーＰの画像について考える。着目点抽出部１３ｄａは、このような欠けＣがあると判定される画像ｐ１，ｐ２，ｐ３…を判定モデル１２ｃへ入力し、欠けＣがあると判定された判定根拠を可視化する。 First, in the description using FIG. 6, as shown in FIG. 5, consider an image of a cookie P for which the determination model 12c determines that there is a chipped portion C. FIG. The point-of-interest extraction unit 13da inputs the images p1, p2, p3, .

深層学習ネットワークは、畳み込み層とプーリング層を何層にもわたって積み重ねた特徴抽出部と、その特徴量出力を受け取ってクラスラベルと照合して教師あり学習を行う識別部との２つの部分に分けられる。また、識別部は通常、全結合の多層ニューラルネットワークで構成され、その最終層は特徴量を各分類クラスのスコアに変換するソフトマックス層になっている。 A deep learning network consists of two parts: a feature extraction part that stacks many layers of convolution layers and pooling layers, and a classification part that receives the feature value output and compares it with class labels to perform supervised learning. divided. In addition, the classifier is usually composed of a fully-connected multi-layer neural network, the final layer of which is a softmax layer that converts the feature quantity into a score for each classification class.

スコアは、入力画像に各分類クラスのタグが付与される確率（類似度と言い換えても可）や確度である。判定モデル１２ｃによる判定結果は、かかるスコアが最大となる分類クラスである。 The score is the probability (can also be referred to as similarity) or certainty that the tag of each classification class is assigned to the input image. The determination result of the determination model 12c is the classification class with the maximum score.

着目点抽出部１３ｄａは、Ｇｒａｄ－ＣＡＭにより、分類クラスごとのスコアへの影響が大きい画像箇所を微分係数（勾配と言い換えても可）の平均化によって特定し、ヒートマップ化する。 The point-of-interest extracting unit 13da uses Grad-CAM to identify image locations that greatly affect the score for each classification class by averaging differential coefficients (which can be called gradients), and converts them into a heat map.

図６には、かかるヒートマップの例を示している。図６の例では、画像ｐ１，ｐ２については、欠けＣの部分のみがヒートマップ化され、判定モデル１２ｃが、まさに欠けＣに着目して欠けＣがあると判定していることが分かる。したがって、画像ｐ１，ｐ２は、分類クラス「欠け」の学習用画像として適していることが一目で分かる。 FIG. 6 shows an example of such a heat map. In the example of FIG. 6, for images p1 and p2, only the portion of chipping C is heatmapped, and it can be seen that the determination model 12c just focuses on chipping C and determines that chipping C exists. Therefore, it can be seen at a glance that the images p1 and p2 are suitable as learning images for the classification class "missing".

一方で、画像ｐ３については、欠けＣの部分だけでなく、焦げＢの部分もヒートマップ化され、判定モデル１２ｃが、欠けＣだけでなく焦げＢにも着目していることが分かる。言い換えれば、画像ｐ３は、分類クラス「欠け」の学習用画像としては、ノイズ成分を含むものであることが一目で分かる。こうした場合に、かかるヒートマップは、オペレータに、画像ｐ３が分類クラス「欠け」の学習用画像としては適さないとして、学習用から除外させることができる。これにより、効率よく判定精度の向上に資することができる。 On the other hand, for the image p3, not only the chipped portion C but also the burnt portion B are heat-mapped. In other words, it can be seen at a glance that the image p3 contains noise components as a learning image of the classification class “missing”. In such a case, such a heat map can cause the operator to exclude image p3 from training as it is not suitable as a training image for the classification class "missing". As a result, it is possible to efficiently improve the determination accuracy.

また、図７～図９に示すように、次元圧縮部１３ｄｂは、たとえば判定モデル１２ｃの高次元空間の特徴量マップを低次元（ここでは、３次元）に次元圧縮し、低次元の埋め込み空間に判定結果の分布を可視化する。また、次元圧縮部１３ｄｂは、かかる可視化情報を、オペレータに操作可能なＧＵＩ（Graphic User Interface）ツールとして生成する。 Further, as shown in FIGS. 7 to 9, the dimension compression unit 13db dimension-compresses, for example, the high-dimensional space feature quantity map of the judgment model 12c to a low dimension (here, three dimensions) to obtain a low-dimensional embedding space. Visualize the distribution of judgment results. Also, the dimension compression unit 13db generates such visualization information as a GUI (Graphic User Interface) tool that can be operated by the operator.

たとえば、図７に示すように、次元圧縮部１３ｄｂは、各判定結果に対応するチェックボックスを有するＧＵＩツールを生成する。かかるＧＵＩツールにおいて、図７に示すように、「欠け」および「焦げ」がチェックされたものとする。 For example, as shown in FIG. 7, the dimension compression unit 13db generates a GUI tool having check boxes corresponding to each determination result. It is assumed that "missing" and "burning" are checked in such a GUI tool as shown in FIG.

すると、図７に示すように、「欠け」の判定結果を受けた各画像と、「焦げ」の判定を受けた各画像との、次元圧縮された低次元空間における分布が可視化される。なお、図中の低次元空間における丸印の各々は、各画像に対応しており、次元圧縮部１３ｄｂは、オペレータがその一つ一つを選択可能となるようにＧＵＩツールを生成する。 Then, as shown in FIG. 7, the distribution in the dimensionally compressed low-dimensional space of each image determined as "missing" and each image determined as "burned" is visualized. Each circle in the low-dimensional space in the drawing corresponds to each image, and the dimension compression unit 13db generates a GUI tool so that the operator can select one of them.

ここで、図中のカーソルＣｒが指すように、たとえば「欠け」と判定されているものの、特徴量としては「欠け」よりも「焦げ」の方にきわめて近い画像があり、オペレータがこれを選択したものとする。 Here, as indicated by the cursor Cr in the figure, there is an image that is determined to be, for example, "missing", but the feature amount is much closer to "burnt" than "missing", and the operator selects this image. shall be

すると、図８に示すように、次元圧縮部１３ｄｂは、かかる画像のファイル名や分類クラスのラベル名といった画像の詳細情報が示されるようにＧＵＩツールを生成する。同図の場合、その詳細情報によれば、該当の画像「ＩＭＧ＿１００１．ｐｎｇ」が分類クラス「欠け」であるにも関わらず、その特徴量は「焦げ」にきわめて近いため、本来であれば該当の画像が「焦げ」と判定されるべき誤判定であることが分かる。 Then, as shown in FIG. 8, the dimension compression unit 13db generates a GUI tool so that detailed information about the image, such as the file name of the image and the label name of the classification class, is displayed. In the case of the same figure, according to the detailed information, although the corresponding image "IMG_1001.png" is classified as "missing", its feature value is very close to "burnt". is an erroneous determination that should be determined as "burnt".

したがって、オペレータは、かかるＧＵＩツールにより、誤判定を一目で把握することができる。そして、オペレータは、分類部１３ｂに該当の画像の分類をやり直させたうえで、学習部１３ｃが判定モデル１２ｃを学習することにより、判定モデル１２ｃの判定精度を向上させることができる。 Therefore, the operator can grasp the erroneous judgment at a glance by such a GUI tool. Then, the operator causes the classification unit 13b to reclassify the corresponding image, and the learning unit 13c learns the determination model 12c, thereby improving the determination accuracy of the determination model 12c.

なお、図９に示すように、ＧＵＩツールにおいて、さらに「割れ」のチェックボックスがチェックされた場合には、低次元空間にさらに「割れ」の各画像の分布が可視化されることとなる。また、図７～図９には図示していないが、オペレータは、ＧＵＩツール上の低次元空間を任意に３６０°回転させたり、拡大したり、縮小したりすることが可能である。 As shown in FIG. 9, when the "crack" check box is further checked in the GUI tool, the distribution of each "crack" image is further visualized in the low-dimensional space. Also, although not shown in FIGS. 7 to 9, the operator can arbitrarily rotate the low-dimensional space on the GUI tool by 360°, enlarge it, and reduce it.

図３の説明に戻る。表示制御部１３ｅは、解析部１３ｄの解析結果を表示部５に表示させる。表示部５から操作部３へ破線の矢印で示すように、オペレータが、図５～図９に示したような解析部１３ｄの解析結果に基づいて再分類を指示すると、分類部１３ｂは、学習用データセット１２ｂの学習用画像を再分類し、学習部１３ｃに判定モデル１２ｃを学習させる。 Returning to the description of FIG. The display control unit 13e causes the display unit 5 to display the analysis result of the analysis unit 13d. As indicated by the dashed arrow from the display unit 5 to the operation unit 3, when the operator instructs reclassification based on the analysis results of the analysis unit 13d as shown in FIGS. The training images in the training data set 12b are reclassified, and the learning unit 13c learns the determination model 12c.

配信部１３ｆは、通信部１１を介し、学習部１３ｃによって学習された判定モデル１２ｃを画像判定装置１００へ配信する。 The distribution unit 13 f distributes the determination model 12 c learned by the learning unit 13 c to the image determination device 100 via the communication unit 11 .

次に、画像判定装置１００の構成について説明する。図１０は、実施形態に係る画像判定装置１００のブロック図である。 Next, the configuration of the image determination device 100 will be described. FIG. 10 is a block diagram of the image determination device 100 according to the embodiment.

図１０に示すように、実施形態に係る画像判定装置１００は、通信部１０１と、記憶部１０２と、制御部１０３とを備える。 As shown in FIG. 10, the image determination apparatus 100 according to the embodiment includes a communication section 101, a storage section 102, and a control section 103. FIG.

通信部１０１は、上述した通信部１１と同様に、たとえば、ＮＩＣ等によって実現される。通信部１０１は、ネットワークＮ、カメラ１５０およびプロジェクタ制御装置２００に対し有線または無線で接続され、学習装置１０、カメラ１５０およびプロジェクタ制御装置２００との間で情報の送受信を行う。 The communication unit 101 is realized by, for example, a NIC, like the communication unit 11 described above. The communication unit 101 is wired or wirelessly connected to the network N, the camera 150 and the projector control device 200 and transmits and receives information to and from the learning device 10 , the camera 150 and the projector control device 200 .

記憶部１０２は、上述した記憶部１２と同様に、たとえば、ＲＡＭ、ＲＯＭ、フラッシュメモリ等の半導体メモリ素子、または、ハードディスク、光ディスク等の記憶装置によって実現される。図１０に示す例では、記憶部１０２は、判定モデル１０２ａを記憶する。判定モデル１０２ａは、学習装置から配信される判定モデル１２ｃに相当する。 The storage unit 102 is realized by, for example, a semiconductor memory device such as a RAM, a ROM, a flash memory, or a storage device such as a hard disk or an optical disk, like the storage unit 12 described above. In the example shown in FIG. 10, the storage unit 102 stores a determination model 102a. The judgment model 102a corresponds to the judgment model 12c delivered from the learning device.

制御部１０３は、上述した制御部１３と同様に、コントローラであり、たとえば、ＣＰＵやＭＰＵ等によって、記憶部１０２に記憶されている図示略の各種プログラムがＲＡＭを作業領域として実行されることにより実現される。また、制御部１０３は、上述した制御部１３と同様に、たとえば、ＡＳＩＣやＦＰＧＡ等の集積回路により実現することができる。 The control unit 103 is a controller similar to the control unit 13 described above. Realized. Also, the control unit 103 can be implemented by an integrated circuit such as an ASIC or FPGA, like the control unit 13 described above.

制御部１０３は、取得部１０３ａと、判定部１０３ｂと、出力部１０３ｃとを有し、以下に説明する情報処理の機能や作用を実現または実行する。 The control unit 103 has an acquisition unit 103a, a determination unit 103b, and an output unit 103c, and implements or executes information processing functions and actions described below.

取得部１０３ａは、通信部１０１を介し、学習装置１０から配信される判定モデル１２ｃを取得し、判定モデル１０２ａとして記憶部１０２へ記憶させる。また、取得部１０３ａは、通信部１０１を介し、カメラ１５０によって撮影されるクッキーＰの画像を取得し、判定部１０３ｂへ出力する。 Acquisition unit 103a acquires judgment model 12c distributed from learning device 10 via communication unit 101, and stores it in storage unit 102 as judgment model 102a. Acquisition unit 103a acquires an image of cookie P captured by camera 150 via communication unit 101, and outputs the image to determination unit 103b.

判定部１０３ｂは、取得部１０３ａによって取得された画像を判定モデル１０２ａへ入力し、判定モデル１０２ａから判定結果を取得する。また、判定部１０３ｂは、取得した判定結果を出力部１０３ｃへ出力する。 The determination unit 103b inputs the image acquired by the acquisition unit 103a to the determination model 102a, and acquires the determination result from the determination model 102a. Further, the determination unit 103b outputs the acquired determination result to the output unit 103c.

出力部１０３ｃは、通信部１０１を介し、判定部１０３ｂからの判定結果を学習装置１０およびプロジェクタ制御装置２００に対し出力する。 The output unit 103 c outputs the determination result from the determination unit 103 b to the learning device 10 and the projector control device 200 via the communication unit 101 .

次に、プロジェクタ制御装置２００の構成について説明する。図１１は、実施形態に係るプロジェクタ制御装置２００のブロック図である。 Next, the configuration of the projector control device 200 will be described. FIG. 11 is a block diagram of the projector control device 200 according to the embodiment.

図１１に示すように、実施形態に係るプロジェクタ制御装置２００は、通信部２０１と、記憶部２０２と、制御部２０３とを備える。 As shown in FIG. 11, the projector control device 200 according to the embodiment includes a communication section 201, a storage section 202, and a control section 203.

通信部２０１は、上述した通信部１１，１０１と同様に、たとえば、ＮＩＣ等によって実現される。通信部２０１は、画像判定装置１００およびコンベア装置３００に対し有線または無線で接続され、画像判定装置１００およびコンベア装置３００との間で情報の送受信を行う。 The communication unit 201 is implemented by, for example, a NIC, like the communication units 11 and 101 described above. The communication unit 201 is connected to the image determination device 100 and the conveyor device 300 by wire or wirelessly, and transmits and receives information to and from the image determination device 100 and the conveyor device 300 .

記憶部２０２は、上述した記憶部１２，１０２と同様に、たとえば、ＲＡＭ、ＲＯＭ、フラッシュメモリ等の半導体メモリ素子、または、ハードディスク、光ディスク等の記憶装置によって実現される。図１１に示す例では、記憶部２０２は、投影設定情報２０２ａを記憶する。投影設定情報２０２ａは、画像判定装置１００からの判定結果に応じたマーカーの投影に関する設定情報である。 The storage unit 202 is realized by, for example, a semiconductor memory device such as a RAM, ROM, flash memory, or a storage device such as a hard disk or an optical disk, like the storage units 12 and 102 described above. In the example shown in FIG. 11, the storage unit 202 stores projection setting information 202a. The projection setting information 202 a is setting information regarding the projection of the marker according to the determination result from the image determination device 100 .

制御部２０３は、上述した制御部１３，１０３と同様に、コントローラであり、たとえば、ＣＰＵやＭＰＵ等によって、記憶部２０２に記憶されている図示略の各種プログラムがＲＡＭを作業領域として実行されることにより実現される。また、制御部２０３は、上述した制御部１３，１０３と同様に、たとえば、ＡＳＩＣやＦＰＧＡ等の集積回路により実現することができる。 The control unit 203 is a controller similar to the control units 13 and 103 described above, and various programs (not shown) stored in the storage unit 202 are executed by the CPU, MPU, etc., using the RAM as a work area. It is realized by Further, like the control units 13 and 103 described above, the control unit 203 can be implemented by an integrated circuit such as an ASIC or FPGA, for example.

制御部２０３は、取得部２０３ａと、投影制御部２０３ｂとを有し、以下に説明する情報処理の機能や作用を実現または実行する。 The control unit 203 has an acquisition unit 203a and a projection control unit 203b, and implements or executes information processing functions and actions described below.

取得部２０３ａは、通信部２０１を介し、画像判定装置１００から出力される判定結果を取得し、投影制御部２０３ｂへ出力する。また、取得部２０３ａは、通信部２０１を介し、コンベア装置３００からコンベアの搬送速度を取得し、投影制御部２０３ｂへ出力する。 The acquisition unit 203a acquires the determination result output from the image determination apparatus 100 via the communication unit 201, and outputs the determination result to the projection control unit 203b. Also, the acquisition unit 203a acquires the conveying speed of the conveyor from the conveyor device 300 via the communication unit 201, and outputs it to the projection control unit 203b.

投影制御部２０３ｂは、取得部２０３ａによって取得された判定結果、搬送速度および投影設定情報２０２ａに基づき、プロジェクタ４００によるマーカーの投影を制御する。 The projection control unit 203b controls projection of the marker by the projector 400 based on the determination result, the transport speed, and the projection setting information 202a acquired by the acquisition unit 203a.

次に、実施形態に係る画像判定システム１が実行する処理手順について、図１２を用いて説明する。図１２は、実施形態に係る画像判定装置１００が実行する処理手順を示す処理シーケンスである。 Next, a processing procedure executed by the image determination system 1 according to the embodiment will be described with reference to FIG. 12 . FIG. 12 is a processing sequence showing a processing procedure executed by the image determination device 100 according to the embodiment.

図１２に示すように、まず画像判定システム１の運用前等において、学習装置１０が学習用画像を収集する（ステップＳ１０１）。そして、学習装置１０は、学習用画像を分類し（ステップＳ１０２）、学習用データセット１２ｂを生成する（ステップＳ１０３）。 As shown in FIG. 12, before the image determination system 1 is operated, the learning device 10 collects learning images (step S101). Then, the learning device 10 classifies the learning images (step S102) and generates a learning data set 12b (step S103).

そして、学習装置１０は、学習用データセット１２ｂを用いて判定モデル１２ｃを学習し（ステップＳ１０４）、画像判定装置１００へ判定モデル１２ｃを配信する（ステップＳ１０５）。 Then, the learning device 10 learns the determination model 12c using the learning data set 12b (step S104), and distributes the determination model 12c to the image determination device 100 (step S105).

画像判定装置１００は、カメラ１５０によって撮影された画像を取得し（ステップＳ１０６）、判定モデル１０２ａを用いて画像を判定する（ステップＳ１０７）。そして、判定結果を学習装置１０およびプロジェクタ制御装置２００へ出力する（ステップＳ１０８，Ｓ１０９）。 The image determination device 100 acquires the image captured by the camera 150 (step S106), and determines the image using the determination model 102a (step S107). Then, the determination result is output to the learning device 10 and the projector control device 200 (steps S108 and S109).

学習装置１０は、画像判定装置１００からの判定結果を収集し（ステップＳ１１０）、着目点抽出による解析（ステップＳ１１１）、および、次元圧縮による解析（ステップＳ１１２）を実行する。 The learning device 10 collects determination results from the image determination device 100 (step S110), and performs analysis by extracting the point of interest (step S111) and analysis by dimension compression (step S112).

そして、学習装置１０は、それらの解析結果に基づき、学習用画像を再分類させる（ステップＳ１１３）。そして、ステップＳ１０４からの処理を繰り返す。 Then, the learning device 10 reclassifies the learning images based on those analysis results (step S113). Then, the processing from step S104 is repeated.

一方、プロジェクタ制御装置２００は、画像判定装置１００からの判定結果に応じたプロジェクタ投影を行うことを繰り返す（ステップＳ１１４）。 On the other hand, the projector control device 200 repeats projector projection according to the determination result from the image determination device 100 (step S114).

なお、上述してきた実施形態に係る学習装置１０、画像判定装置１００およびプロジェクタ制御装置２００は、たとえば図１３に示すような構成のコンピュータ１０００によって実現される。学習装置１０を例に挙げて説明する。図１３は、学習装置１０の機能を実現するコンピュータの一例を示すハードウェア構成図である。コンピュータ１０００は、ＣＰＵ１１００、ＲＡＭ１２００、ＲＯＭ１３００、ＨＤＤ（Hard Disk Drive）１４００、通信インタフェース（Ｉ／Ｆ）１５００、入出力インタフェース（Ｉ／Ｆ）１６００、および、メディアインタフェース（Ｉ／Ｆ）６７を備える。 Note that the learning device 10, the image determination device 100, and the projector control device 200 according to the above-described embodiments are implemented by a computer 1000 configured as shown in FIG. 13, for example. The learning device 10 will be described as an example. FIG. 13 is a hardware configuration diagram showing an example of a computer that implements the functions of the learning device 10. As shown in FIG. Computer 1000 includes CPU 1100 , RAM 1200 , ROM 1300 , HDD (Hard Disk Drive) 1400 , communication interface (I/F) 1500 , input/output interface (I/F) 1600 , and media interface (I/F) 67 .

ＣＰＵ１１００は、ＲＯＭ１３００またはＨＤＤ１４００に格納されたプログラムに基づいて動作し、各部の制御を行う。ＲＯＭ１３００は、コンピュータ１０００の起動時にＣＰＵ１１００によって実行されるブートプログラムや、コンピュータ１０００のハードウェアに依存するプログラム等を格納する。 The CPU 1100 operates based on programs stored in the ROM 1300 or HDD 1400 and controls each section. The ROM 1300 stores a boot program executed by the CPU 1100 when the computer 1000 is started up, a program depending on the hardware of the computer 1000, and the like.

ＨＤＤ１４００は、ＣＰＵ１１００によって実行されるプログラムおよび当該プログラムによって使用されるデータ等を格納する。通信インタフェース１５００は、通信ネットワークを介して他の機器からデータを受信してＣＰＵ１１００へ送り、ＣＰＵ１１００が生成したデータを、通信ネットワークを介して他の機器へ送信する。 HDD 1400 stores programs executed by CPU 1100 and data used by the programs. Communication interface 1500 receives data from other devices via a communication network, sends the data to CPU 1100, and transmits data generated by CPU 1100 to other devices via the communication network.

ＣＰＵ１１００は、入出力インタフェース１６００を介して、ディスプレイやプリンタ等の出力装置、および、キーボードやマウス等の入力装置を制御する。ＣＰＵ１１００は、入出力インタフェース１６００を介して、入力装置からデータを取得する。また、ＣＰＵ１１００は、生成したデータを、入出力インタフェース１６００を介して出力装置へ出力する。 The CPU 1100 controls output devices such as displays and printers, and input devices such as keyboards and mice, via an input/output interface 1600 . CPU 1100 acquires data from an input device via input/output interface 1600 . CPU 1100 also outputs the generated data to an output device via input/output interface 1600 .

メディアインタフェース１７００は、記録媒体１８００に格納されたプログラムまたはデータを読み取り、ＲＡＭ１２００を介してＣＰＵ１１００に提供する。ＣＰＵ１１００は、当該プログラムを、メディアインタフェース１７００を介して記録媒体１８００からＲＡＭ１２００上にロードし、ロードしたプログラムを実行する。記録媒体１８００は、たとえばＤＶＤ（Digital Versatile Disc）、ＰＤ（Phase change rewritable Disk）等の光学記録媒体、ＭＯ（Magneto-Optical disk）等の光磁気記録媒体、テープ媒体、磁気記録媒体、または、半導体メモリ等である。 Media interface 1700 reads programs or data stored in recording medium 1800 and provides them to CPU 1100 via RAM 1200 . CPU 1100 loads the program from recording medium 1800 onto RAM 1200 via media interface 1700 and executes the loaded program. The recording medium 1800 is, for example, an optical recording medium such as a DVD (Digital Versatile Disc) or a PD (Phase change rewritable disc), a magneto-optical recording medium such as an MO (Magneto-Optical disk), a tape medium, a magnetic recording medium, or a semiconductor. memory and the like.

たとえば、コンピュータ１０００が実施形態に係る学習装置１０として機能する場合、コンピュータ１０００のＣＰＵ１１００は、ＲＡＭ１２００上にロードされたプログラムを実行することにより、制御部１３の各機能を実現する。また、ＨＤＤ１４００には、記憶部１２内のデータが記憶される。コンピュータ１０００のＣＰＵ１１００は、これらのプログラムを、記録媒体１８００から読み取って実行するが、他の例として、他の装置から、通信ネットワークを介してこれらのプログラムを取得してもよい。 For example, when computer 1000 functions as learning device 10 according to the embodiment, CPU 1100 of computer 1000 implements each function of control unit 13 by executing a program loaded on RAM 1200 . Data in the storage unit 12 is also stored in the HDD 1400 . CPU 1100 of computer 1000 reads and executes these programs from recording medium 1800, but as another example, these programs may be obtained from another device via a communication network.

上述してきたように、実施形態に係る画像判定システム１は、判定部１０３ｂと、着目点抽出部１３ｄａ（「第１の解析部」の一例に相当）と、次元圧縮部１３ｄｂ（「第２の解析部」の一例に相当）と、学習部１３ｃとを含む。判定部１０３ｂは、製造ラインにおける製品の画像を取得し、深層学習ネットワークである判定モデルを用いて分類判定する。着目点抽出部１３ｄａは、上記画像が判定モデルへ入力された場合の上記画像に対する判定モデルの着目点を抽出して可視化することによって判定モデルの判定結果を解析する。次元圧縮部１３ｄｂは、上記画像が判定モデルへ入力された場合の高次元空間における上記画像の特徴量を次元圧縮による低次元表現へ変換して可視化することによって判定モデルの判定結果を解析する。学習部１３ｃは、着目点抽出部１３ｄａおよび次元圧縮部１３ｄｂによる解析結果に基づいて分類された上記画像に基づいて判定モデルを学習する。 As described above, the image determination system 1 according to the embodiment includes the determination unit 103b, the point-of-interest extraction unit 13da (corresponding to an example of the “first analysis unit”), and the dimension compression unit 13db (“second analysis unit”). (corresponding to an example of an “analysis unit”) and a learning unit 13c. The determination unit 103b acquires images of products on the production line, and classifies and determines them using a determination model that is a deep learning network. The point-of-interest extraction unit 13da analyzes the determination result of the determination model by extracting and visualizing the points of interest of the determination model with respect to the image when the image is input to the determination model. The dimension compression unit 13db analyzes the determination result of the determination model by converting the feature amount of the image in the high-dimensional space when the image is input to the determination model into a low-dimensional expression by dimensional compression and visualizing it. The learning unit 13c learns a determination model based on the images classified based on the analysis results by the point-of-interest extraction unit 13da and the dimension compression unit 13db.

したがって、実施形態に係る画像判定システム１によれば、効率よく判定精度の向上を図ることができる。 Therefore, according to the image determination system 1 according to the embodiment, it is possible to efficiently improve the determination accuracy.

なお、上述した実施形態では、学習用画像の再分類に際し、オペレータの操作を要することとしたが、これに限られるものではなく、解析部１３ｄの解析結果に基づいて分類部１３ｂが自動的に再分類を行うようにしてもよい。 In the above-described embodiment, an operator's operation is required to reclassify the learning images, but the present invention is not limited to this. Reclassification may be performed.

かかる場合、分類部１３ｂは、たとえば解析部１３ｄが可視化したヒートマップや低次元空間マップを画像解析する画像解析機能を有し、その画像解析結果に基づいて学習用画像の再分類を行うこととなる。 In such a case, the classification unit 13b has an image analysis function for image analysis of a heat map or a low-dimensional space map visualized by the analysis unit 13d, for example, and reclassifies the learning images based on the image analysis result. Become.

また、上述した実施形態では、着目点抽出のアルゴリズムとしてＧｒａｄ－ＣＡＭを用いることしたが、これに限られるものではなく、たとえばＧｕｉｄｅｄＢａｃｋｐｒｏｐａｇａｔｉｏｎの結果にＧｒａｄ－ＣＡＭの出力を重ねるＧｕｉｄｅｄＧｒａｄ－ＣＡＭと呼ばれるアルゴリズム等を用いることとしてもよい。 Further, in the above-described embodiment, Grad-CAM is used as an algorithm for extracting the point of interest, but it is not limited to this. An algorithm or the like may be used.

また、上述した実施形態では、製造ラインにおける製品がクッキーＰであることとしたが、無論、製品の種別を限定するものではない。 Also, in the above-described embodiment, the product in the production line is the cookie P, but the type of the product is of course not limited.

さらなる効果や変形例は、当業者によって容易に導き出すことができる。このため、本発明のより広範な態様は、以上のように表しかつ記述した特定の詳細および代表的な実施形態に限定されるものではない。したがって、添付の特許請求の範囲およびその均等物によって定義される総括的な発明の概念の精神または範囲から逸脱することなく、様々な変更が可能である。 Further effects and modifications can be easily derived by those skilled in the art. Therefore, the broader aspects of the invention are not limited to the specific details and representative embodiments so shown and described. Accordingly, various changes may be made without departing from the spirit or scope of the general inventive concept defined by the appended claims and equivalents thereof.

１画像判定システム
１０学習装置
１２ｃ判定モデル
１３制御部
１３ａ収集部
１３ｂ分類部
１３ｃ学習部
１３ｄ解析部
１３ｄａ着目点抽出部
１３ｄｂ次元圧縮部
１３ｅ表示制御部
１３ｆ配信部
１００画像判定装置
１０２ａ判定モデル
１０３制御部
１０３ａ取得部
１０３ｂ判定部
１０３ｃ出力部
２００プロジェクタ制御装置
２０３制御部
２０３ａ取得部
２０３ｂ投影制御部
３００コンベア装置
４００プロジェクタ 1 image determination system 10 learning device 12c determination model 13 control unit 13a collection unit 13b classification unit 13c learning unit 13d analysis unit 13da attention point extraction unit 13db dimension compression unit 13e display control unit 13f distribution unit 100 image determination device 102a determination model 103 control Unit 103a Acquisition unit 103b Determination unit 103c Output unit 200 Projector control device 203 Control unit 203a Acquisition unit 203b Projection control unit 300 Conveyor device 400 Projector

Claims

A judgment step of acquiring an image of a product in a manufacturing line and classifying it using a judgment model that is a deep learning network;
a first analysis step of analyzing a determination result of the determination model by extracting and visualizing a focus point of the determination model with respect to the image when the image is input to the determination model;
a second analysis step of analyzing the determination result of the determination model by converting the feature amount of the image in a high-dimensional space when the image is input to the determination model into a low-dimensional representation by dimensional compression and visualizing the image. When,
and a learning step of learning the judgment model based on the images classified based on the analysis results of the first analysis step and the second analysis step.

The learning step includes:
wherein, in the first analysis step, when the points of interest corresponding to a plurality of different classification classes are extracted for one of the images, the image is excluded from learning images and the judgment model is learned. The image determination method according to claim 1, wherein

The second analysis step includes
3. The image determination method according to claim 1, wherein the distribution of determination results of the determination model based on the low-dimensional representation is made into a GUI, and a user is allowed to classify the images via the GUI.

The first analysis step includes
4. The image determination method according to claim 1, wherein the point of interest is extracted and visualized using a Grad-CAM.

The second analysis step includes
The image determination method according to any one of claims 1 to 4, wherein the feature amount of the image is converted into the low-dimensional representation by the dimensional compression using UMAP and visualized.

The image determination method according to any one of claims 1 to 5, further comprising a determination result reflecting step of reflecting the determination result of the determination model on the production line.

The production line is
Having a projector that projects a marker onto the product on the production line,
The determination result reflection step includes:
7. The image determination method according to claim 6, wherein at least the color and shape of the marker are changed according to the classification class of the product and the action to be taken.

The determination result reflection step includes:
8. The image determination method according to claim 7, wherein the marker is scrolled at the same speed as the conveying speed of the product.

A determination unit that acquires an image of a product in a manufacturing line and uses a determination model that is a deep learning network to make a classification determination;
a first analysis unit that analyzes a determination result of the determination model by extracting and visualizing a focus point of the determination model with respect to the image when the image is input to the determination model;
a second analysis unit for analyzing the determination result of the determination model by converting the feature amount of the image in a high-dimensional space when the image is input to the determination model into a low-dimensional representation by dimensional compression and visualizing the image; When,
and a learning unit that learns the determination model based on the images classified based on the analysis results of the first analysis unit and the second analysis unit.

A judgment procedure for acquiring images of products in the manufacturing line and classifying and judging using a judgment model that is a deep learning network;
a first analysis procedure for analyzing a determination result of the determination model by extracting and visualizing a focus point of the determination model for the image when the image is input to the determination model;
A second analysis procedure for analyzing the judgment result of the judgment model by converting the feature amount of the image in a high-dimensional space when the image is input to the judgment model into a low-dimensional representation by dimensional compression and visualizing it. When,
and a learning procedure for learning the judgment model based on the images classified based on the analysis results in the first analysis procedure and the second analysis procedure.