JP2020003879A

JP2020003879A - Information processing device, information processing method, watermark detection device, watermark detection method, and program

Info

Publication number: JP2020003879A
Application number: JP2018120128A
Authority: JP
Inventors: 茂之酒澤; Shigeyuki Sakasawa; 絵美明堂; Emi Meido; 和之田坂; Kazuyuki Tasaka
Original assignee: KDDI Corp
Current assignee: KDDI Corp
Priority date: 2018-06-25
Filing date: 2018-06-25
Publication date: 2020-01-09
Anticipated expiration: 2038-06-25
Also published as: JP6892844B2

Abstract

【課題】学習モデルの出所を特定できるようにする。【解決手段】モデル取得部１１０は、入力された画像データが複数の識別対象それぞれの画像データである蓋然性を示す複数の指標値から構成される識別ラベルを出力するように機械学習された学習モデルを取得する。透かしパターン取得部１１１は、学習モデルに埋め込むための透かしパターンを取得する。画像取得部１１２は、学習モデルに透かしパターンを埋め込むための再学習に用いる再学習用学習データを構成する透かし用画像データセットを取得する。重畳用パターン取得部１１３は、透かし用画像データセットに含まれる画像データと同数のパターンであって、各パターンの重ね合わせが透かしパターンとなる複数のパターンを取得する。学習用データ生成部１１４は、透かし用画像データセットに含まれる画像データと複数のパターンとの組み合わせを再学習用学習データとして生成する。【選択図】図４PROBLEM TO BE SOLVED: To identify the source of a learning model. A model acquisition unit 110 is a learning model machine-learned to output an identification label composed of a plurality of index values indicating the probability that input image data is image data of a plurality of identification targets. To get. The watermark pattern acquisition unit 111 acquires a watermark pattern to be embedded in the learning model. The image acquisition unit 112 acquires a watermark image data set forming re-learning learning data used for re-learning for embedding a watermark pattern in a learning model. The superimposition pattern acquisition unit 113 acquires a plurality of patterns having the same number of patterns as the image data included in the watermark image data set, and the superposition of the patterns makes a watermark pattern. The learning data generation unit 114 generates a combination of image data included in the watermark image data set and a plurality of patterns as re-learning learning data. [Selection diagram] Fig. 4

Description

本発明は情報処理装置、情報処理方法、透かし検出装置、透かし検出方法、及びプログラムに関し、特に学習モデルに透かしを埋め込む技術及び学習モデルに埋め込んだ透かしを検出する技術に関する。 The present invention relates to an information processing device, an information processing method, a watermark detection device, a watermark detection method, and a program, and particularly to a technology for embedding a watermark in a learning model and a technology for detecting a watermark embedded in a learning model.

近年、ＣＰＵ（Central Processing Unit）及びＧＰＵ（Graphics Processing Unit）の高速化、メモリの大容量化、及び機械学習技術が急速に進んできている。このため、数十万から百万といったオーダーの学習データを用いる機械学習が可能となり、精度の高い識別技術や分類技術が確立されつつある（非特許文献１参照）。 In recent years, CPUs (Central Processing Units) and GPUs (Graphics Processing Units) have become faster, memories have a larger capacity, and machine learning techniques have been rapidly advanced. For this reason, machine learning using learning data on the order of hundreds of thousands to one million is possible, and a highly accurate identification technology and classification technology are being established (see Non-Patent Document 1).

Yangqing Jia, Evan Shelhamer, Jeff Donahue, Sergey Karayev, Jonathan Long, Ross Girshick, Sergio Guadarrama, and Trevor Darrell. Caffe: Convolutional architecture for fast feature embedding. In Proceedings of the 22nd ACM international conference on Multimedia (pp. 675-678). ACM.Yangqing Jia, Evan Shelhamer, Jeff Donahue, Sergey Karayev, Jonathan Long, Ross Girshick, Sergio Guadarrama, and Trevor Darrell.Caffe: Convolutional architecture for fast feature embedding.In Proceedings of the 22nd ACM international conference on Multimedia (pp. 675-678) ACM.

大量の学習データに基づく機械学習を実行するためには大量の計算コストがかかる。また、大量の学習データを用意すること、及び用意した学習データを機械学習に用いるために加工する前処理にも膨大な労力を要する。一方で、機械学習によって生成された学習モデルはデジタルデータであり、その複製は容易である。さらに、一般に学習モデル生成に用いられた学習データを、学習モデル自体から推測することは難しい。 Executing machine learning based on a large amount of learning data requires a large amount of calculation cost. In addition, a large amount of labor is required for preparing a large amount of learning data and for preprocessing for processing the prepared learning data for use in machine learning. On the other hand, a learning model generated by machine learning is digital data, and its copying is easy. Further, it is generally difficult to estimate learning data used for generating a learning model from the learning model itself.

このため、学習モデルを生成した者は、その学習モデルが第三者によって不正に利用されたとしても、不正を立証することが難しい。収集した学習データと、学習データに基づいて生成された学習モデルとはそれぞれ労力をかけて取得した価値あるものであり、不正利用から学習モデルを守ることが望まれている。 Therefore, it is difficult for a person who has generated the learning model to prove the fraud even if the learning model is fraudulently used by a third party. The collected learning data and the learning model generated based on the learning data are valuable values obtained with effort, and it is desired to protect the learning model from unauthorized use.

本発明はこれらの点に鑑みてなされたものであり、学習モデルの出所を特定できるようにする技術を提供することを目的とする。 The present invention has been made in view of these points, and an object of the present invention is to provide a technique that enables the source of a learning model to be specified.

本発明の第１の態様は、情報処理装置である。この装置は、入力された画像データが複数の識別対象それぞれの画像データである蓋然性を示す複数の指標値から構成される識別ラベルを出力するように機械学習された学習モデルを取得するモデル取得部と、前記学習モデルに埋め込むための透かしパターンを取得する透かしパターン取得部と、前記学習モデルに前記透かしパターンを埋め込むための再学習に用いる再学習用学習データを構成する透かし用画像データセットを取得する画像取得部と、前記透かし用画像データセットに含まれる画像データと同数のパターンであって、各パターンの重ね合わせが前記透かしパターンとなる複数のパターンを取得する重畳用パターン取得部と、前記透かし用画像データセットに含まれる画像データと前記複数のパターンとの組み合わせを前記再学習用学習データとして生成する学習用データ生成部と、を備える。 A first aspect of the present invention is an information processing device. This apparatus obtains a learning model machine-learned so as to output an identification label including a plurality of index values indicating the probability that the input image data is image data of a plurality of identification targets, respectively. A watermark pattern acquisition unit for acquiring a watermark pattern to be embedded in the learning model; and acquiring a watermark image data set constituting learning data for re-learning used for re-learning for embedding the watermark pattern in the learning model. An image acquisition unit that performs the same number of patterns as the image data included in the watermark image data set, and a superimposition pattern acquisition unit that acquires a plurality of patterns in which each pattern overlaps the watermark pattern; The combination of the image data included in the watermark image data set and the plurality of patterns is Comprising a learning data generating unit that generates a 習用 training data.

前記情報処理装置は、前記透かし用画像データセットに含まれる画像データそれぞれを入力とする前記学習モデルの出力である複数の識別ラベルを取得するモデル適用部をさらに備えてもよく、前記重畳用パターン取得部は、前記モデル適用部が取得した複数の識別ラベルの重ね合わせが前記透かしパターンとなるように、前記複数の識別ラベルのうちの少なくとも一部の識別ラベルを修正した識別ラベルを前記パターンとして取得してもよく、前記学習用データ生成部は、各識別ラベルと当該識別ラベルに対応する前記画像データとの組み合わせを前記再学習用学習データとして生成してもよい。 The information processing apparatus may further include a model application unit that obtains a plurality of identification labels that are outputs of the learning model, each of which receives image data included in the watermark image data set, and the superimposition pattern The acquiring unit is an identification label obtained by correcting at least a part of the plurality of identification labels so that a superimposition of the plurality of identification labels acquired by the model application unit becomes the watermark pattern. The learning data generator may generate a combination of each identification label and the image data corresponding to the identification label as the learning data for re-learning.

前記情報処理装置は、前記再学習用学習データを用いて前記学習モデルを再学習する学習実行部をさらに備えてもよい。 The information processing device may further include a learning execution unit that re-learns the learning model using the learning data for re-learning.

前記識別ラベルは、前記複数の識別対象と同数の要素から構成されるベクトルであってもよく、前記透かしパターンは、前記識別ラベルの要素と同数の画素から構成される画像であってもよく、前記学習用データ生成部は、前記複数の識別ラベルそれぞれの要素を所定の手順にしたがって２次元データに変換する変換部と、前記変換部が変換した複数の２次元データのそれぞれと前記透かしパターンとに基づいて、前記２次元データそれぞれを修正する修正部と、を備えてもよく、前記変換部は、修正された前記２次元データを前記識別ラベルの形式に変換してもよい。 The identification label may be a vector composed of the same number of elements as the plurality of identification targets, the watermark pattern may be an image composed of the same number of pixels as the elements of the identification label, The learning data generation unit includes: a conversion unit configured to convert each element of the plurality of identification labels into two-dimensional data according to a predetermined procedure; and each of the plurality of two-dimensional data converted by the conversion unit and the watermark pattern. And a correction unit that corrects each of the two-dimensional data based on the two-dimensional data, and the conversion unit may convert the corrected two-dimensional data into the format of the identification label.

前記学習用データ生成部は、前記変換部が変換した２次元データと前記透かしパターンとの差異が所定の除外条件を満たす場合、当該２次元データに対応する画像データを前記透かし用画像データセットから除外する画像選択部をさらに備えてもよい。 When the difference between the two-dimensional data converted by the conversion unit and the watermark pattern satisfies a predetermined exclusion condition, the learning data generation unit converts image data corresponding to the two-dimensional data from the watermark image data set. An image selection unit to be excluded may be further provided.

前記透かしパターン取得部は、前記学習モデルに埋め込むための２以上の異なる透かしパターンを取得してもよく、前記画像取得部は、前記異なる透かしパターンそれぞれに対応する複数の前記透かし用画像データセットを取得してもよく、前記学習用データ生成部は、前記異なる透かしパターンそれぞれに対応する前記再学習用学習データを生成してもよい。 The watermark pattern obtaining unit may obtain two or more different watermark patterns for embedding in the learning model, and the image obtaining unit may obtain a plurality of the watermark image data sets corresponding to the different watermark patterns, respectively. The learning data generating unit may generate the learning data for re-learning corresponding to each of the different watermark patterns.

本発明の第２の態様は、情報処理方法である。この方法において、プロセッサが、入力された画像データが複数の識別対象それぞれの画像データである蓋然性を示す複数の指標値から構成される識別ラベルを出力するように機械学習された学習モデルを取得するステップと、前記学習モデルに埋め込むための透かしパターンを取得するステップと、前記学習モデルに前記透かしパターンを埋め込むための再学習に用いる再学習用学習データを構成する透かし用画像データセットを取得するステップと、前記透かし用画像データセットに含まれる画像データと同数のパターンであって、各パターンの重ね合わせが前記透かしパターンとなる複数のパターンを取得するステップと、前記透かし用画像データセットに含まれる画像データと前記複数のパターンとの組み合わせを前記再学習用学習データとして生成するステップと、を実行する。 A second aspect of the present invention is an information processing method. In this method, the processor acquires a learning model machine-learned to output an identification label including a plurality of index values indicating the probability that the input image data is image data of each of the plurality of identification targets. Acquiring a watermark pattern for embedding in the learning model, and acquiring a watermark image data set constituting learning data for re-learning used for re-learning for embedding the watermark pattern in the learning model. Obtaining a plurality of patterns, the number of which is the same as the number of image data included in the watermark image data set, and each pattern being superimposed is the watermark pattern; and Learning for re-learning the combination of image data and the plurality of patterns Performing the steps of generating as over data, the.

本発明の第３の態様は、プログラムである。このプログラムは、コンピュータに、入力された画像データが複数の識別対象それぞれの画像データである蓋然性を示す複数の指標値から構成される識別ラベルを出力するように機械学習された学習モデルを取得する機能と、前記学習モデルに埋め込むための透かしパターンを取得する機能と、前記学習モデルに前記透かしパターンを埋め込むための再学習に用いる再学習用学習データを構成する透かし用画像データセットを取得する機能と、前記透かし用画像データセットに含まれる画像データと同数のパターンであって、各パターンの重ね合わせが前記透かしパターンとなる複数のパターンを取得する機能と、前記透かし用画像データセットに含まれる画像データと前記複数のパターンとの組み合わせを前記再学習用学習データとして生成する機能と、を実現させる。 A third aspect of the present invention is a program. This program acquires a learning model machine-learned to output, to a computer, an identification label composed of a plurality of index values indicating the probability that input image data is image data of each of a plurality of identification targets. A function for acquiring a watermark pattern for embedding in the learning model, and a function for acquiring a watermark image data set constituting learning data for re-learning used for re-learning for embedding the watermark pattern in the learning model. And a function of acquiring a plurality of patterns that are the same number as the image data included in the watermark image data set and in which each pattern is superimposed becomes the watermark pattern; and a function included in the watermark image data set. A combination of image data and the plurality of patterns is generated as the re-learning learning data. A function of, to realize.

本発明の第４の態様は、透かし検出装置である。この装置は、透かし検出の対象とする学習モデルであって、入力された画像データが複数の識別対象それぞれの画像データである蓋然性を示す複数の指標値から構成される識別ラベルを出力するように機械学習された学習モデルを取得するモデル取得部と、学習モデルに埋め込まれた透かしを検出するための透かし用画像データセットを取得する画像取得部と、前記透かし用画像データセットに含まれる画像データそれぞれを入力とする前記学習モデルの出力である複数の識別ラベルを取得するモデル適用部と、前記複数の識別ラベルを重ね合わせて生成されるパターンを出力するパターン出力部と、を備える。 A fourth aspect of the present invention is a watermark detection device. This apparatus outputs an identification label, which is a learning model to be subjected to watermark detection, and is composed of a plurality of index values indicating the probability that input image data is image data of each of a plurality of identification targets. A model acquisition unit that acquires a learning model that has been machine-learned; an image acquisition unit that acquires a watermark image data set for detecting a watermark embedded in the learning model; and image data included in the watermark image data set. A model application unit that acquires a plurality of identification labels that are outputs of the learning model, each of which is an input, and a pattern output unit that outputs a pattern generated by superimposing the plurality of identification labels.

前記透かし検出装置は、検証対象となる透かしパターンを取得する透かしパターン取得部と、前記パターン出力部が出力したパターンと前記透かしパターンとを比較可能な態様で提示するパターン提示部と、を備えてもよい。 The watermark detection device includes a watermark pattern acquisition unit that acquires a watermark pattern to be verified, and a pattern presentation unit that presents the pattern output by the pattern output unit and the watermark pattern in a comparable manner. Is also good.

本発明の第５の態様は、透かし検出方法である。この方法において、プロセッサが、透かし検出の対象とする学習モデルであって、入力された画像データが複数の識別対象それぞれの画像データである蓋然性を示す複数の指標値から構成される識別ラベルを出力するように機械学習された学習モデルを取得するステップと、学習モデルに埋め込まれた透かしを検出するための透かし用画像データセットを取得するステップと、前記透かし用画像データセットに含まれる画像データそれぞれを入力とする前記学習モデルの出力である複数の識別ラベルを取得するステップと、前記複数の識別ラベルを重ね合わせて生成されるパターンを出力するステップと、を実行する。 A fifth aspect of the present invention is a watermark detection method. In this method, the processor outputs an identification label, which is a learning model to be subjected to watermark detection, and includes a plurality of index values indicating the probability that the input image data is image data of each of the plurality of identification targets. Acquiring a learning model machine-learned so as to obtain a watermark image data set for detecting a watermark embedded in the learning model, and acquiring image data sets included in the watermark image data set. And a step of obtaining a plurality of identification labels that are outputs of the learning model, and outputting a pattern generated by superimposing the plurality of identification labels.

本発明の第６の態様は、プログラムである。このプログラムは、コンピュータに、透かし検出の対象とする学習モデルであって、入力された画像データが複数の識別対象それぞれの画像データである蓋然性を示す複数の指標値から構成される識別ラベルを出力するように機械学習された学習モデルを取得する機能と、学習モデルに埋め込まれた透かしを検出するための透かし用画像データセットを取得する機能と、前記透かし用画像データセットに含まれる画像データそれぞれを入力とする前記学習モデルの出力である複数の識別ラベルを取得する機能と、前記複数の識別ラベルを重ね合わせて生成されるパターンを出力する機能と、を実現させる。 A sixth aspect of the present invention is a program. This program outputs, to a computer, an identification label that is a learning model to be subjected to watermark detection and that is configured with a plurality of index values indicating the probability that the input image data is image data of a plurality of identification targets. A function of acquiring a learning model machine-learned so as to acquire a watermark image data set for detecting a watermark embedded in the learning model, and a function of acquiring a watermark image data set included in the watermark image data set. And a function of outputting a pattern generated by superimposing the plurality of identification labels, the function of acquiring a plurality of identification labels which are outputs of the learning model.

上記のプログラムを提供するため、あるいはプログラムの一部をアップデートするために、このプログラムを記録したコンピュータ読み取り可能な記録媒体が提供されてもよく、また、このプログラムが通信回線で伝送されてもよい。 In order to provide the above-mentioned program or update a part of the program, a computer-readable recording medium on which the program is recorded may be provided, and the program may be transmitted via a communication line. .

なお、以上の構成要素の任意の組み合わせ、本発明の表現を方法、装置、システム、コンピュータプログラム、データ構造、記録媒体などの間で変換したものもまた、本発明の態様として有効である。 Note that any combination of the above-described components and any conversion of the expression of the present invention between a method, an apparatus, a system, a computer program, a data structure, a recording medium, and the like are also effective as embodiments of the present invention.

本発明によれば、学習モデルの出所を特定できるようにする技術を提供することができる。 ADVANTAGE OF THE INVENTION According to this invention, the technique which can specify the source of a learning model can be provided.

画像判別用の学習モデルを説明するための模式図である。It is a schematic diagram for explaining the learning model for image discrimination. 学習モデルのファインチューニングを説明するための模式図である。FIG. 4 is a schematic diagram for explaining fine tuning of a learning model. 複数の識別ラベルを重ね合わせることで透かし画像が形成される様子を模式的に示す図である。It is a figure which shows typically a mode that a watermark image is formed by superimposing several identification labels. 実施の形態に係る情報処理装置の機能構成を模式的に示す図である。FIG. 2 is a diagram schematically illustrating a functional configuration of the information processing apparatus according to the embodiment; 実施の形態に係る学習用データ生成部による識別ラベルの修正の一例を説明するための図である。FIG. 7 is a diagram for explaining an example of correction of an identification label by the learning data generation unit according to the embodiment. 実施の形態に係る学習用データ生成部の内部構成を模式的に示す図である。It is a figure which shows typically the internal structure of the learning data generation part which concerns on embodiment. 実施の形態に係る透かし検出装置の機能構成を模式的に示す図である。FIG. 2 is a diagram schematically illustrating a functional configuration of the watermark detection device according to the embodiment. 実施の形態に係るパターン出力部が出力するパターンの一例を示す図である。FIG. 5 is a diagram illustrating an example of a pattern output by a pattern output unit according to the embodiment. 実施の形態に係るパターン提示部２１５が提示する比較画面の一例を模式的に示す図である。FIG. 9 is a diagram schematically illustrating an example of a comparison screen presented by a pattern presentation unit 215 according to the embodiment. 実施の形態に係る情報処理装置が実行する処理の流れを説明するためのフローチャートである。5 is a flowchart illustrating a flow of a process performed by the information processing device according to the embodiment.

＜実施の形態の概要＞
以下、図１、図２、及び図３を参照して、実施の形態の概要を述べる。 <Outline of Embodiment>
Hereinafter, an outline of an embodiment will be described with reference to FIGS. 1, 2, and 3. FIG.

図１（ａ）−（ｂ）は、画像判別用の学習モデルを説明するための模式図である。実施の形態に係る情報処理装置は、例えばニューラルネットワークやＳＶＭ（Support Vector Machine）等の既知の機械学習技術を用いて作成された画像判別用の学習モデルに、透かしデータを埋め込むための装置である。また、実施の形態に係る透かし検出装置は、情報処理装置が透かしを埋め込んだ学習モデルから、埋め込まれた透かしを検出するための装置である。 FIGS. 1A and 1B are schematic diagrams for explaining a learning model for image discrimination. The information processing apparatus according to the embodiment is an apparatus for embedding watermark data in a learning model for image discrimination created using a known machine learning technique such as a neural network or SVM (Support Vector Machine). . The watermark detection device according to the embodiment is a device for detecting an embedded watermark from a learning model in which the information processing device has embedded the watermark.

図１（ａ）−（ｂ）において、学習モデルＭは複数の動物を識別対象とするモデルである。図１（ａ）−（ｂ）に示す例では、学習モデルＭは、猫、猿、犬、象、羊、キリン等を含む１００種類の動物を識別することができる。学習モデルＭに画像データを入力すると、学習モデルＭは、入力データが複数の識別対象それぞれの画像データである蓋然性を示す複数の指標値から構成される識別ラベルＬを出力する。限定はしないが、学習モデルＭが出力する識別ラベルＬは、識別対象である１００種類の動物それぞれに対応する１００種類の値を並べたデータであり、典型的には１００個の要素を持つベクトルデータである。識別ラベルＬの各要素は０以上１以下の値を取り、学習モデルＭに入力された画像データが１００種類の動物それぞれである蓋然性を示している。 1A and 1B, a learning model M is a model in which a plurality of animals are identified. In the example shown in FIGS. 1A and 1B, the learning model M can identify 100 types of animals including cats, monkeys, dogs, elephants, sheep, giraffes, and the like. When image data is input to the learning model M, the learning model M outputs an identification label L including a plurality of index values indicating the probability that the input data is image data of each of the plurality of identification targets. Although not limited, the identification label L output by the learning model M is data in which 100 kinds of values corresponding to 100 kinds of animals to be identified are arranged, and typically a vector having 100 elements. Data. Each element of the identification label L takes a value of 0 or more and 1 or less, and indicates that the image data input to the learning model M is 100 kinds of animals.

例えば、学習モデルＭに猫画像Ｉｃを入力したときに学習モデルＭが出力する識別ラベルＬｃは、「猫」に対応する要素が大きな値となり、その他の要素は小さな値となる。図１（ａ）−（ｂ）では、識別ラベルの要素の値が大きいほど黒く、小さいほど白くなるように図示している。このため、図１（ａ）に示す識別ラベルＬｃでは、「猫」に対応する要素は黒くそれ以外の要素は白くなっている。同様に、学習モデルＭに猿画像Ｉｍを入力した時に学習モデルＭが出力する識別ラベルＬｍは、「猿」に対応する要素は黒くそれ以外の要素は白くなっている。 For example, in the identification label Lc output from the learning model M when the cat image Ic is input to the learning model M, the element corresponding to “cat” has a large value, and the other elements have small values. FIGS. 1A and 1B show that the larger the value of the element of the identification label is, the blacker it is, and the smaller the value of the element is, the whiter it is. Therefore, in the identification label Lc shown in FIG. 1A, the element corresponding to "cat" is black and the other elements are white. Similarly, in the identification label Lm output from the learning model M when the monkey image Im is input to the learning model M, the element corresponding to “monkey” is black, and the other elements are white.

一般に、学習モデルＭが識別対象としていない被写体が写っている画像データを学習モデルＭに入力した場合、学習モデルＭが出力する識別ラベルＬの各要素は、予測することが難しいランダムな値となる。図１（ｂ）では、学習モデルＭが識別対象としていない風景画像Ｉｌを学習モデルＭに入力すると、学習モデルＭは、各要素がランダムな値である識別ラベルＬｌを出力したことを示している。同様に、学習モデルＭが識別対象としていない雪だるま画像Ｉｓを学習モデルＭに入力しても、学習モデルＭは、各要素がランダムな値である識別ラベルＬｓを出力する。ここで、図１（ｂ）に示すように、学習モデルＭに入力する画像データが異なると、出力される識別ラベルＬのパターンも異なる。 In general, when image data including a subject that is not an object to be identified by the learning model M is input to the learning model M, each element of the identification label L output by the learning model M is a random value that is difficult to predict. . In FIG. 1B, when a landscape image Il that the learning model M is not to identify is input to the learning model M, the learning model M outputs an identification label Ll in which each element is a random value. . Similarly, even if the snow model image Is not input to the learning model M is input to the learning model M, the learning model M outputs an identification label Ls in which each element is a random value. Here, as shown in FIG. 1B, when the image data input to the learning model M is different, the pattern of the output identification label L is also different.

実施の形態に係る情報処理装置は、まず、学習モデルに埋め込むための透かし画像と、透かし画像を埋め込む対象とする学習モデルとを取得する。次に、情報処理装置は、学習モデルＭが識別対象としていない被写体が写っている画像データセットを用意する。最後に、情報処理装置は、画像データセットを構成する複数の画像データそれぞれを学習モデルＭに入力したときに出力される複数の識別ラベルＬを重ね合わせることで透かし画像となるように、学習モデルＭを再学習（ファインチューニング；Fine-tuning）する。 First, the information processing apparatus according to the embodiment acquires a watermark image to be embedded in a learning model and a learning model to which the watermark image is to be embedded. Next, the information processing apparatus prepares an image data set in which a subject not identified by the learning model M is identified. Lastly, the information processing apparatus superimposes a plurality of identification labels L output when each of the plurality of image data constituting the image data set is input to the learning model M, so that the learning model can be obtained as a watermark image. Re-learn M (fine-tuning).

図２は、学習モデルＭのファインチューニングを説明するための模式図である。図１（ｂ）に示す風景画像Ｉｌと雪だるま画像Ｉｓとはともに、透かし埋め込み及び透かし検出用として情報処理装置が取得した画像データセットＩｗの一部である。 FIG. 2 is a schematic diagram for explaining fine tuning of the learning model M. Both the landscape image Il and the snowman image Is shown in FIG. 1B are a part of the image data set Iw acquired by the information processing device for watermark embedding and watermark detection.

実施の形態に係る情報処理装置は、画像データセットＩｗと同数のランダム画像であって、各ランダム画像を重ね合わせると透かし画像Ｗとなるランダム画像群を用意する。情報処理装置は、各ランダム画像の要素から構成されるベクトルを生成して教師用識別ラベルＬｔとする。情報処理装置は、教師用識別ラベルＬｔと画像データセットＩｗを構成する画像データとを対応づけた再学習用学習データを生成する。 The information processing apparatus according to the embodiment prepares a random image group that is the same number of random images as the image data set Iw and becomes a watermark image W when each random image is superimposed. The information processing apparatus generates a vector including the elements of each random image and sets the vector as the teacher identification label Lt. The information processing apparatus generates learning data for re-learning in which the teacher identification label Lt is associated with the image data forming the image data set Iw.

情報処理装置は、学習モデルＭに画像データを入力すると、その画像データに対応づけられた識別ラベルを出力するように、学習モデルＭをファインチューニングする。例えば、学習モデルＭがニューラルネットワークを用いて生成されている場合、画像データセットＩｗを構成する画像データを学習モデルＭに入力したときの出力と、その画像データに対応づけた教師用識別ラベルＬｔとの誤差を取得する。情報処理装置は、取得した誤差に基づいて誤差逆伝播法により、学習モデルＭのパラメータを更新する。 When the image data is input to the learning model M, the information processing apparatus finely tunes the learning model M so as to output an identification label associated with the image data. For example, when the learning model M is generated using a neural network, an output when image data constituting the image data set Iw is input to the learning model M, and a teacher identification label Lt associated with the image data. And get the error. The information processing device updates the parameters of the learning model M by the error backpropagation method based on the acquired error.

なお、情報処理装置は、再学習前の学習モデルＭに画像データを入力したときに出力される識別ラベルと類似した識別ラベルを、その画像データと対応づけて再学習用学習データを生成してもよい。これにより、画像データセットＩｗを構成する画像データを学習モデルＭに入力したときの出力と、その画像データに対応づけた教師用識別ラベルＬｔとの誤差が小さくなるため、再学習の収束を早めることが期待できる。 The information processing apparatus generates learning data for re-learning by associating an identification label similar to an identification label output when image data is input to the learning model M before re-learning with the image data. Is also good. Thereby, the error between the output when the image data constituting the image data set Iw is input to the learning model M and the teacher identification label Lt associated with the image data is reduced, and the convergence of the re-learning is accelerated. Can be expected.

図３は、複数の識別ラベルＬを重ね合わせることで透かし画像Ｗが形成される様子を模式的に示す図である。実施の形態に係る透かし検出装置は、画像データセットＩｗを構成する各画像データを再学習された学習モデルＭに入力し、出力される複数の識別ラベルＬｏを取得する。識別ラベルＬｏは１００行１列のベクトルであるが、透かし検出装置は、図３に示すように、各識別ラベルＬｏの要素を並べ替えて１０行１０列の画像データを生成する。透かし検出装置は、識別ラベルＬｏの要素を並べ替えて生成した画像を重ね合わせることにより、透かし画像Ｗを取得する。図３は、透かし画像がアルファベットの「ａｕ」という文字が描かれた画像である場合の例を示している。 FIG. 3 is a diagram schematically illustrating a state in which a watermark image W is formed by overlapping a plurality of identification labels L. The watermark detection device according to the embodiment inputs each image data constituting the image data set Iw to the retrained learning model M, and acquires a plurality of identification labels Lo to be output. Although the identification label Lo is a vector having 100 rows and 1 column, the watermark detection apparatus rearranges the elements of each identification label Lo to generate image data of 10 rows and 10 columns, as shown in FIG. The watermark detection device obtains the watermark image W by superimposing images generated by rearranging the elements of the identification label Lo. FIG. 3 shows an example in which the watermark image is an image in which letters of the alphabet “au” are drawn.

情報処理装置は、学習モデルＭのファインチューニングに用いた画像データセットＩｗと透かし画像Ｗとを、情報処理装置も含めて外部から改竄できない安全な保管場所に保管する。透かし検出装置は、透かし検出の対象となる学習モデルＭを取得した場合、保管場所から画像データセットＩｗを取得して透かし検出の対象となる学習モデルＭに入力する。透かし検出装置は、学習モデルＭが出力した識別ラベルＬｏを重ね合わせて生成した画像と、保管場所から取得した透かし画像Ｗとを比較する。比較の結果、画像が一致すれば、透かし検出装置は、透かし検出の対象となる学習モデルＭは、情報処理装置１が透かし画像を埋め込んだ学習モデルＭであると判定する。 The information processing device stores the image data set Iw and the watermark image W used for the fine tuning of the learning model M in a safe storage location including the information processing device that cannot be falsified from the outside. When acquiring the learning model M to be subjected to watermark detection, the watermark detection device acquires the image data set Iw from the storage location and inputs the acquired image data set Iw to the learning model M to be subjected to watermark detection. The watermark detection device compares the image generated by superimposing the identification labels Lo output by the learning model M with the watermark image W obtained from the storage location. As a result of the comparison, if the images match, the watermark detection device determines that the learning model M to be subjected to the watermark detection is the learning model M in which the information processing device 1 has embedded the watermark image.

このように、実施の形態に係る情報処理装置は、学習モデルＭに透かし画像Ｗを埋め込むことができる。また、実施の形態に係る透かし検出装置は、透かし画像Ｗが埋め込まれた学習モデルＭから透かし画像Ｗを取り出すことができる。このように、実施の形態に係る情報処理装置及び透かし検出装置は、学習モデルＭの出所を特定できるようにすることができる。 As described above, the information processing apparatus according to the embodiment can embed the watermark image W in the learning model M. Further, the watermark detection device according to the embodiment can extract the watermark image W from the learning model M in which the watermark image W is embedded. Thus, the information processing device and the watermark detection device according to the embodiment can specify the source of the learning model M.

＜実施の形態に係る情報処理装置１の機能構成＞
図４は、実施の形態に係る情報処理装置１の機能構成を模式的に示す図である。情報処理装置１は、記憶部１０と制御部１１とを備える。図４において、矢印は主なデータの流れを示しており、図４に示していないデータの流れがあってもよい。図４において、各機能ブロックはハードウェア（装置）単位の構成ではなく、機能単位の構成を示している。そのため、図４に示す機能ブロックは単一の装置内に実装されてもよく、あるいは複数の装置内に別れて実装されてもよい。機能ブロック間のデータの授受は、データバス、ネットワーク、可搬記憶媒体等、任意の手段を介して行われてもよい。 <Functional Configuration of Information Processing Apparatus 1 According to Embodiment>
FIG. 4 is a diagram schematically illustrating a functional configuration of the information processing apparatus 1 according to the embodiment. The information processing device 1 includes a storage unit 10 and a control unit 11. In FIG. 4, arrows indicate main data flows, and there may be data flows not shown in FIG. In FIG. 4, each functional block shows not a configuration of a hardware (device) unit but a configuration of a functional unit. Therefore, the functional blocks shown in FIG. 4 may be implemented in a single device, or may be separately implemented in a plurality of devices. The transfer of data between the functional blocks may be performed via any means such as a data bus, a network, a portable storage medium, and the like.

記憶部１０は、情報処理装置１を実現するコンピュータのＢＩＯＳ（Basic Input Output System）等を格納するＲＯＭ（Read Only Memory）や情報処理装置１の作業領域となるＲＡＭ（Random Access Memory）、ＯＳ（Operating System）やアプリケーションプログラム、当該アプリケーションプログラムの実行時に参照される種々の情報を格納するＨＤＤ（Hard Disk Drive）やＳＳＤ（Solid State Drive）等の大容量記憶装置である。 The storage unit 10 includes a read only memory (ROM) for storing a basic input output system (BIOS) of a computer that implements the information processing device 1, a random access memory (RAM) serving as a work area of the information processing device 1, and an OS ( It is a large-capacity storage device such as a hard disk drive (HDD) or a solid state drive (SSD) that stores an operating system, an application program, and various information referred to when the application program is executed.

制御部１１は、情報処理装置１のＣＰＵやＧＰＵ等のプロセッサであり、記憶部１０に記憶されたプログラムを実行することによって、モデル取得部１１０、透かしパターン取得部１１１、画像取得部１１２、重畳用パターン取得部１１３、学習用データ生成部１１４、モデル適用部１１５、及び学習実行部１１６として機能する。 The control unit 11 is a processor such as a CPU or a GPU of the information processing device 1, and executes a program stored in the storage unit 10 to execute a model acquisition unit 110, a watermark pattern acquisition unit 111, an image acquisition unit 112, It functions as the use pattern acquisition unit 113, the learning data generation unit 114, the model application unit 115, and the learning execution unit 116.

なお、図４は、情報処理装置１が単一の装置で構成されている場合の例を示している。しかしながら、情報処理装置１は、例えばクラウドコンピューティングシステムのように複数のプロセッサやメモリ等の計算リソースによって実現されてもよい。この場合、制御部１１を構成する各部は、複数の異なるプロセッサの中の少なくともいずれかのプロセッサがプログラムを実行することによって実現される。 FIG. 4 shows an example in which the information processing device 1 is configured by a single device. However, the information processing apparatus 1 may be implemented by a plurality of processors and computation resources such as memories, such as a cloud computing system. In this case, each unit constituting the control unit 11 is realized by at least one of a plurality of different processors executing a program.

モデル取得部１１０は、学習モデルＭを取得する。モデル取得部１１０が取得する学習モデルＭは、画像が入力されると識別ラベルＬを出力するように機械学習された学習モデルＭである。ここで、学習モデルＭが出力する識別ラベルＬは、入力された画像データが複数の識別対象それぞれの画像データである蓋然性を示す複数の指標値から構成される識別ラベルである。 The model acquisition unit 110 acquires a learning model M. The learning model M acquired by the model acquiring unit 110 is a learning model M that has been machine-learned so as to output an identification label L when an image is input. Here, the identification label L output by the learning model M is an identification label composed of a plurality of index values indicating the probability that the input image data is the image data of each of the plurality of identification targets.

透かしパターン取得部１１１は、学習モデルＭに埋め込むための透かしパターンを取得する。透かしパターン取得部１１１が取得する透かしパターンは、例えば上述した透かし画像Ｗである。しかしながら、透かしパターン取得部１１１が取得する透かしパターンは画像データに限らず、例えば１次元のベクトルであってもよい。 The watermark pattern acquisition unit 111 acquires a watermark pattern to be embedded in the learning model M. The watermark pattern acquired by the watermark pattern acquisition unit 111 is, for example, the watermark image W described above. However, the watermark pattern acquired by the watermark pattern acquisition unit 111 is not limited to image data, and may be, for example, a one-dimensional vector.

画像取得部１１２は、学習モデルＭに透かしパターンを埋め込むための再学習に用いる再学習用学習データを構成する透かし用画像データセットを取得する。画像取得部１１２が取得する透かし用画像データセットの具体例は、上述した画像データセットＩｗである。 The image obtaining unit 112 obtains a watermark image data set constituting learning data for re-learning used for re-learning for embedding a watermark pattern in the learning model M. A specific example of the watermark image data set acquired by the image acquisition unit 112 is the above-described image data set Iw.

重畳用パターン取得部１１３は、透かし用画像データセットに含まれる画像データと同数のパターンであって、各パターンの重ね合わせが透かしパターンとなる複数のパターンを取得する。重畳用パターン取得部１１３が取得する複数のパターンの詳細は後述する。 The superimposition pattern acquisition unit 113 acquires a plurality of patterns that are the same number as the image data included in the watermark image data set, and in which the superposition of each pattern becomes a watermark pattern. Details of the plurality of patterns acquired by the superimposition pattern acquisition unit 113 will be described later.

学習用データ生成部１１４は、透かし用画像データセットに含まれる画像データと複数のパターンとの組み合わせを再学習用学習データとして生成する。これにより、情報処理装置１は、透かしパターンを埋め込む対象とする学習モデルＭに透かしパターンを埋め込むためのファインチューニングで用いる再学習用学習データを生成することができる。このため、情報処理装置１は、学習モデルＭの出所を特定できるようにすることができる。 The learning data generating unit 114 generates a combination of the image data included in the watermark image data set and a plurality of patterns as learning data for re-learning. Thereby, the information processing apparatus 1 can generate learning data for re-learning used in fine tuning for embedding the watermark pattern in the learning model M in which the watermark pattern is to be embedded. For this reason, the information processing apparatus 1 can specify the source of the learning model M.

ここで、モデル適用部１１５は、透かし用画像データセットに含まれる画像データそれぞれを入力とする学習モデルＭの出力である複数の識別ラベルＬを取得する。重畳用パターン取得部１１３は、モデル適用部１１５が取得した複数の識別ラベルの重ね合わせが透かしパターンとなるように、複数の識別ラベルＬのうちの少なくとも一部の識別ラベルを修正した識別ラベルを上述のパターンとして取得する。学習用データ生成部１１４は、各識別ラベルとその識別ラベルに対応する画像データとの組み合わせを再学習用学習データとして生成する。 Here, the model applying unit 115 acquires a plurality of identification labels L which are outputs of the learning model M which receives image data included in the watermark image data set, respectively. The superimposition pattern acquisition unit 113 converts an identification label obtained by correcting at least a part of the plurality of identification labels L so that the superposition of the plurality of identification labels acquired by the model application unit 115 becomes a watermark pattern. It is acquired as the above-mentioned pattern. The learning data generation unit 114 generates a combination of each identification label and image data corresponding to the identification label as learning data for re-learning.

図５（ａ）−（ｂ）は、実施の形態に係る学習用データ生成部１１４による識別ラベルの修正の一例を説明するための図である。また、図６は、実施の形態に係る学習用データ生成部１１４の内部構成を模式的に示す図である。 FIGS. 5A and 5B are diagrams illustrating an example of correction of an identification label by the learning data generation unit 114 according to the embodiment. FIG. 6 is a diagram schematically illustrating an internal configuration of the learning data generation unit 114 according to the embodiment.

上述したように、学習モデルＭが出力する識別ラベルＬは、学習モデルＭが識別可能な複数の識別対象と同数の要素から構成されるベクトルである。透かしパターンが透かし画像Ｗである場合、透かし画像Ｗは識別ラベルＬの要素と同数の画素から構成される画像である。 As described above, the identification label L output by the learning model M is a vector including the same number of elements as the plurality of identification targets that can be identified by the learning model M. When the watermark pattern is the watermark image W, the watermark image W is an image including the same number of pixels as the elements of the identification label L.

識別ラベルを修正するため、学習用データ生成部１１４は、変換部１１４０と修正部１１４１とを備えている。変換部１１４０は、複数の識別ラベルそれぞれの要素を所定の手順にしたがって透かしパターンである透かし画像Ｗの画素配列と同じ配列の２次元データに変換する。図５（ａ）−（ｂ）において、符号Ｌが付された画像は、変換部１１４０が識別ラベルＬから変換した画像である。図５（ａ）−（ｂ）に示すように、透かし用画像データセットに含まれる画像データは学習モデルＭの識別対象ではないため、画像に変換された識別ラベルＬは、ノイズ画像のような画像となっている。 To correct the identification label, the learning data generation unit 114 includes a conversion unit 1140 and a correction unit 1141. The conversion unit 1140 converts each element of the plurality of identification labels into two-dimensional data having the same array as the pixel array of the watermark image W, which is a watermark pattern, according to a predetermined procedure. In FIGS. 5A and 5B, the image denoted by reference symbol L is an image converted by the conversion unit 1140 from the identification label L. As shown in FIGS. 5A and 5B, since the image data included in the watermark image data set is not an identification target of the learning model M, the identification label L converted into an image is similar to a noise image. It is an image.

修正部１１４１は、変換部１１４０が変換した複数の２次元データのそれぞれと透かしパターンである透かし画像Ｗとに基づいて、各２次元データを修正する。具体的には、図５（ａ）−（ｂ）において、白抜きの丸で示す記号はアダマール積を表す。変換部１１４０は、透かし画像Ｗと画像化した識別ラベルＬとのアダマール積を、上述のパターン（すなわち、再学習に用いる教師用識別ラベルＬｔ）とする。これにより、モデル適用部１１５は、各識別ラベルＬの特徴を残しつつ、かつ透かし画像Ｗの特徴を含めた教師用識別ラベルＬｔを取得することができる。 The correction unit 1141 corrects each two-dimensional data based on each of the plurality of two-dimensional data converted by the conversion unit 1140 and the watermark image W that is a watermark pattern. Specifically, in FIGS. 5A and 5B, symbols indicated by white circles represent Hadamard products. The conversion unit 1140 sets the Hadamard product of the watermark image W and the imaged identification label L as the above-described pattern (that is, the teacher identification label Lt used for re-learning). Accordingly, the model applying unit 115 can acquire the teacher identification label Lt including the feature of the watermark image W while leaving the feature of each identification label L.

なお、透かし画像Ｗの背景（図３の例では「ａｕ」という文字以外の領域）の画素値を０とした場合、教師用識別ラベルＬｔの背景に対応する領域も０となる。透かし画像Ｗの背景の画素値を０に近い実数とすると、教師用識別ラベルＬｔの背景に対応する領域も０以外の値を持つことになる。図３は、透かし画像Ｗの背景の画素値が０以外の値である場合の例を示している。 When the pixel value of the background of the watermark image W (the area other than the character “au” in the example of FIG. 3) is 0, the area corresponding to the background of the teacher identification label Lt is also 0. If the pixel value of the background of the watermark image W is a real number close to 0, the area corresponding to the background of the teacher identification label Lt also has a value other than 0. FIG. 3 shows an example where the background pixel value of the watermark image W is a value other than 0.

図４の説明に戻る。学習実行部１１６は、学習用データ生成部１１４が生成した再学習用学習データを用いて学習モデルＭをファインチューニングによって再学習する。これにより、学習実行部１１６は、学習モデルＭに透かし画像Ｗを埋め込むことができる。 Returning to the description of FIG. The learning execution unit 116 re-learns the learning model M by fine tuning using the re-learning learning data generated by the learning data generation unit 114. Thereby, the learning execution unit 116 can embed the watermark image W in the learning model M.

ここで、学習実行部１１６は、学習モデルＭの再学習時における学習率（学習モデルＭのモデルパラメータの更新幅）を、学習モデルＭの生成時における学習率より小さくしてもよい。これにより、学習実行部１１６は、再学習用学習データによって学習モデルＭが過学習されることを抑制できる。 Here, the learning execution unit 116 may set the learning rate at the time of re-learning the learning model M (the update width of the model parameters of the learning model M) to be smaller than the learning rate at the time of generating the learning model M. Accordingly, the learning execution unit 116 can suppress the learning model M from being over-learned by the learning data for re-learning.

また、学習実行部１１６は、学習モデルＭの再学習時に、学習用データ生成部１１４が生成した再学習用学習データに加えて、学習モデルＭの生成時に用いた学習データを加えたデータを用いて、学習モデルＭを再学習してもよい。このとき、学習実行部１１６は、再学習用学習データによって学習モデルＭが過学習されることを抑制するために、学習用データ生成部１１４が生成した再学習用学習データに加える画像データの数が、学習用データ生成部１１４が生成した再学習用学習データに含まれる画像データよりも多くなるようにしてもよい。学習用データ生成部１１４が生成した再学習用学習データの数と、加算する学習用データの数との比率は、過学習の抑制と再学習にかかるコスト等を勘案して実験により定めればよいが、例えば１対１０である。これにより、学習実行部１１６は、再学習用学習データによって学習モデルＭが過学習されることを抑制できる。 The learning execution unit 116 uses data obtained by adding the learning data used when generating the learning model M, in addition to the learning data for re-learning generated by the learning data generation unit 114 when re-learning the learning model M. Then, the learning model M may be re-learned. At this time, the learning execution unit 116 determines the number of image data to be added to the re-learning learning data generated by the learning data generation unit 114 in order to prevent the learning model M from being over-learned by the re-learning learning data. May be larger than the image data included in the re-learning learning data generated by the learning data generation unit 114. The ratio between the number of learning data for re-learning generated by the learning data generation unit 114 and the number of learning data to be added is determined by experiments in consideration of suppression of over-learning and costs for re-learning. Good, but for example one to ten. Accordingly, the learning execution unit 116 can suppress the learning model M from being over-learned by the learning data for re-learning.

図２を参照して説明したように、学習実行部１１６は、再学習用学習データを入力とした場合に学習モデルＭが出力する識別ラベルＬと、教師用識別ラベルＬｔとの誤差に基づいて、誤差逆伝播法に基づいて学習モデルＭのパラメータを更新する。このため、識別ラベルＬと教師用識別ラベルＬｔとの誤差が小さい方が再学習の収束が早く、かつ再学習後の学習モデルＭが学習前に持っていた識別性能を維持しやすいと考えられる。 As described with reference to FIG. 2, the learning execution unit 116 determines based on the error between the identification label L output by the learning model M when the learning data for re-learning is input and the identification label Lt for teacher. , The parameters of the learning model M are updated based on the backpropagation method. For this reason, it is considered that the smaller the error between the identification label L and the teacher identification label Lt, the faster the re-learning convergence and the easier it is to maintain the identification performance that the learning model M after re-learning had before learning. .

そのため、学習用データ生成部１１４は、画像選択部１１４２も備えている。画像選択部１１４２は、変換部１１４０が変換した２次元データと透かしパターンである透かし画像Ｗとの差異が所定の除外条件を満たす場合、その２次元データに対応する画像データを透かし用画像データセットから除外する。 Therefore, the learning data generation unit 114 also includes an image selection unit 1142. When the difference between the two-dimensional data converted by the conversion unit 1140 and the watermark image W as the watermark pattern satisfies a predetermined exclusion condition, the image selection unit 1142 converts the image data corresponding to the two-dimensional data into a watermark image data set. Exclude from

ここで「所定の除外条件」とは、画像選択部１１４２が再学習用画像データを構成する画像データを選択する際に参照する「画像選択条件」である。所定の除外条件の具体的な内容は、学習の収束性や透かし画像Ｗの特徴等を考慮して定めればよい。一例としては、２次元データと透かし画像Ｗとの差分画像の強度（例えば差分画像の画素値の絶対値の和や、差分画像の画素値の二乗和）が、透かし画像Ｗの強度の半分以上となることを、所定の除外条件としてもよい。 Here, the “predetermined exclusion condition” is an “image selection condition” that is referred to when the image selection unit 1142 selects the image data constituting the re-learning image data. The specific contents of the predetermined exclusion condition may be determined in consideration of the convergence of learning, the characteristics of the watermark image W, and the like. As an example, the intensity of the difference image between the two-dimensional data and the watermark image W (for example, the sum of the absolute values of the pixel values of the difference image or the sum of the squares of the pixel values of the difference image) is equal to or more than half of the intensity of the watermark image W. May be a predetermined exclusion condition.

これにより、学習用データ生成部１１４は、学習実行部１１６による学習モデルＭのファインチューニングの収束性を向上することができ、再学習にかかるコストを低減することができる。 Accordingly, the learning data generation unit 114 can improve the convergence of the fine tuning of the learning model M by the learning execution unit 116, and can reduce the cost for re-learning.

以上、学習モデルＭに埋め込む透かしパターンが１つであることを前提として説明したが、透かしパターンは複数であってもよい。この場合、透かしパターン取得部１１１は、学習モデルＭに埋め込むための２以上の異なる透かしパターンを取得することになる。画像取得部１１２は、異なる透かしパターンそれぞれに対応する複数の透かし用画像データセットを取得する。これにより、学習用データ生成部１１４は、異なる透かしパターンそれぞれに対応する再学習用学習データを生成することができる。モデル適用部１１５は、学習用データ生成部１１４が生成した再学習用学習データを用いて学習モデルＭをファインチューニングすることにより、学習モデルＭに複数種類の透かしパターンを埋め込むことができる。 Although the description has been made on the assumption that one watermark pattern is embedded in the learning model M, a plurality of watermark patterns may be provided. In this case, the watermark pattern acquisition unit 111 acquires two or more different watermark patterns to be embedded in the learning model M. The image acquisition unit 112 acquires a plurality of watermark image data sets corresponding to different watermark patterns. Thereby, the learning data generation unit 114 can generate learning data for re-learning corresponding to each of the different watermark patterns. The model application unit 115 can embed a plurality of types of watermark patterns in the learning model M by fine tuning the learning model M using the learning data for re-learning generated by the learning data generation unit 114.

以上、情報処理装置１が学習モデルＭに透かしパターンを埋め込む工程について説明した。続いて、透かしパターンが埋め込まれた学習モデルＭから透かしパターンを検出する工程について説明する。 The process in which the information processing apparatus 1 embeds the watermark pattern in the learning model M has been described above. Next, a process of detecting a watermark pattern from the learning model M in which the watermark pattern is embedded will be described.

＜実施の形態に係る透かし検出装置２の機能構成＞
図７は、実施の形態に係る透かし検出装置２の機能構成を模式的に示す図である。透かし検出装置２は、記憶部２０と制御部２１とを備える。図７において、矢印は主なデータの流れを示しており、図７に示していないデータの流れがあってもよい。図７において、各機能ブロックはハードウェア（装置）単位の構成ではなく、機能単位の構成を示している。そのため、図７に示す機能ブロックは単一の装置内に実装されてもよく、あるいは複数の装置内に別れて実装されてもよい。機能ブロック間のデータの授受は、データバス、ネットワーク、可搬記憶媒体等、任意の手段を介して行われてもよい。 <Functional Configuration of Watermark Detection Apparatus 2 According to Embodiment>
FIG. 7 is a diagram schematically illustrating a functional configuration of the watermark detection device 2 according to the embodiment. The watermark detection device 2 includes a storage unit 20 and a control unit 21. In FIG. 7, arrows indicate main data flows, and there may be data flows not shown in FIG. In FIG. 7, each functional block shows a configuration of a function unit, not a configuration of a hardware (device) unit. Therefore, the functional blocks illustrated in FIG. 7 may be implemented in a single device, or may be separately implemented in a plurality of devices. The transfer of data between the functional blocks may be performed via any means such as a data bus, a network, a portable storage medium, and the like.

記憶部２０は、透かし検出装置２を実現するコンピュータのＢＩＯＳ等を格納するＲＯＭや透かし検出装置２の作業領域となるＲＡＭ、ＯＳやアプリケーションプログラム、当該アプリケーションプログラムの実行時に参照される種々の情報を格納するＨＤＤやＳＳＤ等の大容量記憶装置である。 The storage unit 20 stores a ROM for storing a BIOS or the like of a computer that implements the watermark detection device 2, a RAM serving as a work area of the watermark detection device 2, an OS, an application program, and various information referred to when the application program is executed. It is a large-capacity storage device such as an HDD or SSD for storing.

制御部２１は、透かし検出装置２のＣＰＵやＧＰＵ等のプロセッサであり、記憶部２０に記憶されたプログラムを実行することによって、モデル取得部２１０、画像取得部２１１、モデル適用部２１２、パターン出力部２１３、透かしパターン取得部２１４、及びパターン提示部２１５として機能する。 The control unit 21 is a processor such as a CPU and a GPU of the watermark detection device 2, and executes a program stored in the storage unit 20 to execute a model acquisition unit 210, an image acquisition unit 211, a model application unit 212, and a pattern output unit. Function as a unit 213, a watermark pattern acquisition unit 214, and a pattern presentation unit 215.

なお、図７は、透かし検出装置２が単一の装置で構成されている場合の例を示している。しかしながら、透かし検出装置２は、例えばクラウドコンピューティングシステムのように複数のプロセッサやメモリ等の計算リソースによって実現されてもよい。この場合、制御部１１を構成する各部は、複数の異なるプロセッサの中の少なくともいずれかのプロセッサがプログラムを実行することによって実現される。また、透かし検出装置２の機能の少なくとも一部又は全部を、情報処理装置１が備えていてもよい。 FIG. 7 shows an example in which the watermark detection device 2 is configured by a single device. However, the watermark detection device 2 may be realized by a plurality of processors and computational resources such as memories as in a cloud computing system. In this case, each unit constituting the control unit 11 is realized by at least one of a plurality of different processors executing a program. The information processing device 1 may include at least a part or all of the functions of the watermark detection device 2.

モデル取得部２１０は、透かし検出の対象とする学習モデルＭを取得する。学習モデルＭは、入力された画像データが複数の識別対象それぞれの画像データである蓋然性を示す複数の指標値から構成される識別ラベルＬを出力するように機械学習された学習モデルである。 The model acquisition unit 210 acquires a learning model M to be subjected to watermark detection. The learning model M is a learning model that has been machine-learned so as to output an identification label L including a plurality of index values indicating the probability that the input image data is image data of each of a plurality of identification targets.

画像取得部２１１は、学習モデルＭに埋め込まれた透かしを検出するための透かし用画像データセットを取得する。具体的には、画像取得部２１１が取得する透かし用画像データセットは、モデル適用部１１５が学習モデルＭをファインチューニングする際に使用した再学習用学習データを構成する透かし用画像データセットと同一である。 The image acquisition unit 211 acquires a watermark image data set for detecting a watermark embedded in the learning model M. Specifically, the watermark image data set acquired by the image acquisition unit 211 is the same as the watermark image data set constituting the learning data for re-learning used when the model application unit 115 fine-tunes the learning model M. It is.

モデル適用部２１２は、透かし用画像データセットに含まれる画像データそれぞれを入力とする学習モデルＭの出力である複数の識別ラベルＬを取得する。パターン出力部２１３は、複数の識別ラベルＬを重ね合わせて生成されるパターンを出力する。 The model applying unit 212 acquires a plurality of identification labels L which are outputs of the learning model M which receives image data included in the watermark image data set as input. The pattern output unit 213 outputs a pattern generated by overlapping a plurality of identification labels L.

図８（ａ）−（ｅ）は、実施の形態に係るパターン出力部２１３が出力するパターンの一例を示す図である。具体的には、図８（ａ）は、６０個の異なる識別ラベルＬを重ね合わせてできるパターンを示す図であり、図８（ｂ）は、２４０個の異なる識別ラベルＬを重ね合わせてできるパターンを示す図である。同様に、図８（ｃ）は、約２０００個の異なる識別ラベルＬを重ね合わせてできるパターンを示す図であり、図８（ｄ）は、３０００個の異なる識別ラベルＬを重ね合わせてできるパターンを示す図である。なお、図８（ｅ）は、透かし画像Ｗの元画像である。図８（ｄ）に示すパターンに２値化処理を施すと、図８（ｅ）に示す画像と一致する。 FIGS. 8A to 8E are diagrams illustrating an example of a pattern output by the pattern output unit 213 according to the embodiment. Specifically, FIG. 8A is a diagram illustrating a pattern formed by superimposing 60 different identification labels L, and FIG. 8B is a diagram illustrating a pattern obtained by superimposing 240 different identification labels L. It is a figure showing a pattern. Similarly, FIG. 8C is a diagram showing a pattern formed by overlapping about 2000 different identification labels L, and FIG. 8D is a pattern formed by overlapping 3000 different identification labels L. FIG. FIG. 8E shows the original image of the watermark image W. When the pattern shown in FIG. 8D is subjected to the binarization process, the pattern matches the image shown in FIG.

図８（ａ）−（ｄ）に示すように、パターン出力部２１３が重ね合わせる識別ラベルＬの数が少ないときは、識別ラベルＬを重ね合わせても透かし画像Ｗを再現することができない。これにより、万が一透かし用画像データセットに含まれる画像データの一部が第三者に漏洩したとしても、漏洩した画像データの数が少なければ、第三者は透かし画像Ｗを特定することが困難である。このため、実施の形態に係る情報処理装置１は、第三者が推定することが困難な態様で学習モデルＭに透かしパターンを埋め込むことができる。 As shown in FIGS. 8A to 8D, when the number of identification labels L to be superimposed by the pattern output unit 213 is small, the watermark image W cannot be reproduced even when the identification labels L are superimposed. Thereby, even if some of the image data included in the watermark image data set leaks to a third party, it is difficult for the third party to specify the watermark image W if the number of leaked image data is small. It is. For this reason, the information processing apparatus 1 according to the embodiment can embed the watermark pattern in the learning model M in a manner that is difficult for a third party to estimate.

図７の説明に戻る。透かしパターン取得部２１４は、検証対象となる透かしパターンを取得する。上述したように、透かしパターンである透かし画像Ｗは、透かし用画像データセットとともに外部から改竄できない安全な保管場所に保管されている。透かしパターン取得部２１４は、この保管場所から透かし画像Ｗを読み出して取得する。 Returning to the description of FIG. The watermark pattern acquisition unit 214 acquires a watermark pattern to be verified. As described above, the watermark image W, which is a watermark pattern, is stored together with the watermark image data set in a secure storage location that cannot be tampered with from the outside. The watermark pattern acquisition unit 214 reads and acquires the watermark image W from this storage location.

パターン提示部２１５は、パターン出力部２１３が出力したパターンと透かしパターン取得部２１４が取得した透かしパターンとを比較可能な態様で提示する。
図９は、実施の形態に係るパターン提示部２１５が提示する比較画面の一例を模式的に示す図である。図９は、学習モデルＭに「透かしＡ」と「透かしＢ」との２種類の透かしパターンが埋め込まれている場合の例を示している。図９において、「透かしＡ」はアルファベットの「ａｕ」という文字が描かれた画像であり、「透かしＢ」は人の顔が描かれた画像である。 The pattern presentation unit 215 presents the pattern output by the pattern output unit 213 and the watermark pattern acquired by the watermark pattern acquisition unit 214 in a manner that allows comparison.
FIG. 9 is a diagram schematically illustrating an example of a comparison screen presented by the pattern presentation unit 215 according to the embodiment. FIG. 9 shows an example in which two types of watermark patterns “watermark A” and “watermark B” are embedded in the learning model M. In FIG. 9, “watermark A” is an image in which the letters “au” of the alphabet are drawn, and “watermark B” is an image in which a human face is drawn.

図９に示すように、パターン提示部２１５は、情報処理装置１が学習モデルＭに埋め込んだ透かしパターンである透かし画像Ｗと、透かし検出装置２が透かし検出の対象とする学習モデルＭから検出した画像とを並べて表示部Ｄに表示する。これにより、透かし検出装置２のユーザは、透かし検出の対象とする学習モデルＭから検出した画像が透かし画像Ｗと一致するか否かを一見して確認することができる。 As illustrated in FIG. 9, the pattern presenting unit 215 detects a watermark image W that is a watermark pattern embedded in the learning model M by the information processing device 1 and a learning model M that is a target of the watermark detection by the watermark detection device 2. The images are displayed side by side on the display unit D. Thereby, the user of the watermark detection device 2 can see at a glance whether or not the image detected from the learning model M as the target of the watermark detection matches the watermark image W.

また、パターン提示部２１５は、透かし検出の対象とする学習モデルＭから検出した画像と透かし画像Ｗとが一致するか否かを演算によって判定し、その結果を表示部Ｄに提示するようにしてもよい。具体的には、パターン提示部２１５は透かし検出の対象とする学習モデルＭから検出した画像と透かし画像Ｗとの差分画像の強度が所定の条件（例えば、透かし画像Ｗの強度の１０％以下となること）を満たす場合、両者が一致すると判定すればよい。パターン提示部２１５が算出した判定結果を表示部Ｄに表示させることにより、透かし検出装置２のユーザは、透かし検出の対象とする学習モデルＭから検出した画像と透かし画像Ｗとが一致するか否かの判断の一助とすることができる。 Further, the pattern presenting unit 215 determines whether or not the image detected from the learning model M to be subjected to the watermark detection matches the watermark image W by calculation, and presents the result to the display unit D. Is also good. Specifically, the pattern presentation unit 215 determines that the strength of the difference image between the watermark image W and the image detected from the learning model M that is the target of the watermark detection is a predetermined condition (for example, 10% or less of the watermark image W strength). Satisfies), it may be determined that they match. By displaying the determination result calculated by the pattern presentation unit 215 on the display unit D, the user of the watermark detection device 2 can determine whether the image detected from the learning model M to be subjected to the watermark detection matches the watermark image W. This can help in the determination.

＜情報処理装置１が実行する学習方法の処理フロー＞
図１０は、実施の形態に係る情報処理装置１が実行する処理の流れを説明するためのフローチャートである。本フローチャートにおける処理は、例えば情報処理装置１が起動したときに開始する。 <Processing Flow of Learning Method Performed by Information Processing Apparatus 1>
FIG. 10 is a flowchart illustrating a flow of a process performed by information processing device 1 according to the embodiment. The process in this flowchart is started, for example, when the information processing device 1 is started.

モデル取得部１１０は、透かしパターンの埋め込み対象とする学習モデルＭを取得する（Ｓ２）。透かしパターン取得部１１１は、学習モデルＭに埋め込むための透かしパターンを取得する（Ｓ４）。画像取得部１１２は、学習モデルＭに透かしパターンを埋め込むための再学習に用いる再学習用学習データを構成する透かし用画像データセットを取得する（Ｓ６）。 The model acquisition unit 110 acquires a learning model M for embedding a watermark pattern (S2). The watermark pattern acquisition unit 111 acquires a watermark pattern to be embedded in the learning model M (S4). The image acquisition unit 112 acquires a watermark image data set that constitutes learning data for re-learning used for re-learning for embedding a watermark pattern in the learning model M (S6).

重畳用パターン取得部１１３は、透かし用画像データセットに含まれる画像データと同数のパターンであって、各パターンの重ね合わせが透かしパターンを再現するように構成された複数のパターンを取得する（Ｓ８）。学習用データ生成部１１４は、透かし用画像データセットに含まれる画像データと複数のパターンとの組み合わせを再学習用学習データとして生成する（Ｓ１０）。 The superimposition pattern acquisition unit 113 acquires a plurality of patterns that are the same number as the image data included in the watermark image data set and are configured so that the superposition of the patterns reproduces the watermark pattern (S8). ). The learning data generation unit 114 generates a combination of image data and a plurality of patterns included in the watermark image data set as re-learning learning data (S10).

学習実行部１１６は、再学習用学習データを用いてファインチューニングにより学習モデルＭを再学習する（Ｓ１２）。これにより、学習実行部１１６は、学習モデルＭに透かしパターンを埋め込むことができる。学習実行部１１６が再学習を実行すると、本フローチャートにおける処理は終了する。 The learning execution unit 116 re-learns the learning model M by fine tuning using the learning data for re-learning (S12). Thereby, the learning execution unit 116 can embed the watermark pattern in the learning model M. When the learning execution unit 116 executes the re-learning, the processing in this flowchart ends.

＜実施の形態に係る情報処理装置１が奏する効果＞
以上説明したように、実施の形態に係る情報処理装置１によれば、学習モデルＭの出所を特定できるようにすることができる。 <Effects of Information Processing Apparatus 1 According to Embodiment>
As described above, according to the information processing apparatus 1 according to the embodiment, the source of the learning model M can be specified.

以上、本発明を実施の形態を用いて説明したが、本発明の技術的範囲は上記実施の形態に記載の範囲には限定されず、その要旨の範囲内で種々の変形及び変更が可能である。例えば、装置の分散・統合の具体的な実施の形態は、以上の実施の形態に限られず、その全部又は一部について、任意の単位で機能的又は物理的に分散・統合して構成することができる。また、複数の実施の形態の任意の組み合わせによって生じる新たな実施の形態も、本発明の実施の形態に含まれる。組み合わせによって生じる新たな実施の形態の効果は、もとの実施の形態の効果を合わせ持つ。 As described above, the present invention has been described using the embodiment. However, the technical scope of the present invention is not limited to the scope described in the above embodiment, and various modifications and changes can be made within the scope of the gist. is there. For example, the specific embodiment of the distribution / integration of the apparatus is not limited to the above-described embodiment, and all or a part of the apparatus may be functionally or physically distributed / integrated in an arbitrary unit. Can be. Further, new embodiments that are generated by arbitrary combinations of the plurality of embodiments are also included in the embodiments of the present invention. The effect of the new embodiment caused by the combination has the effect of the original embodiment.

＜第１の変形例＞
上記では、情報処理装置１が学習モデルＭに透かしパターンを埋め込む場合について説明した。ここで、透かしパターンは一つの独立した図柄でなくてもよく、例えば、一つの画像を２以上の部分画像に分割し、各部分画像をそれぞれ透かしパターンとしてもよい。このとき、情報処理装置１は、少なくとも１つの部分画像を学習モデルＭに埋め込まずに保管用画像として保管する。この場合、透かし検出装置２は、学習モデルＭから各透かしパターンを検出した上でそれらを組み合わせ、かつ保管用画像と組み合わせることで初めて完全な画像を得る。各部分画像は、いわば電子的な「割符」として機能する。これにより、万が一第三者に透かしパターンの一部が漏洩したとしても、第三者はその透かしパターンから全体画像を構成することは困難であるため、安全性をより向上することができる。 <First Modification>
The case where the information processing apparatus 1 embeds the watermark pattern in the learning model M has been described above. Here, the watermark pattern does not have to be one independent pattern. For example, one image may be divided into two or more partial images, and each partial image may be used as a watermark pattern. At this time, the information processing apparatus 1 stores at least one partial image as a storage image without embedding it in the learning model M. In this case, the watermark detection device 2 obtains a complete image for the first time by detecting each watermark pattern from the learning model M, combining them, and combining with the storage image. Each partial image functions as an electronic "tally". Thus, even if a part of the watermark pattern leaks to the third party, it is difficult for the third party to compose the whole image from the watermark pattern, so that the security can be further improved.

＜第２の変形例＞
上記では、学習用データ生成部１１４が、複数の識別ラベルＬそれぞれの要素を所定の手順にしたがって透かしパターンの画素配列と同じ配列の２次元データに変換し、変換した複数の２次元データのそれぞれと透かしパターンとに基づいて、２次元データそれぞれを修正する場合について説明した。これに替えて、学習用データ生成部１１４は、透かしパターンの画素配列を所定の手順に従って識別ラベルＬと同じ配列の１次元データに変換し、変換した複数の１次元データのそれぞれと透かしパターンとに基づいて、識別ラベルＬを修正してもよい。 <Second Modification>
In the above description, the learning data generation unit 114 converts the elements of each of the plurality of identification labels L into two-dimensional data having the same array as the pixel array of the watermark pattern according to a predetermined procedure. The case has been described in which each of the two-dimensional data is corrected based on the watermark pattern. Instead, the learning data generation unit 114 converts the pixel array of the watermark pattern into one-dimensional data having the same array as the identification label L according to a predetermined procedure, and outputs each of the plurality of converted one-dimensional data and the watermark pattern. May be used to modify the identification label L.

＜第３の変形例＞
上記では、変換部１１４０が、透かし画像Ｗと画像化した識別ラベルＬとのアダマール積を、再学習に用いる教師用識別ラベルＬｔとする場合について説明した。しかしながら、教師用識別ラベルＬｔの取得方法は上記に限られない。例えば、変換部１１４０は、透かし画像Ｗと画像化した識別ラベルＬとのアダマール積の結果、画素値を大きい順に所定の数だけ選択し、残りは０又は０に近い実数（例えば、選択した画素値の中の最小値の１０％）としてもよい。さらに、変換部１１４０は、選択した画素の値を１とし、それ以外の画素の値は０又は０に近い実数として教師用識別ラベルＬｔを２値化してもよい。 <Third Modification>
In the above, the case where the converting unit 1140 sets the Hadamard product of the watermark image W and the imaged identification label L as the teacher identification label Lt used for re-learning has been described. However, the method of acquiring the teacher identification label Lt is not limited to the above. For example, as a result of the Hadamard product of the watermark image W and the imaged identification label L, the conversion unit 1140 selects a predetermined number of pixel values in descending order, and the rest are 0 or a real number close to 0 (for example, the selected pixel (10% of the minimum value among the values). Furthermore, the conversion unit 1140 may binarize the teacher identification label Lt assuming that the value of the selected pixel is 1 and the values of the other pixels are 0 or a real number close to 0.

１・・・情報処理装置
１０・・・記憶部
１１・・・制御部
１１０・・・モデル取得部
１１１・・・透かしパターン取得部
１１２・・・画像取得部
１１３・・・重畳用パターン取得部
１１４・・・学習用データ生成部
１１４０・・・変換部
１１４１・・・修正部
１１４２・・・画像選択部
１１５・・・モデル適用部
１１６・・・学習実行部
２・・・透かし検出装置
２０・・・記憶部
２１・・・制御部
２１０・・・モデル取得部
２１１・・・画像取得部
２１２・・・モデル適用部
２１３・・・パターン出力部
２１４・・・透かしパターン取得部
２１５・・・パターン提示部
Ｄ・・・表示部
DESCRIPTION OF SYMBOLS 1 ... Information processing apparatus 10 ... Storage part 11 ... Control part 110 ... Model acquisition part 111 ... Watermark pattern acquisition part 112 ... Image acquisition part 113 ... Superposition pattern acquisition part 114 learning data generation unit 1140 conversion unit 1141 correction unit 1142 image selection unit 115 model application unit 116 learning execution unit 2 watermark detection device 20 ... Storage unit 21 ... Control unit 210 ... Model acquisition unit 211 ... Image acquisition unit 212 ... Model application unit 213 ... Pattern output unit 214 ... Watermark pattern acquisition unit 215 ... .Pattern presentation section D: Display section

Claims

A model acquisition unit that acquires a learning model machine-learned to output an identification label including a plurality of index values indicating the probability that the input image data is the image data of each of the plurality of identification targets,
A watermark pattern acquisition unit that acquires a watermark pattern for embedding in the learning model,
An image acquisition unit that acquires a watermark image data set that constitutes learning data for re-learning used for re-learning for embedding the watermark pattern in the learning model;
A superimposition pattern acquisition unit that acquires the same number of patterns as the image data included in the watermark image data set, and acquires a plurality of patterns in which the superposition of the patterns is the watermark pattern.
A learning data generation unit that generates a combination of the image data and the plurality of patterns included in the watermark image data set as the re-learning learning data,
An information processing device comprising:

The image processing apparatus further includes a model application unit that obtains a plurality of identification labels that are outputs of the learning model, each of which includes input image data included in the watermark image data set,
The superimposing pattern acquisition unit is an identification label obtained by correcting at least a part of the plurality of identification labels so that the superposition of the plurality of identification labels acquired by the model application unit becomes the watermark pattern. Is obtained as the pattern,
The learning data generation unit generates a combination of each identification label and the image data corresponding to the identification label as the learning data for re-learning,
The information processing device according to claim 1.

Further comprising a learning execution unit for re-learning the learning model using the learning data for re-learning,
The information processing device according to claim 1.

The identification label is a vector composed of the same number of elements as the plurality of identification targets,
The watermark pattern is an image composed of the same number of pixels as the elements of the identification label,
The learning data generation unit includes:
A conversion unit configured to convert each element of the plurality of identification labels into two-dimensional data according to a predetermined procedure;
A correction unit that corrects each of the two-dimensional data based on each of the plurality of two-dimensional data converted by the conversion unit and the watermark pattern,
The conversion unit converts the corrected two-dimensional data into a format of the identification label.
The information processing device according to claim 1.

The learning data generation unit includes:
When the difference between the two-dimensional data converted by the conversion unit and the watermark pattern satisfies a predetermined exclusion condition, the image processing apparatus further includes an image selection unit that excludes image data corresponding to the two-dimensional data from the watermark image data set. ,
The information processing device according to claim 4.

The watermark pattern acquisition unit acquires two or more different watermark patterns for embedding in the learning model,
The image acquisition unit acquires a plurality of watermark image data sets corresponding to each of the different watermark patterns,
The learning data generation unit generates the re-learning learning data corresponding to each of the different watermark patterns.
The information processing device according to claim 1.

The processor
A step of acquiring a learning model machine-learned to output an identification label composed of a plurality of index values indicating the probability that the input image data is the image data of each of the plurality of identification targets,
Obtaining a watermark pattern for embedding in the learning model;
Acquiring a watermark image data set constituting retraining learning data used for retraining for embedding the watermark pattern in the learning model,
Acquiring the same number of patterns as the image data included in the watermark image data set, and obtaining a plurality of patterns in which each pattern is the watermark pattern,
Generating a combination of the image data and the plurality of patterns included in the watermark image data set as the learning data for re-learning,
Information processing method for executing.

On the computer,
A function of acquiring a learning model machine-learned to output an identification label composed of a plurality of index values indicating the probability that the input image data is the image data of each of the plurality of identification targets,
A function of acquiring a watermark pattern for embedding in the learning model;
A function of acquiring a watermark image data set constituting learning data for re-learning used for re-learning for embedding the watermark pattern in the learning model,
A function of acquiring a plurality of patterns, which are the same number of patterns as the image data included in the watermark image data set, and in which each pattern is the watermark pattern,
A function of generating a combination of the image data and the plurality of patterns included in the watermark image data set as the learning data for re-learning,
The program that realizes.

A learning model to be subjected to watermark detection, wherein the input image data is machine-learned so as to output an identification label including a plurality of index values indicating probabilities that are image data of a plurality of identification targets. A model acquisition unit for acquiring a learning model,
An image acquisition unit that acquires a watermark image data set for detecting a watermark embedded in the learning model;
A model application unit that obtains a plurality of identification labels that are outputs of the learning model, each of which receives image data included in the watermark image data set,
A pattern output unit that outputs a pattern generated by overlapping the plurality of identification labels,
A watermark detection device comprising:

A watermark pattern acquisition unit that acquires a watermark pattern to be verified;
A pattern presenting unit that presents the pattern output by the pattern output unit and the watermark pattern in a comparable manner,
The watermark detection device according to claim 9, further comprising:

The processor
A learning model to be subjected to watermark detection, wherein the input image data is machine-learned so as to output an identification label including a plurality of index values indicating probabilities that are image data of a plurality of identification targets. Obtaining a learning model;
Obtaining a watermark image data set for detecting a watermark embedded in the learning model;
Acquiring a plurality of identification labels that are outputs of the learning model with each of the image data included in the watermark image data set as an input,
Outputting a pattern generated by superimposing the plurality of identification labels,
Perform watermark detection method.

On the computer,
A learning model to be subjected to watermark detection, wherein the input image data is machine-learned so as to output an identification label including a plurality of index values indicating probabilities that are image data of a plurality of identification targets. A function to acquire a learning model,
A function of acquiring a watermark image data set for detecting a watermark embedded in the learning model,
A function of acquiring a plurality of identification labels that are outputs of the learning model with each of the image data included in the watermark image data set as an input,
A function of outputting a pattern generated by superimposing the plurality of identification labels,
The program that realizes.