JPH0346080A

JPH0346080A - Feature extraction device

Info

Publication number: JPH0346080A
Application number: JP1181255A
Authority: JP
Inventors: Koichi Higuchi; 浩一樋口; Yoshiyuki Yamashita; 山下　義征
Original assignee: Oki Electric Industry Co Ltd
Current assignee: Oki Electric Industry Co Ltd
Priority date: 1989-07-13
Filing date: 1989-07-13
Publication date: 1991-02-27
Anticipated expiration: 2012-06-04
Also published as: JP2616994B2

Abstract

PURPOSE:To extract a feature with few fluctuation by converting a distance between the remarking point of a subpattern and a character line to a scale following the local position fluctuation of the character line by using the projection distribution (marginal distribution) of the character line of a character/graphic pattern, and extracting the sum of K-the power of a converted distance as the feature of a character. CONSTITUTION:A feature extraction device 10 is comprised of a subpattern extraction part 12, a character projection part 14, a centroid coordinate system extraction part 16, and a feature extraction part 18. And the distance between the remarking point of the subpattern and the position of the character line is expressed in a conversion value setting a centroid coordinate number attached on a centroid coordinate system in sequence of size of a coordi nate value as the scale by using the projection distribution (marginal distribution) of the character line of the character/graphic pattern, and first and second features can be extracted from the sum of K-th power of the distance expressed in the conversion value. The scale of the conversion value follows the local position conversion of the character line, therefore, the fluctuation of the character recognized as the same character can be suppressed even when the fluctuation of the position of the character line occurs, which stabilizes the feature. In such a way, the feature extraction device with few quantity of fluctuation can be obtained.

Description

【発明の詳細な説明】（産業上の利用分野）この発明は特徴抽出装置、特に文字認識におｆする特徴
抽出に用いる装置に関する。DETAILED DESCRIPTION OF THE INVENTION (Field of Industrial Application) The present invention relates to a feature extraction device, and particularly to a device used for feature extraction for character recognition.

（従来の技術）従来の文字認識装置では、文字パタンよりストロークを
抽出し、これら抽出されたストロークの位置、長さ、ス
トローク間の相互位置関係等を用いて認識処理を行なう
方式が多く採用されている。この種の装置においては、
（１）文字図形の輪郭を追跡することにより検出された
輪郭点系列について曲率を計算し、その曲率の大きな値
の点を分割点として輪郭系列を分割し、分割された系列
を組合せることによりストロークを抽出するか、又は（
２）文字図形パタンに細線化処理を行なって骨格化しそ
の骨格パタンの連結性や骨格パタンを追跡することによ
って得られる急激な角度の変化点等を検出してストロー
クを抽出し、抽出したストロークについて幾何学的な特
徴等を抽出し文字図形の識別を行なっていた。しかしな
がら（１）の方法は文字図形パタンか大きくなったり文
字図形パタンか複雑化したりすると、処理量が増大しそ
のため処理速度の低下を招くという欠点を有し、（２）
の方法は文字図形パタンを細線化するという複雑な処理
を行なう必要があり、またこの細線化によってパタンの
びずみ、ヒゲ等が発生しこれらひずみ、ヒゲ等を除去す
る処理が処理全体を複雑化するという欠点があった。(Prior art) Conventional character recognition devices often employ a method of extracting strokes from character patterns and performing recognition processing using the positions and lengths of these extracted strokes, mutual positional relationships between strokes, etc. ing. In this type of device,
(1) By calculating the curvature of a contour point series detected by tracing the contour of a character figure, dividing the contour series using points with large values of curvature as dividing points, and combining the divided series. Extract strokes or (
2) Extract strokes by performing thinning processing on character and figure patterns to create a skeleton, detecting the connectivity of the skeleton pattern and points of sudden angle changes obtained by tracing the skeleton pattern, and extracting strokes. Characters and shapes were identified by extracting geometric features. However, method (1) has the disadvantage that when the character/graphic pattern becomes large or complex, the amount of processing increases, resulting in a decrease in processing speed.
The method requires complicated processing of thinning the character/figure pattern, and this thinning causes distortions, whiskers, etc. in the pattern, and the process to remove these distortions, whiskers, etc. complicates the entire process. There was a drawback.

そこでこれらの欠点を除去するために文字図形パタン内
の各点にの点を着目点と称す）から各走査方向へ走査線
を出し、走査線と文字線との交差数を当該走査線の着目
点に関する特徴として抽出する方法が提案されている。Therefore, in order to eliminate these defects, scanning lines are drawn in each scanning direction from each point in the character figure pattern (called a point of interest), and the number of intersections between the scanning line and the character line is calculated as the point of interest for the scanning line. A method of extracting points as features has been proposed.

しかしながら特徴として抽出される交差数は文字線の傾
斜の変化に伴ない変化してばらつき、従って特徴が不安
定となる。また単に交差数のみでは文字の構造を反映す
るのに不十分である。However, the number of intersections extracted as a feature changes and varies as the slope of the character line changes, and therefore the feature becomes unstable. Furthermore, the mere number of intersections alone is insufficient to reflect the structure of characters.

これらの問題を回避する方法として、特開昭６０−５７
４７４号公報に開示されている特徴抽出方法がある。こ
の方法では文字図形パタンの所定方向のスト口−り成分
を表すサブパタンを抽出し、サブパタンの各点、若しく
は任意の点を着目点とし着目点から所定の他の方向をみ
たときの文字線分布状態を表す特徴を抽出する。このた
め着目点と、当該着目点を通る他の方向の走査線上で当
該着目点より一方の側に位置するすべての文字線との距
Ｍ％求め、これら一方の側の文字線との距離のに乗和を
当該着目点に関する第一の特徴とし、ざらに着目点と、
当該着目点を通る他の方向の走査線上で当該着目点より
他方の側に位置する文字線との距離を求め、これら他方
の側の文字線との距離のに乗和を当該着目点に関する第
二の特徴として抽出する。　これら第一及び第二の特徴
を用いて抽出した文字図形パタンの特徴に基づき文字図
形パタンを認識できる。As a way to avoid these problems, JP-A-60-57
There is a feature extraction method disclosed in Japanese Patent No. 474. In this method, a subpattern representing a stroke component in a predetermined direction of a character figure pattern is extracted, and each point or arbitrary point of the subpattern is set as a point of interest, and the character line distribution when viewed from the point of interest in another predetermined direction. Extract features that represent the state. Therefore, the distance M% between the point of interest and all the character lines located on one side of the point of interest on the scanning line in the other direction passing through the point of interest is calculated, and the distance from the character line on one side is calculated. Let the sum of the products be the first feature regarding the point of interest, and roughly the point of interest,
Find the distance to the character line located on the other side of the point of interest on the scanning line in the other direction passing through the point of interest, and calculate the sum of the products of the distances to the character line on the other side to the point of interest related to the point of interest. It is extracted as the second feature. The character/graphic pattern can be recognized based on the characteristics of the character/graphic pattern extracted using these first and second features.

（発明が解決しようとする課題）しかしながら上述した特開昭６０−５７４７４号公報に
開示の従来技術では第一及び第二の特徴を印刷文字や手
書き文字から抽出した場合に、第一及び第二の特徴が文
字の大きさの変動や文字線位置の変動によって変化する
という問題点があった。(Problem to be Solved by the Invention) However, in the prior art disclosed in Japanese Patent Application Laid-Open No. 60-57474 mentioned above, when the first and second features are extracted from printed characters or handwritten characters, There was a problem in that the characteristics of the characters changed with changes in the size of the characters and the position of the character lines.

印刷文字の場合、活字の大きさや印字の傾斜が変動しこ
の結果着目点と文字線との位置が変動して第一及び第二
の特徴が変化する。また手書き文字の場合には、文字の
書き手によって文字の大きさや文字の傾斜文字線位置が
変動しこの結果着目点と文字線と距離が変動して第一及
び第二の特徴が変化する。In the case of printed characters, the size of the type and the inclination of the printed characters change, and as a result, the position between the point of interest and the character line changes, and the first and second features change. Furthermore, in the case of handwritten characters, the size of the characters and the position of the slanted character line of the characters change depending on the writer of the characters, and as a result, the distance between the point of interest and the character line changes, and the first and second characteristics change.

文字認識において、このような特徴の変動に対応する辞
Ｉ！用意しないと認識率が低下し、また特徴の変動に対
応する辞Ｎを用意すると辞書量が増大しその結果辞書照
合に要する時間が長くなり認識速度が低下する。In character recognition, the word I! corresponds to such variation in characteristics. If the dictionary is not prepared, the recognition rate will decrease, and if the dictionary N corresponding to the variation of the feature is prepared, the amount of the dictionary will increase, and as a result, the time required for dictionary matching will become longer and the recognition speed will decrease.

この発明の目的は上述した従来の問題点を解決するため
、変動の少ない特徴を抽出できる特徴抽出部Ｗを提供す
ることにある。An object of the present invention is to provide a feature extraction unit W that can extract features with little variation in order to solve the above-mentioned conventional problems.

（課題を解決するための手段）この目的の達成を図るため、この発明の特徴抽出装置は
、文字図形パタンを所定の複数の方向に走査してそれぞれ
の走査方向における文字線の断面を検出し、断面長が文
字図形パタンに関する文字線幅より十分長い断面の文字
線を抽出することによってそれぞれの走査方向のストロ
ーク成分を表すサブパタンを抽出するサブパタン抽出部
と、互いに交差する第一及び第二の投影軸上にそれぞれ
文字図形パタンを投影して文字線の第一及び第二投影分
布を求める文字投影部と、文字図形パタンの文字外接枠内の第一及び第二投影分布
の投影軸上における重心座標を求め、文字外接枠内の第
一投影分布の重心座標を最初の分割点座標とし、文字外
接枠の領域を第一の投影軸の方向に分割点座標で分割し
て第一の分割領域を設定し、第一の分割領域内の投影分
布の投影軸上における重心座標を求め、求めた第一投影
分布の重心座標を新な９害１）点座標として第一の分割
領域内の第一投影分布の重心座標を求める処理を所定回
数繰り返して第一の投影軸上の重心座標系列を求め、文字外接枠内の第二投影分布の重心座標を最初の他の分
割点座標とし、文字外接枠の領域を第二の投影軸の方向
に他の分割点座標で分割して第二の分割領域を設定し、
第二の分割領域内の第二投影分布の投影軸上における重
心座標を求め、求めた第二投影分布の重心座標を新な他
の分割点座標として第二の分割領域内の第二投影分布の
重心座標を求める処理を所定回数繰り返して第二の投影
軸上の重心座標系列を求める重心座標系列抽出部と、抽出したサブパタンのそれぞれについて文字外接枠内の
各点又は任意の点を着目点とし、サブパタンを、当該サ
ブパタン抽出時の走査方向に直交し着目点を通る他の方
向に走査して当該走査線上の文字線位置を検出し、これら着目点及び文字線位置と重心座標系列どの位１１
関係に基づいて、これら着目点及び文字線位＠を重心座
標系列に座標値の大きさの順に付した重心座標番号を尺
度とする変換値に変換し、着目点と、当該着目点の一方
の側の走査線上のすべての文字線位置のそれぞれとの変
換語ｍを変換値に基づいて求め、これら一方の側の各変
換距離のに乗の和（Ｋは定数）を当該着目点に関する第
一の特徴として抽出し、着目点と、当該着目点の他方の側の走査線上のすべての
文字線位置のそれぞれとの変換距離を変換値に基づいて
求め、これら他方の側の各変換距離のに乗の和（Ｋは定
数）を当該着目点に関する第一の特徴として抽出する特
徴抽出部とを備えて成ることを特徴とする。(Means for Solving the Problem) In order to achieve this object, the feature extraction device of the present invention scans a character figure pattern in a plurality of predetermined directions and detects a cross section of a character line in each scanning direction. , a sub-pattern extraction unit that extracts sub-patterns representing stroke components in each scanning direction by extracting a character line whose cross-sectional length is sufficiently longer than the character line width of the character-figure pattern; a character projection unit that projects each character figure pattern onto a projection axis to obtain first and second projection distributions of character lines; Find the coordinates of the center of gravity, set the coordinates of the center of gravity of the first projection distribution within the character circumscribing frame as the coordinates of the first division point, and divide the area of the character circumscription frame in the direction of the first projection axis using the coordinates of the division point to perform the first division. Set the area, find the barycentric coordinates on the projection axis of the projected distribution in the first divided area, and use the barycentric coordinates of the first projected distribution as new point coordinates in the first divided area. Repeat the process of determining the barycenter coordinates of the first projection distribution a predetermined number of times to obtain the barycenter coordinate series on the first projection axis, and set the barycenter coordinates of the second projection distribution within the character circumscribing frame as the coordinates of the first other division point, Divide the area of the character circumscribing frame at other dividing point coordinates in the direction of the second projection axis to set a second divided area,
Find the barycenter coordinates on the projection axis of the second projection distribution in the second divided area, and use the barycenter coordinates of the second projection distribution as the coordinates of another new division point for the second projection distribution in the second divided area. a barycenter coordinate series extraction unit that repeats the process of obtaining the barycenter coordinates a predetermined number of times to obtain a barycenter coordinate series on the second projection axis; Then, the subpattern is scanned in another direction that is perpendicular to the scanning direction when extracting the subpattern and passes through the point of interest, and the position of the character line on the scanning line is detected. 11
Based on the relationship, these points of interest and character line positions @ are converted into conversion values using barycenter coordinate numbers added to the barycenter coordinate series in order of the size of the coordinate values, and the points of interest and one of the points of interest are converted. The conversion word m for each of all character line positions on the scanning line on one side is calculated based on the conversion value, and the sum of the powers of each conversion distance on one side (K is a constant) is calculated as the first The conversion distances between the point of interest and all character line positions on the scanning line on the other side of the point of interest are determined based on the conversion values, and the conversion distances of each of these conversion distances on the other side are calculated. The feature extraction unit extracts the sum of powers (K is a constant) as the first feature regarding the point of interest.

（作用）このような構成の特徴抽出装置によれば、文字図形パタ
ンの文字線の投影分布（周辺分布）を求める。そして、
文字外接枠内の周辺分布の重心座標を最初の分割点座標
とし、文字外接枠の領域を分割点座標で分割して分割領
域を設定し、分割領域内の周辺分布の重心座標を求める
。そして求めた重心座標を新な分割点座標とし新な分割
領域を設定して重心座標を求める処理を所定回数繰り返
し、よって所定個数の重心座標から成る重心座標系列を
得る。(Operation) According to the feature extraction device having such a configuration, the projection distribution (marginal distribution) of character lines of a character figure pattern is determined. and,
The coordinates of the center of gravity of the peripheral distribution within the character circumscribing frame are set as the coordinates of the first dividing point, the area of the character circumscribing frame is divided at the coordinates of the dividing point to set divided regions, and the coordinates of the center of gravity of the peripheral distribution within the divided region are determined. Then, the process of determining the barycenter coordinates by setting a new divided region with the obtained barycenter coordinates as new division point coordinates is repeated a predetermined number of times, thereby obtaining a barycenter coordinate series consisting of a predetermined number of barycenter coordinates.

そして着目点と文字線位置との間の距離を、重心座標系
列に座標値の大きさの順に付した重心座標番号を尺度と
する変換値で表し、この変換値で表した距離のに乗和か
ら第一及び第二の特徴を抽出する。変換値の尺度は文字
線の局所的な位置変動に追従するので、文字線位置の変
動かあっても同一文字と認識されるべき文字の特徴の変
動を小ざくでき、特徴が安定する。Then, the distance between the point of interest and the character line position is expressed as a converted value using the barycentric coordinate number added to the barycentric coordinate series in order of the size of the coordinate value, and the sum of the distances expressed by this converted value is Extract the first and second features from. Since the scale of the conversion value follows the local positional variation of the character line, even if there is a variation in the position of the character line, the variation in the characteristics of characters to be recognized as the same character can be reduced, and the characteristics are stabilized.

重心座標系列の重心座標の個数が増えるに従って、同一
文字と認識されるべき文字の特徴の変動をより少なくし
、特徴をより安定にすることができる。As the number of barycenter coordinates in the barycenter coordinate series increases, variations in the characteristics of characters to be recognized as the same character can be reduced and the characteristics can be made more stable.

（実施例）以下、図面を譬照し、この発明の実施例につき説明する
。尚、′図面はこの発明が理解できる程度に概略的に示
しであるにすぎず、従って各構成成分の接続間係、入出
力信号の流れ、配設位置、形状及び寸法を図示例に限定
するものではない。(Embodiments) Hereinafter, embodiments of the present invention will be described with reference to the drawings. It should be noted that the drawings are only schematic illustrations to the extent that the present invention can be understood, and therefore, the connections, input/output signal flows, arrangement positions, shapes, and dimensions of each component are limited to the illustrated examples. It's not a thing.

第１図はこの発明の詳細な説明に供する機能７０９９図
であり、この発明の特徴抽出部Ｎを用いて構成した文字
認識装置の一例を示す。FIG. 1 is a functional diagram 7099 for explaining the present invention in detail, and shows an example of a character recognition device configured using the feature extraction section N of the present invention.

同図において１０はこの発明の実施例の特徴抽出装置を
示し、この特徴抽出部１０は、サブパタン抽出部１２、
文字投影部１４、重心座標系列抽出部１６及び特徴抽出
部１８から成る。In the figure, reference numeral 10 indicates a feature extraction device according to an embodiment of the present invention, and this feature extraction section 10 includes a sub-pattern extraction section 12,
It consists of a character projection section 14, a barycentric coordinate sequence extraction section 16, and a feature extraction section 18.

サブパタン抽出部１２は、文字図形パタンを所定の複数
の方向に走査してそれぞれの走査方向における文字線の
断面を検出し、断面長が文字図形パタンに関する文字線
幅より十分長い断面の文字線を抽出することによってそ
れぞれの走査方向のストローク成分を表すサブパタンを
抽出する。The sub-pattern extraction unit 12 scans the character/figure pattern in a plurality of predetermined directions, detects the cross-section of the character line in each scanning direction, and extracts a character line whose cross-sectional length is sufficiently longer than the character line width related to the character/figure pattern. By extracting, sub-patterns representing stroke components in each scanning direction are extracted.

文字投影部１４は、互いに交差する例えば直交する第一
及び第二の投影軸上にそれぞれ文字図形パタンを投影し
て文字線の第一及び第二投影分布を求める。The character projection unit 14 projects character graphic patterns onto first and second projection axes that intersect, for example, are perpendicular to each other, to obtain first and second projection distributions of character lines.

重心座標系列抽出部１６は、文字図形パタンの文字外接
枠内の第一及び第二投影分布の投影軸上における重心座
標を求める。そして文字外接枠内の第一投影分布の重心
座標を最初の分割点座標とし、文字外接枠の領Ｋｔを第
一の投影軸の方向に分割点座標で分割して第一の分割領
域を設定し、第一の分割領域内の投影分布の投影軸上に
おける重心座標を求め、この第一投影分布の重心座標を
新な分割点座標として第一の分割領域内の第一投影分布
の重心座標を求める処理を所定回数繰り返して第一の投
影軸上の重心座標系列を求める。また文字外接枠内の第
二投影分布の重心座標を最初の他の分割点座標とし、文
字外接枠の領域を第二の投影軸の方向に他の分割点座標
で分割して第二の分割類ｔｉを設定し、第二の分割領域
内の第二投影分布の投影軸上における重心座標を求め、
第二投影分布の重心座標を新な他の分割点座標として第
二の分割領域内の第二投影分布の重心座標を求める処理
を所定回数繰り返して第二・の投影軸上の重心座標系列
を求める。The barycenter coordinate series extraction unit 16 obtains the barycenter coordinates on the projection axis of the first and second projection distributions within the character circumscribing frame of the character graphic pattern. Then, the barycenter coordinates of the first projection distribution in the character circumscribing frame are set as the first dividing point coordinates, and the region Kt of the character circumscribing frame is divided in the direction of the first projection axis by the dividing point coordinates to set the first divided area. Then, find the barycenter coordinates on the projection axis of the projection distribution in the first divided area, and use the barycenter coordinates of the first projection distribution as the new division point coordinates to calculate the barycenter coordinates of the first projection distribution in the first divided area. The barycentric coordinate series on the first projection axis is obtained by repeating the process of obtaining , a predetermined number of times. In addition, the centroid coordinates of the second projection distribution within the character circumscribing frame are set as the first other dividing point coordinates, and the area of the character circumscribing frame is divided in the direction of the second projection axis using the other dividing point coordinates, and the second division is performed. Set the class ti, find the coordinates of the center of gravity on the projection axis of the second projection distribution in the second divided area,
The process of determining the barycenter coordinates of the second projection distribution in the second divided area is repeated a predetermined number of times by using the barycenter coordinates of the second projection distribution as new other division point coordinates to obtain the barycenter coordinate series on the second projection axis. demand.

特徴抽出部１８は、抽出したサブパタンのそれぞれにつ
いて文字外接枠内の各点又は任意の点を着目点とし、サ
ブパタンを、当該サブパタン抽出時の走査方向に直交し
着目点を通る他の方向に走査して当該走査線上の文字線
位Ｍ％検出する。そしてこれら着目点及び文字線位置と
重心座標系列との位１！１１１１係に基づいて、これら
着目点及び文字線位″Ｍを重心座標系列に座標値の大き
さの順に付した重心座標番号を尺度とする変換値に変換
する。The feature extraction unit 18 takes each point or arbitrary point within the character circumscribing frame as a point of interest for each of the extracted sub-patterns, and scans the sub-pattern in another direction that is orthogonal to the scanning direction at the time of extracting the sub-pattern and passes through the point of interest. Then, the character line position M% on the scanning line is detected. Then, based on the order 1!1111 of these points of interest and character line positions and the barycenter coordinate series, the barycenter coordinate numbers are obtained by adding these points of interest and character line positions "M" to the barycenter coordinate series in order of the magnitude of the coordinate values. Convert to a converted value to be used as a scale.

そして着目点と、当該着目点の一方の側の走査線上のす
べての文字線位置のそれぞれとの変換距離を変換値に基
づいて求め、これら一方の側の各変換距離のに乗の和（
Ｋは定数）を当該着目点に関する第一の特徴として抽出
する。また着目点と、当該着目点の他方の側の走査線上
のすべての文字線位置のそれぞれとの変換路ＭＩＦｒ変
換値に基づいて求め、これら他方の側の各変換距離のに
乗の和（Ｋは定数）を当該着目点に関する第二の特徴と
して抽出する。Then, the conversion distance between the point of interest and each of all character line positions on the scanning line on one side of the point of interest is determined based on the conversion value, and the sum of the powers of each conversion distance on one side (
K is a constant) is extracted as the first feature regarding the point of interest. In addition, the conversion path MIFr conversion value between the point of interest and each of all character line positions on the scanning line on the other side of the point of interest is calculated, and the sum of the powers of each conversion distance on the other side (K is a constant) is extracted as the second feature regarding the point of interest.

また第１図において２０は特徴抽出装置１０ヲ用いて構
成した文字認識装置の一例を示し、この文字認識装Ｎ２
０は、認識対象の文字図形パタンを得る前処理部２２と
、文字図形パタンの文字外接枠を検出する文字枠検出部
２４と、文字図形パタンを走査して文字図形パタンに関
する線幅を検出する線幅計算部２６と、特徴抽出部！１
０と、認識対象の特徴ベクトルを抽出する特徴ベクトル
抽出部２８と、特徴ベクトルに基づいて認識対象の識別
を行なう識別部３０と、標準文字パタンの特徴ベクトル
を格納する辞′Ｎ３２とから成る。Further, in FIG. 1, numeral 20 indicates an example of a character recognition device configured using the feature extraction device 10, and this character recognition device N2
0 includes a preprocessing unit 22 for obtaining a character/figure pattern to be recognized, a character frame detection unit 24 for detecting a character circumscribing frame of the character/figure pattern, and a character frame detection unit 24 for scanning the character/figure pattern to detect line widths related to the character/figure pattern. Line width calculation section 26 and feature extraction section! 1
0, a feature vector extraction unit 28 that extracts feature vectors of recognition targets, an identification unit 30 that identifies recognition targets based on the feature vectors, and a word 'N32 that stores feature vectors of standard character patterns.

以下、この実施例の特徴抽出装置１０と、第１図に示す
例の文字認識装置１２とにつきより詳細に説明する。Hereinafter, the feature extraction device 10 of this embodiment and the character recognition device 12 of the example shown in FIG. 1 will be explained in more detail.

（前処理部）この例では、前処理部２２を、白黒２値に量子化したデ
ィジタル信号の文字図形パタンを出力する光電変換部３
４と、文字図形パタンを格納するパタンレジスタ３６と
から構成する。(Preprocessing unit) In this example, the preprocessing unit 22 is a photoelectric conversion unit 3 that outputs a character/figure pattern of a digital signal quantized into black and white binary.
4, and a pattern register 36 for storing character/graphic patterns.

光電変換部３４は帳票等の情報媒体の所定の読取領域を
例えば１２８Ｘ１２８の画素に分解し、情報媒体からの
反射光りを各画素毎に白黒２Ｊａのディジタル信号（文
字図形パタン）に変換し、ディジタル信号をパタンレジ
スタ３６及び線幅計算部２６に対し出力する。文字図形
パタンの白ビ・ント及び黒ビットはそれぞれ文字線及び
文字背景部を表すものとする。尚、この例の光電変換部
３４は、−文字単位に光学的に読取りを行なって一文字
分の文字図形パタンを出力するものである。The photoelectric conversion unit 34 decomposes a predetermined reading area of an information medium such as a form into, for example, 128 x 128 pixels, converts the reflected light from the information medium into a black and white 2Ja digital signal (character/figure pattern) for each pixel, A signal is output to the pattern register 36 and line width calculation section 26. The white bits and black bits of the character figure pattern represent the character line and character background, respectively. The photoelectric conversion section 34 in this example optically reads each negative character and outputs a character/figure pattern for one character.

パタンレジスタ３６は光電変換部３４からの文字図形パ
タンを格納する。パタンレジスタ３６上にはＸ−Ｙ座標
系を仮想的に設定しており、このＸ−Ｙ座標系で表現さ
れる画素位置の画素データをパタンレジスタ３６に書込
みまたパタンレジスタ３６から読出すことが自由自在に
行なえるように威している。パタンレジスタ３６は読取
領域の画素数に対応して例えば１２８Ｘ１２８ビツトの
メモリ容量を有する。尚、平均的な文字−文字性の大き
さは６０ｘ６０ビツトである。The pattern register 36 stores the character/figure pattern from the photoelectric conversion section 34. An X-Y coordinate system is virtually set on the pattern register 36, and pixel data at pixel positions expressed in this X-Y coordinate system can be written to or read from the pattern register 36. It gives them the freedom to do what they want. The pattern register 36 has a memory capacity of, for example, 128×128 bits, corresponding to the number of pixels in the reading area. Note that the average character-to-character size is 60x60 bits.

第２図は文字図形パタンの一例を示す図である。同図に
おいで０８Ｇはパタンレジスタ３６に格納した文字図形
パタンであり、この例では文字図形パタンを一単位の文
字図形のみを含むパタンとする。また同図のＸ及びＹ軸
はパタンレジスタ３６上に設定した座標軸であり、パタ
ンレジスタ３６における２次元平面の左下を原点とする
。FIG. 2 is a diagram showing an example of a character/figure pattern. In the figure, 08G is a text/figure pattern stored in the pattern register 36, and in this example, the text/figure pattern is a pattern containing only one unit of text/figure. Further, the X and Y axes in the figure are coordinate axes set on the pattern register 36, and the origin is at the lower left of the two-dimensional plane in the pattern register 36.

（文字枠検出部）文字枠検出部２４はパタンレジスタ３６の文字図形パタ
ンを走査して文字図形パタンの一単位の文字図形に外接
する方形枠（文字外接枠）の左端座標ＸＬ、右端座標Ｘ
Ｒ１上端座標Ｙ□及び下端座標Ｙ、を検出する。第２図
において文字外接枠を符号Ｇを付した一点鎖線で示した
。文字外接枠は（ＸＬ　、ＹＴ　）、（ＸＬ　、　Ｙｌ
ｌ　）、（×６、ＹＴ）及び（Ｘｓ　、Ｙｅ　）の４点
を結ぶ矩形枠となる。(Character frame detection unit) The character frame detection unit 24 scans the character figure pattern in the pattern register 36 and determines the left end coordinate XL and right end coordinate X of a rectangular frame (character circumscribing frame) circumscribing one unit of character figure of the character figure pattern.
The R1 upper end coordinate Y□ and lower end coordinate Y are detected. In FIG. 2, the character circumscribing frame is shown by a dashed line with the symbol G. Character circumscribing frames are (XL, YT), (XL, Yl
It becomes a rectangular frame connecting four points: l ), (×6, YT), and (Xs , Ye ).

文字枠検出部２４はこれら座標ＸＬ　、Ｘｓ　、　Ｙｔ
及びＹ８を重心座標系列抽出部１６及び特徴ベクトル抽
出部２８に出力する。The character frame detection unit 24 detects these coordinates XL, Xs, Yt
and Y8 are output to the barycenter coordinate sequence extraction section 16 and the feature vector extraction section 28.

（線幅計算部）線幅計算部２６は光電変換部３４からの文字図形パタン
を入力し、例えば２×２の窓の全ての点が黒ビットとな
る状態の個数Ｑと、文字図形パタン中の全黒ビットの個
数とを計数し、従来周知の式（１）に従って線幅Ｗを算
出する。(Line Width Calculation Unit) The line width calculation unit 26 inputs the character/figure pattern from the photoelectric conversion unit 34 and calculates, for example, the number Q of states in which all points in a 2×2 window are black bits, and the character/figure pattern in the character/figure pattern. The number of all black bits is counted, and the line width W is calculated according to the conventionally well-known formula (1).

ｗ＝Ａ／　（Ａ−Ｑ）　　−・−（１）尚、線幅があら
かじめわかっている場合には線幅計算部を省略してもよ
い。w=A/(A-Q) -.-(1) Note that if the line width is known in advance, the line width calculation section may be omitted.

（サブパタン抽出部）サブパタン抽出部１２文字図形パタンを複数の方向に走
査して各走査列毎の黒ビットの連続個数を検出し当該黒
ビツト連続個数と線幅とに基づいて複数の走査方向毎に
対応した複数のサブパタンを抽出するサブパタン抽出部
３０と、サブパタン抽出部１２は、パタンレジスタ３６上に設定
したＸ軸方向に垂直な方向（垂直方向）及び平行な方向
（水平方向）と、Ｘ軸から反時計方向４５°の方向（右
斜め４５°方向）及び時計方向４５°の方向（左斜め４
５°方向）とを、主走査方向としてパタンレジスタ３６
の文字図形パタンを走査し各主走査方向に対応する垂直
、水平、有斜め及び左斜めサブパタンを抽出する。(Sub-pattern extraction unit) Sub-pattern extraction unit 12 Scans the character/figure pattern in multiple directions, detects the number of consecutive black bits in each scanning line, and extracts the number of consecutive black bits in each scanning direction based on the number of consecutive black bits and the line width. The sub-pattern extraction unit 30 extracts a plurality of sub-patterns corresponding to the 45° counterclockwise from the axis (45° diagonal to the right) and 45° clockwise (45° diagonal to the left)
5° direction) is set as the main scanning direction in the pattern register 36.
The character/figure pattern is scanned and vertical, horizontal, diagonal, and left diagonal sub-patterns corresponding to each main scanning direction are extracted.

従ってサブパタン抽出部１２は、図示せずも、垂直サブ
パタン抽出部、水平サブパタン抽出部、右斜めサブパタ
ン抽出部及び左斜めサブパタン抽出部から成る。Therefore, the sub-pattern extraction section 12 includes a vertical sub-pattern extraction section, a horizontal sub-pattern extraction section, a right diagonal sub-pattern extraction section, and a left diagonal sub-pattern extraction section, although not shown.

垂直サブパタン抽出部は垂直方向を主走査方向としてパ
タンレジスタ３６の文字図形パタンを全面走査し、垂直
方向の走査線上で連続する黒ビット（黒ラン）を検出す
る。この黒ランが文字線の断面を表す、そして、検出し
た黒ラフのなかがら次式（２）を満足する長ざｌの黒ラ
ンを抽出する。The vertical sub-pattern extraction section scans the entire character/figure pattern of the pattern register 36 with the vertical direction as the main scanning direction, and detects continuous black bits (black runs) on the vertical scanning line. This black run represents the cross section of the character line, and a black run with a length l that satisfies the following equation (2) is extracted from the detected black rough.

ｌ≧Ｎ−ｗ　　　　　・・・・・・（２）但し、ｌは主
走査方向における黒ランの長さ（文字線の断面層）、Ｎ
は各サブパタンについでそれぞれ任意好適に設定される
定数である。この実施例では全てのサブパタンについて
例えばＮ＝２とする。l≧N-w (2) However, l is the length of the black run in the main scanning direction (cross-sectional layer of the character line), N
is a constant set arbitrarily and suitably for each sub-pattern. In this embodiment, for example, N=2 for all subpatterns.

垂直サブパタン抽出部は式（２）を満足する黒ランをサ
ブパタンを構成する黒ランとみなして図示しない垂直サ
ブパタンメモリに格納する。The vertical sub-pattern extraction section regards the black runs that satisfy equation (2) as black runs constituting a sub-pattern and stores them in a vertical sub-pattern memory (not shown).

式（２）を満足しない黒ランは白ヒツトとみなす。Black runs that do not satisfy equation (2) are considered white hits.

同様に水平、右斜め及び左斜めサブバタシ抽出部は、水
平、右斜め及び左斜め方向を主走査方向としてパタンレ
ジスタ３６の文字図形パタンを走査し、それぞれの主走
査方向の走査線上の黒ランのなかから（２）式を満足す
る黒ランを抽出し、抽出した黒ランをサブパタンを構成
する黒ランとみなして図示しない水平、右斜め及び左斜
めサブパタンメモリに格納する。Similarly, the horizontal, right diagonal, and left diagonal sub-battery extraction sections scan the character/figure patterns in the pattern register 36 using the horizontal, right diagonal, and left diagonal directions as the main scanning directions, and detect black runs on the scanning lines in the respective main scanning directions. A black run that satisfies equation (2) is extracted from among them, and the extracted black run is regarded as a black run constituting a subpattern and is stored in horizontal, right diagonal, and left diagonal subpattern memories (not shown).

各サブパタンメモリ上にもパタンレジスタ３６と同様に
Ｘ−Ｙ座標系を設定しており、パタンレジスタ３６上の
画素位置と対応する画素位置に、各サブパタンを構成す
る黒ランを格納する。尚、各サブパタンメモリは、パタ
ンレジスタ３６と同様例えば１２８Ｘ１２８ビツトのメ
モリ容量を有する。An X-Y coordinate system is set on each sub-pattern memory in the same way as the pattern register 36, and black runs forming each sub-pattern are stored at pixel positions corresponding to pixel positions on the pattern register 36. Note that each sub-pattern memory has a memory capacity of, for example, 128×128 bits, similar to the pattern register 36.

第３図は第２図に示す文字図形パタンから抽出したサブ
パタンを示す図であり、第３図（Ａ）、（Ｂ）、（Ｃ）
及び（０）はそれぞれ垂直サブパタンｖＳＰ、水平サブ
パタン日ＳＰ、右斜めサブパタンＲ８Ｐ及び左斜めサブ
パタンＬＳＰｔ示す、これら図のＸ及びＹ軸はパタンレ
ジスタ３６のＸ及びＹ軸に対応させてサブパタンメモリ
上に設定した座標軸である。Fig. 3 is a diagram showing sub-patterns extracted from the character/figure pattern shown in Fig. 2, and Fig. 3 (A), (B), (C)
and (0) respectively indicate vertical sub-pattern vSP, horizontal sub-pattern SP, right diagonal sub-pattern R8P and left diagonal sub-pattern LSPt. This is the coordinate axis set to .

（パタン投影部）この実施例では第一及び第二の投影軸をパタンレジスタ
３６上に設定したＸ軸及びＹ軸とする。(Pattern Projection Unit) In this embodiment, the first and second projection axes are the X-axis and Y-axis set on the pattern register 36.

この実施例の文字投影部１４は文字外接枠内の文字図形
パタンを互いに直交するＸ軸及びＹ軸上に投影して第一
投影分布として黒ビツト分布５Ｘ（ｘ）及び第二投影分
布として黒ビツト分布５Ｙ（ｙ）　ｌｊ＆求める。The character projection unit 14 of this embodiment projects the character figure pattern within the character circumscribing frame onto the X-axis and Y-axis that are orthogonal to each other, and obtains a black bit distribution 5X(x) as a first projection distribution and a black bit distribution as a second projection distribution. Find bit distribution 5Y(y) lj&.

黒ビツト分布５Ｘ（ｘ）及びＳＹ（ｙ）は次式（３）に
従って得られる。The black bit distributions 5X(x) and SY(y) are obtained according to the following equation (3).

ｙ＝Ｙｔ但しｘ、ｙはパタンレジスタ３６上のＸ−Ｙ座標系にお
ける座標を表し例えばＯ〜１２７の整数値を取り、Ｐ（
ｘ、ｙ）はＰ（ｘ、ｙ）　＝　１のとき黒ビット（有意
色）及びＰ（ｘ、ｙ）＝Ｏのとき白ビット（背景色）を
表し、ＸＬ　、ＸＲ、ＹＴ及びＹ８は一単位の文字図形
の外接枠に間する左端、右端、上端及び下端座標を表す
。y=Yt However, x and y represent the coordinates in the X-Y coordinate system on the pattern register 36, and take integer values from 0 to 127, for example, and P(
x,y) represents the black bit (significant color) when P(x,y) = 1 and the white bit (background color) when P(x,y) = O, and XL, XR, YT and Y8 are the same. Represents the coordinates of the left end, right end, top end, and bottom end of the circumscribed frame of the character figure of the unit.

（重心座標系列抽出部）この実施例の重心座標系列抽出部１６は、第一の投影軸
上の重心座標系列として黒ビツト分布５Ｘ（ｘ）の重心
座標系列Ｘ　（Ｍ、）を、また第二の投影軸上の重心座
標系列として黒ビツト分布ＳＹ（ｙ）の重心座標系列Ｙ
　（Ｍ、）１Ｆｒ求める０重心座標系列Ｘ（Ｍ、）、Ｙ
　（Ｍ、）は、文字外接枠内の領域又は分割領域におけ
る一次モーメントの和をその領域内の黒ビットの個数（
黒ビツト分布５Ｘ（ｘ）　、５Ｙ（ｙ）　）で除すこと
によって求められるものである。但し、Ｍ、及びＭ、は
座標値の大きさの順に重心座標に付す重心座標番号であ
り、この例では座標値の小ざい順に付す０Ｍ、は１≦Ｍ
、≦ＭＸｔ満足する自然数でありＭＸは奇数であり重心
座標系列Ｘ（Ｍ、）として検出される重心座標の総個数
を表し、またＭ、は１≦Ｍ、≦ＭＹヲ満足する自然数で
ありＭＹは奇数であり重心座標系列Ｙ　（Ｍ、）として
検出される重心座標の総個数を表す、　ＭＸ及びＭＹは
任意好適な値に設定されるが、好ましくは文字図形パタ
ンの大きさに近い値とするのがよく例えば文字図形パタ
ンの大きざを６０ｘ６０ビツトとした場合には１５又は
３１個程度を採用すればよい。(Barycenter coordinate series extraction unit) The barycenter coordinate series extraction unit 16 of this embodiment extracts the barycenter coordinate series X (M,) of the black bit distribution 5X(x) as the barycenter coordinate series on the first projection axis, and The barycenter coordinate series Y of the black bit distribution SY(y) is the barycenter coordinate series on the second projection axis.
(M,)1Fr 0 barycentric coordinate series X(M,), Y
(M,) is the sum of the first moments in the area within the character circumscribing frame or the divided area, and the number of black bits in that area (
It is obtained by dividing by the black bit distribution 5X(x), 5Y(y). However, M and M are barycenter coordinate numbers that are attached to the barycenter coordinates in order of the size of the coordinate value, and in this example, 0M, which is attached in order of the coordinate value from small to large, is 1≦M.
, ≦MXt, MX is an odd number and represents the total number of barycenter coordinates detected as the barycenter coordinate series X(M,), and M is a natural number that satisfies 1≦M, ≦MY. is an odd number and represents the total number of barycenter coordinates detected as the barycenter coordinate series Y (M,). MX and MY are set to any suitable value, but preferably a value close to the size of the character figure pattern. For example, if the size of the character/graphic pattern is 60 x 60 bits, then approximately 15 or 31 pieces may be used.

以下、黒ビツト分布５Ｘ（ｘ）　、５Ｙ（ｙ）の投影軸
上における重心座標の検出につき説明するが、以下の説
明では説明の簡略化のためにＭＸ＝ＭＹ＝７とし７個の
重心座標Ｘ（Ｍ、）及び７個の重心座標Ｙ（Ｍ、）を検
出するものとする。Below, we will explain how to detect the barycenter coordinates on the projection axes of the black bit distributions 5X(x) and 5Y(y).In the following explanation, we will assume that MX=MY=7 and use the seven barycenter coordinates to simplify the explanation. It is assumed that X(M,) and seven barycenter coordinates Y(M,) are detected.

重心座標系列抽出部１６は、重心座標系列Ｘ（Ｍ、）を
求めるため、まず、外接文字枠のＸ軸方向の全範囲ＸＬ
−Ｘ、における黒ビツト分布５Ｘ（ｘ）の−次モーメン
ト和を、当該範囲内の黒ビツト全個数で除すことによっ
て、中央の重心座標番号４の重心座標×（４）を求める
。In order to obtain the barycenter coordinate series
-X, by dividing the sum of the -th moments of the black bit distribution 5X(x) by the total number of black bits within the range, the barycenter coordinates x (4) of the center barycenter coordinate number 4 are obtained.

次いで重心座標Ｘ（４）１Ｆｒ最初の分割点座標として
文字外接枠の領域×、〜×１．ｌを座標×（４）で分割
し、第一の分割領域として領域ＸＬ−Ｘ（４）と×（４
）〜Ｘ＋ｔとを設定する。Next, the center of gravity coordinates X (4) 1Fr are used as the coordinates of the first division point, and the area of the character circumscribing frame x, ~ x 1. Divide l by the coordinates ×(4), and divide the region XL-X(4) and ×(4) as the first divided region.
) to X+t.

そして分割類＠ｘ、〜Ｘ（４）内の黒ピット分布の重心
座標×（２）と、分割領域×（４）〜Ｘ５の重心座標×
（６）とを求める。Then, the barycentric coordinates of the black pit distribution in the division class @x, ~X(4) x (2), and the barycenter coordinates of the divided area x (4) to X5 x
(6) Find.

次いで重心座標×（２）及び×（６）を新な分割点座標
として加え、分割点座標Ｘ（２）　、Ｘ（４）及び×（
６）で文字外接枠の領域ＸＬ〜×、を分割し、第一の分
割領域として領域ＸＬ−Ｘ（２）と、Ｘ（２）　〜Ｘ（
４）と、Ｘ　（４）　〜Ｘ　（６）と、×（６）〜ｘ、
Ｉとを設定する。Next, add the barycenter coordinates ×(2) and ×(6) as new division point coordinates, and divide the division point coordinates X(2), X(4), and ×(
6), divide the area XL~x of the character circumscribing frame, and divide the area XL-X(2) as the first divided area and the area X(2)~X(
4), X (4) ~X (6), and ×(6) ~x,
Set I.

そして分割領域ＸＬ〜Ｘ（２）、内の重心座標×（１）
と、Ｘ　（２）　〜Ｘ　（４）内の重心座標×（３）と
、×（４）〜Ｘ（６）内の重心座標×（５）と、×（６
）〜ｘ、ｌ内の重心座標×（７）とを求める。Then, the coordinates of the center of gravity in the divided area XL~X(2) x (1)
, the barycentric coordinates in X (2) to X (4) × (3), the barycenter coordinates in × (4) to X (6) × (5), and × (6
) to x, barycentric coordinates in l×(7).

同様にして重心座標系列抽出部１６は、重心座標系列Ｙ
　（Ｍ、）を求めるため、まず、外接文字枠のＸ軸方向
の全範囲Ｙ＠〜ＹＴにおける黒ビツト分布５Ｙ（ｙ）の
−次モーメント和を、当該範囲内の黒ビツト全個数で除
すことによって、中央の重心座標番号４の重心座標Ｙ（
４）を求める。Similarly, the barycenter coordinate series extraction unit 16 extracts the barycenter coordinate series Y
To find (M,), first, divide the sum of the -th moments of the black bit distribution 5Y(y) in the entire range Y@~YT of the circumscribed character frame in the X-axis direction by the total number of black bits within the range. By doing this, the center of gravity coordinate Y(
Find 4).

次いで重心座標Ｙ（４）を最初の分割点座標として文字
外接枠の領域Ｙ８〜ＹＴを座標Ｙ（４）で分割し、第二
の分割領域として領域Ｙ、〜Ｙ（４）とＹ（４）〜ＹＴ
とを設定し、分割領域Ｙ、〜Ｙ（４）の重心座標Ｙ（２
）と、Ｙ（４）〜Ｙ、の重心座標Ｙ（６）とを求める。Next, the area Y8 to YT of the character circumscribing frame is divided by the coordinate Y(4) using the centroid coordinate Y(4) as the first dividing point coordinate, and the areas Y, ~Y(4) and Y(4) are divided as the second divided area. )~YT
and set the centroid coordinates Y(2) of the divided areas Y, ~Y(4)
) and the center of gravity coordinates Y(6) of Y(4) to Y.

次いで重心座標Ｙ（２）及びＹ（６）！新な分割点座標
としで加え、文字外接枠の領域Ｙ１１＝Ｙｙｔ分割し、
第二の分割領域として領域Ｙ、〜Ｙ（２）と、Ｙ（２）
〜Ｙ（４）と、Ｙ（４）〜Ｙ（６）と、Ｙ（６）〜ＹＴ
とを設定し、分割領域Ｙ、〜Ｙ（２）内の重心座標Ｙ（
１）と、Ｙ（２）〜Ｙ（４）内の重心座標Ｙ（３）と、
Ｙ（４）〜Ｙ（６）内の重心座標Ｙ（５）と、Ｙ（６）
〜ＹＴ内の重心座標Ｙ（７）とを求める。Next, the barycentric coordinates Y(2) and Y(6)! Add new dividing point coordinates, divide area Y11 = Yyt of character circumscribing frame,
Region Y, ~Y(2) and Y(2) as the second divided region
~Y(4), Y(4)~Y(6), and Y(6)~YT
and set the centroid coordinates Y(
1), the center of gravity coordinates Y(3) within Y(2) to Y(4),
Centroid coordinates Y(5) and Y(6) within Y(4) to Y(6)
- Find the center of gravity coordinates Y(7) in YT.

第４図及び第５図は文字図形パタン０日Ｇと黒ビツト分
布５Ｙ（ｙ）　、重心座標系列Ｙ　（Ｍ、）との間係を
示す図である。これら図の（Ａ）、（８）及び（Ｃ）は
それぞれ文字図形パタン、図（Ａ）の文字図形パタンに
関する黒ビツト分布５Ｙ（ｙ）及び図（Ａ）の文字図形
パタンに間する重心座標系列Ｙ（Ｍ、）を示す。4 and 5 are diagrams showing the relationship between the character/figure pattern 0th day G, the black bit distribution 5Y(y), and the barycenter coordinate series Y(M,). (A), (8), and (C) of these figures are the character figure pattern, the black bit distribution 5Y (y) for the character figure pattern of figure (A), and the center of gravity coordinates between the character figure pattern of figure (A). The series Y(M,) is shown.

第４図（Ａ）及び第５図（Ａ）の文字図形パタン０８Ｇ
は同一文字「上」として認識されるべき文字であって文
字線位置が部分的に異なる文字のパタンを示し、これら
図からも明かなように文字線位置の変動に追従して重心
座標Ｙ　（Ｍ、）が移動することが理解できる。Character figure pattern 08G in Figure 4 (A) and Figure 5 (A)
shows a pattern of characters that should be recognized as the same character "upper" but whose character line positions are partially different, and as is clear from these figures, the centroid coordinate Y ( It can be understood that M,) moves.

（特徴抽出部）特徴抽出部１８は、サブパタンを、当該サブパタン抽出
時の走査方向と直交する他の方向に走査し、着目点から
当該他の方向をみたときの文字線の分布状態を表す第一
及び第二の特徴を抽出する。これら第一及び第二の特徴
を各サブパタンにつき抽出する。(Feature Extraction Unit) The feature extraction unit 18 scans the sub-pattern in another direction orthogonal to the scanning direction at the time of extracting the sub-pattern, and extracts a number representing the distribution state of character lines when viewed from the point of interest in the other direction. Extract the first and second features. These first and second features are extracted for each subpattern.

従ってこの実施例では第一及び第二の特徴を抽出するた
めの他の方向は、垂直サブパタンｖＳＰに間して水平方
向（Ｘ軸方向）、水平サブパタンＨ８Ｐに関し垂直方向
（Ｙ軸方向）、右斜めサブパタンＲ３Ｐに関し左斜め方
向（Ｘ軸から時計方向に４５゛ずれる方向）及び左斜め
サブパタンＬＳＰに間して右斜め方向（Ｘ軸から半時針
方向に４５°ずれる方向）となる、そしてこの実施例の
特徴抽出部１８は図示せずも、ＶＳＰ水平特徴抽出回路
、Ｈ３Ｐ垂直特徴抽出回路、日ＳＰ左斜め特徴抽出回路
及びＬＳＰ右斜め特徴抽出回路から成る。ｖｓｐ水平特
徴抽出回路はｖＳＰにつき水平方向の第一及び第二の特
＠（水平特徴）を、口ＳＰ垂直特徴抽出回路はＨ４Ｆに
つき垂直方向の第一及び第二の特徴（垂直特徴）を、Ｒ
３Ｐ左斜め特徴抽出回路は日ＳＰにつき左斜め方向の第
一及び第二の特徴（左斜め特ｒ！１）％、及びＬＳＰ右
斜め特徴抽出回路はＬＳＰにつき右斜め方向の第一及び
第二の特徴（右斜め特徴）を抽出する。Therefore, in this embodiment, the other directions for extracting the first and second features are the horizontal direction (X-axis direction) with respect to the vertical sub-pattern vSP, the vertical direction (Y-axis direction) with respect to the horizontal sub-pattern H8P, and the right direction with respect to the horizontal sub-pattern H8P. Regarding the diagonal sub-pattern R3P, there is a left diagonal direction (a direction shifted by 45 degrees clockwise from the X axis) and a right diagonal direction (a direction shifted by 45 degrees from the X axis in the direction of the half hour hand) between the left diagonal sub patterns LSP, and this implementation. Although not shown, the feature extraction unit 18 in the example includes a VSP horizontal feature extraction circuit, an H3P vertical feature extraction circuit, a SP left diagonal feature extraction circuit, and an LSP right diagonal feature extraction circuit. The vsp horizontal feature extraction circuit extracts the first and second horizontal features (horizontal features) for vSP, and the mouth SP vertical feature extraction circuit extracts the first and second vertical features (vertical features) for H4F. R
The 3P left diagonal feature extraction circuit extracts the first and second features in the left diagonal direction per SP (left diagonal feature r!1)%, and the LSP right diagonal feature extraction circuit extracts the first and second features in the right diagonal direction per LSP. Extract the features (right diagonal features).

第６図はｖＳＰ水平特徴抽出回路の説明に供する機能ブ
ロック図であり、ＶＳＰ水平特徴抽出回路とこの回路に
間違するサブパタン抽出部の垂直サブパタンメモリ及び
重心系列抽出部の重心座標系列メモリとを示す。FIG. 6 is a functional block diagram for explaining the vSP horizontal feature extraction circuit, and includes the VSP horizontal feature extraction circuit, the vertical sub-pattern memory of the sub-pattern extraction section that is mistaken for this circuit, and the barycenter coordinate series memory of the barycenter series extraction section. shows.

同図において３８はｖＳＰ水平特徴抽出回路、４０は垂
直サブパタンを格納するサブパタン抽出部の垂直サブパ
タンメモリ、及び４２は重心座標系列Ｘ（Ｍ、）及びＹ
　（Ｍ、）を格納する重心座標系列抽出部の重心座標系
列メモリを示す。In the same figure, 38 is a vSP horizontal feature extraction circuit, 40 is a vertical sub-pattern memory of a sub-pattern extraction unit that stores vertical sub-patterns, and 42 is a barycenter coordinate series X (M, ) and Y
The barycenter coordinate sequence memory of the barycenter coordinate sequence extraction unit that stores (M,) is shown.

ＶＳＰ水平特徴抽出回路３日は、特徴抽出部１８の全体
を制御する制御回路４４と、サブパタンメモリを走査中
にサブパタンメモリの内容を参照して白ビットから黒ビ
ットへ変化する変化点を検出する変化点検出回路４６と
、白ビットから黒ビットへ変化した時の黒ビットの座標
を変化点座標として格納するための変化点メモリ４８と
、変化点メモリに格納されでいる変化点の個数分変化点
メモリの番地を指定する変化点カウンタ５０とを備える
。The VSP horizontal feature extraction circuit 3 includes a control circuit 44 that controls the entire feature extraction section 18, and a control circuit 44 that controls the entire feature extraction section 18, and while scanning the subpattern memory, refers to the contents of the subpattern memory and detects a change point where a white bit changes to a black bit. A change point detection circuit 46 for detecting a change point, a change point memory 48 for storing the coordinates of a black bit when changing from a white bit to a black bit as change point coordinates, and the number of change points stored in the change point memory. and a change point counter 50 that specifies the address of the minute change point memory.

ざらにｖＳＰ水平特徴抽出回路３８は、Ｘ、　Ｙ座標を
発生するパタンメモリアドレスカウンタ５２と、変化点
が着目点の一方及び他方の側のいずれの側に位置するか
判定するため変化点の座標と着目点の座標とを比較する
比較器５４と、着目点及び変化点の間の変換距離のに乗
を算出する演算回路５６と、着目点の一方の側の特徴（
第一の特徴）を抽出するためのレジスタ４５８と、着目
点の他方の側の特＠（Ｍ二の特徴）を抽出するためのレ
ジスタＢ６０と、各注目点毎に第一及び第二の特徴を格
納するｖＳＰ水平特徴パタンメモリ６２とを備えて成る
。レジスタＡ５８及びレジスタＢ６０の容量は例えば１
６ビツト宅ある。Roughly speaking, the vSP horizontal feature extraction circuit 38 uses a pattern memory address counter 52 that generates X and Y coordinates, and coordinates of a changing point to determine whether the changing point is located on one side or the other side of the point of interest. A comparator 54 that compares the coordinates of the point of interest with the coordinates of the point of interest, an arithmetic circuit 56 that calculates the power of the transformed distance between the point of interest and the point of change, and a feature (
A register 458 for extracting the first feature), a register B60 for extracting the feature on the other side of the point of interest (M2 feature), and a register B60 for extracting the first and second features for each point of interest. and a vSP horizontal feature pattern memory 62 that stores the vSP horizontal feature pattern memory 62. For example, the capacity of register A58 and register B60 is 1
I have a 6-bit house.

第７図はＶＳＰ水平特徴パタンメモリの説明に供する図
、及び第８図は１個の着目点に関する特徴を表す図であ
る。FIG. 7 is a diagram for explaining the VSP horizontal feature pattern memory, and FIG. 8 is a diagram showing features regarding one point of interest.

第７図にも示すようにｖＳＰ水平特徴パタンメモリ６２
は、垂直サブパタンメモリ４０上の点に対応する位置の
着目点の特徴を格納できるように例えばＸ軸方向に１２
８ビツト及びＹ軸方向に１２８ビツトのメモリを例えば
３２個有する。この場合、第８図にも示すように１個の
着目点の特徴を３２ビツトで表すことができ、例えば上
位１６ビツトで着目点の第一の特徴を及び下位１６ビツ
トで着目点の第二の特徴を表せる。ｖＳＰ水平特徴パタ
ンメモリ６２！構成する３２個の各メモリ上には垂直サ
ブパタン４０上のＸ、Ｙ軸に対応するＸ、Ｙ軸を設定し
ており、従ってサブパタンメモリ４０上の着目点に対応
する位置の格納場所に第一及び第二の特徴を格納できる
。As shown in FIG. 7, the vSP horizontal feature pattern memory 62
is, for example, 12 in the
For example, it has 32 memories of 8 bits and 128 bits in the Y-axis direction. In this case, as shown in Fig. 8, the feature of one point of interest can be represented by 32 bits. For example, the upper 16 bits represent the first feature of the point of interest, and the lower 16 bits represent the second feature of the point of interest. can express the characteristics of vSP horizontal feature pattern memory 62! The X and Y axes corresponding to the X and Y axes on the vertical sub-pattern 40 are set on each of the 32 memories making up the memory. The first and second characteristics can be stored.

次にｖＳＰ水平特徴抽出回路３８の全体的な動作につき
概略的に説明する。Next, the overall operation of the vSP horizontal feature extraction circuit 38 will be schematically explained.

ＶＳＰ水平特徴抽出回路３８は、まず変化点を検出する
ため垂直サブパタンを走査する０文字外接枠の左辺上の
点（ＸＬ、Ｙ）（但しＹ、≦ｙ≦ＹＴ）から垂直サブパ
タンの水平走査を開始し、検出した変化点座標を変化点
メモリ４８に登録しながら、文字外接枠の右辺上の点（
ｘ＊　、Ｙ）まで走査する。走査線上で白ビットから黒
ビットに変化した時の黒ビットの座標を変化点座標とし
で検出し、変化点座標を各走査線毎に検出する。The VSP horizontal feature extraction circuit 38 first scans the vertical subpattern horizontally from a point (XL, Y) on the left side of the 0 character circumscribing frame (where Y, ≦y≦YT) to scan the vertical subpattern in order to detect a change point. While registering the detected change point coordinates in the change point memory 48, select the point (
x*, Y). The coordinates of a black bit when it changes from a white bit to a black bit on a scanning line are detected as change point coordinates, and the change point coordinates are detected for each scanning line.

次に、ｖＳＰ水平特徴抽出回路３８は第一及び第二の特
徴を抽出するため着目点を走査する。Next, the vSP horizontal feature extraction circuit 38 scans the point of interest to extract the first and second features.

水平特徴の抽出では、着目点及び変化点の位置関係を判
定するため、着目点のＸ座標ＴＸと変化点のＸ座標Ｃｘ
との大小間係を比較する。座標ＴＸが座標Ｃｘよりも大
きい場合変化点が着目点の一方の側に位置することを表
し、また座標ＴＸが座標Ｃｘよりも小さい場合変化点が
着目点の他方の側に位置することを表す。In horizontal feature extraction, in order to determine the positional relationship between the point of interest and the point of change, the X coordinate TX of the point of interest and the X coordinate Cx of the point of change are
Compare the size and size of the booth. If the coordinate TX is larger than the coordinate Cx, it means that the changing point is located on one side of the point of interest, and if the coordinate TX is smaller than the coordinate Cx, it means that the changing point is located on the other side of the point of interest. .

また着目点のＸ座標ＴＸと重心座標系列Ｘ（Ｍ、）とを
比較し、座標下Ｘをこの比較結果に応じて重心座標番号
凪ヲ尺度とする値（変換＠ＴＸ、４）に変換する。Also, compare the X coordinate TX of the point of interest with the barycenter coordinate series .

・Ｘ（Ｍ、）≦ＴＸ＜　Ｘ　（Ｍ、−１）（７）　ト！
・ＴＸ＝ＸＬのときＴＸ、＝０・ｒｘ＝ｘ＊のときＴＸ、＝ＭＸ＋　１但し、ここではＭ、を０≦Ｍ２≦ＭＸの範囲の整数とし
、Ｘ（０）＝ＸＬ及びＸ　（Ｍｘ◆Ｉ）＝　Ｘ　＊とす
る。・X(M,)≦TX<X(M,-1)(7) To!
・When TX=XL, TX,=0 ・When rx=x*, TX,=MX+1 However, here, M is an integer in the range of 0≦M2≦MX, and X(0)=XL and X (Mx ◆I)=X*.

着目点の場合と同様にして、変化点のＸ座標Ｃｘと重心
座標系列とを比較し、座標Ｃｘをこの比較結果に応じて
重心番号Ｍ、を尺度とする＠（変換値ＣＸＨ）に変換す
る。In the same way as for the point of interest, compare the X coordinate Cx of the changing point with the barycenter coordinate series, and convert the coordinate Cx to @(conversion value CXH) using the barycenter number M as a scale according to the comparison result. .

そして着目点及び変化点の変換距離りのに乗を求めるに
の実施例ではに＝２とする）。水平特徴の抽出ではＤ＝
　ｌ　ＣＸＨ−ＴＸ、　　ｌ　テある。In this embodiment, to calculate the transformed distance of the point of interest and the point of change to the power of 2, 2 is used. For horizontal feature extraction, D=
There are l CXH-TX and l Te.

そして着目点及び変化点の位置関係の判定結果に基づい
て、着目点の一方の側に位置する変化点及び着目点の変
換距離りのに乗を順次に加算してゆき第一の特徴を求め
、及び着目点の他方の側に位置する変化点及び着目点の
変換路ＨＤのに乗を順次に加算してゆき第二の特徴を求
める。Then, based on the determination result of the positional relationship between the point of interest and the point of change, the first feature is determined by sequentially adding the transformed distances of the points of interest and points of interest located on one side of the point of interest. , the change point located on the other side of the point of interest, and the power of the conversion path HD of the point of interest are sequentially added to obtain the second feature.

次にｖＳＰ水平特徴抽出回路３８のより具体的な動作に
つき一例を挙げて説明する。Next, a more specific operation of the vSP horizontal feature extraction circuit 38 will be explained using an example.

垂直サブパタンの走査が始ると、パタンメモリアドレス
カウンタ５２は垂直サブパタンの水平走査を行なうべく
Ｘ、Ｙ座標を発生し、これら座標を垂直サブパタンメモ
リ４０及び変化点メモリ４８に対して出力する。垂直サ
ブパタンメモリ４０は入力するＸ、Ｙ座標で指定される
走査点の画素データを変化点検出回路４６に出力する。When scanning of the vertical sub-pattern begins, pattern memory address counter 52 generates X and Y coordinates for horizontal scanning of the vertical sub-pattern, and outputs these coordinates to vertical sub-pattern memory 40 and change point memory 48. The vertical sub-pattern memory 40 outputs pixel data of the scanning point specified by the input X and Y coordinates to the change point detection circuit 46.

変化点検出回路４６は入力した画素データが白ヒツト及
び黒ビットのいずれであるかを判定し白ビットの次に黒
ビットを検出したときの当該黒ビットを変化点として検
出する。The change point detection circuit 46 determines whether the input pixel data is a white hit or a black bit, and when a black bit is detected next to the white bit, the black bit is detected as a change point.

変化点検出回路４６が変化点を検出すると、変化点カウ
ンタ５０は一本の走査線上で検出された変化点の個数を
計数すべく計数値ｉを１カウントアツプしく計数値ｉは
１本の走査線の走査開始時点で初期値零となっている）
、次いで計数［ｌｉを変化点座標の格納場所を指定する
番地として変化点メモリ４８に対し出力する。変化メモ
リ４８は指定番地ｉを入力すると、パタンメモリアドレ
スカウンタ５２から入力した座標を１本の走査線上で第
１番目に検出された変化点の座標として指定番地ｉに格
納する。When the change point detection circuit 46 detects a change point, the change point counter 50 increments the count value i by 1 to count the number of change points detected on one scanning line. The initial value is zero at the start of line scanning)
Then, the count [li is outputted to the change point memory 48 as an address specifying the storage location of the change point coordinates. When the change memory 48 receives the specified address i, it stores the coordinates input from the pattern memory address counter 52 in the specified address i as the coordinates of the first detected change point on one scanning line.

１本の走査線上の文字外接枠内の変化点の検出が終ると
、当該走査線上の着目点を走査する。When the detection of the change point within the character circumscribing frame on one scanning line is completed, the point of interest on the scanning line is scanned.

この例では文字外接枠内の全点を着目点とし、従って走
査線上の各点を着目点として順次に着目点を水平走査す
る。In this example, all points within the character circumscribing frame are set as points of interest, and therefore, each point on the scanning line is set as a point of interest, and the points of interest are sequentially scanned horizontally.

第９図は変化点を検出したのちの、１個の着目点に関す
る水平特徴の抽出過程の説明図である。FIG. 9 is an explanatory diagram of the horizontal feature extraction process for one point of interest after detecting a change point.

以下第９図を参照し、１個の着目点に関する水平特徴の
抽出過程につき一例を挙げて説明する。Hereinafter, with reference to FIG. 9, an example of the horizontal feature extraction process for one point of interest will be described.

１本の走査線上の変化点の検出が終ると、当該走査線上
の１個の着目点に関する水平特徴を抽出する処理が始ま
る（　５ＴＡＲＴ）、尚、５ＴＡＲＴ時点で変化点カウ
ンタ５０の計数ｉＩｉは変化点検出を終えた走査線上の
変化点総個数を表す値工となっている。When the detection of the change point on one scanning line is completed, the process of extracting the horizontal feature regarding one point of interest on the scanning line starts (5TART).The count ii of the change point counter 50 changes at the time of 5TART. This value represents the total number of changing points on the scanning line for which point detection has been completed.

処理が始まると、制御回路４４はレジスタＡ５８及びレ
ジスタＢ６１初期化する（Ｓｌ、Ｓ２）、これと共に比
較器５４、演算回路５６及びｖＳＰ水平特徴パタンメモ
リ６２はパタンメモリアドレスカラシタ５２から当該着
目点の座標を入力する。When the process starts, the control circuit 44 initializes the register A 58 and the register B 61 (Sl, S2), and at the same time, the comparator 54, the arithmetic circuit 56, and the vSP horizontal feature pattern memory 62 select the target point from the pattern memory address cursor 52. Enter the coordinates of

次いで制御回路４４は１個の着目点に間する水平特徴の
抽出１Ｆｒ終了したか否か判定する（Ｓ３）。Next, the control circuit 44 determines whether extraction 1Fr of horizontal features between one point of interest has been completed (S3).

Ｓ３の判定で変化点カウンタ５０の計数値ｉがｉ＝Ｏで
あれば１個の着目点に関する水平特徴抽出ヲ終了しこの
ときのレジスタＡ５８及びレジスタ８６０の格納値がそ
れぞれ当該着目点に間する第一及び第二の特徴であるこ
とを表すので、制御回路４４はレジスタＡ５８及びレジ
スタＢ６０の格納値をそれぞれｖＳＰ水平特徴パタンメ
モリ６２の着目点座標で指定される格納場所に保存する
。その後１個の着目点に関する水平特徴抽出処理を終了
する（ＥＮＤ　）。If the count value i of the change point counter 50 is i=O in the determination in S3, the horizontal feature extraction for one point of interest is completed, and the values stored in the register A58 and register 860 at this time are respectively related to the point of interest. Since these represent the first and second features, the control circuit 44 stores the values stored in the register A58 and register B60 in the storage locations designated by the coordinates of the point of interest in the vSP horizontal feature pattern memory 62, respectively. Thereafter, the horizontal feature extraction process for one point of interest is ended (END).

Ｓ３の判定で変化点カウンタ５０の計数値ｉが１≠Ｏで
あれば１個の着目点に間する水平特徴抽出を未だ終了し
でいないことを表すので、変化点メモリ４８はこのとき
の変化点カウンタ５０の計数値ｌで指定される番地の変
化点座標を比較器５４及び演算回路５６に入力する。If the count value i of the change point counter 50 is 1≠O in the determination in S3, this indicates that horizontal feature extraction between one point of interest has not been completed, so the change point memory 48 stores the change at this time. The changing point coordinates of the address specified by the count value l of the point counter 50 are input to the comparator 54 and the arithmetic circuit 56.

次いで演算回路５６は入力した着目点及び変化点の座標
と、重心座標系列メモリ４２からの重心座標系列とに基
づいて着目点及び変化点の座標の変換Ｊａミラめ（Ｓ４
）、ざらに求めた変換値に基づいて着目点及び変化点の
変換語ＭＤを求める。Next, the arithmetic circuit 56 converts the coordinates of the point of interest and the point of change based on the input coordinates of the point of interest and the point of change and the barycenter coordinate series from the barycenter coordinate series memory 42 (S4
), the conversion word MD of the point of interest and the point of change is determined based on the roughly determined conversion value.

また比較器５４は入力した着目点及び変化点の座標を比
較し、比較結果を制御回路４４に入力する。Further, the comparator 54 compares the input coordinates of the point of interest and the point of change, and inputs the comparison result to the control circuit 44 .

次いで制御回路４４は比較器５４からの比較結果に基づ
いて変化点が着目点の一方及び他方の側のいずれにある
かを判定する（Ｓ５）。Next, the control circuit 44 determines whether the change point is on one side or the other side of the point of interest based on the comparison result from the comparator 54 (S5).

Ｓ５の判定でＣＸ＜ＴＸであれば変化点は着目点の一方
の側に位置するとみなし、制御回路４４は演算回路５６
が算出した変換距離りのに乗（ＤＫ）をレジスタ４５８
の格納値に加算しく８６）、次いでＳ８の処理を行なう
。If CX<TX in the determination in S5, the change point is considered to be located on one side of the point of interest, and the control circuit 44 uses the calculation circuit 56
The converted distance calculated by DK is stored in the register 458.
86), and then performs the process of S8.

Ｓ５の判定でＣｘ≧ＴＸであれば変化点は着目点の他方
の側に位置するとみなし、制御回路４４は演算回路５６
が算出した変換距離りのに乗（Ｄに）をレジスタＢ６０
の格納値に加算しくＳ７）、次いてＳ８の処理を行なう
。If Cx≧TX in the determination in S5, it is assumed that the change point is located on the other side of the point of interest, and the control circuit 44 uses the arithmetic circuit 56
The converted distance R to the power (D) calculated by is stored in register B60.
(S7), and then the process of S8 is performed.

Ｓ８で、制御回路４４は次の他の変化点と着目点との変
換距離りを求めて特徴を抽出すべく変化点カウンタ５０
の計数［１ｉを１だけ減じ、次いでＳ３の処理に戻る。In S8, the control circuit 44 calculates the conversion distance between the next other change point and the point of interest and uses the change point counter 50 to extract features.
The count [1i is decreased by 1, and then the process returns to S3.

変化点カウンタ５０の計数値１を走査線上で検出した変
化点の総個数Ｉからｉ＝０となるまで順次歩進して各変
化点と着目点との変換語ｊｌｆｆｆＤのに乗を順次レジ
スタＡ５８或はレジスタＢ６０の格納値に加算してゆく
ことによって第一或は第二の特ｍを抽出できる。The count value 1 of the change point counter 50 is sequentially incremented from the total number I of change points detected on the scanning line until i=0, and the conversion word jlfffD of each change point and the point of interest is raised to the power of the conversion word jlfffD in order. Alternatively, the first or second characteristic m can be extracted by adding it to the value stored in the register B60.

走査線上の各着目点につき上述の処理を行なって第一及
び第二の特徴を抽出し、走査線上の全着目点につき第一
及び第二の特徴を抽出し終えると、当該走査線の次の走
査線に関して変化点を検出し着目点の第一及び第二の特
徴を抽出する。The above process is performed for each point of interest on the scanning line to extract the first and second features, and once the first and second features have been extracted for all points of interest on the scanning line, the next A point of change is detected on the scanning line, and first and second features of the point of interest are extracted.

Ｈ３Ｐ垂直、Ｒ８Ｐ左斜め及びしＳＰ右斜め特徴抽出回
路は、第６図に示すｖＳＰ水平特徴抽出回路と同様の構
成を有し、上述したＶＳＰ水平特徴抽出回路と同様にし
て水平サブパタン、右斜めサブパタン及び左斜めサブパ
タンの着目点に間する第一及び第二の特徴を抽出する。The H3P vertical, R8P left diagonal, and SP right diagonal feature extraction circuits have the same configuration as the vSP horizontal feature extraction circuit shown in FIG. First and second features between the points of interest of the subpattern and the left diagonal subpattern are extracted.

以下、水平サブパタン、右斜めサブパタン及び左斜めサ
ブパタンの着目点に閉する第一及び第一の特徴抽出につ
き、概略的に説明する。Hereinafter, the first and first feature extractions that close to the points of interest of the horizontal sub-pattern, right diagonal sub-pattern, and left diagonal sub-pattern will be schematically explained.

Ｈ９Ｐ垂直特徴抽出回路は、まず変化点を検出するため
水平サブパタンを走査する。文字外接枠の下辺上の点（
Ｘ、ＶＢ　）（但しＸＬ≦Ｘ≦Ｘｌ＋）から水平サブパ
タンの垂直走査を開始し、検出した変化点座標を登録し
ながら、文字外接枠の上辺上の点（Ｘ、Ｙア）まで走査
する。The H9P vertical feature extraction circuit first scans the horizontal sub-pattern to detect changing points. A point on the bottom edge of the character circumscribing frame (
Vertical scanning of the horizontal sub-pattern is started from X, VB) (where XL≦X≦Xl+), and while registering the coordinates of the detected change point, it scans to the point (X, YA) on the upper side of the character circumscribing frame.

次に、口ＳＰ垂垂直特徴油抽出回路着目点を走査する。Next, the point of interest of the mouth SP vertical feature oil extraction circuit is scanned.

垂直特徴の抽出では、着目点のＹ座標ＴＹと変化点のＹ
座標ＣＹとの大小関係を比較し、変化点が着目点の一方
及び他方の側のいずれの側に位置するか判定する。In vertical feature extraction, the Y coordinate TY of the point of interest and the Y coordinate of the point of change are
The magnitude relationship with the coordinate CY is compared to determine whether the changing point is located on one side or the other side of the point of interest.

また着目点のＹ座標ＴＹと重心座標系列Ｙ　（Ｍ、）と
を比較し、座標ＴＶをこの比較結果に応じて重心座標番
号Ｍ、壱尺度とする［（変換！ＴＶ、　）に変換する。Also, the Y coordinate TY of the point of interest is compared with the barycenter coordinate series Y (M,), and the coordinate TV is converted into [(conversion!TV, ) with barycenter coordinate number M and 1 scale according to the comparison result.

・Ｘ　（Ｍ、）≦ＴＹ＜　Ｘ　（Ｍ、◆１）のとき・Ｔ
Ｖ＝Ｙ８のときＴＹＨ”０・ＴＶ＝ＹＴのときＴＶ、＝ＭＹ＋　１但し、ここではＭ、を０５Ｍ、≦ＭＹの範囲の整数とし
、Ｙ（０）”Ｙａ及びＸ（ＭＹ◆Ｉ）＝ＹＴとする。・When X (M,)≦TY<X (M,◆1)・T
When V=Y8, TYH”0 ・When TV=YT, TV, = MY+ 1 However, here, M is an integer in the range of 05M, ≦MY, and Y(0)”Ya and X(MY◆I)= Let it be YT.

同様にしで、変化点のＹ座標ＣＹと重心座標系列Ｙ（Ｍ
、）とを比較し、座標ＣＹをこの比較結果に応じて重心
番号Ｍ、を尺度とする値（変換値ＣＶ＋　）に変換する
。Similarly, the Y coordinate CY of the change point and the barycenter coordinate series Y(M
, ), and the coordinate CY is converted into a value (converted value CV+) using the barycenter number M as a scale according to the comparison result.

そして着目点及び変化点の変換距離りのに乗を求める。Then, the transformed distance of the point of interest and the point of change is calculated.

垂直特徴の抽出ではＤ＝　ｌ　ＣＹ、　−Ｔｖ。For vertical feature extraction, D = l CY, -Tv.

である。It is.

そして着目点及び変化点の位置間係の判定結果に基づい
て、変換語ＭＤのに乗を順次ｆこ加算してゆき第一及び
第二の特徴を求める。Then, based on the determination result of the relationship between the positions of the point of interest and the point of change, the powers of the conversion word MD are sequentially added by f to obtain the first and second features.

またＲ３Ｐ左斜め特徴抽出回路は、まず変化点ヲ検出す
るため右斜めサブパタンを走査する。文字外接枠の左辺
上及び下辺上の点から右斜めサブパタンの左斜め走査を
開始し、検出した変化点座標を登録しながら、文字外接
枠の上辺上及び右辺上の点まで走査する。Further, the R3P left diagonal feature extraction circuit first scans the right diagonal sub-pattern to detect a change point. The left diagonal scan of the right diagonal sub-pattern is started from the points on the left side and the bottom side of the character circumscribing frame, and is scanned to the points on the top and right side of the character circumscribing frame while registering the detected change point coordinates.

次に、日ＳＰ左斜め特徴徴抽出回路は着目点を走査する
。Next, the SP left diagonal feature extraction circuit scans the point of interest.

左斜め特徴の抽出では、着目点の座標ＴＸ又はＴＶと変
化点の座標Ｃｘ又はＣＹとの大小関係を比較し、変化点
が着目点の一方及び他方の側のいずれの側に位置するか
判定する。In extracting the left diagonal feature, the magnitude relationship between the coordinates TX or TV of the point of interest and the coordinates Cx or CY of the point of change is compared, and it is determined whether the point of change is located on one side or the other side of the point of interest. do.

また着目点の座標ＴＸ及び丁Ｙと重心座標系列Ｘ（Ｍ、
）及びＹ　（Ｍ、）とを比較し、座標ＴＸ及びＴＹ’８
コの比較結果に応じて重心座標番号Ｍ、及びＭ９を尺度
とする変換値ＴＸ、及びＴＹ、に変換する。In addition, the coordinates TX and Y of the point of interest and the barycentric coordinate series X (M,
) and Y (M,), coordinates TX and TY'8
According to the comparison results of 1 and 2, the barycenter coordinate numbers M and M9 are converted into converted values TX and TY using M9 as a scale.

同様にして、変化点の座標Ｃｘ及びＣＹと重心座標系列
Ｘ（Ｍ、）及びＹ　（Ｍ、）とを比較し、この比較結果
に応して座標Ｃｘ及びＣＹを変換値Ｃｘ８及びＣＹＨに
変換する。Similarly, the coordinates Cx and CY of the change point are compared with the barycenter coordinate series X (M,) and Y (M,), and the coordinates Cx and CY are converted into converted values Cx8 and CYH according to the comparison result. do.

そして着目点及び変化点の変換語ＨＤのに乗を求める。Then, the power of the converted word HD at the point of interest and the point of change is calculated.

左斜め特徴の抽出では変換語ＭＤは次式のように表され
る。In extracting the left diagonal feature, the conversion word MD is expressed as in the following equation.

そして着目点及び変化点の位Ｍ関係の判定結果に基づい
て、変換語ＭＤのに乗を順次に加算してゆき第一及び第
二の特＠を求める。Then, based on the determination result of the magnitude M relationship between the point of interest and the point of change, the powers of the conversion word MD are sequentially added to obtain the first and second characteristics.

またＬＳＰ右斜め特徴抽出回路は、まず変化点を検出す
るため左斜めサブパタンを走査する０文字外接枠の下辺
上及び右辺上の点から左斜めサブパタンの右斜め走査を
開始し、検出した変化点座標をＷ録しながら、文字外接
枠の左辺上及び上辺上の点まで走査する。In addition, the LSP right diagonal feature extraction circuit first scans the left diagonal subpattern in order to detect a change point.The left diagonal subpattern is first scanned from points on the lower side and on the right side of the 0 character circumscribing frame, and the detected change point is detected. While recording the coordinates, scan to the points on the left and top sides of the character circumscribing frame.

次に、ＬＳＰ右斜め特徴徴抽出回路は着目点を走査する
。Next, the LSP right diagonal feature extraction circuit scans the point of interest.

右斜め特徴の抽出ては、左斜め特徴抽出の場合と同様に
して、変化点が着目点の一方及び他方の側のいずれの側
に位置するか判定し、また着目点の座標ＴＸ及びＴＹと
変化点の座標Ｃｘ及びＣＹとを、変換値ＴＸ、及び丁Ｙ
ｓと変換値ＣＸＨ及びＣＹ、とに変換する。To extract the right diagonal feature, in the same way as the left diagonal feature extraction, it is determined which side of the point of interest the change point is located on, one side or the other, and the coordinates TX and TY of the point of interest are determined. The coordinates Cx and CY of the change point are converted into the converted value TX and the coordinate Y
s and converted values CXH and CY.

そして左斜め特徴抽出の場合と同様にして、着目点及び
変化点の変換語ＭＤのに乗を求め、第一及び第二の特徴
を求める。Then, in the same manner as in the case of left diagonal feature extraction, the power of the conversion word MD of the point of interest and the point of change is determined, and the first and second features are determined.

尚、変換距離りは一般に次式のように表せる。Note that the conversion distance can generally be expressed as shown in the following equation.

しかしながら、水平特徴抽出においては着目点及び変化
点のＹ座標が及び垂直特徴抽出においては着目点及び変
化点のＸ座標が等しいので変換距離りを表す式を上述の
ように簡略化できる。However, in horizontal feature extraction, the Y coordinates of the point of interest and the point of change are equal, and in vertical feature extraction, the X coordinates of the point of interest and the point of change are the same, so the equation representing the transformation distance can be simplified as described above.

（特徴マトリクス抽出部）特徴マトリ２ス抽出部２８は、特徴マトリクスを作成す
るための分割点座標に基づいて文字外接枠内をＭＸＮ個
の部分領域に分割する。そして垂直、水平、右斜め及び
左斜めサブパタンについてそれぞれ、各部分領域内に存
在する着目点の着目点の第一の特徴の相加平均及び第二
の特徴の相加平均を要素とするＭｘＮｘ２次元の特徴ベ
クトルである水平、垂直、左斜め及び右斜め特徴ベクト
ルを抽出し、よってひとつの認識対象につきこれら特徴
ベクトルから成るＭｘＮｘ２Ｘ４次元の特徴ベクトルを
抽出する。(Feature matrix extraction unit) The feature matrix 2 extraction unit 28 divides the inside of the character circumscribing frame into MXN partial regions based on the coordinates of division points for creating a feature matrix. Then, for the vertical, horizontal, right diagonal, and left diagonal sub-patterns, MxNx two dimensions whose elements are the arithmetic mean of the first feature and the arithmetic mean of the second feature of the points of interest existing in each partial region. The horizontal, vertical, left diagonal, and right diagonal feature vectors, which are feature vectors of

特徴マドｌノウスを作成するための分割点座標としては
従来公知の手法によって求められるものを用いてもよい
が、この実施例では重心座標系列Ｘ（ＭＰ）及び’／　
ＣＭ、）ｌＦｒ特徴マトリクスを作成するための分割点
座標に用い従ってＭ＝ＭＸ＋１及びＮ＝ＭＹ＋１として
文字外接枠内を分割するものとする。Although the dividing point coordinates for creating the feature map may be obtained by a conventionally known method, in this embodiment, the barycenter coordinate series X(MP) and '/
CM, )lFr The inside of the character circumscribing frame is divided using the dividing point coordinates for creating the feature matrix, and M=MX+1 and N=MY+1.

以下、水平特徴から得られる水平特徴ベクトルの抽出を
例として特徴ベクトルの求め方につき説明する。Hereinafter, a method for obtaining a feature vector will be explained by taking as an example the extraction of a horizontal feature vector obtained from horizontal features.

文字外接枠内を重心座標系列を用いてＭＸＸＭＹ個の部
分領域に分割する。そして垂直サブパタンの各部分領域
の、第一の特徴の相加平均及び第二の特徴の相加平均を
求める。第ｍ行ｎ列の部分９Ｎ域の第一の特徴の相加平
均及び第二の特徴の相加平均を）ＩＬｌ、１．　、及び
ＨＲ−、、（ｍ＝１．２、・・・ＭＸ及びｎ＝１．２、
・・・、ＭＹ）　、第ｍ行ｎ列の部分領域内の位置（ｘ
、ｙ）にある着目点の第一の特徴及び第二の特徴を（ｌ
　Ｘ　＊　Ｖ及びｒＸ＋Ｖ、及び第ｍ行第ｎ列の部分領
域内の着目点の個数をＴ　Ｉｌｌ　＋　ｎとすれば、）ＩＬ□ 及びＨＲ，、、、。The inside of the character circumscribing frame is divided into MXXMY partial regions using the barycenter coordinate series. Then, the arithmetic mean of the first feature and the arithmetic mean of the second feature are determined for each partial region of the vertical sub-pattern. ILl,1. , and HR-, (m=1.2,...MX and n=1.2,
..., MY), position (x
, y), the first feature and the second feature of the point of interest in (l
If X*V and rX+V, and the number of points of interest in the partial region of the m-th row and n-th column are T Ill + n, then )IL□ and HR, , .

壱次式のように定義できる。By definition like Ichiji expression Wear.

各部分領域において求めたＨＬ、、、　、及びＨＲ，、
、。が（ＭＸ＋　１）　Ｘ　（ＭＹ＋　１　）　Ｘ　２
次元の水平特徴ベクトルを表す。HL, , , and HR, found in each partial region,
,. is (MX+ 1) X (MY+ 1) X 2
Represents a dimensional horizontal feature vector.

特徴ベクトル抽出部２８は水平特徴ベクトルの場合と同
様にして口ＳＰ垂直特徴パタンメモリ、８ＳＰ左斜め特
徴パタンメモリ、ＬＳＰ右斜め特徴パタンメモリを参照
して、（ＭＸ＋１　）　Ｘ　（ＭＹ＋１）×２次元の垂
直、左斜め及び右斜め特徴ベクトルを求める。The feature vector extraction unit 28 refers to the mouth SP vertical feature pattern memory, the 8SP left diagonal feature pattern memory, and the LSP right diagonal feature pattern memory in the same way as the horizontal feature vector, and extracts (MX+1) x (MY+1) x 2 dimensions. Find vertical, left diagonal, and right diagonal feature vectors.

これら（ＭＸ＋１）ｘ　（ＭＹ＋１）ｘ２次元の水平、
垂直、左斜め及び右斜め特徴ベクトルが認識対象（７）
（ＭＸ＋１）ｘ　（ＭＹ＋１）ｘ２ｘ４次元の特徴ベク
トルを表す。These (MX+1)x (MY+1)x2-dimensional horizontal,
Vertical, left diagonal, and right diagonal feature vectors are recognized (7)
(MX+1)x (MY+1)x2x4-dimensional feature vector.

（識別部）識別部３０は特徴ベラトル抽出部２８が抽出した認識対
象（７）　（ＭＸ＋　１）　ｘ　（閘Ｙ＋１）Ｘ２Ｘ４
次元の特徴ベクトルと、辞書３２に格納されている標準
文字の特徴ベクトルとを照合し認識対象の識別を行なう
。(Identification unit) The identification unit 30 recognizes the recognition target (7) (MX+1) x (Y+1)X2X4 extracted by the feature bellator extraction unit 28.
The dimensional feature vector is compared with the standard character feature vector stored in the dictionary 32 to identify the recognition target.

辞書３２は、特徴ベクトル抽出部２８が抽出する（ＭＸ
＋１）ｘ　（ＭＹ＋　１）ｘ２ｘ４次元の特徴ベクトル
と同一形式で記述された複数の標準文字の特徴ベクトル
と、標準文字の文字名例えば例えばｊＩＳ規格に定めら
れた文字コードとを格納する。The dictionary 32 is extracted by the feature vector extraction unit 28 (MX
+1)x(MY+1)x2x4-dimensional feature vectors, feature vectors of a plurality of standard characters written in the same format, and character names of the standard characters, such as character codes defined in the JIS standard, are stored.

識別部３２は認識対象の特徴ベクトルと標準文字の特徴
ベクトルとの間のユークリッド距Ｍを計算し、その距離
が最小となる標準文字の文字名を認識結果Ｃとして出力
する。The identification unit 32 calculates the Euclidean distance M between the feature vector to be recognized and the feature vector of the standard character, and outputs the name of the standard character with the minimum distance as the recognition result C.

この発明は上述した実施例にのみ限定されるものではな
く、従って各構成成分の構成、動作、動作の流れ、入出
力信号、入出力信号の流れ、数値的条件、形状、寸法及
び配設位Ｉｌを任意好適に変更できる。The present invention is not limited only to the embodiments described above, and therefore, the configuration, operation, flow of operation, input/output signals, flow of input/output signals, numerical conditions, shape, dimensions, and arrangement position of each component. Il can be changed as desired.

（発明の効果）上述した説明からも明らかなように、この発明の特徴抽
出装置によれば、文字図形パタンの文字線の投影分布（
周辺分布）を用いて、サブパタンの着目点と文字線との
距Ｍを、文字線の局所的な位置変動に追従した尺度に変
換し、この変換した距離のに乗和を文字の特徴として抽
出するので、文字線位置の変動があっても同一文字と認
識されるべき文字の特徴の変動を小さくでき、特徴が安
定する。(Effects of the Invention) As is clear from the above description, the feature extraction device of the present invention can detect the projection distribution (
(marginal distribution), convert the distance M between the subpattern's point of interest and the character line into a measure that follows local positional fluctuations of the character line, and extract the sum of the powers of this converted distance as a character feature. Therefore, even if there is a change in the character line position, the change in the characteristics of characters that should be recognized as the same character can be reduced, and the characteristics are stabilized.

また特徴抽出の処理も簡単な処理で行なえ、処理時間を
短くできる。Furthermore, the feature extraction process can be performed easily and the processing time can be shortened.

従ってこの発明の特徴抽出袋Ｈを用いて文字認識袋Ｍを
構成することにより、文字線位置の変動に対応する大量
の辞書データを用意しなくとも、認識率の低下を招かず
、従って文字認識を高精度に行ないかつ辞書照合時間を
従来より短縮できる文字認識装置を提供できる。Therefore, by configuring the character recognition bag M using the feature extraction bag H of the present invention, the recognition rate does not deteriorate without having to prepare a large amount of dictionary data corresponding to variations in the character line position, and therefore the character recognition Therefore, it is possible to provide a character recognition device that can perform dictionary matching with high accuracy and shorten dictionary matching time compared to conventional ones.

[Brief explanation of drawings]

第１図はこの発明の詳細な説明に供する機能ブロック図
、第２図は認識対象となる文字図形バクンの一例を示す図
、第３図（Ａ）〜（Ｄ）は第２図に示す文字図形パタンか
ら抽出されるサブパタンを示す図、第４図（Ａ）〜（Ｃ
）及び第５図（Ａ）〜（Ｇ）は文字図形パタンと投影分
布５Ｙ（Ｖ）　、重心座標系列Ｙ（Ｍ、）との間係を示
す図、第６図はｖＳＰ水平特徴抽出回路の説明に供する
機能ブロック図、第７図は特徴パタンメモリの説明に供する図、第８図は
１個の着目点に関する特徴の説明の説明に供する図、第９図は１個の着目点ｆこ関する第一及び第二の特徴の
抽出過程の説明に供する図である。ＩＯ・・・特徴抽出装置、　１２・・・サブパタン抽出
部１４・・・文字投影部、　　１６・−重心座標系列抽
出部＋　８−・・特徴抽出部。１個の着目点に間する第一及び第二の特徴の抽出過程第
９図Ｘ〉〉〉Fig. 1 is a functional block diagram providing a detailed explanation of the present invention, Fig. 2 is a diagram showing an example of character/figure Bakun to be recognized, and Figs. 3 (A) to (D) are characters shown in Fig. 2. Diagrams showing sub-patterns extracted from figure patterns, Figures 4 (A) to (C)
) and FIGS. 5(A) to 5(G) are diagrams showing the relationship between the character figure pattern, the projection distribution 5Y(V), and the barycentric coordinate series Y(M, ), and FIG. 6 is the diagram of the vSP horizontal feature extraction circuit. 7 is a diagram for explaining the feature pattern memory, FIG. 8 is a diagram for explaining the feature regarding one point of interest, and FIG. 9 is a diagram for explaining the feature of one point of interest. FIG. 4 is a diagram illustrating a process of extracting first and second features related to FIG. IO...Feature extraction device, 12...Sub pattern extraction unit 14...Character projection unit, 16.-Barycenter coordinate series extraction unit+8-...Feature extraction unit. Figure 9: Extraction process of first and second features between one point of interest

Claims

[Claims]

(1) Scan the character/figure pattern in a plurality of predetermined directions, detect the cross-section of the character line in each scanning direction, and extract the character line of the cross-section whose cross-sectional length is sufficiently longer than the character line width related to the character/figure pattern. a sub-pattern extraction unit that extracts sub-patterns representing stroke components in each scanning direction; a character projection section for determining the projection distribution; determining the barycentric coordinates on the projection axis of the first and second projection distributions within the character circumscribing frame of the character figure pattern; and determining the barycentric coordinates of the first projection distribution within the character circumscribing frame of the character figure pattern set the coordinates of the first dividing point, divide the area of the character circumscribing frame in the direction of the first projection axis by the dividing point coordinates, and set a first divided area;
Find the barycentric coordinates on the projection axis of the projection distribution in the first divided area, and set the barycentric coordinates of the first projected distribution as new dividing point coordinates to determine the barycentric coordinates of the first projected distribution in the first divided area. The barycentric coordinate series on the first projection axis is obtained by repeating the obtaining process a predetermined number of times, and the barycentric coordinates of the second projection distribution within the character circumscribing frame are set as the coordinates of the first other division point, and the area of the character circumscribing frame is A second divided area is set by dividing it in the direction of the second projection axis using other dividing point coordinates, and the center of gravity coordinates on the projection axis of the second projection distribution in the second divided area are determined. The process of calculating the barycenter coordinates of the second projection distribution in the second divided area is repeated a predetermined number of times by using the barycenter coordinates of the two projection distributions as new other division point coordinates, and the barycenter coordinate series on the second projection axis is obtained. A coordinate series extraction unit, for each of the extracted sub-patterns, takes each point or arbitrary point within the character circumscribing frame as a point of interest, and converts the sub-pattern into another direction that is perpendicular to the scanning direction at the time of extracting the sub-pattern and passes through the point of interest. to detect the character line position on the scanning line, and based on the positional relationship between these points of interest and character line positions and the barycentric coordinate series, coordinate values of these points of interest and character line positions are added to the barycentric coordinate series. Convert the barycenter coordinate numbers assigned in order of size to a conversion value as a scale, and calculate the conversion distance between the point of interest and each of all character line positions on the scanning line on one side of the point of interest as the conversion value. The sum of the K powers of each transformation distance on one side (K
is a constant) as the first feature regarding the point of interest, and calculate the converted distance between the point of interest and each of all character line positions on the scanning line on the other side of the point of interest based on the converted value. , the sum of the K powers of each transformation distance on the other side (K
is a constant) as a second feature regarding the point of interest.