JPH05342012A

JPH05342012A - Compiling method and compiler

Info

Publication number: JPH05342012A
Application number: JP17627392A
Authority: JP
Inventors: Eiji Iwata; 英次岩田
Original assignee: Sony Corp
Current assignee: Sony Corp
Priority date: 1992-06-10
Filing date: 1992-06-10
Publication date: 1993-12-24

Abstract

(57)【要約】【目的】ディジタルシグナルプロセッサおよび言語に
依存せずに、プログラムをコンパイルする。【構成】言語Ｌ_n用フロントエンド部１_nにおいて、言
語Ｌ_n（ｎ＝１，２，・・・，Ｎ）で記述されたプログ
ラムのソースコードに対して、言語Ｌ_nに対応した前処
理が行われ、言語Ｌ₁乃至Ｌ_Nに共通の中間コードが出力
される。共通モジュール部２において、その中間コード
に対して、言語Ｌ₁乃至Ｌ_NおよびＤＳＰ₁乃至ＤＳＰ_Mに
依存しない処理が行われ、ディジタルシグナルプロセッ
サＤＳＰ_m用バックエンド部３_m（ｍ＝１，２，・・・，
Ｍ）において、共通モジュール部２の出力に対して後処
理が施されて、ディジタルシグナルプロセッサＤＳＰ_m
に対応したオブジェクトコードが出力される。 (57) [Abstract] [Purpose] A program is compiled independently of a digital signal processor and language. [Structure] In the language L _n front-end unit 1 _n , the preprocessing corresponding to the language L _n is performed on the source code of the program described in the language L _n (n = 1, 2, ..., N). Is performed and an intermediate code common to the languages L _{1 to} L _N is output. In the common module section 2, the intermediate code is subjected to processing independent of the languages L _{1 to} L _N and DSP _{1 to} DSP _M , and the back end section 3 _m (m = 1, 2) for the digital signal processor DSP _m .・・・・・・
In M), post-processing is performed on the output of the common module unit 2, and the digital signal processor DSP _m
The object code corresponding to is output.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明は、例えば音声処理や画像
処理などのディジタル信号処理に用いられるＤＳＰ（デ
ィジタルシグナルプロセッサ）のプログラムをコンパイ
ルする場合に用いて好適なコンパイル方法並びにコンパ
イラに関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a compiling method and a compiler suitable for compiling a DSP (digital signal processor) program used for digital signal processing such as voice processing and image processing.

【０００２】[0002]

【従来の技術】近年の音声処理や画像処理などにおいて
は、多くの場合、信号（音声信号や画像信号）をサンプ
リングし、即ち信号をディジタル化し、例えばＤＦＴ
（離散フーリエ変換），ＦＦＴ（高速フーリエ変換）、
またはＤＣＴ（離散コサイン変換）などにより、時間軸
上の信号を周波数軸上の信号（スペクトル）に変換し
て、パラメータの抽出や圧縮符号化処理が行われる。2. Description of the Related Art In recent years, in audio processing and image processing, in many cases, a signal (audio signal or image signal) is sampled, that is, the signal is digitized and, for example, DFT.
(Discrete Fourier transform), FFT (fast Fourier transform),
Alternatively, a signal on the time axis is converted into a signal (spectrum) on the frequency axis by DCT (discrete cosine transform) or the like, and parameter extraction and compression encoding processing are performed.

【０００３】このような、ディジタル信号の時間軸と周
波数軸との相互変換（ＤＦＴ（逆ＤＦＴ），ＦＦＴ（逆
ＦＦＴ）、およびＤＣＴ（逆ＤＣＴ）など）処理に代表
されるディジタル信号処理においては、積和演算の回数
が非常に多く、この処理を、例えば汎用的なＣＰＵなど
で実現した場合には、実時間処理を行うことが困難であ
った。In such digital signal processing represented by the mutual conversion (DFT (inverse DFT), FFT (inverse FFT), DCT (inverse DCT), etc.) processing between the time axis and the frequency axis of the digital signal. The number of product-sum operations is extremely large, and when this processing is realized by, for example, a general-purpose CPU, it is difficult to perform real-time processing.

【０００４】そこで、いわゆるパイプライン処理や並列
処理に適したアーキテクチャを有し、積和演算を、例え
ば数１０乃至数１００ｎ秒で行うことのできる、いわば
積和演算処理を得意とするＤＳＰ（ディジタルシグナル
プロセッサ）が開発され、近年、このＤＳＰを用いた多
くのディジタル信号処理装置（例えば画像処理装置や音
声処理装置など）が実現されている。Therefore, a DSP (digital) having an architecture suitable for so-called pipeline processing and parallel processing and capable of performing a product-sum operation in, for example, several tens to several hundreds of nanoseconds, which is, so to speak, good at a product-sum operation process. Signal processors) have been developed, and in recent years, many digital signal processing devices (for example, image processing devices and audio processing devices) using this DSP have been realized.

【０００５】[0005]

【発明が解決しようとする課題】ところで、このＤＳＰ
に、ディジタル信号処理を実行させるには、まず、その
ＤＳＰ用の言語でディジタル信号処理プログラムを記述
し、そのソースコードをコンパイラでコンパイルしてオ
ブジェクトコードに変換する必要がある。By the way, this DSP
In order to execute digital signal processing, first, it is necessary to write a digital signal processing program in the language for the DSP, compile the source code thereof with a compiler, and convert it into object code.

【０００６】しかしながら、ディジタル信号処理プログ
ラムを記述するための言語は、ＤＳＰにより異なる場合
が多く、また同じＤＳＰであってもプログラムを記述す
るための言語が複数種類提供されている場合もあり、従
って、例えばＭ個のＤＳＰのプログラムが、Ｎ種類の言
語で記述することができるときには、各ＤＳＰおよび各
言語に対応したＭ×Ｎ個のコンパイラを用意しなければ
ならず、不便であった。However, the language for describing the digital signal processing program is often different depending on the DSP, and even the same DSP may provide a plurality of languages for describing the program. For example, when a program of M DSPs can be written in N kinds of languages, M × N compilers corresponding to each DSP and each language must be prepared, which is inconvenient.

【０００７】なお、言語がＤＳＰにより異なる場合と
は、言語の種類がＤＳＰにより異なる場合の他、同じ言
語であってもその系統がＤＳＰにより異なる場合（例え
ば、コンピュータ言語でいえば、同じＣ言語でも、コン
パイラの提供メーカにより異なる場合）を含む。Note that the case where the language is different depending on the DSP means that the type of language is different depending on the DSP, and that the system is different depending on the DSP even if it is the same language (for example, the same C language in computer language). However, it depends on the manufacturer of the compiler).

【０００８】さらに、コンパイラは、プログラムをコン
パイルする過程を分割した、複数の論理的な操作単位
（フェーズ）よりなっているが、上述した各ＤＳＰおよ
び各言語に対応した各コンパイラには、ＤＳＰおよび言
語に依存しない、互いに共用することができるフェーズ
が、少なからず含まれており、各コンパイラにこのフェ
ーズを設けることは無駄であった。Further, the compiler is composed of a plurality of logical operation units (phases) obtained by dividing the process of compiling a program. The above-mentioned DSPs and compilers corresponding to respective languages have DSPs and There are quite a few language-independent phases that can be shared with each other, and it was useless to provide this phase for each compiler.

【０００９】本発明は、このような状況に鑑みてなされ
たものであり、ＤＳＰおよび言語に依存せずに、プログ
ラムをコンパイルすることができるようにするものであ
る。The present invention has been made in view of such a situation, and makes it possible to compile a program without depending on the DSP and the language.

【００１０】[0010]

【課題を解決するための手段】請求項１に記載のコンパ
イル方法は、ディジタルシグナルプロセッサのプログラ
ムのソースコードを中間コードに変換し、中間コードを
ディジタルシグナルプロセッサに対応したオブジェクト
コードに変換するコンパイル方法において、中間コード
は、ディジタルシグナルプロセッサのプログラムを記述
するための複数の言語、または複数のディジタルシグナ
ルプロセッサにそれぞれ対応したオブジェクトコードに
共通であることを特徴とする。A compiling method according to claim 1, wherein a source code of a program of a digital signal processor is converted into an intermediate code, and the intermediate code is converted into an object code corresponding to the digital signal processor. In the above, the intermediate code is common to a plurality of languages for writing a program of the digital signal processor or an object code corresponding to each of the plurality of digital signal processors.

【００１１】請求項２に記載のコンパイラは、ディジタ
ルシグナルプロセッサＤＳＰ₁乃至ＤＳＰ_Mのプログラム
のソースコードを中間コードに変換する前処理手段とし
てのフロントエンド部１（言語Ｌ₁用フロントエンド部
１₁乃至言語Ｌ_N用フロントエンド部１_N）と、フロント
エンド部１より出力される中間コードを解析して最適化
する共通処理手段としての共通モジュール部２と、共通
モジュール部２からの出力をディジタルシグナルプロセ
ッサＤＳＰ₁乃至ＤＳＰ_Mに対応したオブジェクトコード
に変換する後処理手段としてのバックエンド部３（ＤＳ
Ｐ₁用バックエンド部３₁乃至ＤＳＰ_M用バックエンド部
３_M）とを備え、フロントエンド部１（言語Ｌ₁用フロン
トエンド部１₁乃至言語Ｌ_N用フロントエンド部１_N）
は、ディジタルシグナルプロセッサＤＳＰ₁乃至ＤＳＰ_M
のプログラムを記述するための複数の言語Ｌ₁乃至Ｌ_Nに
共通の中間コードを出力し、バックエンド部３（ＤＳＰ
₁用バックエンド部３₁乃至ＤＳＰ_M用バックエンド部
３_M）は、中間コードを複数のディジタルシグナルプロ
セッサＤＳＰ₁乃至ＤＳＰ_Mにそれぞれ対応したオブジェ
クトコードに変換することを特徴とする。[0011] The compiler according to claim 2, a digital signal processor DSP ₁ to the front end portion 1 (front language L ₁ end portion 1 ₁ of the pre-processing means for converting the source code of a program of DSP _M into an intermediate code To a language L _N front end unit 1 _N ), a common module unit 2 as a common processing unit for analyzing and optimizing an intermediate code output from the front end unit 1, and a digital output from the common module unit 2. Back-end unit 3 (DS) as a post-processing unit for converting into object code corresponding to the signal processors DSP _{1 to} DSP _M
P ₁ back end unit 3 _{1 to} DSP _M back end unit 3 _M ) and a front end unit 1 (language L ₁ front end unit 1 _{1 to} language L _N front end unit 1 _N )
Is a digital signal processor DSP _{1 to} DSP _M
Output intermediate code common to a plurality of languages L _{1 to} L _N for describing the program of the back end unit 3 (DSP).
Back end unit 3 ₁ to the back-end portion 3 _M for DSP _M) for _1, and converting the intermediate code into an object code corresponding to the plurality of digital signal processors DSP ₁ to DSP _M.

【００１２】請求項３に記載のコンパイラは、共通モジ
ュール部２に、中間コードに対して、ＤＳＰ₁乃至ＤＳ
Ｐ_Mのどれにも依存しない最適化処理を施させることを
特徴とする。According to a third aspect of the present invention, the common module section 2 provides the common module section 2 with DSP _{1 to} DS for intermediate code.
It is characterized in that an optimization process that does not depend on any of P _M is performed.

【００１３】請求項４に記載のコンパイラは、バックエ
ンド部３（ＤＳＰ₁用バックエンド部３₁乃至ＤＳＰ_M用
バックエンド部３_M）に、中間コードを並列に実行可能
な単位に分割させ、ＤＳＰ₁乃至ＤＳＰ_Mに割り当てさせ
ることを特徴とする。According to a fourth aspect of the compiler, the back end unit 3 (the DSP ₁ back end unit 3 _{1 to the} DSP _M back end unit 3 _M ) divides the intermediate code into units that can be executed in parallel. It is characterized by being assigned to DSP _{1 to} DSP _M.

【００１４】請求項５に記載のコンパイラは、バックエ
ンド部３（ＤＳＰ₁用バックエンド部３₁乃至ＤＳＰ_M用
バックエンド部３_M）に、レジスタ割付を行わせること
を特徴とする。[0014] The compiler according to claim 5, the back end unit 3 (DSP ₁ for the back-end portion 3 ₁ to the back-end portion 3 _M for DSP _M), characterized in that to perform register allocation.

【００１５】[0015]

【作用】請求項１に記載のコンパイル方法においては、
ディジタルシグナルプロセッサのプログラムのソースコ
ードを、ディジタルシグナルプロセッサのプログラムを
記述するための複数の言語、または複数のディジタルシ
グナルプロセッサにそれぞれ対応したオブジェクトコー
ドに共通の中間コードに変換し、その中間コードをディ
ジタルシグナルプロセッサに対応したオブジェクトコー
ドに変換する。従って、ＤＳＰ（ディジタルシグナルプ
ロセッサ）および言語に依存せずに、プログラムをコン
パイルすることができる。In the compiling method according to claim 1,
The source code of the program of the digital signal processor is converted into an intermediate code common to a plurality of languages for writing the program of the digital signal processor, or an object code corresponding to each of the digital signal processors, and the intermediate code is digitally converted. Convert to the object code corresponding to the signal processor. Therefore, the program can be compiled independent of the DSP (digital signal processor) and the language.

【００１６】請求項２に記載のコンパイラにおいては、
ＤＳＰ₁乃至ＤＳＰ_Mのプログラムのソースコードを言語
Ｌ₁乃至Ｌ_Nに共通の中間コードに変換し、その中間コー
ドを解析して最適化する。そして、最適化した中間コー
ドをＤＳＰ₁乃至ＤＳＰ_Mにそれぞれ対応したオブジェク
トコードに変換する。従って、ＤＳＰ（ディジタルシグ
ナルプロセッサ）および言語に依存せずに、プログラム
をコンパイルすることができる。In the compiler according to claim 2,
The source code of the program of DSP _{1 to} DSP _M is converted into an intermediate code common to the languages L _{1 to} L _N , and the intermediate code is analyzed and optimized. Then, the optimized intermediate code is converted into object codes respectively corresponding to DSP _{1 to} DSP _M. Therefore, the program can be compiled independent of the DSP (digital signal processor) and the language.

【００１７】請求項３に記載のコンパイラにおいては、
共通モジュール部２に、中間コードに対して、ＤＳＰ₁
乃至ＤＳＰ_Mのどれにも依存しない最適化処理を施させ
るので、重複したフェーズによる無駄な処理が防止され
る。In the compiler according to claim 3,
In the common module section 2, for the intermediate code, DSP ₁
Since the optimization processing independent of any of DSP to DSP _M is performed, useless processing due to overlapping phases is prevented.

【００１８】請求項４に記載のコンパイラは、バックエ
ンド部３（ＤＳＰ₁用バックエンド部３₁乃至ＤＳＰ_M用
バックエンド部３_M）に、中間コードを並列に実行可能
な単位に分割させ、ＤＳＰ₁乃至ＤＳＰ_Mに割り当てさせ
るので、並列処理をすることができるＤＳＰ（ディジタ
ルシグナルプロセッサ）の機能を有効に利用することが
できる。According to a fourth aspect of the compiler, the back-end unit 3 (the DSP ₁ back-end unit 3 _{1 to the} DSP _M back-end unit 3 _M ) divides the intermediate code into units that can be executed in parallel. Since it is assigned to DSP _{1 to} DSP _M , it is possible to effectively use the function of a DSP (digital signal processor) capable of parallel processing.

【００１９】請求項５に記載のコンパイラは、バックエ
ンド部３（ＤＳＰ₁用バックエンド部３₁乃至ＤＳＰ_M用
バックエンド部３_M）に、レジスタ割付を行わせるの
で、ＤＳＰ（ディジタルシグナルプロセッサ）のレジス
タを効率的に利用することができる。According to a fifth aspect of the compiler, the back end unit 3 (the DSP ₁ back end unit 3 _{1 to the} DSP _M back end unit 3 _M ) performs register allocation, so that a DSP (digital signal processor) is provided. The registers of can be used efficiently.

【００２０】[0020]

【実施例】図１は、本発明のコンパイラの一実施例の構
成を示すブロック図である。フロントエンド部１は、Ｍ
個のディジタルシグナルプロセッサＤＳＰ₁乃至ＤＳＰ_M
（以下、ＤＳＰ₁乃至ＤＳＰ_M）のいずれかのプログラム
記述用言語Ｌ₁乃至Ｌ_Nで記述されたプログラムのソース
コードを、言語Ｌ₁乃至Ｌ_Nに共通の中間言語で記述され
る中間コードにそれぞれ変換する言語Ｌ₁用フロントエ
ンド部１₁乃至言語Ｌ_N用フロントエンド部１_Nから構成
される。各言語Ｌ_n用フロントエンド部１_n（ｎ＝１，
２，・・・，Ｎ）は、図２に示すフェーズ（コンパイル
の処理単位）としての、字句解析部１１_n、構文解析部
１２_n、および意味解析部１３_nから構成され、言語Ｌ_n
に依存した処理を行う。1 is a block diagram showing the configuration of an embodiment of a compiler according to the present invention. Front end 1 is M
Digital signal processors DSP _{1 to} DSP _M
(Hereinafter, DSP _{1 to} DSP _M ) A source code of a program described in any one of the program description languages L _{1 to} L _N is converted into an intermediate code described in an intermediate language common to the languages L _{1 to} L _N. It is composed of a language L ₁ front end unit 1 _{1 to a} language L _N front end unit 1 _N to be converted. Each language L _n for the front-end unit 1 _{n (n} = 1,
2, · · ·, N) is constructed as a phase (the processing unit of compilation) shown in FIG. 2, lexical analyzer 11 _n, the parsing unit 12 _n, and the semantic analysis unit 13 _n, the language L _n
Perform processing depending on.

【００２１】即ち、言語Ｌ_n用フロントエンド部１_nの字
句解析部１１_nは、言語Ｌ_nで記述されたプログラムのソ
ースコードを、論理的に扱うことのできる最小単位の文
字列（字句）に分離する。構文解析部１２_nは、字句解
析部１１_nより出力される字句からなる構文を構造解析
し、言語Ｌ_nの構文構造として許されているか否かを検
査（構文チェック）する。意味解析部１３_nは、字句か
らなる構文を意味解析し、構文の意味上の間違いをチェ
ックし、間違いがなければソースコードを、言語Ｌ₁乃
至Ｌ_Nのすべてに共通の中間コードに変換する。[0021] That is, the language L lexical analyzer 11 _n of the front end portion 1 _n for _n is the source code for a program written in a language L _n, a minimum unit that can be handled logically string (token) To separate. The syntax analysis unit 12 _n structurally analyzes the syntax composed of the lexical output from the lexical analysis unit 11 _n, and checks whether the syntax structure of the language L _n is allowed (syntax check). The semantic analysis unit 13 _n performs a semantic analysis on a syntax composed of lexical words, checks a semantic error in the syntax, and if there is no error, converts the source code into an intermediate code common to all the languages L _{1 to} L _N. ..

【００２２】共通モジュール部２は、図３に示すフェー
ズとしての依存解析部２１および最適化部２２より構成
され、言語Ｌ₁用フロントエンド部１₁乃至言語Ｌ_N用フ
ロントエンド部１_Nから出力される中間コードに対し
て、言語Ｌ₁乃至Ｌ_NおよびＤＳＰ₁乃至ＤＳＰ_Mに依存し
ない処理を行う。The common module section 2 is composed of a phase dependence analysis section 21 and an optimization section 22 shown in FIG. 3, and outputs from the language L ₁ front end section 1 _{1 to the} language L _N front end section 1 _N. The intermediate code to be processed is processed independent of the languages L _{1 to} L _N and DSP _{1 to} DSP _M.

【００２３】即ち、共通モジュール部２の依存解析部２
１は、フロントエンド部１より出力される中間コードに
おける、データの参照または定義の依存関係（データ依
存）を解析し、この依存関係（データ依存）を、いわゆ
るデータフローグラフと呼ばれるデータ構造に変換す
る。最適化部２２は、必要に応じて依存解析部２１で作
成されたデータフローグラフを参照し、フロントエンド
部１より出力される中間コードに対して、例えば定数の
畳み込みや共通部分式の識別など、言語Ｌ₁乃至Ｌ_Nおよ
びＤＳＰ₁乃至ＤＳＰ_Mに依存しない最適化処理を施す。That is, the dependency analysis unit 2 of the common module unit 2
Reference numeral 1 analyzes the data reference or definition dependency (data dependency) in the intermediate code output from the front end unit 1 and converts this dependency (data dependency) into a data structure called a so-called data flow graph. To do. The optimizing unit 22 refers to the data flow graph created by the dependency analyzing unit 21 as necessary, and with respect to the intermediate code output from the front end unit 1, for example, convolution of constants, identification of common subexpressions, etc. , L _{1 to} L _N and DSP _{1 to} DSP _M independent optimization processing.

【００２４】バックエンド部３は、共通モジュール部２
で最適化された中間コードから、ＤＳＰ₁乃至ＤＳＰ_Mに
対応したオブジェクトコードをそれぞれ生成するＤＳＰ
₁用バックエンド部３₁乃至ＤＳＰ_M用バックエンド部３_M
から構成される。各ＤＳＰ_m用バックエンド部３_m（ｍ＝
１，２，・・・，Ｍ）は、図４に示すフェーズとして
の、スケジューリング部３１_m、コード生成／レジスタ
割付部３２_m、および最適化部３３_mから構成され、ＤＳ
Ｐ_mに依存した処理を行う。The back end unit 3 is the common module unit 2
DSP that generates object code corresponding to DSP _{1 to} DSP _M from the intermediate code optimized by
₁ back-end section 3 _{1 to} DSP _M back-end section 3 _M
Composed of. Back end unit for each DSP _m 3 _m (m =
1, 2, ..., M) is composed of a scheduling unit 31 _m , a code generation / register allocation unit 32 _m , and an optimization unit 33 _m as the phase shown in FIG.
Perform processing depending on P _m .

【００２５】即ち、スケジューリング部３１_mは、例え
ばＤＳＰ_mのアーキテクチャが並列処理を行うことがで
きるものである場合、共通モジュール部２で最適化され
た中間コードを、並列に実行可能な単位（タスク）に分
割し、ＤＳＰ_mを構成する、例えばＡＬＵ、乗算器、加
算器、またはシフタなどに割り当てる。コード生成／レ
ジスタ割付部３２_mは、共通モジュール部２（最適化部
２２）より出力される中間コードから、ＤＳＰ_mに対応
したオブジェクトコードを生成するとともに、ＤＳＰ_m
の有するレジスタに、効率的に変数を割り付けるための
レジスタ割付を行う。最適化部３３_mは、必要に応じて
共通モジュール部２の依存解析部２１で作成されたデー
タフローグラフを参照し、コード生成／レジスタ割付部
３２_mで生成されたＤＳＰ_mに対応したオブジェクトコー
ドに対して、ＤＳＰ_mに依存した最適化処理を施す。That is, when the architecture of the DSP _m is such that the architecture of the DSP _m can perform parallel processing, the scheduling unit 31 _m can execute the intermediate code optimized by the common module unit 2 in units (tasks) that can be executed in parallel. ) And configure DSP _m , for example, ALU, multiplier, adder, or shifter. Code generation / register allocation unit 32 _m, from the intermediate code output from common module unit 2 (optimizing unit 22), generates an object code corresponding to the DSP _m, DSP _m
Performs register allocation for efficiently allocating variables to the registers of. The optimization unit 33 _m refers to the data flow graph created by the dependency analysis unit 21 of the common module unit 2 as necessary, and the object code corresponding to the DSP _m created by the code creation / register allocation unit 32 _m. Is subjected to optimization processing depending on DSP _m .

【００２６】このように構成されるコンパイラにおい
て、言語Ｌ_nで記述されたＤＳＰ_m用のプログラムがコン
パイルされる場合、まずフロントエンド部１の言語Ｌ_n
用フロントエンド部１_nにそのプログラムが読み込まれ
る。そして、言語Ｌ_n用フロントエンド部１_nの字句解析
部１１_n（図２）において、言語Ｌ_nで記述されたプログ
ラムのソースコードが、例えば手掛かり語（例えば、Ｃ
言語やＦＯＲＴＲＡＮでいうところのdo,while,if、お
よびforなど）、識別子（例えば、プログラムにおける
変数など）、定数、並びに演算子（例えば、＋，−，
＊、および／など）など、論理的に扱うことのできる最
小単位の文字列（字句）に分離される。When a program for DSP _m written in the language L _n is compiled in the compiler configured as described above, first, the language L _{n of the} front end unit 1 is written.
The program is read into the front end unit 1 _n for use. Then, in the language L _n for the front end portion 1 _n of the lexical analyzer 11 _n (Fig. 2), the source code of a program written in a language L _n, e.g. clue words (for example, C
Language, FORTRAN, such as do, while, if, and for), identifiers (eg, variables in a program), constants, and operators (eg, +, −,
*, And / or the like) is separated into a character string (lexical) of the smallest unit that can be logically handled.

【００２７】即ち、字句解析部１１_nにおいて、例えばＩＦ（５．ＥＱ．ＭＡＸ）ＧＯＴＯ１００というＦＯＲＴＲＡＮ文は、ＩＦ，（，５，．ＥＱ．，ＭＡＸ，），ＧＯＴＯ、およ
び１００という８つの字句に分離される。That is, in the lexical analysis unit 11 _n , for example, the FORTRAN statement of IF (5.EQ.MAX) GOTO 100 is the eight lexical phrases of IF, (, 5..EQ., MAX,), GOTO, and 100. Is separated into

【００２８】字句解析部１１_nで分離された字句は構文
解析部１２_nに入力され、構文解析部１２_nにおいて、こ
の字句からなる構文が構造解析され、言語Ｌ_nの構文構
造として許されているか否かが検査（構文チェック）さ
れる。即ち、例えば字句Ａ，＋、およびＢが入力された
場合、構文解析部１２_nにおいて、字句Ａ，＋、および
Ｂからなる構文Ａ＋Ｂが、式と名付ける構文構造を有す
ると構造解析され、構文Ａ＋Ｂが言語Ｌ_nで記述された
式として許されているか否かが検査（構文チェック）さ
れる。構文解析部１２_nで、任意の構文が、構文チェッ
クで使用不可と判定された場合には、コンパイル（以後
の処理）が中止される。The lexical separated by lexical analyzer 11 _n is input to the parsing unit 12 _n, the syntax analyzing unit 12 _n, the syntax comprising the lexical is structural analysis, allowed the syntactic structure of the language L _n The presence or absence is checked (syntax check). That is, for example, when the lexical characters A, +, and B are input, the syntax analysis unit 12 _n structurally analyzes the syntax A + B composed of the lexical characters A, +, and B as having a syntactic structure named as an expression, and the syntax A + B. Is checked (syntax check) as to whether or not is allowed as an expression described in the language L _n . If the syntax analysis unit 12 _{n determines} that an arbitrary syntax cannot be used in the syntax check, the compilation (the subsequent processing) is stopped.

【００２９】構文解析部１２_nでの構文チェックをパス
した構文は、意味解析部１３_nで、意味解析され、構文
の意味上の間違いがチェックされる。即ち、意味解析部
１３_nにおいて、例えば整数型の変数Ａ、演算子＋、お
よび実数型の変数Ｂの３つの字句からなる構文Ａ＋Ｂ
が、整数型と実数型の加算式であると解析され、整数と
実数との加算が間違いか否かがチェックされる。言語Ｌ
_nの仕様で、整数と実数との加算が許されていなけれ
ば、意味解析部１３_nで、構文Ａ＋Ｂは使用不可とさ
れ、コンパイル（以後の処理）が中止される。The syntax that has passed the syntax check by the syntax analysis unit 12 _n is semantically analyzed by the semantic analysis unit 13 _n to check the semantic error of the syntax. That is, in the semantic analysis unit 13 _n , for example, a syntax A + B including three tokens of an integer type variable A, an operator +, and a real number type variable B.
Is analyzed as an addition expression of integer type and real number type, and it is checked whether addition of integer and real number is wrong. Language L
_If the specification of _n does not allow addition of an integer and a real number, the semantic analysis unit 13 _n disables the syntax A + B, and compilation (subsequent processing) is stopped.

【００３０】意味解析部１３_nで、構文の意味上の間違
いがチェックされた構文に間違いがなければ、ソースコ
ードが、言語Ｌ₁乃至Ｌ_Nのすべてに共通の中間コードに
変換され、共通モジュール部２の依存解析部２１（図
３）に出力される。The semantic analysis unit 13 _n has checked the semantic meaning of the syntax. If the syntax is correct, the source code is converted into an intermediate code common to all the languages L _{1 to} L _N , and the common module is used. It is output to the dependency analysis unit 21 (FIG. 3) of the unit 2.

【００３１】依存解析部２１において、意味解析部１３
_nより出力された中間コードにおける、データの参照ま
たは定義の依存関係（データ依存）が解析され、この依
存関係（データ依存）が、いわゆるデータフローグラフ
と呼ばれるデータ構造に変換される。In the dependency analysis unit 21, the semantic analysis unit 13
_In the intermediate code output from _n , the data reference or definition dependency (data dependency) is analyzed, and this dependency (data dependency) is converted into a data structure called a so-called data flow graph.

【００３２】即ち、例えばＡ＝Ｂ＋Ｃ（１）Ｂ＝Ａ＋Ｅ（２）Ｂ＝Ｆ＋Ｇ（３）という３つの式が、式（１），（２），（３）の順番で
実行されるように、プログラムが記述されているとする
と、依存解析部２１では、・式（１）で定義されたＡが式（２）で参照されている
（フロー依存）。・式（１）で参照されたＢが式（２）で定義されている
（逆依存）。・式（１）で参照されたＢが式（３）で定義されている
（逆依存）。・式（２）で定義されたＢが式（３）で再定義されてい
る（出力依存）。のように依存解析がなされ、このデータの参照または定
義の依存関係（データ依存）がいわゆるデータフローグ
ラフと呼ばれるデータ構造に変換される。That is, for example, three equations A = B + C (1) B = A + E (2) B = F + G (3) are executed in the order of the equations (1), (2) and (3). Assuming that a program is described, the dependency analysis unit 21: A defined in Expression (1) is referenced in Expression (2) (flow dependence). B referenced in equation (1) is defined in equation (2) (reverse dependency). B referenced in equation (1) is defined in equation (3) (reverse dependency). B defined in equation (2) is redefined in equation (3) (output dependent). The dependency analysis is performed as described above, and the dependency relation (data dependency) of the reference or definition of this data is converted into a data structure called a so-called data flow graph.

【００３３】依存解析部２１でデータフローグラフが作
成された後、最適化部２２において、必要に応じてこの
データフローグラフが参照され、意味解析部１３_nより
出力された中間コードに対して、例えば定数の畳み込み
や共通部分式の識別など、ＤＳＰ₁乃至ＤＳＰ_Mに依存し
ない最適化処理が施される。After the data flow graph is created by the dependency analysis unit 21, the data flow graph is referred to by the optimization unit 22 as needed, and the intermediate code output from the semantic analysis unit 13 _n is For example, optimization processing that does not depend on DSP _{1 to} DSP _M , such as constant folding and common subexpression identification, is performed.

【００３４】即ち、例えばＩ＝４Ａ＝Ｉ＊Ｂという構文がプログラムの中に言語Ｌ_nで記述されてい
ると、最適化部２２において、上記の構文は、Ａ＝４＊Ｂと最適化され、これにより定義（Ｉ＝４）の回数を減ら
すことができる。That is, for example, if the syntax I = 4 A = I * B is described in the language L _n in the program, the optimization unit 22 optimizes the above syntax as A = 4 * B. As a result, the number of definitions (I = 4) can be reduced.

【００３５】また、例えばＡ［Ｉ＋１］＝Ｂ［Ｉ＋１］＊Ｃ［Ｉ＋１］いう構文がプログラムの中に言語Ｌ_nで記述されている
と、最適化部２２において、上記の構文は、Ｊ＝Ｉ＋１Ａ［Ｊ］＝Ｂ［Ｊ］＊Ｃ［Ｊ］と最適化され、これにより加算（Ｉ＋１）の回数を３回
から１回に減らすことができる。Further, for example, if the syntax A [I + 1] = B [I + 1] * C [I + 1] is described in the language L _n in the program, the above-mentioned syntax is J = Optimized as I + 1 A [J] = B [J] * C [J], which can reduce the number of additions (I + 1) from 3 to 1.

【００３６】共通モジュール部２の最適化部２２で最適
化された中間コードは、ＤＳＰ_m用バックエンド部３_mの
スケジューリング部３１_m（図４）に出力され、スケジ
ューリング部３１_mにおいて、ＤＳＰ_mのアーキテクチャ
が並列処理を行うことができるものである場合には、共
通モジュール部２で最適化された中間コードが、並列に
実行可能な単位（タスク）に分割され、分割された各タ
スクが、ＤＳＰ_mを構成する、例えばＡＬＵ、乗算器、
加算器、またはシフタなどに割り当てられる（スケジュ
ーリングされる）。The optimized intermediate code in the common module unit 2 of the optimization unit 22 is output to the DSP _m for the back-end portion 3 _m of the scheduling section 31 _m (Fig. 4), in the scheduling section 31 _m, DSP _m If the architecture of is capable of performing parallel processing, the intermediate code optimized by the common module unit 2 is divided into units (tasks) that can be executed in parallel, and each divided task is The DSP _m is composed of, for example, an ALU, a multiplier,
It is assigned (scheduled) to an adder, a shifter, or the like.

【００３７】そして、コード生成／レジスタ割付部３２
_mにおいて、共通モジュール部２の最適化部２２より出
力された中間コードから、ＤＳＰ_mに対応したオブジェ
クトコードが生成されるとともに、ＤＳＰ_mの有するレ
ジスタに、効率的に変数を割り付けるためのレジスタ割
付が行われ、ＤＳＰ_mに対応したオブジェクトコードが
最適化部３３_mに出力される。最適化部３３_mにおいて、
必要に応じて共通モジュール部２の依存解析部２１で作
成されたデータフローグラフが参照され、コード生成／
レジスタ割付部３２_mより出力されたオブジェクトコー
ドに対して、ＤＳＰ_mに依存した最適化処理が施され、
言語Ｌ_nで記述されたＤＳＰ_m用のプログラムのコンパイ
ルが終了する。The code generating / register allocating unit 32
_{In m} , an object code corresponding to DSP _m is generated from the intermediate code output from the optimizing unit 22 of the common module unit 2, and register allocation for efficiently allocating variables to registers of DSP _m It is carried out, and an object code corresponding to the DSP _m is output to the optimization unit 33 _m. In the optimization unit 33 _m ,
If necessary, the data flow graph created by the dependency analysis unit 21 of the common module unit 2 is referred to, and the code generation /
The object code output from the register allocating unit 32 _m is subjected to optimization processing depending on DSP _m ,
Compiling of the program for DSP _m described in the language L _n is completed.

【００３８】以下、ＤＳＰ_m で３タップのＦＩＲ型ディ
ジタルフィルタを実現するプログラムｙ（０）＝ｈ（０）＊ｘ（０）＋ｈ（１）＊ｘ（１）＋ｈ（２）＊ｘ（２）ｙ（１）＝ｈ（０）＊ｘ（１）＋ｈ（１）＊ｘ（２）＋ｈ（２）＊ｘ（３）ｙ（２）＝ｈ（０）＊ｘ（２）＋ｈ（１）＊ｘ（３）＋ｈ（２）＊ｘ（４）を例にして、ＤＳＰ_m用バックエンド部３_mのスケジュー
リング部３１_m、コード生成／レジスタ割付部３２_m、お
よび最適化部３３_mの動作を、さらに説明する。なお、
このプログラムにおいて、ｈ（０），ｈ（１）、および
ｈ（２）はフィルタの係数、ｘ（０），ｘ（１），・・
・はフィルタへの入力信号、ｙ（０），ｙ（１），・・
・はフィルタ出力を示す。A program for realizing a 3-tap FIR digital filter with DSP _m is as follows: y (0) = h (0) * x (0) + h (1) * x (1) + h (2) * x (2 ) Y (1) = h (0) * x (1) + h (1) * x (2) + h (2) * x (3) y (2) = h (0) * x (2) + h (1 ) * x (3) + h (2) * x (4) as an example, the scheduling unit 31 _m of the back-end portion 3 _m for DSP _m, code generation / register allocation unit 32 _m, and the optimization unit 33 _m The operation will be further described. In addition,
In this program, h (0), h (1), and h (2) are filter coefficients, x (0), x (1), ...
• is the input signal to the filter, y (0), y (1), ...
・ Indicates the filter output.

【００３９】上記のソースコードを中間コードで表現す
ると、ｔｅｍｐ１＝ｈ（０）＊ｘ（０）（ａ１）ｔｅｍｐ２＝ｈ（１）＊ｘ（１）（ａ２）ｔｅｍｐ３＝ｈ（２）＊ｘ（２）（ａ３）ｔｅｍｐ４＝ｔｅｍｐ１＋ｔｅｍｐ２（ａ４）ｙ（０）＝ｔｅｍｐ３＋ｔｅｍｐ４（ａ５）ｔｅｍｐ５＝ｈ（０）＊ｘ（１）（ｂ１）ｔｅｍｐ６＝ｈ（１）＊ｘ（２）（ｂ２）ｔｅｍｐ７＝ｈ（２）＊ｘ（３）（ｂ３）ｔｅｍｐ８＝ｔｅｍｐ５＋ｔｅｍｐ６（ｂ４）ｙ（１）＝ｔｅｍｐ７＋ｔｅｍｐ８（ｂ５）ｔｅｍｐ９＝ｈ（０）＊ｘ（２）（ｃ１）ｔｅｍｐ１０＝ｈ（１）＊ｘ（３）（ｃ２）ｔｅｍｐ１１＝ｈ（２）＊ｘ（４）（ｃ３）ｔｅｍｐ１２＝ｔｅｍｐ９＋ｔｅｍｐ１０（ｃ４）ｙ（２）＝ｔｅｍｐ１１＋ｔｅｍｐ１２（ｃ５）となる。When the above source code is expressed as an intermediate code, temp1 = h (0) * x (0) (a1) temp2 = h (1) * x (1) (a2) temp3 = h (2) * x (2) (a3) temp4 = temp1 + temp2 (a4) y (0) = temp3 + temp4 (a5) temp5 = h (0) * x (1) (b1) temp6 = h (1) * x (2) (b2) temp7 = H (2) * x (3) (b3) temp8 = temp5 + temp6 (b4) y (1) = temp7 + temp8 (b5) temp9 = h (0) * x (2) (c1) temp10 = h (1) * x (3) (c2) temp11 = h (2) * x (4) (c3) temp12 = temp9 + temp10 (c4) y (2) = temp11 + temp1 To become (c5).

【００４０】ここで、ＤＳＰ_mが、充分な数のレジスタ
を有するとすると、式（ａ１）乃至（ａ５）、式（ｂ
１）乃至（ｂ５）、および式（ｃ１）乃至（ｃ５）（以
下、式（ａ１）乃至（ｃ５）と記載する）のデータ依存
は、前述したフロー依存のみとなり、式（ａ１）および
式（ａ２）での定義が、式（ａ４）で参照されていると
いうフロー依存を（ａ１），（ａ２）→（ａ４）と表すと、式（ａ１）乃至（ａ５）、式（ｂ１）乃至
（ｂ５）、および式（ｃ１）乃至（ｃ５）のデータ依存
は、（ａ１），（ａ２）→（ａ４）（ａ３），（ａ４）→（ａ５）（ｂ１），（ｂ２）→（ｂ４）（ｂ３），（ｂ４）→（ｂ５）（ｃ１），（ｃ２）→（ｃ４）（ｃ３），（ｃ４）→（ｃ５）となる。Assuming that DSP _m has a sufficient number of registers, equations (a1) to (a5) and equation (b) are used.
1) to (b5) and equations (c1) to (c5) (hereinafter, referred to as equations (a1) to (c5)) have only the data dependence described above, that is, the equation (a1) and the equation (a1) When the flow dependence that the definition in a2) is referred to in expression (a4) is expressed as (a1), (a2) → (a4), expressions (a1) to (a5) and expressions (b1) to (b1) to (b1) b5) and the data dependence of the expressions (c1) to (c5) are (a1), (a2) → (a4) (a3), (a4) → (a5) (b1), (b2) → (b4). (B3), (b4) → (b5) (c1), (c2) → (c4) (c3), (c4) → (c5).

【００４１】ＤＳＰ_mが並列に実行可能なアーキテクチ
ャを有さない場合、即ち例えば演算器として２入力１出
力の演算器を１つだけＤＳＰ_mが有する場合、スケジュ
ーリング部３１_mにおいて、式（ａ１）乃至（ｃ５）で
示される演算が、ＤＳＰ_mが内蔵する１つだけの演算器
に、図５に示すように割り当てられる（スケジューリン
グされる）。When the DSP _m does not have an architecture that can be executed in parallel, that is, when the DSP _m has only one 2-input 1-output arithmetic unit as an arithmetic unit, the scheduling unit 31 _m uses the formula (a1) The operations indicated by (c5) to (c5) are assigned (scheduled) to only one arithmetic unit incorporated in the DSP _m as shown in FIG.

【００４２】即ち、ＤＳＰ_mが内蔵する演算器が１クロ
ックで動作するとすると、スケジューリング部３１_mで
は、式（ａ１）乃至（ｃ５）で示される演算が、それぞ
れ１乃至１５クロック目に、演算器で行われるように割
り当てられる。That is, if the arithmetic unit incorporated in the DSP _m operates in one clock, the scheduling unit 31 _m executes the arithmetic operations represented by the expressions (a1) to (c5) at the 1st to 15th clocks, respectively. Assigned to be done in.

【００４３】また、ＤＳＰ_mが並列に実行可能なアーキ
テクチャを有する場合、即ち例えば演算器として、９個
の、２入力１出力の乗算器Ｘ₁乃至Ｘ₉、および６個の、
２入力１出力の加算器Ｙ₁乃至Ｙ₆をＤＳＰ_mが有する場
合、スケジューリング部３１_mにおいて、式（ａ１）乃
至（ｃ５）で示される演算が、ＤＳＰ_mが内蔵する乗算
器Ｘ₁乃至Ｘ₉および加算器Ｙ₁乃至Ｙ₆に、図６に示すよ
うに割り当てられる（スケジューリングされる）。When the DSP _m has an architecture capable of being executed in parallel, that is, as the arithmetic units, for example, nine 2-input 1-output multipliers X _{1 to} X ₉ and 6
When the DSP _m has the 2-input 1-output adders Y _{1 to} Y ₆ , the scheduling unit 31 _m performs the operations represented by the expressions (a1) to (c5) by the multipliers X _{1 to} X included in the DSP _m. ₉ and adders Y _{1 to} Y ₆ are assigned (scheduled) as shown in FIG.

【００４４】即ち、乗算器Ｘ₁乃至Ｘ₉および加算器Ｙ₁
乃至Ｙ₆が１クロックで動作するとすると、スケジュー
リング部３１_mにおいて、１クロック目に、式（ａ１）
乃至（ａ３）、式（ｂ１）乃至（ｂ３）、または式（ｃ
１）乃至（ｃ３）で示される演算（乗算）が、乗算器Ｘ
₁乃至Ｘ₉でそれぞれ行われるように割り当てられ、２ク
ロック目に、式（ａ４），（ｂ４）、または（ｃ４）で
示される演算（加算）が、加算器Ｙ₁，Ｙ₃、またはＹ₅
でそれぞれ行われるように割り当てられるとともに、３
クロック目に、式（ａ５），（ｂ５）、または（ｃ５）
で示される演算（加算）が、加算器Ｙ₂，Ｙ₄、またはＹ
₆でそれぞれ行われるように割り当てられる。That is, the multipliers X _{1 to} X ₉ and the adder Y ₁
If Y to Y ₆ operate in one clock, in the scheduling unit 31 _m , at the first clock, the formula (a1)
To (a3), formulas (b1) to (b3), or formula (c
The operations (multiplication) indicated by 1) to (c3) are performed by the multiplier X.
_{1 to} X ₉ , respectively, and the operation (addition) represented by the formula (a4), (b4), or (c4) is performed by the adder Y ₁ , Y ₃ , or Y at the second clock. _Five
Assigned to take place respectively in 3
At the clock eye, expression (a5), (b5), or (c5)
The operation (addition) indicated by is the adder Y ₂ , Y ₄ , or Y
Assigned as done in ₆ respectively.

【００４５】さらに、ＤＳＰ_mが、例えば演算器とし
て、３個の、２入力１出力の乗算器Ｘ₁乃至Ｘ₃、並びに
２個の、２入力１出力の加算器Ｙ₁およびＹ₂を有する場
合、スケジューリング部３１_mにおいて、式（ａ１）乃
至（ｃ５）で示される演算が、ＤＳＰ_mが内蔵する乗算
器Ｘ₁乃至Ｘ₃並びに加算器Ｙ₁およびＹ₂に、図７に示す
ように割り当てられる（スケジューリングされる）。Further, the DSP _m has, for example, three 2-input 1-output multipliers X _{1 to} X ₃ and two 2-input 1-output adders Y ₁ and Y ₂ as arithmetic units. In this case, in the scheduling unit 31 _m , the operations represented by the formulas (a1) to (c5) are performed by the multipliers X _{1 to} X _{3 and the} adders Y ₁ and Y ₂ incorporated in the DSP _m as shown in FIG. Assigned (scheduled).

【００４６】即ち、乗算器Ｘ₁乃至Ｘ₃並びに加算器Ｙ₁
およびＹ₂が１クロックで動作するとすると、スケジュ
ーリング部３１_mにおいて、１クロック目に、式（ａ
１）乃至（ａ３）で示される演算が、乗算器Ｘ₁乃至Ｘ₃
でそれぞれ行われるように割り当てられ、２クロック目
に、式（ｂ１）乃至（ｂ３）、または式（ａ４）で示さ
れる演算が、乗算器Ｘ₁乃至Ｘ₃、または加算器Ｙ₁でそ
れぞれ行われるように割り当てられるとともに、３クロ
ック目に、式（ｃ１）乃至（ｃ３）、式（ｂ４）、また
は式（ａ５）で示される演算が、乗算器Ｘ₁乃至Ｘ₃、加
算器Ｙ₁またはＹ₂でそれぞれ行われるように割り当てら
れる。さらに、４クロック目には、式（ｃ４）または式
（ｂ５）で示される演算が、加算器Ｙ₁またはＹ₂でそれ
ぞれ行われるように割り当てられ、５クロック目に、式
（ｃ５）で示される演算が、加算器Ｙ₂で行われるよう
に割り当てられる。That is, the multipliers X _{1 to} X _{3 and the} adder Y ₁
If Y ₂ and Y ₂ operate in one clock, the scheduling unit 31 _m outputs the formula (a
The operations shown in 1) to (a3) are performed by the multipliers X _{1 to} X ₃
And the operations represented by the formulas (b1) to (b3) or the formula (a4) are performed by the multipliers X _{1 to} X ₃ or the adder Y ₁ at the second clock, respectively. And the operations represented by the formulas (c1) to (c3), the formula (b4), or the formula (a5) are assigned to the multipliers X _{1 to} X ₃ , the adder Y ₁ or Assigned as done in Y ₂ , respectively. Further, at the 4th clock, the operation represented by the formula (c4) or the formula (b5) is assigned to be performed by the adder Y ₁ or Y ₂ , respectively, and at the 5th clock, the formula (c5) is represented. The operations performed are assigned to be performed in adder Y ₂ .

【００４７】また、ＤＳＰ_mが、例えば演算器として、
２個の、２入力１出力の乗算器Ｘ₁およびＸ₂、並びに１
個の、２入力１出力の加算器Ｙ₁を有する場合、スケジ
ューリング部３１_mにおいて、式（ａ１）乃至（ｃ５）
で示される演算が、ＤＳＰ_mが内蔵する乗算器Ｘ₁および
Ｘ₂並びに加算器Ｙ₁に、図８に示すように割り当てられ
る（スケジューリングされる）。Further, the DSP _m is, for example, as an arithmetic unit,
Two 2-input 1-output multipliers X ₁ and X ₂ and 1
In the case of having two 2-input 1-output adders Y ₁ , in the scheduling unit 31 _m , equations (a1) to (c5)
The operation indicated by is assigned (scheduled) to the multipliers X ₁ and X ₂ and the adder Y ₁ incorporated in the DSP _m as shown in FIG.

【００４８】即ち、乗算器Ｘ₁およびＸ₂並びに加算器Ｙ
₁が１クロックで動作するとすると、スケジューリング
部３１_mにおいて、乗算器Ｘ₁で、式（ａ１），（ａ
３），（ｂ２），（ｃ１）、または（ｃ３）で示される
演算が、１乃至５クロック目にそれぞれ行われるように
割り当てられ、乗算器Ｘ₂で、式（ａ２），（ｂ１），
（ｂ３）、または（ｃ２）で示される演算が、１乃至４
クロック目にそれぞれ行われるように割り当てられると
ともに、加算器Ｙ₁で、式（ａ４），（ａ５），（ｂ
４），（ｂ５），（ｃ４）、または（ｃ５）で示される
演算が、２乃至７クロック目にそれぞれ行われるように
割り当てられる。That is, the multipliers X ₁ and X ₂ and the adder Y
When ₁ is to operate in one clock, in the scheduling section 31 _m, a multiplier X _1, formula (a1), (a
3), (b2), (c1), or (c3) is assigned to be performed at each of the 1st to 5th clocks, and the multiplier X ₂ uses the expressions (a2), (b1),
The calculation represented by (b3) or (c2) is 1 to 4
It is assigned so as to be performed at each clock cycle, and at the adder Y ₁ , the equations (a4), (a5), (b
4), (b5), (c4), or (c5) is assigned so as to be performed at the second to seventh clocks, respectively.

【００４９】スケジューリング部３１_mでのスケジュー
リングが終了すると、コード生成／レジスタ割付部３２
_mにおいて、ＤＳＰ_mの有するレジスタに、効率的に変数
を割り付けるためのレジスタ割付が行われながら、式
（ａ１）乃至式（ｃ５）に示す中間コードから、ＤＳＰ
_mに対応したオブジェクトコードが生成される。When the scheduling by the scheduling unit 31 _m is completed, the code generation / register allocation unit 32
_{In m} , while register allocation for efficiently allocating variables to the register of DSP _m is performed, the DSP is changed from the intermediate code shown in formulas (a1) to (c5) to
_The object code corresponding to _m is generated.

【００５０】ＤＳＰ_mが３つのレジスタｒｅｇ１乃至ｒ
ｅｇ３を有する場合、コード生成／レジスタ割付部３２
_mにおいて、式（ａ１）乃至式（ｃ５）における変数ｔ
ｅｍｐ１乃至ｔｅｍｐ１２が、レジスタｒｅｇ１乃至ｒ
ｅｇ３に割り付けられながら、式（ａ１）乃至式（ｃ
５）で示された中間コードから、ＤＳＰ_mに対応したオ
ブジェクトコードが、例えばＭＵＬｈ（０），ｘ（０），ｒｅｇ１（Ａ１）ＭＵＬｈ（１），ｘ（１），ｒｅｇ２（Ａ２）ＭＵＬｈ（２），ｘ（２），ｒｅｇ３（Ａ３）ＳＴｒｅｇ３，ｍｅｍ１（Ａ４）ＡＤＤｒｅｇ１，ｒｅｇ２，ｒｅｇ３（Ａ５）ＬＤｍｅｍ１，ｒｅｇ１（Ａ６）ＡＤＤｒｅｇ１，ｒｅｇ３，ｙ（０）（Ａ７）ＭＵＬｈ（０），ｘ（１），ｒｅｇ１（Ｂ１）ＭＵＬｈ（１），ｘ（２），ｒｅｇ２（Ｂ２）ＭＵＬｈ（２），ｘ（３），ｒｅｇ３（Ｂ３）ＳＴｒｅｇ３，ｍｅｍ２（Ｂ４）ＡＤＤｒｅｇ１，ｒｅｇ２，ｒｅｇ３（Ｂ５）ＬＤｍｅｍ２，ｒｅｇ１（Ｂ６）ＡＤＤｒｅｇ１，ｒｅｇ３，ｙ（１）（Ｂ７）ＭＵＬｈ（０），ｘ（２），ｒｅｇ１（Ｃ１）ＭＵＬｈ（１），ｘ（３），ｒｅｇ２（Ｃ２）ＭＵＬｈ（２），ｘ（４），ｒｅｇ３（Ｃ３）ＳＴｒｅｇ３，ｍｅｍ３（Ｃ４）ＡＤＤｒｅｇ１，ｒｅｇ２，ｒｅｇ３（Ｃ５）ＬＤｍｅｍ３，ｒｅｇ１（Ｃ６）ＡＤＤｒｅｇ１，ｒｅｇ３，ｙ（２）（Ｃ７）のように生成される。DSP _m has three registers reg1 through r
In the case of having eg3, the code generation / register allocation unit 32
_{In m} , the variable t in equations (a1) to (c5)
emp1 to temp12 are registers reg1 to r
While being assigned to eg3, equations (a1) to (c)
From the intermediate code shown in 5), the object code corresponding to DSP _m is, for example, MUL h (0), x (0), reg1 (A1) MUL h (1), x (1), reg2 (A2) MUL h (2), x (2), reg3 (A3) ST reg3, mem1 (A4) ADD reg1, reg2, reg3 (A5) LD mem1, reg1 (A6) ADD reg1, reg3, y (0) (A7) MUL h (0), x (1), reg1 (B1) MUL h (1), x (2), reg2 (B2) MUL h (2), x (3), reg3 (B3) ST reg3, mem2 ( B4) ADD reg1, reg2, reg3 (B5) LD mem2, reg1 (B6) ADD reg1, reg3, y (1) (B7) MUL h (0), x (2 ), Reg1 (C1) MUL h (1), x (3), reg2 (C2) MUL h (2), x (4), reg3 (C3) ST reg3, mem3 (C4) ADD reg1, reg2, reg3 ( C5) LD mem3, reg1 (C6) is generated as ADD reg1, reg3, y (2) (C7).

【００５１】オブジェクトコード（Ａ１）乃至（Ａ３）
は、ｒｅｇ１＝ｈ（０）＊ｘ（０），ｒｅｇ２＝ｈ（１）＊ｘ（１）、またはｒｅｇ３＝ｈ（２）＊ｘ（２）をそれぞれ示し、式（ａ１）乃至（ａ３）に対応する。
ＤＳＰ_mには、３つのレジスタｒｅｇ１乃至ｒｅｇ３し
かないので、式（ａ４）を計算するために、オブジェク
トコード（Ａ４）で、レジスタｒｅｇ３の内容（式（ａ
３）の変数ｔｅｍｐ３の内容）がスタックｍｅｍ１に退
避され（ストアされ）、オブジェクトコード（Ａ５）
で、式（ａ４）に対応するｒｅｇ３＝ｒｅｇ１＋ｒｅｇ２が計算される。Object code (A1) to (A3)
Are reg1 = h (0) * x (0), reg2 = h (1) * x (1), or reg3 = h (2) * x (2), respectively, and are expressed by equations (a1) to (a3). Corresponding to.
Since the DSP _m has only three registers reg1 to reg3, the contents of the register reg3 (expression (a (a4)
The contents of the variable temp3 in 3) is saved (stored) in the stack mem1 and the object code (A5)
Then, reg3 = reg1 + reg2 corresponding to the equation (a4) is calculated.

【００５２】そして、オブジェクトコード（Ａ６）で、
スタックｍｅｍ１にストアされた式（ａ３）の変数ｔｅ
ｍｐ３の内容（ｈ（２）＊ｘ（２））が、レジスタｒｅ
ｇ１にロードされ、オブジェクトコード（Ａ７）で、式
（ａ５）に対応するｙ（０）＝ｒｅｇ１＋ｒｅｇ３が計算される。Then, in the object code (A6),
The variable te of the expression (a3) stored in the stack mem1
The contents of mp3 (h (2) * x (2)) are stored in the register re
The object code (A7) is loaded into g1 and y (0) = reg1 + reg3 corresponding to the expression (a5) is calculated.

【００５３】オブジェクトコード（Ｂ１）乃至（Ｂ
７）、または（Ｃ１）乃至（Ｃ７）でも、上記したオブ
ジェクトコード（Ａ１）乃至（Ａ７）における場合と同
様にして、式（ｂ１）乃至（ｂ５）、または（ｃ１）乃
至（ｃ５）にそれぞれ対応する処理がなされる。Object codes (B1) to (B
7), or (C1) to (C7), in the same manner as in the case of the above object codes (A1) to (A7), the formulas (b1) to (b5) or (c1) to (c5) are respectively added. Corresponding processing is performed.

【００５４】なお、ＤＳＰ_mが、例えば１クロックで動
作する演算器として、３個の、２入力１出力の乗算器Ｘ
₁乃至Ｘ₃、並びに２個の、２入力１出力の加算器Ｙ₁お
よびＹ₂を有し、データのロードおよびストアを１クロ
ックで行うとすると、上記のオブジェクトコード（Ａ
１）乃至（Ａ７），（Ｂ１）乃至（Ｂ７）、および（Ｃ
１）乃至（Ｃ７）は、図９に示すように実行される。It should be noted that the DSP _m is, for example, three multipliers X each having two inputs and one output, which operate as one clock.
_{1 to} X ₃ , and two 2-input 1-output adders Y ₁ and Y ₂ , and if the data is loaded and stored in one clock, the above object code (A
1) to (A7), (B1) to (B7), and (C
1) to (C7) are executed as shown in FIG.

【００５５】即ち、１クロック目に、オブジェクトコー
ド（Ａ１）乃至（Ａ３）に対応する式（ａ１）乃至（ａ
３）で示される演算が、乗算器Ｘ₁乃至Ｘ₃でそれぞれ行
われ、２クロック目に、オブジェクトコード（Ｂ１）乃
至（Ｂ３）に対応する式（ｂ１）乃至（ｂ３）で示され
る演算が、乗算器Ｘ₁乃至Ｘ₃でそれぞれ行われるととも
に、オブジェクトコード（Ａ４）に対応するレジスタｒ
ｅｇ３のスタックｍｅｍ１へのストアが行われる。That is, at the first clock, the expressions (a1) to (a) corresponding to the object codes (A1) to (A3).
The operation shown in 3) is performed in each of the multipliers X _{1 to} X ₃ , and at the second clock, the operations shown in the expressions (b1) to (b3) corresponding to the object codes (B1) to (B3) are performed. , Multipliers X _{1 to} X ₃ respectively, and register r corresponding to the object code (A4)
Store to the stack mem1 of eg3.

【００５６】３クロック目に、オブジェクトコード（Ｃ
１）乃至（Ｃ３）または（Ａ５）に対応する式（ｃ１）
乃至（ｃ３）、または式（ａ４）で示される演算が、乗
算器Ｘ₁乃至Ｘ₃、加算器Ｙ₁でそれぞれ行われるととも
に、オブジェクトコード（Ｂ４）に対応するレジスタｒ
ｅｇ３のスタックｍｅｍ２へのストアが行われ、４クロ
ック目に、オブジェクトコード（Ｂ５）に対応する式
（ｂ４）で示される演算が、加算器Ｙ₁で行われるとと
もに、オブジェクトコード（Ｃ４）に対応するレジスタ
ｒｅｇ３のスタックｍｅｍ３へのストアと、オブジェク
トコード（Ａ６）に対応するスタックｍｅｍ１からレジ
スタｒｅｇ１へのロードが行われる。At the third clock, the object code (C
Formula (c1) corresponding to 1) to (C3) or (A5)
To (c3) or the equation (a4) is performed in each of the multipliers X _{1 to} X ₃ and the adder Y ₁ , and the register r corresponding to the object code (B4)
The store of the eg3 to the stack mem2 is performed, and at the fourth clock, the operation represented by the expression (b4) corresponding to the object code (B5) is performed by the adder Y ₁ and the operation corresponding to the object code (C4) is performed. The register reg3 is stored in the stack mem3, and the register reg1 is loaded from the stack mem1 corresponding to the object code (A6).

【００５７】５クロック目には、オブジェクトコード
（Ｃ５）または（Ａ７）に対応する式（ｃ４）または
（ａ５）で示される演算が、加算器Ｙ₁またはＹ₂でそれ
ぞれ行われるとともに、オブジェクトコード（Ｂ６）に
対応するスタックｍｅｍ２からレジスタｒｅｇ１へのロ
ードが行われ、６クロック目に、オブジェクトコード
（Ｂ７）に対応する式（ｂ５）で示される演算が、加算
器Ｙ₂で行われるとともに、オブジェクトコード（Ｃ
６）に対応するスタックｍｅｍ３からレジスタｒｅｇ１
へのロードが行われる。At the 5th clock, the operation represented by the equation (c4) or (a5) corresponding to the object code (C5) or (A7) is performed by the adder Y ₁ or Y ₂ , respectively, and the object code The register reg1 is loaded from the stack mem2 corresponding to (B6), and at the sixth clock, the operation represented by the expression (b5) corresponding to the object code (B7) is performed by the adder Y ₂ and Object code (C
Register reg1 from stack mem3 corresponding to 6)
Is loaded.

【００５８】そして、７クロック目に、オブジェクトコ
ード（Ｃ７）に対応する式（ｃ５）で示される演算が、
加算器Ｙ₂で行われ、処理を終了する。At the 7th clock, the operation represented by the equation (c5) corresponding to the object code (C7) is
The processing is completed by the adder Y ₂ .

【００５９】次に、上記のような、ＤＳＰ_mに対応した
オブジェクトコードが、最適化部３３_mに供給される
と、最適化部３３_mにおいて、このオブジェクトコード
に対して、ＤＳＰ_mに依存した最適化処理が施される。Next, as described above, an object code corresponding to the DSP _m is, when supplied to the optimization unit 33 _m, in the optimization unit 33 _m, with respect to the object code, depending on the DSP _m Optimization processing is performed.

【００６０】即ち、例えばＬＤｍｅｍ，ｒｅｇ（Ｄ１）ＳＴｒｅｇ，ｍｅｍ（Ｄ２）のような、メモリｍｅｍからレジスタｒｅｇにデータを
ロードし（Ｄ１）、すぐにレジスタｒｅｇのデータをメ
モリｍｅｍにストアする（Ｄ２）ことを示すオブジェク
トコードにおいては、同じ内容（データ）を、メモリｍ
ｅｍにストアすることは無駄であるから、最適化部３３
_mにおいて、オブジェクトコード（Ｄ２）が削除される
（冗長なロード／ストア命令の削除）。That is, data is loaded from the memory mem to the register reg (D1) such as LD mem, reg (D1) ST reg, mem (D2), and the data of the register reg is immediately stored in the memory mem. In the object code indicating (D2), the same content (data) is stored in the memory m.
Since storing in em is useless, the optimization unit 33
_{At m} , the object code (D2) is deleted (redundant load / store instruction deletion).

【００６１】また、例えばＡＤＤ０，ｒｅｇ，ｒｅｇ（Ｅ１）ＭＵＬ１，ｒｅｇ，ｒｅｇ（Ｅ２）のような、レジスタｒｅｇに０を加算して、レジスタｒ
ｅｇに記憶させたり（Ｅ１）、レジスタｒｅｇに１を乗
じて、レジスタｒｅｇに記憶させたりする（Ｅ２）オブ
ジェクトコードにおいては、０の加算や１の乗算は結果
が変わらないので、やはり無駄であるから、最適化部３
３_mにおいて、オブジェクトコード（Ｅ１）および（Ｅ
１）とも削除される（代数的簡約化）。Further, 0 is added to the register reg, such as ADD 0, reg, reg (E1) MUL 1, reg, reg (E2), and the register r is added.
In object code in which the result is stored in eg (E1) or the register reg is multiplied by 1 and stored in the register reg (E2), the addition of 0 and the multiplication of 1 do not change the result, which is also useless. From the optimization unit 3
At 3 _m , the object code (E1) and (E
Both 1) are deleted (algebraic reduction).

【００６２】以上のように、このコンパイラでは、プロ
グラムを記述した言語Ｌ₁乃至Ｌ_Nに対応した言語Ｌ₁用
フロントエンド部１₁乃至言語Ｌ_Nフロントエンド部１_N
で前処理を行い、ＤＳＰ₁乃至ＤＳＰ_Mに対応したオブジ
ェクトコードを出力するＤＳＰ₁用バックエンド部３₁乃
至ＤＳＰ_M用バックエンド部３_Mで後処理を行うととも
に、言語Ｌ₁乃至Ｌ_NおよびＤＳＰ₁乃至ＤＳＰ_Mに依存し
ない処理（フェーズ）を共通モジュール部２で行うよう
にしたので、言語Ｌ₁乃至Ｌ_Nのうちのどの言語で記述さ
れたプログラムでも、また、ＤＳＰ₁乃至ＤＳＰ_Mのうち
のどのＤＳＰに対応するプログラムでもコンパイルする
ことができる。[0062] As described above, in this compiler front end portion 1 ₁ to Language language L ₁ corresponding to the language L ₁ to L _N describing the program L _N front end portion 1 _N
In Preprocess, performs post-processing in DSP ₁ to DSP ₁ for the back-end portion 3 ₁ to the back for DSP _M end portion 3 to output the object code corresponding to the DSP _M _M, languages L ₁ to L _N and Since the common module unit 2 performs the processing (phase) that does not depend on the DSP _{1 to} DSP _M , the program written in any of the languages L _{1 to} L _N , and the DSP _{1 to} DSP _M A program compatible with any of the DSPs can be compiled.

【００６３】[0063]

【発明の効果】請求項１に記載のコンパイル方法によれ
ば、ディジタルシグナルプロセッサ（ＤＳＰ）のプログ
ラムのソースコードを、ＤＳＰのプログラムを記述する
ための複数の言語、または複数のＤＳＰにそれぞれ対応
したオブジェクトコードに共通の中間コードに変換し、
その中間コードをＤＳＰに対応したオブジェクトコード
に変換する。従って、ＤＳＰおよび言語に依存せずに、
プログラムをコンパイルすることができる。According to the compiling method of the first aspect, the source code of the digital signal processor (DSP) program corresponds to a plurality of languages for describing the DSP program or a plurality of DSPs. Converted to intermediate code common to object code,
The intermediate code is converted into an object code compatible with DSP. Therefore, regardless of DSP and language,
The program can be compiled.

【００６４】請求項２に記載のコンパイラによれば、デ
ィジタルシグナルプロセッサ（ＤＳＰ）のプログラムの
ソースコードを言語に共通の中間コードに変換し、その
中間コードを解析して最適化する。そして、最適化した
中間コードを複数のＤＳＰにそれぞれ対応したオブジェ
クトコードに変換する。従って、ＤＳＰおよび言語に依
存せずに、プログラムをコンパイルすることができる。According to the compiler of the second aspect, the source code of the program of the digital signal processor (DSP) is converted into an intermediate code common to the language, and the intermediate code is analyzed and optimized. Then, the optimized intermediate code is converted into an object code corresponding to each of the plurality of DSPs. Therefore, the program can be compiled independently of the DSP and the language.

【００６５】請求項３に記載のコンパイラによれば、共
通処理手段に、中間コードに対して、複数のＤＳＰのど
れにも依存しない最適化処理を施させるので、重複した
フェーズによる無駄な処理が防止される。According to the third aspect of the compiler, since the common processing means is caused to perform the optimization processing that does not depend on any of the plurality of DSPs with respect to the intermediate code, wasteful processing due to overlapping phases is eliminated. To be prevented.

【００６６】請求項４に記載のコンパイラは、後処理手
段に、中間コードを並列に実行可能な単位に分割させ、
ＤＳＰに割り当てさせるので、並列処理をすることがで
きるＤＳＰの機能を有効に利用することができる。A compiler according to a fourth aspect causes the post-processing means to divide the intermediate code into units that can be executed in parallel,
Since it is assigned to the DSP, it is possible to effectively use the function of the DSP capable of parallel processing.

【００６７】請求項５に記載のコンパイラは、後処理手
段に、レジスタ割付を行わせるので、ＤＳＰのレジスタ
を効率的に利用することができる。The compiler according to the fifth aspect causes the post-processing means to perform register allocation, so that the registers of the DSP can be efficiently used.

[Brief description of drawings]

【図１】本発明のコンパイラの一実施例の構成を示すブ
ロック図である。FIG. 1 is a block diagram showing a configuration of an embodiment of a compiler of the present invention.

【図２】図１の実施例の言語Ｌ₁用フロントエンド部１₁
乃至言語Ｌ_Nフロントエンド部１_Nのより詳細を示す図で
ある。FIG. 2 is a front end section 1 ₁ for a language L ₁ of the embodiment shown in FIG.
FIG. 6 is a diagram showing more details of the language L _N front end unit 1 _N.

【図３】図１の実施例の共通モジュール部２のより詳細
を示す図である。FIG. 3 is a diagram showing more details of a common module unit 2 of the embodiment of FIG.

【図４】図１の実施例のＤＳＰ₁用バックエンド部３₁乃
至ＤＳＰ_M用バックエンド部３_Mのより詳細を示す図であ
る。FIG. 4 is a diagram showing more details of the DSP ₁ back-end unit 3 _{1 to the} DSP _M back-end unit 3 _{M in} the embodiment of FIG.

【図５】図４のスケジューリング部で行われるスケジュ
ーリングを説明するための図である。5 is a diagram for explaining the scheduling performed by the scheduling unit of FIG.

【図６】図４のスケジューリング部で行われるスケジュ
ーリングを説明するための図である。FIG. 6 is a diagram for explaining scheduling performed by the scheduling unit of FIG.

【図７】図４のスケジューリング部で行われるスケジュ
ーリングを説明するための図である。FIG. 7 is a diagram for explaining scheduling performed by the scheduling unit of FIG.

【図８】図４のスケジューリング部で行われるスケジュ
ーリングを説明するための図である。FIG. 8 is a diagram for explaining scheduling performed by the scheduling unit in FIG.

【図９】図４のコード生成／レジスタ割付部３２_mで生
成されたＤＳＰ_mに対応したオブジェクトコードが実行
される様子を説明するための図である。9 is a diagram for explaining how the object code corresponding to the DSP _m generated by the code generation / register allocation unit 32 _m in FIG. 4 is executed.

[Explanation of symbols]

１フロントエンド部１₁乃至１_N 言語Ｌ₁用フロントエンド部乃至言語Ｌ_Nフ
ロントエンド部２共通モジュール部３バックエンド部３₁乃至３_M ＤＳＰ₁用バックエンド部乃至ＤＳＰ_M用バ
ックエンド部１１ｎ字句解析部１２ｎ構文解析部１３ｎ意味解析部２１依存解析部２２最適化部３１ｍスケジューリング部３２ｍコード生成／レジスタ割付部３３ｍ最適化部1 Front End Part 1 _{1 to} 1 _N Front End Part for Language L _{1 to} Language L _N Front End Part 2 Common Module Part 3 Back End Part 3 _{1 to} 3 _M Back End Part for DSP _{1 to} Back End Part for DSP _M 11n Lexical analysis unit 12n Syntax analysis unit 13n Semantic analysis unit 21 Dependency analysis unit 22 Optimization unit 31m Scheduling unit 32m Code generation / register allocation unit 33m Optimization unit

Claims

[Claims]

1. A compiling method for converting a source code of a program of a digital signal processor into an intermediate code, and converting the intermediate code into an object code corresponding to the digital signal processor, wherein the intermediate code is of the digital signal processor. A compiling method characterized by being common to object codes corresponding to a plurality of languages for writing a program or a plurality of digital signal processors.

2. Preprocessing means for converting the source code of the program of the digital signal processor into intermediate code, common processing means for analyzing and optimizing the intermediate code output from the preprocessing means, and the common processing means. Post-processing means for converting the output from the digital signal processor into an object code corresponding to the digital signal processor, wherein the pre-processing means outputs an intermediate code common to a plurality of languages for describing a program of the digital signal processor. The post-processing means converts the intermediate code into object code corresponding to a plurality of digital signal processors, respectively.

3. The compiler according to claim 2, wherein the common processing means performs an optimization process on the intermediate code that does not depend on any of the plurality of digital signal processors.

4. The compiler according to claim 2, wherein the post-processing unit divides the intermediate code into units that can be executed in parallel and assigns the units to the digital signal processor.

5. The compiler according to claim 2, 3 or 4, wherein the post-processing means performs register allocation.