TWI876580B

TWI876580B - System and method for obtaining accurate surname

Info

Publication number: TWI876580B
Application number: TW112138579A
Authority: TW
Inventors: 游政儒; 黃至德
Original assignee: 中華電信股份有限公司
Priority date: 2023-10-06
Filing date: 2023-10-06
Publication date: 2025-03-11
Also published as: TW202516494A

Abstract

A system and a method for obtaining an accurate surname are provided. The method includes following steps: when an accurate-surname-confirmation-module judges that a text does not include a precise-surname-statement and does not include a confirmation-surname-sentence, extracting, by the accurate-surname-confirmation-module, a surname from the text; using, by a similar-sound-acquisition-module, to obtain a similar-note-text-sequence; using, by a TOP-N-surname-generation-module, the similar-note-text-sequence to obtain a TOP-N-surname; and generating, by a precise-surname-answer-sentence-generation-module, a precise-surname-querying-back-sentence corresponding to the TOP-N-surname.

Description

System and method for obtaining accurate surname

本發明是有關於一種用於獲得精準姓氏的系統及方法。The present invention relates to a system and method for obtaining an accurate surname.

當客戶連線/致電至一般現有的對話服務系統時，若此時對話服務系統需要確認客戶的姓氏，則現有的文字辨識技術常因為中文存在高機率的同音字而誤判，造成客戶的觀感不佳。When a customer connects/calls a general existing conversation service system, if the conversation service system needs to confirm the customer's surname at this time, the existing text recognition technology often misjudges because there is a high probability of homophones in Chinese, resulting in a bad impression for the customer.

本發明的用於獲得精準姓氏的系統包括儲存媒體以及處理器。儲存媒體儲存多個模組，其中多個模組包括精準姓氏確認模組、相似音取得模組、TOP N姓氏生成模組以及精準姓氏回問語句生成模組。處理器耦接儲存媒體，其中處理器存取和執行多個模組來執行以下各步驟：當精準姓氏確認模組判斷文字不包括精準姓氏說法且不包括確認姓氏語句時，精準姓氏確認模組從文字中擷取出姓氏；相似音取得模組利用姓氏獲得相似音符文字序列；TOP N姓氏生成模組利用相似音符文字序列獲得TOP N姓氏；以及精準姓氏回問語句生成模組產生對應於TOP N姓氏的精準姓氏回問問句。The system for obtaining an accurate surname of the present invention includes a storage medium and a processor. The storage medium stores a plurality of modules, wherein the plurality of modules include an accurate surname confirmation module, a similar sound acquisition module, a TOP N surname generation module, and an accurate surname answer sentence generation module. The processor is coupled to the storage medium, wherein the processor accesses and executes the plurality of modules to execute the following steps: when the accurate surname confirmation module determines that the text does not include an accurate surname statement and does not include a surname confirmation sentence, the accurate surname confirmation module extracts the surname from the text; the similar sound acquisition module uses the surname to obtain a similar musical note text sequence; the TOP N surname generation module uses the similar musical note text sequence to obtain the TOP N surnames; and the accurate surname answer sentence generation module generates an accurate surname answer sentence corresponding to the TOP N surnames.

本發明的用於獲得精準姓氏的方法包括以下步驟：當精準姓氏確認模組判斷文字不包括精準姓氏說法且不包括確認姓氏語句時，由精準姓氏確認模組從文字中擷取出姓氏；由相似音取得模組利用姓氏獲得相似音符文字序列；由TOP N姓氏生成模組利用相似音符文字序列獲得TOP N姓氏；以及由精準姓氏回問語句生成模組產生對應於TOP N姓氏的精準姓氏回問問句。 The method for obtaining an accurate surname of the present invention comprises the following steps: when the accurate surname confirmation module determines that the text does not include an accurate surname statement and does not include a surname confirmation sentence, the accurate surname confirmation module extracts the surname from the text; the similar sound acquisition module uses the surname to obtain a similar musical note text sequence; the TOP N surname generation module uses the similar musical note text sequence to obtain the TOP N surnames; and the accurate surname question answer sentence generation module generates an accurate surname question answer sentence corresponding to the TOP N surnames .

圖1是根據本發明的一實施例繪示的用於獲得精準姓氏的系統1的示意圖。系統1可包括儲存媒體以及處理器。Fig. 1 is a schematic diagram of a system 1 for obtaining an accurate surname according to an embodiment of the present invention. The system 1 may include a storage medium and a processor.

儲存媒體例如是任何型態的固定式或可移動式的隨機存取記憶體（random access memory，RAM）、唯讀記憶體（read-only memory，ROM）、快閃記憶體（flash memory）、硬碟（hard disk drive，HDD）、固態硬碟（solid state drive，SSD）或類似元件或上述元件的組合，而用於儲存可由處理器執行的多個模組或各種應用程式。在本實施例中，儲存媒體可儲存多個模組，其中所述多個模組可包括精準姓氏確認模組20、相似音取得模組30、TOP N姓氏生成模組40以及精準姓氏回問語句生成模組50。在其他實施例中，所述多個模組還可包括語音辨別與文字轉換模組10。在其他實施例中，儲存媒體還可儲存中文文字字庫11、姓氏說法資料庫21以及百家姓資料庫41。The storage medium is, for example, any type of fixed or removable random access memory (RAM), read-only memory (ROM), flash memory, hard disk drive (HDD), solid state drive (SSD) or similar components or a combination of the above components, and is used to store multiple modules or various applications that can be executed by the processor. In this embodiment, the storage medium can store multiple modules, wherein the multiple modules may include an accurate surname confirmation module 20, a similar sound acquisition module 30, a TOP N surname generation module 40, and an accurate surname question sentence generation module 50. In other embodiments, the multiple modules may also include a voice recognition and text conversion module 10. In other embodiments, the storage medium can also store the Chinese character database 11, the surname database 21, and the Hundred Family Surnames database 41.

處理器例如是中央處理單元（central processing unit，CPU），或是其他可程式化之一般用途或特殊用途的微控制單元（micro control unit，MCU）、微處理器（microprocessor）、數位信號處理器（digital signal processor，DSP）、可程式化控制器、特殊應用積體電路（application specific integrated circuit，ASIC）、圖形處理器（graphics processing unit，GPU）、影像訊號處理器（image signal processor，ISP）、影像處理單元（image processing unit，IPU）、算數邏輯單元（arithmetic logic unit，ALU）、複雜可程式邏輯裝置（complex programmable logic device，CPLD）、現場可程式化邏輯閘陣列（field programmable gate array，FPGA）或其他類似元件或上述元件的組合。處理器可耦接至儲存媒體，並且存取和執行儲存於儲存媒體中的多個模組和各種應用程式。The processor is, for example, a central processing unit (CPU), or other programmable general-purpose or special-purpose micro control unit (MCU), microprocessor, digital signal processor (DSP), programmable controller, application specific integrated circuit (ASIC), graphics processing unit (GPU), image signal processor (ISP), image processing unit (IPU), arithmetic logic unit (ALU), complex programmable logic device (CPLD), field programmable gate array (FPGA), or other similar components or combinations of the above components. The processor may be coupled to the storage medium and access and execute a plurality of modules and various applications stored in the storage medium.

圖2是圖1所示的系統1的運作流程圖。請同時參照圖1及圖2。FIG2 is an operation flow chart of the system 1 shown in FIG1. Please refer to FIG1 and FIG2 at the same time.

在步驟S100中，語音辨別與文字轉換模組10可將語音轉換為文字。以下將以圖3進一步說明。In step S100, the speech recognition and text conversion module 10 can convert speech into text. This will be further described below with reference to FIG.

圖3是圖2所示的步驟S100的詳細流程圖。請同時參照圖1、圖2及圖3。FIG3 is a detailed flow chart of step S100 shown in FIG2. Please refer to FIG1, FIG2 and FIG3 at the same time.

在步驟S101中，語音辨別與文字轉換模組10可從客戶設備接收語音。詳細而言，圖1所示的系統1還可包括收發器（圖1未繪示）。語音辨別與文字轉換模組10可通過收發器從客戶設備（圖1未繪示）接收語音。客戶設備例如是電話或智慧音箱設備，然而本發明不限於此。舉例來說，當客戶操作客戶設備以連線至系統1時，系統1可詢問「請問客戶貴姓？」。以下實施例將以語音辨別與文字轉換模組10從客戶設備接收的語音為「我姓王」來繼續說明。In step S101, the speech recognition and text conversion module 10 can receive speech from a client device. In detail, the system 1 shown in FIG1 may further include a transceiver (not shown in FIG1 ). The speech recognition and text conversion module 10 can receive speech from a client device (not shown in FIG1 ) via the transceiver. The client device is, for example, a telephone or a smart speaker device, but the present invention is not limited thereto. For example, when a customer operates a client device to connect to the system 1, the system 1 may ask, "What is your last name?" The following embodiment will be described assuming that the speech received by the speech recognition and text conversion module 10 from the client device is "My last name is Wang".

在步驟S102中，語音辨別與文字轉換模組10可對語音執行語音識別操作以獲得識別後語音。詳細而言，語音辨別與文字轉換模組10可對語音進行前處理。由於不同客戶身處的環境不同，客戶距離客戶設備的收音設備/麥克風的距離也不同，且各種噪音會影響收音結果。因此，語音辨別與文字轉換模組10可將環境的雜音去除，並識別出可用的識別後語音（人聲音頻）。In step S102, the speech recognition and text conversion module 10 can perform speech recognition operation on the speech to obtain recognized speech. In detail, the speech recognition and text conversion module 10 can perform pre-processing on the speech. Since different customers are in different environments, the distances between the customers and the sound receiving device/microphone of the customer equipment are also different, and various noises will affect the sound receiving results. Therefore, the speech recognition and text conversion module 10 can remove the noise of the environment and recognize the usable recognized speech (human voice frequency).

在步驟S103中，語音辨別與文字轉換模組10可利用識別後語音以及中文文字字庫11獲得對應於語音的文字。詳細而言，語音辨別與文字轉換模組10可利用中文文字字庫11內的中文文字內容來產生對應於識別後語音的文字。承前述實施例，語音辨別與文字轉換模組10可獲得文字「我姓王」。In step S103, the speech recognition and text conversion module 10 can use the recognized speech and the Chinese character library 11 to obtain text corresponding to the speech. Specifically, the speech recognition and text conversion module 10 can use the Chinese character content in the Chinese character library 11 to generate text corresponding to the recognized speech. Based on the above embodiment, the speech recognition and text conversion module 10 can obtain the text "My surname is Wang".

請回到圖2。在步驟S200中，當精準姓氏確認模組20判斷文字不包括精準姓氏說法且不包括確認姓氏語句時，精準姓氏確認模組20可從文字中擷取出姓氏。以下將以圖4進一步說明。Please return to FIG. 2. In step S200, when the precise surname confirmation module 20 determines that the text does not include the precise surname statement and does not include the surname confirmation sentence, the precise surname confirmation module 20 can extract the surname from the text. This will be further explained below with reference to FIG. 4.

圖4是圖2所示的步驟S200的詳細流程圖。請同時參照圖1、圖2及圖4。FIG4 is a detailed flow chart of step S200 shown in FIG2. Please refer to FIG1, FIG2 and FIG4 at the same time.

在步驟S201中，精準姓氏確認模組20可利用姓氏說法資料庫21判斷文字是否包括精準姓氏說法。In step S201, the accurate surname confirmation module 20 may use the surname saying database 21 to determine whether the text includes an accurate surname saying.

當精準姓氏確認模組20判斷文字包括精準姓氏說法時（步驟S201的判斷結果為「是」），在步驟S202中，精準姓氏確認模組20可採納精準姓氏說法。舉例來說，若此時文字包括精準姓氏說法「我姓張，弓長張」或者精準姓氏說法「我姓黃，草頭黃」，則精準姓氏確認模組20可採納此精準姓氏說法。然後，精準姓氏確認模組20可執行步驟S203以結束與客戶設備的對話。換言之，此時系統1將不執行步驟S300、S400及S500。When the precise surname confirmation module 20 determines that the text includes the precise surname statement (the determination result of step S201 is "yes"), in step S202, the precise surname confirmation module 20 can adopt the precise surname statement. For example, if the text includes the precise surname statement "My surname is Zhang, and my bow is Zhang" or the precise surname statement "My surname is Huang, and my grass is Huang", the precise surname confirmation module 20 can adopt the precise surname statement. Then, the precise surname confirmation module 20 can execute step S203 to end the dialogue with the client device. In other words, the system 1 will not execute steps S300, S400 and S500 at this time.

當精準姓氏確認模組20判斷文字不包括精準姓氏說法時，精準姓氏確認模組20可判斷文字是否包括確認姓氏語句。詳細而言，當精準姓氏確認模組20判斷文字包括確認姓氏語句時（步驟S204的判斷結果為「是」），在步驟S203中，精準姓氏確認模組20可結束與客戶的對話。舉例來說，假設在經過與特定客戶的多輪對話後文字已包括了確認姓氏語句（例如「對，沒錯」）。由於文字雖然不包括精準姓氏說法但已經包括了確認姓氏語句，精準姓氏確認模組20可執行步驟S203來結束與客戶設備的對話。換言之，此時系統1將不執行步驟S300、S400及S500。When the precise surname confirmation module 20 determines that the text does not include the precise surname statement, the precise surname confirmation module 20 can determine whether the text includes a surname confirmation sentence. In detail, when the precise surname confirmation module 20 determines that the text includes a surname confirmation sentence (the judgment result of step S204 is "yes"), in step S203, the precise surname confirmation module 20 can end the dialogue with the customer. For example, assume that after multiple rounds of dialogue with a specific customer, the text has included a surname confirmation sentence (such as "Yes, that's right"). Since the text does not include the precise surname statement but has included the surname confirmation sentence, the precise surname confirmation module 20 can execute step S203 to end the dialogue with the customer device. In other words, at this time, the system 1 will not execute steps S300, S400 and S500.

另一方面，當精準姓氏確認模組20判斷文字不包括確認姓氏語句時（步驟S204的判斷結果為「否」），在步驟S205中，精準姓氏確認模組20可從文字中擷取出姓氏。承前述實施例，由於文字「我姓王」不包括確認姓氏語句，精準姓氏確認模組20可執行步驟S205。換言之，精準姓氏確認模組20可從文字「我姓王」中擷取出可能的姓氏「王」。On the other hand, when the precise surname confirmation module 20 determines that the text does not include a surname confirmation sentence (the determination result of step S204 is "No"), in step S205, the precise surname confirmation module 20 can extract a surname from the text. Based on the above embodiment, since the text "My surname is Wang" does not include a surname confirmation sentence, the precise surname confirmation module 20 can execute step S205. In other words, the precise surname confirmation module 20 can extract a possible surname "Wang" from the text "My surname is Wang".

請回到圖2。在步驟S300中，相似音取得模組30可利用姓氏獲得相似音符文字序列。以下將以圖5進一步說明。Please return to FIG. 2. In step S300, the similar sound acquisition module 30 can use the surname to obtain a similar phonetic character sequence. FIG. 5 will be used to further illustrate this.

圖5是圖2所示的步驟S300的詳細流程圖。請同時參照圖1、圖2及圖5。FIG5 is a detailed flow chart of step S300 shown in FIG2. Please refer to FIG1, FIG2 and FIG5 at the same time.

在步驟S301中，相似音取得模組30可去除姓氏中的中文音調以獲得保留注音。中文音調可以是中文的1234聲音調。承前述實施例，相似音取得模組30可去除姓氏「王」中的中文音調以獲得保留注音「ㄨㄤ」。In step S301, the similar sound acquisition module 30 can remove the Chinese tones in the surname to obtain the retained phonetic notation. The Chinese tones can be the 1234 tones of Chinese. Based on the above embodiment, the similar sound acquisition module 30 can remove the Chinese tones in the surname "王" to obtain the retained phonetic notation "ㄨㄤ".

在步驟S302中，相似音取得模組30可利用（如下表1所示的）注音子音母音表以及保留注音來獲得相似音符文字序列。承前述實施例，相似音取得模組30可利用注音子音母音表以及保留注音「ㄨㄤ」獲得相似音符文字序列「ㄨㄤ、ㄅㄤ、ㄈㄤ、ㄓㄤ、ㄔㄤ、ㄏㄨㄤ、ㄎㄨㄤ…」。In step S302, the similar sound acquisition module 30 can use the phonetic consonant vowel table and the reserved phonetic notation (as shown in Table 1 below) to obtain a similar musical note text sequence. Based on the above embodiment, the similar sound acquisition module 30 can use the phonetic consonant vowel table and the reserved phonetic notation "ㄨㄤ" to obtain a similar musical note text sequence "ㄨㄤ、ㄅㄤ、ㄈㄤ、ㄓㄤ、ㄔㄤ、ㄏㄨㄤ、ㄎㄨㄤ...".

表1注音子音母音表的範例子音ㄅㄆㄇㄈㄉㄊㄋㄌㄍㄎㄏㄐㄑㄒ可獨立存在子音ㄓㄔㄕㄖㄗㄘㄙ母音ㄧㄨㄩㄚㄛㄜㄝㄞㄟㄠㄡㄢㄣㄤㄥㄦ Table 1 Example of a phonetic consonant and vowel table consonant ㄅㄆㄇㄈㄉㄊㄋㄌㄍㄎㄏㄐㄑㄒ Consonants can exist independently ㄓㄔㄕㄖㄗㄘㄙ vowel ㄧㄨㄩㄚㄛㄜㄝㄞㄟㄠㄡㄢㄣㄤㄥㄦ

請回到圖2。在步驟S400中，TOP N姓氏生成模組40可利用相似音符文字序列獲得TOP N姓氏。以下將以圖6進一步說明。Please return to FIG. 2. In step S400, the TOP N surname generation module 40 can obtain the TOP N surnames using similar phonetic character sequences. This will be further described below with reference to FIG. 6.

圖6是圖2所示的步驟S400的詳細流程圖。請同時參照圖1、圖2及圖6。FIG6 is a detailed flow chart of step S400 shown in FIG2. Please refer to FIG1, FIG2 and FIG6 at the same time.

在步驟S401中，TOP N姓氏生成模組40可利用相似音符文字序列以及百家姓資料庫41來產生可能姓氏序列。承前述實施例，TOP N姓氏生成模組40可比對相似音符文字序列「ㄨㄤ、ㄅㄤ、ㄈㄤ、ㄓㄤ、ㄔㄤ、ㄏㄨㄤ、ㄎㄨㄤ…」以及百家姓資料庫41來產生可能姓氏序列「汪、王、邦、方、房、張、常、黃、匡」。In step S401, the TOP N surname generation module 40 can generate a possible surname sequence using similar phonetic character sequences and the Hundred Family Surnames database 41. Based on the above-mentioned embodiment, the TOP N surname generation module 40 can compare the similar phonetic character sequences "ㄨㄤ、ㄅㄤ、ㄈㄤ、ㄓㄤ、ㄔㄤ、ㄏㄨㄤ、ㄎㄨㄤ..." and the Hundred Family Surnames database 41 to generate a possible surname sequence "汪、王、邦、方、房、张、常、黄、匡".

在步驟S402中，TOP N姓氏生成模組40可從可能姓氏序列中選出TOP N姓氏。承前述實施例，TOP N姓氏生成模組40可從可能姓氏序列「汪、王、邦、方、房、張、常、黃、匡」中選出TOP N姓氏「王」及「黃」。In step S402, the TOP N surname generation module 40 can select TOP N surnames from the possible surname sequence. In the above embodiment, the TOP N surname generation module 40 can select the TOP N surnames "王" and "黄" from the possible surname sequence "汪、王、邦、方、房、张、常、黄、匡".

請回到圖2。在步驟S500中，精準姓氏回問語句生成模組50可產生對應於TOP N姓氏的精準姓氏回問問句。以下將以圖7進一步說明。Please return to FIG. 2. In step S500, the precise surname answer sentence generation module 50 can generate precise surname answer sentence corresponding to the TOP N surnames. FIG. 7 will be used to further illustrate the above.

圖7是圖2所示的步驟S500的詳細流程圖。請同時參照圖1、圖2及圖7。FIG7 is a detailed flow chart of step S500 shown in FIG2. Please refer to FIG1, FIG2 and FIG7 at the same time.

在步驟S501中，精準姓氏回問語句生成模組50可利用姓氏說法資料庫21來選出對應於TOP N姓氏的精準姓氏說法。承前述實施例，精準姓氏回問語句生成模組50可利用姓氏說法資料庫21來選出對應於TOP N姓氏（「王」及「黃」）的精準姓氏說法「三橫一豎王」及「草頭黃」。In step S501, the precise surname answer sentence generation module 50 can use the surname saying database 21 to select the precise surname sayings corresponding to the TOP N surnames. Based on the above embodiment, the precise surname answer sentence generation module 50 can use the surname saying database 21 to select the precise surname sayings "三橫一豎王" and "草头黄" corresponding to the TOP N surnames ("王" and "黄") .

在步驟S502中，精準姓氏回問語句生成模組50可利用精準姓氏說法來產生對應於TOP N姓氏的精準姓氏回問問句。承前述實施例，精準姓氏回問語句生成模組50可利用精準姓氏說法「三橫一豎王」及「草頭黃」來產生精準姓氏回問問句「請問是三橫一豎王，還是草頭黃呢」。In step S502, the precise surname answer sentence generation module 50 can use the precise surname utterance to generate a precise surname answer sentence corresponding to the TOP N surnames. Based on the above embodiment, the precise surname answer sentence generation module 50 can use the precise surname utterance "三橫一竖王" and "草头黄" to generate a precise surname answer sentence "Excuse me, is it 三橫一竖王, or 草头黄?".

在產生精準姓氏回問問句之後，系統1可通過收發器（圖1未繪示）傳送精準姓氏回問問句至客戶設備。待客戶回應之後，便可進入下一輪的對話（即再度回到步驟S100）。若用戶回應了語音「是三橫一豎王」，則在步驟S201中，精準姓氏確認模組20可判斷文字「是三橫一豎王」包括精準姓氏說法「三橫一豎王」，並且可執行步驟S202以及步驟S203（結束與客戶的多輪對話）。After generating the precise surname query sentence, the system 1 can transmit the precise surname query sentence to the client device via a transceiver (not shown in FIG. 1 ). After the client responds, the next round of dialogue can be entered (i.e., returning to step S100 again). If the user responds with the voice "It is three horizontal and one vertical king", then in step S201, the precise surname confirmation module 20 can determine that the text "It is three horizontal and one vertical king" includes the precise surname statement "three horizontal and one vertical king", and can execute steps S202 and S203 (ending multiple rounds of dialogue with the client).

另一方面，若此時用戶回應了語音「都不是，是郎，郎中的郎」，則在步驟S201中，精準姓氏確認模組20可判斷文字「都不是，是郎，郎中的郎」包括了其它的精準姓氏說法，並且可執行步驟S202以及步驟S203（結束與客戶的多輪對話）。On the other hand, if the user responds with the voice "Neither, it's Lang, Lang in Lang" at this time, then in step S201, the precise surname confirmation module 20 can determine that the text "Neither, it's Lang, Lang in Lang" includes other precise surname statements, and can execute steps S202 and S203 (ending multiple rounds of dialogue with the customer).

另一方面，若此時用戶回應了語音「都不是，我姓杭」，則在步驟S201中，精準姓氏確認模組20可判斷文字「都不是，我姓杭」不包括精準姓氏說法。接著，精準姓氏確認模組20可在步驟S204中判斷文字「都不是，我姓杭」不包括確認姓氏語句，並且可執行步驟S205。然後，系統1將繼續執行步驟S300、步驟S400及步驟S500，並且在執行完步驟S500之後，再開始下一輪的對話（即回到步驟S100）。On the other hand, if the user responds with the voice "neither, my surname is Hang", then in step S201, the precise surname confirmation module 20 can determine that the text "neither, my surname is Hang" does not include the precise surname statement. Then, the precise surname confirmation module 20 can determine in step S204 that the text "neither, my surname is Hang" does not include the surname confirmation sentence, and can execute step S205. Then, the system 1 will continue to execute steps S300, S400 and S500, and after executing step S500, start the next round of dialogue (i.e. return to step S100).

另一方面，若此時用戶回應了語音「不素啦，偶姓荒啦（方姓用戶因口音，將『方』念為『荒』）」，且若經過多輪的對話之後系統1仍然無法獲得精準姓氏，當多輪的對話的次數等於次數門檻值時，系統1將結束與客戶之間的多輪的對話。On the other hand, if the user responds with the voice message "Not Su La, my surname is Huang La (the user with the surname Fang pronounces 'Fang' as 'Huang' due to his accent)" at this time, and if after multiple rounds of dialogue, System 1 still cannot obtain the accurate surname, when the number of multiple rounds of dialogue is equal to the threshold value, System 1 will end the multiple rounds of dialogue with the customer.

本發明另提供了一種用於獲得精準姓氏的方法，其中所述方法可由圖1所示的系統1實施。所述方法包括以下步驟：The present invention further provides a method for obtaining an accurate surname, wherein the method can be implemented by the system 1 shown in FIG. 1 . The method comprises the following steps:

(a)當精準姓氏確認模組判斷文字不包括精準姓氏說法且不包括確認姓氏語句時，由精準姓氏確認模組從文字中擷取出姓氏。(a) When the precise surname confirmation module determines that the text does not include a precise surname statement and does not include a surname confirmation sentence, the precise surname confirmation module extracts the surname from the text.

(b)由相似音取得模組利用姓氏獲得相似音符文字序列。(b) The similar sound acquisition module uses the surname to obtain a similar phonetic character sequence.

(c)由TOP N姓氏生成模組利用相似音符文字序列獲得TOP N姓氏。(c) The TOP N surname generation module obtains the TOP N surnames using similar phonetic character sequences.

(d)由精準姓氏回問語句生成模組產生對應於TOP N姓氏的精準姓氏回問問句。(d) The precise surname answer sentence generation module generates precise surname answer question sentences corresponding to the TOP N surnames.

所述方法已於前述實施例說明，於此不再贅述。The method has been described in the above embodiments and will not be described again here.

綜上所述，本發明的用於獲得精準姓氏的系統及方法可藉由精準姓氏說法、相似音符文字序列以及精準姓氏回問問句來獲得客戶的精準姓氏。除此之外，還可藉由與客戶之間多輪的對話來提高獲得姓氏的準確度。基此，對於需要客戶的姓氏的對話服務系統來說，可以更有效地獲得客戶的精準姓氏。In summary, the system and method for obtaining an accurate surname of the present invention can obtain the accurate surname of a customer through accurate surname sayings, similar phonetic character sequences, and accurate surname answer sentences. In addition, the accuracy of the surname obtained can be improved through multiple rounds of dialogue with the customer. Based on this, for a dialogue service system that requires a customer's surname, the accurate surname of the customer can be obtained more effectively.

1:用於獲得精準姓氏的系統 10:語音辨別與文字轉換模組 20:精準姓氏確認模組 30:相似音取得模組 40:TOP N姓氏生成模組 50:精準姓氏回問語句生成模組 11:中文文字字庫 21:姓氏說法資料庫 41:百家姓資料庫 S100~S500、S101~S103、S201~S205、S301~S302、S401~S402、S501~S502:步驟 1: System for obtaining accurate surnames 10: Voice recognition and text conversion module 20: Accurate surname confirmation module 30: Similar sound acquisition module 40: TOP N surname generation module 50: Accurate surname question sentence generation module 11: Chinese character library 21: Surname saying database 41: Hundred Family Surname Database S100~S500, S101~S103, S201~S205, S301~S302, S401~S402, S501~S502: Steps

圖1是根據本發明的一實施例繪示的用於獲得精準姓氏的系統的示意圖。圖2是圖1所示的系統的運作流程圖。圖3是圖2所示的步驟S100的詳細流程圖。圖4是圖2所示的步驟S200的詳細流程圖。圖5是圖2所示的步驟S300的詳細流程圖。圖6是圖2所示的步驟S400的詳細流程圖。圖7是圖2所示的步驟S500的詳細流程圖。 FIG. 1 is a schematic diagram of a system for obtaining an accurate surname according to an embodiment of the present invention. FIG. 2 is an operation flow chart of the system shown in FIG. 1 . FIG. 3 is a detailed flow chart of step S100 shown in FIG. 2 . FIG. 4 is a detailed flow chart of step S200 shown in FIG. 2 . FIG. 5 is a detailed flow chart of step S300 shown in FIG. 2 . FIG. 6 is a detailed flow chart of step S400 shown in FIG. 2 . FIG. 7 is a detailed flow chart of step S500 shown in FIG. 2 .

S100~S500:步驟 S100~S500: Steps

Claims

A system for obtaining an accurate surname comprises: a storage medium storing a plurality of modules, wherein the plurality of modules comprises an accurate surname confirmation module, a similar sound acquisition module, a TOP N surname generation module and an accurate surname answer sentence generation module; and a processor coupled to the storage medium, wherein the processor accesses and executes the plurality of modules to perform the following steps: when the accurate surname confirmation module determines that a text does not include an accurate surname statement and does not include a surname confirmation sentence, the accurate surname confirmation module extracts a surname from the text; the similar sound acquisition module obtains a similar musical note text sequence using the surname; the TOP N surname generation module obtains a TOP N surname using the similar musical note text sequence; and the accurate surname answer sentence generation module generates a sentence corresponding to the TOP An accurate surname answer question sentence for the surname N, wherein the accurate surname answer question sentence includes a first surname and a second surname, and the first surname and the second surname have similar pronunciations.

A system as described in claim 1, wherein the storage medium further stores a surname statement database, wherein the precise surname confirmation module uses the surname statement database to determine whether the text includes the precise surname statement; When the precise surname confirmation module determines that the text includes the precise surname statement, the precise surname confirmation module ends the dialogue with the client device; when the precise surname confirmation module determines that the text does not include the precise surname statement, the precise surname confirmation module determines whether the text includes the confirmation surname sentence; when the precise surname confirmation module determines that the text includes the confirmation surname sentence, the precise surname confirmation module ends the dialogue with the client device.

A system as described in claim 1, wherein the similar sound acquisition module removes the Chinese tones in the surname to obtain the retained Zhuyin; the similar sound acquisition module uses the Zhuyin consonant vowel table and the retained Zhuyin to obtain the similar phonetic character sequence.

A system as described in claim 1, wherein the storage medium further stores a database of Hundred Family Surnames, wherein the TOP N Surname generation module uses the similar phonetic character sequence and the Hundred Family Surnames database to generate a possible surname sequence; the TOP N Surname generation module selects the TOP N surnames from the possible surname sequence.

The system as described in claim 1, wherein the storage medium further stores a surname statement database, wherein the precise surname answer sentence generation module uses the surname statement database to select the precise surname statement corresponding to the TOP N surnames; The precise surname answer sentence generation module uses the precise surname statement to generate the precise surname answer sentence corresponding to the TOP N surnames.

A system as described in claim 1, wherein the processor repeatedly executes each of the steps; when the number of times the processor repeatedly executes each of the steps is equal to a number threshold value, the processor stops executing each of the steps.

The system as described in claim 1 further includes a speech recognition and text conversion module, wherein the storage medium further stores a Chinese character library, wherein the speech recognition and text conversion module receives speech from a client device; the speech recognition and text conversion module performs a speech recognition operation on the speech to obtain recognized speech; the speech recognition and text conversion module uses the recognized speech and the Chinese character library to obtain the text corresponding to the speech.

A method for obtaining an accurate surname is suitable for being executed by a system including an accurate surname confirmation module, a similar sound acquisition module, a TOP N surname generation module and an accurate surname answer sentence generation module, wherein the method comprises the following steps: when the accurate surname confirmation module determines that a text does not include an accurate surname statement and does not include a surname confirmation sentence, the accurate surname confirmation module extracts a surname from the text; the similar sound acquisition module uses the surname to obtain a similar musical note character sequence; the TOP N surname generation module uses the similar musical note character sequence to obtain the TOP N surnames; and the accurate surname answer sentence generation module generates an accurate surname answer sentence corresponding to the TOP N surnames, wherein the accurate surname answer sentence includes a first surname and a second surname, and the first surname and the second surname have similar pronunciations.