TWI876580B - System and method for obtaining accurate surname - Google Patents

System and method for obtaining accurate surname Download PDF

Info

Publication number
TWI876580B
TWI876580B TW112138579A TW112138579A TWI876580B TW I876580 B TWI876580 B TW I876580B TW 112138579 A TW112138579 A TW 112138579A TW 112138579 A TW112138579 A TW 112138579A TW I876580 B TWI876580 B TW I876580B
Authority
TW
Taiwan
Prior art keywords
surname
accurate
precise
text
module
Prior art date
Application number
TW112138579A
Other languages
Chinese (zh)
Other versions
TW202516494A (en
Inventor
游政儒
黃至德
Original Assignee
中華電信股份有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 中華電信股份有限公司 filed Critical 中華電信股份有限公司
Priority to TW112138579A priority Critical patent/TWI876580B/en
Application granted granted Critical
Publication of TWI876580B publication Critical patent/TWI876580B/en
Publication of TW202516494A publication Critical patent/TW202516494A/en

Links

Images

Landscapes

  • Machine Translation (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Document Processing Apparatus (AREA)

Abstract

A system and a method for obtaining an accurate surname are provided. The method includes following steps: when an accurate-surname-confirmation-module judges that a text does not include a precise-surname-statement and does not include a confirmation-surname-sentence, extracting, by the accurate-surname-confirmation-module, a surname from the text; using, by a similar-sound-acquisition-module, to obtain a similar-note-text-sequence; using, by a TOP-N-surname-generation-module, the similar-note-text-sequence to obtain a TOP-N-surname; and generating, by a precise-surname-answer-sentence-generation-module, a precise-surname-querying-back-sentence corresponding to the TOP-N-surname.

Description

用於獲得精準姓氏的系統及方法System and method for obtaining accurate surname

本發明是有關於一種用於獲得精準姓氏的系統及方法。The present invention relates to a system and method for obtaining an accurate surname.

當客戶連線/致電至一般現有的對話服務系統時,若此時對話服務系統需要確認客戶的姓氏,則現有的文字辨識技術常因為中文存在高機率的同音字而誤判,造成客戶的觀感不佳。When a customer connects/calls a general existing conversation service system, if the conversation service system needs to confirm the customer's surname at this time, the existing text recognition technology often misjudges because there is a high probability of homophones in Chinese, resulting in a bad impression for the customer.

本發明的用於獲得精準姓氏的系統包括儲存媒體以及處理器。儲存媒體儲存多個模組,其中多個模組包括精準姓氏確認模組、相似音取得模組、TOP N姓氏生成模組以及精準姓氏回問語句生成模組。處理器耦接儲存媒體,其中處理器存取和執行多個模組來執行以下各步驟:當精準姓氏確認模組判斷文字不包括精準姓氏說法且不包括確認姓氏語句時,精準姓氏確認模組從文字中擷取出姓氏;相似音取得模組利用姓氏獲得相似音符文字序列;TOP N姓氏生成模組利用相似音符文字序列獲得TOP N姓氏;以及精準姓氏回問語句生成模組產生對應於TOP N姓氏的精準姓氏回問問句。The system for obtaining an accurate surname of the present invention includes a storage medium and a processor. The storage medium stores a plurality of modules, wherein the plurality of modules include an accurate surname confirmation module, a similar sound acquisition module, a TOP N surname generation module, and an accurate surname answer sentence generation module. The processor is coupled to the storage medium, wherein the processor accesses and executes the plurality of modules to execute the following steps: when the accurate surname confirmation module determines that the text does not include an accurate surname statement and does not include a surname confirmation sentence, the accurate surname confirmation module extracts the surname from the text; the similar sound acquisition module uses the surname to obtain a similar musical note text sequence; the TOP N surname generation module uses the similar musical note text sequence to obtain the TOP N surnames; and the accurate surname answer sentence generation module generates an accurate surname answer sentence corresponding to the TOP N surnames.

本發明的用於獲得精準姓氏的方法包括以下步驟:當精準姓氏確認模組判斷文字不包括精準姓氏說法且不包括確認姓氏語句時,由精準姓氏確認模組從文字中擷取出姓氏;由相似音取得模組利用姓氏獲得相似音符文字序列;由TOP N姓氏生成模組利用相似音符文字序列獲得TOP N姓氏;以及由精準姓氏回問語句生成模組產生對應於TOP N姓氏的精準姓氏回問問句 The method for obtaining an accurate surname of the present invention comprises the following steps: when the accurate surname confirmation module determines that the text does not include an accurate surname statement and does not include a surname confirmation sentence, the accurate surname confirmation module extracts the surname from the text; the similar sound acquisition module uses the surname to obtain a similar musical note text sequence; the TOP N surname generation module uses the similar musical note text sequence to obtain the TOP N surnames; and the accurate surname question answer sentence generation module generates an accurate surname question answer sentence corresponding to the TOP N surnames .

圖1是根據本發明的一實施例繪示的用於獲得精準姓氏的系統1的示意圖。系統1可包括儲存媒體以及處理器。Fig. 1 is a schematic diagram of a system 1 for obtaining an accurate surname according to an embodiment of the present invention. The system 1 may include a storage medium and a processor.

儲存媒體例如是任何型態的固定式或可移動式的隨機存取記憶體(random access memory,RAM)、唯讀記憶體(read-only memory,ROM)、快閃記憶體(flash memory)、硬碟(hard disk drive,HDD)、固態硬碟(solid state drive,SSD)或類似元件或上述元件的組合,而用於儲存可由處理器執行的多個模組或各種應用程式。在本實施例中,儲存媒體可儲存多個模組,其中所述多個模組可包括精準姓氏確認模組20、相似音取得模組30、TOP N姓氏生成模組40以及精準姓氏回問語句生成模組50。在其他實施例中,所述多個模組還可包括語音辨別與文字轉換模組10。在其他實施例中,儲存媒體還可儲存中文文字字庫11、姓氏說法資料庫21以及百家姓資料庫41。The storage medium is, for example, any type of fixed or removable random access memory (RAM), read-only memory (ROM), flash memory, hard disk drive (HDD), solid state drive (SSD) or similar components or a combination of the above components, and is used to store multiple modules or various applications that can be executed by the processor. In this embodiment, the storage medium can store multiple modules, wherein the multiple modules may include an accurate surname confirmation module 20, a similar sound acquisition module 30, a TOP N surname generation module 40, and an accurate surname question sentence generation module 50. In other embodiments, the multiple modules may also include a voice recognition and text conversion module 10. In other embodiments, the storage medium can also store the Chinese character database 11, the surname database 21, and the Hundred Family Surnames database 41.

處理器例如是中央處理單元(central processing unit,CPU),或是其他可程式化之一般用途或特殊用途的微控制單元(micro control unit,MCU)、微處理器(microprocessor)、數位信號處理器(digital signal processor,DSP)、可程式化控制器、特殊應用積體電路(application specific integrated circuit,ASIC)、圖形處理器(graphics processing unit,GPU)、影像訊號處理器(image signal processor,ISP)、影像處理單元(image processing unit,IPU)、算數邏輯單元(arithmetic logic unit,ALU)、複雜可程式邏輯裝置(complex programmable logic device,CPLD)、現場可程式化邏輯閘陣列(field programmable gate array,FPGA)或其他類似元件或上述元件的組合。處理器可耦接至儲存媒體,並且存取和執行儲存於儲存媒體中的多個模組和各種應用程式。The processor is, for example, a central processing unit (CPU), or other programmable general-purpose or special-purpose micro control unit (MCU), microprocessor, digital signal processor (DSP), programmable controller, application specific integrated circuit (ASIC), graphics processing unit (GPU), image signal processor (ISP), image processing unit (IPU), arithmetic logic unit (ALU), complex programmable logic device (CPLD), field programmable gate array (FPGA), or other similar components or combinations of the above components. The processor may be coupled to the storage medium and access and execute a plurality of modules and various applications stored in the storage medium.

圖2是圖1所示的系統1的運作流程圖。請同時參照圖1及圖2。FIG2 is an operation flow chart of the system 1 shown in FIG1. Please refer to FIG1 and FIG2 at the same time.

在步驟S100中,語音辨別與文字轉換模組10可將語音轉換為文字。以下將以圖3進一步說明。In step S100, the speech recognition and text conversion module 10 can convert speech into text. This will be further described below with reference to FIG.

圖3是圖2所示的步驟S100的詳細流程圖。請同時參照圖1、圖2及圖3。FIG3 is a detailed flow chart of step S100 shown in FIG2. Please refer to FIG1, FIG2 and FIG3 at the same time.

在步驟S101中,語音辨別與文字轉換模組10可從客戶設備接收語音。詳細而言,圖1所示的系統1還可包括收發器(圖1未繪示)。語音辨別與文字轉換模組10可通過收發器從客戶設備(圖1未繪示)接收語音。客戶設備例如是電話或智慧音箱設備,然而本發明不限於此。舉例來說,當客戶操作客戶設備以連線至系統1時,系統1可詢問「請問客戶貴姓?」。以下實施例將以語音辨別與文字轉換模組10從客戶設備接收的語音為「我姓王」來繼續說明。In step S101, the speech recognition and text conversion module 10 can receive speech from a client device. In detail, the system 1 shown in FIG1 may further include a transceiver (not shown in FIG1 ). The speech recognition and text conversion module 10 can receive speech from a client device (not shown in FIG1 ) via the transceiver. The client device is, for example, a telephone or a smart speaker device, but the present invention is not limited thereto. For example, when a customer operates a client device to connect to the system 1, the system 1 may ask, "What is your last name?" The following embodiment will be described assuming that the speech received by the speech recognition and text conversion module 10 from the client device is "My last name is Wang".

在步驟S102中,語音辨別與文字轉換模組10可對語音執行語音識別操作以獲得識別後語音。詳細而言,語音辨別與文字轉換模組10可對語音進行前處理。由於不同客戶身處的環境不同,客戶距離客戶設備的收音設備/麥克風的距離也不同,且各種噪音會影響收音結果。因此,語音辨別與文字轉換模組10可將環境的雜音去除,並識別出可用的識別後語音(人聲音頻)。In step S102, the speech recognition and text conversion module 10 can perform speech recognition operation on the speech to obtain recognized speech. In detail, the speech recognition and text conversion module 10 can perform pre-processing on the speech. Since different customers are in different environments, the distances between the customers and the sound receiving device/microphone of the customer equipment are also different, and various noises will affect the sound receiving results. Therefore, the speech recognition and text conversion module 10 can remove the noise of the environment and recognize the usable recognized speech (human voice frequency).

在步驟S103中,語音辨別與文字轉換模組10可利用識別後語音以及中文文字字庫11獲得對應於語音的文字。詳細而言,語音辨別與文字轉換模組10可利用中文文字字庫11內的中文文字內容來產生對應於識別後語音的文字。承前述實施例,語音辨別與文字轉換模組10可獲得文字「我姓王」。In step S103, the speech recognition and text conversion module 10 can use the recognized speech and the Chinese character library 11 to obtain text corresponding to the speech. Specifically, the speech recognition and text conversion module 10 can use the Chinese character content in the Chinese character library 11 to generate text corresponding to the recognized speech. Based on the above embodiment, the speech recognition and text conversion module 10 can obtain the text "My surname is Wang".

請回到圖2。在步驟S200中,當精準姓氏確認模組20判斷文字不包括精準姓氏說法且不包括確認姓氏語句時,精準姓氏確認模組20可從文字中擷取出姓氏。以下將以圖4進一步說明。Please return to FIG. 2. In step S200, when the precise surname confirmation module 20 determines that the text does not include the precise surname statement and does not include the surname confirmation sentence, the precise surname confirmation module 20 can extract the surname from the text. This will be further explained below with reference to FIG. 4.

圖4是圖2所示的步驟S200的詳細流程圖。請同時參照圖1、圖2及圖4。FIG4 is a detailed flow chart of step S200 shown in FIG2. Please refer to FIG1, FIG2 and FIG4 at the same time.

在步驟S201中,精準姓氏確認模組20可利用姓氏說法資料庫21判斷文字是否包括精準姓氏說法。In step S201, the accurate surname confirmation module 20 may use the surname saying database 21 to determine whether the text includes an accurate surname saying.

當精準姓氏確認模組20判斷文字包括精準姓氏說法時(步驟S201的判斷結果為「是」),在步驟S202中,精準姓氏確認模組20可採納精準姓氏說法。舉例來說,若此時文字包括精準姓氏說法「我姓張,弓長張」或者精準姓氏說法「我姓黃,草頭黃」,則精準姓氏確認模組20可採納此精準姓氏說法。然後,精準姓氏確認模組20可執行步驟S203以結束與客戶設備的對話。換言之,此時系統1將不執行步驟S300、S400及S500。When the precise surname confirmation module 20 determines that the text includes the precise surname statement (the determination result of step S201 is "yes"), in step S202, the precise surname confirmation module 20 can adopt the precise surname statement. For example, if the text includes the precise surname statement "My surname is Zhang, and my bow is Zhang" or the precise surname statement "My surname is Huang, and my grass is Huang", the precise surname confirmation module 20 can adopt the precise surname statement. Then, the precise surname confirmation module 20 can execute step S203 to end the dialogue with the client device. In other words, the system 1 will not execute steps S300, S400 and S500 at this time.

當精準姓氏確認模組20判斷文字不包括精準姓氏說法時,精準姓氏確認模組20可判斷文字是否包括確認姓氏語句。詳細而言,當精準姓氏確認模組20判斷文字包括確認姓氏語句時(步驟S204的判斷結果為「是」),在步驟S203中,精準姓氏確認模組20可結束與客戶的對話。舉例來說,假設在經過與特定客戶的多輪對話後文字已包括了確認姓氏語句(例如「對,沒錯」)。由於文字雖然不包括精準姓氏說法但已經包括了確認姓氏語句,精準姓氏確認模組20可執行步驟S203來結束與客戶設備的對話。換言之,此時系統1將不執行步驟S300、S400及S500。When the precise surname confirmation module 20 determines that the text does not include the precise surname statement, the precise surname confirmation module 20 can determine whether the text includes a surname confirmation sentence. In detail, when the precise surname confirmation module 20 determines that the text includes a surname confirmation sentence (the judgment result of step S204 is "yes"), in step S203, the precise surname confirmation module 20 can end the dialogue with the customer. For example, assume that after multiple rounds of dialogue with a specific customer, the text has included a surname confirmation sentence (such as "Yes, that's right"). Since the text does not include the precise surname statement but has included the surname confirmation sentence, the precise surname confirmation module 20 can execute step S203 to end the dialogue with the customer device. In other words, at this time, the system 1 will not execute steps S300, S400 and S500.

另一方面,當精準姓氏確認模組20判斷文字不包括確認姓氏語句時(步驟S204的判斷結果為「否」),在步驟S205中,精準姓氏確認模組20可從文字中擷取出姓氏。承前述實施例,由於文字「我姓王」不包括確認姓氏語句,精準姓氏確認模組20可執行步驟S205。換言之,精準姓氏確認模組20可從文字「我姓王」中擷取出可能的姓氏「王」。On the other hand, when the precise surname confirmation module 20 determines that the text does not include a surname confirmation sentence (the determination result of step S204 is "No"), in step S205, the precise surname confirmation module 20 can extract a surname from the text. Based on the above embodiment, since the text "My surname is Wang" does not include a surname confirmation sentence, the precise surname confirmation module 20 can execute step S205. In other words, the precise surname confirmation module 20 can extract a possible surname "Wang" from the text "My surname is Wang".

請回到圖2。在步驟S300中,相似音取得模組30可利用姓氏獲得相似音符文字序列。以下將以圖5進一步說明。Please return to FIG. 2. In step S300, the similar sound acquisition module 30 can use the surname to obtain a similar phonetic character sequence. FIG. 5 will be used to further illustrate this.

圖5是圖2所示的步驟S300的詳細流程圖。請同時參照圖1、圖2及圖5。FIG5 is a detailed flow chart of step S300 shown in FIG2. Please refer to FIG1, FIG2 and FIG5 at the same time.

在步驟S301中,相似音取得模組30可去除姓氏中的中文音調以獲得保留注音。中文音調可以是中文的1234聲音調。承前述實施例,相似音取得模組30可去除姓氏「王」中的中文音調以獲得保留注音「ㄨㄤ」。In step S301, the similar sound acquisition module 30 can remove the Chinese tones in the surname to obtain the retained phonetic notation. The Chinese tones can be the 1234 tones of Chinese. Based on the above embodiment, the similar sound acquisition module 30 can remove the Chinese tones in the surname "王" to obtain the retained phonetic notation "ㄨㄤ".

在步驟S302中,相似音取得模組30可利用(如下表1所示的)注音子音母音表以及保留注音來獲得相似音符文字序列。承前述實施例,相似音取得模組30可利用注音子音母音表以及保留注音「ㄨㄤ」獲得相似音符文字序列「ㄨㄤ、ㄅㄤ、ㄈㄤ、ㄓㄤ、ㄔㄤ、ㄏㄨㄤ、ㄎㄨㄤ…」。In step S302, the similar sound acquisition module 30 can use the phonetic consonant vowel table and the reserved phonetic notation (as shown in Table 1 below) to obtain a similar musical note text sequence. Based on the above embodiment, the similar sound acquisition module 30 can use the phonetic consonant vowel table and the reserved phonetic notation "ㄨㄤ" to obtain a similar musical note text sequence "ㄨㄤ、ㄅㄤ、ㄈㄤ、ㄓㄤ、ㄔㄤ、ㄏㄨㄤ、ㄎㄨㄤ...".

表1注音子音母音表的範例 子音 ㄅㄆㄇㄈㄉㄊㄋㄌㄍㄎㄏㄐㄑㄒ 可獨立存在子音 ㄓㄔㄕㄖㄗㄘㄙ 母音 ㄧㄨㄩㄚㄛㄜㄝㄞㄟㄠㄡㄢㄣㄤㄥㄦ Table 1 Example of a phonetic consonant and vowel table consonant ㄅㄆㄇㄈㄉㄊㄋㄌㄍㄎㄏㄐㄑㄒ Consonants can exist independently ㄓㄔㄕㄖㄗㄘㄙ vowel ㄧㄨㄩㄚㄛㄜㄝㄞㄟㄠㄡㄢㄣㄤㄥㄦ

請回到圖2。在步驟S400中,TOP N姓氏生成模組40可利用相似音符文字序列獲得TOP N姓氏。以下將以圖6進一步說明。Please return to FIG. 2. In step S400, the TOP N surname generation module 40 can obtain the TOP N surnames using similar phonetic character sequences. This will be further described below with reference to FIG. 6.

圖6是圖2所示的步驟S400的詳細流程圖。請同時參照圖1、圖2及圖6。FIG6 is a detailed flow chart of step S400 shown in FIG2. Please refer to FIG1, FIG2 and FIG6 at the same time.

在步驟S401中,TOP N姓氏生成模組40可利用相似音符文字序列以及百家姓資料庫41來產生可能姓氏序列。承前述實施例,TOP N姓氏生成模組40可比對相似音符文字序列「ㄨㄤ、ㄅㄤ、ㄈㄤ、ㄓㄤ、ㄔㄤ、ㄏㄨㄤ、ㄎㄨㄤ…」以及百家姓資料庫41來產生可能姓氏序列「汪、王、邦、方、房、張、常、黃、匡」。In step S401, the TOP N surname generation module 40 can generate a possible surname sequence using similar phonetic character sequences and the Hundred Family Surnames database 41. Based on the above-mentioned embodiment, the TOP N surname generation module 40 can compare the similar phonetic character sequences "ㄨㄤ、ㄅㄤ、ㄈㄤ、ㄓㄤ、ㄔㄤ、ㄏㄨㄤ、ㄎㄨㄤ..." and the Hundred Family Surnames database 41 to generate a possible surname sequence "汪、王、邦、方、房、张、常、黄、匡".

在步驟S402中,TOP N姓氏生成模組40可從可能姓氏序列中選出TOP N姓氏。承前述實施例,TOP N姓氏生成模組40可從可能姓氏序列「汪、王、邦、方、房、張、常、黃、匡」中選出TOP N姓氏「王」及「黃」。In step S402, the TOP N surname generation module 40 can select TOP N surnames from the possible surname sequence. In the above embodiment, the TOP N surname generation module 40 can select the TOP N surnames "王" and "黄" from the possible surname sequence "汪、王、邦、方、房、张、常、黄、匡".

請回到圖2。在步驟S500中,精準姓氏回問語句生成模組50可產生對應於TOP N姓氏的精準姓氏回問問句。以下將以圖7進一步說明。Please return to FIG. 2. In step S500, the precise surname answer sentence generation module 50 can generate precise surname answer sentence corresponding to the TOP N surnames. FIG. 7 will be used to further illustrate the above.

圖7是圖2所示的步驟S500的詳細流程圖。請同時參照圖1、圖2及圖7。FIG7 is a detailed flow chart of step S500 shown in FIG2. Please refer to FIG1, FIG2 and FIG7 at the same time.

在步驟S501中,精準姓氏回問語句生成模組50可利用姓氏說法資料庫21來選出對應於TOP N姓氏的精準姓氏說法。承前述實施例,精準姓氏回問語句生成模組50可利用姓氏說法資料庫21來選出對應於TOP N姓氏(「王」及「黃」)的精準姓氏說法「三橫一豎王」及「草頭黃」。In step S501, the precise surname answer sentence generation module 50 can use the surname saying database 21 to select the precise surname sayings corresponding to the TOP N surnames. Based on the above embodiment, the precise surname answer sentence generation module 50 can use the surname saying database 21 to select the precise surname sayings "三橫一豎王" and "草头黄" corresponding to the TOP N surnames ("王" and "黄") .

在步驟S502中,精準姓氏回問語句生成模組50可利用精準姓氏說法來產生對應於TOP N姓氏的精準姓氏回問問句。承前述實施例,精準姓氏回問語句生成模組50可利用精準姓氏說法「三橫一豎王」及「草頭黃」來產生精準姓氏回問問句「請問是三橫一豎王,還是草頭黃呢」。In step S502, the precise surname answer sentence generation module 50 can use the precise surname utterance to generate a precise surname answer sentence corresponding to the TOP N surnames. Based on the above embodiment, the precise surname answer sentence generation module 50 can use the precise surname utterance "三橫一竖王" and "草头黄" to generate a precise surname answer sentence "Excuse me, is it 三橫一竖王, or 草头黄?".

在產生精準姓氏回問問句之後,系統1可通過收發器(圖1未繪示)傳送精準姓氏回問問句至客戶設備。待客戶回應之後,便可進入下一輪的對話(即再度回到步驟S100)。若用戶回應了語音「是三橫一豎王」,則在步驟S201中,精準姓氏確認模組20可判斷文字「是三橫一豎王」包括精準姓氏說法「三橫一豎王」,並且可執行步驟S202以及步驟S203(結束與客戶的多輪對話)。After generating the precise surname query sentence, the system 1 can transmit the precise surname query sentence to the client device via a transceiver (not shown in FIG. 1 ). After the client responds, the next round of dialogue can be entered (i.e., returning to step S100 again). If the user responds with the voice "It is three horizontal and one vertical king", then in step S201, the precise surname confirmation module 20 can determine that the text "It is three horizontal and one vertical king" includes the precise surname statement "three horizontal and one vertical king", and can execute steps S202 and S203 (ending multiple rounds of dialogue with the client).

另一方面,若此時用戶回應了語音「都不是,是郎,郎中的郎」,則在步驟S201中,精準姓氏確認模組20可判斷文字「都不是,是郎,郎中的郎」包括了其它的精準姓氏說法,並且可執行步驟S202以及步驟S203(結束與客戶的多輪對話)。On the other hand, if the user responds with the voice "Neither, it's Lang, Lang in Lang" at this time, then in step S201, the precise surname confirmation module 20 can determine that the text "Neither, it's Lang, Lang in Lang" includes other precise surname statements, and can execute steps S202 and S203 (ending multiple rounds of dialogue with the customer).

另一方面,若此時用戶回應了語音「都不是,我姓杭」,則在步驟S201中,精準姓氏確認模組20可判斷文字「都不是,我姓杭」不包括精準姓氏說法。接著,精準姓氏確認模組20可在步驟S204中判斷文字「都不是,我姓杭」不包括確認姓氏語句,並且可執行步驟S205。然後,系統1將繼續執行步驟S300、步驟S400及步驟S500,並且在執行完步驟S500之後,再開始下一輪的對話(即回到步驟S100)。On the other hand, if the user responds with the voice "neither, my surname is Hang", then in step S201, the precise surname confirmation module 20 can determine that the text "neither, my surname is Hang" does not include the precise surname statement. Then, the precise surname confirmation module 20 can determine in step S204 that the text "neither, my surname is Hang" does not include the surname confirmation sentence, and can execute step S205. Then, the system 1 will continue to execute steps S300, S400 and S500, and after executing step S500, start the next round of dialogue (i.e. return to step S100).

另一方面,若此時用戶回應了語音「不素啦,偶姓荒啦(方姓用戶因口音,將『方』念為『荒』)」,且若經過多輪的對話之後系統1仍然無法獲得精準姓氏,當多輪的對話的次數等於次數門檻值時,系統1將結束與客戶之間的多輪的對話。On the other hand, if the user responds with the voice message "Not Su La, my surname is Huang La (the user with the surname Fang pronounces 'Fang' as 'Huang' due to his accent)" at this time, and if after multiple rounds of dialogue, System 1 still cannot obtain the accurate surname, when the number of multiple rounds of dialogue is equal to the threshold value, System 1 will end the multiple rounds of dialogue with the customer.

本發明另提供了一種用於獲得精準姓氏的方法,其中所述方法可由圖1所示的系統1實施。所述方法包括以下步驟:The present invention further provides a method for obtaining an accurate surname, wherein the method can be implemented by the system 1 shown in FIG. 1 . The method comprises the following steps:

(a)當精準姓氏確認模組判斷文字不包括精準姓氏說法且不包括確認姓氏語句時,由精準姓氏確認模組從文字中擷取出姓氏。(a) When the precise surname confirmation module determines that the text does not include a precise surname statement and does not include a surname confirmation sentence, the precise surname confirmation module extracts the surname from the text.

(b)由相似音取得模組利用姓氏獲得相似音符文字序列。(b) The similar sound acquisition module uses the surname to obtain a similar phonetic character sequence.

(c)由TOP N姓氏生成模組利用相似音符文字序列獲得TOP N姓氏。(c) The TOP N surname generation module obtains the TOP N surnames using similar phonetic character sequences.

(d)由精準姓氏回問語句生成模組產生對應於TOP N姓氏的精準姓氏回問問句。(d) The precise surname answer sentence generation module generates precise surname answer question sentences corresponding to the TOP N surnames.

所述方法已於前述實施例說明,於此不再贅述。The method has been described in the above embodiments and will not be described again here.

綜上所述,本發明的用於獲得精準姓氏的系統及方法可藉由精準姓氏說法、相似音符文字序列以及精準姓氏回問問句來獲得客戶的精準姓氏。除此之外,還可藉由與客戶之間多輪的對話來提高獲得姓氏的準確度。基此,對於需要客戶的姓氏的對話服務系統來說,可以更有效地獲得客戶的精準姓氏。In summary, the system and method for obtaining an accurate surname of the present invention can obtain the accurate surname of a customer through accurate surname sayings, similar phonetic character sequences, and accurate surname answer sentences. In addition, the accuracy of the surname obtained can be improved through multiple rounds of dialogue with the customer. Based on this, for a dialogue service system that requires a customer's surname, the accurate surname of the customer can be obtained more effectively.

1:用於獲得精準姓氏的系統 10:語音辨別與文字轉換模組 20:精準姓氏確認模組 30:相似音取得模組 40:TOP N姓氏生成模組 50:精準姓氏回問語句生成模組 11:中文文字字庫 21:姓氏說法資料庫 41:百家姓資料庫 S100~S500、S101~S103、S201~S205、S301~S302、S401~S402、S501~S502:步驟 1: System for obtaining accurate surnames 10: Voice recognition and text conversion module 20: Accurate surname confirmation module 30: Similar sound acquisition module 40: TOP N surname generation module 50: Accurate surname question sentence generation module 11: Chinese character library 21: Surname saying database 41: Hundred Family Surname Database S100~S500, S101~S103, S201~S205, S301~S302, S401~S402, S501~S502: Steps

圖1是根據本發明的一實施例繪示的用於獲得精準姓氏的系統的示意圖。 圖2是圖1所示的系統的運作流程圖。 圖3是圖2所示的步驟S100的詳細流程圖。 圖4是圖2所示的步驟S200的詳細流程圖。 圖5是圖2所示的步驟S300的詳細流程圖。 圖6是圖2所示的步驟S400的詳細流程圖。 圖7是圖2所示的步驟S500的詳細流程圖。 FIG. 1 is a schematic diagram of a system for obtaining an accurate surname according to an embodiment of the present invention. FIG. 2 is an operation flow chart of the system shown in FIG. 1 . FIG. 3 is a detailed flow chart of step S100 shown in FIG. 2 . FIG. 4 is a detailed flow chart of step S200 shown in FIG. 2 . FIG. 5 is a detailed flow chart of step S300 shown in FIG. 2 . FIG. 6 is a detailed flow chart of step S400 shown in FIG. 2 . FIG. 7 is a detailed flow chart of step S500 shown in FIG. 2 .

S100~S500:步驟 S100~S500: Steps

Claims (8)

一種用於獲得精準姓氏的系統,包括:儲存媒體,儲存多個模組,其中所述多個模組包括精準姓氏確認模組、相似音取得模組、TOP N姓氏生成模組以及精準姓氏回問語句生成模組;以及處理器,耦接所述儲存媒體,其中所述處理器存取和執行所述多個模組來執行以下各步驟:當所述精準姓氏確認模組判斷文字不包括精準姓氏說法且不包括確認姓氏語句時,所述精準姓氏確認模組從所述文字中擷取出姓氏;所述相似音取得模組利用所述姓氏獲得相似音符文字序列;所述TOP N姓氏生成模組利用所述相似音符文字序列獲得TOP N姓氏;以及所述精準姓氏回問語句生成模組產生對應於所述TOP N姓氏的精準姓氏回問問句,其中所述精準姓氏回問問句包括第一姓氏以及第二姓氏,且所述第一姓氏以及所述第二姓氏具有相似的讀音。 A system for obtaining an accurate surname comprises: a storage medium storing a plurality of modules, wherein the plurality of modules comprises an accurate surname confirmation module, a similar sound acquisition module, a TOP N surname generation module and an accurate surname answer sentence generation module; and a processor coupled to the storage medium, wherein the processor accesses and executes the plurality of modules to perform the following steps: when the accurate surname confirmation module determines that a text does not include an accurate surname statement and does not include a surname confirmation sentence, the accurate surname confirmation module extracts a surname from the text; the similar sound acquisition module obtains a similar musical note text sequence using the surname; the TOP N surname generation module obtains a TOP N surname using the similar musical note text sequence; and the accurate surname answer sentence generation module generates a sentence corresponding to the TOP An accurate surname answer question sentence for the surname N, wherein the accurate surname answer question sentence includes a first surname and a second surname, and the first surname and the second surname have similar pronunciations. 如請求項1所述的系統,其中所述儲存媒體更儲存姓氏說法資料庫,其中所述精準姓氏確認模組利用所述姓氏說法資料庫判斷所述文字是否包括所述精準姓氏說法; 當所述精準姓氏確認模組判斷所述文字包括所述精準姓氏說法時,所述精準姓氏確認模組結束與客戶設備的對話;當所述精準姓氏確認模組判斷所述文字不包括所述精準姓氏說法時,所述精準姓氏確認模組判斷所述文字是否包括所述確認姓氏語句;當所述精準姓氏確認模組判斷所述文字包括所述確認姓氏語句時,所述精準姓氏確認模組結束與所述客戶設備的所述對話。 A system as described in claim 1, wherein the storage medium further stores a surname statement database, wherein the precise surname confirmation module uses the surname statement database to determine whether the text includes the precise surname statement; When the precise surname confirmation module determines that the text includes the precise surname statement, the precise surname confirmation module ends the dialogue with the client device; when the precise surname confirmation module determines that the text does not include the precise surname statement, the precise surname confirmation module determines whether the text includes the confirmation surname sentence; when the precise surname confirmation module determines that the text includes the confirmation surname sentence, the precise surname confirmation module ends the dialogue with the client device. 如請求項1所述的系統,其中所述相似音取得模組去除所述姓氏中的中文音調以獲得保留注音;所述相似音取得模組利用注音子音母音表以及所述保留注音來獲得所述相似音符文字序列。 A system as described in claim 1, wherein the similar sound acquisition module removes the Chinese tones in the surname to obtain the retained Zhuyin; the similar sound acquisition module uses the Zhuyin consonant vowel table and the retained Zhuyin to obtain the similar phonetic character sequence. 如請求項1所述的系統,其中所述儲存媒體更儲存百家姓資料庫,其中所述TOP N姓氏生成模組利用所述相似音符文字序列以及所述百家姓資料庫來產生可能姓氏序列;所述TOP N姓氏生成模組從所述可能姓氏序列中選出所述TOP N姓氏。 A system as described in claim 1, wherein the storage medium further stores a database of Hundred Family Surnames, wherein the TOP N Surname generation module uses the similar phonetic character sequence and the Hundred Family Surnames database to generate a possible surname sequence; the TOP N Surname generation module selects the TOP N surnames from the possible surname sequence. 如請求項1所述的系統,其中所述儲存媒體更儲存姓氏說法資料庫,其中所述精準姓氏回問語句生成模組利用所述姓氏說法資料庫來選出對應於所述TOP N姓氏的所述精準姓氏說法; 所述精準姓氏回問語句生成模組利用所述精準姓氏說法來產生對應於所述TOP N姓氏的所述精準姓氏回問問句。 The system as described in claim 1, wherein the storage medium further stores a surname statement database, wherein the precise surname answer sentence generation module uses the surname statement database to select the precise surname statement corresponding to the TOP N surnames; The precise surname answer sentence generation module uses the precise surname statement to generate the precise surname answer sentence corresponding to the TOP N surnames. 如請求項1所述的系統,其中所述處理器重複執行所述各步驟;當所述處理器重複執行所述各步驟的重複次數等於次數門檻值時,所述處理器停止執行所述各步驟。 A system as described in claim 1, wherein the processor repeatedly executes each of the steps; when the number of times the processor repeatedly executes each of the steps is equal to a number threshold value, the processor stops executing each of the steps. 如請求項1所述的系統,更包括語音辨別與文字轉換模組,其中所述儲存媒體更儲存中文文字字庫,其中所述語音辨別與文字轉換模組從客戶設備接收語音;所述語音辨別與文字轉換模組對所述語音執行語音識別操作以獲得識別後語音;所述語音辨別與文字轉換模組利用所述識別後語音以及所述中文文字字庫獲得對應於所述語音的所述文字。 The system as described in claim 1 further includes a speech recognition and text conversion module, wherein the storage medium further stores a Chinese character library, wherein the speech recognition and text conversion module receives speech from a client device; the speech recognition and text conversion module performs a speech recognition operation on the speech to obtain recognized speech; the speech recognition and text conversion module uses the recognized speech and the Chinese character library to obtain the text corresponding to the speech. 一種用於獲得精準姓氏的方法,適於由包括精準姓氏確認模組、相似音取得模組、TOP N姓氏生成模組以及精準姓氏回問語句生成模組的系統執行,其中所述方法包括以下步驟:當所述精準姓氏確認模組判斷文字不包括精準姓氏說法且不包括確認姓氏語句時,由所述精準姓氏確認模組從所述文字中擷取出姓氏;由所述相似音取得模組利用所述姓氏獲得相似音符文字序列;由所述TOP N姓氏生成模組利用所述相似音符文字序列獲得TOP N姓氏;以及 由所述精準姓氏回問語句生成模組產生對應於所述TOP N姓氏的精準姓氏回問問句,其中所述精準姓氏回問問句包括第一姓氏以及第二姓氏,且所述第一姓氏以及所述第二姓氏具有相似的讀音。 A method for obtaining an accurate surname is suitable for being executed by a system including an accurate surname confirmation module, a similar sound acquisition module, a TOP N surname generation module and an accurate surname answer sentence generation module, wherein the method comprises the following steps: when the accurate surname confirmation module determines that a text does not include an accurate surname statement and does not include a surname confirmation sentence, the accurate surname confirmation module extracts a surname from the text; the similar sound acquisition module uses the surname to obtain a similar musical note character sequence; the TOP N surname generation module uses the similar musical note character sequence to obtain the TOP N surnames; and the accurate surname answer sentence generation module generates an accurate surname answer sentence corresponding to the TOP N surnames, wherein the accurate surname answer sentence includes a first surname and a second surname, and the first surname and the second surname have similar pronunciations.
TW112138579A 2023-10-06 2023-10-06 System and method for obtaining accurate surname TWI876580B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
TW112138579A TWI876580B (en) 2023-10-06 2023-10-06 System and method for obtaining accurate surname

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
TW112138579A TWI876580B (en) 2023-10-06 2023-10-06 System and method for obtaining accurate surname

Publications (2)

Publication Number Publication Date
TWI876580B true TWI876580B (en) 2025-03-11
TW202516494A TW202516494A (en) 2025-04-16

Family

ID=95830252

Family Applications (1)

Application Number Title Priority Date Filing Date
TW112138579A TWI876580B (en) 2023-10-06 2023-10-06 System and method for obtaining accurate surname

Country Status (1)

Country Link
TW (1) TWI876580B (en)

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TW200937214A (en) * 2008-01-04 2009-09-01 Yahoo Inc Identifying and employing social network relationships
CN101542591A (en) * 2006-09-25 2009-09-23 维里逊商务网络服务有限公司 Method and system for providing speech recognition
US20170011024A1 (en) * 2014-02-04 2017-01-12 Nigel Henry CANNINGS System and method for contextualising a stream of unstructured text representative of spoken word
CN110678884A (en) * 2016-09-23 2020-01-10 欧佩科技股份公司 System and method for customizable pre-dispatch monotony for transportation services

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101542591A (en) * 2006-09-25 2009-09-23 维里逊商务网络服务有限公司 Method and system for providing speech recognition
TW200937214A (en) * 2008-01-04 2009-09-01 Yahoo Inc Identifying and employing social network relationships
US20170011024A1 (en) * 2014-02-04 2017-01-12 Nigel Henry CANNINGS System and method for contextualising a stream of unstructured text representative of spoken word
CN110678884A (en) * 2016-09-23 2020-01-10 欧佩科技股份公司 System and method for customizable pre-dispatch monotony for transportation services

Also Published As

Publication number Publication date
TW202516494A (en) 2025-04-16

Similar Documents

Publication Publication Date Title
US11069336B2 (en) Systems and methods for name pronunciation
US20130110511A1 (en) System, Method and Program for Customized Voice Communication
CN107871502A (en) Voice dialogue system and voice dialogue method
US12424223B2 (en) Voice-controlled communication requests and responses
JPH096390A (en) Speech recognition dialogue processing method and speech recognition dialogue apparatus
JP5196199B2 (en) Keyword display system, keyword display method, and program
JP2004101901A (en) Speech interaction system and speech interaction program
JP2020056972A (en) Language identification program, language identification method and language identification device
JP2021033260A (en) Learning method, speaker identification method, and program
US20200320976A1 (en) Information processing apparatus, information processing method, and program
JP2018197924A (en) Information processing apparatus, dialogue processing method, and dialogue processing program
WO2014203328A1 (en) Voice data search system, voice data search method, and computer-readable storage medium
JP4859125B2 (en) Pronunciation rating device and program
JP6254504B2 (en) Search server and search method
TWI876580B (en) System and method for obtaining accurate surname
JP2007328283A (en) Interaction system, program and interactive method
JP6721676B2 (en) Interactive device, interactive device control method, and control program
JP2002132291A (en) Natural language interaction processor and method for the same as well as memory medium for the same
JP2020184183A (en) Information processing device, information processing system, and method for controlling information processing device
JP5877823B2 (en) Speech recognition apparatus, speech recognition method, and program
JP4445371B2 (en) Recognition vocabulary registration apparatus, speech recognition apparatus and method
JP7287459B2 (en) Speech imaging device, speech imaging method and program
JP2018128576A (en) Business determination device, business determination method and program
WO2018043139A1 (en) Information processing device, information processing method, and program
JP2002258889A (en) Dictionary-editable speech recognition device