TWI876580B - System and method for obtaining accurate surname - Google Patents
System and method for obtaining accurate surname Download PDFInfo
- Publication number
- TWI876580B TWI876580B TW112138579A TW112138579A TWI876580B TW I876580 B TWI876580 B TW I876580B TW 112138579 A TW112138579 A TW 112138579A TW 112138579 A TW112138579 A TW 112138579A TW I876580 B TWI876580 B TW I876580B
- Authority
- TW
- Taiwan
- Prior art keywords
- surname
- accurate
- precise
- text
- module
- Prior art date
Links
Images
Landscapes
- Machine Translation (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
- Document Processing Apparatus (AREA)
Abstract
Description
本發明是有關於一種用於獲得精準姓氏的系統及方法。The present invention relates to a system and method for obtaining an accurate surname.
當客戶連線/致電至一般現有的對話服務系統時,若此時對話服務系統需要確認客戶的姓氏,則現有的文字辨識技術常因為中文存在高機率的同音字而誤判,造成客戶的觀感不佳。When a customer connects/calls a general existing conversation service system, if the conversation service system needs to confirm the customer's surname at this time, the existing text recognition technology often misjudges because there is a high probability of homophones in Chinese, resulting in a bad impression for the customer.
本發明的用於獲得精準姓氏的系統包括儲存媒體以及處理器。儲存媒體儲存多個模組,其中多個模組包括精準姓氏確認模組、相似音取得模組、TOP N姓氏生成模組以及精準姓氏回問語句生成模組。處理器耦接儲存媒體,其中處理器存取和執行多個模組來執行以下各步驟:當精準姓氏確認模組判斷文字不包括精準姓氏說法且不包括確認姓氏語句時,精準姓氏確認模組從文字中擷取出姓氏;相似音取得模組利用姓氏獲得相似音符文字序列;TOP N姓氏生成模組利用相似音符文字序列獲得TOP N姓氏;以及精準姓氏回問語句生成模組產生對應於TOP N姓氏的精準姓氏回問問句。The system for obtaining an accurate surname of the present invention includes a storage medium and a processor. The storage medium stores a plurality of modules, wherein the plurality of modules include an accurate surname confirmation module, a similar sound acquisition module, a TOP N surname generation module, and an accurate surname answer sentence generation module. The processor is coupled to the storage medium, wherein the processor accesses and executes the plurality of modules to execute the following steps: when the accurate surname confirmation module determines that the text does not include an accurate surname statement and does not include a surname confirmation sentence, the accurate surname confirmation module extracts the surname from the text; the similar sound acquisition module uses the surname to obtain a similar musical note text sequence; the TOP N surname generation module uses the similar musical note text sequence to obtain the TOP N surnames; and the accurate surname answer sentence generation module generates an accurate surname answer sentence corresponding to the TOP N surnames.
本發明的用於獲得精準姓氏的方法包括以下步驟:當精準姓氏確認模組判斷文字不包括精準姓氏說法且不包括確認姓氏語句時,由精準姓氏確認模組從文字中擷取出姓氏;由相似音取得模組利用姓氏獲得相似音符文字序列;由TOP N姓氏生成模組利用相似音符文字序列獲得TOP N姓氏;以及由精準姓氏回問語句生成模組產生對應於TOP N姓氏的精準姓氏回問問句 。 The method for obtaining an accurate surname of the present invention comprises the following steps: when the accurate surname confirmation module determines that the text does not include an accurate surname statement and does not include a surname confirmation sentence, the accurate surname confirmation module extracts the surname from the text; the similar sound acquisition module uses the surname to obtain a similar musical note text sequence; the TOP N surname generation module uses the similar musical note text sequence to obtain the TOP N surnames; and the accurate surname question answer sentence generation module generates an accurate surname question answer sentence corresponding to the TOP N surnames .
圖1是根據本發明的一實施例繪示的用於獲得精準姓氏的系統1的示意圖。系統1可包括儲存媒體以及處理器。Fig. 1 is a schematic diagram of a
儲存媒體例如是任何型態的固定式或可移動式的隨機存取記憶體(random access memory,RAM)、唯讀記憶體(read-only memory,ROM)、快閃記憶體(flash memory)、硬碟(hard disk drive,HDD)、固態硬碟(solid state drive,SSD)或類似元件或上述元件的組合,而用於儲存可由處理器執行的多個模組或各種應用程式。在本實施例中,儲存媒體可儲存多個模組,其中所述多個模組可包括精準姓氏確認模組20、相似音取得模組30、TOP N姓氏生成模組40以及精準姓氏回問語句生成模組50。在其他實施例中,所述多個模組還可包括語音辨別與文字轉換模組10。在其他實施例中,儲存媒體還可儲存中文文字字庫11、姓氏說法資料庫21以及百家姓資料庫41。The storage medium is, for example, any type of fixed or removable random access memory (RAM), read-only memory (ROM), flash memory, hard disk drive (HDD), solid state drive (SSD) or similar components or a combination of the above components, and is used to store multiple modules or various applications that can be executed by the processor. In this embodiment, the storage medium can store multiple modules, wherein the multiple modules may include an accurate
處理器例如是中央處理單元(central processing unit,CPU),或是其他可程式化之一般用途或特殊用途的微控制單元(micro control unit,MCU)、微處理器(microprocessor)、數位信號處理器(digital signal processor,DSP)、可程式化控制器、特殊應用積體電路(application specific integrated circuit,ASIC)、圖形處理器(graphics processing unit,GPU)、影像訊號處理器(image signal processor,ISP)、影像處理單元(image processing unit,IPU)、算數邏輯單元(arithmetic logic unit,ALU)、複雜可程式邏輯裝置(complex programmable logic device,CPLD)、現場可程式化邏輯閘陣列(field programmable gate array,FPGA)或其他類似元件或上述元件的組合。處理器可耦接至儲存媒體,並且存取和執行儲存於儲存媒體中的多個模組和各種應用程式。The processor is, for example, a central processing unit (CPU), or other programmable general-purpose or special-purpose micro control unit (MCU), microprocessor, digital signal processor (DSP), programmable controller, application specific integrated circuit (ASIC), graphics processing unit (GPU), image signal processor (ISP), image processing unit (IPU), arithmetic logic unit (ALU), complex programmable logic device (CPLD), field programmable gate array (FPGA), or other similar components or combinations of the above components. The processor may be coupled to the storage medium and access and execute a plurality of modules and various applications stored in the storage medium.
圖2是圖1所示的系統1的運作流程圖。請同時參照圖1及圖2。FIG2 is an operation flow chart of the
在步驟S100中,語音辨別與文字轉換模組10可將語音轉換為文字。以下將以圖3進一步說明。In step S100, the speech recognition and
圖3是圖2所示的步驟S100的詳細流程圖。請同時參照圖1、圖2及圖3。FIG3 is a detailed flow chart of step S100 shown in FIG2. Please refer to FIG1, FIG2 and FIG3 at the same time.
在步驟S101中,語音辨別與文字轉換模組10可從客戶設備接收語音。詳細而言,圖1所示的系統1還可包括收發器(圖1未繪示)。語音辨別與文字轉換模組10可通過收發器從客戶設備(圖1未繪示)接收語音。客戶設備例如是電話或智慧音箱設備,然而本發明不限於此。舉例來說,當客戶操作客戶設備以連線至系統1時,系統1可詢問「請問客戶貴姓?」。以下實施例將以語音辨別與文字轉換模組10從客戶設備接收的語音為「我姓王」來繼續說明。In step S101, the speech recognition and
在步驟S102中,語音辨別與文字轉換模組10可對語音執行語音識別操作以獲得識別後語音。詳細而言,語音辨別與文字轉換模組10可對語音進行前處理。由於不同客戶身處的環境不同,客戶距離客戶設備的收音設備/麥克風的距離也不同,且各種噪音會影響收音結果。因此,語音辨別與文字轉換模組10可將環境的雜音去除,並識別出可用的識別後語音(人聲音頻)。In step S102, the speech recognition and
在步驟S103中,語音辨別與文字轉換模組10可利用識別後語音以及中文文字字庫11獲得對應於語音的文字。詳細而言,語音辨別與文字轉換模組10可利用中文文字字庫11內的中文文字內容來產生對應於識別後語音的文字。承前述實施例,語音辨別與文字轉換模組10可獲得文字「我姓王」。In step S103, the speech recognition and
請回到圖2。在步驟S200中,當精準姓氏確認模組20判斷文字不包括精準姓氏說法且不包括確認姓氏語句時,精準姓氏確認模組20可從文字中擷取出姓氏。以下將以圖4進一步說明。Please return to FIG. 2. In step S200, when the precise
圖4是圖2所示的步驟S200的詳細流程圖。請同時參照圖1、圖2及圖4。FIG4 is a detailed flow chart of step S200 shown in FIG2. Please refer to FIG1, FIG2 and FIG4 at the same time.
在步驟S201中,精準姓氏確認模組20可利用姓氏說法資料庫21判斷文字是否包括精準姓氏說法。In step S201, the accurate
當精準姓氏確認模組20判斷文字包括精準姓氏說法時(步驟S201的判斷結果為「是」),在步驟S202中,精準姓氏確認模組20可採納精準姓氏說法。舉例來說,若此時文字包括精準姓氏說法「我姓張,弓長張」或者精準姓氏說法「我姓黃,草頭黃」,則精準姓氏確認模組20可採納此精準姓氏說法。然後,精準姓氏確認模組20可執行步驟S203以結束與客戶設備的對話。換言之,此時系統1將不執行步驟S300、S400及S500。When the precise
當精準姓氏確認模組20判斷文字不包括精準姓氏說法時,精準姓氏確認模組20可判斷文字是否包括確認姓氏語句。詳細而言,當精準姓氏確認模組20判斷文字包括確認姓氏語句時(步驟S204的判斷結果為「是」),在步驟S203中,精準姓氏確認模組20可結束與客戶的對話。舉例來說,假設在經過與特定客戶的多輪對話後文字已包括了確認姓氏語句(例如「對,沒錯」)。由於文字雖然不包括精準姓氏說法但已經包括了確認姓氏語句,精準姓氏確認模組20可執行步驟S203來結束與客戶設備的對話。換言之,此時系統1將不執行步驟S300、S400及S500。When the precise
另一方面,當精準姓氏確認模組20判斷文字不包括確認姓氏語句時(步驟S204的判斷結果為「否」),在步驟S205中,精準姓氏確認模組20可從文字中擷取出姓氏。承前述實施例,由於文字「我姓王」不包括確認姓氏語句,精準姓氏確認模組20可執行步驟S205。換言之,精準姓氏確認模組20可從文字「我姓王」中擷取出可能的姓氏「王」。On the other hand, when the precise
請回到圖2。在步驟S300中,相似音取得模組30可利用姓氏獲得相似音符文字序列。以下將以圖5進一步說明。Please return to FIG. 2. In step S300, the similar
圖5是圖2所示的步驟S300的詳細流程圖。請同時參照圖1、圖2及圖5。FIG5 is a detailed flow chart of step S300 shown in FIG2. Please refer to FIG1, FIG2 and FIG5 at the same time.
在步驟S301中,相似音取得模組30可去除姓氏中的中文音調以獲得保留注音。中文音調可以是中文的1234聲音調。承前述實施例,相似音取得模組30可去除姓氏「王」中的中文音調以獲得保留注音「ㄨㄤ」。In step S301, the similar
在步驟S302中,相似音取得模組30可利用(如下表1所示的)注音子音母音表以及保留注音來獲得相似音符文字序列。承前述實施例,相似音取得模組30可利用注音子音母音表以及保留注音「ㄨㄤ」獲得相似音符文字序列「ㄨㄤ、ㄅㄤ、ㄈㄤ、ㄓㄤ、ㄔㄤ、ㄏㄨㄤ、ㄎㄨㄤ…」。In step S302, the similar
表1注音子音母音表的範例
請回到圖2。在步驟S400中,TOP N姓氏生成模組40可利用相似音符文字序列獲得TOP N姓氏。以下將以圖6進一步說明。Please return to FIG. 2. In step S400, the TOP N
圖6是圖2所示的步驟S400的詳細流程圖。請同時參照圖1、圖2及圖6。FIG6 is a detailed flow chart of step S400 shown in FIG2. Please refer to FIG1, FIG2 and FIG6 at the same time.
在步驟S401中,TOP N姓氏生成模組40可利用相似音符文字序列以及百家姓資料庫41來產生可能姓氏序列。承前述實施例,TOP N姓氏生成模組40可比對相似音符文字序列「ㄨㄤ、ㄅㄤ、ㄈㄤ、ㄓㄤ、ㄔㄤ、ㄏㄨㄤ、ㄎㄨㄤ…」以及百家姓資料庫41來產生可能姓氏序列「汪、王、邦、方、房、張、常、黃、匡」。In step S401, the TOP N
在步驟S402中,TOP N姓氏生成模組40可從可能姓氏序列中選出TOP N姓氏。承前述實施例,TOP N姓氏生成模組40可從可能姓氏序列「汪、王、邦、方、房、張、常、黃、匡」中選出TOP N姓氏「王」及「黃」。In step S402, the TOP N
請回到圖2。在步驟S500中,精準姓氏回問語句生成模組50可產生對應於TOP N姓氏的精準姓氏回問問句。以下將以圖7進一步說明。Please return to FIG. 2. In step S500, the precise surname answer
圖7是圖2所示的步驟S500的詳細流程圖。請同時參照圖1、圖2及圖7。FIG7 is a detailed flow chart of step S500 shown in FIG2. Please refer to FIG1, FIG2 and FIG7 at the same time.
在步驟S501中,精準姓氏回問語句生成模組50可利用姓氏說法資料庫21來選出對應於TOP N姓氏的精準姓氏說法。承前述實施例,精準姓氏回問語句生成模組50可利用姓氏說法資料庫21來選出對應於TOP N姓氏(「王」及「黃」)的精準姓氏說法「三橫一豎王」及「草頭黃」。In step S501, the precise surname answer
在步驟S502中,精準姓氏回問語句生成模組50可利用精準姓氏說法來產生對應於TOP N姓氏的精準姓氏回問問句。承前述實施例,精準姓氏回問語句生成模組50可利用精準姓氏說法「三橫一豎王」及「草頭黃」來產生精準姓氏回問問句「請問是三橫一豎王,還是草頭黃呢」。In step S502, the precise surname answer
在產生精準姓氏回問問句之後,系統1可通過收發器(圖1未繪示)傳送精準姓氏回問問句至客戶設備。待客戶回應之後,便可進入下一輪的對話(即再度回到步驟S100)。若用戶回應了語音「是三橫一豎王」,則在步驟S201中,精準姓氏確認模組20可判斷文字「是三橫一豎王」包括精準姓氏說法「三橫一豎王」,並且可執行步驟S202以及步驟S203(結束與客戶的多輪對話)。After generating the precise surname query sentence, the
另一方面,若此時用戶回應了語音「都不是,是郎,郎中的郎」,則在步驟S201中,精準姓氏確認模組20可判斷文字「都不是,是郎,郎中的郎」包括了其它的精準姓氏說法,並且可執行步驟S202以及步驟S203(結束與客戶的多輪對話)。On the other hand, if the user responds with the voice "Neither, it's Lang, Lang in Lang" at this time, then in step S201, the precise
另一方面,若此時用戶回應了語音「都不是,我姓杭」,則在步驟S201中,精準姓氏確認模組20可判斷文字「都不是,我姓杭」不包括精準姓氏說法。接著,精準姓氏確認模組20可在步驟S204中判斷文字「都不是,我姓杭」不包括確認姓氏語句,並且可執行步驟S205。然後,系統1將繼續執行步驟S300、步驟S400及步驟S500,並且在執行完步驟S500之後,再開始下一輪的對話(即回到步驟S100)。On the other hand, if the user responds with the voice "neither, my surname is Hang", then in step S201, the precise
另一方面,若此時用戶回應了語音「不素啦,偶姓荒啦(方姓用戶因口音,將『方』念為『荒』)」,且若經過多輪的對話之後系統1仍然無法獲得精準姓氏,當多輪的對話的次數等於次數門檻值時,系統1將結束與客戶之間的多輪的對話。On the other hand, if the user responds with the voice message "Not Su La, my surname is Huang La (the user with the surname Fang pronounces 'Fang' as 'Huang' due to his accent)" at this time, and if after multiple rounds of dialogue,
本發明另提供了一種用於獲得精準姓氏的方法,其中所述方法可由圖1所示的系統1實施。所述方法包括以下步驟:The present invention further provides a method for obtaining an accurate surname, wherein the method can be implemented by the
(a)當精準姓氏確認模組判斷文字不包括精準姓氏說法且不包括確認姓氏語句時,由精準姓氏確認模組從文字中擷取出姓氏。(a) When the precise surname confirmation module determines that the text does not include a precise surname statement and does not include a surname confirmation sentence, the precise surname confirmation module extracts the surname from the text.
(b)由相似音取得模組利用姓氏獲得相似音符文字序列。(b) The similar sound acquisition module uses the surname to obtain a similar phonetic character sequence.
(c)由TOP N姓氏生成模組利用相似音符文字序列獲得TOP N姓氏。(c) The TOP N surname generation module obtains the TOP N surnames using similar phonetic character sequences.
(d)由精準姓氏回問語句生成模組產生對應於TOP N姓氏的精準姓氏回問問句。(d) The precise surname answer sentence generation module generates precise surname answer question sentences corresponding to the TOP N surnames.
所述方法已於前述實施例說明,於此不再贅述。The method has been described in the above embodiments and will not be described again here.
綜上所述,本發明的用於獲得精準姓氏的系統及方法可藉由精準姓氏說法、相似音符文字序列以及精準姓氏回問問句來獲得客戶的精準姓氏。除此之外,還可藉由與客戶之間多輪的對話來提高獲得姓氏的準確度。基此,對於需要客戶的姓氏的對話服務系統來說,可以更有效地獲得客戶的精準姓氏。In summary, the system and method for obtaining an accurate surname of the present invention can obtain the accurate surname of a customer through accurate surname sayings, similar phonetic character sequences, and accurate surname answer sentences. In addition, the accuracy of the surname obtained can be improved through multiple rounds of dialogue with the customer. Based on this, for a dialogue service system that requires a customer's surname, the accurate surname of the customer can be obtained more effectively.
1:用於獲得精準姓氏的系統 10:語音辨別與文字轉換模組 20:精準姓氏確認模組 30:相似音取得模組 40:TOP N姓氏生成模組 50:精準姓氏回問語句生成模組 11:中文文字字庫 21:姓氏說法資料庫 41:百家姓資料庫 S100~S500、S101~S103、S201~S205、S301~S302、S401~S402、S501~S502:步驟 1: System for obtaining accurate surnames 10: Voice recognition and text conversion module 20: Accurate surname confirmation module 30: Similar sound acquisition module 40: TOP N surname generation module 50: Accurate surname question sentence generation module 11: Chinese character library 21: Surname saying database 41: Hundred Family Surname Database S100~S500, S101~S103, S201~S205, S301~S302, S401~S402, S501~S502: Steps
圖1是根據本發明的一實施例繪示的用於獲得精準姓氏的系統的示意圖。 圖2是圖1所示的系統的運作流程圖。 圖3是圖2所示的步驟S100的詳細流程圖。 圖4是圖2所示的步驟S200的詳細流程圖。 圖5是圖2所示的步驟S300的詳細流程圖。 圖6是圖2所示的步驟S400的詳細流程圖。 圖7是圖2所示的步驟S500的詳細流程圖。 FIG. 1 is a schematic diagram of a system for obtaining an accurate surname according to an embodiment of the present invention. FIG. 2 is an operation flow chart of the system shown in FIG. 1 . FIG. 3 is a detailed flow chart of step S100 shown in FIG. 2 . FIG. 4 is a detailed flow chart of step S200 shown in FIG. 2 . FIG. 5 is a detailed flow chart of step S300 shown in FIG. 2 . FIG. 6 is a detailed flow chart of step S400 shown in FIG. 2 . FIG. 7 is a detailed flow chart of step S500 shown in FIG. 2 .
S100~S500:步驟 S100~S500: Steps
Claims (8)
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| TW112138579A TWI876580B (en) | 2023-10-06 | 2023-10-06 | System and method for obtaining accurate surname |
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| TW112138579A TWI876580B (en) | 2023-10-06 | 2023-10-06 | System and method for obtaining accurate surname |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| TWI876580B true TWI876580B (en) | 2025-03-11 |
| TW202516494A TW202516494A (en) | 2025-04-16 |
Family
ID=95830252
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| TW112138579A TWI876580B (en) | 2023-10-06 | 2023-10-06 | System and method for obtaining accurate surname |
Country Status (1)
| Country | Link |
|---|---|
| TW (1) | TWI876580B (en) |
Citations (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| TW200937214A (en) * | 2008-01-04 | 2009-09-01 | Yahoo Inc | Identifying and employing social network relationships |
| CN101542591A (en) * | 2006-09-25 | 2009-09-23 | 维里逊商务网络服务有限公司 | Method and system for providing speech recognition |
| US20170011024A1 (en) * | 2014-02-04 | 2017-01-12 | Nigel Henry CANNINGS | System and method for contextualising a stream of unstructured text representative of spoken word |
| CN110678884A (en) * | 2016-09-23 | 2020-01-10 | 欧佩科技股份公司 | System and method for customizable pre-dispatch monotony for transportation services |
-
2023
- 2023-10-06 TW TW112138579A patent/TWI876580B/en active
Patent Citations (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN101542591A (en) * | 2006-09-25 | 2009-09-23 | 维里逊商务网络服务有限公司 | Method and system for providing speech recognition |
| TW200937214A (en) * | 2008-01-04 | 2009-09-01 | Yahoo Inc | Identifying and employing social network relationships |
| US20170011024A1 (en) * | 2014-02-04 | 2017-01-12 | Nigel Henry CANNINGS | System and method for contextualising a stream of unstructured text representative of spoken word |
| CN110678884A (en) * | 2016-09-23 | 2020-01-10 | 欧佩科技股份公司 | System and method for customizable pre-dispatch monotony for transportation services |
Also Published As
| Publication number | Publication date |
|---|---|
| TW202516494A (en) | 2025-04-16 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| US11069336B2 (en) | Systems and methods for name pronunciation | |
| US20130110511A1 (en) | System, Method and Program for Customized Voice Communication | |
| CN107871502A (en) | Voice dialogue system and voice dialogue method | |
| US12424223B2 (en) | Voice-controlled communication requests and responses | |
| JPH096390A (en) | Speech recognition dialogue processing method and speech recognition dialogue apparatus | |
| JP5196199B2 (en) | Keyword display system, keyword display method, and program | |
| JP2004101901A (en) | Speech interaction system and speech interaction program | |
| JP2020056972A (en) | Language identification program, language identification method and language identification device | |
| JP2021033260A (en) | Learning method, speaker identification method, and program | |
| US20200320976A1 (en) | Information processing apparatus, information processing method, and program | |
| JP2018197924A (en) | Information processing apparatus, dialogue processing method, and dialogue processing program | |
| WO2014203328A1 (en) | Voice data search system, voice data search method, and computer-readable storage medium | |
| JP4859125B2 (en) | Pronunciation rating device and program | |
| JP6254504B2 (en) | Search server and search method | |
| TWI876580B (en) | System and method for obtaining accurate surname | |
| JP2007328283A (en) | Interaction system, program and interactive method | |
| JP6721676B2 (en) | Interactive device, interactive device control method, and control program | |
| JP2002132291A (en) | Natural language interaction processor and method for the same as well as memory medium for the same | |
| JP2020184183A (en) | Information processing device, information processing system, and method for controlling information processing device | |
| JP5877823B2 (en) | Speech recognition apparatus, speech recognition method, and program | |
| JP4445371B2 (en) | Recognition vocabulary registration apparatus, speech recognition apparatus and method | |
| JP7287459B2 (en) | Speech imaging device, speech imaging method and program | |
| JP2018128576A (en) | Business determination device, business determination method and program | |
| WO2018043139A1 (en) | Information processing device, information processing method, and program | |
| JP2002258889A (en) | Dictionary-editable speech recognition device |