TW201939391A

TW201939391A - Voice service system for allowing consumers to input data by using voice

Info

Publication number: TW201939391A
Application number: TW107109329A
Authority: TW
Inventors: 賴志揚
Original assignee: 安源資訊股份有限公司
Priority date: 2018-03-19
Filing date: 2018-03-19
Publication date: 2019-10-01

Abstract

A voice service system includes a cloud server apparatus and at least one multimedia terminal machine having a sound receiving device, wherein the multimedia terminal machine includes a touch display screen, an operation interface module, an activation module, a voice recognition module, a service activation module and a service operation module. The sound receiving device is capable of receiving at least one voice signal. The voice recognition module may recognize a text message from the voice signal so received. Then, after the service activation module determines and activates a type of service based on the content of the recognized text message, the service operation module further performs an operation of the determined type of service, so as to input data required by the operation of different types of service into the content of the text message recognized from the received voice signal.

Description

Voice service system

本發明是有關一種語音服務系統，特別是一種讓消費者能夠以語音方式輸入資料以完成交易之系統。The invention relates to a voice service system, in particular to a system that enables consumers to input data in a voice manner to complete transactions.

目前電子科技發達的現今，語音辨識系統與機制已發展的相對成熟，因此，語音辨識功能由於具有方便的使用性、提高效率、節省時間成本、廣泛應用、趣味性等特性與優勢，故近年更成為熱門的產品功能之一。At present, with the development of electronic technology, speech recognition systems and mechanisms have been relatively mature. Therefore, the speech recognition function has the characteristics and advantages of convenient usability, improved efficiency, time and cost saving, wide application, and fun. Become one of the hot product features.

而對於多媒體終端機台來說，目前常用的選擇服務與使用服務大多是透過觸控螢幕的輸入來進行操作並提供服務，然而語音辨識系統的發展，若能夠將該多媒體終端機台結合語音辨識系統，將能夠透過消費者所發出之語音進行辨識所選擇之服務的種類，並能夠於服務的運作過程中，更能夠接收消費者發出之語音所辨識之文字內容，以達到全語音服務的狀態，如此本發明應為一最佳解決方案。For multimedia terminals, most of the commonly used selection services and services are operated and provided through touch screen input. However, the development of speech recognition systems, if the multimedia terminals can be combined with speech recognition The system will be able to identify the type of service selected by the voice issued by the consumer, and be able to receive the text content recognized by the voice emitted by the consumer during the operation of the service to achieve the status of full voice service Therefore, the present invention should be an optimal solution.

一種語音服務系統，係至少包括：一雲端伺服設備；至少一個具有收音裝置之多媒體終端機台，係與該雲端伺服設備進行連線，而該多媒體終端機台係包含一觸控顯示螢幕；一操作介面模組，係用以提供至少一個能夠顯示於該觸控顯示螢幕上的操作頁面，以讓一消費者能夠觸控使用該多媒體終端機台；一啟動模組，係與該操作介面模組及該收音裝置電性連接，用以能夠啟動語音服務機制，以使該收音裝置能夠進行接收至少一個語音訊號；一語音辨識模組，係與該操作介面模組及該啟動模組電性連接，用以能夠將所接收之語音訊號進行辨識出一文字訊息；一服務啟動模組，係與該操作介面模組及該語音辨識模組電性連接，用以能夠依據所辨識出之文字訊息的內容進行判斷並啟動服務類別；以及一服務運作模組，係與該操作介面模組及該服務啟動模組電性連接，用以能夠依據所辨識出之文字訊息的內容進行所判斷服務類別之運作。A voice service system includes at least: a cloud servo device; at least one multimedia terminal with a radio device, which is connected to the cloud servo device, and the multimedia terminal includes a touch display screen; The operation interface module is used to provide at least one operation page that can be displayed on the touch display screen, so that a consumer can use the multimedia terminal with touch; a startup module is connected to the operation interface module. And the radio device are electrically connected to enable the voice service mechanism to enable the radio device to receive at least one voice signal; a voice recognition module is electrically connected to the operation interface module and the activation module A connection for recognizing a received text signal as a text message; a service activation module electrically connected to the operation interface module and the speech recognition module to enable the text message to be recognized based on Content to determine and start the service category; and a service operation module, related to the operation interface module and the service startup module Connection to be able to operate the service categories judged based on the content of the identified text messages.

更具體的說，所述多媒體終端機台更包含有一與該啟動模組電性連接之感測裝置，用以偵測於該多媒體終端機台前方具有至少一位使用者，並能夠透過該啟動模組於該觸控顯示螢幕上的操作頁面，進行確認是否啟動該語音服務機制。More specifically, the multimedia terminal further includes a sensing device electrically connected to the startup module, and is used to detect that there is at least one user in front of the multimedia terminal, and can be started through the startup. The module confirms whether to activate the voice service mechanism on the operation page on the touch display screen.

更具體的說，所述多媒體終端機台更包含有一與該操作介面模組、服務啟動模組及該服務運作模組電性連接之觸控輸入模組，而該觸控輸入模組用以能夠於該操作頁面上進行輸入資料或/及確認操作。More specifically, the multimedia terminal further includes a touch input module electrically connected to the operation interface module, the service start module, and the service operation module, and the touch input module is used for Able to enter data or confirm operations on this operation page.

更具體的說，所述多媒體終端機台更包含有一與該啟動模組、服務啟動模組及該服務運作模組電性連接之聲音輸出裝置，而該聲音輸出裝置用以能夠於該啟動模組、服務啟動模組及該服務運作模組運作時播放至少一個語音檔。More specifically, the multimedia terminal further includes a sound output device electrically connected to the startup module, the service startup module, and the service operation module, and the sound output device is used to enable the At least one voice file is played when the group, the service startup module, and the service operation module operate.

更具體的說，所述服務啟動模組能夠於該觸控顯示螢幕上的操作頁面，進行確認服務類別。More specifically, the service activation module can confirm a service type on an operation page on the touch display screen.

更具體的說，所述服務運作模組更能夠與該語音辨識模組電性連接，用以於該操作頁面上進行不同服務類別之運作，更能夠將所接收之語音訊號所辨識出之文字訊息的內容進行輸入不同服務類別運作所需之資料。More specifically, the service operation module is more capable of being electrically connected to the voice recognition module, and is used to operate different service categories on the operation page, and is able to recognize the text recognized by the received voice signal. The content of the message is used to enter the information required for the operation of different service categories.

有關於本發明其他技術內容、特點與功效，在以下配合參考圖式之較佳實施例的詳細說明中，將可清楚的呈現。Regarding other technical contents, features and effects of the present invention, they will be clearly presented in the following detailed description of the preferred embodiments with reference to the drawings.

請參閱第1圖，為本發明語音服務系統之整體架構示意圖，由圖中可知，本發明之語音服務系統係包含一雲端伺服設備1及一多媒體終端機台2，其中該雲端伺服設備1係用以運算不同服務類型所需之處理作業，而該多媒體終端機台2係與該雲端伺服設備1進行連線，並能夠與該雲端伺服設備1進行資料交換。Please refer to FIG. 1, which is a schematic diagram of the overall architecture of the voice service system of the present invention. As can be seen from the figure, the voice service system of the present invention includes a cloud server device 1 and a multimedia terminal 2. The cloud server device 1 is It is used to calculate processing operations required for different service types, and the multimedia terminal 2 is connected to the cloud server device 1 and can exchange data with the cloud server device 1.

而該多媒體終端機台2係包含一觸控顯示螢幕201、一操作介面模組202、一啟動模組203、一語音辨識模組204、一收音裝置205、一服務啟動模組206、一服務運作模組207、一觸控輸入模組208、一感測模組209及一聲音輸出裝置210，其中該操作介面模組202用以提供至少一個能夠顯示於該觸控顯示螢幕201上的操作頁面，以讓一消費者能夠觸控使用該多媒體終端機台2。The multimedia terminal 2 includes a touch display screen 201, an operation interface module 202, a startup module 203, a voice recognition module 204, a radio device 205, a service startup module 206, and a service. The operation module 207, a touch input module 208, a sensing module 209, and a sound output device 210, wherein the operation interface module 202 is used to provide at least one operation that can be displayed on the touch display screen 201 Page so that a consumer can use the multimedia terminal 2 with touch.

其中該啟動模組203用以能夠啟動語音服務機制，以使該收音裝置205能夠進行接收至少一個語音訊號，而該語音辨識模組204用以能夠將所接收之語音訊號進行辨識出一文字訊息。The activation module 203 is used to enable a voice service mechanism so that the radio device 205 can receive at least one voice signal, and the voice recognition module 204 is used to recognize a received voice signal as a text message.

其中該服務啟動模組206用以能夠依據所辨識出之文字訊息的內容進行判斷並啟動服務類別，因此能夠於該觸控顯示螢幕201上的操作頁面進行確認服務類別。The service activation module 206 is capable of determining and activating a service type according to the content of the recognized text message, and thus can confirm the service type on the operation page on the touch display screen 201.

其中該服務運作模組207用以能夠依據所辨識出之文字訊息的內容進行所判斷服務類別之運作，且更能夠與該語音辨識模組204電性連接，故於該操作頁面上進行不同服務類別之運作，更能夠將所接收之語音訊號所辨識出之文字訊息的內容進行輸入不同服務類別運作所需之資料。The service operation module 207 is capable of performing the operation of the determined service type according to the content of the identified text message, and can be electrically connected to the voice recognition module 204, so different services are performed on the operation page. The operation of the category can further input the content of the text message recognized by the received voice signal into the data required for the operation of different service categories.

其中該觸控輸入模組208用以能夠於該操作頁面上進行輸入資料或/及確認操作，而該感測模組209用以偵測於該多媒體終端機台2前方具有至少一位使用者，並能夠透過該啟動模組203於該觸控顯示螢幕201上的操作頁面，進行確認是否啟動該語音服務機制，且該聲音輸出裝置210用以能夠於該啟動模組203、服務啟動模組206及該服務運作模組207運作時播放至少一個語音檔。The touch input module 208 is used to input data or / and confirm operations on the operation page, and the sensing module 209 is used to detect that there is at least one user in front of the multimedia terminal 2 And can use the operation page of the activation module 203 on the touch display screen 201 to confirm whether to activate the voice service mechanism, and the sound output device 210 is used to enable the activation module 203 and the service activation module 206 and the service operation module 207 play at least one voice file during operation.

由於本發明能夠達到服務查找及全語音服務的服務，先以服務查找做為實施說明，如第3A~3C圖所示，當按下「關鍵字/語音查詢」，則能夠進一步選擇「語音查詢」，之後，於操作頁面2021上顯示選擇[語音查詢]按鈕後文字提醒消費者拿起話筒；Since the present invention can achieve the service search and full voice service, the service search is used as an implementation description. As shown in Figures 3A to 3C, when "Keyword / Voice Search" is pressed, the "Voice Search" can be further selected. ”, After that, the text reminds consumers to pick up the microphone after selecting the [Voice Search] button on the operation page 2021.

之後，如第3D圖所示，當多媒體終端機台2偵測話筒(收音裝置205)被拿起後則會跳出語音文字對話框，並能夠搭配該聲音輸出裝置210進行發出特定的語音檔詢問消費者需要何種服務，當辨識確認發出之語音是代表哪一類的服務之後，則會連結至不同服務的頁面，如第3E及3F圖所示，則能夠陸續選擇及輸入車種及車牌號碼，而該多媒體終端機台2則能夠連線至外部，以進行查找該車牌號碼所應繳之停車費，最後如第3G~3I圖所示，消費者則能夠選取要繳費的項目，並於確定之後，就直接將繳費單列印出來。Then, as shown in FIG. 3D, when the multimedia terminal 2 detects that the microphone (radio device 205) is picked up, a voice text dialog box will pop up, and it can be used with the sound output device 210 to issue a specific voice file query What kind of services do consumers need? After identifying and confirming which types of services the voices represent, they will be linked to the pages of different services. As shown in Figures 3E and 3F, they can successively select and enter vehicle types and license plate numbers. The multimedia terminal 2 can be connected to the outside to find the parking fee payable by the license plate number. Finally, as shown in Figures 3G to 3I, consumers can select the items to be paid and determine After that, print the payment slip directly.

而本發明當應用於全語音服務的服務時，與前述實施不同之處，如第4A~4B圖所示，則是一步一步以語音引導消費者回答，並將回答辨識後的結果顯示如第4C圖所示，而消費者更能夠透過下方的虛擬鍵盤進行修正，而之後選擇繳費項目並列印繳費單與前一實施例相同，故不重複贅述。When the present invention is applied to a full voice service, the difference from the previous implementation, as shown in Figures 4A to 4B, is to guide the consumer to answer step by step with voice, and display the result of the recognition as shown in the figure. As shown in Figure 4C, consumers can use the virtual keyboard below to make corrections, and then select the payment items and print the payment slip, which is the same as the previous embodiment, so it will not be repeated.

本發明所提供之語音服務系統，與其他習用技術相互比較時，其優點如下： (1) 本發明能夠將該多媒體終端機台結合語音辨識系統，將能夠透過消費者所發出之語音進行辨識所選擇之服務的種類。 (2) 本發明更能夠於服務的運作過程中，更能夠接收消費者發出之語音所辨識之文字內容，以達到全語音服務的狀態。Compared with other conventional technologies, the voice service system provided by the present invention has the following advantages: (1) The present invention can integrate the multimedia terminal with a voice recognition system, and can recognize the voice recognition system through the voices sent by consumers. The type of service selected. (2) The present invention is more capable of receiving the text content recognized by the voice emitted by the consumer during the operation of the service, so as to achieve the state of the full voice service.

本發明已透過上述之實施例揭露如上，然其並非用以限定本發明，任何熟悉此一技術領域具有通常知識者，在瞭解本發明前述的技術特徵及實施例，並在不脫離本發明之精神和範圍內，當可作些許之更動與潤飾，因此本發明之專利保護範圍須視本說明書所附之請求項所界定者為準。The present invention has been disclosed as above through the above-mentioned embodiments, but it is not intended to limit the present invention. Anyone with ordinary knowledge in this technical field will understand the aforementioned technical features and embodiments of the present invention without departing from the scope of the present invention. Within the spirit and scope, some changes and retouching can be made. Therefore, the scope of patent protection of the present invention shall be subject to the definition in the claims attached to this specification.

1‧‧‧雲端伺服設備1‧‧‧ Cloud Servo Equipment

2‧‧‧多媒體終端機台2‧‧‧Multimedia Terminal

201‧‧‧觸控顯示螢幕201‧‧‧Touch display

202‧‧‧操作介面模組202‧‧‧operation interface module

2021‧‧‧操作頁面2021‧‧‧ operation page

203‧‧‧啟動模組203‧‧‧Activation module

204‧‧‧語音辨識模組204‧‧‧Speech recognition module

205‧‧‧收音裝置205‧‧‧Radio

206‧‧‧服務啟動模組206‧‧‧Service activation module

207‧‧‧服務運作模組207‧‧‧Service Operation Module

208‧‧‧觸控輸入模組208‧‧‧Touch Input Module

209‧‧‧感測模組209‧‧‧Sensor Module

210‧‧‧聲音輸出裝置 210‧‧‧ sound output device

[第1圖]係本發明語音服務系統之整體架構示意圖。 [第2圖]係本發明語音服務系統之多媒體終端機台之架構示意圖。 [第3A圖]係本發明語音服務系統之第一實施示意圖。 [第3B圖]係本發明語音服務系統之第一實施示意圖。 [第3C圖]係本發明語音服務系統之第一實施示意圖。 [第3D圖]係本發明語音服務系統之第一實施示意圖。 [第3E圖]係本發明語音服務系統之第一實施示意圖。 [第3F圖]係本發明語音服務系統之第一實施示意圖。 [第3G圖]係本發明語音服務系統之第一實施示意圖。 [第3H圖]係本發明語音服務系統之第一實施示意圖。 [第3I圖]係本發明語音服務系統之第一實施示意圖。 [第4A圖]係本發明語音服務系統之第一實施示意圖。 [第4B圖]係本發明語音服務系統之第一實施示意圖。 [第4C圖]係本發明語音服務系統之第一實施示意圖。[Figure 1] is a schematic diagram of the overall architecture of the voice service system of the present invention. [Figure 2] Schematic diagram of the architecture of a multimedia terminal in the voice service system of the present invention. [Figure 3A] is a schematic diagram of a first implementation of the voice service system of the present invention. [Figure 3B] is a schematic diagram of a first implementation of the voice service system of the present invention. [Figure 3C] is a schematic diagram of the first implementation of the voice service system of the present invention. [Figure 3D] is a schematic diagram of the first implementation of the voice service system of the present invention. [Figure 3E] is a schematic diagram of a first implementation of the voice service system of the present invention. [Figure 3F] is a schematic diagram of a first implementation of the voice service system of the present invention. [Figure 3G] is a schematic diagram of the first implementation of the voice service system of the present invention. [Figure 3H] is a schematic diagram of the first implementation of the voice service system of the present invention. [Figure 3I] is a schematic diagram of the first implementation of the voice service system of the present invention. [FIG. 4A] A schematic diagram of the first implementation of the voice service system of the present invention. [FIG. 4B] It is a schematic diagram of the first implementation of the voice service system of the present invention. [Figure 4C] is a schematic diagram of a first implementation of the voice service system of the present invention.

Claims

A voice service system includes: a cloud servo device; at least one multimedia terminal with a radio device, which is connected to the cloud servo device, and the multimedia terminal includes: a touch display screen; The operation interface module is used to provide at least one operation page that can be displayed on the touch display screen, so that a consumer can use the multimedia terminal with touch; a startup module is connected to the operation interface module. And the radio device are electrically connected to enable the voice service mechanism to enable the radio device to receive at least one voice signal; a voice recognition module is electrically connected to the operation interface module and the activation module A connection for recognizing a text message from the received voice signal; and a service activation module electrically connected to the operation interface module and the voice recognition module to enable the text message to be recognized based on Content to determine and start the service category; and a service operation module, related to the operation interface module and the service start Group of electrical connection to be able to operate the service categories judged based on the content of the identified text messages.

The voice service system according to claim 1, wherein the multimedia terminal further includes a sensing device electrically connected to the activation module, and is configured to detect that there is at least one user in front of the multimedia terminal. , And can confirm whether to activate the voice service mechanism through the operation page of the activation module on the touch display screen.

The voice service system according to claim 1, wherein the multimedia terminal further includes a touch input module electrically connected to the operation interface module, the service start module, and the service operation module, and the touch The control input module is used to input data and / or confirm operations on the operation page.

The voice service system according to claim 1, wherein the multimedia terminal further includes a sound output device electrically connected to the startup module, the service startup module, and the service operation module, and the sound output device is used for It is capable of playing at least one voice file when the startup module, the service startup module and the service operation module are operating.

The voice service system according to claim 1, wherein the service activation module is capable of confirming a service type on an operation page on the touch display screen.

The voice service system according to claim 1, wherein the service operation module is further capable of being electrically connected to the voice recognition module, and is used to operate different service categories on the operation page, and is able to better receive the received voice The content of the text message identified by the signal is used to enter the data required for the operation of different service categories.