WO2021134358A1 - 人机交互方法、装置和系统 - Google Patents
人机交互方法、装置和系统 Download PDFInfo
- Publication number
- WO2021134358A1 WO2021134358A1 PCT/CN2019/130195 CN2019130195W WO2021134358A1 WO 2021134358 A1 WO2021134358 A1 WO 2021134358A1 CN 2019130195 W CN2019130195 W CN 2019130195W WO 2021134358 A1 WO2021134358 A1 WO 2021134358A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- touch operation
- operated
- touch
- voice instruction
- position information
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Ceased
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/01—Input arrangements or combined input and output arrangements for interaction between user and computer
- G06F3/048—Interaction techniques based on graphical user interfaces [GUI]
- G06F3/0481—Interaction techniques based on graphical user interfaces [GUI] based on specific properties of the displayed interaction object or a metaphor-based environment, e.g. interaction with desktop elements like windows or icons, or assisted by a cursor's changing behaviour or appearance
- G06F3/0482—Interaction with lists of selectable items, e.g. menus
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/01—Input arrangements or combined input and output arrangements for interaction between user and computer
- G06F3/048—Interaction techniques based on graphical user interfaces [GUI]
- G06F3/0484—Interaction techniques based on graphical user interfaces [GUI] for the control of specific functions or operations, e.g. selecting or manipulating an object, an image or a displayed text element, setting a parameter value or selecting a range
- G06F3/04842—Selection of displayed objects or displayed text elements
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/01—Input arrangements or combined input and output arrangements for interaction between user and computer
- G06F3/048—Interaction techniques based on graphical user interfaces [GUI]
- G06F3/0484—Interaction techniques based on graphical user interfaces [GUI] for the control of specific functions or operations, e.g. selecting or manipulating an object, an image or a displayed text element, setting a parameter value or selecting a range
- G06F3/04845—Interaction techniques based on graphical user interfaces [GUI] for the control of specific functions or operations, e.g. selecting or manipulating an object, an image or a displayed text element, setting a parameter value or selecting a range for image manipulation, e.g. dragging, rotation, expansion or change of colour
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/01—Input arrangements or combined input and output arrangements for interaction between user and computer
- G06F3/048—Interaction techniques based on graphical user interfaces [GUI]
- G06F3/0484—Interaction techniques based on graphical user interfaces [GUI] for the control of specific functions or operations, e.g. selecting or manipulating an object, an image or a displayed text element, setting a parameter value or selecting a range
- G06F3/04847—Interaction techniques to control parameter settings, e.g. interaction with sliders or dials
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/01—Input arrangements or combined input and output arrangements for interaction between user and computer
- G06F3/048—Interaction techniques based on graphical user interfaces [GUI]
- G06F3/0487—Interaction techniques based on graphical user interfaces [GUI] using specific features provided by the input device, e.g. functions controlled by the rotation of a mouse with dual sensing arrangements, or of the nature of the input device, e.g. tap gestures based on pressure sensed by a digitiser
- G06F3/0488—Interaction techniques based on graphical user interfaces [GUI] using specific features provided by the input device, e.g. functions controlled by the rotation of a mouse with dual sensing arrangements, or of the nature of the input device, e.g. tap gestures based on pressure sensed by a digitiser using a touch-screen or digitiser, e.g. input of commands through traced gestures
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/01—Input arrangements or combined input and output arrangements for interaction between user and computer
- G06F3/048—Interaction techniques based on graphical user interfaces [GUI]
- G06F3/0487—Interaction techniques based on graphical user interfaces [GUI] using specific features provided by the input device, e.g. functions controlled by the rotation of a mouse with dual sensing arrangements, or of the nature of the input device, e.g. tap gestures based on pressure sensed by a digitiser
- G06F3/0488—Interaction techniques based on graphical user interfaces [GUI] using specific features provided by the input device, e.g. functions controlled by the rotation of a mouse with dual sensing arrangements, or of the nature of the input device, e.g. tap gestures based on pressure sensed by a digitiser using a touch-screen or digitiser, e.g. input of commands through traced gestures
- G06F3/04883—Interaction techniques based on graphical user interfaces [GUI] using specific features provided by the input device, e.g. functions controlled by the rotation of a mouse with dual sensing arrangements, or of the nature of the input device, e.g. tap gestures based on pressure sensed by a digitiser using a touch-screen or digitiser, e.g. input of commands through traced gestures for inputting data by handwriting, e.g. gesture or text
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/16—Sound input; Sound output
- G06F3/167—Audio in a user interface, e.g. using voice commands for navigating, audio feedback
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F2203/00—Indexing scheme relating to G06F3/00 - G06F3/048
- G06F2203/038—Indexing scheme relating to G06F3/038
- G06F2203/0381—Multimodal input, i.e. interface arrangements enabling the user to issue commands by simultaneous use of input devices of different nature, e.g. voice plus gesture on digitizer
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F2203/00—Indexing scheme relating to G06F3/00 - G06F3/048
- G06F2203/048—Indexing scheme relating to G06F3/048
- G06F2203/04808—Several contacts: gestures triggering a specific function, e.g. scrolling, zooming, right-click, when the user establishes several contacts with the surface simultaneously; e.g. using several fingers or a combination of fingers and pen
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
- G10L2015/223—Execution procedure of a spoken command
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02D—CLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
- Y02D10/00—Energy efficient computing, e.g. low power processors, power management or thermal management
Definitions
- This application relates to the field of human-computer interaction technology, and in particular to human-computer interaction methods, devices and systems.
- HMI Human computer interaction
- HMI Human computer interaction
- the system here can be a variety of machines or computerized systems and software.
- Multi-modality (modality) human-computer interaction in layman's terms, refers to the integration of multiple senses. Multi-modal interaction refers to human-computer interaction through multiple methods such as text, voice, vision, motion, and environment. How to achieve multi-modal human-computer interaction to improve user experience is a technical problem that needs to be solved urgently.
- the human-computer interaction method, device, and system provided in the present application help realize multi-modal human-computer interaction and improve user experience.
- the present application provides a human-computer interaction method, which is applied to a human-computer interaction system, and the human-computer interaction system includes a touch screen.
- the method includes: receiving a first touch operation and a second touch operation having overlapping touch durations, and receiving a voice instruction, the voice instruction being a voice received within the overlapping touch duration of the first touch operation and the second touch operation instruction.
- the position information of the object to be operated for the voice instruction on the touch screen is determined. Based on the determined position information of the object to be operated on the touch screen, the voice command is executed.
- the to-be-executed voice command (that is, the above-mentioned voice command) is determined.
- “wake words need to be frequently used to wake up the human-computer interaction system Only the “voice instructions to be executed” can be determined based on the awakened human-computer interaction system.
- This technical solution does not require frequent use of wake-up words to wake up the system, thus helping to improve the user’s experience in the human-computer interaction process.
- the position information of the object to be operated on the touch screen for the voice command to be executed is determined. In this way, in a large screen scene, the problem of long-distance sliding on the large screen can be alleviated, thereby helping to improve the user's experience in the human-computer interaction process.
- the distance between the contact point of the first touch operation and the contact point of the second touch operation is greater than or equal to a preset threshold.
- the human-computer interaction method provided in this application can be applied to a large-screen scene. By using multiple touch operations to trigger and monitor voice commands in a large screen scene, the problem of long-distance sliding on the large screen can be effectively alleviated, thereby improving the user's experience in the human-computer interaction process.
- the aforementioned "determining the position information of the object to be operated by the voice instruction on the touch screen” may include: determining the area to be operated on the touch screen of the voice instruction, The to-be-operated area contains the to-be-operated object.
- the above “executing the voice instruction based on the determined position information of the object to be operated on the touch screen” may include: determining the area to be operated on the touch screen according to the content of the received voice instruction, and selecting the area to be operated on the touch screen. Perform voice commands in the area.
- the aforementioned "executing a voice instruction in the area to be operated" may include: when the voice instruction is used to instruct the selected object, selecting the object in the area to be operated; or, When the voice instruction is used to instruct to modify the attribute information of the object to the target attribute information, the attribute information of the object in the area to be operated is modified to the target attribute information; or when the voice instruction is used to instruct to delete the object, the area to be operated is deleted Or, when the voice instruction is used to instruct to create an object, create an object in the area to be operated.
- This possible design provides several examples of the human-computer interaction system executing the voice command.
- the human-computer interaction method may further include: the human-computer interaction system further displays the determined area to be operated on the touch screen. In this way, by displaying the area to be operated on the touch screen, the process of the human-computer interaction system executing the voice command is visualized, and the user experience is improved.
- determining the position information of the object to be operated by the voice instruction in the touch screen may include: determining the position information of the first endpoint on the touch screen based on the position information of the contact point of the first touch operation, and determining the position information of the first endpoint on the touch screen based on the second touch.
- the position information of the operated contact determines the position information of the second end point in the touch screen.
- the position information of the touch-operated contact point can be used to determine the position in the voice instruction that indicates the creation of the object to be operated, that is, the position information of the touch-operated contact point can be used as a parameter of the voice instruction.
- the above-mentioned object to be operated includes at least one of the following: a straight line, a line segment, a polyline, a curve, or a polygon. That is to say, the human-computer interaction system of the present application can create objects to be operated including a first end point and a second end point, such as a straight line, a line segment, a polyline, a curve, or a polygon, based on this method.
- the aforementioned "executing a voice command based on the determined position information of the object to be operated on the touch screen” may include: based on the position information of the object to be operated on the touch screen, and According to the sequence of the first touch operation and the start touch time of the second touch operation, the voice command is executed.
- the sequence of the start touch time of the first touch operation and the second touch operation may be used as a parameter of the voice command.
- the voice command can be executed Including: when a voice command is used to instruct to create a circle, create the circle based on the sequence of the first touch operation and the start touch time of the second touch operation.
- the position of the center of the circle is based on the contact point of the first touch operation
- the position is determined, and the position of the point in the circle of the circle is determined based on the position of the contact point of the second touch operation.
- the start touch time of the first touch operation precedes or follows the second touch operation.
- the sequence of the first touch operation and the start time of the second touch operation is used as a parameter of the voice instruction for creating a circle.
- the above-mentioned "execute a voice command based on the position information of the object to be operated on the touch screen and the sequence of the first touch operation and the start touch time of the second touch operation” It may include: when a voice instruction is used to instruct to create a line with a one-way arrow, creating the line with a one-way arrow based on the sequence of the start touch time of the first touch operation and the second touch operation.
- a voice instruction is used to instruct to create a line with a one-way arrow
- the position of the end point pointed by the arrow in the line is determined based on the position of the contact point of the first touch operation
- the position of the end point not pointed by the arrow in the line is determined based on the position of the contact point of the second touch operation.
- the start touch time of the first touch operation precedes or follows the second touch operation.
- the sequence of the start touch time of the first touch operation and the second touch operation is used as a parameter of the voice instruction for creating a line with a one-way arrow.
- the present application provides a human-computer interaction method, which is applied to a computer device, and the human-computer interaction method includes: acquiring position information of a first contact point based on a first touch operation, and The position information of the second contact of the second touch operation, the first touch operation and the second touch operation have overlapping touch durations.
- a voice instruction is received, and the voice instruction is a voice instruction received within the overlapped touch duration of the first touch operation and the second touch operation.
- the position information of the object to be operated for the voice command is determined.
- the position information of the object to be operated is the position information of the object to be operated on the touch screen in the human-computer interaction system.
- the touch screen can be integrated on the computer device or set separately from the computer device. Based on the position information of the object to be operated, the voice command is executed.
- the distance between the contact point of the first touch operation and the contact point of the second touch operation is greater than or equal to a preset threshold.
- the above-mentioned "determining the position information of the object to be operated by the voice instruction” includes: determining the area to be operated for the voice instruction, and the area to be operated contains the object to be operated.
- the aforementioned "executing a gauge voice instruction based on the position information of the object to be operated” includes: executing the voice instruction in the area to be operated.
- the above-mentioned "execute the voice instruction in the area to be operated” includes: when the voice instruction is used to instruct the selected object, select the object in the area to be operated; or, when The voice command is used to instruct to modify the attribute information of the object to target attribute information, and to modify the attribute information of the object in the area to be operated to target attribute information; or, when the voice command is used to instruct to delete the object, the computer device deletes the pending operation Objects in the area; or, when a voice instruction is used to instruct to create an object, create an object in the area to be operated.
- determining the position information of the object to be operated by the voice command includes: determining the position information of the first endpoint on the touch screen based on the position information of the contact point of the first touch operation, and determining the position information of the contact point based on the second touch operation
- the location information determines the location information of the second endpoint on the touch screen.
- the above-mentioned object to be operated includes at least one of the following: a straight line, a line segment, a polyline, a curve, or a polygon.
- the above-mentioned "execute the voice command based on the position information of the object to be operated” includes: based on the position information of the object to be operated on the touch screen, and the first touch operation and the first touch operation 2. Execute the voice command in the sequence of the start touch time of the touch operation.
- the above execute voice commands based on the position information of the object to be operated on the touch screen and the sequence of the first touch operation and the start touch time of the second touch operation
- the computer device creates the circle based on the sequence of the first touch operation and the start touch time of the second touch operation.
- the position of the center of the circle is determined based on the position of the contact point of the first touch operation, and the position of the point on the circumference of the circle is determined based on the position of the contact point of the second touch operation.
- the start touch time of the first touch operation precedes or follows the second touch operation.
- the above "execute voice commands based on the position information of the object to be operated on the touch screen and the sequence of the first touch operation and the start touch time of the second touch operation” Including: when a voice command is used to instruct to create a line with a one-way arrow, create the line with a one-way arrow based on the sequence of the start touch time of the first touch operation and the second touch operation.
- the position of the end point pointed by the arrow in the line is determined based on the position of the contact point of the first touch operation
- the position of the end point not pointed by the arrow in the line is determined based on the position of the contact point of the second touch operation.
- the start touch time of the first touch operation precedes or follows the second touch operation.
- the present application provides a human-computer interaction method, which is applied to a human-computer interaction system, and the human-computer interaction system includes a touch screen.
- the human-computer interaction method includes: receiving a first touch operation and a second touch operation with overlapping touch durations, and receiving a voice instruction, the voice instruction being received within the overlapped touch duration of the first touch operation and the second touch operation Voice instructions to the destination.
- the voice command is executed.
- the to-be-executed voice instruction ie the above-mentioned voice instruction
- the distance between the contact point of the first touch operation and the contact point of the second touch operation is greater than or equal to a preset threshold.
- the present application provides a human-computer interaction method, which is applied to a human-computer interaction system, and the human-computer interaction system includes a touch screen.
- the human-computer interaction method includes: receiving a first touch operation and a second touch operation, and receiving a voice instruction.
- the voice instruction is determined based on the start touch duration of the first touch operation or the start touch duration of the second touch operation.
- the person determines the position information of the object to be operated by the voice instruction on the touch screen. Based on the position information of the object to be operated on the touch screen, the above-mentioned voice command is executed.
- the position information of the object to be operated on the touch screen for the voice command to be executed is determined.
- the problem of long-distance sliding on the large screen can be alleviated, thereby helping to improve the user's experience in the human-computer interaction process.
- the voice command is a voice command received within a preset time period starting from the start touch time of the target touch operation.
- the target touch operation is a touch operation with a touch start time before the first touch operation and the second touch operation.
- the target touch operation is a touch operation in which the touch start time is later in the first touch operation and the second touch operation. That is, the voice processing time window of the voice command may be determined based on the single-touch operation.
- the distance between the contact point of the first touch operation and the contact point of the second touch operation is greater than or equal to a preset threshold.
- the above-mentioned "determining the position information of the object to be operated on the touch screen of the voice instruction” may include: determining the area to be operated on the touch screen of the voice instruction.
- the operation area contains the object to be operated.
- the “executing the voice instruction based on the position information of the object to be operated on the touch screen” may include: determining the area to be operated on the touch screen according to the content of the voice instruction, and executing the voice instruction in the area to be operated .
- the present application provides a human-computer interaction system, which can be used to implement any of the methods provided in the first aspect, the third aspect, or the fourth aspect.
- the human-computer interaction system may include a touch screen and a processor.
- the above-mentioned touch screen is used to receive a first touch operation and a second touch operation, and the touch duration of the first touch operation and the second touch operation overlap.
- the foregoing processor is configured to receive a voice instruction, and the voice instruction is a voice instruction received within the overlapped touch duration of the first touch operation and the second touch operation.
- the above-mentioned processor is further configured to respond to the first touch operation and the second touch operation to determine the position information of the object to be operated by the voice instruction on the touch screen.
- the above-mentioned processor is further configured to execute voice instructions based on the position information of the object to be operated on the touch screen.
- the above-mentioned processor is configured to receive a first touch operation and a second touch operation with overlapping touch durations, and receive a voice command, which is performed between the first touch operation and the second touch operation.
- the above-mentioned processor is further configured to execute the voice instruction in response to the first touch operation and the second touch operation.
- the above-mentioned processor is used for the voice command received within a preset time period starting from the start touch time of the target touch operation.
- the target touch operation is a touch operation whose start time is earlier in the first touch operation and the second touch operation.
- the target touch operation is a touch operation in which the touch start time is later in the first touch operation and the second touch operation. That is, the voice processing time window of the voice command may be determined based on the single-touch operation.
- the human-computer interaction system further includes a voice collector.
- the above-mentioned voice collector is used to collect voice instructions in real time, and send the collected voice instructions to the processor.
- the above-mentioned voice collector performs information interaction with the above-mentioned processor to determine the collection and sending of voice instructions.
- the above-mentioned voice collector is configured to send a first instruction at the beginning of the overlapping touch duration of the above-mentioned first touch operation and the above-mentioned second touch operation, and the first instruction is used to instruct the voice collector to start collecting voice instructions.
- the processor is configured to send a second instruction to the voice collector at the end of the overlapping touch duration of the first touch operation and the second touch operation, and the second instruction is used to instruct the voice collector to end collecting voice instructions.
- this application provides a computer device that can be used to execute any of the methods provided in the second aspect.
- the computer device may specifically be a processor or a device containing a processor.
- the device can be divided into functional modules according to any of the methods provided in the second aspect.
- each function module can be divided corresponding to each function, or two or more functions can be integrated into one processing module.
- the computer device includes: a memory and one or more processors; the memory and the processor are coupled.
- the above-mentioned memory is used to store computer program code.
- the computer program code includes computer instructions.
- the computer device executes the human-computer interaction as described in the second aspect and any of its possible design methods. method.
- the present application provides a chip system, which is applied to a human-computer interaction system.
- the chip system includes one or more interface circuits and one or more processors.
- the interface circuit and the processor are interconnected by wires; the interface circuit is used to receive a signal from the memory of the human-computer interaction system and send the signal to the processor.
- the signal includes computer instructions stored in the memory; when the processor executes When the computer is instructed, the human-computer interaction system executes the human-computer interaction method described in any of the possible design methods provided in the first aspect, the third aspect, or the fourth aspect.
- the present application provides a chip system, which is applied to computer equipment.
- the chip system includes one or more interface circuits and one or more processors;
- the above-mentioned interface circuit and the above-mentioned processor are interconnected by wires; the above-mentioned interface circuit is used to receive a signal from the memory of the computer device and send the signal to the above-mentioned processor, and the signal includes computer instructions stored in the memory; when the above-mentioned processor executes the computer When instructed, the computer device executes the human-computer interaction method described in the second aspect and any of its possible design methods.
- the present application provides a computer-readable storage medium that includes computer instructions.
- the human-computer interaction system can realize the first aspect or the first aspect.
- this application provides a computer-readable storage medium that includes computer instructions that, when the computer instructions run on a computer device, enable the computer device to implement the second aspect and any of its possibilities The human-computer interaction method described in the design method.
- this application provides a computer program product, which when the computer program product runs on a human-computer interaction system, enables the human-computer interaction system to implement any one of the aspects provided in the first aspect, the third aspect, or the fourth aspect.
- this application provides a computer program product that, when the computer program product runs on a computer device, enables the computer device to implement the human-computer interaction method described in the second aspect and any of its possible design methods .
- FIG. 1 is a hardware structure diagram of a human-computer interaction system provided by an embodiment of the application
- FIG. 2 is a schematic diagram 1 of a structure of a human-computer interaction system provided by an embodiment of this application;
- FIG. 3 is a schematic diagram 2 of a structure of a human-computer interaction system provided by an embodiment of the application;
- FIG. 4 is a first schematic flowchart of a human-computer interaction method provided by an embodiment of this application.
- FIG. 5 is a first schematic diagram of a human-computer interaction method provided by an embodiment of this application.
- FIG. 6 is a second schematic diagram of a method for finding human-computer interaction in an embodiment of this application.
- FIG. 7 is a third schematic diagram of a human-computer interaction method provided by an embodiment of this application.
- FIG. 8 is a fourth schematic diagram of a human-computer interaction method provided by an embodiment of this application.
- FIG. 9 is a schematic diagram five of a human-computer interaction method provided by an embodiment of this application.
- FIG. 10 is a sixth schematic diagram of a human-computer interaction method provided by an embodiment of this application.
- FIG. 11 is a schematic diagram 7 of a human-computer interaction method provided by an embodiment of this application.
- FIG. 12 is a schematic diagram eight of a human-computer interaction method provided by an embodiment of this application.
- FIG. 13 is a second schematic flowchart of a human-computer interaction method provided by an embodiment of this application.
- FIG. 14 is a third schematic flowchart of a human-computer interaction method provided by an embodiment of this application.
- FIG. 15 is a fourth schematic flowchart of a human-computer interaction method provided by an embodiment of this application.
- FIG. 16 is a schematic structural diagram of a computer device provided by an embodiment of the application.
- FIG. 17 is a schematic structural diagram of a chip system provided by an embodiment of the application.
- FIG. 18 is a schematic diagram of the structure of a computer program product provided by an embodiment of the application.
- words such as “exemplary” or “for example” are used as examples, illustrations, or illustrations. Any embodiment or design solution described as “exemplary” or “for example” in the embodiments of the present application should not be construed as being more preferable or advantageous than other embodiments or design solutions. To be precise, words such as “exemplary” or “for example” are used to present related concepts in a specific manner.
- first and second are only used for descriptive purposes, and cannot be understood as indicating or implying relative importance or implicitly indicating the number of indicated technical features. Thus, the features defined with “first” and “second” may explicitly or implicitly include one or more of these features. In the description of the embodiments of the present application, unless otherwise specified, "plurality" means two or more.
- the embodiments of the present application provide a human-computer interaction method, device, and system.
- the touch duration of the first touch operation received by the human-computer interaction system overlaps the touch duration of the second touch operation.
- the human-computer interaction system executes the monitoring voice command.
- the human-computer interaction system executes the voice instruction according to the instruction of the voice instruction and the position information of the first touch operation and the second touch operation.
- the above-mentioned human-computer interaction method can be implemented by an application installed on the device, such as a human-computer interaction application.
- the above-mentioned application program may be an embedded application program installed in the device (that is, a system application of the device), or a downloadable application program.
- an embedded application is an application provided as a part of the implementation of a device (such as a mobile phone).
- a downloadable application is an application that can provide its own Internet protocol multimedia subsystem (IMS) connection.
- the downloadable application is an application that can be pre-installed in the device or can be downloaded and installed by the user.
- Third-party applications in the device is an embedded application program installed in the device.
- Fig. 1 shows the hardware structure of the human-computer interaction system provided by an embodiment of the present application.
- the human-computer interaction system 10 includes a processor 11, a memory 12, a touch screen 13 and a voice collector 14.
- the processor 11, the memory 12, the touch screen 13, and the voice collector 14 may be integrated on one device, or may be integrated on different devices.
- the processor 11, the memory 12, the touch screen 13, and the voice collector 14 are integrated on one device as an example.
- the human-computer interaction system 10 further includes a bus 15.
- the processor 11, the memory 12, the touch screen 13 and the voice collector 14 are connected by a bus 15.
- the processor 11 is the control center of the human-computer interaction system 10, and may be a general-purpose central processing unit (central processing unit, CPU), or other general-purpose processors.
- the general-purpose processor may be a microprocessor or any conventional processor.
- the processor 11 may include one or more CPUs, such as CPU 0 and CPU 1 shown in FIG. 1.
- the memory 12 can be a read-only memory (ROM) or other types of static storage devices that can store static information and instructions, a random access memory (RAM), or other types that can store information and instructions.
- ROM read-only memory
- RAM random access memory
- dynamic storage devices can also be electrically erasable programmable read-only memory (EEPROM), magnetic disk storage media or other magnetic storage devices, or can be used to carry or store instructions or data structures The form of the desired program code and any other medium that can be accessed by the computer, but not limited to this.
- the memory 12 may be independent of the processor 11.
- the memory 12 may be connected to the processor 11 through the bus 15 for storing data, instructions or program codes.
- the processor 11 calls and executes the instructions or program codes stored in the memory 12, it can implement the human-computer interaction method provided in the embodiments of the present application.
- the memory 12 may also be integrated with the processor 11.
- the touch screen 13 may specifically include a touch pad 131 and a display screen 132.
- the touchpad 131 can collect the user's touch events on or near it (for example, the user uses a finger, a stylus or any other suitable object to operate on the touchpad or near the touchpad), and collects The touch information is sent to other devices (for example, the processor 11).
- the user's touch event near the touchpad can be called floating touch; floating touch can mean that the user does not need to directly touch the touchpad in order to select, move or drag a target (such as an icon, etc.), but only The user is required to be near the device in order to perform the desired function.
- multiple types such as resistive, capacitive, infrared, and surface acoustic wave can be used to realize the touch panel.
- the display screen 132 may be used to display information input by the user or information provided to the user.
- the display screen 132 can be configured in the form of a liquid crystal display screen, an organic light emitting diode, or the like.
- the touchpad 131 can cover the display screen 132. When the touchpad 131 detects a touch event on or near it, it transmits it to the processor 11 to determine the type of the touch event.
- the processor 11 can determine the type of the touch event according to the touch event. The type provides a corresponding visual output on the display 132.
- the voice collector 14 is used to receive voice signals, convert the voice signals into electrical signals, and send them to other devices (for example, the processor 11) for processing.
- the voice collector can be a microphone, also called a "microphone”, or a "microphone”, etc.
- the bus 15 may be an industry standard architecture (ISA) bus, a peripheral component interconnect (PCI) bus, or an extended industry standard architecture (EISA) bus.
- ISA industry standard architecture
- PCI peripheral component interconnect
- EISA extended industry standard architecture
- the bus can be divided into address bus, data bus, control bus and so on. For ease of presentation, only one thick line is used in FIG. 1, but it does not mean that there is only one bus or one type of bus.
- FIG. 1 does not constitute a limitation to the human-computer interaction system 10.
- the human-computer interaction system 10 may include more or less than those shown in the figure. Component, or combination of some components, or different component arrangements.
- the human-computer interaction system 10 may be an electronic whiteboard, a smart phone, a notebook computer with a touch screen, a computer with a touch screen, a tablet, or a netbook. , Vehicle and other equipment.
- a processor 11, a memory 12, a touch screen 13 and a voice collector 14 are integrated in the electronic whiteboard 20.
- the above-mentioned human-computer interaction application program can be run in the electronic whiteboard 20.
- the human-computer interaction system 10 may further include a touch pen 21, and the touch pen 21 is used to input touch operations on the touch screen 13 of the electronic whiteboard 20.
- the human-computer interaction system 10 may include multiple devices to execute the human-computer interaction method provided in the embodiment of the present application.
- the human-computer interaction system 10 shown in FIG. 3 may include: an electronic whiteboard 20, a computer 32, and a projector 33.
- the human-computer interaction system 10 may further include a touch pen 21, and the touch pen 21 is used to input touch operations on the touch screen 13 of the electronic whiteboard 20.
- the processor 11 may be the processor of the computer 32.
- the memory 12 may be the memory of the computer 32, and the processor 11 and the memory 12 are connected through a bus 15. At this time, the above-mentioned human-computer interaction application program can be run in the computer 32.
- the touch screen 13 may be a touch screen of the electronic whiteboard 20.
- the voice collector 14 may be integrated in the electronic whiteboard 20.
- the voice collector 14 may also be integrated in the computer 32, the projector 33 or the touch pen 21, which is not limited in the implementation of this application.
- FIG. 4 shows a schematic flowchart of a human-computer interaction method provided by an embodiment of the present application.
- the human-computer interaction method includes the following steps:
- the touch screen receives the first touch operation and the second touch operation, and determines the position information of the first contact point and the position information of the second contact point.
- the first touch operation and the second touch operation may be operations in which the user touches on the touch screen with a finger or a touch pen.
- the first touch operation and the second touch operation each have a certain touch duration.
- the touch screen can receive the first touch operation and the second touch operation at the same time, and can also receive the first touch operation and the second touch operation at different times.
- the first contact is the contact point between the first touch operation and the touch screen.
- the second contact point is the contact point between the second touch operation and the touch screen.
- the position information of the contact point is used to characterize the position of the contact point on the touch screen.
- the embodiment of the present application does not limit the specific content of the position information of the contact point.
- the position information of the contact point may be the coordinates of the contact point in the touch screen. Exemplarily, as shown in FIG. 5, FIG. 5 shows the determined coordinates (x 1 , y 1 ) of the first contact A after the touch screen 50 receives the first touch operation, and the second touch operation is received Then, the coordinates (x 2 , y 2 ) of the second contact B are determined.
- the distance between the first contact and the second contact may be greater than or equal to a preset threshold. That is to say, the embodiment of the present application supports a scene with a relatively large distance between two touch points (such as a large-screen scene), which helps to solve the problem of high difficulty in one-handed operation in a large-screen scene.
- the preset threshold can be set according to actual conditions, which is not specifically limited in the embodiment of the present application.
- the size of the touch screen (that is, the size of the long side and/or the size of the short side of the touch screen) is greater than a threshold, that is, the embodiment of the present application supports a large-screen scene.
- the touch screen may also determine the position information of the first contact group after receiving the first touch operation.
- the touch screen can also determine the position information of the second contact group after receiving the second touch operation.
- the distance between every two contacts in a contact group is less than or equal to a preset distance.
- the following description of the embodiments of the present application takes the touch screen to determine the position information of the first contact point according to the first touch operation and the position information of the second contact point according to the second touch operation as an example for description.
- the touch screen sends the position information of the first contact point and the position information of the second contact point to the processor.
- the touch screen may periodically or trigger or real-time send the determined position information of the contact point to the processor.
- the touch screen can send one frame of data to the processor every cycle.
- the frame data includes the position information of the contact points of the touch operation determined in the period, for example, includes the position information of the first contact point and the second contact point in the period.
- the processor determines a voice processing time window in response to the first touch operation and the second touch operation.
- the processor may determine the voice processing time window according to any method:
- the processor may determine the voice processing time window according to the overlapped touch duration of the first touch operation and the second touch operation and a preset rule.
- the preset rule may indicate that the overlapped touch duration is a voice processing time window, or the preset rule may indicate that after the preset duration from the start of the overlapped touch duration, to the overlapped touch duration
- the end time is the voice processing time window.
- the voice processing time window is included in the overlapping touch duration, and the end time of the overlapping touch duration is the time when the processor ends the voice processing time window. That is, the voice processing time window may be the overlapped touch duration, or the duration of the voice processing time window may be less than the overlapped touch duration.
- FIG. 6 shows a schematic diagram of the relationship between the touch duration of the first touch operation, the touch duration of the second touch operation, and the voice processing time window.
- the touch duration of the first touch operation corresponding to the first contact is L 1
- the touch duration of the second touch operation corresponding to the second contact is L 2
- the duration of the voice processing time window is the first The touch duration L during which the one touch operation and the second touch operation overlap.
- the processor may send a first instruction to the voice collector at the beginning of the overlapping touch duration of the first touch operation and the second touch operation, where the first instruction is used to instruct the voice collector to start collecting voice instructions.
- the processor sends a second instruction to the voice collector at the end of the overlapping touch duration of the first touch operation and the second touch operation, and the second instruction is used to instruct the voice collector to end the collection of the voice instruction.
- the processor may determine that the moment when the first instruction is sent to the voice collector to the moment when the second instruction is sent to the voice collector is the voice Processing time window.
- the processor may not send the first instruction and the second instruction to the voice collector.
- the processor may determine the touch duration of the first touch operation by tracking the first touch point.
- the processor may determine the touch duration of the second touch operation by tracking the second touch point.
- the voice collector collects the voice instruction, and sends the voice instruction to the processor.
- the voice collector collects voice instructions in real time and sends the voice instructions to the processor.
- This voice collection mode corresponds to the mode in S403 above.
- the voice collector collects voice instructions according to the first instruction received from the processor, and sends the collected voice instructions to the processor.
- the voice collector also stops the collection of voice commands according to the second command received from the processor.
- This voice collection method corresponds to the second method in S403.
- the voice command can be a voice command issued by a user or a certain device and collected by a voice collector.
- the content of the voice instruction may include selecting an object, modifying an object, deleting an object, or creating an object, etc.
- the voice instruction for the selected object may include "selected text”
- the voice instruction for modifying the object may include “bold text color”
- the voice instruction for deleting the object may include “delete picture”
- the voice for creating the object above Instructions can include "create a table with 3 rows and 2 columns", or "create a circle” and so on.
- S405 In response to the first touch operation and the second touch operation, the processor determines the position information of the object to be operated on the touch screen of the voice instruction received within the voice processing time window.
- the processor may receive one or more voice commands.
- the processor may discard the voice command or invalidate the voice command.
- the processor may receive one or more voice instructions and execute the one or more voice instructions.
- one voice command cannot span multiple voice processing time windows.
- the end point of the first voice processing time window is the start point of the second voice processing time window.
- the processor receives part of the voice command in the first voice processing time window
- the voice processing time window corresponding to the first contact point and the second contact point is the first voice processing time window
- the voice processing time window corresponding to the first contact point and the third contact point is the third voice processing time window.
- the first speech processing time window and the third speech processing time window do not exist at the same time, or the end of the first speech processing time window is the start of the third speech processing time window, or the start of the first speech processing time window is The end of the third speech processing time window.
- the processor may determine the position information of the object to be operated by the voice command on the touch screen in the following manner 1 or manner 2.
- the object to be operated may be an object included in the voice instruction, or an object to be created indicated in the voice instruction.
- Manner 1 The processor determines the area to be operated according to the position information of the first contact and the position information of the second contact, and according to the area to be operated, determines the position information of the voice commanded object to be operated on the touch screen. Among them, the object to be operated is contained in the area to be operated.
- the object to be operated can be completely contained in the area to be operated, or partly contained in the area to be operated.
- the object to be operated is completely contained in the area to be operated as an example for description.
- the object to be operated may be the text object in the voice command "selected text", the text object to be bolded in the voice command "bold text color”, or the voice command "delete”.
- the picture object in “Picture” can also be the table object in the voice command "Create a table with 3 rows and 2 columns”.
- the processor may create a circle based on the position information of the first contact and the position information of the second contact, using the line between the first contact and the second contact as the radius, or using the first contact
- the line between the point and the second contact creates a circle for the diameter, and a circular area to be operated is obtained.
- FIG. 7 shows a circular to-be-operated area 70 obtained by creating a circle with the diameter of the connecting line between the first contact and the second contact.
- the processor may use the connection line between the first contact and the second contact as a diagonal line to obtain a rectangular or regular polygon according to the position information of the first contact and the position information of the second contact. Area to be operated.
- the processor may use the line between the first contact and the second contact as the side length according to the position information of the first contact and the position information of the second contact to obtain a regular polygon or diamond-shaped area to be operated.
- the area to be operated of a regular polygon or a rhombus, the relative position of the connection line with the first contact and the second contact, and the inner angle of the rhombus can be specified by the processor according to a preset rule. The embodiment of the application does not limit this.
- FIG. 8 shows a rectangular area to be operated 80 obtained by taking the line between the first contact and the second contact as a diagonal line.
- the processor may also instruct the display screen to display the area to be operated according to the determined area to be operated.
- the display screen may display the area to be operated in the form of a frame.
- the frame of the area to be operated may be a black dashed frame as shown in Figs. 7 and 8, or a dashed frame of other colors, or a solid black frame or a solid line frame of other colors.
- the embodiment of the application does not limit this.
- the display screen can also display the area to be operated by changing the background color of the area to be operated.
- the display screen can also display the area to be operated in any manner that can distinguish the area to be operated from the background currently displayed on the display screen.
- the embodiment of the application does not limit this.
- Manner 2 The processor determines the position information of the first end point and the position information of the second end point according to the position information of the first contact point and the position information of the second contact point, and determines the position information of the first end point according to the determined position information of the first end point. And the position information of the second endpoint to determine the position information of the object to be operated by the voice command on the touch screen.
- the first end point and the second end point are both included in the object to be operated, or the object to be operated can be constructed through the first end point and the second end point.
- the attributes of the first endpoint and the second endpoint may be the same or different.
- the voice command is "create circle”. If both the first end point and the second end point are points on the circumference of the "circle" of the object to be operated, then both the first end point and the second end point are included in the to-be-operated object "circle".
- the operation object has the same attributes; if the first end point is the center of the "circle” of the object to be operated, and the second end point is a point on the circumference of the "circle” of the object to be operated, the processor can pass the first end and the second end.
- the two end points construct the "circle” of the object to be operated, and for the "circle", the attributes of the first end point and the second end point are different.
- the processor may use the first contact point and the second contact point as the first end point and the second end point, respectively, so as to determine the position information of the first end point and the position information of the second end point.
- the processor uses the first contact as the first end point and the second contact as the second end point.
- the position information of the first end point is the position information of the first contact point
- the position information of the second end point is the position information of the second contact point.
- the processor may also determine the position information of the first end point and the position information of the second end point according to preset rules and the position information of the first contact point and the position information of the second contact point.
- the preset rule may be the sequence of the start touch time of the first touch operation and the second touch operation, or there may be presets between the first contact point and the second contact point, and between the first end point and the second end point. distance.
- the preset rule may also be other rules, which are not limited in the embodiment of the present application.
- the processor uses the first touch point as the first end point and the second touch point as the second end point.
- the position information of the first end point is the position information of the first contact point
- the position information of the second end point is the position information of the second contact point.
- the processor may also use the first touch point as the second end point and the second touch point as the first end point.
- the position information of the first end point is the position information of the second contact point
- the position information of the second end point is the position information of the first contact point.
- S406 The processor executes the voice instruction according to the position information of the object to be operated on the touch screen of the voice instruction.
- the voice instructions received by the processor in the voice processing time window include any one of selecting the object to be operated, deleting the object to be operated, modifying the attribute information of the object to be operated, or creating a new object to be operated.
- the processor executes the voice instruction in the to-be-operated area determined in mode one in S405.
- the processor selects the object to be operated in the area to be operated, and instructs the display screen to display the selected result.
- the selection result is displayed on the display screen.
- the selection result can be displayed by surrounding the frame of the object to be operated, or the selected result can be displayed by changing the background color of the object to be operated, which is not limited in this application.
- the processor selects the banana icon in the area 80 to be operated according to the voice command, and instructs the display screen 50 to display the selection result in a square frame 90.
- the processor deletes the object to be operated in the area to be operated. And instruct the display to show the delete result.
- the processor deletes the apple icon in the area to be operated 80 according to the voice command, and instructs the display screen to display the deletion result.
- the processor creates a new designated object in the area to be operated, and instructs the display screen to display the newly created object.
- the processor will create a table with 3 rows and 2 columns in the area to be operated 80 according to the voice command.
- the row width and row height of the table may be specified by the preset rules of the system, or may be set by the user, which is not specifically limited in the embodiment of the present application.
- the processor can ask the user for the row height and row width through a voice external device such as a speaker, and then the user enters the voice through the microphone to respond to the processing Device. Further, the processor creates a new table according to the user's instruction. Then, the processor instructs the display screen to display the newly created form.
- the processor will modify the attribute information of the object to be operated in the area to be operated, and instruct the display screen to display the object to be operated after the attribute information is modified.
- the processor will thicken the text "1000 grams” in the area to be operated 80 according to the voice command, and instruct the display screen to display bold text.
- the bold text is "bold text”.
- voice commands described above are merely illustrative of the voice commands executed by the human-computer interaction device in the area to be operated, and do not constitute a limitation on the protection scope of the embodiments of the present application.
- the voice instruction received by the processor within the voice processing time window includes creating an object, which can be determined by two points, and the attributes of the object at these two points are the same.
- objects such as straight lines without arrows, straight lines with double arrows, line segments, polylines without arrows, polylines with double arrows, curves, circles, polygons, etc.
- the polyline here can be a right-angle polyline.
- the circle here is a circle created with a known line segment as the diameter.
- the types of polygons can include rhombuses or regular polygons.
- the processor executes the voice command according to the location information of the first endpoint and the location information of the second endpoint.
- the first end point is the first contact and the second end point is the second contact as an example for description.
- the processor connects the first contact A and the second contact B to obtain a straight line, and instructs the display screen to display the straight line.
- the processor uses the first contact A and the second contact B as the two end points to create a double-sided arrow Straight or broken line, and instruct the display to show the straight or broken line with a double-headed arrow.
- the processor uses the first contact A as one end point and the second contact B as the other end point to create a line segment, and instruct the display screen to display the line segment.
- the processor uses the first contact A as one end point and the second contact B as the other end point to create a polyline, and instruct the display screen to display the polyline.
- the specific shape of the broken line may be specified by a preset rule of the processor, which is not limited in the embodiment of the present application.
- the processor uses the first contact A as one end point and the second contact B as the other end point to create a curve, and instruct the display screen to display the curve.
- the curvature of the curve and the curved shape may be specified by a preset rule of the processor, which is not limited in the embodiment of the present application.
- the processor will create a circle with the diameter of the connecting line between the first contact A and the second contact B, and instruct the display screen to display the circle.
- the processor uses the connection between the first contact A and the second contact B as the side length of the rhombus to create a rhombus, and instruct the display screen to display the rhombus.
- the relative position of the connection line between the first contact A and the second contact B to the rhombus, and the number of inner angles of the rhombus may be specified by a preset rule of the processor, which is not limited in the embodiment of the present application.
- the processor creates a rhombus by taking the connection between the first contact A and the second contact B as a diagonal line of the rhombus, and instructs the display screen to display the rhombus. At this time, the length of the other diagonal line of the rhombus may be specified by a preset rule of the processor, which is not limited in the embodiment of the present application.
- the processor creates a diamond 101 with the line AB between contact A and contact B as a diagonal line and the other diagonal line is (AB)/2 according to the received voice command. , And a diamond 101 is displayed on the display screen 100.
- the processor uses the connection line between the first contact A and the second contact B as the diagonal of the regular pentagon to create the regular pentagon , And instruct the display to show the pentagon.
- the processor creates the regular pentagon by taking the line connecting the first contact A and the second contact B as the side length of the regular pentagon, and instructs the display screen to display the pentagon.
- the relative position between the connection line of the first contact A and the second contact B and the regular pentagon may be specified by a preset rule of the processor, which is not limited in the embodiment of the present application.
- voice commands described above are merely illustrative of the voice commands executed by the human-computer interaction device in the area to be operated, and do not constitute a limitation on the protection scope of the embodiments of the present application.
- the voice instruction received by the processor in the voice processing time window is to create an object, and the object can be determined by two points, and the attributes of the object at these two points are different. For example, objects such as rays, lines with one-way arrows, or circles.
- the processor can create the object based on the sequence of the first touch operation and the start touch time of the second touch operation (for example, create a circle or a line with a one-way arrow). ).
- the embodiments of the present application are not limited to this.
- the position of the center of the circle may be determined based on the position of the contact point of the first touch operation, and the position of the point on the circumference of the circle may be determined based on the position of the contact point of the second touch operation.
- the position of the end point pointed by the arrow in the line may be determined based on the position of the contact point of the first touch operation, and the position of the end point not pointed by the arrow in the line may be determined based on the second touch Determined by the position of the operating contact.
- the start touch time of the first touch operation precedes or follows the second touch operation.
- the processor determines the position information of the first endpoint and the position information of the second endpoint according to the sequence of the first touch operation and the start touch time of the second touch operation, and executes the voice command.
- the first touch operation precedes the second touch operation
- the processor uses the first contact as the first endpoint and the second contact as the second endpoint as an example.
- the processor can use the first end point as the end point of the ray, create a ray passing through the second end point, and instruct the display screen to display the ray.
- the end point of the ray may also be the second end point. At this time, the ray also passes through the first end point.
- the processor can use the first end point as the arrow vertex of the line with one-way arrow, and the second end point is the one-way arrow
- the line of is away from the end point on the side of the arrow, creating a line with a one-way arrow, and instructing the display to show the line with a one-way arrow.
- the arrow vertex of the line with a one-way arrow may also be the second end point.
- the first end point is the end point of the line with the one-way arrow on the side away from the arrow.
- the processor can create a circle with the line connecting the first end point and the second end point as the radius, and instruct the display screen to display the circle.
- the processor creates a circle with the first end point as the center of the circle, the second end point is a point on the circle.
- the processor creates a circle with the second end point as the center of the circle, the first end point is a point on the circumference of the circle.
- the voice command received by the processor in the voice monitoring window is "create a circle with a radius".
- the human-computer interaction device determines that the contact point A is the first end point, and that the contact point B is the second end point.
- the processor creates a circle with the line connecting the first end point and the second end point as the radius, and the second end point as the center of the circle, to obtain a circle 111, and instructs the display screen 110 to display the circle 111.
- the first end point is a point on the circumference of the circle 111.
- Figure 12 shows the first contact (marked as contact A) and the second contact (marked as contact B) received by the processor in the previous frame of the current frame location information. Among them, the coordinates of A are (x a , y a ), and the coordinates of B are (x b , y b ). In addition, the processor assigns IDa to contact A and IDb to contact B.
- FIG. 12 shows the contact C received by the processor in the current frame. Among them, the coordinates of C are (x c , y c ).
- the previous frame of the current frame is the previous time of the current time or the frame received by the processor in the previous cycle of the current cycle.
- the current frame is the frame received by the processor at the current moment or current cycle.
- the process of the processor tracking the contacts may include the process of assigning IDs to the contacts in the current frame, and the process of determining the missing contacts in the previous frame of the current frame according to the IDs of the contacts in the current frame.
- FIG. 13 shows a schematic diagram of a process in which the processor assigns IDs to contacts in the current frame.
- the process includes:
- the processor determines the distance D1 between the contact A and the contact C, and determines the distance D2 between the contact B and the contact C according to the position information of the contact A, the contact B, and the contact C.
- the processor determines the distance D1 between the contact A and the contact C according to the position information of the contact A and the contact B in the previous frame, and the position information of the contact C in the current frame, and determines the contact The distance between B and contact C is D2.
- the processor judges whether D1 is less than or equal to a threshold value.
- the threshold value may be preset by the processor, and the size of the threshold value may be set according to actual conditions, which is not specifically limited in the embodiment of the present application.
- touch misoperations such as a palm touching the touch screen, or a finger or stylus sliding during the process of touching the touch screen.
- the distance between the touch point in the current frame and the touch point in the previous frame of the current frame is usually small.
- the contact generated by the misoperation in the current frame and the contact in the previous frame of the current frame whose distance is less than or equal to the threshold value are determined as the same contact coordinate tracking Sequence, the contact coordinate tracking sequence shares one ID and corresponds to one touch operation. In this way, it is helpful to reduce the occurrence of accidental start or end caused by the "voice processing time window determined according to the touch operation" due to misoperation.
- the processor allocates the ID of the contact A to the contact C.
- the processor assigns the ID of the contact A to the contact C, that is, the ID of the contact C is IDa.
- contact A and contact C belong to the same contact coordinate tracking sequence, and the ID of the contact coordinate tracking sequence is IDa.
- the contact C serves as the first contact.
- the processor judges whether D2 is less than or equal to a threshold value.
- the processor assigns the ID of the contact B to the contact C.
- the processor assigns the ID of the contact B to the contact C, that is, the ID of the contact C is IDb.
- contact B and contact C belong to the same contact coordinate tracking sequence, and the ID of the contact coordinate tracking sequence is IDb.
- the contact C serves as the second contact.
- the processor assigns a new ID to the contact C.
- the processor newly assigns an ID to the contact C, such as IDc. At this time, it indicates that the contact C is a new contact currently appearing. In this case, the contact C is neither the first contact nor the second contact.
- the contact C may be a contact generated by a misoperation, or the time when the contact C appears may be the starting point of the next processing time window, and so on.
- the processor determines that there are contact A (ie the first contact) and contact B (the second contact) in the current frame.
- contact A ie the first contact
- contact B the second contact
- the contact C is the contact generated by the user's misoperation.
- the processor determines that there is no contact A (that is, the first contact) or the contact B (the second contact) in the current frame.
- the newly generated contact point C will generate a new overlapping touch duration with the contact point B or the contact point A existing in the current frame, that is, the starting touch time of the contact point C may be the starting point of the next voice processing time window.
- the processor can track the contact points of multiple consecutive frames to obtain the coordinate tracking sequence of multiple contacts.
- One contact coordinate tracking sequence shares one ID, and one contact ID corresponds to one touch action. Therefore, a contact tracking sequence can describe the sliding track of a touch operation on the touch screen.
- the processor determines whether the contact in the previous frame of the current frame is missing according to the ID of the contact in the current frame. Referring to FIG. 14, the process of determining whether the contact in the previous frame of the current frame is missing specifically includes:
- the processor determines the ID of the contact in the current frame.
- the processor may determine the ID of the contact in the current frame through the foregoing S131-S136, which will not be described in detail here.
- the processor determines whether there is an ID value different from the ID value of the contact in the current frame among the ID values of the contacts in the previous frame of the current frame. If it exists, execute S143; if it does not exist, execute S144.
- the processor compares the ID value of the contact in the previous frame with the ID value of the contact in the current frame to determine whether the ID value of the contact in the previous frame is related to the contact in the current frame.
- the ID value of the point is different from the ID value.
- the ID of the contact in the previous frame includes IDa and IDb
- the processor assigns IDa to the contact C in the current frame.
- the processor can determine that the ID value of the contact in the current frame does not include the IDb in the previous frame. That is, the processor determines that among the ID values of the contacts in the previous frame, there is an ID value that is different from the ID values of the contacts in the current frame, that is, IDb.
- the processor determines that the contact corresponding to the ID value is missing.
- the processor determines that among the ID values of the contacts in the previous frame of the current frame, the contacts corresponding to the ID values that are different from the ID values of the contacts in the current frame are missing. At this time, it indicates that the touch duration of the touch operation corresponding to the contact point ends, that is, the voice processing time window corresponding to the touch operation ends.
- the processor continues to execute the monitoring voice command, and continues to track the contact point of the next frame of the current frame.
- the processor continues to execute the monitoring voice command according to the touch operation. And, the processor continues to track the touch point of the next frame of the current frame.
- the human-computer interaction method can also track multiple touch points. If the current frame includes m contacts, in the m contacts, the processor determines that the distance between n contacts and a contact in the previous frame (for example, contact A) is less than the threshold .
- m and n are positive integers greater than 0, and m is greater than or equal to n.
- the processor determines the closest contact point (for example, contact point C) to the contact point A in the previous frame of the current frame among the n contact points in the current frame.
- the processor assigns the ID value of contact A to contact C. At this time, contact A and contact C belong to the same contact coordinate tracking sequence, and the contact coordinate tracking sequence shares one ID. Then, the processor discards the n-1 contacts except for the contact A among the n contacts in the current frame, or invalidates them.
- the processor can determine the start and end of the voice processing time window, and execute the monitoring voice command within the voice processing time window.
- FIG. 15 shows the relationship between touch point tracking and voice processing time window.
- the state where the processor does not execute the monitoring voice command is called the idle state
- the state where the processor executes the monitoring voice command is called the monitoring state.
- the arrows in Figure 15 indicate that the processor tracks the contacts in real time.
- the processor determines that there are at least two contacts (for example, the first contact and the second contact) in the current frame according to the current frame sent by the touch screen.
- the processor assigns ID1 to the first contact and ID2 to the second contact.
- the processor confirms that the first moment is the starting point of the voice processing time window, and starts to execute the monitoring voice command.
- the processor determines that ID1 or ID2 does not exist in the ID value corresponding to the touch point in the current frame according to the current frame sent by the touch screen. If ID1 does not exist, the processor determines that the first contact point corresponding to ID1 is missing. At this time, it indicates that the touch duration of the touch operation corresponding to the first contact point is over. If ID2 does not exist, the processor determines that the second contact point corresponding to ID2 is missing. At this time, it indicates that the duration of the touch operation corresponding to the second contact point is over. Therefore, the processor determines that the second moment is the end of the voice processing time window, and ends the execution of the monitoring voice command.
- the processor returns to the idle state and continues to track whether a new touch point appears, so as to determine the starting point of the next voice processing time window.
- touch misoperations such as a palm touching the touch screen, or a finger or stylus sliding during the process of touching the touch screen.
- the contact tracking method provided by the embodiments of the present application can not only determine the start and end of the voice processing time window, but also avoid the problem of accidental start or end of the voice processing time window caused by touch misoperation.
- the voice command to be executed is determined based on the overlapping touch duration of multi-touch. In this way, there is no need to frequently use wake-up words to wake up the system, which is helpful To improve the user experience in the process of human-computer interaction.
- the position information of the object to be operated for the voice command on the touch screen is determined. In this way, in a large screen scene, the problem of long-distance sliding on the large screen can be alleviated, thereby helping to improve the user's experience in the human-computer interaction process.
- the embodiment of the present application may divide the human-computer interaction system according to the foregoing method example.
- the human-computer interaction system shown in FIG. 1 can be used to execute a human-computer interaction method, for example, to execute the methods shown in FIG. 4, FIG. 13, and FIG. 14.
- the touch screen 13 is configured to receive a first touch operation and a second touch operation, and the touch duration of the first touch operation and the second touch operation overlap.
- the processor 11 is configured to receive a voice command, the voice command being a voice command received within the overlapping touch duration of the first touch operation and the second touch operation.
- the processor 11 is further configured to respond to the first touch operation and the second touch operation to determine the position information of the object to be operated by the voice instruction on the touch screen 13.
- the processor 11 is also configured to execute voice instructions based on the position information of the object to be operated on the touch screen 13.
- the touch screen 13 may be used to perform S401 and S402, and the processor 11 may be used to perform S403, S405, and S406.
- the processor 11 is specifically configured to determine the area to be operated for the voice command; the area to be operated is an area on the touch screen 13, and the object to be operated is contained in the area to be operated.
- the processor 11 is also used to execute voice instructions in the area to be operated. For example, in conjunction with FIG. 4, the processor 11 may be used to execute S405.
- the processor 11 is specifically configured to: when the voice instruction is used to instruct the selected object, select the object in the area to be operated; or, when the voice instruction is used to instruct to modify the attribute information of the object to the target attribute information, change The attribute information of the object in the area to be operated is modified to the target attribute information; or, when the voice instruction is used to instruct to delete the object, delete the object in the area to be operated; or, when the voice instruction is used to instruct to create the object, in Create an object in the area to be operated.
- the processor 11 may be used to perform S406.
- the display screen 132 is used to display the area to be operated.
- the aforementioned voice instruction is used to instruct to create the object to be operated, and the object to be operated includes a first endpoint and a second endpoint.
- the processor 11 is further configured to determine the position information of the first end point in the touch screen 13 based on the position information of the contact point of the first touch operation; and determine the position information of the second end point in the touch screen 13 based on the position information of the contact point of the second touch operation Position information in the touch screen 13.
- the processor 11 may be used to execute S405.
- the object to be operated includes at least one of the following: a straight line, a line segment, a polyline, a curve, or a polygon.
- the processor 11 is specifically configured to execute a voice command based on the position information of the object to be operated on the touch screen 13 and the sequence of the start touch time of the first touch operation and the second touch operation.
- the processor 11 may be used to perform S406.
- the processor 11 is specifically configured to, when a voice command is used to instruct to create a circle, create the circle based on the sequence of the start touch time of the first touch operation and the second touch operation.
- the position of the center of the circle is determined based on the position of the contact point of the first touch operation
- the position of the point on the circumference of the circle is determined based on the position of the contact point of the second touch operation.
- the start touch time of the first touch operation precedes or follows the second touch operation.
- the processor 11 may be used to perform S406.
- the processor 11 is specifically configured to create the one-way arrow based on the sequence of the first touch operation and the start touch time of the second touch operation when the voice instruction is used to instruct the creation of a line with a one-way arrow Of the line.
- the position of the end point pointed by the arrow in the line is determined based on the position of the contact point of the first touch operation
- the position of the end point not pointed by the arrow in the line is determined based on the position of the contact point of the second touch operation.
- the start touch time of the first touch operation precedes or follows the second touch operation.
- the processor 11 may be used to perform S406.
- the voice collector 14 may be used to collect voice instructions.
- the embodiment of the present application may divide the above-mentioned processor or the computer device containing the processor into functional modules according to the above-mentioned method examples.
- each functional module may be divided corresponding to each function, or two or more The functions are integrated in a processing module.
- the above-mentioned integrated modules can be implemented in the form of hardware or software functional modules. It should be noted that the division of modules in the embodiments of the present application is illustrative, and is only a logical function division, and there may be other division methods in actual implementation.
- FIG. 16 shows a schematic structural diagram of a processor or computer device provided by an embodiment of the present application.
- the processor or computer device is used to execute the above-mentioned human-computer interaction method, for example, to execute the methods shown in FIG. 4, FIG. 13, FIG. 14, and FIG. 15.
- the processor or computer device may include an acquiring unit 161, a receiving unit 162, a determining unit 163, and an executing unit 164.
- the acquiring unit 161 is configured to acquire the position information of the first contact point and the position information of the second contact point.
- the first contact point is a contact point based on the first touch operation
- the second contact point is a contact point based on the second touch operation
- the touch durations of the first touch operation and the second touch operation overlap.
- the receiving unit 162 is configured to receive a voice instruction, the voice instruction being a voice instruction received within the overlapped touch duration of the first touch operation and the second touch operation.
- the determining unit 163 is configured to respond to the acquiring operation of the acquiring unit 161 to determine the position information of the object to be operated by the voice instruction.
- the execution unit 164 is configured to execute a voice command based on the position information of the object to be operated determined by the determination unit 163.
- the acquiring unit 161 may be used to perform S403
- the determining unit 163 may be used to perform S405
- the performing unit 164 may be used to perform S406.
- the determining unit 163 is specifically configured to determine the area to be operated for the voice instruction, and the object to be operated is included in the area to be operated.
- the execution unit 164 is specifically configured to execute a voice instruction in the region to be operated determined by the determination unit 163.
- the determining unit 163 may be used to execute S405.
- the execution unit 164 is specifically configured to select the object in the area to be operated when the voice instruction is used to instruct to select the object; or, when the voice instruction is used to instruct to modify the attribute information of the object to the target attribute information, change The attribute information of the object in the area to be operated is modified to the target attribute information; or, when the voice instruction is used to instruct to delete the object, delete the object in the area to be operated; or, when the voice instruction is used to instruct to create the object, in the to-be operated Create objects in the area.
- the execution unit 164 may be used to execute S406.
- the voice command is used to instruct to create an object to be operated, and the object to be operated includes a first endpoint and a second endpoint.
- the determining unit 163 is specifically configured to determine the position information of the first end point based on the position information of the first contact point; and determine the position information of the second end point based on the position information of the second contact point.
- the determining unit 163 may be used to execute S405.
- the execution unit 164 is specifically configured to execute the voice command based on the position information of the object to be operated and the sequence of the start touch time of the first touch operation and the second touch operation.
- the execution unit 164 may be used to execute S406.
- the execution unit 164 is specifically configured to create a circle based on the sequence of the start touch time of the first touch operation and the second touch operation when the voice instruction is used to instruct to create a circle.
- the position of the center of the circle is determined based on the position of the contact point of the first touch operation
- the position of the point on the circumference of the circle is determined based on the position of the contact point of the second touch operation.
- the start touch time of the first touch operation precedes or follows the second touch operation.
- the execution unit 164 may be used to execute S406.
- the execution unit 164 is specifically configured to create a line with a one-way arrow based on the sequence of the first touch operation and the start touch time of the second touch operation when the voice instruction is used to instruct the creation of a line with a one-way arrow. line.
- the position of the end point pointed by the arrow in the line is determined based on the position of the contact point of the first touch operation
- the position of the end point not pointed by the arrow in the line is determined based on the position of the contact point of the second touch operation.
- the start touch time of the first touch operation precedes or follows the second touch operation.
- the execution unit 164 may be used to execute S406.
- the computer equipment provided in the embodiments of the present application includes but is not limited to the above-mentioned units.
- the computer equipment may further include a storage unit 165.
- the storage unit 165 may be used to store the program code of the computer device.
- the functions implemented by the acquiring unit 161, the receiving unit 162, the determining unit 163, and the executing unit 164 in the computer device are the same as those of the processor 11 in FIG.
- the function implemented by the storage unit 165 is the same as that of the memory 12 in FIG. 1.
- the embodiment of the present application also provides a chip system.
- the chip system includes at least one processor 171 and at least one interface circuit 172.
- the processor 171 and the interface circuit 172 may be interconnected by wires.
- the interface circuit 172 may be used to receive signals from other devices (for example, the memory of a human-computer interaction system, or the memory of a computer device).
- the interface circuit 172 may be used to send signals to other devices (for example, the processor 171).
- the interface circuit 172 may read instructions stored in the memory and send the instructions to the processor 171.
- the human-computer interaction system or the computer device can be made to execute the steps in the above-mentioned embodiments.
- the chip system may also include other discrete devices, which are not specifically limited in the embodiment of the present application.
- Another embodiment of the present application further provides a computer-readable storage medium that stores instructions in the computer-readable storage medium.
- the human-computer interaction system or computer device executes The steps performed by the human-computer interaction system or computer equipment in the method flow shown in the foregoing method embodiments.
- the disclosed methods may be implemented as computer program instructions encoded on a computer-readable storage medium in a machine-readable format or encoded on other non-transitory media or articles.
- FIG. 18 schematically shows a conceptual partial view of a computer program product provided by an embodiment of the present application.
- the computer program product includes a computer program for executing a computer process on a computing device.
- the computer program product is provided using the signal bearing medium 180.
- the signal bearing medium 180 may include one or more program instructions, which, when executed by one or more processors, can provide the functions or part of the functions described above with respect to FIG. 4. Therefore, for example, referring to one or more features of S401 to S406 in FIG. 4 may be undertaken by one or more instructions associated with the signal bearing medium 180.
- the program instructions in FIG. 18 also describe example instructions.
- the signal-bearing medium 180 may include a computer-readable medium 181, such as, but not limited to, a hard drive, compact disc (CD), digital video disc (DVD), digital tape, memory, read-only storage memory (read -only memory, ROM) or random access memory (RAM), etc.
- a computer-readable medium 181 such as, but not limited to, a hard drive, compact disc (CD), digital video disc (DVD), digital tape, memory, read-only storage memory (read -only memory, ROM) or random access memory (RAM), etc.
- the signal bearing medium 180 may include a computer recordable medium 182, such as, but not limited to, memory, read/write (R/W) CD, R/W DVD, and so on.
- a computer recordable medium 182 such as, but not limited to, memory, read/write (R/W) CD, R/W DVD, and so on.
- the signal-bearing medium 180 may include a communication medium 183, such as, but not limited to, digital and/or analog communication media (eg, fiber optic cables, waveguides, wired communication links, wireless communication links, etc.).
- a communication medium 183 such as, but not limited to, digital and/or analog communication media (eg, fiber optic cables, waveguides, wired communication links, wireless communication links, etc.).
- the signal bearing medium 180 may be communicated by a wireless communication medium 183 (for example, a wireless communication medium that complies with the IEEE 802.11 standard or other transmission protocols).
- the one or more program instructions may be, for example, computer-executable instructions or logic-implemented instructions.
- the computer device may be configured to respond to the passage of one or more of the computer readable medium 181, the computer recordable medium 182, and/or the communication medium 183
- Each program instruction provides various operations, functions, or actions.
- the above embodiments it may be implemented in whole or in part by software, hardware, firmware, or any combination thereof.
- a software program it can be implemented in the form of a computer program product in whole or in part.
- the computer program product includes one or more computer instructions.
- the computer execution instructions When the computer execution instructions are loaded and executed on the computer, the processes or functions according to the embodiments of the present application are generated in whole or in part.
- the computer can be a general-purpose computer, a special-purpose computer, a computer network, or other programmable devices.
- Computer instructions may be stored in a computer-readable storage medium, or transmitted from one computer-readable storage medium to another computer-readable storage medium.
- computer instructions may be transmitted from a website, computer, server, or data center through a cable (such as Coaxial cable, optical fiber, digital subscriber line (digital subscriber line, DSL) or wireless (such as infrared, wireless, microwave, etc.) transmission to another website site, computer, server or data center.
- the computer-readable storage medium may be any available medium that can be accessed by a computer, or may include one or more data storage devices such as a server or a data center that can be integrated with the medium.
- the usable medium may be a magnetic medium (for example, a floppy disk, a hard disk, and a magnetic tape), an optical medium (for example, a DVD), or a semiconductor medium (for example, a solid state disk (SSD)).
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- General Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Human Computer Interaction (AREA)
- General Physics & Mathematics (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Health & Medical Sciences (AREA)
- Multimedia (AREA)
- General Health & Medical Sciences (AREA)
- Computational Linguistics (AREA)
- Acoustics & Sound (AREA)
- User Interface Of Digital Computer (AREA)
Abstract
一种人机交互方法、装置和系统,涉及人机交互技术领域,有助于实现多模态的人机交互,提高用户体验。该人机交互方法应用于人机交互系统(10),该人机交互系统(10)包括触摸屏(13)。该人机交互方法包括:接收具有重叠的触摸时长的第一触摸操作和第二触摸操作;在接收语音指令,该语音指令是在第一触摸操作和第二触摸操作的重叠的触摸时长内接收到的语音指令;响应第一触摸操作和第二触摸操作,确定该语音指令的待操作对象在触摸屏中的位置信息;基于待操作对象在触摸屏中的位置信息,执行该语音指令。
Description
本申请涉及人机交互技术领域,尤其涉及人机交互方法、装置和系统。
人机交互(human computer interaction,HCI;或者,human machine interaction,HMI)是一门研究系统与用户之间的交互关系的学问。这里的系统可以是各种各样的机器,也可以是计算机化的系统和软件。多模态(modality)人机交互,通俗地讲,是指多种感官融合的人机交互。多模态交互,就是通过文字、语音、视觉、动作、环境等多种方式进行人机交互。如何实现多模态人机交互,从而提高用户体验,是亟待解决的技术问题。
发明内容
本申请提供的人机交互方法、装置和系统,有助于实现多模态的人机交互,提高了用户体验。
为达上述目的,本申请提供如下技术方案:
第一方面,本申请提供了一种人机交互方法,该人机交互方法应用于人机交互系统,该人机交互系统包括触摸屏。该方法包括:接收具有重叠的触摸时长的第一触摸操作和第二触摸操作,并接收语音指令,该语音指令是在第一触摸操作和第二触摸操作的重叠的触摸时长内接收到的语音指令。响应于第一触摸操作和第二触摸操作,确定该语音指令的待操作对象在上述触摸屏中的位置信息。基于确定出的待操作对象在触摸屏中的位置信息,执行该语音指令。本技术方案中,一方面,基于多点触控的重叠触摸时长,确定待执行语音指令(即上述语音指令),这样,相比现有技术中“需要频繁使用唤醒词唤醒人机交互系统,才能基于唤醒后的人机交互系统确定待执行语音指令”的方案,本技术方案无需频繁使用唤醒词来唤醒系统,因此有助于提高用户在人机交互过程中的体验。另一方面,基于多点触控,确定待执行语音指令的待操作对象在触摸屏中的位置信息。这样,在大屏场景中,可以缓解在大屏上长距离滑动的问题,从而有助于提高用户在人机交互过程中的体验。
结合第一方面,在一种可能的设计方式中,上述第一触摸操作的触点与上述第二触摸操作的触点之间的距离大于或等于预设阈值。也就是说,本申请提供的人机交互方法可以应用于大屏场景。通过在大屏场景中使用多个触摸操作来触发监听语音指令,可以有效缓解在大屏上长距离滑动的问题,从而提高了用户在人机交互过程中的体验。
结合第一方面,在另一种可能的设计方式中,上述“确定该语音指令的待操作对象在上述触摸屏中的位置信息”,可以包括:确定该语音指令的在触摸屏中的待操作区域,该待操作区域中包含待操作对象。该情况下,上述“基于确定出的待操作对象在触摸屏中的位置信息,执行该语音指令”可以包括:根据接收到的语音指令的内容在触摸屏中确定出待操作区域,并在该待操作区域中执行语音指令。
结合第一方面,在另一种可能的设计方式中,上述“在该待操作区域中执行语音指令”可以包括:当语音指令用于指示选中对象时,选中待操作区域中的对象;或者,当语音指令用于指示将对象的属性信息修改为目标属性信息时,将待操作区域中的对象 的属性信息修改为目标属性信息;或者,当语音指令用于指示删除对象时,删除待操作区域中的对象;或者,当语音指令用于指示创建对象时,在待操作区域中创建对象。该可能的设计提供了人机交互系统执行该语音指令的几种示例。
结合第一方面,在另一种可能的设计方式中,该人机交互方法还可以包括:人机交互系统还在触摸屏上显示上述确定出的待操作区域。这样,通过在触摸屏上显示待操作区域,使得人机交互系统执行语音指令的过程可视化,提高了用户的体验。
结合第一方面,在另一种可能的设计方式中,若上述语音指令用于指示创建包括第一端点和第二端点的待操作对象。上述“确定该语音指令的待操作对象在上述触摸屏中的位置信息”可以包括:基于第一触摸操作的触点的位置信息,确定第一端点在触摸屏中的位置信息,以及基于第二触摸操作的触点的位置信息,确定第二端点在所述触摸屏中的位置信息。在这种实现方式中,触摸操作的触点的位置信息可以用于确定语音指令中指示创建待操作对象的位置,也就是说,触摸操作的触点的位置信息可以作为语音指令的参数。
结合第一方面,在另一种可能的设计方式中,上述待操作对象包括以下至少一种:直线、线段、折线、曲线或多边形。也就是说,本申请的人机交互系统可以基于该方法创建直线、线段、折线、曲线或多边形等包括第一端点和第二端点的待操作对象。
结合第一方面,在另一种可能的设计方式中,上述“基于确定出的待操作对象在触摸屏中的位置信息,执行语音指令”可以包括:基于待操作对象在触摸屏中的位置信息,以及第一触摸操作与第二触摸操作的开始触摸时间的先后顺序,执行语音指令。在这种实现方式中,第一触摸操作与第二触摸操作的开始触摸时间的先后顺序可以作为语音指令的参数。
结合第一方面,在另一种可能的设计方式中,上述“基于待操作对象在触摸屏中的位置信息,以及第一触摸操作与第二触摸操作的开始触摸时间的先后顺序,执行语音指令可以包括:当语音指令用于指示创建圆时,基于第一触摸操作与第二触摸操作的开始触摸时间的先后顺序,创建该圆。该圆的圆心的位置是基于第一触摸操作的触点的位置确定的,该圆的圆周中的点的位置是基于第二触摸操作的触点的位置确定的。第一触摸操作的开始触摸时间先于或后于第二触摸操作。在这种实现方式中,第一触摸操作与第二触摸操作的开始触摸时间的先后顺序作为创建圆的语音指令的参数。
结合第一方面,在另一种可能的设计方式中,上述“基于待操作对象在触摸屏中的位置信息,以及第一触摸操作与第二触摸操作的开始触摸时间的先后顺序,执行语音指令”可以包括:当语音指令用于指示创建带单向箭头的线时,基于第一触摸操作与第二触摸操作的开始触摸时间的先后顺序,创建该带单向箭头的线。这里,该线中箭头指向的端点的位置是基于第一触摸操作的触点的位置确定的,该线中非箭头指向的端点的位置是基于第二触摸操作的触点的位置确定的。第一触摸操作的开始触摸时间先于或后于第二触摸操作。在这种实现方式中,第一触摸操作与第二触摸操作的开始触摸时间的先后顺序作为创建带单向箭头的线的语音指令的参数。
第二方面,本申请提供了一种人机交互方法,该人机交互方法应用于计算机设备,该人机交互方法包括:获取基于第一触摸操作的第一触点的位置信息,以及基于第二触摸操作的第二触点的位置信息,第一触摸操作和第二触摸操作具有重叠的触摸时长。 接收语音指令,该语音指令是在第一触摸操作和第二触摸操作的重叠的触摸时长内接收到的语音指令。响应获取操作,确定该语音指令的待操作对象的位置信息。其中,待操作对象的位置信息,是待操作对象在人机交互系统中的触摸屏中的位置信息,触摸屏可以集成在计算机设备上,也可以与计算机设备分开设置。基于待操作对象的位置信息,执行该语音指令。
结合第二方面,在一种可能的设计方式中,上述第一触摸操作的触点与上述第二触摸操作的触点之间的距离大于或等于预设阈值。
结合第二方面,在另一种可能的设计方式中,上述“确定该语音指令的待操作对象的位置信息”包括:确定该语音指令的待操作区域,该待操作区域中包含待操作对象。该情况下,上述“基于待操作对象的位置信息,执行gauge语音指令”包括:在待操作区域中执行该语音指令。
结合第二方面,在另一种可能的设计方式中,上述“在待操作区域中执行该语音指令”包括:当语音指令用于指示选中对象时,选中待操作区域中的对象;或者,当语音指令用于指示将对象的属性信息修改为目标属性信息时,将待操作区域中的对象的属性信息修改为目标属性信息;或者,当语音指令用于指示删除对象时,计算机设备删除待操作区域中的对象;或者,当语音指令用于指示创建对象时,在待操作区域中创建对象。
结合第二方面,在另一种可能的设计方式中,若上述语音指令用于指示创建包括第一端点和第二端点的待操作对象。上述“确定该语音指令的待操作对象的位置信息”包括:基于第一触摸操作的触点的位置信息,确定第一端点在触摸屏中的位置信息,以及基于第二触摸操作的触点的位置信息,确定第二端点在所述触摸屏中的位置信息。
结合第二方面,在另一种可能的设计方式中,上述待操作对象包括以下至少一种:直线、线段、折线、曲线或多边形。
结合第二方面,在另一种可能的设计方式中,上述“基于待操作对象的位置信息,执行该语音指令”包括:基于待操作对象在触摸屏中的位置信息,以及第一触摸操作与第二触摸操作的开始触摸时间的先后顺序,执行该语音指令。
结合第二方面,在另一种可能的设计方式中,上述“基于待操作对象在触摸屏中的位置信息,以及第一触摸操作与第二触摸操作的开始触摸时间的先后顺序,执行语音指令”包括:当语音指令用于指示创建圆时,计算机设备基于第一触摸操作与第二触摸操作的开始触摸时间的先后顺序,创建该圆。该圆的圆心的位置是基于第一触摸操作的触点的位置确定的,该圆的圆周中的点的位置是基于第二触摸操作的触点的位置确定的。第一触摸操作的开始触摸时间先于或后于第二触摸操作。
结合第二方面,在另一种可能的设计方式中,上述“基于待操作对象在触摸屏中的位置信息,以及第一触摸操作与第二触摸操作的开始触摸时间的先后顺序,执行语音指令”包括:当语音指令用于指示创建带单向箭头的线时,基于第一触摸操作与第二触摸操作的开始触摸时间的先后顺序,创建该带单向箭头的线。这里,该线中箭头指向的端点的位置是基于第一触摸操作的触点的位置确定的,该线中非箭头指向的端点的位置是基于第二触摸操作的触点的位置确定的。第一触摸操作的开始触摸时间先于或后于第二触摸操作。
第二方面及其任一种可能的设计提供的技术方案的相关内容的解释和有益效果的描述均可以参考上述第一方面或其相应的可能的设计提供的技术方案,此处不再赘述。
第三方面,本申请提供一种人机交互的方法,该人机交互方法应用于人机交互系统,该人机交互系统包括触摸屏。该人机交互方法包括:接收具有重叠的触摸时长的第一触摸操作和第二触摸操作,并接收语音指令,该语音指令是在第一触摸操作和第二触摸操作的重叠的触摸时长内接收到的语音指令。响应于第一触摸操作述第二触摸操作,执行该语音指令。本技术方案中,基于多点触控的重叠触摸时长,确定待执行语音指令(即上述语音指令),这样,相比现有技术中“需要频繁使用唤醒词唤醒人机交互系统,才能基于唤醒后的人机交互系统确定待执行语音指令”的方案,本技术方案无需频繁使用唤醒词来唤醒系统,因此有助于提高用户在人机交互过程中的体验。并且,相比现有技术中基于单点触控和预设时间确定待执行语音指令(如将从单点触控的触摸开始时刻开始的预设时间段内接收到的语音指令作为待执行语音指令),本技术方案,有助于灵活控制接收待执行语音指令的时间,从而提高用户体验。
结合第三方面,在一种可能的设计方式中,上述第一触摸操作的触点与上述第二触摸操作的触点之间的距离大于或等于预设阈值。
第四方面,本申请提供一种人机交互的方法,该人机交互方法应用于人机交互系统,该人机交互系统包括触摸屏。该人机交互方法包括:接收第一触摸操作和第二触摸操作,以及接收语音指令。其中,该语音指令是基于第一触摸操作的开始触摸时长或第二触摸操作的开始触摸时长确定的。响应于该第一触摸操作和第二触摸操作,人确定该语音指令的待操作对象在触摸屏中的位置信息。基于待操作对象在触摸屏中的位置信息,执行上述语音指令。本技术方案中,基于多点触控,确定待执行语音指令的待操作对象在触摸屏中的位置信息。这样,在大屏场景中,可以缓解在大屏上长距离滑动的问题,从而有助于提高用户在人机交互过程中的体验。
结合第四方面,在一种可能的设计方式中,语音指令是从目标触摸操作的开始触摸时间开始的预设时间段内接收到的语音指令。在一种实现方式中,目标触摸操作是第一触摸操作和第二触摸操作中开始触摸时间在前的触摸操作。在另一种实现方式中,目标触摸操作是第一触摸操作和第二触摸操作中开始触摸时间在后的触摸操作。也就是说,语音指令的语音处理时间窗可以是基于单点触控操作确定的。
结合第四方面,在另一种可能的设计方式中,上述第一触摸操作的触点与上述第二触摸操作的触点之间的距离大于或等于预设阈值。
结合第四方面,在另一种可能的设计方式中,上述“确定该语音指令的待操作对象在触摸屏中的位置信息”可以包括:确定该语音指令的在触摸屏中的待操作区域,该待操作区域中包含待操作对象。该情况下,上述“基于待操作对象在触摸屏中的位置信息,执行上述语音指令”可以包括:根据该语音指令的内容在触摸屏中确定出待操作区域,并在该待操作区域中执行语音指令。
第四方面的其他可能的设计提供的技术方案,以及所提供的技术方案的相关内容的解释和有益效果的描述均可以参考上述第一方面或其相应的可能的设计提供的技术方案,此处不再赘述。
第五方面,本申请提供一种人机交互系统,该人机交互系统可以用于执行上述第 一方面、第三方面或第四方面提供的任一种方法。该人机交互系统可以包括触摸屏和处理器。
在一种可能的设计方式中,上述触摸屏,用于接收第一触摸操作和第二触摸操作,该第一触摸操作和该第二触摸操作的触摸时长有重叠。上述处理器,用于接收语音指令,该语音指令是在第一触摸操作和第二触摸操作的重叠的触摸时长内接收到的语音指令。上述处理器,还用于响应第一触摸操作和第二触摸操作,确定语音指令的待操作对象在触摸屏中的位置信息。上述处理器,还用于基于待操作对象在触摸屏中的位置信息,执行语音指令。
在一另种可能的设计方式中,上述处理器,用于接收具有重叠的触摸时长的第一触摸操作和第二触摸操作,并接收语音指令,该语音指令是在第一触摸操作和第二触摸操作的重叠的触摸时长内接收到的语音指令。上述处理器,还用于响应于第一触摸操作述第二触摸操作,执行该语音指令。
在一另种可能的设计方式中,上述处理器,用于从目标触摸操作的开始触摸时间开始的预设时间段内接收到的语音指令。其中,在一种实现方式中,目标触摸操作是第一触摸操作和第二触摸操作中开始触摸时间在前的触摸操作。在另一种实现方式中,目标触摸操作是第一触摸操作和第二触摸操作中开始触摸时间在后的触摸操作。也就是说,语音指令的语音处理时间窗可以是基于单点触控操作确定的。
在一另种可能的设计方式中,该人机交互系统还包括语音采集器。
在一种实现方式中,上述语音采集器用于实时的采集语音指令,并向处理器发送采集到的语音指令。在另一种实现方式中,上述语音采集器通过与上述处理器之间进行信息交互,以确定采集和发送语音指令。例如,上述语音采集器,用于在上述第一触摸操作和上述第二触摸操作的重叠的触摸时长的开始时刻发送第一指令,第一指令用于指示语音采集器开始采集语音指令。上述处理器,用于在上述第一触摸操作和上述第二触摸操作的重叠触摸时长的结束时刻,向语音采集器发送第二指令,第二指令用于指示语音采集器结束采集语音指令。
第五方面中,处理器和触摸屏执行的其他可能的技术方案和有益效果的描述均可以参考上述第一方面或其相应的可能的设计提供的技术方案,此处不再赘述。
第六方面,本申请提供了一种计算机设备,该计算机设备可以用于执行上述第二方面提供的任一种方法,该情况下,该计算机设备具体可以是处理器或包含处理器的设备。
在一种可能的设计中,可以根据上述第二方面提供的任一种方法,对该装置进行功能模块的划分。例如,可以对应各个功能划分各个功能模块,也可以将两个或两个以上的功能集成在一个处理模块中。
在另一种可能的设计中,该计算机设备包括:存储器和一个或多个处理器;存储器和处理器耦合。上述存储器用于存储计算机程序代码,该计算机程序代码包括计算机指令,当该计算机指令被计算机设备执行时,使得计算机设备执行如第二方面及其任一种可能的设计方式所述的人机交互方法。
第七方面,本申请提供一种芯片系统,该芯片系统应用于人机交互系统。该芯片系统包括一个或多个接口电路和一个或多个处理器。
上述接口电路和上述处理器通过线路互联;上述接口电路用于从人机交互系统的存储器接收信号,并向上述处理器发送该信号,该信号包括存储器中存储的计算机指令;当上述处理器执行该计算机指令时,人机交互系统执行如第一方面或第三方面或第四方面提供的提供的任一种可能的设计方式所述的人机交互方法。
第八方面,本申请提供一种芯片系统,该芯片系统应用于计算机设备。该芯片系统包括一个或多个接口电路和一个或多个处理器;
上述接口电路和上述处理器通过线路互联;上述接口电路用于从计算机设备的存储器接收信号,并向上述处理器发送该信号,该信号包括存储器中存储的计算机指令;当上述处理器执行该计算机指令时,计算机设备执行如第二方面及其任一种可能的设计方式所述的人机交互方法。
第九方面,本申请提供一种计算机可读存储介质,该计算机可读存储介质包括计算机指令,当该计算机指令在人机交互系统上运行时,使得人机交互系统实现如第一方面或第三方面或第四方面提供的任一种可能的设计方式所述的人机交互方法。
第十方面,本申请提供一种计算机可读存储介质,该计算机可读存储介质包括计算机指令,当该计算机指令在计算机设备上运行时,使得计算机设备实现如第二方面及其任一种可能的设计方式所述的人机交互方法。
第十一方面,本申请提供一种计算机程序产品,当该计算机程序产品在人机交互系统上运行时,使得人机交互系统实现如第一方面或第三方面或第四方面提供的任一种可能的设计方式所述的人机交互方法。
第十二方面,本申请提供一种计算机程序产品,当该计算机程序产品在计算机设备上运行时,使得计算机设备实现如第二方面及其任一种可能的设计方式所述的人机交互方法。
本申请中第二方面到第十二方面及其各种实现方式的具体描述,可以参考第一方面及其各种实现方式中的详细描述;并且,第二方面到第十二方面及其各种实现方式的有益效果,可以参考第一方面及其各种实现方式中的有益效果分析,此处不再赘述。
在本申请中,上述人机交互系统的名字对设备或功能模块本身不构成限定,在实际实现中,这些设备或功能模块可以以其他名称出现。只要各个设备或功能模块的功能和本申请类似,属于本申请权利要求及其等同技术的范围之内。
本申请的这些方面或其他方面在以下的描述中会更加简明易懂。
图1为本申请实施例提供的人机交互系统的一种硬件结构图;
图2为本申请实施例提供的人机交互系统的一种结构示意图一;
图3为本申请实施例提供的人机交互系统的一种结构示意图二;
图4为本申请实施例提供的人机交互方法的一种流程示意图一;
图5为本申请实施例提供的人机交互方法的一种示意图一;
图6为本申请实施例中查找人机交互方法的一种示意图二;
图7为本申请实施例提供的人机交互方法的一种示意图三;
图8为本申请实施例提供的人机交互方法的一种示意图四;
图9为本申请实施例提供的人机交互方法的一种示意图五;
图10为本申请实施例提供的人机交互方法的一种示意图六;
图11为本申请实施例提供的人机交互方法的一种示意图七;
图12为本申请实施例提供的人机交互方法的一种示意图八;
图13为本申请实施例提供的人机交互方法的一种流程示意图二;
图14为本申请实施例提供的人机交互方法的一种流程示意图三;
图15为本申请实施例提供的人机交互方法的一种流程示意图四;
图16为本申请实施例提供的计算机设备的结构示意图;
图17为本申请实施例提供的一种芯片系统的结构示意图;
图18为本申请实施例提供的计算机程序产品的结构示意图。
在本申请实施例中,“示例性的”或者“例如”等词用于表示作例子、例证或说明。本申请实施例中被描述为“示例性的”或者“例如”的任何实施例或设计方案不应被解释为比其它实施例或设计方案更优选或更具优势。确切而言,使用“示例性的”或者“例如”等词旨在以具体方式呈现相关概念。
在本申请的实施例中,术语“第一”、“第二”仅用于描述目的,而不能理解为指示或暗示相对重要性或者隐含指明所指示的技术特征的数量。由此,限定有“第一”、“第二”的特征可以明示或者隐含地包括一个或者更多个该特征。在本申请实施例的描述中,除非另有说明,“多个”的含义是两个或两个以上。
本申请实施例提供一种人机交互方法、装置和系统,人机交互系统接收的第一触摸操作的触摸时长和第二触摸操作的触摸时长有重叠。在重叠的触摸时长内,人机交互系统执行监听语音指令。然后,人机交互系统根据语音指令的指示,以及第一触摸操作和第二触摸操作的位置信息,执行语音指令。通过本申请实施例提供的人机交互方法,利用触摸操作和语音监听两种模态进行人机交互,提高了用户在人机交互过程中的体验。
上述人机交互方法可以通过安装在设备上的应用程序实现,例如人机交互应用程序。
上述应用程序可以是安装在设备中的嵌入式应用程序(即设备的系统应用),也可以是可下载应用程序。其中,嵌入式应用程序是作为设备(如手机)实现的一部分提供的应用程序。可下载应用程序是一个可以提供自己的因特网协议多媒体子系统(internet protocol multimedia subsystem,IMS)连接的应用程序,该可下载应用程序是可以预先安装在设备中的应用或可以由用户下载并安装在设备中的第三方应用。
参考图1,图1示出了本申请实施例提供的人机交互系统的硬件结构。如图1所示,人机交互系统10包括处理器11、存储器12、触摸屏13以及语音采集器14。处理器11、存储器12、触摸屏13以及语音采集器14可以集成在一个设备上,也可以分别集成在不同的设备上。
下面对人机交互系统10的硬件结构的描述以处理器11、存储器12、触摸屏13以及语音采集器14集成在一个设备上为例进行说明。当处理器11、存储器12、触摸 屏13以及语音采集器14集成在一个设备上时,人机交互系统10还包括总线15。处理器11、存储器12、触摸屏13以及语音采集器14通过总线15连接。
处理器11是人机交互系统10的控制中心,可以是一个通用中央处理单元(central processing unit,CPU),也可以是其他通用处理器等。其中,通用处理器可以是微处理器或者是任何常规的处理器等。
作为一个示例,处理器11可以包括一个或多个CPU,例如图1中所示的CPU 0和CPU 1。
存储器12,可以是只读存储器(read-only memory,ROM)或可存储静态信息和指令的其他类型的静态存储设备,随机存取存储器(random access memory,RAM)或者可存储信息和指令的其他类型的动态存储设备,也可以是电可擦可编程只读存储器(electrically erasable programmable read-only memory,EEPROM)、磁盘存储介质或者其他磁存储设备、或者能够用于携带或存储具有指令或数据结构形式的期望的程序代码并能够由计算机存取的任何其他介质,但不限于此。
一种可能的实现方式中,存储器12可以独立于处理器11。存储器12可以通过总线15与处理器11相连接,用于存储数据、指令或者程序代码。处理器11调用并执行存储器12中存储的指令或程序代码时,能够实现本申请实施例提供的人机交互方法。
另一种可能的实现方式中,存储器12也可以和处理器11集成在一起。
触摸屏13,具体可以包括触控板131和显示屏132。
其中,触控板131可采集用户在其上或附近的触摸事件(比如用户使用手指、触控笔等任何适合的物体在触控板上或在触控板附近的操作),并将采集到的触摸信息发送给其他器件(例如处理器11)。其中,用户在触控板附近的触摸事件可以称之为悬浮触控;悬浮触控可以是指,用户无需为了选择、移动或拖动目标(例如图标等)而直接接触触控板,而只需用户位于设备附近以便执行所想要的功能。此外,可以采用电阻式、电容式、红外线以及表面声波等多种类型来实现触控板。
显示屏132可用于显示由用户输入的信息或提供给用户的信息。可以采用液晶显示屏、有机发光二极管等形式来配置显示屏132。触控板131可以覆盖在显示屏132之上,当触控板131检测到在其上或附近的触摸事件后,传送给处理器11以确定触摸事件的类型,处理器11可以根据触摸事件的类型在显示屏132上提供相应的视觉输出。
语音采集器14,用于接收语音信号,并将语音信号转换为电信号后发送给其他器件(例如处理器11)处理。语音采集器可以是麦克风,也称“话筒”,或“传声器”等。
总线15,可以是工业标准体系结构(industry standard architecture,ISA)总线、外部设备互连(peripheral component interconnect,PCI)总线或扩展工业标准体系结构(extended industry standard architecture,EISA)总线等。该总线可以分为地址总线、数据总线、控制总线等。为便于表示,图1中仅用一条粗线表示,但并不表示仅有一根总线或一种类型的总线。
需要指出的是,图1中示出的结构并不构成对该人机交互系统10的限定,除图1所示部件之外,该人机交互系统10可以包括比图示更多或更少的部件,或者组合某些部件,或者不同的部件布置。
当上述处理器11、存储器12、触摸屏13以及语音采集器14集成在一个设备上时, 人机交互系统10可以是电子白板、智能手机、带触摸屏的笔记本电脑、带触摸屏的计算机、平板、上网本、车载等设备。示例性的,如图2所示,电子白板20中集成了处理器11、存储器12、触摸屏13以及语音采集器14。这种情况下,上述人机交互应用程序可以在电子白板20内运行。可选的,该人机交互系统10还可以包括触摸笔21,触摸笔21用于在电子白板20的触摸屏13上输入触摸操作。
当上述处理器11、存储器12、触摸屏13以及语音采集器14分别集成在不同的设备上时,上述人机交互系统10可以包括多个设备,以执行本申请实施例提供的人机交互方法。示例性的,如图3所示的人机交互系统10可以包括:电子白板20、计算机32和投影机33。可选的,人机交互系统10还可以包括触摸笔21,触摸笔21用于在电子白板20的触摸屏13上输入触摸操作。其中,处理器11可以是计算机32的处理器。存储器12可以是计算机32的存储器,处理器11和存储器12通过总线15连接。这时,上述人机交互应用程序可以在计算机32内运行。另外,触摸屏13可以是电子白板20的触摸屏。语音采集器14可以集成在电子白板20中。或者,语音采集器14也可以集成在计算机32、投影机33或者触摸笔21中,本申请实施对此不作限定。
可以理解的是,上述对本申请实施例提供的人机交互系统10的描述仅为示例性说明,并不构成对本实施例的限定。可以理解的是,任何能够实现本申请实施例提供的人机交互方法的设备,或设备的组合,都在本申请实施例的保护范围之内。
下面结合附图对本申请实施例提供的人机交互方法进行描述。
请参考图4,图4示出了本申请实施例提供的人机交互方法的流程示意图,该人机交互方法包括以下步骤:
S401、触摸屏接收第一触摸操作和第二触摸操作,并确定第一触点的位置信息和第二触点的位置信息。
第一触摸操作和第二触摸操作,可以是用户通过手指或触摸笔在触摸屏上触摸的操作。第一触摸操作和第二触摸操作分别具有一定的触摸时长。触摸屏可以同时接收第一触摸操作和第二触摸操作,也可以在不同时刻接收第一触摸操作和第二触摸操作。
第一触点是第一触摸操作与触摸屏的接触点。第二触点是第二触摸操作与触摸屏的接触点。触点的位置信息,用于表征该触点在触摸屏中的位置。本申请实施例对触点的位置信息的具体内容不进行限定,例如,触点的位置信息可以是该触点在触摸屏中的坐标。示例性的,如图5所示,图5示出了触摸屏50接收到第一触摸操作后,所确定的第一触点A的坐标(x
1,y
1),以及接收到第二触摸操作后,所确定的第二触点B的坐标(x
2,y
2)。
可选的,第一触点和第二触点之间的距离可以大于或等于预设阈值。也就是说,本申请实施例支持两个触点之间的距离比较大的场景(如大屏场景),这样,有助于解决大屏场景中单手操作困难度高的问题。预设的阈值可以根据实际情况进行设置,本申请实施例对此不作具体限定。可选的,触摸屏的尺寸(即触摸屏的长边尺寸和/或短边尺寸)大于一个阈值,也就是说,本申请实施例支持大屏场景。
可选的,触摸屏还可以在接收第一触摸操作后,确定第一触点组的位置信息。同样的,触摸屏还可以在接收第二触摸操作后,确定第二触点组的位置信息。这里,一个触点组中的每两个触点之间的距离小于或等于预设距离。通过用触点组表示触摸操 作对应的触点位置,能够有效的避免用户在进行触摸操作时发生误操作,例如手掌碰触触摸屏。其中,预设距离可以由人机交互系统根据实际情况设置,本申请实施例对此不作具体限定。
本申请实施例下面的描述,以触摸屏根据第一触摸操作确定第一触点的位置信息,以及根据第二触摸操作确定第二触点的位置信息为例进行说明。
S402、触摸屏向处理器发送第一触点的位置信息和第二触点的位置信息。
具体的,触摸屏可以周期性或触发性或实时性地向处理器发送所确定的触点的位置信息。例如,触摸屏可以在每个周期向处理器发送一帧数据。该帧数据包括该周期内确定的触摸操作的触点的位置信息,如包括该周期内第一触点和第二触点的位置信息。
S403、处理器响应于第一触摸操作和第二触摸操作,确定语音处理时间窗。
具体的,处理器可以根据任意一种方式确定语音处理时间窗:
方式一、处理器可以根据第一触摸操作和第二触摸操作的重叠的触摸时长,以及预设规则确定语音处理时间窗。该预设规则可以是指示该重叠的触摸时长为语音处理时间窗,或者,该预设规则可以是指示在该重叠的触摸时长的开始时刻起的预设时长之后,到该重叠的触摸时长的结束时刻为语音处理时间窗。
也就是说,语音处理时间窗包含于该重叠的触摸时长,且该重叠的触摸时长的结束时刻为处理器结束语音处理时间窗的时刻。即语音处理时间窗可以与该重叠的触摸时长,或者语音处理时间窗的时长小于该重叠的触摸时长。
示例性的,参考图6,图6示出了第一触摸操作的触摸时长、第二触摸操作的触摸时长、以及语音处理时间窗之间的关系示意图。如图6所示,第一触点所对应的第一触摸操作的触摸时长为L
1,第二触点所对应的第二触摸操作的触摸时长为L
2,语音处理时间窗的时长为第一触摸操作和第二触摸操作重叠的触摸时长L。
方式二、处理器可以在第一触摸操作和第二触摸操作的重叠触摸时长的开始时刻,向语音采集器发送第一指令,第一指令用于指示语音采集器开始采集语音指令。并且,处理器在第一触摸操作和第二触摸操作的重叠触摸时长的结束时刻,向语音采集器发送第二指令,第二指令用于指示语音采集器结束采集语音指令。该情况下,作为示例,针对第一触摸操作和第二触摸操作来说,处理器可以将向语音采集器发送第一指令的时刻,到向语音采集器发送第二指令的时刻,确定为语音处理时间窗。
可选的,在方式二中,当第一触摸操作和第二触摸操作的重叠触摸时长的结束时刻,与第一触摸操作和第三触摸操作的重叠触摸时长的开始时刻相同时,在该时刻,处理器可以不向语音采集器发送第一指令和第二指令。
可选的,处理器可以通过对第一触点进行跟踪,确定第一触摸操作的触摸时长。处理器可以通过对第二触点进行跟踪,确定第二触摸操作的触摸时长。触点的跟踪过程具体可以参考下文S131-S136、以及S141-S144描述。
S404、语音采集器采集语音指令,并向处理器发送该语音指令。
在一种实现方式中,语音采集器实时采集语音指令,并将该语音指令发送至处理器。该语音采集方式与上述S403中的方式一对应。
在另一种实现方式中,语音采集器根据从处理器接收到的第一指令,进行语音指 令采集,并将采集到的语音指令发送至处理器。语音采集器还根据从处理器接收到的第二指令,停止语音指令的采集。该语音采集方式与上述S403中的方式二对应。
语音指令可以是用户发出的或者是某一设备发出的,并由语音采集器采集到的语音指令。
语音指令的内容可以包括选中对象、修改对象、删除对象、或者创建对象等。
示例性的,上述选中对象的语音指令可以包括“选中文本”,上述修改对象的语音指令可以包括“加粗文本颜色”,上述删除对象的语音指令可以包括“删除图片”,上述创建对象的语音指令可以包括“创建3行2列的表格”、或者“创建圆”等。
可以理解的是,语音采集器实时采集语音指令,并向处理器发送该语音指令的过程,和上述S401-S403的过程是独立并行的两个过程,二者的执行没有时序限制。
S405、处理器响应于第一触摸操作和第二触摸操作,确定语音处理时间窗内接收到的语音指令的待操作对象在触摸屏中的位置信息。
在语音处理时间窗内,处理器可以接收到一个或多个语音指令。
可选的,如果处理器接收到的某一语音指令不在语音处理时间窗内,则处理器可以丢弃该语音指令,或者将该语音指令置为无效。
可以理解的是,在语音处理时间窗内,处理器可以接收一个或多个语音指令,并执行该一个或多个语音指令。在一种实现方式中,一个语音指令无法跨越多个语音处理时间窗。
例如,第一语音处理时间窗的终点为第二语音处理时间窗的起点,当处理器在第一语音处理时间窗内接收到语音指令的一部分内容,在第二语音处理时间窗内接收到该语音指令的另一部分内容,这种情况下,处理器不执行该语音指令。
再例如,第一触点与第二触点对应的语音处理时间窗为第一语音处理时间窗,第一触点与第三触点对应的语音处理时间窗为第三语音处理时间窗。这里,第一语音处理时间窗和第三语音处理时间窗不同时存在,或者,第一语音处理时间窗的终点为第三语音处理时间窗的起点,或者,第一语音处理时间窗的起点为第三语音处理时间窗的终点。当处理器在第一语音处理时间窗内接收到语音指令的一部分内容,在第三语音处理时间窗内接收到该语音指令的另一部分内容,这种情况下,处理器不执行该语音指令。
具体的,处理器可以通过下述的方式一或方式二确定语音指令的待操作对象在触摸屏中的位置信息。这里,待操作对象可以是语音指令中包含的对象,也可是语音指令中指示的待创建的对象。
方式一、处理器根据第一触点的位置信息和第二触点的位置信息,确定待操作区域,并根据该待操作区域,确定语音指令的待操作对象在触摸屏中的位置信息。其中,待操作对象包含于待操作区域。
其中,待操作对象可以完全包含在待操作区域中,也可以部分包含在待操作区域中。本申请实施例以待操作对象完全包含于待操作区域中为例进行说明。
示例性的,待操作对象可以为上述语音指令“选中文本”中的文本对象,也可以为上述语音指令“加粗文本颜色”中的待加粗的文本对象,也可以为上述语音指令“删除图片”中的图片对象,还可以为上述语音指令“创建3行2列的表格”中的表格对象。
可选的,处理器可以根据第一触点的位置信息和第二触点的位置信息,以第一触点和第二触点之间的连线为半径创建圆,或者,以第一触点和第二触点之间的连线为直径创建圆,得到圆形的待操作区域。
示例性的,结合图5,参考图7,图7示出了以第一触点和第二触点之间的连线为直径创建圆,得到的圆形的待操作区域70。
可选的,处理器可以根据第一触点的位置信息和第二触点的位置信息,以第一触点和第二触点之间的连线为对角线,得到矩形或正多边形的待操作区域。或者,处理器可以根据第一触点的位置信息和第二触点的位置信息,以第一触点和第二触点之间的连线为边长,得到正多边形或菱形的待操作区域。这里,正多边形或菱形的待操作区域,和第一触点及第二触点连线的相对位置,以及菱形的内角角度,均可以由处理器根据预设规则指定。本申请实施例对此不作限定。
示例性的,结合图5,参考图8,图8示出了以第一触点和第二触点之间的连线为对角线,得到的矩形的待操作区域80。
可以理解的是,上述根据第一触点的位置信息和第二触点的位置信息,确定待操作区域的方式仅为示例性说明,并不构成对本申请实施例保护范围的限定。
可选的,处理器还可以根据确定出的待操作区域,指示显示屏显示该待操作区域。
可选的,显示屏可以以边框的形式显示待操作区域。待操作区域的边框可以是如图7、图8所示的黑色虚线框,也可以是其他颜色的虚线框,还可以是黑色实线框或其他颜色的实线框。本申请实施例对此不作限定。
可选的,显示屏也可以通过改变待操作区域的背景颜色来显示待操作区域。
当然,显示屏还可以通过任意能够将待操作区域与显示屏上当前显示的背景区分开的方式,显示待操作区域。本申请实施例对此不作限定。
方式二、处理器根据第一触点的位置信息和第二触点的位置信息,确定第一端点的位置信息和第二端点的位置信息,并根据确定出的第一端点的位置信息和第二端点的位置信息,确定语音指令的待操作对象在触摸屏中的位置信息。
其中,第一端点和第二端点均包含于待操作对象,或者,通过第一端点和第二端点,可以构建待操作对象。对于待操作对象而言,第一端点和第二端点的属性可以相同,也可以不同。示例性的,语音指令为“创建圆”,若第一端点和第二端点均为待操作对象“圆”的圆周上的点,这时,第一端点和第二端点均包含于待操作对象,且属性相同;若第一端点为待操作对象“圆”的圆心,第二端点为待操作对象“圆”的圆周上一点,这时,处理器可以通过第一端点和第二端点构建待操作对象“圆”,且对于该“圆”而言,第一端点和第二端点属性不同。
具体的,处理器可以将第一触点和第二触点,分别作为第一端点和第二端点,从而确定出第一端点的位置信息和第二端点的位置信息。例如,处理器将第一触点作为第一端点,将第二触点作为第二端点。这时,第一端点的位置信息为第一触点的位置信息,第二端点的位置信息为第二触点的位置信息。
处理器也可以根据预设规则,以及第一触点的位置信息和第二触点的位置信息,确定第一端点的位置信息和第二端点的位置信息。该预设规则可以是第一触摸操作和第二触摸操作的开始触摸时间的先后顺序,也可以是第一触点和第二触点、以及第一 端点和第二端点之间存在预设距离。当然,该预设规则还可以是其他规则,本申请实施例对此不作限定。
示例性的,若第一触摸操作先于第二触摸操作,则处理器将第一触点作为第一端点,将第二触点作为第二端点。这时,第一端点的位置信息为第一触点的位置信息,第二端点的位置信息为第二触点的位置信息。当然,若第一触摸操作先于第二触摸操作,处理器也可以将第一触点作为第二端点,将第二触点作为第一端点。这时,第一端点的位置信息为第二触点的位置信息,第二端点的位置信息为第一触点的位置信息。
S406、处理器根据语音指令的待操作对象在触摸屏中的位置信息执行该语音指令。
以下对处理器执行不同语音指令的场景进行描述。
场景一
处理器在语音处理时间窗内接收到的语音指令包括:选中待操作对象、删除待操作对象、修改待操作对象的属性信息或新建待操作对象中的任意一种。
处理器根据上述语音指令,处理器在S405中方式一确定的待操作区域中,执行语音指令。
具体的,下面分别对处理器执行上述语音指令的过程进行描述。
1)、若上述语音指令用于选中待操作对象时,则处理器在待操作区域中选中该待操作对象,并指示显示屏显示选中结果。
可选的,在显示屏上显示选中结果,可以通过包围待操作对象的边框来显示选中结果,也可以通过改变待操作对象的背景颜色来显示中选结果,本申请对此不作限定。
示例性的,结合图8,参考图9。若上述语音指令为“选中香蕉图标”,则处理器根据该语音指令,选中待操作区域80中的香蕉图标,并指示显示屏50以方形边框90显示选中结果。
2)、若上述语音指令用于指示删除待操作对象时,则处理器在待操作区域中删除该待操作对象。并指示显示屏显示删除结果。
示例性的,结合图9,若上述语音指令为“删除苹果图标”,则处理器根据该语音指令,删除待操作区域80中的苹果图标,并指示显示屏显示删除结果。
3)、若上述语音指令用于指示新建待操作对象时,则处理器在待操作区域中新建指定的对象,并指示显示屏显示新建的对象。
示例性的,结合图9,若上述语音指令为“新建一个3行2列的表格”,则处理器根据该语音指令,在待操作区域80中新建一个3行2列的表格。其中,表格的行宽和行高可以是系统的预设规则指定的,也可以是由用户设置的,本申请实施例对此不作具体限定。这里,若新建表格的行高和行宽是用户设置的,具体的,处理器可以通过扬声器等语音外放设备询问用户需要的行高和行宽,然后,用户通过麦克风输入语音,以应答处理器。进一步的,处理器根据用户的指示,新建表格。然后,处理器指示显示屏显示新建的表格。
4)、若上述语音指令用于指示修改待操作对象的属性信息时,则处理器在待操作区域中修改该待操作对象的属性信息,以及指示显示屏显示属性信息修改后的待操作对象。
示例性的,结合图9,若上述语音指令为“加粗文本”,则处理器根据该语音指令, 将待操作区域80中的文本“1000克”进行加粗处理,并指示显示屏显示加粗后的文本。
当然,上面描述的语音指令,仅为示例性说明人机交互装置在待操作区域中所执行的语音指令,并不构成对本申请实施例保护范围的限定。
场景二
处理器在语音处理时间窗内接收到的语音指令包括创建对象,该对象可以通过两点确定,并且,该对象在这两点上的属性是相同的。例如不带箭头的直线、带双向箭头的直线、线段、不带箭头的折线、带双向箭头的折线、曲线、圆形、多边形等对象。这里的折线可以是直角折线。这里的圆形为将已知线段作为直径所创建的圆形。种类的多边形可以包括菱形或正多边形等。
根据上述语音指令,处理器根据第一端点的位置信息和第二端点的位置信息,执行该语音指令。本申请实施例在下面描述中,以第一端点为第一触点,第二端点为第二触点为例进行说明。
具体的,下面结合图5,分别对处理器执行上述语音指令的过程进行描述。
1)、若上述语音指令为“创建直线”,则处理器连接第一触点A和第二触点B,得到一条直线,并指示显示屏显示该直线。
2)、若上述语音指令为“创建带双向箭头的直线”或“创建带双向箭头的折线”,则处理器以第一触点A和第二触点B为两个端点,创建带双向箭头的直线或折线,并指示显示屏显示该带双向箭头的直线或折线。
3)、若上述语音指令为“创建线段”,则处理器以第一触点A为一个端点,以第二触点B为另一个端点,创建线段,并指示显示屏显示该线段。
4)、若上述语音指令为“创建折线”,则处理器以第一触点A为一个端点,以第二触点B为另一个端点,创建折线,并指示显示屏显示该折线。其中,折线的具体形状,可以是处理器的预设规则指定的,本申请实施例对此不作限定。
5)、若上述语音指令为“创建曲线”,则处理器以第一触点A为一个端点,以第二触点B为另一个端点,创建曲线,并指示显示屏显示该曲线。其中,曲线的曲率以及弯曲的形状,可以是处理器的预设规则指定的,本申请实施例对此不作限定。
6)、若上述语音指令为“以直径创建圆”,则处理器以第一触点A和第二触点B之间的连线为直径创建圆,并指示显示屏显示该圆。
7)、若上述语音指令为“创建菱形”,则处理器以第一触点A和第二触点B的连线为菱形的边长,创建菱形,并指示显示屏显示该菱形。第一触点A和第二触点B的连线与该菱形的相对位置、以及菱形的内角度数,可以是处理器的预设规则指定的,本申请实施例对此不作限定。或者,处理器以第一触点A和第二触点B的连线为菱形的一条对角线,创建菱形,并指示显示屏显示该菱形。这时,菱形的另一条对角线的长度可以是处理器的预设规则指定的,本申请实施例对此不作限定。
示例性的,参考图10,若上述语音指令为“以对角线创建菱形”,且处理器的预设规则为,未知对角线长度为已知对角线长度的1/2。如图10所示,处理器根据接收到的语音指令,创建一个以触点A和触点B的连线AB为一条对角线,且另一条对角线为(AB)/2的菱形101,并在显示屏100上显示菱形101。
8)、若上述语音指令为“创建正五边形”,则处理器以第一触点A和第二触点B 的连线为正五边形的对角线,创建该正五边形,并指示显示屏显示该五边形。或者,处理器以第一触点A和第二触点B的连线为正五边形的边长,创建该正五边形,并指示显示屏显示该五边形。第一触点A和第二触点B的连线与该正五边形的相对位置,可以是处理器的预设规则指定的,本申请实施例对此不作限定。
当然,上面描述的语音指令,仅为示例性说明人机交互装置在待操作区域中所执行的语音指令,并不构成对本申请实施例保护范围的限定。
场景三
处理器在语音处理时间窗内接收到的语音指令为创建对象,该对象可以通过两点确定,并且,该对象在这两点上的属性不同。例如射线、带单向箭头的线或圆形等对象。
由于上述对象在两个点上的属性不同,因此,处理器可以基于第一触摸操作与第二触摸操作的开始触摸时间的先后顺序,创建该对象(例如,创建圆或带单向箭头的线)。当然,本申请实施例并不限定于此。
当创建圆时,该圆的圆心的位置可以是基于第一触摸操作的触点的位置确定的,该圆的圆周中的点的位置可以是基于第二触摸操作的触点的位置确定的。当创建带单向箭头的线时,该线中箭头指向的端点的位置可以是基于第一触摸操作的触点的位置确定的,该线中非箭头指向的端点的位置可以是基于第二触摸操作的触点的位置确定的。这里,第一触摸操作的开始触摸时间先于或后于第二触摸操作。
因此,根据上述语音指令,处理器根据第一触摸操作与第二触摸操作的开始触摸时间的先后顺序,确定第一端点的位置信息和第二端点的位置信息,并执行该语音指令。本申请实施例在下面描述中,以第一触摸操作先于第二触摸操作,且处理器将第一触点作为第一端点,将第二触点作为第二端点为例进行说明。
具体的,下面分别对处理器执行上述语音指令的过程进行描述。
1)、若上述语音指令为“创建射线”,则处理器可以以第一端点为该射线的端点,创建经过第二端点的射线,并指示显示屏显示该射线。当然,该射线的端点也可以是第二端点,这时,该射线还经过第一端点。
2)、若上述语音指令为“创建带单向箭头的线”,则处理器可以以第一端点为该带单向箭头的线的箭头顶点,并且,第二端点为该带单向箭头的线远离箭头一侧的端点,创建带单向箭头的线,并指示显示屏显示该带单向箭头的线。当然,该带单向箭头的线的箭头顶点也可以是第二端点,这时,第一端点为该带单向箭头的线远离箭头一侧的端点。
3)、若上述语音指令为“以半径创建圆”,则处理器可以以第一端点和第二端点的连线为半径创建圆,并指示显示屏显示该圆。其中,若处理器以第一端点为圆心创建圆,则第二端点为圆周上一点。若处理器以第二端点为圆心创建圆,则第一端点为该圆形圆周上一点。
示例性的,参考图11。若处理器在语音监听窗口接收到的语音指令为“以半径创建圆”。此时,如图11所示,人机交互装置确定触点A为第一端点,确定触点B为第二端点。处理器根据该语音指令,以第一端点和第二端点的连线为半径,且以第二端点为圆心创建圆,得到圆111,并指示显示屏110显示圆111。这时,第一端点为圆 111圆周上的一点。
下面对处理器跟踪触点的过程进行描述。
参考图12,图12中的(a)示出了处理器在当前帧的上一帧接收到的第一触点(标记为触点A)和第二触点(标记为触点B)的位置信息。其中,A的坐标为(x
a,y
a),B的坐标为(x
b,y
b)。并且,处理器为触点A分配IDa,为触点B分配IDb。图12中的(b)示出了处理器在当前帧接收到的触点C。其中,C的坐标为(x
c,y
c)。
这里,当前帧的上一帧为处理器在当前时刻的上一时刻,或在当前周期的上一个周期内接收到的帧。当前帧为处理器在当前时刻,或当前周期接收到的帧。
具体的,处理器跟踪触点的过程,可以包括为当前帧中的触点分配ID的过程,以及包括根据当前帧中触点的ID确定当前帧的上一帧中的触点失踪的过程。
参考图13,图13示出了处理器为当前帧中的触点分配ID的流程示意图,该流程包括:
S131、处理器根据触点A、触点B以及触点C的位置信息,确定触点A和触点C之间的距离D1,以及确定触点B和触点C之间的距离D2。
具体的,处理器根据上一帧中触点A、触点B的位置信息,以及当前帧中触点C的位置信息,确定触点A和触点C之间的距离D1,并确定触点B和触点C之间的距离D2。
S132、处理器判断D1是否小于或等于门限值。
这里,门限值可以为处理器预先设置的,门限值的大小可以根据实际情况设定,本申请实施例对此不作具体限定。
若D1小于或等于门限值,则执行S133。若D1大于门限值,则执行S134。
在实际应用中,可能存在触摸误操作,例如手掌碰触触摸屏,或者,手指或触摸笔在触摸触摸屏的过程中滑动。这时,当前帧中的触点和当前帧的上一帧的中的触点之间的距离通常较小。本申请实施例通过设置门限值,将此类当前帧中误操作生成的触点,与当前帧的上一帧中距离小于或等于门限值的触点,确定为同一个触点坐标跟踪序列,该触点坐标跟踪序列共用一个ID,且对应一个触摸操作。这样,有助于降低“根据触摸操作确定的语音处理时间窗”因误操作而而导致意外开始或结束的发生。
S133、处理器将触点A的ID分配给触点C。
处理器将触点A的ID分配给触点C,即触点C的ID为IDa。这时,触点A和触点C属于同一个触点坐标跟踪序列,该触点坐标跟踪序列的ID为IDa。该情况下,触点C作为第一触点。
执行S133之后,本次为当前帧中的触点分配ID流程结束。
S134、处理器判断D2是否小于或等于门限值。
若D2小于或等于门限值,则执行S135。若D2大于门限值,则执行S136。
S135、处理器将触点B的ID分配给触点C。
处理器将触点B的ID分配给触点C,即触点C的ID为IDb。这时,触点B和触点C属于同一个触点坐标跟踪序列,该触点坐标跟踪序列的ID为IDb。该情况下,触点C作为第二触点。
执行S135之后,本次为当前帧中的触点分配ID流程结束。
S136、处理器为触点C新分配ID。
处理器为触点C新分配ID,例如IDc。这时,表明触点C为当前新出现的触点。该情况下,触点C既不是第一触点,也不是第二触点。例如,触点C可以是误操作产生的触点,或者,触点C出现的时刻可以是下一个处理时间窗的起点等。
示例性的,处理器确定当前帧中存在触点A(即第一触点)和触点B(第二触点),该情况下,可以认为用户在通过手指触摸触摸屏时,手掌意外接触触摸屏而导致产生了新的触点C,此时,触点C是用户的误操作产生的触点,
示例性的,处理器确定当前帧中不存在触点A(即第一触点)或触点B(第二触点)。这时,新产生的触点C将与当前帧中存在的触点B或触点A产生新的重叠的触摸时长,即触点C的开始触摸时刻可以是下一个语音处理时间窗的起点。
通过S131-S136描述的触点跟踪方法,处理器可以对连续多帧的触点跟踪,得到多个触点的坐标跟踪序列。一个触点坐标跟踪序列共用一个ID,一个触点ID对应一个触摸动作。因此,一个触点跟踪序列可以描述一个触摸操作在触摸屏上的滑动轨迹。
进一步的,处理器根据当前帧中触点的ID,确定当前帧的上一帧中的触点是否失踪。参考图14,该确定当前帧的上一帧中的触点是否失踪的过程具体包括:
S141、处理器确定当前帧中的触点的ID。
具体的,处理器可以通过上述S131-S136确定当前帧中的触点的ID,此处不再详细赘述。
S142、处理器判断当前帧的上一帧中的触点的ID值中,是否存在与当前帧中的触点的ID值不同的ID值。若存在,则执行S143;若不存在,则执行S144。
处理器将上一帧中的触点的ID值与当前帧中触点的ID值一一比对,用以判断上一帧中的触点的ID值中,是否存在与当前帧中的触点的ID值不同的ID值。
示例性的,结合图12及图13,上一帧中触点的ID包括IDa和IDb,并且,处理器为当前帧中的触点C分配了IDa。这样的话,处理器可以确定当前帧中的触点的ID值不包括上一帧中的IDb。也就是说,处理器确定上一帧中的触点的ID值中,存在与当前帧中的触点的ID值不同的ID值,即IDb。
S143、处理器确定该ID值对应的触点失踪。
处理器确定当前帧的上一帧中的触点的ID值中,与当前帧中的触点的ID值不同的ID值所对应的触点失踪。此时,表明该触点对应的触摸操作的触摸时长结束,即与该触摸操作对应的语音处理时间窗结束。
S144、处理器继续执行监听语音指令,并且继续跟踪当其帧的下一帧的触点。
当其帧的上一帧中的触点的ID值中,不存在与当前帧中的触点的ID值不同的ID值,也就是说,当其帧的上一帧中的触点ID依旧在当前帧中出现。这时,说明当其帧的上一帧中的触点均没有失踪,即当其帧的上一帧中的触点所对应的触摸操作的触摸时长并未结束。这种情况下,处理器根据触摸操作,继续执行监听语音指令。并且,处理器继续跟踪当其帧的下一帧的触点。
此外,本申请实施例提供的人机交互方法还能对多触点进行跟踪。若当前帧中的包括m个触点,在该m个触点中,处理器确定n个触点与上一帧中的一个触点(例如触点A)之间的距离均小于门限值。这里,m和n分别为大于0的正整数,且m大于 或等于n。这时,处理器在当前帧中的n个触点中,确定出与当前帧的上一帧中的触点A距离最近的一个触点(例如触点C)。处理器将触点A的ID值分配给触点C,这时,触点A和触点C属于同一个触点坐标跟踪序列,该触点坐标跟踪序列共用一个ID。然后,处理器将当前帧里n个触点中,除触点A之外的n-1个触点丢弃,或将其置为无效。
可以理解的是,通过上述S131-S136,以及S141-S144描述的触点跟踪流程,处理器可以确定语音处理时间窗的起点和终点,并在语音处理时间窗内,执行监听语音指令。
具体的,参考图15,图15示出了触点跟踪和语音处理时间窗的关系。如图15所示,本申请实施例将处理器不执行监听语音指令的状态称作空闲状态,将处理器执行监听语音指令的状态称作监听状态。图15中的箭头表示处理器实时跟踪触点。
如图15所示,在第一时刻,处理器根据触摸屏发送的当前帧,确定当前帧中至少存在两个触点(例如第一触点和第二触点)。处理器为第一触点分配ID1,为第二触点分配ID2。这时,处理器确认第一时刻为语音处理时间窗的起点,并开始执行监听语音指令。
在第二时刻,处理器根据触摸屏发送的当前帧,确定当前帧中的触点所对应的ID值中不存在ID1或ID2。若不存在ID1,则处理器确定ID1对应的第一触点失踪,这时,表明第一触点对应的触摸操作的触摸时长结束。若不存在ID2,则处理器确定ID2对应的第二触点失踪,这时,表明第二触点对应的触摸操作的时长结束。因此,处理器确定第二时刻为语音处理时间窗的终点,并结束执行监听语音指令。
这时,处理器回到空闲态,并继续跟踪是否有新的触点出现,以便确定下个一语音处理时间窗的起点。
在实际应用中,可能存在触摸误操作,例如手掌碰触触摸屏,或者,手指或触摸笔在触摸触摸屏的过程中滑动。通过本申请实施例提供的触点跟踪方法,不仅能确定语音处理时间窗的起点和终点,还能避免触摸误操作所造成的语音处理时间窗意外开始或结束的问题。
综上,本申请实施例提供一种人机交互方法中,一方面,基于多点触控的重叠触摸时长,确定待执行的语音指令,这样,无需频繁使用唤醒词来唤醒系统,因此有助于提高用户在人机交互过程中的体验。另一方面,基于多点触控,确定该语音指令的待操作对象在触摸屏中的位置信息。这样,在大屏场景中,可以缓解在大屏上长距离滑动的问题,从而有助于提高用户在人机交互过程中的体验。
上述主要从方法的角度对本申请实施例提供的方案进行了介绍。为了实现上述功能,其包含了执行各个功能相应的硬件结构和/或软件模块。本领域技术人员应该很容易意识到,结合本文中所公开的实施例描述的各示例的单元及算法步骤,本申请能够以硬件或硬件和计算机软件的结合形式来实现。某个功能究竟以硬件还是计算机软件驱动硬件的方式来执行,取决于技术方案的特定应用和设计约束条件。专业技术人员可以对每个特定的应用来使用不同方法来实现所描述的功能,但是这种实现不应认为超出本申请的范围。
本申请实施例可以根据上述方法示例对人机交互系统进行划分。参考图1,图1 示出的人机交互系统可以用于执行人机交互方法,例如用于执行图4、图13和图14所示的方法。触摸屏13,用于接收第一触摸操作和第二触摸操作,该第一触摸操作和该第二触摸操作的触摸时长有重叠。处理器11,用于接收语音指令,该语音指令是在第一触摸操作和第二触摸操作的重叠的触摸时长内接收到的语音指令。处理器11,还用于响应第一触摸操作和所述第二触摸操作,确定语音指令的待操作对象在触摸屏13中的位置信息。处理器11,还用于基于待操作对象在触摸屏13中的位置信息,执行语音指令。例如,结合图4,触摸屏13可以用于执行S401和S402,处理器11可以用于执行S403、S405以及S406。
可选的,处理器11,具体用于确定语音指令的待操作区域;待操作区域是触摸屏13中的区域,待操作对象包含于待操作区域。处理器11,还用于在待操作区域中执行语音指令。例如,结合图4,处理器11可以用于执行S405。
可选的,处理器11具体用于:当语音指令用于指示选中对象时,选中待操作区域中的对象;或者,当语音指令用于指示将对象的属性信息修改为目标属性信息时,将待操作区域中的对象的属性信息修改为所述目标属性信息;或者,当语音指令用于指示删除对象时,删除待操作区域中的对象;或者,当语音指令用于指示创建对象时,在待操作区域中创建对象。例如,结合图4,处理器11可以用于执行S406。
可选的,显示屏132,用于显示待操作区域。
可选的,上述语音指令用于指示创建所述待操作对象,该待操作对象包括第一端点和第二端点。处理器11,还用于基于第一触摸操作的触点的位置信息,确定第一端点在触摸屏13中的位置信息;以及基于第二触摸操作的触点的位置信息,确定第二端点在触摸屏13中的位置信息。例如,结合图4,处理器11可以用于执行S405。
可选的,待操作对象包括以下至少一种:直线、线段、折线、曲线或多边形。
可选的,处理器11,具体用于基于待操作对象在触摸屏13中的位置信息,以及第一触摸操作与第二触摸操作的开始触摸时间的先后顺序,执行语音指令。例如,结合图4,处理器11可以用于执行S406。
可选的,处理器11,具体用于当语音指令用于指示创建圆时,基于第一触摸操作与第二触摸操作的开始触摸时间的先后顺序,创建该圆。其中,该圆的圆心的位置是基于第一触摸操作的触点的位置确定的,该圆的圆周中的点的位置是基于第二触摸操作的触点的位置确定的。第一触摸操作的开始触摸时间先于或后于第二触摸操作。例如,结合图4,处理器11可以用于执行S406。
可选的,处理器11,具体用于当语音指令用于指示创建带单向箭头的线时,基于第一触摸操作与第二触摸操作的开始触摸时间的先后顺序,创建该带单向箭头的线。其中,该线中箭头指向的端点的位置是基于第一触摸操作的触点的位置确定的,该线中非箭头指向的端点的位置是基于第二触摸操作的触点的位置确定的。第一触摸操作的开始触摸时间先于或后于第二触摸操作。例如,结合图4,处理器11可以用于执行S406。
可选的,语音采集器14可以用于采集语音指令。
关于上述可选方式的具体描述参见前述的方法实施例,此处不再赘述。此外,上述提供的任一种人机交互系统的解释以及有益效果的描述均可参考上述对应的方法实 施例,不再赘述。
另外,本申请实施例可以根据上述方法示例对上述处理器,或者包含处理器的计算机设备进行功能模块的划分,例如,可以对应各个功能划分各个功能模块,也可以将两个或两个以上的功能集成在一个处理模块中。上述集成的模块既可以采用硬件的形式实现,也可以采用软件功能模块的形式实现。需要说明的是,本申请实施例中对模块的划分是示意性的,仅仅为一种逻辑功能划分,实际实现时可以有另外的划分方式。
如图16所示,图16示出了本申请实施例提供的一种处理器或计算机设备的结构示意图。处理器或计算机设备用于执行上述人机交互方法,例如用于执行图4、图13、图14和图15所示的方法。其中,处理器或计算机设备可以包括获取单元161、接收单元162、确定单元163以及执行单元164。
获取单元161,用于获取第一触点的位置信息和第二触点的位置信息。这里,第一触点是基于第一触摸操作的触点,第二触点是基于第二触摸操作的触点,第一触摸操作和第二触摸操作的触摸时长有重叠。接收单元162,用于接收语音指令,该语音指令是在第一触摸操作和第二触摸操作的重叠的触摸时长内接收到的语音指令。确定单元163,用于响应获取单元161的获取操作,确定语音指令的待操作对象的位置信息。执行单元164,用于基于确定单元163确定的待操作对象的位置信息,执行语音指令。例如,结合图4,获取单元161可以用于执行S403,确定单元163可以用于执行S405,执行单元164可以用于执行S406。
可选的,确定单元163,具体用于确定语音指令的待操作区域,待操作对象包含于待操作区域。执行单元164,具体用于在确定单元163确定出的待操作区域中执行语音指令。例如,结合图4,确定单元163可以用于执行S405。
可选的,执行单元164具体用于,当语音指令用于指示选中对象时,选中待操作区域中的对象;或者,当语音指令用于指示将对象的属性信息修改为目标属性信息时,将待操作区域中的对象的属性信息修改为目标属性信息;或者,当语音指令用于指示删除对象时,删除待操作区域中的对象;或者,当语音指令用于指示创建对象时,在待操作区域中创建对象。例如,结合图4,执行单元164可以用于执行S406。
可选的,语音指令用于指示创建待操作对象,该待操作对象包括第一端点和第二端点。确定单元163,具体用于基于第一触点的位置信息,确定第一端点的位置信息;基于第二触点的位置信息,确定第二端点的位置信息。例如,结合图4,确定单元163可以用于执行S405。
可选的,执行单元164,具体用于基于待操作对象的位置信息,以及第一触摸操作与第二触摸操作的开始触摸时间的先后顺序,执行语音指令。例如,结合图4,执行单元164可以用于执行S406。
可选的,执行单元164,具体用于当语音指令用于指示创建圆时,基于第一触摸操作与第二触摸操作的开始触摸时间的先后顺序,创建圆。其中,该圆的圆心的位置是基于第一触摸操作的触点的位置确定的,该圆的圆周中的点的位置是基于第二触摸操作的触点的位置确定的。第一触摸操作的开始触摸时间先于或后于第二触摸操作。例如,结合图4,执行单元164可以用于执行S406。
可选的,执行单元164,具体用于当语音指令用于指示创建带单向箭头的线时,基于第一触摸操作与第二触摸操作的开始触摸时间的先后顺序,创建带单向箭头的线。其中,该线中箭头指向的端点的位置是基于第一触摸操作的触点的位置确定的,该线中非箭头指向的端点的位置是基于第二触摸操作的触点的位置确定的。第一触摸操作的开始触摸时间先于或后于第二触摸操作。例如,结合图4,执行单元164可以用于执行S406。
当然,本申请实施例提供的计算机设备包括但不限于上述单元,例如该计算机设备还可以包括存储单元165。存储单元165可以用于存储该计算机设备的程序代码。
作为一个示例,结合图1,计算机设备中的获取单元161、接收单元162、确定单元163以及执行单元164实现的功能与图1中的处理器11的功能相同。存储单元165实现的功能与图1中的存储器12的功能相同。
关于上述可选方式的具体描述参见前述的方法实施例,此处不再赘述。此外,上述提供的任一种处理器或计算机设备的解释以及有益效果的描述均可参考上述对应的方法实施例,不再赘述。
本申请实施例还提供一种芯片系统,如图17所示,该芯片系统包括至少一个处理器171和至少一个接口电路172。处理器171和接口电路172可通过线路互联。例如,接口电路172可用于从其它装置(例如人机交互系统的存储器,或者计算机设备的存储器)接收信号。又例如,接口电路172可用于向其它装置(例如处理器171)发送信号。示例性的,接口电路172可读取存储器中存储的指令,并将该指令发送给处理器171。当所述指令被处理器171执行时,可使得人机交互系统或者计算机设备执行上述实施例中的各个步骤。当然,该芯片系统还可以包含其他分立器件,本申请实施例对此不作具体限定。
本申请另一实施例还提供一种计算机可读存储介质,该计算机可读存储介质中存储有指令,当指令在人机交互系统或者计算机设备上运行时,该人机交互系统或者计算机设备执行上述方法实施例所示的方法流程中人机交互系统或者计算机设备执行的各个步骤。
在一些实施例中,所公开的方法可以实施为以机器可读格式被编码在计算机可读存储介质上的或者被编码在其它非瞬时性介质或者制品上的计算机程序指令。
图18示意性地示出本申请实施例提供的计算机程序产品的概念性局部视图,所述计算机程序产品包括用于在计算设备上执行计算机进程的计算机程序。
在一个实施例中,计算机程序产品是使用信号承载介质180来提供的。所述信号承载介质180可以包括一个或多个程序指令,其当被一个或多个处理器运行时可以提供以上针对图4描述的功能或者部分功能。因此,例如,参考图4中S401~S406的一个或多个特征可以由与信号承载介质180相关联的一个或多个指令来承担。此外,图18中的程序指令也描述示例指令。
在一些示例中,信号承载介质180可以包含计算机可读介质181,诸如但不限于,硬盘驱动器、紧密盘(CD)、数字视频光盘(DVD)、数字磁带、存储器、只读存储记忆体(read-only memory,ROM)或随机存储记忆体(random access memory,RAM)等等。
在一些实施方式中,信号承载介质180可以包含计算机可记录介质182,诸如但不限于,存储器、读/写(R/W)CD、R/W DVD、等等。
在一些实施方式中,信号承载介质180可以包含通信介质183,诸如但不限于,数字和/或模拟通信介质(例如,光纤电缆、波导、有线通信链路、无线通信链路、等等)。
信号承载介质180可以由无线形式的通信介质183(例如,遵守IEEE 802.11标准或者其它传输协议的无线通信介质)来传达。一个或多个程序指令可以是,例如,计算机可执行指令或者逻辑实施指令。
在一些示例中,诸如针对图4描述的人机交互系统好或者计算机设备可以被配置为,响应于通过计算机可读介质181、计算机可记录介质182、和/或通信介质183中的一个或多个程序指令,提供各种操作、功能、或者动作。
应该理解,这里描述的布置仅仅是用于示例的目的。因而,本领域技术人员将理解,其它布置和其它元素(例如,机器、接口、功能、顺序、和功能组等等)能够被取而代之地使用,并且一些元素可以根据所期望的结果而一并省略。另外,所描述的元素中的许多是可以被实现为离散的或者分布式的组件的、或者以任何适当的组合和位置来结合其它组件实施的功能实体。
在上述实施例中,可以全部或部分地通过软件、硬件、固件或者其任意组合来实现。当使用软件程序实现时,可以全部或部分地以计算机程序产品的形式来实现。该计算机程序产品包括一个或多个计算机指令。在计算机上加载和执行计算机执行指令时,全部或部分地产生按照本申请实施例的流程或功能。计算机可以是通用计算机、专用计算机、计算机网络、或者其他可编程装置。计算机指令可以存储在计算机可读存储介质中,或者从一个计算机可读存储介质向另一个计算机可读存储介质传输,例如,计算机指令可以从一个网站站点、计算机、服务器或者数据中心通过有线(例如同轴电缆、光纤、数字用户线(digital subscriber line,DSL))或无线(例如红外、无线、微波等)方式向另一个网站站点、计算机、服务器或数据中心进行传输。计算机可读存储介质可以是计算机能够存取的任何可用介质或者是包含一个或多个可以用介质集成的服务器、数据中心等数据存储设备。可用介质可以是磁性介质(例如,软盘、硬盘、磁带),光介质(例如,DVD)、或者半导体介质(例如固态硬盘(solid state disk,SSD))等。
以上所述,仅为本申请的具体实施方式。熟悉本技术领域的技术人员根据本申请提供的具体实施方式,可想到变化或替换,都应涵盖在本申请的保护范围之内。
Claims (37)
- 一种人机交互方法,其特征在于,应用于人机交互系统,所述人机交互系统包括触摸屏,所述方法包括:接收第一触摸操作和第二触摸操作,所述第一触摸操作和所述第二触摸操作的触摸时长有重叠;接收语音指令,所述语音指令是在所述第一触摸操作和所述第二触摸操作的重叠的触摸时长内接收到的语音指令;响应所述第一触摸操作和所述第二触摸操作,确定所述语音指令的待操作对象在所述触摸屏中的位置信息;基于所述待操作对象在所述触摸屏中的位置信息,执行所述语音指令。
- 根据权利要求1所述的方法,其特征在于,所述确定所述语音指令的待操作对象在所述触摸屏中的位置信息,包括:确定所述语音指令的待操作区域;所述待操作区域是所述触摸屏中的区域,所述待操作对象包含于所述待操作区域;所述基于所述待操作对象在所述触摸屏中的位置信息,执行所述语音指令,包括:在所述待操作区域中执行所述语音指令。
- 根据权利要求2所述的方法,其特征在于,所述在所述待操作区域中执行所述语音指令,包括:当所述语音指令用于指示选中对象时,选中所述待操作区域中的对象;或者,当所述语音指令用于指示将对象的属性信息修改为目标属性信息时,将所述待操作区域中的对象的属性信息修改为所述目标属性信息;或者,当所述语音指令用于指示删除对象时,删除所述待操作区域中的对象;或者,当所述语音指令用于指示创建对象时,在所述待操作区域中创建对象。
- 根据权利要求2或3所述的方法,其特征在于,所述方法还包括:显示所述待操作区域。
- 根据权利要求1所述的方法,其特征在于,所述语音指令用于指示创建所述待操作对象,所述待操作对象包括第一端点和第二端点;所述确定所述语音指令的待操作对象在所述触摸屏中的位置信息,包括:基于所述第一触摸操作的触点的位置信息,确定所述第一端点在所述触摸屏中的位置信息;基于所述第二触摸操作的触点的位置信息,确定所述第二端点在所述触摸屏中的位置信息。
- 根据权利要求5所述的方法,其特征在于,所述待操作对象包括以下至少一种:直线、线段、折线、曲线或多边形。
- 根据权利要求1所述的方法,其特征在于,所述基于所述待操作对象在所述触摸屏中的位置信息,执行所述语音指令,包括:基于所述待操作对象在所述触摸屏中的位置信息,以及所述第一触摸操作与所述第二触摸操作的开始触摸时间的先后顺序,执行所述语音指令。
- 根据权利要求7所述的方法,其特征在于,所述基于所述待操作对象在所述触 摸屏中的位置信息,以及所述第一触摸操作与所述第二触摸操作的开始触摸时间的先后顺序,执行所述语音指令,包括:当所述语音指令用于指示创建圆时,基于所述第一触摸操作与所述第二触摸操作的开始触摸时间的先后顺序,创建所述圆;其中,所述圆的圆心的位置是基于所述第一触摸操作的触点的位置确定的,所述圆的圆周中的点的位置是基于所述第二触摸操作的触点的位置确定的;所述第一触摸操作的开始触摸时间先于或后于所述第二触摸操作。
- 根据权利要求7所述的方法,其特征在于,所述基于所述待操作对象在所述触摸屏中的位置信息,以及所述第一触摸操作与所述第二触摸操作的开始触摸时间的先后顺序,执行所述语音指令,包括:当所述语音指令用于指示创建带单向箭头的线时,基于所述第一触摸操作与所述第二触摸操作的开始触摸时间的先后顺序,创建所述带单向箭头的线;其中,所述线中箭头指向的端点的位置是基于所述第一触摸操作的触点的位置确定的,所述线中非箭头指向的端点的位置是基于所述第二触摸操作的触点的位置确定的;所述第一触摸操作的开始触摸时间先于或后于所述第二触摸操作。
- 一种人机交互方法,其特征在于,应用于计算机设备,所述方法包括:获取第一触点的位置信息和第二触点的位置信息;所述第一触点是基于第一触摸操作的触点,所述第二触点是基于第二触摸操作的触点,所述第一触摸操作和所述第二触摸操作的触摸时长有重叠;接收语音指令,所述语音指令是在所述第一触摸操作和所述第二触摸操作的重叠的触摸时长内接收到的语音指令;响应所述获取操作,确定所述语音指令的待操作对象的位置信息;基于所述待操作对象的位置信息,执行所述语音指令。
- 根据权利要求10所述的方法,其特征在于,所述确定所述语音指令的待操作对象的位置信息,包括:确定所述语音指令的待操作区域,所述待操作对象包含于所述待操作区域;所述基于所述待操作对象的位置信息,执行所述语音指令,包括:在所述待操作区域中执行所述语音指令。
- 根据权利要求11所述的方法,其特征在于,所述在所述待操作区域中执行所述语音指令,包括:当所述语音指令用于指示选中对象时,选中所述待操作区域中的对象;或者,当所述语音指令用于指示将对象的属性信息修改为目标属性信息时,将所述待操作区域中的对象的属性信息修改为所述目标属性信息;或者,当所述语音指令用于指示删除对象时,删除所述待操作区域中的对象;或者,当所述语音指令用于指示创建对象时,在所述待操作区域中创建对象。
- 根据权利要求10所述的方法,其特征在于,所述语音指令用于指示创建所述待操作对象,所述待操作对象包括第一端点和第二端点;所述确定所述语音指令的待操作对象的位置信息,包括:基于所述第一触点的位置信息,确定所述第一端点的位置信息;基于所述第二触点的位置信息,确定所述第二端点的位置信息。
- 根据权利要求10所述的方法,其特征在于,所述基于所述待操作对象的位置信息,执行所述语音指令,包括:基于所述待操作对象的位置信息,以及所述第一触摸操作与所述第二触摸操作的开始触摸时间的先后顺序,执行所述语音指令。
- 根据权利要求14所述的方法,其特征在于,所述基于所述待操作对象的位置信息,以及所述第一触摸操作与所述第二触摸操作的开始触摸时间的先后顺序,执行所述语音指令,包括:当所述语音指令用于指示创建圆时,基于所述第一触摸操作与所述第二触摸操作的开始触摸时间的先后顺序,创建所述圆;其中,所述圆的圆心的位置是基于所述第一触摸操作的触点的位置确定的,所述圆的圆周中的点的位置是基于所述第二触摸操作的触点的位置确定的;所述第一触摸操作的开始触摸时间先于或后于所述第二触摸操作。
- 根据权利要求14所述的方法,其特征在于,所述基于所述待操作对象的位置信息,以及所述第一触摸操作与所述第二触摸操作的开始触摸时间的先后顺序,执行所述语音指令,包括:当所述语音指令用于指示创建带单向箭头的线时,基于所述第一触摸操作与所述第二触摸操作的开始触摸时间的先后顺序,创建所述带单向箭头的线;其中,所述线中箭头指向的端点的位置是基于所述第一触摸操作的触点的位置确定的,所述线中非箭头指向的端点的位置是基于所述第二触摸操作的触点的位置确定的;所述第一触摸操作的开始触摸时间先于或后于所述第二触摸操作。
- 一种人机交互系统,其特征在于,所述人机交互系统包括触摸屏和处理器;所述触摸屏,用于接收第一触摸操作和第二触摸操作,所述第一触摸操作和所述第二触摸操作的触摸时长有重叠;所述处理器,用于接收语音指令,所述语音指令是在所述第一触摸操作和所述第二触摸操作的重叠的触摸时长内接收到的语音指令;所述处理器,还用于响应所述第一触摸操作和所述第二触摸操作,确定所述语音指令的待操作对象在所述触摸屏中的位置信息;以及用于基于所述待操作对象在所述触摸屏中的位置信息,执行所述语音指令。
- 根据权利要求17所述的系统,其特征在于,所述处理器,具体用于确定所述语音指令的待操作区域;所述待操作区域是所述触摸屏中的区域,所述待操作对象包含于所述待操作区域;以及用于在所述待操作区域中执行所述语音指令。
- 根据权利要求18所述的系统,其特征在于,所述处理器具体用于,当所述语音指令用于指示选中对象时,选中所述待操作区域中的对象;或者,当所述语音指令用于指示将对象的属性信息修改为目标属性信息时,将所述待操作区域中的对象的属性信息修改为所述目标属性信息;或者,当所述语音指令用于指示删除对象时,删除所述待操作区域中的对象;或者,当所述语音指令用于指示创建对象时,在所述待操作区域中创建对象。
- 根据权利要求18或19所述的系统,其特征在于,所述系统还包括显示屏;所述显示屏,用于显示所述待操作区域。
- 根据权利要求17所述的系统,其特征在于,所述语音指令用于指示创建所述待操作对象,所述待操作对象包括第一端点和第二端点;所述处理器,还用于基于所述第一触摸操作的触点的位置信息,确定所述第一端点在所述触摸屏中的位置信息;基于所述第二触摸操作的触点的位置信息,确定所述第二端点在所述触摸屏中的位置信息。
- 根据权利要求21所述的系统,其特征在于,所述待操作对象包括以下至少一种:直线、线段、折线、曲线或多边形。
- 根据权利要求17所述的系统,其特征在于,所述处理器,具体用于基于所述待操作对象在所述触摸屏中的位置信息,以及所述第一触摸操作与所述第二触摸操作的开始触摸时间的先后顺序,执行所述语音指令。
- 根据权利要求23所述的系统,其特征在于,所述处理器,具体用于当所述语音指令用于指示创建圆时,基于所述第一触摸操作与所述第二触摸操作的开始触摸时间的先后顺序,创建所述圆;其中,所述圆的圆心的位置是基于所述第一触摸操作的触点的位置确定的,所述圆的圆周中的点的位置是基于所述第二触摸操作的触点的位置确定的;所述第一触摸操作的开始触摸时间先于或后于所述第二触摸操作。
- 根据权利要求23所述的系统,其特征在于,所述处理器,具体用于当所述语音指令用于指示创建带单向箭头的线时,基于所述第一触摸操作与所述第二触摸操作的开始触摸时间的先后顺序,创建所述带单向箭头的线;其中,所述线中箭头指向的端点的位置是基于所述第一触摸操作的触点的位置确定的,所述线中非箭头指向的端点的位置是基于所述第二触摸操作的触点的位置确定的;所述第一触摸操作的开始触摸时间先于或后于所述第二触摸操作。
- 一种计算机设备,其特征在于,所述计算机设备包括:获取单元,接收单元,确定单元以及执行单元;所述获取单元,用于获取第一触点的位置信息和第二触点的位置信息;所述第一触点是基于第一触摸操作的触点,所述第二触点是基于第二触摸操作的触点,所述第一触摸操作和所述第二触摸操作的触摸时长有重叠;所述接收单元,用于接收语音指令,所述语音指令是在所述第一触摸操作和所述第二触摸操作的重叠的触摸时长内接收到的语音指令;所述确定单元,用于响应所述获取单元的获取操作,确定所述语音指令的待操作对象的位置信息;所述执行单元,用于基于所述确定单元确定的所述待操作对象的位置信息,执行所述语音指令。
- 根据权利要求26所述的计算机设备,其特征在于,所述确定单元,具体用于确定所述语音指令的待操作区域,所述待操作对象包含于所述待操作区域;所述执行单元,具体用于在所述确定单元确定出的所述待操作区域中执行所述语音指令。
- 根据权利要求27所述的计算机设备,其特征在于,所述执行单元具体用于,当所述语音指令用于指示选中对象时,选中所述待操作区域中的对象;或者,当所述语音指令用于指示将对象的属性信息修改为目标属性信息时,将所述待操作区域中的对象的属性信息修改为所述目标属性信息;或者,当所述语音指令用于指示删除对象时,删除所述待操作区域中的对象;或者,当所述语音指令用于指示创建对象时,在所述待操作区域中创建对象。
- 根据权利要求26所述的计算机设备,其特征在于,所述语音指令用于指示创建所述待操作对象,所述待操作对象包括第一端点和第二端点;所述确定单元,具体用于基于所述第一触点的位置信息,确定所述第一端点的位置信息;基于所述第二触点的位置信息,确定所述第二端点的位置信息。
- 根据权利要求26所述的计算机设备,其特征在于,所述执行单元,具体用于基于所述待操作对象的位置信息,以及所述第一触摸操作与所述第二触摸操作的开始触摸时间的先后顺序,执行所述语音指令。
- 根据权利要求30所述的计算机设备,其特征在于,所述执行单元,具体用于当所述语音指令用于指示创建圆时,基于所述第一触摸操作与所述第二触摸操作的开始触摸时间的先后顺序,创建所述圆;其中,所述圆的圆心的位置是基于所述第一触摸操作的触点的位置确定的,所述圆的圆周中的点的位置是基于所述第二触摸操作的触点的位置确定的;所述第一触摸操作的开始触摸时间先于或后于所述第二触摸操作。
- 根据权利要求30所述的计算机设备,其特征在于,所述执行单元,具体用于当所述语音指令用于指示创建带单向箭头的线时,基于所述第一触摸操作与所述第二触摸操作的开始触摸时间的先后顺序,创建所述带单向箭头的线;其中,所述线中箭头指向的端点的位置是基于所述第一触摸操作的触点的位置确定的,所述线中非箭头指向的端点的位置是基于所述第二触摸操作的触点的位置确定的;所述第一触摸操作的开始触摸时间先于或后于所述第二触摸操作。
- 一种计算机设备,其特征在于,包括:存储器和一个或多个处理器;所述存储器和所述处理器耦合;所述存储器用于存储计算机程序代码,所述计算机程序代码包括计算机指令,当所述计算机指令被所述计算机设备执行时,使得所述计算机设备执行如权利要求10-16中任一项所述的人机交互方法。
- 一种芯片系统,其特征在于,所述芯片系统应用于人机交互系统;所述芯片系统包括一个或多个接口电路和一个或多个处理器;所述接口电路和所述处理器通过线路互联;所述接口电路用于从所述人机交互系统的存储器接收信号,并向所述处理器发送所述信号,所述信号包括所述存储器中存储的计算机指令;当所述处理器执行所述计算机指令时,所述人机交互系统执行如权利要求1-9中任一项所述的人机交互方法。
- 一种芯片系统,其特征在于,所述芯片系统应用于计算机设备;所述芯片系统包括一个或多个接口电路和一个或多个处理器;所述接口电路和所述处理器通过线路互联;所述接口电路用于从所述计算机设备的存储器接收信号,并向所述处理器发送所述信号,所述信号包括所述存储器中存储的计算机指令;当所述处理器执行所述计算机指令时,所述计算机设备执行如权利要求10-16中任一项所述的人机交互方法。
- 一种计算机可读存储介质,其特征在于,包括计算机指令,当所述计算机指令在人机交互系统上运行时,使得所述人机交互系统实现如权利要求1-9中任意一项所述的人机交互方法。
- 一种计算机可读存储介质,其特征在于,包括计算机指令,当所述计算机指令在计算机设备上运行时,使得所述计算机设备实现如权利要求10-16中任意一项所述的人机交互方法。
Priority Applications (5)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| JP2022513988A JP7413513B2 (ja) | 2019-12-30 | 2019-12-30 | ヒューマンコンピュータインタラクション方法、装置、及びシステム |
| PCT/CN2019/130195 WO2021134358A1 (zh) | 2019-12-30 | 2019-12-30 | 人机交互方法、装置和系统 |
| EP19958385.7A EP3992768A4 (en) | 2019-12-30 | 2019-12-30 | METHOD, DEVICE AND SYSTEM FOR HUMAN-COMPUTER INTERACTION |
| CN201980079293.8A CN113330409B (zh) | 2019-12-30 | 2019-12-30 | 人机交互方法、装置和系统 |
| US17/854,094 US20220335943A1 (en) | 2019-12-30 | 2022-06-30 | Human-computer interaction method, apparatus, and system |
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| PCT/CN2019/130195 WO2021134358A1 (zh) | 2019-12-30 | 2019-12-30 | 人机交互方法、装置和系统 |
Related Child Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| US17/854,094 Continuation US20220335943A1 (en) | 2019-12-30 | 2022-06-30 | Human-computer interaction method, apparatus, and system |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| WO2021134358A1 true WO2021134358A1 (zh) | 2021-07-08 |
Family
ID=76686232
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| PCT/CN2019/130195 Ceased WO2021134358A1 (zh) | 2019-12-30 | 2019-12-30 | 人机交互方法、装置和系统 |
Country Status (5)
| Country | Link |
|---|---|
| US (1) | US20220335943A1 (zh) |
| EP (1) | EP3992768A4 (zh) |
| JP (1) | JP7413513B2 (zh) |
| CN (1) | CN113330409B (zh) |
| WO (1) | WO2021134358A1 (zh) |
Cited By (3)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20230094042A1 (en) * | 2021-09-24 | 2023-03-30 | Google Llc | Personalized autonomous spreadsheets |
| US20230106058A1 (en) * | 2021-09-24 | 2023-04-06 | Google Llc | Autonomous spreadsheet creation |
| US12547823B2 (en) | 2021-09-24 | 2026-02-10 | Google Llc | Dynamically and selectively updated spreadsheets based on knowledge monitoring and natural language processing |
Citations (6)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN105045481A (zh) * | 2015-06-26 | 2015-11-11 | 深圳市金立通信设备有限公司 | 一种操作方法及终端 |
| CN105677223A (zh) * | 2016-01-07 | 2016-06-15 | 广东欧珀移动通信有限公司 | 一种按压触控方法和装置 |
| CN105988707A (zh) * | 2015-06-30 | 2016-10-05 | 乐视致新电子科技(天津)有限公司 | 一种控制电子设备的方法及装置 |
| CN107436698A (zh) * | 2016-05-27 | 2017-12-05 | 深圳市科潮达科技有限公司 | 触控系统及方法 |
| US20180318705A1 (en) * | 2017-05-04 | 2018-11-08 | Roblox Corporation | Touch-Screen Based Game Controller |
| CN109388318A (zh) * | 2018-09-30 | 2019-02-26 | 联想(北京)有限公司 | 一种控制方法及电子设备 |
Family Cites Families (12)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JP3203061B2 (ja) * | 1992-09-07 | 2001-08-27 | シャープ株式会社 | 音声電子黒板及び音声認識機能を備える表示装置 |
| US5726688A (en) * | 1995-09-29 | 1998-03-10 | Ncr Corporation | Predictive, adaptive computer interface |
| JPH11184669A (ja) * | 1997-12-24 | 1999-07-09 | Sharp Corp | 情報処理装置及び情報処理方法並びに情報処理装置制御プログラムを記憶した媒体 |
| JP4515653B2 (ja) | 2001-03-15 | 2010-08-04 | 株式会社リコー | 情報入力装置、情報入力装置の制御方法、プログラムおよびプログラムを記録したコンピュータ読み取り可能な記録媒体 |
| US9223471B2 (en) * | 2010-12-28 | 2015-12-29 | Microsoft Technology Licensing, Llc | Touch screen control |
| WO2013170383A1 (en) | 2012-05-16 | 2013-11-21 | Xtreme Interactions Inc. | System, device and method for processing interlaced multimodal user input |
| JP5921703B2 (ja) | 2012-10-16 | 2016-05-24 | 三菱電機株式会社 | 情報表示装置および情報表示装置における操作制御方法 |
| CN103970460A (zh) * | 2013-01-30 | 2014-08-06 | 三星电子(中国)研发中心 | 基于触摸屏的操作方法和使用该方法的终端设备 |
| JP2014157520A (ja) | 2013-02-18 | 2014-08-28 | Fujitsu Ten Ltd | 表示装置、及び、表示方法 |
| US20160103655A1 (en) | 2014-10-08 | 2016-04-14 | Microsoft Corporation | Co-Verbal Interactions With Speech Reference Point |
| US10558341B2 (en) * | 2017-02-20 | 2020-02-11 | Microsoft Technology Licensing, Llc | Unified system for bimanual interactions on flexible representations of content |
| CN109542323B (zh) * | 2018-11-23 | 2021-03-02 | 网易(杭州)网络有限公司 | 基于虚拟场景的交互控制方法及装置、存储介质、电子设备 |
-
2019
- 2019-12-30 JP JP2022513988A patent/JP7413513B2/ja active Active
- 2019-12-30 EP EP19958385.7A patent/EP3992768A4/en active Pending
- 2019-12-30 CN CN201980079293.8A patent/CN113330409B/zh active Active
- 2019-12-30 WO PCT/CN2019/130195 patent/WO2021134358A1/zh not_active Ceased
-
2022
- 2022-06-30 US US17/854,094 patent/US20220335943A1/en not_active Abandoned
Patent Citations (6)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN105045481A (zh) * | 2015-06-26 | 2015-11-11 | 深圳市金立通信设备有限公司 | 一种操作方法及终端 |
| CN105988707A (zh) * | 2015-06-30 | 2016-10-05 | 乐视致新电子科技(天津)有限公司 | 一种控制电子设备的方法及装置 |
| CN105677223A (zh) * | 2016-01-07 | 2016-06-15 | 广东欧珀移动通信有限公司 | 一种按压触控方法和装置 |
| CN107436698A (zh) * | 2016-05-27 | 2017-12-05 | 深圳市科潮达科技有限公司 | 触控系统及方法 |
| US20180318705A1 (en) * | 2017-05-04 | 2018-11-08 | Roblox Corporation | Touch-Screen Based Game Controller |
| CN109388318A (zh) * | 2018-09-30 | 2019-02-26 | 联想(北京)有限公司 | 一种控制方法及电子设备 |
Non-Patent Citations (1)
| Title |
|---|
| See also references of EP3992768A4 * |
Cited By (7)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20230094042A1 (en) * | 2021-09-24 | 2023-03-30 | Google Llc | Personalized autonomous spreadsheets |
| US20230106058A1 (en) * | 2021-09-24 | 2023-04-06 | Google Llc | Autonomous spreadsheet creation |
| US12001782B2 (en) * | 2021-09-24 | 2024-06-04 | Google Llc | Personalized autonomous spreadsheets |
| US12001783B2 (en) * | 2021-09-24 | 2024-06-04 | Google Llc | Autonomous spreadsheet creation |
| US12493738B2 (en) | 2021-09-24 | 2025-12-09 | Google Llc | Personalized autonomous spreadsheets |
| US12499307B2 (en) | 2021-09-24 | 2025-12-16 | Google Llc | Autonomous spreadsheet creation |
| US12547823B2 (en) | 2021-09-24 | 2026-02-10 | Google Llc | Dynamically and selectively updated spreadsheets based on knowledge monitoring and natural language processing |
Also Published As
| Publication number | Publication date |
|---|---|
| US20220335943A1 (en) | 2022-10-20 |
| CN113330409B (zh) | 2024-10-11 |
| JP2022547667A (ja) | 2022-11-15 |
| EP3992768A1 (en) | 2022-05-04 |
| EP3992768A4 (en) | 2022-08-24 |
| CN113330409A (zh) | 2021-08-31 |
| JP7413513B2 (ja) | 2024-01-15 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| CN111625179A (zh) | 图形绘制方法、电子设备及计算机存储介质 | |
| US20220335943A1 (en) | Human-computer interaction method, apparatus, and system | |
| EP2843531A2 (en) | Electronic device and method for providing content according to field attribute | |
| CN107924286A (zh) | 电子设备及电子设备的输入方法 | |
| WO2019184490A1 (zh) | 用于显示寄宿应用的图标的方法、设备和存储介质 | |
| WO2020073980A1 (zh) | 寄宿应用的处理方法、设备及计算机可读存储介质 | |
| CN105580024A (zh) | 一种截屏方法及装置 | |
| CN108595220A (zh) | 应用组件的处理方法、设备及计算机可读存储介质 | |
| US10983625B2 (en) | Systems and methods for measurement of unsupported user interface actions | |
| CN112218134A (zh) | 一种输入方法以及相关设备 | |
| WO2020200263A1 (zh) | 信息流中图片的处理方法、设备及计算机可读存储介质 | |
| WO2019033655A1 (zh) | 一种防误触的方法、装置、设备及存储介质 | |
| CN107748639A (zh) | 一种曲线的编辑方法、装置、设备和存储介质 | |
| CN104679392A (zh) | 基于状态栏的页面显示方法及装置 | |
| CN107735762A (zh) | 屏幕控制方法以及用于支持该方法的电子设备 | |
| WO2019105391A1 (zh) | 无线局域网络的连接方法、设备及计算机可读存储介质 | |
| CN108492349B (zh) | 书写笔画的处理方法、装置、设备和存储介质 | |
| WO2020135010A1 (zh) | 寄宿应用的处理方法、设备及计算机可读存储介质 | |
| CN112073301A (zh) | 删除聊天群组成员的方法、设备及计算机可读介质 | |
| CN108885556A (zh) | 控制数字输入 | |
| WO2019218684A1 (zh) | 应用分区的处理方法、设备及计算机可读存储介质 | |
| CN110955787B (zh) | 用户头像的设置方法、计算机设备及计算机可读存储介质 | |
| CN116521028A (zh) | 光标定位方法、装置、设备及存储介质 | |
| WO2023000867A1 (zh) | 页面配置方法和装置 | |
| WO2022206171A1 (zh) | 远程书写显示的方法、装置和存储介质 |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| 121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 19958385 Country of ref document: EP Kind code of ref document: A1 |
|
| ENP | Entry into the national phase |
Ref document number: 2019958385 Country of ref document: EP Effective date: 20220128 |
|
| ENP | Entry into the national phase |
Ref document number: 2022513988 Country of ref document: JP Kind code of ref document: A |
|
| NENP | Non-entry into the national phase |
Ref country code: DE |