WO2025001769A1 - 一种数字助手虚拟形象的显示方法及相关装置 - Google Patents

一种数字助手虚拟形象的显示方法及相关装置 Download PDF

Info

Publication number
WO2025001769A1
WO2025001769A1 PCT/CN2024/097229 CN2024097229W WO2025001769A1 WO 2025001769 A1 WO2025001769 A1 WO 2025001769A1 CN 2024097229 W CN2024097229 W CN 2024097229W WO 2025001769 A1 WO2025001769 A1 WO 2025001769A1
Authority
WO
WIPO (PCT)
Prior art keywords
interface
feature
digital assistant
state
displayed
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Ceased
Application number
PCT/CN2024/097229
Other languages
English (en)
French (fr)
Inventor
王一凡
姜顺吉
林俐
郑江震
王宗波
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Huawei Technologies Co Ltd
Original Assignee
Huawei Technologies Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from CN202310786258.8A external-priority patent/CN117008762A/zh
Application filed by Huawei Technologies Co Ltd filed Critical Huawei Technologies Co Ltd
Priority to EP24830416.4A priority Critical patent/EP4645047A4/en
Publication of WO2025001769A1 publication Critical patent/WO2025001769A1/zh
Anticipated expiration legal-status Critical
Ceased legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/01Input arrangements or combined input and output arrangements for interaction between user and computer
    • G06F3/048Interaction techniques based on graphical user interfaces [GUI]
    • G06F3/0481Interaction techniques based on graphical user interfaces [GUI] based on specific properties of the displayed interaction object or a metaphor-based environment, e.g. interaction with desktop elements like windows or icons, or assisted by a cursor's changing behaviour or appearance
    • G06F3/04817Interaction techniques based on graphical user interfaces [GUI] based on specific properties of the displayed interaction object or a metaphor-based environment, e.g. interaction with desktop elements like windows or icons, or assisted by a cursor's changing behaviour or appearance using icons
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F1/00Details not covered by groups G06F3/00 - G06F13/00 and G06F21/00
    • G06F1/26Power supply means, e.g. regulation thereof
    • G06F1/32Means for saving power
    • G06F1/3203Power management, i.e. event-based initiation of a power-saving mode
    • G06F1/3206Monitoring of events, devices or parameters that trigger a change in power modality
    • G06F1/3231Monitoring the presence, absence or movement of users
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/01Input arrangements or combined input and output arrangements for interaction between user and computer
    • G06F3/011Arrangements for interaction with the human body, e.g. for user immersion in virtual reality
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/01Input arrangements or combined input and output arrangements for interaction between user and computer
    • G06F3/017Gesture based interaction, e.g. based on a set of recognized hand gestures
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/01Input arrangements or combined input and output arrangements for interaction between user and computer
    • G06F3/048Interaction techniques based on graphical user interfaces [GUI]
    • G06F3/0481Interaction techniques based on graphical user interfaces [GUI] based on specific properties of the displayed interaction object or a metaphor-based environment, e.g. interaction with desktop elements like windows or icons, or assisted by a cursor's changing behaviour or appearance
    • G06F3/04815Interaction with a metaphor-based environment or interaction object displayed as three-dimensional [3D], e.g. changing the user viewpoint with respect to the environment or object
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/01Input arrangements or combined input and output arrangements for interaction between user and computer
    • G06F3/048Interaction techniques based on graphical user interfaces [GUI]
    • G06F3/0484Interaction techniques based on graphical user interfaces [GUI] for the control of specific functions or operations, e.g. selecting or manipulating an object, an image or a displayed text element, setting a parameter value or selecting a range
    • G06F3/04845Interaction techniques based on graphical user interfaces [GUI] for the control of specific functions or operations, e.g. selecting or manipulating an object, an image or a displayed text element, setting a parameter value or selecting a range for image manipulation, e.g. dragging, rotation, expansion or change of colour
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/01Input arrangements or combined input and output arrangements for interaction between user and computer
    • G06F3/048Interaction techniques based on graphical user interfaces [GUI]
    • G06F3/0484Interaction techniques based on graphical user interfaces [GUI] for the control of specific functions or operations, e.g. selecting or manipulating an object, an image or a displayed text element, setting a parameter value or selecting a range
    • G06F3/04847Interaction techniques to control parameter settings, e.g. interaction with sliders or dials
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/01Input arrangements or combined input and output arrangements for interaction between user and computer
    • G06F3/048Interaction techniques based on graphical user interfaces [GUI]
    • G06F3/0487Interaction techniques based on graphical user interfaces [GUI] using specific features provided by the input device, e.g. functions controlled by the rotation of a mouse with dual sensing arrangements, or of the nature of the input device, e.g. tap gestures based on pressure sensed by a digitiser
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/16Sound input; Sound output
    • G06F3/167Audio in a user interface, e.g. using voice commands for navigating, audio feedback
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/44Arrangements for executing specific programs
    • G06F9/451Execution arrangements for user interfaces
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/44Arrangements for executing specific programs
    • G06F9/451Execution arrangements for user interfaces
    • G06F9/453Help systems
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/442Monitoring of processes or resources, e.g. detecting the failure of a recording device, monitoring the downstream bandwidth, the number of times a movie has been viewed, the storage space available from the internal hard disk
    • H04N21/44213Monitoring of end-user related data
    • H04N21/44218Detecting physical presence or behaviour of the user, e.g. using sensors to detect if the user is leaving the room or changes his face expression during a TV programme

Definitions

  • the present application relates to the field of human-computer interaction technology, and in particular to a method for displaying a virtual image of a digital assistant and related devices.
  • the user interface of the digital assistants will display an assistant image, which is usually a two-dimensional pattern that is static or continuously moves in a certain pattern.
  • the digital assistant In the user interface of the digital assistant, after the user further inputs voice commands or gesture commands, it is usually necessary to wait for the digital assistant to output the recognition results before confirming whether the digital assistant has received the voice commands or gesture commands. Later, the digital assistant's form does not change during the process of recognizing or processing commands. Therefore, after the user inputs voice commands or gesture commands, it is usually necessary to wait for the digital assistant to output the recognition results before confirming whether the digital assistant has received the voice commands or gesture commands. That is, when the digital assistant recognizes or processes the commands input by the user, the user cannot get feedback from the digital assistant, and it is difficult to identify the current working status of the digital assistant. It is easy to mistakenly believe that the digital assistant has not received the command, which causes the user to repeat the command.
  • the present application provides a method for displaying a digital assistant virtual image, which can improve the naturalness of the interaction between the user and the digital assistant.
  • the first aspect of the present application provides a method for displaying a digital assistant virtual image, which is applied to an electronic device, and the virtual image of the digital assistant includes multiple feature bodies.
  • the virtual image is first displayed in a first form on a first interface; then, a signal instruction generated by the electronic device is detected, and the virtual image is displayed in a second form.
  • the signal instruction is generated by the electronic device based on the received first user input, and the user input can be before or after the first form is displayed. That is, after the electronic device receives the first user input, a signal instruction is generated, thereby triggering the virtual image to switch from the first form to the second form, so that the form of the virtual image displayed on the interface of the electronic device changes.
  • first form and the second form both display at least one of a plurality of characteristic bodies, and the change from the first form to the second form displays that at least one characteristic body undergoes at least one of a change in shape, a change in position, a change in movement pattern, and a change in display area.
  • the different forms of the virtual image represent the different working states of the digital assistant, that is, the digital assistant uses the form of the virtual image to indicate the current working state of itself.
  • the digital assistant uses the form of the virtual image to indicate the current working state of itself.
  • the virtual image of the digital assistant is set to include multiple features, and after the electronic device obtains user input, the electronic device generates a corresponding signal instruction, thereby triggering the change of the form of the virtual image displayed on the interface, thereby achieving the use of different forms to represent the different working states of the digital assistant, so that the user can confirm the current state of interaction with the digital assistant through the changes in the form of the digital assistant, thereby improving the naturalness of the interaction between the user and the digital assistant.
  • the plurality of feature bodies include a first feature body
  • the plurality of feature bodies include the first feature body
  • the shape change of the first feature body includes at least one of the following changes: enlargement, reduction, stretching, compression, rotation, and switching between a main form, a geometric array, or a strip. That is, the shape of the first feature body includes: a main form, a geometric array, and a strip.
  • the shape change of the first feature body includes switching between multiple shapes, and enlarging, reducing, stretching, compressing, or rotating each shape. That is, in addition to switching between different shapes, the shape change of the first feature body may also include enlarging, reducing, stretching, compressing, or rotating under the same shape.
  • the main form refers to the form of the first feature body under normal circumstances, while the geometric array and the long strip are the forms of the first feature body under certain specific circumstances.
  • the plurality of feature bodies further include a second feature body.
  • the main form of the first feature body is a ring
  • the second feature body is a sphere
  • the center of the ring coincides with the center of the sphere.
  • the position change of the first feature body includes at least one of a change in orbital position after flipping around the sphere and a change in position caused by the overall movement or rotation of the virtual image.
  • the motion law of the first feature body includes the ring rotating around the sphere with the center of the ring as the fulcrum according to a preset trajectory.
  • the motion law of the second feature body includes the sphere reciprocating along a straight line passing through the center of the ring and perpendicular to the plane where the ring is located according to a preset law.
  • a third feature is displayed on the first feature, and the third feature can move on the first feature and indicate the position of the user.
  • the virtual image has a breathing state
  • the virtual image is displayed in a first form on the first interface, including: if the first interface is an off-screen interface, the first form is a breathing state including a first feature body and a second feature body, and the first feature body is displayed in a main form.
  • the first interface is a screen-off interface
  • the digital assistant in a breathing state moves back and forth on the screen according to a preset pattern.
  • a third feature body is displayed on the first feature body, and the third feature body can move on the first feature body and indicate the user's position.
  • the first feature body is a ring, and the third feature body can be a light effect area on the ring.
  • the virtual image has a waiting state and a listening state
  • the first interface is a lock screen interface or any screen interface after unlocking.
  • Displaying the virtual image in a first form on the first interface specifically includes: displaying the waiting state as the first form on the first interface, the waiting state includes a first feature body, and the first feature body is displayed in a long strip as a navigation prompt line, and the navigation prompt line is used as a reference line for screen gesture navigation; wherein, when no input for waking up the digital assistant is received, the waiting state is permanently displayed on the first interface.
  • a third user input is detected and displayed on the first interface in a listening state as the first form, and the third user input is used to wake up the digital assistant.
  • the listening state includes a first feature body and a second feature body, and the first feature body is displayed in the main form.
  • the first interface is a lock screen interface or any screen interface after unlocking, and the first form is a listening state.
  • displaying the virtual image in the first form on the first interface specifically includes: the first feature body is first displayed in a geometric array; the first feature body is then changed from the geometric array to the main form, and the second feature body is displayed. That is, in the initial stage, only the first feature body is displayed, and the second feature body is not displayed, and the first feature body is initially displayed in a geometric array, covering the entire screen; then the first feature body is changed from the geometric array to the main form through shape display, and the second feature body is displayed.
  • the digital assistant when waking up the digital assistant, the digital assistant first appears in the form of a geometric array (such as a dot matrix) and gradually fills the screen; then, the digital assistant gradually transforms from the geometric array to the main form of the first feature body (i.e., a ring) and displays the second feature body (i.e., a sphere), thereby realizing the awakening process of the digital assistant in the form of an animation, increasing the intuitiveness of the interaction process, and helping users to identify the current working status of the digital assistant.
  • a geometric array such as a dot matrix
  • the first feature body is first displayed in a geometric array, specifically: a portion of the array of geometric figures is displayed at an initial display position, and then the geometric figures are displayed on the entire screen. That is, the initial display position of the geometric array is first determined; secondly, the portion of the array of geometric figures is displayed at the initial display position, and then the geometric figures are gradually spread over the entire screen. For example, a portion of the dot matrix is first displayed from an initial display position, and then the dot matrix is gradually spread to cover the entire screen.
  • the user input refers to the input used to wake up the digital assistant.
  • the initial display position may be near the user input that triggers the digital assistant to wake up.
  • the initial display position of the geometric array may be on the screen near the power button.
  • the initial position is a default position, which is an area where the virtual image is displayed, such as the bottom of the screen.
  • the initial display position of the geometric array is determined by user input, which can make the display process of the digital assistant more closely aligned with the user input, enhance the intuitiveness of the interaction between the user and the digital assistant, and improve the user experience.
  • the first interface is the lock screen interface or any screen interface after unlocking
  • the first form is the wake-up state
  • the second form is the listening state
  • the signal instruction is generated based on the user performing a preset gesture on the digital assistant in the first form.
  • the plurality of feature bodies further include a second feature body; in the second form, the first feature body is in the main form of a ring, the second feature body is a sphere, and the ring is located on the equator of the sphere in a top-down perspective.
  • the virtual image is displayed in a first form on the first interface, specifically: the virtual image is displayed at a preset position on the first interface; or the virtual image is displayed by overlaying a target content element in the first interface, wherein the target content element is a content element of a preset type.
  • the virtual image is covered on the target content element, so that the target content element is no longer displayed on the first interface.
  • the gesture event interface of the target content element is copied to the virtual image so that performing a gesture on the virtual image can realize the gesture corresponding function of the target content element.
  • the virtual image inherits the original function of the target content element, so that when a specific gesture (such as single-click) is performed on the virtual image, the gesture-corresponding function of the target content element can still be executed.
  • the first interface is a camera interface
  • the target content element is a shutter control in the camera interface
  • the virtual image also has a task execution state, which includes a first feature body; in at least one process interface during the digital assistant's execution of an automated task, the first feature body in the task execution state corresponds to at least one of a shape change and a position change.
  • the digital assistant in the process interface of the digital assistant performing automated tasks, the digital assistant is actually always in the task execution state, and the first feature body of the digital assistant in the task execution state can undergo various changes to show the user the automated process of the digital assistant performing tasks, making the automated process of the digital assistant performing tasks more intuitive and clear, making it easier for users to observe whether there are any errors in the process of the digital assistant performing automated tasks.
  • the first feature body is a geometric array
  • the display position of the geometric array matches the target control in the process interface
  • the next process interface is entered by simulating the click event of the target control.
  • the digital assistant can use the first feature body as the geometric array to simulate the user's operation of clicking the control on the interface, thereby visualizing the entire process of the digital assistant simulating the user to execute the automation task.
  • the first form is a listening state
  • the second form is a task execution state.
  • Both the listening state and the task execution state include a first feature body, and the first feature body in the listening state and the task execution state has different forms respectively; in the task execution state, the forms of the first feature body include: a geometric array within the interface, an outer frame of a digital assistant dialogue card, an outer frame of an input box when calling a digital assistant input, or an outer frame of a task result page.
  • the first feature body may be represented in different forms so as to be more in line with the content displayed on the current interface.
  • the first feature body has different morphological changes at different task progress of the automation task. That is, in the process of the digital assistant performing the automation task, at different task progress, the first feature body may change into different forms, so as to ensure that the form of the first feature body always closely fits the current interface.
  • the first feature body is changed from a geometric array covering the entire screen to a geometric array displayed only on the recognized executable object.
  • the morphological changes of the first feature body include: initially, the first feature body is displayed in the main form of a ring at an initial position relative to the second feature body (for example, the periphery of the second feature body), and the initial position is the same as or different from the position of the first feature body in the listening state; the first feature body is separated from the second feature body by the ring, and is enlarged and displayed as the outer frame of the digital assistant dialogue card; the first feature body changes from the outer frame of the dialogue card to a geometric array in the dialogue bubble of the digital assistant; after the automated task is executed, the virtual image is displayed in the listening state.
  • the method may further include: the electronic device acquires scene information, determines a target form of the first feature body based on the scene information, and displays the first feature body in the target form. That is, the target form is related to the scene information.
  • the electronic device may display the first feature body in a specific form.
  • the virtual image has a state to be awakened, which includes at least a first feature body, and the first feature body is displayed in a long strip; in response to detecting that the focus of human eye sight falls on the area where the screen is located, the electronic device displays a second feature body.
  • the digital assistant when the digital assistant is in the wake-up state, only a long bar is displayed; when the digital assistant detects that the user's eyes fall on the screen, it can be considered that the user has an interactive tendency, so the second feature body is displayed, thereby indicating that the digital assistant detects that the user has an interactive tendency and is waiting for interaction, so as to improve the naturalness of human-computer interaction.
  • the method further includes: in response to detecting a human voice, the second feature body moves closer to the first feature body. The direction of the vocal sound source.
  • the electronic device is a smart car
  • the method further includes: indicating a vehicle function or warning information by deformation or movement of a first feature body among the multiple feature bodies.
  • a second aspect of the present application provides a display device for a digital assistant virtual image, which is applied to an electronic device, wherein the virtual image includes a plurality of feature bodies.
  • the device includes:
  • a display module configured to display a virtual image in a first form on a first interface
  • the display module is further used to detect the signal instruction generated by the electronic device and display the virtual image in the second form;
  • the signal instruction is generated based on the first user input, the first form and the second form display at least one of the multiple feature bodies, and the change from the first form to the second form means that at least one feature body undergoes at least one of a shape change, a position change, a motion law change, and a display area change.
  • the plurality of feature bodies include a first feature body, and the shape change of the first feature body includes at least one of the following changes: enlargement, reduction, stretching, compression, rotation, and switching between a main form, a geometric array, or a strip.
  • the plurality of feature bodies further include a second feature body, wherein the main shape of the first feature body is a ring, the second feature body is a sphere, and the center of the ring and the center of the sphere are co-pointed;
  • the position change of the first feature body includes at least one of a change in orbital position after flipping around the sphere and a change in position caused by overall movement or rotation of the virtual image;
  • the motion law of the first characteristic body includes that the ring rotates around the sphere according to a preset trajectory with the center of the circle as the fulcrum;
  • the motion law of the second characteristic body includes that the sphere moves back and forth according to a preset law along a straight line passing through the center of the ring and perpendicular to the plane where the ring is located.
  • the display module is further configured to detect a second user input and display a third feature body on the first feature body, where the third feature body can move on the first feature body and indicate the user's position.
  • the virtual image has a breathing state
  • the virtual image is displayed in a first form on the first interface, including:
  • the first interface is a screen-off interface
  • the first form is a breathing form including a first feature body and a second feature body, and the first feature body is displayed in a main form.
  • the virtual image in a breathing state reciprocates on the screen according to a preset rule.
  • the display module is further used to:
  • a second user input is detected, and a third feature is displayed on the first feature, the third feature being movable on the first feature and indicating the user's position.
  • the virtual image has a waiting state and a listening state
  • the first interface is a lock screen interface or any screen interface after unlocking
  • Displaying a virtual image in a first form on a first interface includes:
  • the first state to be awakened is displayed on the first interface as the first form, the state to be awakened includes a first feature body, and the first feature body is displayed in a long strip as a navigation prompt line, and the navigation prompt line is used as a reference line for screen gesture navigation; wherein, when no input for waking up the digital assistant is received, the state to be awakened is permanently displayed on the first interface;
  • a third user input is detected and displayed on the first interface in a listening state as a first form.
  • the third user input is used to wake up the digital assistant.
  • the listening state includes a first feature body and a second feature body, and the first feature body is displayed in a main form.
  • the first interface is a lock screen interface or any screen interface after unlocking, and the first state is a listening state;
  • Displaying the virtual image in the first form on the first interface specifically includes:
  • the first feature is first displayed as a geometric array
  • the first feature body is then transformed from a geometric array into a main form, and a second feature body is displayed.
  • the first feature body is first displayed in a geometric array, specifically:
  • the initial display position is determined based on user input
  • the initial position is the area where the virtual image is displayed.
  • the virtual image is displayed in the first form on the first interface, specifically:
  • the virtual image is overlaid and displayed on a target content element in the first interface, wherein the target content element is a content element of a preset type.
  • the gesture event interface of the target content element is copied to the virtual image so that performing a gesture on the virtual image can realize the gesture corresponding function of the target content element.
  • the first interface is a camera interface
  • the target content element is a shutter control in the camera interface
  • the first interface is the lock screen interface or any screen interface after unlocking
  • the first form is the wake-up state
  • the second form is the listening state
  • the signal instruction is generated based on the user performing a preset gesture on the digital assistant in the first form.
  • the plurality of feature bodies further includes a second feature body
  • the first feature body is in the main form of a ring
  • the second feature body is a sphere
  • the ring is located on the equator of the sphere in a top-down perspective.
  • the virtual image further has a task execution state, and the task execution state includes a first feature body
  • the first feature body in the task execution state corresponds to at least one of a shape change and a position change.
  • the first feature body is a geometric array
  • the display position of the geometric array matches the target control in the process interface, so as to simulate a click event of the target control to enter the next process interface.
  • the first state is a listening state
  • the second state is a task execution state.
  • Both the listening state and the task execution state include a first feature body, and the first feature bodies in the listening state and the task execution state have different forms respectively.
  • the shape of the first feature body includes: a geometric array within the interface, an outer frame of a dialog card, an outer frame of an input box when calling a digital assistant input, or an outer frame of a task result page.
  • the first feature body has different morphological changes at different task progresses of the automation task.
  • the first feature body is changed from a geometric array covering the entire screen to a geometric array displayed only on the recognized executable object.
  • the morphological change of the first feature body includes:
  • the first feature body is first displayed in the main form of a ring on the periphery of the second feature body;
  • the first feature body is then separated from the second feature body by the ring and is enlarged and displayed to serve as the outer frame of the conversation card;
  • the first feature body changes from the outer frame of the dialogue card to the geometric array inside the dialogue bubble;
  • the avatar is displayed in a listening state.
  • the device further includes:
  • the acquisition module is used to acquire scene information and display a first feature body among multiple feature bodies in a target form based on the scene information.
  • the virtual image has a waiting-to-be-awakened state
  • the waiting-to-be-awakened state includes at least a first feature body, and the first feature body is displayed in a long strip;
  • the display module is also used to display a second feature body in response to detecting that the focus of the human eye falls on the area where the screen is located.
  • the display module is further configured to, in response to detecting a human voice, cause the second feature body to move relative to the first feature body toward a direction close to a source of the human voice.
  • the electronic device is a smart car
  • the display module is further used for:
  • the vehicle function or warning information is indicated by the deformation or movement of the first feature body among the multiple feature bodies.
  • the third aspect of the present application provides a display device for a digital assistant virtual image, which may include a processor, the processor and a memory are coupled, the memory stores program instructions, and when the program instructions stored in the memory are executed by the processor, the method of the first aspect or any implementation of the first aspect is implemented.
  • a display device for a digital assistant virtual image which may include a processor, the processor and a memory are coupled, the memory stores program instructions, and when the program instructions stored in the memory are executed by the processor, the method of the first aspect or any implementation of the first aspect is implemented.
  • the fourth aspect of the present application provides a computer-readable storage medium, in which a computer program is stored.
  • the computer-readable storage medium is run on a computer, the computer executes the method of any implementation manner of the first aspect.
  • a fifth aspect of the present application provides a circuit system, the circuit system includes a processing circuit, and the processing circuit is configured to execute a method in any implementation manner of the above-mentioned first aspect.
  • the sixth aspect of the present application provides a computer program product, which, when executed on a computer, enables the computer to execute a method implemented in any one of the first aspects.
  • the seventh aspect of the present application provides a chip system, which includes a processor for supporting a server or a threshold value acquisition device to implement the functions involved in any implementation of the first aspect, for example, sending or processing the data and/or information involved in the above method.
  • the chip system also includes a memory, which is used to store program instructions and data necessary for the server or communication device.
  • the chip system can be composed of chips, or it can include chips and other discrete devices.
  • FIG1 is a schematic diagram of an architecture of an application scenario provided in an embodiment of the present application.
  • FIG. 2 is a schematic diagram of the structure of an electronic device 101 provided in an embodiment of the present application.
  • FIG3 is a schematic diagram of a flow chart of a method for displaying a digital assistant virtual image provided in an embodiment of the present application
  • FIG4 is a schematic diagram of a main body image of a digital assistant provided in an embodiment of the present application.
  • FIG5 is a schematic diagram of a movement law of a digital assistant in a breathing state provided by an embodiment of the present application.
  • FIG6 is a schematic diagram of a digital assistant switching from a breathing state to a waiting-to-be-awakened state provided by an embodiment of the present application;
  • FIG7 is a schematic diagram of a digital assistant provided by an embodiment of the present application in a wake-up state
  • FIG8 is a schematic diagram of a digital assistant provided by an embodiment of the present application in a listening state
  • FIG9 is a schematic diagram of a digital assistant provided by an embodiment of the present application in a thinking state
  • FIG10 is a schematic diagram of a digital assistant provided by an embodiment of the present application in a task execution state
  • FIG11 is a schematic diagram of various forms of a digital assistant provided in an embodiment of the present application.
  • FIG12 is a schematic diagram of a digital assistant provided in an embodiment of the present application displayed in a waiting-to-be-awakened state;
  • FIG13 is a schematic diagram of a digital assistant switching from a waiting state to a listening state provided by an embodiment of the present application.
  • FIG14 is a schematic diagram of a virtual image overlaid on a camera shutter control provided by an embodiment of the present application.
  • FIG15 is a schematic diagram of a virtual image overlaid on an assistant suggestion component provided by an embodiment of the present application.
  • FIG16 is a schematic diagram of a process of a digital assistant performing an automated task provided by an embodiment of the present application.
  • FIG17 is a schematic diagram of a digital assistant provided in an embodiment of the present application, in which a circular ring is converted into a dot matrix and laid out on a task execution interface;
  • FIG18 is a schematic diagram of using a digital assistant to input text on a desktop according to an embodiment of the present application.
  • FIG19 is a schematic diagram of a ring of a digital assistant provided by an embodiment of the present application deforming and returning to its original shape;
  • FIG20 is a schematic diagram of the morphological changes of a digital assistant in a smart recognition scenario provided by an embodiment of the present application.
  • FIG21 is a schematic diagram of a digital assistant appearing in a smart recognition scenario provided by an embodiment of the present application.
  • FIG22 is a schematic diagram of a digital assistant that generates deformation based on scene information provided by an embodiment of the present application
  • FIG23 is a schematic diagram of a change of a digital assistant on a vehicle-mounted terminal provided in an embodiment of the present application.
  • FIG24 is a schematic diagram of a flow chart of a display method of a digital assistant provided in an embodiment of the present application.
  • FIG25 is a schematic diagram of an application scenario of a display method of a digital assistant provided in an embodiment of the present application.
  • FIG26 is a schematic diagram of a digital assistant in a dormant state provided by an embodiment of the present application.
  • FIG27 is a schematic diagram of a region division provided in an embodiment of the present application.
  • FIG28 is a schematic diagram of a form of switching display of a digital assistant provided in an embodiment of the present application.
  • FIG29 is a schematic diagram of changes in a digital assistant in a wake-up state provided by an embodiment of the present application.
  • FIG30 is a schematic diagram of another digital assistant in a wake-up state provided by an embodiment of the present application.
  • FIG31 is a schematic diagram of the structure of a display device for a digital assistant virtual image provided in an embodiment of the present application.
  • FIG32 is a schematic diagram of a structure of an execution device provided in an embodiment of the present application.
  • FIG33 is a schematic diagram of the structure of a computer-readable storage medium provided in an embodiment of the present application.
  • the division of units in this application is a logical division. There may be other division methods when it is implemented in actual applications. For example, multiple units can be combined or integrated into another system, or some features can be ignored or not executed.
  • the mutual coupling or direct coupling or communication connection shown or discussed can be through some interfaces, and the indirect coupling or communication connection between units can be electrical or other similar forms, which are not limited in this application.
  • the units or sub-units described as separate components may or may not be physically separated, may or may not be physical units, or may be distributed in multiple circuit units, and some or all of the units may be selected according to actual needs to achieve the purpose of the present application.
  • a digital assistant is a virtual assistant that receives user commands to replace manual operations and helps users complete a series of operations on smart devices.
  • most digital assistants are voice assistants that complete the operation of smart devices by receiving user voice commands. For example, when the smart device is a smartphone or smart TV, the digital assistant can receive user voice commands to open an application or play a song.
  • a distance sensor also called a displacement sensor, is a type of sensor that is used to sense the distance between it and an object to complete a preset function and has been widely used.
  • Distance sensors can be divided into optical distance sensors, infrared distance sensors, ultrasonic distance sensors and other types according to their working principles.
  • Most distance sensors used on mobile phones are infrared distance sensors, which have an infrared transmitting tube and an infrared receiving tube. When the infrared rays emitted by the transmitting tube are received by the receiving tube, it indicates that the distance is close and the screen needs to be turned off to avoid misoperation. When the receiving tube cannot receive the infrared rays emitted by the transmitting tube, it indicates that the distance is far and there is no need to turn off the screen.
  • the working principles of other types of distance sensors are similar. They also judge the distance through the emission and reception of a certain substance. The emitted substance can be ultrasonic waves, light pulses, etc.
  • Figure 1 is a schematic diagram of the architecture of an application scenario provided in an embodiment of the present application.
  • the application scenario includes a user and an electronic device with a digital assistant.
  • the user can issue instructions to the digital assistant in the electronic device through voice or gestures, so that the digital assistant assists the user to automatically complete a series of operations of the electronic device, such as automatically executing an application in the electronic device.
  • the electronic device is equipped with a digital assistant, and the electronic device may be equipped with a command receiving device such as a microphone or a camera, which can receive the user's voice commands or gesture commands, and then automatically complete a series of operations by processing the user's input commands through the digital assistant, and feedback the processing results to the user.
  • the electronic device also has a display screen, which can display the form of the digital assistant on the display screen.
  • the electronic device when the electronic device is at a certain distance from the user, the electronic device recognizes the user's spatial interaction behavior by sensing the user's position change or posture change, and displays the morphological changes of the digital assistant in response to the user's spatial interaction behavior on the display screen.
  • the electronic device adjusts the form of the digital assistant by obtaining the spatial interaction behavior of the user in the real physical world.
  • the electronic device can also be a virtual reality (VR) device or an augmented reality (AR) device.
  • the electronic device adjusts the form of the digital assistant by acquiring the spatial interaction behavior of the user in a virtual world such as a VR world or an AR world.
  • the user interacting with the electronic device may be a virtual image representing the real user in the virtual world.
  • the real user controls the virtual image to perform spatial interaction behavior, thereby changing the form of the digital assistant in the virtual world.
  • electronic devices with digital assistants may be, for example, smartphones (mobile phones), personal computers (PCs), laptops, tablet computers, smart TVs, mobile internet devices (MIDs), display devices in autonomous vehicles, smart speakers, wearable devices, VR devices, AR devices, display devices in industrial control, display devices in remote medical surgery, display devices in smart grids, wireless communication devices in smart cities, display devices in smart homes, etc.
  • smartphones mobile phones
  • PCs personal computers
  • laptops laptops
  • tablet computers smart TVs
  • MIDs mobile internet devices
  • display devices in autonomous vehicles smart speakers
  • wearable devices VR devices
  • AR devices display devices in industrial control
  • display devices in remote medical surgery display devices in smart grids
  • wireless communication devices in smart cities display devices in smart homes, etc.
  • FIG. 2 is a schematic diagram of the structure of an electronic device 101 provided in an embodiment of the present application.
  • the electronic device 101 includes a processor 103, and the processor 103 is coupled to a system bus 105.
  • the processor 103 may be one or more processors, each of which may include one or more processor cores.
  • a display adapter (video adapter) 107 which may drive a display 109, and the display 109 is coupled to the system bus 105.
  • the system bus 105 is coupled to an input/output (I/O) bus via a bus bridge 111.
  • An I/O interface 115 is coupled to the I/O bus.
  • the I/O interface 115 communicates with a variety of I/O devices, such as an input device 117 (such as a touch screen, etc.), an external memory 121 (for example, a hard disk, a floppy disk, an optical disk or a USB flash drive), a multimedia interface, etc.).
  • I/O devices such as an input device 117 (such as a touch screen, etc.), an external memory 121 (for example, a hard disk, a floppy disk, an optical disk or a USB flash drive), a multimedia interface, etc.).
  • a transceiver 123 which may send and/or receive radio communication signals
  • a camera 155 which may capture static and dynamic digital video images
  • an external USB port 125 may be a USB interface.
  • the processor 103 may be any conventional processor, including a reduced instruction set computing (RISC) processor, a complex instruction set computing (CISC) processor, or a combination thereof.
  • the processor may be a dedicated device such as an ASIC.
  • the electronic device 101 can communicate with the software deployment server 149 through the network interface 129.
  • the network interface 129 is a hardware network interface, such as a network card.
  • the network 127 can be an external network, such as the Internet, or an internal network, such as Ethernet or a virtual private network (VPN).
  • the network 127 can also be a wireless network, such as a WiFi network, a cellular network, etc.
  • the hard disk drive interface 131 is coupled to the system bus 105.
  • the hard disk drive interface is connected to the hard disk drive 133.
  • the internal memory 135 is coupled to the system bus 105.
  • the data running in the internal memory 135 may include an operating system (OS) 137 of the electronic device 101, an application 143, and a scheduler.
  • OS operating system
  • the operating system consists of a shell 139 and a kernel 141.
  • Shell 139 is an interface between the user and the kernel of the operating system.
  • the shell is the outermost layer of the operating system.
  • the shell manages the interaction between the user and the operating system: it waits for user input, interprets user input to the operating system, and processes various operating system output results.
  • the kernel 141 consists of those parts of the operating system that manage memory, files, peripherals, and system resources.
  • the kernel 141 interacts directly with the hardware.
  • the operating system kernel usually runs processes and provides communication between processes, provides CPU time slice management, interrupts, memory management, IO management, etc.
  • Figure 3 is a flow chart of a method for displaying a digital assistant virtual image provided in an embodiment of the present application.
  • the method for displaying a digital assistant virtual image includes the following steps 301-302, and the method for displaying a digital assistant virtual image is applied to an electronic device equipped with a digital assistant.
  • Step 301 displaying the virtual image of the digital assistant in a first form on a first interface.
  • the virtual image of the digital assistant may include multiple features.
  • the multiple features included in the virtual image will change.
  • the first form is one of the multiple forms of the virtual image of the digital assistant, which is displayed on the first interface of the electronic device.
  • Step 302 based on the signal instruction generated by the electronic device, display the virtual image in a second form.
  • the signal instruction is generated by the electronic device based on the received first user input. After the electronic device receives the first user input, the signal instruction is generated, thereby triggering the virtual image to switch from the first form to the second form, so that the form of the virtual image displayed on the interface of the electronic device changes.
  • the virtual image in the first form and the second form each includes at least one of the plurality of features.
  • the virtual image changes from the first form to the second form in particular when at least one of the features changes in shape, position, motion pattern, or display area. One less change.
  • the first user input received by the electronic device may be before the electronic device displays the first form of the virtual image, or after the first form of the virtual image is displayed. If the first user input received may be before the electronic device displays the first form of the virtual image, it means that the first user input will prompt the electronic device to display the first form of the virtual image, and change from the first form to the second form, that is, cause at least two form changes. If the first user input received may be after the electronic device displays the first form of the virtual image, it means that the first user input will prompt the digital assistant's virtual image to change from the first form to the first form, that is, cause at least one form change.
  • different forms of the virtual image represent different working states of the digital assistant, that is, the digital assistant uses the form of the virtual image to indicate the current working state of the digital assistant.
  • the digital assistant uses the form of the virtual image to indicate the current working state of the digital assistant.
  • the plurality of feature bodies include a first feature body, and the shape of the first feature body includes: a main form, a geometric array, and a long strip.
  • the shape change of the first feature body includes switching between multiple shapes, and enlarging, reducing, stretching, compressing, or rotating each shape. That is, in addition to switching between different shapes, the shape change of the first feature body may also include enlarging, reducing, stretching, compressing, or rotating the same shape.
  • the main form refers to the form of the first feature body under normal circumstances, while the geometric array and the long strip are the forms of the first feature body under certain specific circumstances.
  • the plurality of feature bodies further include a second feature body.
  • the main form of the first feature body is a ring
  • the second feature body is a sphere
  • the center of the ring coincides with the center of the sphere.
  • the position change of the first feature body includes at least one of the orbital position change after flipping around the sphere and the position change caused by the overall movement or rotation of the virtual image;
  • the motion law of the first feature body includes the ring rotating around the sphere with the center of the ring as the fulcrum according to a preset trajectory;
  • the motion law of the second feature body includes the sphere reciprocating along a straight line passing through the center of the ring and perpendicular to the plane where the ring is located according to a preset law.
  • the digital assistant can have the following six different forms.
  • Breathing state corresponds to the state of the digital assistant when there is no task processing, that is, it is used to indicate that the digital assistant is in a state of no task processing.
  • the breathing state is the state of the digital assistant in the screen-off interface.
  • Waiting state corresponds to the state of the digital assistant waiting to be awakened (essentially, the state of the digital assistant when there is no task to be processed), that is, it is used to indicate that the digital assistant is in a state of waiting for a wake-up command.
  • the waiting state is the state of the digital assistant in the bright screen interface, where the bright screen interface includes the lock screen interface or the interface after the screen is unlocked.
  • Wake-up state corresponds to the state when the digital assistant is awakened and ready to receive commands, that is, it is used to indicate that the digital assistant is in the state of waiting to receive commands.
  • Listening state corresponds to the state of the digital assistant during the process of receiving commands, that is, it is used to indicate that the digital assistant is in the command receiving state.
  • Task execution state The state of the task obtained when the digital assistant executes the parsed instruction, which is used to indicate that the digital assistant is in the task execution state.
  • the above-mentioned multiple forms are arranged from low to high according to the degree of interaction intention: breathing state, waiting to be awakened state, awakened state, listening state, thinking state, and task execution state.
  • the breathing state can be transformed into the awakened state.
  • the awakened state can be transformed into the listening state instead of the awakened state.
  • the forms of digital assistants may also be different.
  • the forms of digital assistants when applied to smartphones, the forms of digital assistants include the above-mentioned breathing state, waiting to be awakened state, listening state, thinking state, and task execution state, and the forms can be switched back and forth in sequence; when applied to smart TVs, the forms of digital assistants include the above-mentioned breathing state, awakened state, listening state, thinking state, and task execution state, and the forms can be switched back and forth in sequence.
  • the main image of the digital assistant includes at least two features, and the different forms of the digital assistant are distinguished by the changes in the shape, position, and movement law of the feature bodies.
  • each feature body can be dynamically presented to represent the concept of the digital assistant's living body.
  • Different forms of digital assistants are distinguished by the changes in the shapes, positions, and/or movement laws of different feature bodies.
  • the change in the shape of the feature body can mean that the feature body maintains its current position, while adjusting the position of the feature body presented on the display screen (for example, adjusting the display of the front of the feature body to the display of the back of the feature body), switching between a plurality of pre-set different shapes (including enlarging, reducing, stretching, compressing or rotating each shape), adjusting the position or movement of certain parts of the feature body (for example, For example, the arm on the feature body is adjusted from being stationary to waving.
  • the change in the position of the feature body may refer to the change in the position where the feature body is displayed, for example, from being displayed at the bottom of the screen to being displayed at the top of the screen or in the center of the screen.
  • the change in the motion law of the feature body may refer to the change in the motion of the entire feature body on the display screen, for example, the feature body changes from being stationary to moving along a preset trajectory, or the feature body changes from moving along a sine curve trajectory to the feature body rotating around a certain point.
  • Figure 4 is a schematic diagram of the main image of a digital assistant provided in an embodiment of the present application.
  • the main image of the digital assistant includes three characteristic bodies: a ring, a sphere, and an image body.
  • the image body is wrapped in the sphere; the ring surrounds the sphere, similar to the ring of a planet.
  • the ring, sphere, and image body can have different postures and different motion laws.
  • the posture and/or motion law of at least one characteristic body has changed.
  • a reference system is defined for the main image of the digital assistant: in the breathing state, the plane where the ring is located is the xoy plane, and the x-axis points to the screen of the electronic device, and the vertical direction is the z-axis.
  • the width direction of the electronic device is the y-axis
  • the height direction of the electronic device is the z-axis.
  • the ring, sphere and image body When the digital assistant is in the breathing state, the ring, sphere and image body all move regularly along the preset trajectory and according to the preset period. During the movement, the relative positions between the ring, sphere and image body remain unchanged, that is, the center points of the ring, sphere and image body always coincide.
  • the image body and the sphere can rotate around an axis or a point, for example, around the center of the sphere. That is, the image body and the sphere rotate while moving along the preset trajectory (similar to the earth rotating around the sun while rotating itself).
  • the avatar when the avatar has a breathing state, the avatar is displayed in the first form on the first interface, which may specifically include: if the first interface is a screen-off interface, the first form is a breathing state.
  • the breathing state includes a first feature body and a second feature body, and the first feature body is displayed in the main form.
  • the digital assistant in a breathing state makes reciprocating motion on the screen according to a preset pattern.
  • Figure 5 is a schematic diagram of the movement law of a digital assistant in a breathing state provided by an embodiment of the present application.
  • Figure 5 shows the breathing state image of the digital assistant at time t, and the ring moves back and forth along the z-axis direction, and the function of the moving distance and time conforms to a sine curve.
  • the sphere and the image body rotate around the center of the sphere.
  • corresponding light and shadow effects can be added to the sphere to indicate the rotation of the sphere.
  • the circular ring of the digital assistant i.e., the first characteristic body mentioned above
  • a gesture prompt line which can specifically be a long strip.
  • the gesture prompt line is designed to match the full-screen gestures.
  • the gesture prompt line is a global control that can be displayed at the bottom of the screen to provide a reference for full-screen gestures. For example, swiping up from the bottom usually starts from the position of the gesture prompt line.
  • the digital assistant only displays the first feature body, and the shape of the first feature body is a long strip. Other feature bodies of the digital assistant (such as the sphere and image body of the digital assistant) are hidden.
  • gesture prompt lines are usually displayed at the bottom of the screen and have specific functions.
  • the digital assistant in the wake-up state also has the function of gesture prompt lines, such as swiping left and right to switch applications.
  • Figure 6 is a schematic diagram of a digital assistant switching from a breathing state to a to-be-awakened state provided in an embodiment of the present application.
  • the virtual image of the digital assistant includes a ring, a sphere, and an image body.
  • the digital assistant is converted from the breathing state to the to-be-awakened state.
  • the ring of the digital assistant is transformed into a long strip at the bottom of the screen, and the sphere and image body of the digital assistant are hidden.
  • the ring When the digital assistant is in the awake state, the ring is fixed on the equatorial plane of the sphere and rotates around the z-axis as the user moves.
  • part of the area on the ring is set as a light effect area.
  • the brightness of the light effect area on the ring is higher than the non-light effect area on the ring.
  • the light effect area on the ring points to the user, and when the ring rotates, the light effect area moves and points to the user.
  • the ring may also rotate with the user, so that the light effect area always points to the user's location.
  • the ring may change its pitch angle (i.e., rotate around the plane where the display screen of the electronic device is located), so that the ring can simulate the "head up” or “head down” action when seeing the user approaching.
  • the first interface is a screen-off interface and the first form displayed on the first interface is a breathing state
  • a third feature body is displayed on the first feature body, and the third feature body is used to indicate the user's orientation information and can move on the first feature body.
  • the first feature body is the above-mentioned ring, and the third feature body can be a light effect area on the ring.
  • Figure 7 is a schematic diagram of a digital assistant provided by an embodiment of the present application in a wake-up state.
  • the partial area on the ring close to the side of the screen is the light effect area, which indicates the user's orientation relative to the electronic device.
  • the ring rotates around the z-axis so that the light effect area always faces the user.
  • the distance between the user and the electronic device is less than a preset value, the user continues to approach the TV, and in addition to rotating around the z-axis, the ring will also produce a change in pitch angle around the y-axis. For example, after the ring rotates upward around the y-axis for a certain angle, it gradually stops following inertia, so that the ring can simulate the "head-up" action when seeing the user approaching.
  • the ring rotates around the y-axis on the xoy plane. After the rotation, the ring is located on the yoz plane and no longer blocks the sphere and the image. In addition, when the ring flips, it can drive the sphere and the image to flip together, so that the bottom or top of the sphere and the image is facing the screen.
  • Figure 8 is a schematic diagram of a digital assistant provided by an embodiment of the present application in a listening state. As shown in Figure 8, the ring in Figure 8 rotates to the yoz plane, and the sphere and the image body also rotate.
  • the ring and the image body are always in a rotating state, and the fulcrum around which the ring and the image body rotate can be the center of the sphere.
  • the trajectory of the rotation of the ring and the image body can be random or preset.
  • part of the area on the ring in order to indicate the rotation relationship of the ring, can also be set as a light effect area, and the light effect area moves with the movement of the ring, or the light effect area continuously flows and moves on the ring.
  • the method of setting the light effect area on the ring can refer to the description of the above-mentioned awakening state, which will not be repeated here.
  • Figure 9 is a schematic diagram of a digital assistant provided by an embodiment of the present application in a thinking state.
  • Figure 9 shows two states of the digital assistant during rotation, and in these two states, the ring and the image body are always rotating.
  • the ring of the digital assistant When the digital assistant is in the task execution state, the ring of the digital assistant may be converted into other shapes (such as a geometric array) and separated from the sphere and the image body, and displayed in other locations on the interface.
  • Figure 10 is a schematic diagram of a digital assistant provided in an embodiment of the present application when the form is in the task execution state. As shown in Figure 10, the ring of the digital assistant is transformed into a geometric array, and is separated from the sphere and the image body of the digital assistant, and is displayed in other locations.
  • Figure 11 is a schematic diagram of multiple forms of a digital assistant provided in an embodiment of the present application. Among them, Figure 11 shows multiple forms included in the digital assistant.
  • the first interface showing the first form of the virtual image may be a lock screen interface or any screen interface after unlocking.
  • displaying the virtual image in the first form on the first interface may specifically include: displaying the to-be-awakened state as the first form on the first interface.
  • the to-be-awakened state includes a first feature body, and the first feature body is displayed in a long strip as a navigation prompt line.
  • the navigation prompt line is used as a reference line for screen gesture navigation.
  • the to-be-awakened state is permanently displayed on the display interface.
  • the electronic device determines a response strategy based on the recognition of the user input and displays the response result on the screen.
  • the response result displayed on the screen by the electronic device may be that the interface remains unchanged or that the interface has changed.
  • the electronic device is a smart phone
  • the user moves the probe close to the smart phone, and the face recognition hardware on the smart phone recognizes the face
  • the smart phone detects the user input.
  • the smart phone detects the approach of a human hand through an infrared sensor, and it can also be considered that the smart phone detects the user input.
  • the screen is switched from the off-screen interface to the lock-screen interface (the lock-screen interface is essentially the bright-screen interface).
  • the face recognition verification on the lock-screen interface is passed, it is unlocked to enter the desktop or the running application interface.
  • the user directly double-clicks the off-screen interface, and the electronic device switches to the lock-screen interface.
  • the digital assistant When the electronic device switches from the off-screen interface to the bright-screen interface and the digital assistant is not awakened, the digital assistant is displayed in the waiting-to-be-awakened state.
  • the ring of the digital assistant closes inward and is displayed in the form of a gesture prompt line, which can be specifically a long strip.
  • the gesture prompt line is a global control and is displayed at the bottom of the screen in any interface. It is used to provide a reference for full-screen gestures. For example, swiping up from the bottom usually swipes up from the position of the gesture prompt line.
  • the digital assistant is always displayed in a location that the user can reach and in different forms.
  • the digital assistant in the lock screen interface after the screen is turned off, and in the desktop or in-application interface when the user does not wake up the digital assistant through a preset operation, the digital assistant can be displayed in a waiting-to-be-awakened state. That is, the ring of the digital assistant closes to form a long strip as a system gesture prompt line (usually displayed at the bottom of the screen), and as the ring closes, the image body and the sphere in the ring disappear (which may be accompanied by a relevant disappearance animation).
  • FIG. 12 is a schematic diagram of a digital assistant provided in an embodiment of the present application being displayed in a waiting state.
  • the digital assistant in the off-screen interface, the digital assistant is displayed in a breathing state and can move in the off-screen interface according to a preset rule.
  • FIG. 12 (b) when the user's input causes the electronic device to enter the lock screen interface from the off-screen interface, the digital assistant moves to the bottom, the ring shrinks inward, and the image body and the sphere are squeezed to be completely hidden (this process is reflected in animation, and the animation can also be hidden).
  • the digital assistant becomes a strip and is displayed at the bottom of the screen as a system gesture prompt line.
  • FIG. 12 (a) in the off-screen interface, the digital assistant is displayed in a breathing state and can move in the off-screen interface according to a preset rule.
  • FIG. 12 (b) when the user's input causes the electronic device to enter the lock screen interface from the off-screen interface, the digital assistant moves to the bottom,
  • the user has not woken up the digital assistant, so the digital assistant is always displayed in a waiting state and displayed on the interface in the form of a gesture prompt line.
  • the digital assistant switches from the first form to the second form, which can be specifically switching from the wake-up state to the listening state.
  • the first form is a waiting state
  • the second form displayed based on the signal instruction generated by the electronic device is a listening state.
  • the signal instruction that triggers the display of the virtual image in the second form can be generated after the user performs a preset gesture on the digital assistant in the first form.
  • the user performs a long press gesture or a double-click gesture on the digital assistant in the first form.
  • This embodiment does not specifically limit the preset gesture.
  • the plurality of features include a first feature and a second feature.
  • the first feature is in the main form of a ring
  • the second feature is a sphere
  • the ring is located on the equator of the sphere in a top-down perspective.
  • the above describes an embodiment in which the first state is the waiting state, and the following describes an embodiment in which the first state is the listening state.
  • the virtual image of the digital assistant is displayed on the first interface in a listening state as a first form, wherein the third user input is used to wake up the digital assistant.
  • the third user input may be, for example, a user's voice input, an input from an external input device (such as a remote control or a mouse), a touch input generated by a user's direct interaction with an electronic device screen, or an input from a user's interaction with a physical button, a back panel gesture, or other shortcut operations.
  • the listening state of the virtual image includes a first characteristic body (such as the aforementioned ring) and a second characteristic body (such as the aforementioned sphere), and the first characteristic body is displayed in the main state.
  • a first characteristic body such as the aforementioned ring
  • a second characteristic body such as the aforementioned sphere
  • Figure 13 is a schematic diagram of a digital assistant switching from a waiting state to a listening state provided in an embodiment of the present application.
  • the user interacts with the gesture prompt line where the digital assistant is located with gestures such as single-click, double-click, or long press to trigger the awakening of the digital assistant.
  • the digital assistant is displayed in the current interface in a listening state.
  • specific content may be displayed on the interface of the electronic device, and there may be less effective space on the interface to display the digital assistant.
  • the virtual image of the digital assistant can be integrated with the specific content on the interface.
  • the above-mentioned display of the virtual image in the first form on the first interface may specifically include: displaying the virtual image at a preset position on the first interface; or, overlaying and displaying the virtual image on a target content element in the first interface, wherein the target content element is a content element of a preset type. That is, the virtual image is overlaid on the target content element, so that the target content element is no longer displayed on the first interface.
  • the target content element is, for example, a control or icon of a preset type predefined by the system.
  • a logo may refer to a space or icon having a circular shape.
  • the gesture event interface of the target content element can be copied to the avatar, so that the gesture-corresponding function of the target content element can be realized by performing a gesture on the avatar.
  • the avatar inherits the original function of the target content element, so that when a specific gesture (such as single-click) is performed on the avatar, the gesture-corresponding function of the target content element can still be executed.
  • the target content element is a shutter control in the camera interface.
  • the virtual image is displayed overlaid on the shutter control of the camera interface, and clicking the virtual image can still realize the shooting of photos or videos.
  • the gesture can be used to implement the function of the original control.
  • the display area of the virtual image implements the function of the original content element when receiving the gesture input
  • the mapping relationship between the gesture and function of the target content element can be associated with the digital assistant, so that the digital assistant triggers the corresponding function after receiving the gesture input.
  • the digital assistant integrates the original function of the control, it only needs to synchronize the trigger interface of the touch instruction to the digital assistant, so that when the area where the digital assistant is located is touched, the corresponding touch instruction is generated to realize the function of the original control.
  • the function corresponding to the control is executed according to the gesture type, regardless of whether there are other layers displayed above the control.
  • the digital assistant function itself, it is only necessary to recognize the gesture corresponding to the digital assistant and then trigger the function corresponding to the gesture.
  • Figure 14 is a schematic diagram of a virtual image overlaid on a camera shutter control provided in an embodiment of the present application.
  • the digital assistant in one of the initial states, the digital assistant is not awakened, and is displayed on the interface in a state to be awakened as a navigation prompt line.
  • the digital assistant When the user interacts with the digital assistant in the state to be awakened (i.e., the navigation prompt line), such as long pressing the navigation prompt line or long pressing the power button to wake up the digital assistant, the digital assistant is now in a listening state.
  • the system displays the digital assistant in the listening state, it first checks the type of control in the current interface. Since the camera shutter control in the current interface is a preset type of control, the digital assistant is displayed in a fusion with the camera shutter control. After the fusion display, the digital assistant is displayed in a listening state overlaid on the position of the original camera shutter control.
  • the digital assistant in another initial state, is not displayed in any state on the current interface.
  • the digital assistant is called out by long pressing a control (such as the power button) or by voice, the digital assistant can also be displayed over a preset type of control or icon on the current interface.
  • clicking the display area of the digital assistant's avatar can also trigger taking a photo. This can be achieved by the digital assistant passing instructions to the camera application to control taking photos, or by the camera application directly recognizing gestures.
  • the navigation prompt line is permanently displayed in the system globally. Then, when the digital assistant is in the wake-up state, it can also be understood that the virtual image hides the sphere, changes the circle, and merges it with the navigation prompt line, so that the navigation prompt line not only has the function of a navigation prompt line, but also serves as the permanent state of the digital assistant virtual image.
  • the digital assistant can switch between different controls or icons.
  • the digital assistant can be cut and merged from the controls of the first displayed interface to the controls or icons in the latest interface in the order of display time.
  • Figure 15 is a schematic diagram of a virtual image overlaid on an assistant suggestion component provided by an embodiment of the present application.
  • the digital assistant is displayed in a state to be awakened. In this state, the digital assistant merges the navigation prompt line, and the deformed ring overlaps the navigation prompt line.
  • the main screen page contains an assistant suggestion widget, and the icon of the virtual image is displayed in the assistant suggestion widget, then the digital assistant can flow from the navigation prompt line to the virtual icon in the assistant suggestion widget.
  • the navigation prompt line when the navigation prompt line is globally resident, it is the scheme shown in (b) of Figure 15; when the navigation prompt line is only a deformed ring, then after the digital assistant flows from the navigation prompt line to the virtual icon in the assistant suggestion widget, the navigation prompt line is no longer displayed.
  • the virtual image of the digital assistant also has a task execution state, which includes a first feature body.
  • the first feature body in the task execution state generates at least one of a shape change and a position change.
  • a digital assistant when a digital assistant performs an automated task, a series of changes will occur on the display interface of the electronic device.
  • the electronic device When the digital assistant performs an automated task of calling a certain contact, the electronic device will first open the contact interface, then select a specific contact in the contact interface, and then enter the phone interface for calling the specific contact. Then, in the process interface of the digital assistant performing an automated task, the digital assistant is actually always in the task execution state, and the first feature body of the digital assistant in the task execution state can undergo various changes to show the user the automated process of the digital assistant performing the task, making the automated process of the digital assistant performing the task more intuitive and clear, and making it easier for the user to observe whether there are any errors in the process of the digital assistant performing the automated task.
  • the first feature body is displayed in the form of a geometric array, and the display position of the geometric array is adapted to the target control in the process interface.
  • the geometric array enters the next process interface by simulating the click event of the target control.
  • the digital assistant in the process of performing the automation task, can simulate the operation of the user clicking the control on the interface with the first feature body as the geometric array, thereby visualizing the entire process of the digital assistant simulating the user to perform the automation task.
  • the application icon is the execution target of this step.
  • the dot matrix that changes to a circle can be displayed on the application icon.
  • Figure 16 is a schematic diagram of the process of a digital assistant performing an automated task provided by an embodiment of the present application.
  • the digital assistant receives the user's voice command "Share this photo to WeChat Moments”.
  • the digital assistant displays the dot matrix on the "Send" control of the current interface to simulate the user's operation of clicking the "Send” control.
  • the interface changes, and the current interface changes to display applications that can send photos, and these applications include the "WeChat” application; in the interface after the change, the dot matrix is displayed on the icon of the "WeChat” application to simulate the user's operation of clicking the "WeChat” application icon.
  • the interface changes again, changing to an interface for editing a picture to be published in the "WeChat” application's Moments, and at this time the dot matrix is displayed on the "Publish” control to simulate the user's operation of clicking the "Publish” control.
  • both the listening state and the task execution state include a first feature body, and the first feature bodies in the listening state and the task execution state have different forms.
  • the first feature body includes: a geometric array in the interface, an outer frame of a digital assistant dialog card, an outer frame of an input box when calling a digital assistant input, or an outer frame of a task result page. That is, in different scenarios, the first feature body can be represented in different forms to better fit the content displayed on the current interface.
  • the first feature body has different morphological changes corresponding to the task progress of the automated task. That is, in the process of the digital assistant performing the automated task, at different task progresses, the first feature body may change into different forms, so that the form of the first feature body always closely fits the current interface.
  • the morphological change of the first feature body includes the following process: initially, the first feature body is displayed in the main form of a ring at an initial position relative to the second feature body, and the initial position is the same as or different from the position of the first feature body in the listening state; then, the first feature body is separated from the second feature body, and after being enlarged and deformed, it serves as the outer frame of the digital assistant dialogue card; secondly, the first feature body is changed from the outer frame of the dialogue card to a geometric array within the dialogue bubble of the dialogue card; finally, after the automated task is executed, the virtual image is displayed in the listening state.
  • the task execution interface refers to the interface when the digital assistant receives the task instruction and transmits the instruction to the local application for execution based on the analysis of the task instruction, or the execution result interface. Therefore, the task execution interface can be part of the interface or the entire interface of the execution application.
  • the morphological changes of the ring include: the ring concentric with the image body and the sphere disappears and transforms into a geometric array (such as a dot matrix) of the task execution interface on the screen, or an outer frame surrounding the task execution interface.
  • a geometric array such as a dot matrix
  • the process of transforming from a ring to a geometric array or an outer frame can be accompanied by animation.
  • the ring is decomposed into several points, and then laid on the task execution interface to form a dot matrix; the ring is deformed and elongated, transformed into a frame, and enlarged into the outer frame of the task execution interface.
  • Figure 17 is a schematic diagram of a digital assistant provided by an embodiment of the present application, wherein a circular ring is converted into a dot matrix and laid on a task execution interface.
  • (a) of Figure 17 corresponds to the listening state of the digital assistant, and the language instruction input by the user at this time is "help me order a cup of Xinba coffee."
  • Figure 17(b) corresponds to the thinking state of the digital assistant (where the thinking state is a dynamic image, the ring keeps rotating according to a certain rule, and Figure 17(b) is a certain instantaneous state).
  • the thinking state corresponds to the digital assistant's task parsing process.
  • the interface for users to interact with the digital assistant shows the process of the digital assistant parsing the task: "Recommendations are being made for you based on Xinba's historical orders.”
  • the task execution phase begins, and the digital assistant is in the task execution state, corresponding to Figure 17 (c).
  • the ring of the digital assistant icon at the bottom disappears, and the task card obtained by executing the task is displayed above (i.e., a dialogue bubble, but presented in the form of a card).
  • a number of dots are displayed on the task card, which is formed by the transformation of the original ring of the digital assistant.
  • the dot matrix can disappear after being displayed for a period of time.
  • the dot matrix of the upper task card has disappeared, and the lower task card shows an instantaneous state in the process of the dot matrix disappearing (the dot matrix and interface content are displayed at the same time, and both are displayed translucently).
  • the dot matrix disappears. That is, for the task execution interface in Figure 17, the display process is: only the dot matrix is displayed ⁇ the dot matrix gradually disappears, and the interface content is gradually displayed (the dot matrix and the interface exist at the same time) - only the interface content is displayed.
  • the task execution interface can be an input box or a pop-up card interface.
  • These cards can be components or subroutines provided by locally installed applications.
  • Figure 18 is a schematic diagram of using a digital assistant to input text on the desktop provided by an embodiment of the present application.
  • the digital assistant is enabled by long pressing the power button. At this time, the digital assistant is in a listening state, and the digital assistant calls the microphone to obtain the user's voice input.
  • the digital assistant receives the voice input, and fills the voice into the input box after converting it into text; at this time, the digital assistant is in a task execution state, the ring disappears from the outside of the sphere, and is enlarged and deformed and transferred to the outside of the search box to form an outer frame surrounding the search box.
  • the search widget is expanded into a search card, and the outer frame converted by the ring can be expanded and surround the search card; of course, the outer frame converted by the ring can also disappear after the search card is presented, or disappear after a preset time.
  • the ring in the initial state (undeformed) can be displayed outside the sphere.
  • Figure 19 is a schematic diagram of a ring of a digital assistant provided in an embodiment of the present application deforming and returning to its original state. As shown in (a) of Figure 19, after the digital assistant is initially called out, the digital assistant is displayed in a listening state, and the ring on the digital assistant is displayed outside the sphere.
  • the digital assistant is displayed in the task execution state.
  • the ring of the digital assistant deforms outward, overlaps with the border position of the digital assistant's dialogue card, and is displayed on the border or outside the border, or displayed on the upper layer of the border.
  • the ring of the digital assistant becomes the outer frame of the dialogue card.
  • the task execution interface is the application interface currently being operated.
  • a smart recognition result is usually generated, which is one or more executable objects obtained by recognition (such as smart removal of people/smart removal of objects in album editing), then the task execution interface can be the area where the executable object is located.
  • the dot matrix corresponding to the circle can also be adapted to the task execution process, that is, in the recognition process stage, the dot matrix is spread all over the interface, and after the executable object is recognized, the dot matrix is displayed only in the area of the executable object.
  • Figure 20 is a schematic diagram of the morphological changes of a digital assistant in a smart recognition scenario provided by an embodiment of the present application.
  • the user calls out the digital assistant and uses the smart recognition function.
  • a picture is displayed on the interface of the smartphone, and the picture is displayed in full screen. After that, the user long presses anywhere on the interface to trigger the smart recognition function.
  • the icon of the digital assistant is displayed on the current interface. When the digital assistant just appears, the digital assistant is in a listening state.
  • the digital assistant performs smart recognition on the picture.
  • the ring of the digital assistant disappears from the outside of the sphere and is converted into a dot matrix displayed on the interface, and the dot matrix fills the screen.
  • the dot matrix fills the screen.
  • the points of the dot matrix may not be a regular dot matrix, but can move dynamically in the display area where the executable object is located. The dot matrix becomes multiple irregular discrete points.
  • the system defines long pressing the power button as a shortcut operation for smart recognition
  • the operating system displays the digital assistant icon on the interface and directly displays the task execution status, that is, after long pressing the power button, it directly enters the display interface of (b) in Figure 20.
  • the image body and sphere of the digital assistant may not be displayed on the interface, but only the changed shape of the ring body may be displayed.
  • the dot matrix may be displayed without displaying the sphere and image body below.
  • displaying the virtual image in the first form on the first interface specifically includes: in the initial stage, only the first feature body is displayed, and the second feature body is not displayed, and the first feature body is initially displayed in a geometric array, covering the entire screen; then, the first feature body is changed from the geometric array to be displayed in the main form, and the second feature body is displayed.
  • the digital assistant when waking up the digital assistant, the digital assistant first appears in the form of a geometric array (such as a dot matrix) and gradually fills the screen; then, the digital assistant gradually transforms from the geometric array to the main form of the first feature body (i.e., a ring) and displays the second feature body (i.e., a sphere), thereby realizing the awakening process of the digital assistant in the form of an animation, increasing the intuitiveness of the interaction process, and helping users to identify the current working status of the digital assistant.
  • a geometric array such as a dot matrix
  • the geometric array may be, for example, a dot matrix, that is, an array formed by a plurality of dots arranged in a regular pattern.
  • the geometric array may also be an array formed by other geometric shapes (such as polygons such as triangles and rectangles) arranged in a regular pattern.
  • the first feature body is initially displayed in a geometric array, which may specifically include: first determining an initial display position of the geometric array; secondly, displaying a portion of the geometric pattern of the array at the initial display position, and then gradually spreading the geometric pattern to cover the entire screen. For example, first displaying a portion of the dot matrix from an initial display position, and then gradually spreading the dot matrix to cover the entire screen.
  • the initial display position of the geometric array is determined based on user input.
  • the user input refers to the input used to wake up the digital assistant.
  • the initial display position may be near the user input that triggers the waking up of the digital assistant. For example, when the user input is a long press of the power button of a smartphone, the initial display position of the geometric array may be on the screen near the power button.
  • the initial position is a default position, which is an area where the virtual image is displayed, such as a preset area at the bottom of the screen.
  • the initial display position of the geometric array is determined by user input, which can make the display process of the digital assistant more closely aligned with the user input, enhance the intuitiveness of the interaction between the user and the digital assistant, and improve the user experience.
  • the appearance of the dot matrix can have a preset pattern.
  • the initial display position of the digital assistant is determined based on the user's operation, and the dot matrix is expanded from the initial display position to the full image display.
  • the display position of the digital assistant's virtual image is used as the initial display position, and the dot matrix is expanded from the initial position to the full image display.
  • Figure 21 is a schematic diagram of the appearance of a digital assistant in a smart recognition scenario provided by an embodiment of the present application.
  • the digital assistant when the user wakes up the digital assistant by long pressing the power button on the right side of the smartphone, the digital assistant is initially displayed in the form of a dot matrix on the screen near the power button, and then gradually spreads to the entire screen.
  • the digital assistant When the user wakes up the digital assistant by long pressing the digital assistant at the bottom of the screen, the digital assistant is initially displayed in the form of a dot matrix near the area where the virtual image of the digital assistant is located at the bottom, and then gradually spreads to the entire screen.
  • the morphological changes of the digital assistant may also be related to the scene information, so that some information in the current usage scenario can be accurately represented through the digital assistant's virtual image.
  • the electronic device may obtain scene information and determine the first feature of the digital assistant based on the scene information.
  • the target form of the body is used to display the first feature body in the target form. That is, the target form is related to the scene information.
  • the electronic device can display the first feature body in the specific form.
  • the scene information may include external environment information and system event information.
  • External environment information is a signal obtained by a sensor to indicate the external environment, which may be obtained through a sensor on an electronic device or a network, such as temperature, humidity, weather and other external environment information.
  • System event information is an event obtained from the system of an electronic device and occurring within a preset time, such as music being played, an alarm notification within a preset time, searching for a file and other event information.
  • the ring can be deformed based on the external environment information. For example, on a rainy day, the ring shape changes into an umbrella shape.
  • Figure 22 is a schematic diagram of a digital assistant deformed based on scene information provided in an embodiment of the present application. As shown in Figure 22, after calling out the digital assistant, if the current interface is maintained, the digital assistant is always displayed in a listening state. After maintaining a certain state (such as the listening state in Figure 22) for a preset time, the ring of the digital assistant is deformed based on the scene information. In Figure 2, the current weather is light rain, so the ring becomes an umbrella shape.
  • the ring can also be deformed based on system event information. For example, when the screen is turned off to play music, the ring of the digital assistant changes to the shape of headphones; when there is an alarm notification within the preset time, the ring of the digital assistant changes to the shape of a physical alarm clock. For another example, when the user calls the digital assistant to search for a file, the ring of the digital assistant changes to a magnifying glass.
  • the state to be awakened includes at least a first feature body, and the first feature body is currently displayed in a long strip.
  • the electronic device In response to the electronic device detecting that the focus of the human eye falls on the area where the screen is located, the electronic device displays a second feature body.
  • the digital assistant when the digital assistant is in the wake-up state, only a long bar is displayed; when the digital assistant detects that the user's eyes fall on the screen, it can be considered that the user has an interactive tendency, so the second feature body is displayed, thereby indicating that the digital assistant detects that the user has an interactive tendency and is waiting for interaction, so as to improve the naturalness of human-computer interaction.
  • the second feature body in response to detecting a human voice, may move relative to the first feature body in a direction closer to the source of the human voice. Specifically, when the second feature body just appears, its position is relatively fixed, for example, located directly above or below the first feature body.
  • the electronic device may move the second feature body according to the direction of the human voice, so that the second feature body moves with the human voice to improve the naturalness of the interaction.
  • the in-vehicle terminal can also be resident on the display screen of the vehicle (such as the central control screen) and interact with the user visually through at least one deformation in a ring or a sphere, thereby keeping the user's attention on the digital assistant and enhancing the sense of conversation.
  • scene information mainly includes human information, system information and external environment information.
  • the human information may come from at least one of the driver and the passenger, including head orientation, eye contact, gestures, posture, and voice, and may be distinguished as the main driver or the front passenger (it is usually difficult for the back seat to interact with the central control screen).
  • System information may refer to changes in manual or adaptive adjustments of vehicle control functions, such as driving mode, air conditioning settings, window defogger, seat belt status changes, etc., as well as some vehicle condition change information, such as changes in vehicle speed, driving direction, vehicle bumps, special traffic conditions, etc.
  • vehicle condition change information such as changes in vehicle speed, driving direction, vehicle bumps, special traffic conditions, etc.
  • system information may also include details such as the type of the aforementioned running application.
  • External environmental information usually includes changes in weather, light, temperature, etc.
  • virtual images corresponding to different states can trigger different deformations of the virtual form, such as the deformation of the above-mentioned ring, or the deformation of the sphere.
  • the central control screen basically does not have an off state, so the digital assistant on the vehicle terminal is displayed in the wake-up state after it is turned on. In this way, in the wake-up state, the digital assistant has different deformation states according to the different scene conditions detected.
  • FIG. 23 is a schematic diagram of the changes of a digital assistant on a vehicle-mounted terminal provided by an embodiment of the present application.
  • the initial state of the digital assistant to be awakened is to display only the deformed ring.
  • the driver or co-driver is detected looking at the central control screen, a part of the sphere is displayed. If a human voice is further detected on this basis, more sphere area is displayed, such as a hemisphere.
  • the ring can be elongated, and the sphere is located on the side of the ring in the direction of the voice source.
  • the hemisphere when the driver's voice is detected, the hemisphere is located on the side of the elongated ring body close to the driver, that is, the hemisphere moves toward the driver; when the co-driver's voice is detected, the hemisphere is located on the side of the elongated ring body close to the co-driver, that is, the hemisphere moves toward the co-driver.
  • a digital assistant can be displayed on the screen of the electronic device based on the activation or alarm of the vehicle-mounted function, and the corresponding function or alarm information can be indicated by the deformation or movement of the first feature body.
  • a digital assistant can also be used to prompt vehicle condition information. For example, when it is detected that the vehicle is in a bumpy road environment, the awakened digital assistant is displayed, and the sphere vibrates up and down along the screen where the ring is located. Alternatively, the sphere and the ring vibrate up and down together, and the vibration frequency of the two can be consistent or inconsistent.
  • the awakened digital assistant is displayed, and the sphere is rendered in white, and then the ring uses the x-axis as the fixed axis to produce an up and down swing in the z-axis direction, and as it swings, the transparency of the white rendered on the sphere becomes lower until it is completely transparent and the ring stops swinging.
  • the awakened digital assistant is displayed, and after the ring body is tilted at a preset angle, a part of the ring body is broken in the middle and becomes an arc, and the arc gradually closes to form a ring.
  • the above mainly introduces the process of the digital assistant's form changing when the electronic device is a smartphone or a car terminal.
  • the following will introduce the process of the digital assistant's form changing when the electronic device is a smart TV.
  • Figure 24 is a flow chart of a display method of a digital assistant provided in an embodiment of the present application.
  • the display method of the digital assistant includes the following steps 2401-2403, and the display method of the digital assistant is applied to an electronic device equipped with a digital assistant.
  • Step 2401 obtain the user's location and display the first form of the digital assistant based on the user's location.
  • the electronic device configured with the digital assistant may be deployed with a sensor, such as an infrared sensor, an ultrasonic sensor, or an optical distance sensor, so that the electronic device can detect the user's position based on the deployed sensor.
  • a sensor such as an infrared sensor, an ultrasonic sensor, or an optical distance sensor
  • the electronic device emits infrared light through the infrared sensor and receives infrared light returned by the user's body, and then calculates the direction and distance of the user relative to the electronic device, and then determines the user's position.
  • the electronic device may also obtain the user's location in other ways, such as obtaining the user's location from other devices.
  • the electronic device when the electronic device is a smart TV or smart speaker in a smart home, the electronic device can connect to other smart devices (such as home cameras, sweeping robots, or smart routers) through the network, so that after the other smart devices determine the user's location through their own sensors or positioning modules, the electronic device can obtain the user's location from the other smart devices.
  • other smart devices such as home cameras, sweeping robots, or smart routers
  • the first form of the digital assistant can be displayed on the display screen of the electronic device based on the user's location, that is, the first form of the digital assistant displayed by the electronic device is related to the user's location.
  • the farther the user's location is from the electronic device the lower the user's tendency to interact with the electronic device, so the electronic device can display a digital assistant form with a lower degree of interaction intention; the closer the user's location is to the electronic device, the higher the user's tendency to interact with the electronic device, so the electronic device can display a digital assistant form with a higher degree of interaction intention.
  • Step 2402 obtaining the user's spatial interaction behavior, where the spatial interaction behavior is used to indicate the user's activities in the space.
  • the electronic device After determining the user's position and displaying the first form of the digital assistant, the electronic device can further capture the user's spatial interaction behavior, that is, the user's activities in the space, through sensors, and then determine the user's further interaction tendencies.
  • the user's spatial interaction behavior may include, but is not limited to, any of the following behaviors: approaching the electronic device, moving away from the electronic device, moving to the left, moving to the right, pointing a finger at the electronic device, pointing an arm at the electronic device, facing the face toward the electronic device, and a preset gesture.
  • the user's spatial interaction behavior can be used to indicate the interaction tendency between the user and the digital assistant of the electronic device. For example, when the user's spatial interaction behavior is approaching the electronic device, it means that the user expects to conduct further interaction with the digital assistant of the electronic device; when the user's spatial interaction behavior is moving away from the electronic device, it means that the user does not expect to conduct further interaction with the digital assistant of the electronic device.
  • Step 2403 in response to the spatial interaction behavior, display the second form of the digital assistant.
  • the electronic device switches from the first form of the digital assistant to displaying the second form of the digital assistant.
  • the first form of the digital assistant and the second form of the digital assistant are used to represent different states of the digital assistant and the user's interaction process, thereby facilitating the user to identify the current interaction state of the digital assistant and issue corresponding interaction instructions.
  • the second form of the digital assistant displayed by the electronic device may be related only to the spatial interaction behavior. That is, after the electronic device obtains the user's spatial interaction behavior, no matter what the first form of the digital assistant currently displayed is, it determines to display the corresponding form of the digital assistant (i.e., the second form mentioned above) according to the pre-established mapping relationship between the spatial interaction behavior and the form of the digital assistant.
  • the second form of the digital assistant displayed by the electronic device may also be related to the first form of the digital assistant and the spatial interaction behavior displayed by the electronic device.
  • the electronic device is actually based on the first form currently displayed and the user
  • the spatial interaction behavior of the digital assistant is used to determine the second form of the digital assistant that needs to be displayed.
  • the digital assistant can have multiple forms, and different forms correspond to different degrees of interaction intention between the user and the electronic device as judged by the electronic device.
  • the electronic device When the electronic device judges that the user expects the degree of interaction intention with the electronic device to increase based on the acquired spatial interaction behavior, the electronic device can determine to display a second form with a higher degree of interaction intention than the first form based on the first form of the digital assistant currently displayed, and display the second form; conversely, when the electronic device judges that the degree of interaction intention between the user and the electronic device decreases based on the acquired spatial interaction behavior, the electronic device can determine to display another second form with a lower degree of interaction intention than the first form based on the first form of the digital assistant currently displayed, and display the other second form.
  • the electronic device may divide the area near the electronic device into multiple areas based on the location of the electronic device, and different areas are not the same relative to the location of the electronic device.
  • the electronic device can be divided into three areas, namely the first area, the second area and the third area.
  • the electronic device is located in the first area
  • the second area is located outside the first area relative to the electronic device and surrounds the first area
  • the third area is located outside the second area relative to the electronic device and surrounds the second area.
  • the first area, the second area and the third area may be areas divided with the electronic device as the center, and the plane shape of the first area is a circle or an ellipse, the plane shape of the second area is a ring surrounded outside the first area, and the plane shape of the third area is a ring surrounded outside the second area.
  • the electronic device recognizes that the user's spatial interaction behavior is moving between different areas, it can accurately determine the user's interaction tendency with the electronic device, and then adjust the form of the displayed digital assistant.
  • the electronic device switches the first form of the digital assistant to the second form.
  • the first target area belongs to any one of the above-mentioned first area, second area and third area, and the second target area can be an area adjacent to the first target area.
  • the electronic device may display a second form of the digital assistant; wherein the second form is the form corresponding to the second target area.
  • each area divided around the electronic device corresponds to a form of the digital assistant, and when the user moves from one area to another, the form of the digital assistant is switched so that the currently displayed form is the form corresponding to the area where the user moves.
  • the second form is used to indicate that the digital assistant is in a state of waiting to receive instructions.
  • the first target area is outside the second target area
  • the user moves from the first area to the second area, which means that the user moves from an area farther away from the electronic device to an area closer to the electronic device, that is, the user is approaching the electronic device, so the electronic device can recognize that the user has a further tendency to interact, thereby switching the form of the digital assistant to show the user that the digital assistant has switched from a non-task processing state to a state of waiting to receive instructions.
  • the non-task processing state may be a state indicating that the digital assistant has no tasks to be processed and is in a dormant state;
  • the state of waiting to receive instructions may refer to a state in which the digital assistant has been awakened and is ready to receive instructions at any time.
  • the state of waiting to receive instructions of the digital assistant is a state with a higher degree of interaction intention than the state of non-task processing.
  • the second form is used to indicate that the digital assistant is in a state to be received.
  • the user moves from the first target area to the second target area, which means that the user has a further tendency to interact, thereby switching the form of the digital assistant to show the user that the digital assistant has switched from a state to be received to a state to be received.
  • the instruction receiving state may refer to a state in which the digital assistant has received instructions issued by the user through the corresponding hardware on the electronic device, that is, the hardware receiving the instructions is already in a working state, for example, the digital assistant receives voice instructions issued by the user through the microphone on the electronic device.
  • the instruction receiving state of the digital assistant is a state with a higher degree of interaction intention than the state to be received.
  • the second form is used to indicate that the digital assistant is in a state of waiting to receive instructions
  • the second target area is outside the first target area
  • the second form is used to indicate that the digital assistant is in a state of no task processing. It is understandable that when the second target area is outside the first target area, the user moves from the first target area to the second target area, which means that the user is moving away from the electronic device, so the electronic device can recognize that the user has no further interaction tendency, and thus switches from the state of waiting to receive instructions to the state of no task processing with a lower degree of interaction intention.
  • the user's interaction tendency with the digital assistant is identified by obtaining the user's movement between areas with different distances from the electronic device, and then the morphology of the digital assistant is adjusted. This can well fit the relationship between social status and distance between humans and improve the naturalness of the morphology change of the digital assistant.
  • the electronic device may also obtain the user's spatial activity behavior.
  • the spatial activity behavior is used to indicate the user's activities in the space.
  • the spatial activity behavior is the behavior performed by the user without the intention to interact with the digital assistant, that is, the spatial interaction behavior can be used to indicate that the user does not have the intention to interact with the digital assistant.
  • the electronic device displays the adjusted first form of the digital assistant; wherein the adjusted first form is obtained by adjusting at least one feature of the first form, and the adjusted first form and the first form are used to represent the same state in the process of interaction between the digital assistant and the user. That is, in the case where the user does not have an intention to interact, the spatial activity behavior performed by the user will change the features of the form of the digital assistant, but will not change the form of the digital assistant, that is, will not cause the interactive state of the digital assistant to change.
  • the features adjusted in the first form may include, but are not limited to, the following features of the feature body used to represent the digital assistant: the posture of the feature body, the movement law of the feature body, the light effect of the feature body, the material of the feature body, the color of the feature body, the brightness of the feature body, etc.
  • the feature body of the digital assistant such as a ring body
  • the feature body of the digital assistant may rotate with the user; for another example, when the user points his finger in different directions, the feature body of the digital assistant may rotate or rotate with the position pointed by the user's finger; for another example, when the user gradually approaches the display screen of the electronic device, the ring body of the digital assistant may rotate and change the pitch angle.
  • the first form and the second form of the digital assistant both include multiple features, and compared with the first form, the posture and/or motion law of at least one feature in the second form is changed. That is, the different forms of the digital assistant are all composed of the same multiple features, and the difference between the different forms is that the posture and/or motion law of at least one feature has changed.
  • different feature bodies can independently adjust posture and movement rules.
  • the digital assistant in different states can be represented, forming a unified style of feedback method that responds to spatial interactive behavior input, which helps users understand the feedback of the digital assistant.
  • the multiple feature bodies used to represent the image of the digital assistant include a torus, a sphere, and an image body.
  • the image body is wrapped in the sphere, and the torus surrounds the sphere. That is, in the image of the digital assistant, from the inside to the outside, they are the image body-the sphere-the torus.
  • the sphere may be transparent or transparent so that the user can see through the sphere to see the image body wrapped in the sphere.
  • the image body may be an object of any shape designed for the digital assistant, such as an object similar to the shape of the human body, an object of irregular shape, etc.
  • the first form or the second form when used to indicate that the digital assistant is in a non-task processing state, the first form or the second form is that multiple feature bodies move along a preset trajectory, and the sphere and the image body rotate during the movement.
  • the form of the digital assistant can be manifested as moving according to predetermined rules without any interactive intention, that is, multiple feature bodies move along preset trajectories and the sphere and the image body rotate during the movement.
  • the first form or the second form when used to indicate that the digital assistant is in a state of waiting to receive commands, the first form or the second form is that a partial area in the ring body has a light effect, and the partial area is facing the user's position.
  • the digital assistant When the digital assistant is in the state of waiting to receive instructions, it means that the user has the tendency to interact with the digital assistant, and the digital assistant is ready to receive the user's instructions at any time. Therefore, the digital assistant can be manifested as always paying attention to the user (that is, the area with light effects is facing the user's position) and waiting for the user to issue instructions.
  • the second form when the second form is used to indicate that the digital assistant is in a command receiving state, the second form is that the top or bottom of the multiple features are facing the user. That is, when the digital assistant is in a command receiving state, it means that the digital assistant is receiving the user's command, so the form of the digital assistant can be expressed as the state in which the digital assistant is concentrating on receiving the command (that is, the top or bottom of the multiple features are facing the user, indicating listening to the user's command).
  • the first form and the second form are both forms of a virtual human body, and the different forms of the virtual human body differ in any one or more of the following features: the expression, action, and size of the virtual human body.
  • the form of the digital assistant is also composed of multiple feature bodies, and the multiple feature bodies can be divided according to human body parts, that is, different feature bodies correspond to
  • the plurality of feature bodies can be divided into a head feature body, an eye feature body, a mouth feature body, a left hand feature body, a right hand feature body, etc.
  • Figure 25 is a schematic diagram of an application scenario of the display method of the digital assistant provided in an embodiment of the present application.
  • the electronic device 402 can be, for example, a smart TV in the user's home, and a digital assistant is deployed on the smart TV.
  • the user 401 can control the electronic device 402 through voice commands, so that the electronic device 402 automatically executes certain applications, such as automatically opening an application for playing movies and TV shows.
  • the electronic device 402 is configured with hardware such as a distance sensor and a microphone, wherein the distance sensor is used to detect the user's position and the microphone is used to receive the user's voice command.
  • the electronic device 402 is equipped with a digital assistant in software to process the user's voice command, thereby automatically executing certain applications according to the voice command.
  • a smart TV as an example of an electronic device that executes the display method of a digital assistant, and will describe in detail the process of a smart TV executing the display method of a digital assistant.
  • FIG. 26 is a schematic diagram of a digital assistant in a dormant state provided by an embodiment of the present application.
  • the smart TV when the smart TV is not used for a long time and there are no users around the smart TV, the smart TV enters a lock screen standby state and displays the lock screen content on the display screen.
  • the digital assistant is not displayed on the display screen of the smart TV, and the digital assistant runs in the background, waiting to be awakened.
  • the lock screen content displayed on the display screen may be content unrelated to the digital assistant.
  • the lock screen content displayed on the display screen may also be a certain form of the digital assistant, such as the breathing form of the digital assistant.
  • the smart TV when the smart TV detects a user through a sensor, it can first determine the user's location based on the sensor, and then determine the area where the user is located based on the user's location, and determine the initial display form of the digital assistant (i.e., the first form mentioned above) according to the area where the user is located.
  • the initial display form of the digital assistant i.e., the first form mentioned above
  • a preset logic may be set in the smart TV to divide the space around the smart TV into multiple areas.
  • different forms of digital assistants are displayed accordingly.
  • Figure 27 is a schematic diagram of a regional division provided in an embodiment of the present application.
  • the space around the smart TV is divided into four areas according to different radii, from the outside to the inside: non-perception area A1, presence perception area A2, natural behavior area A3 and typical task area A4.
  • three radii are defined: R1, R2 and R3, where R1>R2>R3.
  • the distance between the non-perception area A1 and the smart TV is greater than R1
  • the distance between the presence perception area A2 and the smart TV is between R1 and R2
  • the distance between the natural behavior area A3 and the smart TV is between R2 and R3
  • the distance between the typical task area A4 and the smart TV is less than R3.
  • the non-perception area corresponds to the breathing state
  • the presence perception area corresponds to the awakening state
  • the natural behavior area and the typical task area correspond to the listening state.
  • the smart TV When the digital assistant of the smart TV is activated, for example, the user activates the digital assistant through the wake-up word/remote control, the smart TV will determine an initial form of the digital assistant from the breathing state, the wake-up state and the thinking state according to the user's position, and display it on the screen.
  • the initial state corresponding to the non-sensing area is a breathing state
  • the smart TV activates the main program of the digital assistant and displays the breathing state of the digital assistant.
  • the awakening state of the digital assistant is displayed.
  • the smart TV After the smart TV displays the initial form of the digital assistant based on the user's location, the smart TV will also capture the user's activities in the space through the sensors deployed by itself, and parse the sensor data according to preset logic to identify whether there is spatial interaction behavior and the type of spatial interaction behavior from the captured user spatial activities.
  • a smart TV uses a distance sensor to sense the user's position and distance relative to the TV, and determines whether the user is walking towards the TV based on the changes in position and distance over a period of time. If it is detected that the distance between the user and the TV continues to decrease over a period of time, and the real-time distance between the user and the TV is less than a preset value, it is considered that a walking input is detected. Similarly, when the user moves left or right relative to the TV, the user's horizontal left/right movement input can be detected.
  • the smart TV After detecting the user's spatial interaction behavior, the smart TV can adjust at least one feature of the initial form of the digital assistant according to the type of spatial interaction behavior, thereby achieving form switching of the digital assistant.
  • Figure 28 is a schematic diagram of a switching display of the form of a digital assistant provided in an embodiment of the present application.
  • the digital assistant in the smart TV when the digital assistant in the smart TV is activated, the user is in the non-perception zone A1, and the breathing state of the digital assistant is displayed on the smart TV, that is, the ring body moves regularly up and down along the z-axis.
  • the smart TV When the user is walking in the non-perception zone A1, the smart TV will not perceive the spatial input of the non-perception zone A1, so the user's movement will not affect the form of the digital assistant, that is, the smart TV always displays the breathing state of the digital assistant.
  • the smart TV detects that the user's spatial interaction behavior approaches the smart TV, and the user is in the presence sensing area A2, so the smart TV switches the digital assistant to the awake state.
  • the digital assistant is in the awake state, the ring body is fixed on the equatorial plane of the sphere and rotates around the z-axis following the user's movement.
  • the light effect area on the ring body points to the user.
  • Figure 29 is a schematic diagram of the change of a digital assistant in the awake state provided by an embodiment of the present application.
  • the state of the digital assistant switches from the awake state to the listening state, and the position and movement law of the ring body change.
  • the ring body rotates 90 degrees around the y-axis from the equatorial plane of the sphere to the yoz plane.
  • the light effect can change, and the entire ring body is filled with light effects.
  • the movement law of the sphere and the image body changes. For example, after the sphere and the image body rotate 90 degrees around the y-axis once, they rotate continuously around the z-axis.
  • the digital assistant displayed on the smart TV switches from the awake state to the listening state, indicating that the digital assistant is receiving the voice command issued by the user.
  • the digital assistant When the digital assistant is in the listening state, when the user inputs a voice command, for example, the voice command is "What's the weather like today?", the digital assistant will parse the voice command and perform the corresponding operation.
  • the smart TV may display the thinking state of the digital assistant, and the display duration of the thinking state is positively correlated with the length of time the digital assistant takes to parse the voice command and perform the corresponding operation.
  • the digital assistant when the user is in the natural behavior area A3, the digital assistant is in a listening state.
  • the user When the user inputs a voice command to the digital assistant, the user often needs to first input a voice command as a wake-up command, such as "Hello, Xiaoyi” or "Xiaoyi Xiaoyi", so that the digital assistant can receive and parse the voice command after the wake-up command.
  • a voice command such as "Hello, Xiaoyi" or "Xiaoyi Xiaoyi”
  • the digital assistant when the user enters the typical task area A4, the digital assistant is still in a listening state, but the user can directly input a voice command to the digital assistant without having to input a voice command as a wake-up command.
  • the digital assistant can change its own display form according to the content of the user's voice command.
  • the digital assistant's form can be to dynamically change color or rotation rhythm to follow the rhythm of the music, that is, the digital assistant will also adjust its own display form accordingly when performing music playback operations.
  • the digital assistant's form can change in brightness, color, or material following the action of turning on the lights, turning off the lights, or the type of weather being reported.
  • the digital assistant when the digital assistant parses voice commands and performs corresponding tasks based on the content of the voice commands, the digital assistant can change its own display form according to the progress of its own task execution. For example, the digital assistant can gradually change the material of the ring body, change the area of the ring body filled with specific materials, etc. according to the progress of the task.
  • the smart TV can decide based on the user's spatial interaction behavior and the area where the user is located after moving. Display the state of the digital assistant, so as to realize the state switching of the digital assistant. For example, when the user moves from the natural behavior area A3 to the presence perception area A2, the digital assistant displayed on the smart TV switches from the listening state to the awake state; when the user moves from the presence perception area A2 to the non-perception area A1, the digital assistant displayed on the smart TV switches from the awake state to the breathing state.
  • the above describes the morphological change process of the digital assistant when the main body image of the digital assistant is composed of a torus, a sphere, and an image body.
  • the following describes the morphological change process of the digital assistant when the main body image of the digital assistant is composed of a virtual human body.
  • the main body image of the digital assistant can be a virtual human body, and the shape of the digital assistant is also composed of multiple feature bodies.
  • the multiple feature bodies can be divided according to human body parts, that is, different feature bodies correspond to different parts of the virtual human body.
  • the multiple feature bodies can be divided into head feature bodies, eye feature bodies, mouth feature bodies, left hand feature bodies, right hand feature bodies, etc.
  • the smart TV can also divide the space near the smart TV into multiple areas with the location of the smart TV as the center of the circle. In this way, after the digital assistant in the smart TV is activated, the smart TV can display the corresponding form of the digital assistant according to the area where the user is located in the space and the user's spatial interaction behavior.
  • the space near the smart TV can be divided into three areas, area 1 is an area 1.2-3.7 meters away from the smart TV, area 2 is an area 0.5-1.2 meters away from the smart TV, and area 3 is an area less than 0.5 meters away from the smart TV.
  • the smart TV When the smart TV recognizes that the user is in area 1, the smart TV can consider the distance between the user and the digital assistant to be an identification distance, and the spatial interaction behaviors of the user obtained by the smart TV include: the user faces the digital assistant, the user's gaze is on the digital assistant, and the user gazes at the digital assistant for more than 3 seconds, then the smart TV can display the form of the digital assistant as a form with expressions and actions.
  • the form of the digital assistant can be a virtual human body representing the digital assistant looking at the user, blinking and smiling at the user, and the body and head swaying slightly, indicating that the digital assistant is willing to interact with the user.
  • 1.2 meters can be used as the dividing distance to control the virtual human body representing the digital assistant to look in the direction of different users in turn.
  • the smart TV When the smart TV recognizes that the user is in area 2, the smart TV can consider the distance between the user and the digital assistant to be a greeting distance, and the spatial interaction behavior of the user obtained by the smart TV includes: the user faces the digital assistant, the user's gaze point is on the digital assistant, and the user gazes at the digital assistant for more than 3 seconds, then the smart TV can display the form of the digital assistant as a form with expressions and actions, and the smart TV outputs the corresponding voice of the digital assistant.
  • the form of the digital assistant can be a virtual human body representing the digital assistant waving to the user and greeting the user; in addition, the voice of the digital assistant output by the smart TV can be to inquire about the user's needs, for example, the voice of the digital assistant can be specifically "Excuse me, how can I help you?"
  • the smart TV When the smart TV recognizes that the user is in area 3, the smart TV can consider that the distance between the user and the digital assistant is a manual touch screen operation distance, and the spatial interaction behavior of the user obtained by the smart TV includes: the user faces the digital assistant, the user's gaze point is on the digital assistant, and the user gazes at the digital assistant for more than 3 seconds, then the smart TV can display the task information that requires touch screen operation on the display screen, and proportionally reduce the main image of the digital assistant.
  • H is the projected height of the main image of the digital assistant on the screen
  • the actual height of the main image of the digital assistant is the real height of the virtual human body representing the digital assistant.
  • the real height of the male virtual human body can be, for example, 174cm
  • the real height of the female virtual human body can be, for example, 161cm.
  • the specific value can be adjusted according to the actual situation, and this embodiment does not make specific limitations on this.
  • FIG. 31 is a schematic diagram of the structure of a display device for a digital assistant virtual image provided by an embodiment of the present application.
  • the display device for the digital assistant virtual image is applied to an electronic device, and the virtual image includes multiple feature bodies.
  • the device includes:
  • Display module 3101 used for displaying a virtual image in a first form on a first interface
  • the display module 3101 is further used to detect the signal instruction generated by the electronic device and display the virtual image in the second form;
  • the signal instruction is generated based on the first user input, the first form and the second form display at least one of the multiple feature bodies, and the change from the first form to the second form means that at least one feature body undergoes at least one of a shape change, a position change, a motion law change, and a display area change.
  • the plurality of feature bodies include a first feature body, and the shape change of the first feature body includes the following changes: At least one of: zoom in, zoom out, stretch, compress, rotate, and switch between main form, geometric array, or stripe.
  • the plurality of feature bodies further include a second feature body, wherein the main shape of the first feature body is a ring, the second feature body is a sphere, and the center of the ring and the center of the sphere are co-pointed;
  • the position change of the first feature body includes at least one of a change in orbital position after flipping around the sphere and a change in position caused by overall movement or rotation of the virtual image;
  • the motion law of the first characteristic body includes that the ring rotates around the sphere according to a preset trajectory with the center of the circle as the fulcrum;
  • the motion law of the second characteristic body includes that the sphere moves back and forth according to a preset law along a straight line passing through the center of the ring and perpendicular to the plane where the ring is located.
  • the display module 3101 is further configured to detect a second user input and display a third feature body on the first feature body, where the third feature body can move on the first feature body and indicate the user's position.
  • the virtual image has a breathing state
  • the virtual image is displayed in a first form on the first interface, including:
  • the first interface is a screen-off interface
  • the first form is a breathing form including a first feature body and a second feature body, and the first feature body is displayed in a main form.
  • the virtual image in a breathing state reciprocates on the screen according to a preset rule.
  • the display module 3101 is further used to:
  • a second user input is detected, and a third feature is displayed on the first feature, the third feature being movable on the first feature and indicating the user's position.
  • the virtual image has a waiting state and a listening state
  • the first interface is a lock screen interface or any screen interface after unlocking
  • Displaying a virtual image in a first form on a first interface includes:
  • the first state to be awakened is displayed on the first interface as the first form, the state to be awakened includes a first feature body, and the first feature body is displayed in a long strip as a navigation prompt line, and the navigation prompt line is used as a reference line for screen gesture navigation; wherein, when no input for waking up the digital assistant is received, the state to be awakened is permanently displayed on the first interface;
  • a third user input is detected and displayed on the first interface in a listening state as a first form.
  • the third user input is used to wake up the digital assistant.
  • the listening state includes a first feature body and a second feature body, and the first feature body is displayed in a main form.
  • the first interface is a lock screen interface or any screen interface after unlocking, and the first state is a listening state;
  • Displaying the virtual image in the first form on the first interface specifically includes:
  • the first feature is first displayed as a geometric array
  • the first feature body is then transformed from a geometric array into a main form, and a second feature body is displayed.
  • the first feature body is first displayed in a geometric array, specifically:
  • the initial display position is determined based on user input
  • the initial position is the area where the virtual image is displayed.
  • the virtual image is displayed in the first form on the first interface, specifically:
  • the virtual image is overlaid and displayed on a target content element in the first interface, wherein the target content element is a content element of a preset type.
  • the gesture event interface of the target content element is copied to the virtual image so that performing a gesture on the virtual image can realize the gesture corresponding function of the target content element.
  • the first interface is a camera interface
  • the target content element is a shutter control in the camera interface
  • the first interface is the lock screen interface or any screen interface after unlocking
  • the first form is the wake-up state
  • the second form is the listening state
  • the signal instruction is generated based on the user performing a preset gesture on the digital assistant in the first form.
  • the plurality of feature bodies further includes a second feature body
  • the first feature body is a main form of a ring
  • the second feature body is a sphere
  • the ring is located at the red center of the sphere when viewed from above. On the road.
  • the virtual image further has a task execution state, and the task execution state includes a first feature body
  • the first feature body in the task execution state corresponds to at least one of a shape change and a position change.
  • the first feature body is a geometric array
  • the display position of the geometric array matches the target control in the process interface, so as to simulate a click event of the target control to enter the next process interface.
  • the first state is a listening state
  • the second state is a task execution state.
  • Both the listening state and the task execution state include a first feature body, and the first feature bodies in the listening state and the task execution state have different forms respectively.
  • the shape of the first feature body includes: a geometric array within the interface, an outer frame of a dialog card, an outer frame of an input box when calling a digital assistant input, or an outer frame of a task result page.
  • the first feature body has different morphological changes at different task progresses of the automation task.
  • the first feature body is changed from a geometric array covering the entire screen to a geometric array displayed only on the recognized executable object.
  • the morphological change of the first feature body includes:
  • the first feature body is first displayed in the main form of a ring on the periphery of the second feature body;
  • the first feature body is then separated from the second feature body by the ring and is enlarged and displayed to serve as the outer frame of the conversation card;
  • the first feature body changes from the outer frame of the dialogue card to the geometric array inside the dialogue bubble;
  • the avatar is displayed in a listening state.
  • the device further includes:
  • the acquisition module 3102 is used to acquire scene information and display a first feature body among multiple feature bodies in a target form based on the scene information.
  • the virtual image has a waiting-to-be-awakened state
  • the waiting-to-be-awakened state includes at least a first feature body, and the first feature body is displayed in a long strip;
  • the display module 3101 is further configured to display a second feature body in response to detecting that the focus of the human eye falls on the area where the screen is located.
  • the display module 3101 is further configured to, in response to detecting a human voice, cause the second feature body to move relative to the first feature body toward a direction close to a source of the human voice.
  • the electronic device is a smart car
  • the display module 3101 is further used for:
  • the vehicle function or warning information is indicated by the deformation or movement of the first feature body among the multiple feature bodies.
  • FIG 32 is a schematic diagram of the structure of an execution device provided in an embodiment of the present application.
  • the execution device 3200 can be specifically manifested as a mobile phone, a tablet, a laptop computer, an intelligent wearable device, a server, etc., which is not limited here.
  • the execution device 3200 includes: a receiver 3201, a transmitter 3202, a processor 3203 and a memory 3204 (wherein the number of processors 3203 in the execution device 3200 can be one or more, and one processor is taken as an example in Figure 32), wherein the processor 3203 may include an application processor 32031 and a communication processor 32032.
  • the receiver 3201, the transmitter 3202, the processor 3203 and the memory 3204 may be connected via a bus or other means.
  • the memory 3204 may include a read-only memory and a random access memory, and provides instructions and data to the processor 3203. A portion of the memory 3204 may also include a non-volatile random access memory (NVRAM).
  • NVRAM non-volatile random access memory
  • the memory 3204 stores processor and operation instructions, executable modules or data structures, or subsets thereof, or extended sets thereof, wherein the operation instructions may include various operation instructions for implementing various operations.
  • the processor 3203 controls the operation of the execution device.
  • the various components of the execution device are coupled together through a bus system, wherein the bus system includes not only a data bus but also a power bus, a control bus, and a status signal bus, etc.
  • the bus system includes not only a data bus but also a power bus, a control bus, and a status signal bus, etc.
  • various buses are referred to as bus systems in the figure.
  • the method disclosed in the above-mentioned embodiment of the present application can be applied to the processor 3203, or implemented by the processor 3203.
  • the processor 3203 can be an integrated circuit chip with signal processing capabilities. In the implementation process, each step of the above-mentioned method can be completed by an integrated logic circuit of hardware in the processor 3203 or an instruction in the form of software.
  • the above-mentioned processor 3203 can be a general-purpose processor, a digital signal processor (digital signal processing, DSP), a microprocessor or a microcontroller, and can further include an application specific integrated circuit (ASIC), a field-programmable gate array (FPGA) or other Programmable logic devices, discrete gate or transistor logic devices, discrete hardware components.
  • DSP digital signal processing
  • ASIC application specific integrated circuit
  • FPGA field-programmable gate array
  • the processor 3203 can implement or execute the methods, steps and logic block diagrams disclosed in the embodiments of the present application.
  • the general processor can be a microprocessor or the processor can also be any conventional processor, etc.
  • the steps of the method disclosed in the embodiments of the present application can be directly embodied as a hardware decoding processor for execution, or a combination of hardware and software modules in the decoding processor for execution.
  • the software module can be located in a mature storage medium in the field such as a random access memory, a flash memory, a read-only memory, a programmable read-only memory or an electrically erasable programmable memory, a register, etc.
  • the storage medium is located in the memory 3204, and the processor 3203 reads the information in the memory 3204 and completes the steps of the above method in combination with its hardware.
  • the receiver 3201 can be used to receive input digital or character information and generate signal input related to the relevant settings and function control of the execution device.
  • the transmitter 3202 can be used to output digital or character information through the first interface; the transmitter 3202 can also be used to send instructions to the disk group through the first interface to modify the data in the disk group; the transmitter 3202 can also include a display device such as a display screen.
  • the electronic device provided in the embodiment of the present application may specifically be a chip, and the chip includes: a processing unit and a communication unit, the processing unit may be, for example, a processor, and the communication unit may be, for example, an input/output interface, a pin or a circuit, etc.
  • the processing unit may execute the computer execution instructions stored in the storage unit so that the chip in the execution device executes the method described in the above embodiment.
  • the storage unit is a storage unit in the chip, such as a register, a cache, etc.
  • the storage unit may also be a storage unit located outside the chip in the wireless access device end, such as a read-only memory (ROM) or other types of static storage devices that can store static information and instructions, a random access memory (RAM), etc.
  • ROM read-only memory
  • RAM random access memory
  • Figure 33 is a schematic diagram of the structure of a computer-readable storage medium provided in an embodiment of the present application.
  • the present application also provides a computer-readable storage medium.
  • the method disclosed in Figure 3 above can be implemented as computer program instructions encoded in a machine-readable format on a computer-readable storage medium or encoded on other non-transient media or products.
  • FIG. 33 schematically illustrates a conceptual partial view of an example computer-readable storage medium including a computer program for executing a computer process on a computing device, arranged in accordance with at least some embodiments presented herein.
  • computer readable storage medium 3300 is provided using signal bearing medium 3301.
  • Signal bearing medium 3301 may include one or more program instructions 3302, which when executed by one or more processors may provide the functionality or portions of the functionality described above with respect to FIG.
  • the signal bearing medium 3301 may include a computer readable medium 3303 such as, but not limited to, a hard drive, a compact disk (CD), a digital video disk (DVD), a digital tape, a memory, a ROM or RAM, and the like.
  • a computer readable medium 3303 such as, but not limited to, a hard drive, a compact disk (CD), a digital video disk (DVD), a digital tape, a memory, a ROM or RAM, and the like.
  • the signal bearing medium 3301 may include a computer recordable medium 3304, such as, but not limited to, a memory, a read/write (R/W) CD, a R/W DVD, etc.
  • the signal bearing medium 3301 may include a communication medium 3305, such as, but not limited to, a digital and/or analog communication medium (e.g., a fiber optic cable, a waveguide, a wired communication link, a wireless communication link, etc.).
  • a wireless form of the communication medium 3305 e.g., a wireless communication medium complying with the IEEE 802.X standard or other transmission protocol.
  • the one or more program instructions 3302 may be, for example, computer executable instructions or logic implementing instructions.
  • the computing device of the computing device may be configured to provide various operations, functions, or actions in response to the program instructions 3302 communicated to the computing device via one or more of the computer readable medium 3303, the computer recordable medium 3304, and/or the communication medium 3305.
  • the device embodiments described above are merely schematic, wherein the units described as separate components may or may not be physically separated, and the components displayed as units may or may not be physical units, that is, they may be located in one place, or they may be distributed over multiple network units. Some or all of the modules may be selected according to actual needs to achieve the purpose of the scheme of this embodiment.
  • the connection relationship between the modules indicates that there is a communication connection between them, which may be specifically implemented as one or more communication buses or signal lines.
  • the storage medium such as a computer floppy disk, USB flash drive, mobile hard disk, ROM, RAM, magnetic disk or optical disk, includes a number of instructions for enabling a computer device (which may be a personal computer, training device, or network device, etc.) to execute the methods of the various embodiments of the present application.
  • all or part of the embodiments may be implemented by software, hardware, firmware or any combination thereof.
  • all or part of the embodiments may be implemented in the form of a computer program product.
  • the computer program product includes one or more computer instructions.
  • the computer can be a general-purpose computer, a special-purpose computer, a computer network, or other programmable devices.
  • the computer instructions can be stored in a computer-readable storage medium, or transmitted from one computer-readable storage medium to another computer-readable storage medium.
  • the computer instructions can be transmitted from a website site, a computer, a training device or a data center by wired (e.g., coaxial cable, optical fiber, digital subscriber line) or wireless (e.g., infrared, wireless, microwave, etc.) mode to another website site, computer, training device or data center.
  • wired e.g., coaxial cable, optical fiber, digital subscriber line
  • wireless e.g., infrared, wireless, microwave, etc.
  • the computer-readable storage medium can be any available medium that a computer can store or a data storage device such as a training device, a data center, etc. that contains one or more available media integration.
  • Available media can be magnetic media, (e.g., floppy disk, hard disk, tape), optical media (e.g., DVD), or semiconductor media (e.g., solid-state hard disk (SSD)), etc.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • General Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Human Computer Interaction (AREA)
  • Software Systems (AREA)
  • General Health & Medical Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Social Psychology (AREA)
  • Multimedia (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Databases & Information Systems (AREA)
  • Signal Processing (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • User Interface Of Digital Computer (AREA)

Abstract

一种数字助手虚拟形象的显示方法,能够提高用户与数字助手交互的自然性。在该方法中,通过设定数字助手的虚拟形象包括多个特征体,且在电子设备获取到用户输入后,电子设备产生相应的信号指令,从而触发界面上所显示的虚拟形象的形态发生改变,从而实现以不同的形态来表示数字助手所处的不同工作状态,以便于用户通过数字助手的形态变化情况来确认当前与数字助手交互的状态,提高用户与数字助手交互的自然性。

Description

一种数字助手虚拟形象的显示方法及相关装置
本申请要求于2023年6月29日提交国家知识产权局、申请号为202310786258.8、发明名称为“一种数字助手的显示方法及相关装置”的中国专利申请以及于2024年2月8日提交国家知识产权局、申请号为202410178141.6、发明名称为“一种数字助手虚拟形象的显示方法及相关装置”的中国专利申请的优先权,其全部内容通过引用结合在本申请中。
技术领域
本申请涉及人机交互技术领域,尤其涉及一种数字助手虚拟形象的显示方法及相关装置。
背景技术
随着智能设备的快速发展,在智能设备上配置数字助手已经逐渐成为了主流。通过数字助手来替代用户手动操作进而自动完成智能设备的一系列操作,能够解放用户的双手,提升智能设备的使用体验。
目前,用户在使用数字助手时,通常要通过语音指令或者手动操作来唤醒数字助手,从而进入数字助手的使用界面。数字助手的使用界面会显示一个助手形象,该助手形象通常为静态或持续按一定规律运动的二维图案。
在数字助手的使用界面中,当用户进一步输入语音指令或手势指令后,通常需要等待数字助手输出识别结果后,才能够确认数字助手是否已接收到语音指令或手势指令。后,数字助手在识别或处理指令的过程中,数字助手的形态并不会发生改变。因此,用户在输入语音指令或手势指令后,通常需要等待数字助手输出识别结果后,才能够确认数字助手是否已接收到语音指令或手势指令。即,在数字助手识别或处理用户输入的指令的时,用户无法得到数字助手的反馈,难以辨认数字助手当前的工作状态,容易误认数字助手没有接收到指令,从而导致用户重复下发指令。
发明内容
本申请提供了一种数字助手虚拟形象的显示方法,能够提高用户与数字助手交互的自然性。
本申请第一方面提供一种数字助手虚拟形象的显示方法,应用于电子设备,该数字助手的虚拟形象包括多个特征体。在该方法中,先在第一界面上以第一形态显示虚拟形象;然后,检测到电子设备产生的信号指令,以第二形态显示虚拟形象。其中,信号指令为电子设备基于接收到的第一用户输入产生的,且用户输入可以在显示第一形态前,也可以在显示第一形态后。即,在电子设备接收到第一用户输入后,产生信号指令,从而触发虚拟形象从第一形态切换为第二形态,以使得电子设备的界面上所显示的虚拟形象的形态发生变化。
此外,第一形态和第二形态均是展示多个特征体中的至少一个,从第一形态变化为第二形态显示是,至少一个特征体发生形状变化、位置变化、运动规律变化、显示面积变化中的至少一个。
其中,虚拟形象的不同形态代表数字助手所处的不同工作状态,即数字助手是通过虚拟形象的形态来表示当前自身所处的工作状态。这样,当界面上所显示的虚拟形象的形态发生变化时,用户可以直观清楚地了解到数字助手的工作状态的变化,从而明确当前数字助手针对用户指令的响应情况。
本方案中,通过设定数字助手的虚拟形象包括多个特征体,且在电子设备获取到用户输入后,电子设备产生相应的信号指令,从而触发界面上所显示的虚拟形象的形态发生改变,从而实现以不同的形态来表示数字助手所处的不同工作状态,以便于用户通过数字助手的形态变化情况来确认当前与数字助手交互的状态,提高用户与数字助手交互的自然性。
在一种可能的实现方式中,多个特征体包括第一特征体,多个特征体包括第一特征体,第一特征体的形状变化包括以下变化中的至少一种:放大、缩小、拉伸、压缩、旋转,以及在主形态、几何阵列或长条形之间切换。即,第一特征体的形状包括:主形态、几何阵列、长条形。并且,第一特征体的形状变化包括在多种形状之间切换,以及各形状的放大、缩小、拉伸、压缩或旋转。即,第一特征体的形状变化除了在不同的形状之间切换之外,还可以是包括在同一种形状下的放大、缩小、拉伸、压缩或旋转。
其中,主形态是指第一特征体在常规情况下的形态,而几何阵列和长条形则是第一特征体在一些特定情况下所表现的形态。
在一种可能的实现方式中,多个特征体还包括第二特征体。其中,第一特征体的主形态为一圆环,第二特征体为一球体,圆环的圆心与球体的球心共点。第一特征体的位置变化包括,围绕球体翻转后的轨道位置变化、虚拟形象整体移动或旋转所导致的位置变化中的至少一种。第一特征体的运动规律包括圆环以圆心为支点,按照预设的轨迹环绕球体转动。第二特征体的运动规律包括球体沿着经过圆环的圆心、且垂直于圆环所在平面的直线按照预设的规律做往复运动。
在一种可能的实现方式中,在检测到第二用户输入时,在第一特征体上显示第三特征体,第三特征体能够在第一特征体上移动并指示用户的方位。
在一种可能的实现方式中,虚拟形象具有呼吸态,在第一界面上以第一形态显示虚拟形象,包括:若第一界面为熄屏界面,则第一形态为包括第一特征体和第二特征体的呼吸态,且第一特征体以主形态显示。
在一种可能的实现方式中,第一界面为熄屏界面,呼吸态的数字助手在屏幕上按照预设的规律做往复运动。
在一种可能的实现方式中,在第一界面为熄屏界面,第一形态为呼吸态时,响应于检测到第二用户输入,在第一特征体上显示第三特征体,第三特征体能够在第一特征体上移动并指示用户的方位。其中,第一特征体为圆环,第三特征体则可以为圆环上的光效区域。
在一种可能的实现方式中,虚拟形象具有待唤醒态和聆听态,第一界面为锁屏界面或解锁后的任意屏幕界面。在第一界面上以第一形态显示虚拟形象,具体包括:以待唤醒态作为第一形态显示在第一界面上,待唤醒态包括第一特征体,且第一特征体作为导航提示线以长条形显示,导航提示线用于作为屏幕手势导航的基准线;其中,在未接收到唤醒数字助手的输入时,待唤醒态在第一界面常驻显示。
或者,检测到第三用户输入,以聆听态作为第一形态显示在第一界面上,第三用户输入用于唤醒数字助手,聆听态包括第一特征体和第二特征体,且第一特征体以主形态显示。
在一种可能的实现方式中,第一界面为锁屏界面或解锁后的任意屏幕界面,第一形态为聆听态。那么,在第一界面上以第一形态显示虚拟形象,具体包括:第一特征体先以几何阵列显示;第一特征体再由几何阵列变化为主形态,并显示第二特征体。即,初始阶段仅显示第一特征体,不显示第二特征体,且第一特征体初始以几何阵列显示,铺满屏幕;然后第一特征体由几何阵列经形状变化为以主形态显示,并显示第二特征体。
也就是说,在唤醒数字助手的时候,数字助手先是以几何阵列(例如点阵)的形式出现,并逐渐铺满屏幕;然后,数字助手再逐渐从几何阵列转换为第一特征体的主形态(即圆环),并且显示第二特征体(即球体),从而实现以动画的形式展示数字助手的唤醒过程,增加了交互过程的直观性,有利于用户识别到数字助手当前所处的工作状态。
在一种可能的实现方式中,第一特征体先以几何阵列显示,具体为:在初始显示位置显示部分阵列的几何图形,再在整个屏幕显示几何图形。即,先确定几何阵列的初始显示位置;其次,在初始显示位置显示部分阵列的几何图形,再将几何图形逐渐铺满屏幕。例如,先从一个初始显示位置开始显示部分点阵,然后再逐渐将点阵扩散到铺满整个屏幕。其中,用户输入是指用于唤醒数字助手的输入。
在一种可能的实现方式中,初始显示位置具体可以是在触发唤醒数字助手的用户输入的附近。比如,在用户输入为长按智能手机的电源键时,几何阵列的初始显示位置可以是在电源键附近的屏幕位置上。
并且,在未检测到用户输入时,初始位置为默认位置,该默认位置为显示虚拟形象的区域,例如屏幕底部。
本方案中,通过用户输入来确定几何阵列的初始显示位置,能够使得数字助手的显示过程与用户输入更为贴合,增强用户与数字助手交互的直观性,提高用户体验。
在一种可能的实现方式中,第一界面为锁屏界面或解锁后的任意屏幕界面,第一形态为待唤醒态,第二形态为聆听态;信号指令是基于用户对第一形态的数字助手执行预设手势后产生。
在一种可能的实现方式中,多个特征体还包括第二特征体;在第二形态下,第一特征体为圆环的主形态,第二特征体为球体,且圆环位于球体的俯视视角的赤道上。
在一种可能的实现方式中,在第一界面上以第一形态显示虚拟形象,具体为:在第一界面上的预设位置显示虚拟形象;或者,将虚拟形象覆盖显示在第一界面内的目标内容元素上,其中,目标内容元素为预设类型的内容元素。
即,虚拟形象是覆盖在目标内容元素上的,使得第一界面上不再显示目标内容元素。
在一种可能的实现方式中,当虚拟形象覆盖显示在第一界面内的目标内容元素上时,将目标内容元素的手势事件接口复制到虚拟形象上,以使得对虚拟形象执行手势能够实现目标内容元素的手势对应功能。
也就是说,即便目标内容元素被虚拟形象覆盖了,但是虚拟形象继承了目标内容元素本来的功能,从而使得对虚拟形象执行特定手势(例如单击)时,仍然能够执行目标内容元素的手势对应功能。
在一种可能的实现方式中,第一界面为相机界面,目标内容元素为相机界面内的快门控件。
在一种可能的实现方式中,虚拟形象还具有任务执行态,任务执行态包括第一特征体;在数字助手执行自动化任务过程中的至少一个过程界面,任务执行态下的第一特征体对应产生形状变化、位置变化中的至少一种变化。
具体来说,在数字助手执行自动化任务过程中的过程界面中,数字助手实际上是一直处于任务执行态,且数字助手在任务执行态下的第一特征体可以发生各种变化,以向用户展示数字助手执行任务的自动化过程,使得数字助手执行任务的自动化过程更为直观、清晰,便于用户观察数字助手执行自动化任务的过程是否出现错误。
在一种可能的实现方式中,第一特征体为几何阵列,且几何阵列的显示位置与过程界面内的目标控件相匹配,通过模拟目标控件的点击事件进入下一过程界面。也就是说,在执行自动化任务的过程中,数字助手能够以作为几何阵列的第一特征体来模拟用户点击界面上控件的操作,从而将数字助手模拟用户执行自动化任务的整个过程可视化。
在一种可能的实现方式中,第一形态为聆听态,第二形态为任务执行态,聆听态和任务执行态均包括第一特征体,且聆听态和任务执行态下的第一特征体分别具有不同的形态;在任务执行态下,第一特征体的形态包括:界面内的几何阵列、数字助手对话卡片的外框、调用数字助手输入时输入框的外框或任务结果页的外框。
即,在不同的场景下,第一特征体可以是以不同的形态来表示,以便于更为贴合当前界面所显示的内容。
在一种可能的实现方式中,在自动化任务的不同任务进度下,第一特征体具有不同的形态变化。即,在数字助手执行自动化任务的过程中,在执行到不同的任务进度下,第一特征体可能会变化为不同的形态,以便于保证第一特征体的形态始终与当前的界面紧密贴合。
在一种可能的实现方式中,在自动化任务为智慧识别任务时,第一特征体由铺满屏幕的几何阵列变化为仅显示在识别到的可执行对象上的几何阵列。
在一种可能的实现方式中,对应自动化任务的执行过程,在任务执行态下,第一特征体的形态变化包括:初始时,第一特征体先以圆环的主形态显示在相对第二特征体的初始位置(例如第二特征体的外围),初始位置与聆听态时第一特征体的位置相同或不同;第一特征体由圆环脱离第二特征体,并放大显示,作为数字助手对话卡片的外框;第一特征体由对话卡片的外框变为数字助手对话气泡内的几何阵列;在自动化任务执行完成后,虚拟形象以聆听态显示。
在一种可能的实现方式中,该方法还可以包括:电子设备获取场景信息,并基于场景信息确定第一特征体的目标形态,将第一特征体以目标形态显示。即,目标形态是与场景信息相关的。在电子设备获取到特定场景信息时,电子设备可以是以特定形态来显示第一特征体。
在一种可能的实现方式中,虚拟形象具有待唤醒态,待唤醒态至少包括第一特征体,第一特征体以长条形显示;响应于检测到人眼视线焦点落在屏幕所在区域,电子设备显示第二特征体。
也就是说,在一些场景下,数字助手处于待唤醒态时,只显示一个长条形;等到数字助手检测到有用户视线落在屏幕上时,可以认为用户有交互倾向,因此显示第二特征体,从而表征数字助手检测到用户具有交互倾向且正在等待交互,以提高人机交互的自然性。
在一种可能的实现方式中,该方法还包括:响应于检测到人声,第二特征体相对第一特征体向靠近 人声声源的方向移动。
在一种可能的实现方式中,电子设备为智能汽车,该方法还包括:通过多个特征体中第一特征体的形变或运动来指示车载功能或告警信息。
本申请第二方面提供一种数字助手虚拟形象的显示装置,应用于电子设备,虚拟形象包括多个特征体。该装置包括:
显示模块,用于在第一界面上以第一形态显示虚拟形象;
显示模块,还用于检测到电子设备产生的信号指令,以第二形态显示虚拟形象;
其中,信号指令为基于第一用户输入产生的,第一形态和第二形态展示多个特征体中的至少一个,从第一形态变化为第二形态是,至少一个特征体发生形状变化、位置变化、运动规律变化、显示面积变化中的至少一个。
在一种可能的实现方式中,多个特征体包括第一特征体,第一特征体的形状变化包括以下变化中的至少一种:放大、缩小、拉伸、压缩、旋转,以及在主形态、几何阵列或长条形之间切换。
在一种可能的实现方式中,多个特征体还包括第二特征体,其中,第一特征体的主形态为一圆环,第二特征体为一球体,圆环的圆心与球体的球心共点;
第一特征体的位置变化包括,围绕球体翻转后的轨道位置变化、虚拟形象整体移动或旋转所导致的位置变化中的至少一种;
第一特征体的运动规律包括,圆环以圆心为支点,按照预设的轨迹环绕球体转动;
第二特征体的运动规律包括,球体沿着经过圆环的圆心、且垂直于圆环所在平面的直线按照预设的规律做往复运动。
在一种可能的实现方式中,显示模块,还用于检测到第二用户输入,在第一特征体上显示第三特征体,第三特征体能够在第一特征体上移动并指示用户的方位。
在一种可能的实现方式中,虚拟形象具有呼吸态,在第一界面上以第一形态显示虚拟形象,包括:
若第一界面为熄屏界面,则第一形态为包括第一特征体和第二特征体的呼吸态,且第一特征体以主形态显示。
在一种可能的实现方式中,在第一界面为熄屏界面时,呼吸态的虚拟形象在屏幕上按照预设的规律做往复运动。
在一种可能的实现方式中,在第一界面为熄屏界面,第一形态为呼吸态时,显示模块,还用于:
检测到第二用户输入,在第一特征体上显示第三特征体,第三特征体能够在第一特征体上移动并指示用户的方位。
在一种可能的实现方式中,虚拟形象具有待唤醒态和聆听态,第一界面为锁屏界面或解锁后的任意屏幕界面;
在第一界面上以第一形态显示虚拟形象,包括:
以待唤醒态作为第一形态显示在第一界面上,待唤醒态包括第一特征体,且第一特征体作为导航提示线以长条形显示,导航提示线用于作为屏幕手势导航的基准线;其中,在未接收到唤醒数字助手的输入时,待唤醒态在第一界面常驻显示;
或者,
检测到第三用户输入,以聆听态作为第一形态显示在第一界面上,第三用户输入用于唤醒数字助手,聆听态包括第一特征体和第二特征体,且第一特征体以主形态显示。
在一种可能的实现方式中,第一界面为锁屏界面或解锁后的任意屏幕界面,第一形态为聆听态;
在第一界面上以第一形态显示虚拟形象,具体包括:
第一特征体先以几何阵列显示;
第一特征体再由几何阵列变化为主形态,并显示第二特征体。
在一种可能的实现方式中,第一特征体先以几何阵列显示,具体为:
在初始显示位置显示部分阵列的几何图形,再在整个屏幕显示几何图形。
在一种可能的实现方式中,初始显示位置是基于用户输入确定的;
且在未检测到用户输入时,初始位置为显示虚拟形象的区域。
在一种可能的实现方式中,在第一界面上以第一形态显示虚拟形象,具体为:
在第一界面上的预设位置显示虚拟形象;或者,
将虚拟形象覆盖显示在第一界面内的目标内容元素上,其中,目标内容元素为预设类型的内容元素。
在一种可能的实现方式中,当虚拟形象覆盖显示在第一界面内的目标内容元素上时,将目标内容元素的手势事件接口复制到虚拟形象上,以使得对虚拟形象执行手势能够实现目标内容元素的手势对应功能。
在一种可能的实现方式中,第一界面为相机界面,目标内容元素为相机界面内的快门控件。
在一种可能的实现方式中,第一界面为锁屏界面或解锁后的任意屏幕界面,第一形态为待唤醒态,第二形态为聆听态;信号指令是基于用户对第一形态的数字助手执行预设手势后产生。
在一种可能的实现方式中,多个特征体还包括第二特征体;
在第二形态下,第一特征体为圆环的主形态,第二特征体为球体,且圆环位于球体的俯视视角的赤道上。
在一种可能的实现方式中,虚拟形象还具有任务执行态,任务执行态包括第一特征体;
在数字助手执行自动化任务过程中的至少一个过程界面,任务执行态下的第一特征体对应产生形状变化、位置变化中的至少一种。
在一种可能的实现方式中,第一特征体为几何阵列,且几何阵列的显示位置与过程界面内的目标控件相匹配,用于模拟目标控件的点击事件以进入下一过程界面。
在一种可能的实现方式中,第一形态为聆听态,第二形态为任务执行态,聆听态和任务执行态均包括第一特征体,且聆听态和任务执行态下的第一特征体分别具有不同的形态;
在任务执行态下,第一特征体的形态包括:界面内的几何阵列、对话卡片的外框、调用数字助手输入时输入框的外框或任务结果页的外框。
在一种可能的实现方式中,在自动化任务的不同任务进度下,第一特征体具有不同的形态变化。
在一种可能的实现方式中,在自动化任务为智慧识别任务时,第一特征体由铺满屏幕的几何阵列变化为仅显示在识别到的可执行对象上的几何阵列。
在一种可能的实现方式中,在任务执行态下,第一特征体的形态变化包括:
第一特征体先以圆环的主形态显示在第二特征体的外围;
第一特征体再由圆环脱离第二特征体,并放大显示,以作为对话卡片的外框;
第一特征体由对话卡片的外框变为对话气泡内的几何阵列;
在自动化任务执行完成后,虚拟形象以聆听态显示。
在一种可能的实现方式中,装置还包括:
获取模块,用于获取场景信息,并基于场景信息以目标形态显示多个特征体中的第一特征体。
在一种可能的实现方式中,虚拟形象具有待唤醒态,待唤醒态至少包括第一特征体,第一特征体以长条形显示;
显示模块,还用于响应于检测到人眼视线焦点落在屏幕所在区域,显示第二特征体。
在一种可能的实现方式中,显示模块,还用于响应于检测到人声,使得第二特征体相对第一特征体向靠近人声声源的方向移动。
在一种可能的实现方式中,电子设备为智能汽车,显示模块,还用于:
通过多个特征体中第一特征体的形变或运动来指示车载功能或告警信息。
本申请第三方面提供一种数字助手虚拟形象的显示装置,可以包括处理器,处理器和存储器耦合,存储器存储有程序指令,当存储器存储的程序指令被处理器执行时实现上述第一方面或第一方面任一实现方式的方法。对于处理器执行第一方面的各个可能实现方式中的步骤,具体均可以参阅第一方面,此处不再赘述。
本申请第四方面提供了一种计算机可读存储介质,计算机可读存储介质中存储有计算机程序,当其在计算机上运行时,使得计算机执行上述第一方面任一实现方式的方法。
本申请第五方面提供了一种电路系统,电路系统包括处理电路,处理电路配置为执行上述第一方面任一实现方式的方法。
本申请第六方面提供了一种计算机程序产品,当其在计算机上运行时,使得计算机执行上述第一方面任一实现方式的方法。
本申请第七方面提供了一种芯片系统,该芯片系统包括处理器,用于支持服务器或门限值获取装置实现上述第一方面任一实现方式中所涉及的功能,例如,发送或处理上述方法中所涉及的数据和/或信息。在一种可能的设计中,芯片系统还包括存储器,存储器,用于保存服务器或通信设备必要的程序指令和数据。该芯片系统,可以由芯片构成,也可以包括芯片和其他分立器件。
上述第二方面至第七方面的有益效果可以参考上述第一方面的介绍,在此不再赘述。
附图说明
图1为本申请实施例提供的一种应用场景的架构示意图;
图2为本申请实施例提供的一种电子设备101的结构示意图;
图3为本申请实施例提供的一种数字助手虚拟形象的显示方法的流程示意图;
图4为本申请实施例提供的一种数字助手的本体形象的示意图;
图5为本申请实施例提供的一种数字助手的形态为呼吸态时的运动规律的示意图;
图6为本申请实施例提供的一种数字助手从呼吸态切换到待唤醒态的示意图;
图7为本申请实施例提供的一种数字助手的形态为唤醒态时的示意图;
图8为本申请实施例提供的一种数字助手的形态为聆听态时的示意图;
图9为本申请实施例提供的一种数字助手的形态为思考态时的示意图;
图10为本申请实施例提供的一种数字助手的形态为任务执行态时的示意图;
图11为本申请实施例提供的一种数字助手的多种形态的示意图;
图12为本申请实施例提供的一种数字助手以待唤醒态显示的示意图;
图13为本申请实施例提供的一种数字助手从待唤醒态切换为聆听态的示意图;
图14为本申请实施例提供的一种虚拟形象覆盖在相机快门控件上的示意图;
图15为本申请实施例提供的一种虚拟形象覆盖在助手建议组件上的示意图;
图16为本申请实施例提供的一种数字助手执行自动化任务的过程示意图;
图17为本申请实施例提供的一种数字助手的圆环转化为点阵铺在任务执行界面的示意图;
图18为本申请实施例提供的在桌面使用数字助手进行文字输入的示意图;
图19为本申请实施例提供的一种数字助手的圆环发生形变且恢复原状的示意图;
图20为本申请实施例提供的一种智慧识别场景下数字助手的形态变化示意图;
图21为本申请实施例提供的一种智慧识别场景下数字助手出现的示意图;
图22为本申请实施例提供的一种数字助手基于场景信息产生形变的示意图;
图23为本申请实施例提供的一种车载终端上数字助手的变化示意图;
图24为本申请实施例提供的一种数字助手的显示方法的流程示意图;
图25为本申请实施例提供的数字助手的显示方法的应用场景示意图;
图26为本申请实施例提供的一种数字助手处于休眠状态的示意图;
图27为本申请实施例提供的一种区域划分的示意图;
图28为本申请实施例提供的一种切换显示数字助手的形态的示意图;
图29为本申请实施例提供的一种数字助手在唤醒态的变化示意图;
图30为本申请实施例提供的另一种数字助手在唤醒态的变化示意图;
图31为本申请实施例提供的一种数字助手虚拟形象的显示装置的结构示意图;
图32为本申请实施例提供的执行设备的一种结构示意图;
图33为本申请实施例提供的一种计算机可读存储介质的结构示意图。
具体实施方式
为了使本申请的目的、技术方案及优点更加清楚明白,下面结合附图,对本申请的实施例进行描述。显然,所描述的实施例仅仅是本申请一部分的实施例,而不是全部的实施例。本领域普通技术人员可知,随着新应用场景的出现,本申请实施例提供的技术方案对于类似的技术问题,同样适用。
本申请的说明书和权利要求书及上述附图中的术语“第一”、“第二”等是用于区别类似的对象,而不必用于描述特定的顺序或先后次序。应该理解这样使用的描述在适当情况下可以互换,以便使实施例能够以除了在本申请图示或描述的内容以外的顺序实施。此外,术语“包括”和“具有”以及他们的任何变形,意图在于覆盖不排他的包含,例如,包含了一系列步骤或模块的过程、方法、系统、产品或设备不必限于清楚地列出的那些步骤或模块,而是可包括没有清楚地列出的或对于这些过程、方法、产品或设备固有的其它步骤或模块。在本申请中出现的对步骤进行的命名或者编号,并不意味着必须按照命名或者编号所指示的时间/逻辑先后顺序执行方法流程中的步骤,已经命名或者编号的流程步骤可以根据要实现的技术目的变更执行顺序,只要能达到相同或者相类似的技术效果即可。
本申请中所出现的单元的划分,是一种逻辑上的划分,实际应用中实现时可以有另外的划分方式,例如多个单元可以结合成或集成在另一个系统中,或一些特征可以忽略,或不执行,另外,所显示的或讨论的相互之间的耦合或直接耦合或通信连接可以是通过一些接口,单元之间的间接耦合或通信连接可以是电性或其他类似的形式,本申请中均不作限定。并且,作为分离部件说明的单元或子单元可以是也可以不是物理上的分离,可以是也可以不是物理单元,或者可以分布到多个电路单元中,可以根据实际的需要选择其中的部分或全部单元来实现本申请方案的目的。
为便于理解,以下先介绍本申请实施例所涉及的一些技术术语。
(1)数字助手
数字助手是指通过接收用户的指令,来替代用户的手动操作,进而帮忙用户完成智能设备的一系列操作的虚拟助手。通常来说,目前大部分的数字助手均为语音助手,通过接收用户的语音指令来完成智能设备的操作。例如,在智能设备为智能手机或智能电视的情况下,数字助手能够接收用户的语音指令来打开某个应用程序或者是播放某一首歌曲。
(2)距离传感器
距离传感器,又叫做位移传感器,是传感器的一种,用于感应其与某物体间的距离以完成预设的某种功能,得到了相当广泛的应用。
距离传感器根据其工作原理的不同可分为光学距离传感器、红外距离传感器、超声波距离传感器等多种。手机上使用的距离传感器大多是红外距离传感器,其具有一个红外线发射管和一个红外线接收管,当发射管发出的红外线被接收管接收到时,表明距离较近,需要关闭屏幕以免出现误操作现象,而当接收管接收不到发射管发射的红外线时,表明距离较远,无需关闭屏幕。其它类型距离传感器的工作原理也大同小异,也是通过某种物质的发射与接受来判断其距离的远近,其发射的物质可以是超声波,光脉冲等等。
请参阅图1,图1为本申请实施例提供的一种应用场景的架构示意图。如图1所示,应用场景中包括用户和具有数字助手的电子设备。其中,用户可以通过语音或手势来向电子设备中的数字助手下发指令,进而使得数字助手协助用户自动地完成电子设备的一系列操作,例如自动地执行电子设备中的应用程序。
电子设备上配置有数字助手,且电子设备上可以配置有拾音器或摄像头等指令接收装置,能够接收用户的语音指令或手势指令,进而通过数字助手处理用户输入的指令来自动地完成一系列操作,并向用户反馈处理结果。电子设备上还具有显示屏幕,能够在显示屏幕上显示数字助手的形态。当电子设备获取到用户输入时,电子设备产生信号指令来触发数字助手的虚拟形象的变化,从而使得数字助手与用户交互的过程更为直观。例如,在电子设备与用户具有一定距离的情况下,电子设备通过感知用户的位置变化或姿态变化,来识别用户的空间交互行为,并在显示屏幕上显示数字助手响应于用户的空间交互行为而产生的形态变化。
以上介绍的是电子设备通过获取真实物理世界下的用户的空间交互行为来调整数字助手的形态。此外,电子设备还可以为虚拟现实(virtual reality,VR)设备或增强现实(augmented reality,AR)设备, 电子设备通过获取VR世界或AR世界等虚拟世界内用户的空间交互行为来调整数字助手的形态。其中,在虚拟世界内,与电子设备进行交互的用户可以是在虚拟世界内代表真实用户的虚拟形象。真实用户通过控制虚拟形象来执行空间交互行为,从而改变虚拟世界内的数字助手的形态。
示例性地,具有数字助手的电子设备例如可以是智能手机(mobile phone)、个人电脑(personal computer,PC)、笔记本电脑、平板电脑、智慧电视、移动互联网设备(mobile internet device,MID)、自动驾驶车辆中的显示装置、智能音箱、可穿戴设备,VR设备、AR设备、工业控制(industrial control)中的显示装置、远程手术(remote medical surgery)中的显示装置、智能电网(smart grid)中的显示装置、智慧城市(smart city)中的无线通信装置、智慧家庭(smart home)中的显示装置等。
请参阅图2,图2为本申请实施例提供的一种电子设备101的结构示意图。如图2所示,电子设备101包括处理器103,处理器103和系统总线105耦合。处理器103可以是一个或者多个处理器,其中每个处理器都可以包括一个或多个处理器核。显示适配器(video adapter)107,显示适配器可以驱动显示器109,显示器109和系统总线105耦合。系统总线105通过总线桥111和输入输出(I/O)总线耦合。I/O接口115和I/O总线耦合。I/O接口115和多种I/O设备进行通信,比如输入设备117(如:触摸屏等),外存储器121,(例如,硬盘、软盘、光盘或优盘),多媒体接口等)。收发器123(可以发送和/或接收无线电通信信号),摄像头155(可以捕捉静态和动态数字视频图像)和外部USB端口125。其中,可选地,和I/O接口115相连接的接口可以是USB接口。
其中,处理器103可以是任何传统处理器,包括精简指令集计算(reduced instruction set Computing,RISC)处理器、复杂指令集计算(complex instruction set computing,CISC)处理器或上述的组合。可选地,处理器可以是诸如ASIC的专用装置。
电子设备101可以通过网络接口129和软件部署服务器149通信。示例性的,网络接口129是硬件网络接口,比如,网卡。网络127可以是外部网络,比如因特网,也可以是内部网络,比如以太网或者虚拟私人网络(virtual private network,VPN)。可选地,网络127还可以是无线网络,比如WiFi网络,蜂窝网络等。
硬盘驱动器接口131和系统总线105耦合。硬件驱动接口和硬盘驱动器133相连接。内存储器135和系统总线105耦合。运行在内存储器135的数据可以包括电子设备101的操作系统(OS)137、应用程序143和调度表。
操作系统包括Shell 139和内核(kernel)141。Shell 139是介于使用者和操作系统的内核间的一个接口。shell是操作系统最外面的一层。shell管理使用者与操作系统之间的交互:等待使用者的输入,向操作系统解释使用者的输入,并且处理各种各样的操作系统的输出结果。
内核141由操作系统中用于管理存储器、文件、外设和系统资源的那些部分组成。内核141直接与硬件交互,操作系统内核通常运行进程,并提供进程间的通信,提供CPU时间片管理、中断、内存管理和IO管理等等。
请参阅图3,图3为本申请实施例提供的一种数字助手虚拟形象的显示方法的流程示意图。如图3所示,该数字助手虚拟形象的显示方法包括以下的步骤301-302,且该数字助手虚拟形象的显示方法应用于配置有数字助手的电子设备上。
步骤301,在第一界面上以第一形态显示数字助手的虚拟形象。
本实施例中,数字助手的虚拟形象可以包括多个特征体。当虚拟形象以不同的形态来显示时,虚拟形象所包括的多个特征体会发生变化。其中,第一形态是数字助手的虚拟形象的多个形态中的一个,显示在电子设备的第一界面上。
步骤302,基于电子设备产生的信号指令,以第二形态显示虚拟形象。
其中,信号指令为电子设备基于接收到的第一用户输入产生的。在电子设备接收到第一用户输入后,产生信号指令,从而触发虚拟形象从第一形态切换为第二形态,以使得电子设备的界面上所显示的虚拟形象的形态发生变化。
其中,第一形态和第二形态的虚拟形象均包括多个特征体中的至少一个。虚拟形象从第一形态改为第二形态具体可以是,至少一个特征体发生形状变化、位置变化、运动规律变化、显示面积变化中的至 少一个变化。
需要说明的是,电子设备接收到的第一用户输入可以是在电子设备显示虚拟形象的第一形态之前,也可以是在显示虚拟形象的第一形态之后。如果接收到的第一用户输入可以是在电子设备显示虚拟形象的第一形态之前,则代表第一用户输入会促使电子设备显示虚拟形象的第一形态,并且从第一形态变化为第二形态,即引起了至少两种形态变化。如果接收到的第一用户输入可以是在电子设备显示虚拟形象的第一形态之后,则代表第一用户输入会促使数字助手的虚拟形象从第一形态变换为第一形态,即至少引起一种形态变化。
在本实施例中,虚拟形象的不同形态代表数字助手所处的不同工作状态,即数字助手是通过虚拟形象的形态来表示当前自身所处的工作状态。这样,当界面上所显示的虚拟形象的形态发生变化时,用户可以直观清楚地了解到数字助手的工作状态的变化,从而明确当前数字助手针对用户指令的响应情况。
可选的,多个特征体包括第一特征体,第一特征体的形状包括:主形态、几何阵列、长条形。并且,第一特征体的形状变化包括在多种形状之间切换,以及各形状的放大、缩小、拉伸、压缩或旋转。即,第一特征体的形状变化除了在不同的形状之间切换之外,还可以是包括在同一种形状下的放大、缩小、拉伸、压缩或旋转。其中,主形态是指第一特征体在常规情况下的形态,而几何阵列和长条形则是第一特征体在一些特定情况下所表现的形态。
可选的,多个特征体还包括第二特征体。在多个特征体中,第一特征体的主形态为一圆环,第二特征体为一球体,圆环的圆心与球体的球心共点。第一特征体的位置变化包括,围绕球体翻转后的轨道位置变化、虚拟形象整体移动或旋转所导致的位置变化中的至少一种;第一特征体的运动规律包括圆环以圆心为支点,按照预设的轨迹环绕球体转动;第二特征体的运动规律包括球体沿着经过圆环的圆心、且垂直于圆环所在平面的直线按照预设的规律做往复运动。
示例性地,对应于任务的不同处理阶段,数字助手可以有以下六种不同的形态。
呼吸态:对应无任务处理时数字助手的形态,即用于表示数字助手处于无任务处理状态。其中,呼吸态是数字助手处于熄屏界面下的形态。
待唤醒态:对应于数字助手等待唤醒时的形态(本质上也是无任务处理时数字助手的形态),即用于表示数字助手处于等待唤醒指令的状态。其中,待唤醒态是数字助手处于亮屏界面下的形态,其中亮屏界面包括锁屏界面或屏幕解锁后的界面。
唤醒态:对应数字助手被唤醒,准备接收指令时的形态,即用于表示数字助手处于待接收指令状态。
聆听态:对应接收指令过程中数字助手的形态,即用于表示数字助手处于指令接收状态。
思考态:对应接收指令完成,对指令进行解析的过程中数字助手的形态,即用于表示数字助手处于指令解析状态。
任务执行态:对应数字助手执行解析指令所得到的任务时的形态,即用于表示数字助手处于任务执行状态。
从交互意向程度上来看,上述的多种形态按照交互意向程度从低到高进行排序依次为:呼吸态、待唤醒态、唤醒态、聆听态、思考态、任务执行态。此外,在熄屏界面下,呼吸态可以是转变为唤醒态。在亮屏界面下,待唤醒态则可以是转变为聆听态,而不会转变为唤醒态。在应用于不同的电子设备时,数字助手所具有的形态可能也是不一样的。例如,在应用于智能手机时,数字助手的形态包括上述的呼吸态、待唤醒态、聆听态、思考态和任务执行态,且形态之间能够按照顺序来回进行切换;在应用于智慧电视时,数字助手的形态包括上述的呼吸态、唤醒态、聆听态、思考态和任务执行态,且形态之间能够按照顺序来回进行切换。
上述介绍了数字助手的不同形态,以下将介绍关于数字助手的本体形象。
数字助手的本体形象包括至少两个特征体,并通过特征体的形状、位置、运动规律的改变,来区分数字助手的不同形态。具体到某一个形态的数字助手的本体形象,各个特征体可以是动态地呈现,以表示数字助手活动的生命体的概念。通过不同特征体的形状、位置和/或运动规律的变化,来区分不同形态的数字助手。其中,特征体的形状发生改变可以是指特征体保持当前位置不变,而调整特征体在显示屏幕上所呈现的部位(例如将显示特征体的正面调整为显示特征体的背面)、从预先设定的多种不同的形状之间切换(包括各形状的放大、缩小、拉伸、压缩或旋转)、调整特征体上某些部位的位置或动作(例 如将特征体上的手臂从静止调整为挥手动作)。特征体的位置发生改变可以是指特征体所显示的位置发生改变,例如从显示在屏幕底部变化为显示在屏幕上方或屏幕中央。特征体的运动规律发生改变可以是指整个特征体在显示屏幕上的运动情况发生改变,例如特征体从静止改变为沿着预设轨迹移动,又例如特征体从沿着正弦曲线轨迹移动改变为特征体绕某一点自转。
请参阅图4,图4为本申请实施例提供的一种数字助手的本体形象的示意图。如图4所示,数字助手的本体形象包括三个特征体:圆环、球体和形象本体。其中,形象本体包裹在球体内;圆环环绕球体,类似行星的星环。不同形态下的数字助手,圆环、球体、形象本体可以具有不同的姿态、不同的运动规律。并且,不同形态的数字助手之间,至少一个特征体的姿态和/或运动规律发生了变化。
此外,为便于描述每个特征体的运动状态变化,为数字助手的本体形象定义一个参考系:在呼吸态下,圆环所在的平面为xoy平面,且x轴指向电子设备的屏幕,竖直方向则为z轴。简单来说,电子设备的宽度方向为y轴,电子设备的高度方向为z轴。
以下将结合附图,详细介绍数字助手的每个形态下的特征体的姿态和运动规律。
1,呼吸态
在数字助手的形态为呼吸态时,圆环、球体和形象本体均沿着预设的轨迹,并按照预设的周期进行规律性移动。在移动的过程中,圆环、球体和形象本体之间的相对位置不变,即圆环、球体和形象本体的中心点始终重合。此外,形象本体和球体可以是围绕一个轴线或一个点自转,例如,围绕球体的球心自转。即,形象本体和球体在沿着预设的轨迹移动的同时,还发生自转(类似于地球在围绕太阳旋转的同时,本身也在自转)。
示例性地,在虚拟形象具有呼吸态的情况下,在第一界面上以第一形态显示虚拟形象,具体可以是包括:若第一界面为熄屏界面,则第一形态为呼吸态。其中,呼吸态包括第一特征体和第二特征体,第一特征体以主形态显示。
可选的,在第一界面为熄屏界面的情况下,呼吸态的数字助手在屏幕上按照预设的规律做往复运动。
可以参阅图5,图5为本申请实施例提供的一种数字助手的形态为呼吸态时的运动规律的示意图。如图5所示,图5中示出了t时刻的数字助手的呼吸态形象,圆环沿着z轴方向进行往复移动,且移动的距离与时间的函数符合正弦曲线。球体和形象体在圆环运动过程中,围绕球体的球心自转。此外,为了体现球体的自转效果,可以给球体添加对应的光影效果,以示意球体的旋转。
2,待唤醒态
在数字助手的形态为待唤醒态时,数字助手的圆环(即上述的第一特征体)向内合拢,以手势提示线的方式显示,该手势提示线具体可以为一长条形。
其中,手势提示线是为了配合全面屏手势所作出的设计。手势提示线为全局控件,可以是显示在屏幕底部,用于为全面屏手势提供一个基准参考,例如底部上滑通常是从手势提示线的位置上滑。
具体来说,数字助手只显示第一特征体,且第一特征体的形状为一个长条形。数字助手的其他特征体(例如数字助手的球体及形象本体),则被隐藏。一般地,手势提示线通常显示在屏幕的底部,且具有特定的功能。对应地,待唤醒态下的数字助手也具备了手势提示线的功能,例如左右滑动切换应用。
可以参阅图6,图6为本申请实施例提供的一种数字助手从呼吸态切换到待唤醒态的示意图。如图6的(a)所示,在熄屏界面下,当数字助手处于呼吸态时,数字助手的虚拟形象包括圆环、球体和形象本体。如图6的(b)所示,当用户操作触发电子设备从熄屏界面切换到亮屏界面时,数字助手从呼吸态转换为待唤醒态。在待唤醒态下,数字助手的圆环转变为一个处于屏幕底部的长条形,且数字助手的球体和形象本体则被隐藏了。
3,唤醒态
在数字助手的形态为唤醒态时,圆环固定在球体的赤道平面,并跟随用户的移动而围绕z轴旋转。并且,为了表示圆环的旋转关系,圆环上的部分区域被设置为光效区域。圆环上的光效区域的亮度要高于圆环上的非光效区域。并且,圆环上的光效区域指向用户,当圆环旋转时,光效区域则会发生移动并指向用户。或者,当用户移动时,圆环也可以是跟随用户移动而转动,从而使得光效区域始终指向用户所在的位置。又或者,当用户走近或远离电子设备时,圆环可以发生俯仰角度的变化(即绕着电子设备的显示屏幕所在的平面转动),从而使得圆环能够模拟看到用户靠近时的“抬头”或“低头”动作。
示例性地,在第一界面为熄屏界面,第一界面上所显示的第一形态为呼吸态时,如果检测到第二用户输入,则在第一特征体上显示第三特征体,该第三特征体用于指示用户的方位信息,并能够在第一特征体上移动。其中,第一特征体为上述的圆环,第三特征体则可以为圆环上的光效区域。
可以参阅图7,图7为本申请实施例提供的一种数字助手的形态为唤醒态时的示意图。如图7所示,圆环上靠近屏幕一侧的部分区域为光效区域,该光效区域指示了用户相对电子设备的方位。当用户的方位改变时,圆环围绕z轴旋转,使得光效区域始终面向用户。此外,当用户与电子设备的距离小于某一个预设值时,用户继续靠近电视,圆环除了围绕z轴旋转之外,还会围绕y轴产生俯仰角度的变化。例如,圆环围绕y轴向上旋转一定角度后跟随惯性逐渐静止,以使得圆环能够模拟看到用户靠近时的“抬头”动作。
4,聆听态
在数字助手的形态为聆听态时,圆环在xoy平面上绕y轴旋转,转动后,圆环位于yoz平面上,不再遮挡球体和形象本体。此外,圆环在翻转时,可以带动球体和形象本体一起翻转,使得原来球体和形象本体的底部或顶部朝向屏幕。
可以参阅图8,图8为本申请实施例提供的一种数字助手的形态为聆听态时的示意图。如图8所示,图8中圆环转动至yoz平面,球体和形象体也发生旋转。
5,思考态
在数字助手的形态为思考态时,圆环与形象本体始终处于旋转状态,且圆环与形象本体旋转所围绕的支点可以是球体的中心。此外,圆环和形象本体旋转的轨迹可以是随机的,也可以是预设的。此外,在思考态下,为了表示圆环的旋转关系,圆环上的部分区域也可以被设置为光效区域,光效区域随着圆环的运动而运动、或者,光效区域在圆环上不断流转运动。其中,在圆环上设置光效区域的方式可以参考上述唤醒态的描述,在此不再赘述。
可以参阅图9,图9为本申请实施例提供的一种数字助手的形态为思考态时的示意图。图9中展示了数字助手在旋转过程中的两个状态,并且在这两个状态中,圆环与形象体始终处于旋转中。
6,任务执行态
在数字助手的形态为任务执行态时,数字助手的圆环可能会转换为其他形状(例如几何阵列)并与球体和形象本体脱离,显示在界面上的其他位置。示例性地,可以参阅图10,图10为本申请实施例提供的一种数字助手的形态为任务执行态时的示意图。如图10所示,数字助手的圆环转变为一个几何阵列,并脱离了数字助手的球体和形象本体,而是显示在其他位置上。
总的来说,可以参阅图11,图11为本申请实施例提供的一种数字助手的多种形态的示意图。其中,图11展示了数字助手所包括的多个形态。
结合上述所介绍的多种形态,以下将介绍在各种情况下,本实施例所提及的虚拟形象的第一形态和第二形态。
可选的,在虚拟形象具有待唤醒态和聆听态的情况下,上述显示虚拟形象的第一形态的第一界面可以为锁屏界面或解锁后的任意屏幕界面。
那么,在第一界面上以第一形态显示虚拟形象,具体可以包括:以待唤醒态作为第一形态显示在第一界面上。其中,待唤醒态包括第一特征体,且第一特征体以长条形显示,作为导航提示线。导航提示线则用于作为屏幕手势导航的基准线。此外,在未接收到唤醒数字助手的输入时,待唤醒态在显示界面常驻显示。
具体来说,电子设备在检测到用户输入后,根据对用户输入的识别,来确定响应策略,并在屏幕上体现响应结果。电子设备在屏幕上体现的响应结果可以是界面维持不变,也可以是界面发生了改变。
例如,当电子设备为智能手机时,用户探头靠近智能手机,智能手机上的人脸识别硬件识别到人脸,则可以认为智能手机检测到用户输入。或者,智能手机通过红外传感器检测到人手靠近,也可以认为智能手机检测到用户输入。根据智能手机所检测到的用户输入,从熄屏界面切换到锁屏界面(锁屏界面本质上为亮屏界面)。进一步地,在锁屏界面人脸识别验证通过后,解锁进入桌面或者正在运行的应用界面。再例如,用户直接双击熄屏界面,电子设备切换到锁屏界面。
当电子设备从熄屏界面切换到亮屏界面,且数字助手未被唤醒时,数字助手以待唤醒形态显示。在待唤醒形态下,数字助手的圆环向内合拢,以手势提示线的方式显示,该手势提示线具体可以为一长条形。手势提示线为全局控件,在任何界面均显示在屏幕底部,用于为全面屏手势提供一个基准参考,例如底部上滑通常是从手势提示线的位置上滑。
也就是说,从熄屏到亮屏,再到用户主动唤醒数字助手,与数字助手交互,数字助手始终显示在用户可触达的位置,并以不同形态显示。
本实施例中,在解除熄屏后的锁屏界面,以及在用户未通过预设操作唤醒数字助手时的桌面或应用内界面,数字助手均可以是以待唤醒形态显示。即,数字助手的圆环合拢形成一长条形,作为系统手势提示线(通常显示在屏幕底部),伴随着圆环的合拢,圆环内的形象本体、球体消失(可以伴随相关的消失动画)。
示例性地,请参阅图12,图12为本申请实施例提供的一种数字助手以待唤醒态显示的示意图。如图12的(a)所示,在熄屏界面下,数字助手以呼吸态显示,并能够以预设的规律在熄屏界面内移动。如图12的(b)所示,当用户的输入导致电子设备从熄屏界面进入锁屏界面时,数字助手移动到底部,圆环向内收缩,将形象本体和球体挤压至完全隐藏(此过程以动画体现,也可以隐藏动画),此时数字助手成为条状,并显示在屏幕底部,作为系统手势提示线。如图12的(c)以及(d)所示,屏幕解锁后,用户点击图库应用图标(或者其他任意应用图标也可),进入图库界面。在上述的一系列操作的过程中,用户均为未唤醒数字助手,因而数字助手始终显示为待唤醒态,以手势提示线的方式显示在界面上。
在第一形态为待唤醒态的情况下,数字助手从第一形态切换为第二形态具体可以是待唤醒态切换到聆听态。
示例性地,在第一界面为锁屏界面或解锁后的任意屏幕界面的情况下,第一形态为待唤醒态,基于电子设备产生的信号指令所显示的第二形态为聆听态。其中,触发以第二形态显示所述虚拟形象的信号指令可以是基于用户对第一形态的数字助手执行预设手势后产生。例如,用户对第一形态的数字助手执行长按手势或者是双击手势,本实施例对预设手势并不做具体限定。
可选的,多个特征体包括第一特征体和第二特征体。在第二形态下,第一特征体为圆环的主形态,第二特征体为球体,且圆环位于球体的俯视视角的赤道上。
以上介绍了第一形态为待唤醒态的实施例,以下将介绍第一形态为聆听态的实施例。
可选的,在检测到第三用户输入时,数字助手的虚拟形象以聆听态作为第一形态显示在第一界面上,其中第三用户输入用于唤醒数字助手。示例性地,第三用户输入例如可以为用户的语音输入,外部输入设备(例如遥控器或鼠标)的输入,用户直接与电子设备屏幕交互所产生的触摸输入,或者用户与物理按键交互的输入、背板手势等快捷操作等。
其中,虚拟形象的聆听态包括第一特征体(例如上述的圆环)和第二特征体(例如上述的球体),且第一特征体以主形态显示。
具体来说,在亮屏界面上,点击手势提示线,即可唤醒数字助手,从而使得数字助手进入聆听态。请参阅图13,图13为本申请实施例提供的一种数字助手从待唤醒态切换为聆听态的示意图。如图13的(a)所示,用户以单击、双击或长按等手势与数字助手所在的手势提示线交互,触发唤醒数字助手。如图13的(b)所示,在数字助手被唤醒后,数字助手以聆听态显示在当前界面内。
在一些场景下,电子设备的界面上可能显示有特定内容,且界面上可能难以有更多的有效空间显示数字助手了,此时可以将数字助手的虚拟形象与界面上的特定内容进行融合。
示例性地,上述在第一界面上以第一形态显示虚拟形象,具体可以包括:在第一界面上的预设位置显示虚拟形象;或者,将虚拟形象覆盖显示在第一界面内的目标内容元素上,其中,目标内容元素为预设类型的内容元素。即,虚拟形象是覆盖在目标内容元素上的,使得第一界面上不再显示目标内容元素。这样,通过将虚拟形象覆盖显示在目标内容元素上,可以避免在原来的界面上额外增加一个展示虚拟形象的界面而遮挡原来界面上过多的内容(如图13右侧所示的用于展示数字助手的虚拟形象的悬浮窗遮挡了屏幕底部的部分内容),从而提高数字助手与界面的融合性。
其中,目标内容元素例如为由系统预先定义的预设类型的控件或图标。例如,预设类型的控件或图 标可以是指外形具有圆形的形状特征的空间或图标。
可选的,当虚拟形象覆盖显示在第一界面内的目标内容元素上时,可以将目标内容元素的手势事件接口复制到虚拟形象上,以使得对虚拟形象执行手势能够实现目标内容元素的手势对应功能。也就是说,即便目标内容元素被虚拟形象覆盖了,但是虚拟形象继承了目标内容元素本来的功能,从而使得对虚拟形象执行特定手势(例如单击)时,仍然能够执行目标内容元素的手势对应功能。
例如,在第一界面为相机界面的情况下,目标内容元素为相机界面内的快门控件。虚拟形象是覆盖显示在相机界面的快门控件上,且点击虚拟形象仍然能够实现照片或视频的拍摄。
也就是说,数字助手的虚拟形象覆盖显示在目标内容元素上之后,当数字助手的虚拟形象的显示区域接收到针对原来的目标内容元素的手势输入时,该手势可以用于实现原控件的功能。其中,虚拟形象的显示区域接收到手势输入时实现原内容元素的功能,可以通过将目标内容元素的手势与功能的映射关系关联到数字助手,使得数字助手在接收到手势输入之后,触发对应的功能。以目标内容元素是控件为例,通常控件在被交互时,将显示区域对应的坐标是否触碰,来作为该控件的触发,并产生对应的指令。因此,在数字助手集成控件原来的功能时,只需要将触控指令的触发接口同步到数字助手即可,使得数字助手所在区域被触碰时,产生对应的触控指令,即可实现原控件的功能。
或者,只要控件对应的显示区域被预设手势触碰,即按照手势类型执行该控件对应的功能,而不限制该控件上方是否有其他图层显示。此外,实现数字助手功能本身的功能只需要按照数字助手对应的手势识别,然后触发该手势对应的功能即可。
请参阅图14,图14为本申请实施例提供的一种虚拟形象覆盖在相机快门控件上的示意图。如图14中所示,在其中一种初始状态下,数字助手未被唤醒,以待唤醒态显示在界面上,作为导航提示线。当用户与待唤醒态的数字助手(即导航提示线)交互,如长按导航提示线或长按电源键,来唤醒数字助手后,此时数字助手为聆听态。此时,系统在显示聆听态的数字助手时,首先检查当前界面内的控件类型。由于当前界面内的相机快门控件为预设类型的控件,因此,将数字助手与相机快门控件融合显示,融合显示后,数字助手以聆听态覆盖显示在原相机快门控件的位置。
此外,在另一种初始状态下,数字助手在当前界面并未以任何状态显示,以长按控件(如电源键)或语音方式唤出数字助手后,数字助手同样可以是覆盖显示在当前界面的预设类型的控件或图标上。
在数字助手的虚拟形象覆盖在原相机快门控件上之后,点击数字助手的虚拟形象的显示区域(即原相机快门控件的显示区域),同样可以触发拍照。这可以通过数字助手传递指令给相机应用来控制拍照,也可以直接由相机应用识别手势来实现。
在一些情况下,导航提示线在系统全局常驻显示,那么当数字助手处于待唤醒态时,也可以理解为虚拟形象将球体隐藏、圆环形变后与导航提示线融合,使得导航提示线在具有导航提示线功能的同时,也作为数字助手虚拟形象的常驻状态。
此外,当界面内同时存在多个可供覆盖显示的控件或图标时,数字助手可以在不同的控件或图标之间切换。并且,数字助手可以是按显示时间顺序,从在先显示界面的控件中切融合到最新界面内的控件或图标上。
请参阅图15,图15为本申请实施例提供的一种虚拟形象覆盖在助手建议组件上的示意图。如图15的(a)所示,数字助手以待唤醒态显示。在此状态下,数字助手融合导航提示线,形变后的圆环与导航提示线重叠。如图15的(b)所示,当切换另一主屏页时,主屏页包含助手建议小组件,助手建议小组件内显示有虚拟形象的图标,则数字助手可以从导航提示线流转到助手建议小组件内的虚拟图标上。并且,当导航提示线是全局常驻时,则为图15的(b)所示的方案;当导航提示线仅为形变后的圆环,那么数字助手从导航提示线流转到助手建议小组件内的虚拟图标之后,则不再显示导航提示线。
以上详细介绍了虚拟形象的待唤起态和聆听态,以下继续介绍虚拟形象的任务执行态。
可选的,数字助手的虚拟形象还具有任务执行态,该任务执行态包括第一特征体。对应于数字助手执行自动化任务过程中的至少一个过程界面,任务执行态下的第一特征体对应产生形状变化、位置变化中的至少一种变化。
具体来说,在数字助手执行自动化任务时,电子设备的显示界面上会发生一系列的变化。例如,数 字助手在执行打电话给某个联系人的自动化任务时,电子设备会先打开联系人界面,然后选定联系人界面中的特定联系人,再进入到呼叫该特定联系人的电话界面。那么,在数字助手执行自动化任务过程中的过程界面中,数字助手实际上是一直处于任务执行态,且数字助手在任务执行态下的第一特征体可以发生各种变化,以向用户展示数字助手执行任务的自动化过程,使得数字助手执行任务的自动化过程更为直观、清晰,便于用户观察数字助手执行自动化任务的过程是否出现错误。
在一种可能的示例中,第一特征体以几何阵列的形态显示,且几何阵列的显示位置与过程界面内的目标控件相适应。几何阵列通过模拟目标控件的点击事件进入下一过程界面。也就是说,在执行自动化任务的过程中,数字助手能够以作为几何阵列的第一特征体来模拟用户点击界面上控件的操作,从而将数字助手模拟用户执行自动化任务的整个过程可视化。
简单来说,如果执行自动化任务需要打开某个应用,且打开应用这一自动化步骤中,需要点击桌面的应用图标,则应用图标即为该步骤的执行目标,打开应用时,可以将圆环形变的点阵显示在该应用图标上。
示例性地,请参阅图16,图16为本申请实施例提供的一种数字助手执行自动化任务的过程示意图。如图16所示,首先,数字助手接收到用户的语音指令“把这张照片分享给微信朋友圈”。然后,数字助手将点阵显示在当前界面的“发送”控件上,以模拟用户点击“发送”控件的操作。其次,响应于数字助手模拟用户点击“发送”控件的操作,界面发生变化,当前界面变化为展示能够发送照片的应用,且这些应用中包括“微信”应用;在发生变化后的界面中,点阵显示在“微信”应用的图标上,以模拟用户点击“微信”应用图标的操作。最后,界面再次发生变化,变化为在“微信”应用的朋友圈编辑需要发表的图片的界面,此时点阵显示在“发表”控件上,以模拟用户点击“发表”控件的操作。
可选的,在第一形态为聆听态,第二形态为任务执行态的情况下。聆听态和任务执行态均包括第一特征体,且聆听态和任务执行态下的第一特征体分别具有不同的形态。
具体地,在任务执行态下,第一特征体的形态包括:界面内的几何阵列、数字助手对话卡片的外框、调用数字助手输入时输入框的外框或任务结果页的外框。即,在不同的场景下,第一特征体可以是以不同的形态来表示,以便于更为贴合当前界面所显示的内容。
可选的,对应自动化任务的任务进度,第一特征体具有不同的形态变化。即,在数字助手执行自动化任务的过程中,在执行到不同的任务进度下,第一特征体可能会变化为不同的形态,以便于使得第一特征体的形态始终与当前的界面紧密贴合。
示例性地,对应自动化任务的执行过程,在任务执行态下,第一特征体的形态变化包括以下的过程:初始时,第一特征体以圆环的主形态显示在相对第二特征体的初始位置,初始位置与聆听态时第一特征体的位置相同或不同;然后,第一特征体脱离第二特征体,并放大、形变后,作为数字助手对话卡片的外框;其次,第一特征体再由对话卡片的外框变为对话卡片的对话气泡内的几何阵列;最后,在自动化任务执行完成后,虚拟形象以聆听态显示。
总的来说,在任务执行态下,针对主形态为圆环的第一特征体,圆环脱离球体,经过形态变化后,以变化后的特征态显示在任务执行界面。任务执行界面是指在数字助手接收到任务指令后,根据对任务指令的解析,将指令输送到本地应用程序进行执行时的界面,或者执行结果界面。因此,任务执行界面可以是执行应用的部分界面或者全部界面。
其中,圆环的形态变化包括:与形象本体、球体同心的圆环消失,转化为屏幕上的任务执行界面的几何阵列(例如点阵),或者环绕任务执行界面的外框。并且,从圆环转化为几何阵列、外框的过程可以伴随动画。例如,圆环分解为若干点,然后铺设到任务执行界面,形成点阵;由圆环产生形变、拉长,转变一个框,并放大为任务执行界面的外框。
以下将以数字助手的几个应用场景为例,来介绍数字助手在任务执行态下发生形态变化的过程。
请参阅图17,图17为本申请实施例提供的一种数字助手的圆环转化为点阵铺在任务执行界面的示意图。如图17所示,图17的(a)对应数字助手的聆听态,此时用户输入的语言指令为“帮我订一杯新吧咖啡…”。
图17的(b)对应数字助手的思考态(其中思考态是一个动态形象,圆环一直按照一定规律转动,图17的(b)为某一个瞬时的形态)。具体地,思考态对应的是数字助手执行任务解析过程。此时,用 户与数字助手交互的界面显示数字助手解析任务的过程“正在结合新吧历史订单为您推荐”。
在完成任务解析过程后,进入任务执行阶段,数字助手即体现为任务执行态,对应图17的(c)。在图17的(c)中,最下面的数字助手图标的圆环体消失,并在上方展示执行任务所得到的任务卡片(即对话气泡,但以卡片的形式呈现)。同时在任务卡片上显示有若干点阵,该点阵由数字助手原来的圆环转化形成。
可以理解的是,为了避免对任务执行界面的干扰,点阵可以在显示一段时间后消失。图17的(c)中的两个任务卡片,上方的任务卡片点阵已经消失,下方的任务卡片显示了点阵消失过程中的一个瞬间状态(同时显示点阵和界面内容,且均半透明化显示)。当卡片界面完全显示时,点阵消失。即,对于图17中的任务执行界面,显示过程为:仅显示点阵→点阵逐渐消失,界面内容逐渐显示(同时存在点阵和界面)——仅显示界面内容。
在另一个应用场景下,调用数字助手进行语音转文本输入时,则任务执行界面可以是输入框、或者弹出的卡片界面,这些卡片可以是安装在本地的应用程序提供的组件或者子程序。
请参阅图18,图18为本申请实施例提供的在桌面使用数字助手进行文字输入的示意图。如图18的(a)所示,用户单击桌面的搜索小组件,搜索小组件进入编辑模式。如图18的(b)所示,通过长按电源键启用数字助手,此时数字助手为聆听态,数字助手调用麦克风获取用户语音输入。如图18的(c)所示,数字助手接收语音输入,并将语音转为文字后填充到输入框;此时,数字助手的形态为任务执行态,圆环从球体外侧消失,并放大、变形后转移到搜索框外侧,形成一个环绕搜索框的外框。如图18的(d)所示,在完成输入后,搜索小组件展开为搜索卡片,圆环转化的外框可以扩大,并环绕搜索卡片;当然,圆环转化的外框也可以在呈现搜索卡片后消失,或保留预设时间后消失。
当圆环形变的形变体消失时,或者消失后预设时间内,球体外可以显示初始状态(未形变)的圆环。
示例性地,请参阅图19,图19为本申请实施例提供的一种数字助手的圆环发生形变且恢复原状的示意图。如图19的(a)所示,初始唤出数字助手后,数字助手以聆听态显示,且数字助手上的圆环显示在球体外侧。
如图19的(b)所示,用户输入完毕后,经过思考态(图19中未示出),数字助手以任务执行态显示。在任务执行态的初始阶段,数字助手的圆环向外形变,与数字助手的对话卡片的边框位置重叠,显示在边框上或边框的外侧,或者显示在边框的上层。在视觉上,数字助手的圆环变为了对话卡片的外框。
如图19的(c)所示,当卡片不再扩大时,圆环从卡片外框消失,并在球体外侧重新以初始大小的圆环体显示。
如图19的(d)所示,当以对话气泡呈现结果时,数字助手上的圆环则再次转化为点阵,显示在对话气泡上。可以理解的是,图19的(c)所示的中间态可以省略,圆环的状态变化可以直接由图19的(b)所示的卡片外框变换得到图19的(d)所示的点阵。
如图19的(e)所示,对话卡片的结果完全显示,点阵消失,圆环再次以初始大小的圆环体显示在球体外侧。
在又一个应用场景下,调用数字助手进行智慧识别或者其他自动任务时,任务执行界面为当前正在操作的应用界面。对于智慧识别来说,通常会产生一个智慧识别结果,该智慧识别结果为识别得到的一个或多个可执行对象(如相册编辑中的智慧去人/智慧去物),则任务执行界面可以是可执行对象所在区域。在执行智慧识别的过程中,是实时对整个界面进行识别,最后识别到可执行对象。对应这种任务执行过程,圆环对应的点阵也可以和任务执行过程相适应,即在识别过程阶段,将点阵铺满整个界面,识别出可执行对象后,仅在可执行对象的区域显示点阵。
示例性地,请参阅图20,图20为本申请实施例提供的一种智慧识别场景下数字助手的形态变化示意图。在图20中,用户唤出数字助手,使用智慧识别功能。如图20的(a)所示,智能手机的界面上显示一图片,该图片全屏显示,之后,用户长按界面任意位置,触发智慧识别功能。并且,当前界面上显示数字助手的图标,在数字助手刚出现时,数字助手为聆听态。如图20的(b)所示,数字助手对图片进行智慧识别,此时,数字助手的圆环从球体外侧消失,并转化为点阵显示在界面上,点阵铺满屏幕。如图20的(c)所示,图片识别完成后,识别到至少一个可执行对象,则点阵仅显示在可执行对象所在的显示区域。此时,点阵的点也可以不是规则的点阵,而是可以在可执行对象所在的显示区域动态运动, 使得点阵变为多个不规则的离散点。
需要说明的是,在上述的图19和图20中,为了完整呈现数字助手形态之间的切换,是按照既定步骤来示意的。实际在操作电子设备时,电子设备支持通过快捷操作来唤出数字助手执行预设功能,例如图20的智慧识别功能。此时,数字助手可以直接显示对应的状态,如直接显示任务执行态,或者在唤出的瞬间显示聆听态,然后立即切换为任务执行态。
以智慧识别功能为例,假设系统定义长按电源键为智慧识别的快捷操作,当用户长按电源键,操作系统在界面上显示数字助手图标,并直接显示任务执行态,即长按电源键后直接进入图20的(b)的显示界面。
在一些设计中,在任务执行态下,数字助手的形象本体和球体可以不显示在界面上,而仅显示圆环体变化后的形态,如图20的(c)中可以仅显示点阵,而不显示下方的球体和形象本体。
以上介绍了数字助手在任务执行态下的变化过程,在实际应用中,数字助手在初始出现在界面的时候,也可以是以一定的规律来出现。
可选的,在上述的第一界面为锁屏界面或解锁后的任意屏幕界面,且第一界面上所显示的第一形态为聆听态的情况下。那么,在第一界面上以第一形态显示虚拟形象,具体还包括:初始阶段仅显示第一特征体,不显示第二特征体,且第一特征体初始以几何阵列显示,铺满屏幕;然后,第一特征体由几何阵列经形状变化为以主形态显示,并显示第二特征体。
也就是说,在唤醒数字助手的时候,数字助手先是以几何阵列(例如点阵)的形式出现,并逐渐铺满屏幕;然后,数字助手再逐渐从几何阵列转换为第一特征体的主形态(即圆环),并且显示第二特征体(即球体),从而实现以动画的形式展示数字助手的唤醒过程,增加了交互过程的直观性,有利于用户识别到数字助手当前所处的工作状态。
其中,几何阵列例如可以为点阵,即由多个圆点通过规则排列所构成的阵列。几何阵列也可以是由其他的几何形状(例如三角形、矩形等多边形)通过规则排列所构成的阵列。
可选的,第一特征体初始以几何阵列显示,具体可以包括:先确定几何阵列的初始显示位置;其次,在初始显示位置显示部分阵列的几何图形,再将几何图形逐渐铺满屏幕。例如,先从一个初始显示位置开始显示部分点阵,然后再逐渐将点阵扩散到铺满整个屏幕。
可选的,几何阵列的初始显示位置是基于用户输入确定的。其中,用户输入是指用于唤醒数字助手的输入。其中,初始显示位置具体可以是在触发唤醒数字助手的用户输入的附近。比如,在用户输入为长按智能手机的电源键时,几何阵列的初始显示位置可以是在电源键附近的屏幕位置上。
并且,在未检测到用户输入时,初始位置为默认位置,该默认位置为显示虚拟形象的区域,例如屏幕底部的预设区域。
本方案中,通过用户输入来确定几何阵列的初始显示位置,能够使得数字助手的显示过程与用户输入更为贴合,增强用户与数字助手交互的直观性,提高用户体验。
总的来说,在唤出数字助手的过程中,点阵的出现可以具有预设的规律。当检测到用户的操作(触摸或者按压物理按键)时,基于用户的操作确定数字助手的初始显示位置,并从初始显示位置将点阵扩展到全图显示。当未检测到用户操作,则以数字助手的虚拟形象的显示位置作为初始显示位置,点阵从初始位置将点阵扩展到全图显示。
示例性地,请参阅图21,图21为本申请实施例提供的一种智慧识别场景下数字助手出现的示意图。如图21所示,当用户通过长按智能手机右侧的电源按键来唤起数字助手时,数字助手在初始阶段是先以点阵的形式显示在电源按键附近的屏幕上,然后再逐渐扩散到整个屏幕。当用户通过长按屏幕底部的数字助手来唤起数字助手时,数字助手在初始阶段是先以点阵的形式显示在底部数字助手的虚拟形象所在区域附近上,然后再逐渐扩散到整个屏幕。
在一些场景下,数字助手的形态变化还可以是与场景信息有关,从而实现通过数字助手的虚拟形象来贴切地表示当前使用场景下的一些信息。
可选的,在一些实施例中,电子设备可以获取场景信息,并基于场景信息确定数字助手的第一特征 体的目标形态,将第一特征体以目标形态显示。即,目标形态是与场景信息相关的。在电子设备获取到特定场景信息时,电子设备可以是以特定形态来显示第一特征体。
其中,场景信息可以包括外界环境信息和系统事件信息。外部环境信息为通过传感器获取的用于指示外部环境的信号,具体可以是通过电子设备上的传感器或者网络获取,如气温、湿度、天气等外界环境信息。系统事件信息为从电子设备的系统内部获取的、在预设时间内发生的事件,例如正在播放音乐、预设时间内有闹钟通知、正在查找文件等事件信息。
在第一特征体为圆环的情况下,圆环可以是基于外界环境信息产生形变。例如,在下雨天时,圆环形变为雨伞形状。请参阅图22,图22为本申请实施例提供的一种数字助手基于场景信息产生形变的示意图。如图22所示,在唤出数字助手后,若维持在当前界面,数字助手始终以聆听态显示。维持某一状态(如图22的聆听态)预设时间后,数字助手的圆环基于场景信息产生形变。在图2中,当前天气为小雨,因此圆环变为伞状。
此外,圆环还可以基于系统事件信息产生形变。例如,熄屏播放音乐时,数字助手的圆环形变为耳机形状;预设时间内有闹钟通知,数字助手的圆环形变为实体闹钟形状。再例如,用户调用数字助手查找文件时,数字助手的圆环形变为一个放大镜。
当然,除了第一特征体可以发生形变之外,数字助手的其他特征体也可以基于场景信息发生形变,此处不再赘述。
可选的,在虚拟形象具有待唤醒态的情况下,待唤醒态至少包括第一特征体,且当前第一特征体以长条形显示。响应于电子设备检测到人眼视线焦点落在屏幕所在区域,电子设备显示第二特征体。
也就是说,在一些场景下,数字助手处于待唤醒态时,只显示一个长条形;等到数字助手检测到有用户视线落在屏幕上时,可以认为用户有交互倾向,因此显示第二特征体,从而表征数字助手检测到用户具有交互倾向且正在等待交互,以提高人机交互的自然性。
可选的,响应于检测到人声,第二特征体可以是相对第一特征体向靠近人声声源的方向移动。具体来说,第二特征体刚刚出现的时候,位置是相对固定的,例如位于第一特征体的正上方或正下方。当电子设备检测到人声时,电子设备可以根据人声的方位来移动第二特征体,从而使得第二特征体跟随人声移动,以提高交互的自然性。
具体来说,在另一个常见的终端——车载终端中,数字助手作为主要的交互方式,也可以常驻与车机的显示屏(如中控屏)上,并通过圆环或球体中的至少一个形变,来在视觉上与用户进行交互,从而保持用户对数字助手的注意力,增强对话感。
在车载终端环境下,场景信息主要包括人的信息、系统信息以及外界环境信息。
人的信息可以是来源于驾驶员、乘客中的至少一个,包括头部朝向、眼神、手势、姿势、人声,并可以区分为是主驾或者副驾(后座通常难以和中控屏交互)。
系统信息可以是指车控功能人为或自适应调节的变化,比如驾驶模式、空调设置、车窗除雾、安全带状态变化等,以及一些车况的变化信息,如车辆速度变化、行驶方向变化、车辆产生颠簸、处于特殊交通状态等。此外,系统信息还可以包括前述正在运行的应用程序的类型等细节。
外界环境信息通常为天气、光线、温度等的变化。
基于这些场景信息,对应不同状态的虚拟形象,可以触发虚拟形态的不同形变,例如上述圆环的形变,或者球体的形变。
此外,车载终端与手持终端不同的一点在于,中控屏基本不存在熄屏状态,因此车载终端上的数字助手开机后就以待唤醒态显示。这样,在待唤醒态下,根据检测到的不同场景条件,数字助手有不同的形变状态。
示例性地,请参阅图23,图23为本申请实施例提供的一种车载终端上数字助手的变化示意图。在图23中,数字助手初始的待唤醒态为仅显示形变后的圆环,当检测到主驾或副驾的视线看向中控屏时,则显示部分球体,如果在此基础上,进一步检测到人声,则显示更多的球体面积,如显示半球体。
进一步地,对应主驾和副驾的不同人声,圆环可以拉长,且球体位于圆环的人声声源方向一侧。在图23中,在检测到主驾人声时,半球体则位于拉长后的圆环体的靠近主驾一侧,即半球体往主驾方向移动;检测到副驾人声时,半球体则位于拉长后的圆环体的靠近副驾一侧,即半球体往副驾方向移动。
可选的,在电子设备为智能汽车的情况下,基于车载功能的启动或告警,可以在电子设备的屏幕上显示数字助手,并通过第一特征体的形变或运动来指示对应功能或告警信息。
也就是说,在车机上,还可以用数字助手来提示车况信息。例如,在检测到车辆处于道路颠簸环境时,显示唤醒态的数字助手,并且球体沿着圆环所在屏幕上下震动。或者,球体与圆环一起上下震动,二者震动频率可以一致或不一致。又例如,在检测到车辆启动车窗除雾功能时,显示唤醒态的数字助手,并将球体渲染为白色,然后圆环以x轴为固定轴,产生z轴方向的上下摆动,且随着摆动,球体上渲染的白色透明度变低,直至完全透明,圆环停止摆动。再例如,在检测到安全带阀门未锁紧或锁的进度不够(即未佩戴安全带)时,显示唤醒态的数字助手,并将圆环体以预设角度倾斜显示后,圆环体中间断裂一部分,变为圆弧,且圆弧逐渐合拢形成圆环。
以上主要是举例介绍了电子设备为智能手机或车载终端的情况下,数字助手的形态发生变化的过程。以下将介绍电子设备为智慧电视的情况下,数字助手的形态发生变化的过程。
请参阅图24,图24为本申请实施例提供的一种数字助手的显示方法的流程示意图。如图24所示,该数字助手的显示方法包括以下的步骤2401-2403,且该数字助手的显示方法应用于配置有数字助手的电子设备上。
步骤2401,获取用户的位置,并基于用户的位置显示数字助手的第一形态。
本实施例中,配置有数字助手的电子设备上可以是部署有传感器,例如红外传感器、超声波传感器或光学距离传感器等距离传感器,因此电子设备能够基于所部署的传感器来检测得到用户的位置。例如,在电子设备部署有红外传感器的情况下,电子设备通过红外传感器发射红外光并接收由用户的身体所返回的红外光,进而计算得到用户相对于电子设备的所在方向以及距离,进而确定用户的位置。
可选的,电子设备也可以是通过其他的方式获取用户的位置,例如从其他的设备处获取到用户的位置。例如,在电子设备为智慧家庭中的智慧电视或智慧音箱的情况下,电子设备可以通过网络与其他的智能设备(例如家用摄像头、扫地机器人或智能路由器),这样在其他的智能设备通过自身的传感器或定位模块确定用户的位置后,电子设备可以从其他的智能设备处获取到用户的位置。
在电子设备获取到用户的位置的情况下,则可以基于用户的位置在电子设备的显示屏幕上显示数字助手的第一形态,即电子设备所显示的数字助手的第一形态是与用户的位置相关的。简单来说,在用户的位置距离电子设备越远时,代表用户与电子设备交互的倾向性越低,因此电子设备可以显示交互意向程度越低的数字助手形态;在用户的位置距离电子设备越近时,代表用户与电子设备交互的倾向性越高,因此电子设备可以显示交互意向程度越高的数字助手形态。
步骤2402,获取用户的空间交互行为,空间交互行为用于指示用户在空间中的活动情况。
在确定用户位置并显示数字助手的第一形态之后,电子设备可以通过传感器来进一步捕捉用户的空间交互行为,即用户在空间中的活动情况,进而判断用户进一步的交互倾向。
示例性地,用户的空间交互行为可以包括但不限于以下行为中的任意一种:走近电子设备、远离电子设备、向左移动、向右移动、手指指向电子设备、手臂指向电子设备、面部朝向电子设备以及预设手势。总的来说,用户的空间交互行为可以用于表示用户与电子设备的数字助手之间的交互倾向。例如,当用户的空间交互行为是走近电子设备时,代表用户期望与电子设备的数字助手开展进一步的交互;当用户的空间交互行为是远离电子设备时,代表用户并不期望与电子设备的数字助手开展进一步的交互。
步骤2403,响应于空间交互行为,显示数字助手的第二形态。
本实施例中,在获取到用户的空间交互行为后,电子设备则从数字助手的第一形态切换为显示数字助手的第二形态。其中,数字助手的第一形态和数字助手的第二形态分别用于表示数字助手与用户交互过程中的不同状态,进而便于用户识别数字助手当前的交互状态而下达相应的交互指令。
可选的,电子设备显示数字助手的第二形态可以是仅与空间交互行为有关。即,电子设备在获取到用户的空间交互行为后,无论当前所显示的数字助手的第一形态具体是什么形态,均按照预先建立的空间交互行为与数字助手的形态之间的映射关系,确定显示数字助手相应的形态(即上述的第二形态)。
在另一些实施例中,电子设备显示数字助手的第二形态也可以是同时与电子设备已显示的数字助手的第一形态以及空间交互行为有关。也就是说,电子设备实际上是基于当前已显示的第一形态以及用户 的空间交互行为,来确定需要显示的数字助手的第二形态。举例来说,数字助手可以具有多个形态,不同的形态对应电子设备判断的用户与电子设备的不同大小的交互意向程度。当电子设备基于获取到的空间交互行为判断用户期望与电子设备的交互意向程度提高时,电子设备可以基于当前已显示的数字助手的第一形态,确定显示交互意向程度比第一形态高的第二形态,并显示该第二形态;反之,当电子设备基于获取到的空间交互行为判断用户与电子设备的交互意向程度降低时,电子设备可以基于当前已显示的数字助手的第一形态,确定显示交互意向程度比第一形态低的另一第二形态,并显示该另一第二形态。
可选的,在一些实施例中,电子设备可以基于电子设备所在的位置将电子设备附近的区域划分为多个区域,且不同的区域相对于电子设备的位置不相同。比如,电子设备可以划分3个区域,分别为第一区域、第二区域和第三区域。其中,电子设备位于第一区域内,第二区域相对电子设备位于第一区域外侧并包围第一区域,第三区域相对电子设备位于第二区域外侧并包围第二区域。例如,第一区域、第二区域和第三区域可以是以电子设备为圆心所划分的区域,且第一区域的平面形状为圆或椭圆,第二区域的平面形状为包围在第一区域之外的圆环,第三区域的平面形状为包围在第二区域之外的圆环。这样,当电子设备识别到用户的空间交互行为是在不同的区域之间移动时,则可以准确地判定用户与电子设备的交互倾向,进而调整所显示的数字助手的形态。
示例性地,假设电子设备在上述步骤2401中所获取到的用户的位置处于第一目标区域,电子设备在上述步骤2402中所获取到的空间交互行为是用户从第一目标区域移动至第二目标区域,则电子设备将数字助手的第一形态切换为第二形态。其中,第一目标区域属于上述的第一区域、第二区域和第三区域中的任意一个,第二目标区域则可以是与第一目标区域相邻的一个区域。
可选的,响应于用户从第一目标区域移动至第二目标区域,电子设备可以显示数字助手的第二形态;其中,第二形态为第二目标区域对应的形态。也就是说,以电子设备为中心所划分的每个区域均对应于数字助手的一个形态,当用户从某一个区域移动至另一个区域之后,则切换显示数字助手的形态,以使得当前显示的形态为用户移动后所在的区域对应的形态。
具体地,在第一形态用于表示数字助手处于无任务处理状态,且第一目标区域位于第二目标区域之外时,第二形态用于表示数字助手处于待接收指令状态。具体来说,在第一目标区域位于第二目标区域之外时,用户从第一区域移动至第二区域,代表用户从离电子设备更远的区域移动到了离电子设备更近的区域,即用户是走近电子设备的,因此电子设备可以识别到用户具有进一步的交互倾向,从而切换数字助手的形态,以向用户展示数字助手从无任务处理状态切换到了待接收指令状态。其中,无任务处理状态可以是指示数字助手没有需要处理的任务而处于休眠的状态;待接收指令状态则可以是指数字助手已经被唤醒,随时准备接收指令的状态。数字助手的待接收指令状态是比无任务处理状态具有更高交互意向程度的状态。
或者,在第一形态用于表示数字助手处于待接收指令状态,且第一目标区域位于第二目标区域之外时,第二形态用于表示数字助手处于指令接收状态。类似地,在第一形态用于表示数字助手处于待接收指令状态的情况下,用户从第一目标区域移动至第二目标区域,代表用户具有进一步的交互倾向,从而切换数字助手的形态,以向用户展示数字助手从待接收指令状态切换到了指令接收状态。其中,指令接收状态可以是指数字助手已经通过电子设备上的相应硬件来接收用户下发的指令的状态,即接收指令的硬件已经处于工作状态,例如数字助手通过电子设备上的麦克风来接收用户下发的语音指令。数字助手的指令接收状态是比待接收指令状态具有更高交互意向程度的状态。
又或者,在第一形态用于表示数字助手处于待接收指令状态,且第二目标区域位于第一目标区域之外时,第二形态用于表示数字助手处于无任务处理状态。可以理解的是,在第二目标区域位于第一目标区域之外的情况下,用户从第一目标区域移动至第二目标区域,代表用户正在远离电子设备,因此电子设备可以识别到用户不具有进一步的交互倾向,从而是从待接收指令状态切换到交互意向程度更低的无任务处理状态。
本方案中,通过获取用户在与电子设备具有不同距离的区域之间的移动情况,来识别用户针对于数字助手的交互倾向,进而实现数字助手的形态调整,能够很好地贴合人类之间的社交状态与距离之间的关系,提高数字助手形态变化的自然性。
可选的,在将数字助手的第一形态切换为第二形态之前,电子设备还可以获取用户的空间活动行为, 空间活动行为用于指示用户在空间中的活动情况。其中,该空间活动行为是用户不具有与数字助手进行交互的意向下所做出的行为,即空间交互行为能够用于表示用户并不具有与数字助手进行交互的意向。
然后,响应于空间活动行为,电子设备显示数字助手的调整后的第一形态;其中,调整后的第一形态是对第一形态的至少一个特征进行调整后得到的,且调整后的第一形态和第一形态用于表示数字助手与用户交互过程中的相同状态。即,在用户不具有交互意向的情况下的情况下,用户所做出的空间活动行为会改变数字助手的形态的特征,而不会改变数字助手的形态,即不会使得数字助手的交互状态发生改变。
第一形态中被调整的特征例如可以包括但不限于用于表示数字助手的特征体的以下特征:特征体的姿态、特征体的运动规律、特征体的光效、特征体的材质、特征体的颜色、特征体的亮度等特征。例如,用户在已划分的某个区域内走动而不是移动至另一个区域时,数字助手的特征体(比如圆环体)可以是跟随用户转动;又例如,用户通过手指指向不同的方向时,数字助手的特征体可以是跟随用户手指所指向的位置来转动或旋转;又例如,用户逐渐走近电子设备的显示屏幕时,数字助手的圆环体可以旋转而发生俯仰角度的变化。
以上介绍了基于用户的空间交互行为,切换数字助手的形态的过程,以下将详细介绍数字助手的具体形态。
示例性地,数字助手的第一形态和第二形态均包括多个特征体,且相较于第一形态,第二形态中的至少一个特征体的姿态和/或运动规律发生改变。即,数字助手的不同形态均是由相同的多个特征体构成的,且不同形态之间的区别在于至少一个特征体的姿态和/或运动规律发生改变。
总的来说,通过使用多个特征体来构建数字助手的形象,不同的特征体能够独立地调整姿态和运动规律,这样通过调整特征体的改变,来表示不同状态的数字助手,形成风格统一的、响应空间交互行为输入的反馈方式,有助于用户理解数字助手的反馈。
在一种可能的实现方式中,用于表示数字助手的形象的多个特征体包括圆环体、球体和形象本体。其中,形象本体包裹于球体内,圆环体环绕球体。即,在数字助手的形象中,从内到外依次为形象本体--球--圆环体。此外,球体可以是透明或透明的,以便于用户能够透过球体看到包裹于球体内的形象本体。形象本体可以是为数字助手所设计的任意形状的物体,例如类似人体形状的物体、不规则形状的物体等。
本方案中,通过基于圆环体、球体和形象本体这几个特征体来构建数字助手的形象,可以使得在通过调整特征体的姿态或运动规律来切换数字助手的形态时,能够实现无缝、流畅的转化,使得数字助手的形态变化过程更为自然且符合人类认知,便于用户了解当前与数字助手的交互状态。
示例性地,在第一形态或第二形态用于表示数字助手处于无任务处理状态时,第一形态或第二形态为多个特征体沿着预设轨迹移动,且球体和形象本体在移动过程中自转。
简单来说,当数字助手处于无任务处理状态时,代表用户对于数字助手并无交互倾向,因此数字助手的形态则可以表现为按照预定的规则运动而并无交互意向的形态,即多个特征体沿着预设轨迹移动且球体和形象本体在移动过程中自转。
示例性地,在第一形态或第二形态用于表示数字助手处于待接收指令状态时,第一形态或第二形态为圆环体中的部分区域具有光效,且部分区域朝向用户的位置。
当数字助手处于待接收指令状态时,代表用户已经具有与数字助手交互的倾向,且数字助手也已经准备好随时接收用户的指令,因此数字助手的形态则可以表现为时刻关注用户(即具有光效的区域朝向用户的位置),并等待用户下发指令。
示例性地,在第二形态用于表示数字助手处于指令接收状态时,第二形态为多个特征体的顶部或底部朝向用户。即,当数字助手处于指令接收状态时,代表数字助手正在接收用户的指令,因此数字助手的形态则可以表现为数字助手正在全神贯注地接收指令的状态(即多个特征体的顶部或底部朝向用户,表示对用户指令的聆听)。
在另一种可能的实现方式中,第一形态和第二形态均为虚拟人体的形态,且虚拟人体的不同形态之间是在以下的任意一种或多种特征中存在差异:虚拟人体的表情、动作以及大小。简单来说,数字助手的形态也是由多个特征体构成的,且多个特征体可以是按照人体部位进行划分的,即不同的特征体对应 于虚拟人体中的不同部位。例如,多个特征体可以划分为头部特征体、眼部特征体、嘴部特征体、左手特征体、右手特征体等。
本方案中,通过将数字助手的形态设置为虚拟人体的形态,可以使得在响应于用户的空间交互输入来切换数字助手的形态时实现仿人化的形态变化,使得数字助手的形态变化更为自然,便于用户了解当前与数字助手的交互状态。
为了便于理解,以下将结合具体例子详细介绍本申请实施例提供的数字助手的显示方法的执行过程。
请参阅图25,图25为本申请实施例提供的数字助手的显示方法的应用场景示意图。在图25所示的应用场景中,包括用户401和电子设备402。其中,电子设备402例如可以为用户家庭中的智慧电视,且智慧电视上部署有数字助手。
在电子设备402的启动期间,用户401可以通过语音指令来控制电子设备402,从而使得电子设备402自动地执行某些应用程序,例如自动地打开用于播放影视的应用程序。
电子设备402在硬件上配置有距离传感器和麦克风等硬件,其中距离传感器用于检测用户的位置,麦克风用于接收用户的语音指令。电子设备402在软件上则搭载有数字助手,用于处理用户的语音指令,从而根据语音指令自动地执行某些应用程序。
为了便于叙述,以下将以执行数字助手的显示方法的电子设备为智慧电视为例,详细介绍智慧电视执行数字助手的显示方法的过程。
请参阅图26,图26为本申请实施例提供的一种数字助手处于休眠状态的示意图。如图26所示,在智慧电视处于长时间无人使用且智慧电视周围无用户的情况下,智慧电视进入锁屏待机状态,并在显示屏幕上显示锁屏内容。此时,在智慧电视进入锁屏待机状态的时候,智慧电视的显示屏幕上不显示数字助手,数字助手在后台运行,等待被唤醒。
可选的,显示屏幕上所显示的锁屏内容可以是与数字助手无关的内容。或者,显示屏幕上所显示的锁屏内容也可以是数字助手的某一个形态,例如数字助手的呼吸态。
在智慧电视的启动期间,当智慧电视通过传感器检测到用户时,可以先基于传感器确定用户的位置,然后再基于用户的位置确定用户所处的区域,并根据用户所处的区域来确定数字助手的初始显示形态(即上述的第一形态)。
在实际应用中,智慧电视内可以设置有预设的逻辑,将智慧电视周围的空间划分为多个区域,用户在不同的区域时,对应显示不同的数字助手的形态。请参阅图27,图27为本申请实施例提供的一种区域划分的示意图。如图27所示,本实施例中,以智慧电视为中心,按照不同的半径将智慧电视周围的空间划分成四个区域,由外到内依次为:无感知区A1、存在感知区A2、自然行为区A3和典型任务区A4。具体来说,以智慧电视为中心,定义了三个半径:R1、R2和R3,其中R1>R2>R3。那么,无感知区A1与智慧电视之间的距离大于R1、存在感知区A2与智慧电视之间的距离位于R1与R2之间、自然行为区A3与智慧电视之间的距离位于R2与R3之间,典型任务区A4与智慧电视之间的距离小于R3。其中,无感知区对应呼吸态、存在感知区对应唤醒态、自然行为区和典型任务区对应聆听态。
当智慧电视的数字助手激活时,例如用户通过唤醒词/遥控器激活数字助手,智慧电视会根据用户的位置,从呼吸态、唤醒态以及思考态中确定一个数字助手的初始形态,并显示在屏幕上。
举例来说,当用户在无感知区使用遥控器上的语音指令激活智慧电视的数字助手,无感知区对应的初始形态为呼吸态,则智慧电视将数字助手主程序激活后,显示数字助手的呼吸态。当用户在存在感知区使用遥控器上的语音指令激活智慧电视的数字助手,,则显示数字助手的唤醒态。
在智慧电视根据用户的位置显示数字助手的初始形态后,智慧电视还会通过本身所部署的传感器捕捉用户在空间中的活动情况,并根据预置的逻辑解析传感器数据,从捕捉到的用户空间活动中识别出是否存在空间交互行为以及空间交互行为的类型。
举例来说,智慧电视通过距离传感器来感知用户的相对电视的方位和距离,并基于一段时间内的方位、距离变化,来判断用户是否在走向电视。如果检测到一段时间在用户与电视的距离在持续缩小,且用户与电视的实时距离小于预设值,则认为检测到走向输入。同理,当用户相对电视左右移动时,可以检测到用户的横向向左/向右移动输入。
智慧电视在检测到用户的空间交互行为后,可以根据空间交互行为的类型来调整数字助手的初始形态的至少一个特征,从而实现数字助手的形态切换。
请参阅图28,图28为本申请实施例提供的一种切换显示数字助手的形态的示意图。如图28中的(a)所示,在智慧电视中的数字助手被激活时,用户在无感知区A1,此时智慧电视上显示数字助手的呼吸态,即圆环体沿着z轴上下呈现规律性移动。当用户一直在无感知区A1内走动时,智慧电视因为不会感知无感知区A1的空间输入,因此,用户的走动不会对数字助手的形态产生影响,即智慧电视始终显示数字助手的呼吸态。
如图28中的(b)所示,随着用户从无感知区A1移动到存在感知区A2,智慧电视检测到用户的空间交互行为走近智慧电视,且用户所在位置处于存在感知区A2,因此智慧电视将数字助手切换为唤醒态。在数字助手的形态为唤醒态时,圆环体固定在球体的赤道平面,并跟随用户的移动而围绕z轴旋转。并且,圆环体上的光效区域指向用户。
此外,当用户在存在感知区A2内移动从而调整相对电视机的方位时,数字助手中的圆环体跟随人体的旋转而旋转,并将光效区域始终朝向用户。请参阅图29,图29为本申请实施例提供的一种数字助手在唤醒态的变化示意图。如图29中的(a)所示,当用户位于存在感知区A2内时,数字助手处于唤醒态,且数字助手中的圆环体上的光效区域指向用户;如图29中的(b)所示,当用户在存在感知区A2内走动,并从左边移动至右边时,数字助手仍处于唤醒态,且数字助手中的圆环体跟随用户移动并使得圆环体上的光效区域始终指向用户,即圆环体上的光效区域也从左边移动至右边。并且,当用户持续性的靠近智慧电视时,智慧电视检测到用户的走向输入,圆环体还可以绕y轴旋转,产生俯仰方向的变化。当用户始终在存在感知区A2内移动时,球体和形象本体的运动规律可以不发生变化。
当用户从存在感知区A2进入到自然行为区A3以内(包括自然行为区A3和典型任务区A4),数字助手的形态从唤醒态切换到聆听态,圆环体的位置和运动规律发生变化。首先,圆环体从球体的赤道平面绕y轴旋转90度,转动至yoz平面上,同时可以发生光效的改变,整个圆环体都填充光效。球体和形象体的运动规律变化,例如,球体和形象体绕y轴旋转90度一次后,再绕z轴不停旋转。如图30所示,当用户从存在感知区A2进入到自然行为区A3之后,智慧电视上所显示的数字助手从唤醒态切换到聆听态,表示数字助手正在接收用户所下发的语音指令。
在数字助手处于聆听态的情况下,当用户输入语音指令时,例如语音指令为“今天天气怎么样?”,此时数字助手会对语音指令进行解析和执行相应的操作。在数字助手对语音指令进行解析和执行相应的操作的过程中,智慧电视上可以是显示数字助手的思考态,思考态的显示时长与数字助手对语音指令进行解析和执行相应的操作的时间长度呈正相关。
此外,在用户位于自然行为区A3的情况下,数字助手处于聆听态,用户在向数字助手输入语音指令时,往往还需要先输入作为唤醒指令的语音指令,例如“你好,小艺”或“小艺小艺”,以便于数字助手接收并解析唤醒指令后的语音指令。但是,当用户进入典型任务区A4后,数字助手仍然处于聆听态,但是用户可以直接向数字助手输入语音指令,而不再需要输入作为唤醒指令的语音指令。
需要说明的是,在数字助手接收并解析得到用户的语音指令后,数字助手可以根据用户的语音指令的内容来改变本身的形态显示。例如,在数字助手解析得到语音指令的内容为播放音乐时,数字助手的形态可以是跟随音乐的节奏发生颜色上的动态变化或者是发生转动节奏上的动态变化,即数字助手在执行音乐播放的操作时也会相应调整自身的形态显示。又例如,在数字助手解析得到语音指令的内容为开灯、关灯或播报天气时,数字助手的形态可以跟随开灯动作、关灯动作或播报的天气类型来发生亮度、颜色或材质上的变化。
此外,在数字助手解析语音指令并基于语音指令的内容执行相应任务的时候,数字助手可以根据自身执行任务的进度情况来改变本身的形态显示,比如数字助手根据任务的进度情况来逐渐改变圆环体的材质、改变圆环体被特定材质填充的面积等。
以上介绍了用户从距离智慧电视较远的区域移动至距离智慧电视较近的区域时,智慧电视上所显示的数字助手的形态变化情况。实际上,用户也可能会从距离智慧电视较近的区域移动至距离智慧电视较远的区域,在这种情况下,智慧电视可以根据用户的空间交互行为以及用户移动后所位于的区域来决定 显示数字助手的形态,从而实现数字助手的形态切换。例如,当用户从自然行为区A3移动至存在感知区A2时,智慧电视上所显示的数字助手从聆听态切换到唤醒态;当用户从存在感知区A2移动至无感知区A1时,智慧电视上所显示的数字助手从唤醒态切换到呼吸态。
以上介绍了数字助手的本体形象是由圆环体、球体和形象本体构成的情况下,数字助手的形态变化过程。以下将介绍数字助手的本体形象是由虚拟人体构成的情况下,数字助手的形态变化过程。
具体来说,数字助手的本体形象可以为一个虚拟人体,且数字助手的形态也是由多个特征体构成的。多个特征体可以是按照人体部位进行划分的,即不同的特征体对应于虚拟人体中的不同部位。例如,多个特征体可以划分为头部特征体、眼部特征体、嘴部特征体、左手特征体、右手特征体等。
在数字助手的本体形象为虚拟人体的情况下,智慧电视同样可以以智慧电视所在位置为圆心,将智慧电视附近的空间划分为多个区域。这样,在智慧电视中的数字助手被激活后,智慧电视可以根据用户在空间中所处的区域以及用户的空间交互行为,来显示数字助手相应的形态。
具体地,本实施例中可以将智慧电视附近的空间划分为3个区域,区域1为距离智慧电视1.2-3.7米的区域,区域2为距离智慧电视0.5-1.2米的区域,区域3为距离智慧电视小于0.5米的区域。
当智慧电视识别到用户处于区域1时,智慧电视可以认为用户与数字助手之间的距离为一个识别距离,且智慧电视获取到的用户的空间交互行为包括:用户面对数字助手、用户的注视点在数字助手上且用户注视数字助手的时长大于3秒,则智慧电视可以显示数字助手的形态为具有表情和动作的形态。具体地,数字助手的形态可以为代表数字助手的虚拟人体看向用户,对用户眨眼微笑,且身体和头部微微摆动,表明数字助手有意愿与用户交互。此外,如果存在多个用户位于区域1,且多个用户均具有以上的空间交互行为,则可以是以1.2米作为划分距离,控制代表数字助手的虚拟人体朝不同的用户方向轮流投去目光。
当智慧电视识别到用户处于区域2时,智慧电视可以认为用户与数字助手之间的距离为一个打招呼距离,且智慧电视获取到的用户的空间交互行为包括:用户面对数字助手、用户的注视点在数字助手上且用户注视数字助手的时长大于3秒,则智慧电视可以显示数字助手的形态为具有表情和动作的形态,并且智慧电视输出数字助手对应的语音。具体地,数字助手的形态可以为代表数字助手的虚拟人体向用户挥手并招呼迎接用户;此外,智慧电视所输出的数字助手的语音可以为询问用户的需求,例如数字助手的语音具体可以为“请问有什么可以为您效劳的呢”。
当智慧电视识别到用户处于区域3时,智慧电视可以认为用户与数字助手之间的距离为一个手动触屏操作距离,且智慧电视获取到的用户的空间交互行为包括:用户面对数字助手、用户的注视点在数字助手上且用户注视数字助手的时长大于3秒,则智慧电视可以在显示屏幕上显示需要触屏操作的任务信息,并且将数字助手的本体形象等比例缩小。具体地,数字助手的本体形象在显示屏幕上的投影身高可以为H=本体形象的实际身高*0.5/1.2。其中,H是数字助手的本体形象在屏幕上的投影身高,数字助手的本体形象的实际身高是代表数字助手的虚拟人体的真实身高,男性虚拟人体的真实身高例如可以为174cm,女性虚拟人体的真实身高例如可以为161cm,具体取值可根据实际情况调整,本实施例对此并不做具体限定。
以上详细介绍了本申请实施例提供的方法,接下来将介绍本申请实施例提供的用于执行上述方法的设备。
请参阅图31,图31为本申请实施例提供的一种数字助手虚拟形象的显示装置的结构示意图。如图31所示,该数字助手虚拟形象的显示装置,应用于电子设备,虚拟形象包括多个特征体。该装置包括:
显示模块3101,用于在第一界面上以第一形态显示虚拟形象;
显示模块3101,还用于检测到电子设备产生的信号指令,以第二形态显示虚拟形象;
其中,信号指令为基于第一用户输入产生的,第一形态和第二形态展示多个特征体中的至少一个,从第一形态变化为第二形态是,至少一个特征体发生形状变化、位置变化、运动规律变化、显示面积变化中的至少一个。
在一种可能的实现方式中,多个特征体包括第一特征体,第一特征体的形状变化包括以下变化中的 至少一种:放大、缩小、拉伸、压缩、旋转,以及在主形态、几何阵列或长条形之间切换。
在一种可能的实现方式中,多个特征体还包括第二特征体,其中,第一特征体的主形态为一圆环,第二特征体为一球体,圆环的圆心与球体的球心共点;
第一特征体的位置变化包括,围绕球体翻转后的轨道位置变化、虚拟形象整体移动或旋转所导致的位置变化中的至少一种;
第一特征体的运动规律包括,圆环以圆心为支点,按照预设的轨迹环绕球体转动;
第二特征体的运动规律包括,球体沿着经过圆环的圆心、且垂直于圆环所在平面的直线按照预设的规律做往复运动。
在一种可能的实现方式中,显示模块3101,还用于检测到第二用户输入,在第一特征体上显示第三特征体,第三特征体能够在第一特征体上移动并指示用户的方位。
在一种可能的实现方式中,虚拟形象具有呼吸态,在第一界面上以第一形态显示虚拟形象,包括:
若第一界面为熄屏界面,则第一形态为包括第一特征体和第二特征体的呼吸态,且第一特征体以主形态显示。
在一种可能的实现方式中,在第一界面为熄屏界面时,呼吸态的虚拟形象在屏幕上按照预设的规律做往复运动。
在一种可能的实现方式中,在第一界面为熄屏界面,第一形态为呼吸态时,显示模块3101,还用于:
检测到第二用户输入,在第一特征体上显示第三特征体,第三特征体能够在第一特征体上移动并指示用户的方位。
在一种可能的实现方式中,虚拟形象具有待唤醒态和聆听态,第一界面为锁屏界面或解锁后的任意屏幕界面;
在第一界面上以第一形态显示虚拟形象,包括:
以待唤醒态作为第一形态显示在第一界面上,待唤醒态包括第一特征体,且第一特征体作为导航提示线以长条形显示,导航提示线用于作为屏幕手势导航的基准线;其中,在未接收到唤醒数字助手的输入时,待唤醒态在第一界面常驻显示;
或者,
检测到第三用户输入,以聆听态作为第一形态显示在第一界面上,第三用户输入用于唤醒数字助手,聆听态包括第一特征体和第二特征体,且第一特征体以主形态显示。
在一种可能的实现方式中,第一界面为锁屏界面或解锁后的任意屏幕界面,第一形态为聆听态;
在第一界面上以第一形态显示虚拟形象,具体包括:
第一特征体先以几何阵列显示;
第一特征体再由几何阵列变化为主形态,并显示第二特征体。
在一种可能的实现方式中,第一特征体先以几何阵列显示,具体为:
在初始显示位置显示部分阵列的几何图形,再在整个屏幕显示几何图形。
在一种可能的实现方式中,初始显示位置是基于用户输入确定的;
且在未检测到用户输入时,初始位置为显示虚拟形象的区域。
在一种可能的实现方式中,在第一界面上以第一形态显示虚拟形象,具体为:
在第一界面上的预设位置显示虚拟形象;或者,
将虚拟形象覆盖显示在第一界面内的目标内容元素上,其中,目标内容元素为预设类型的内容元素。
在一种可能的实现方式中,当虚拟形象覆盖显示在第一界面内的目标内容元素上时,将目标内容元素的手势事件接口复制到虚拟形象上,以使得对虚拟形象执行手势能够实现目标内容元素的手势对应功能。
在一种可能的实现方式中,第一界面为相机界面,目标内容元素为相机界面内的快门控件。
在一种可能的实现方式中,第一界面为锁屏界面或解锁后的任意屏幕界面,第一形态为待唤醒态,第二形态为聆听态;信号指令是基于用户对第一形态的数字助手执行预设手势后产生。
在一种可能的实现方式中,多个特征体还包括第二特征体;
在第二形态下,第一特征体为圆环的主形态,第二特征体为球体,且圆环位于球体的俯视视角的赤 道上。
在一种可能的实现方式中,虚拟形象还具有任务执行态,任务执行态包括第一特征体;
在数字助手执行自动化任务过程中的至少一个过程界面,任务执行态下的第一特征体对应产生形状变化、位置变化中的至少一种。
在一种可能的实现方式中,第一特征体为几何阵列,且几何阵列的显示位置与过程界面内的目标控件相匹配,用于模拟目标控件的点击事件以进入下一过程界面。
在一种可能的实现方式中,第一形态为聆听态,第二形态为任务执行态,聆听态和任务执行态均包括第一特征体,且聆听态和任务执行态下的第一特征体分别具有不同的形态;
在任务执行态下,第一特征体的形态包括:界面内的几何阵列、对话卡片的外框、调用数字助手输入时输入框的外框或任务结果页的外框。
在一种可能的实现方式中,在自动化任务的不同任务进度下,第一特征体具有不同的形态变化。
在一种可能的实现方式中,在自动化任务为智慧识别任务时,第一特征体由铺满屏幕的几何阵列变化为仅显示在识别到的可执行对象上的几何阵列。
在一种可能的实现方式中,在任务执行态下,第一特征体的形态变化包括:
第一特征体先以圆环的主形态显示在第二特征体的外围;
第一特征体再由圆环脱离第二特征体,并放大显示,以作为对话卡片的外框;
第一特征体由对话卡片的外框变为对话气泡内的几何阵列;
在自动化任务执行完成后,虚拟形象以聆听态显示。
在一种可能的实现方式中,装置还包括:
获取模块3102,用于获取场景信息,并基于场景信息以目标形态显示多个特征体中的第一特征体。
在一种可能的实现方式中,虚拟形象具有待唤醒态,待唤醒态至少包括第一特征体,第一特征体以长条形显示;
显示模块3101,还用于响应于检测到人眼视线焦点落在屏幕所在区域,显示第二特征体。
在一种可能的实现方式中,显示模块3101,还用于响应于检测到人声,使得第二特征体相对第一特征体向靠近人声声源的方向移动。
在一种可能的实现方式中,电子设备为智能汽车,显示模块3101,还用于:
通过多个特征体中第一特征体的形变或运动来指示车载功能或告警信息。
请参阅图32,图32为本申请实施例提供的执行设备的一种结构示意图,执行设备3200具体可以表现为手机、平板、笔记本电脑、智能穿戴设备、服务器等,此处不做限定。具体的,执行设备3200包括:接收器3201、发射器3202、处理器3203和存储器3204(其中执行设备3200中的处理器3203的数量可以一个或多个,图32中以一个处理器为例),其中,处理器3203可以包括应用处理器32031和通信处理器32032。在本申请的一些实施例中,接收器3201、发射器3202、处理器3203和存储器3204可通过总线或其它方式连接。
存储器3204可以包括只读存储器和随机存取存储器,并向处理器3203提供指令和数据。存储器3204的一部分还可以包括非易失性随机存取存储器(non-volatile random access memory,NVRAM)。存储器3204存储有处理器和操作指令、可执行模块或者数据结构,或者它们的子集,或者它们的扩展集,其中,操作指令可包括各种操作指令,用于实现各种操作。
处理器3203控制执行设备的操作。具体的应用中,执行设备的各个组件通过总线系统耦合在一起,其中总线系统除包括数据总线之外,还可以包括电源总线、控制总线和状态信号总线等。但是为了清楚说明起见,在图中将各种总线都称为总线系统。
上述本申请实施例揭示的方法可以应用于处理器3203中,或者由处理器3203实现。处理器3203可以是一种集成电路芯片,具有信号的处理能力。在实现过程中,上述方法的各步骤可以通过处理器3203中的硬件的集成逻辑电路或者软件形式的指令完成。上述的处理器3203可以是通用处理器、数字信号处理器(digital signal processing,DSP)、微处理器或微控制器,还可进一步包括专用集成电路(application specific integrated circuit,ASIC)、现场可编程门阵列(field-programmable gate array,FPGA)或者其他 可编程逻辑器件、分立门或者晶体管逻辑器件、分立硬件组件。
该处理器3203可以实现或者执行本申请实施例中的公开的各方法、步骤及逻辑框图。通用处理器可以是微处理器或者该处理器也可以是任何常规的处理器等。结合本申请实施例所公开的方法的步骤可以直接体现为硬件译码处理器执行完成,或者用译码处理器中的硬件及软件模块组合执行完成。软件模块可以位于随机存储器,闪存、只读存储器,可编程只读存储器或者电可擦写可编程存储器、寄存器等本领域成熟的存储介质中。该存储介质位于存储器3204,处理器3203读取存储器3204中的信息,结合其硬件完成上述方法的步骤。
接收器3201可用于接收输入的数字或字符信息,以及产生与执行设备的相关设置以及功能控制有关的信号输入。发射器3202可用于通过第一接口输出数字或字符信息;发射器3202还可用于通过第一接口向磁盘组发送指令,以修改磁盘组中的数据;发射器3202还可以包括显示屏等显示设备。
本申请实施例提供的电子设备具体可以为芯片,芯片包括:处理单元和通信单元,处理单元例如可以是处理器,通信单元例如可以是输入/输出接口、管脚或电路等。该处理单元可执行存储单元存储的计算机执行指令,以使执行设备内的芯片执行上述实施例描述的方法。可选地,存储单元为芯片内的存储单元,如寄存器、缓存等,存储单元还可以是无线接入设备端内的位于芯片外部的存储单元,如只读存储器(read-only memory,ROM)或可存储静态信息和指令的其他类型的静态存储设备,随机存取存储器(random access memory,RAM)等。
可以参阅图33,图33为本申请实施例提供的一种计算机可读存储介质的结构示意图。本申请还提供了一种计算机可读存储介质,在一些实施例中,上述图3所公开的方法可以实施为以机器可读格式被编码在计算机可读存储介质上或者被编码在其它非瞬时性介质或者制品上的计算机程序指令。
图33示意性地示出根据这里展示的至少一些实施例而布置的示例计算机可读存储介质的概念性局部视图,示例计算机可读存储介质包括用于在计算设备上执行计算机进程的计算机程序。
在一个实施例中,计算机可读存储介质3300是使用信号承载介质3301来提供的。信号承载介质3301可以包括一个或多个程序指令3302,其当被一个或多个处理器运行时可以提供以上针对图3描述的功能或者部分功能。
在一些示例中,信号承载介质3301可以包含计算机可读介质3303,诸如但不限于,硬盘驱动器、紧密盘(CD)、数字视频光盘(DVD)、数字磁带、存储器、ROM或RAM等等。
在一些实施方式中,信号承载介质3301可以包含计算机可记录介质3304,诸如但不限于,存储器、读/写(R/W)CD、R/W DVD、等等。在一些实施方式中,信号承载介质3301可以包含通信介质3305,诸如但不限于,数字和/或模拟通信介质(例如,光纤电缆、波导、有线通信链路、无线通信链路、等等)。因此,例如,信号承载介质3301可以由无线形式的通信介质3305(例如,遵守IEEE 802.X标准或者其它传输协议的无线通信介质)来传达。
一个或多个程序指令3302可以是,例如,计算机可执行指令或者逻辑实施指令。在一些示例中,计算设备的计算设备可以被配置为,响应于通过计算机可读介质3303、计算机可记录介质3304、和/或通信介质3305中的一个或多个传达到计算设备的程序指令3302,提供各种操作、功能、或者动作。
另外需说明的是,以上所描述的装置实施例仅仅是示意性的,其中作为分离部件说明的单元可以是或者也可以不是物理上分开的,作为单元显示的部件可以是或者也可以不是物理单元,即可以位于一个地方,或者也可以分布到多个网络单元上。可以根据实际的需要选择其中的部分或者全部模块来实现本实施例方案的目的。另外,本申请提供的装置实施例附图中,模块之间的连接关系表示它们之间具有通信连接,具体可以实现为一条或多条通信总线或信号线。
通过以上的实施方式的描述,所属领域的技术人员可以清楚地了解到本申请可借助软件加必需的通用硬件的方式来实现,当然也可以通过专用硬件包括专用集成电路、专用CPU、专用存储器、专用元器件等来实现。一般情况下,凡由计算机程序完成的功能都可以很容易地用相应的硬件来实现,而且,用来实现同一功能的具体硬件结构也可以是多种多样的,例如模拟电路、数字电路或专用电路等。但是,对本申请而言更多情况下软件程序实现是更佳的实施方式。基于这样的理解,本申请的技术方案本质上或者说对现有技术做出贡献的部分可以以软件产品的形式体现出来,该计算机软件产品存储在可读取的 存储介质中,如计算机的软盘、U盘、移动硬盘、ROM、RAM、磁碟或者光盘等,包括若干指令用以使得一台计算机设备(可以是个人计算机,训练设备,或者网络设备等)执行本申请各个实施例的方法。
在上述实施例中,可以全部或部分地通过软件、硬件、固件或者其任意组合来实现。当使用软件实现时,可以全部或部分地以计算机程序产品的形式实现。
计算机程序产品包括一个或多个计算机指令。在计算机上加载和执行计算机程序指令时,全部或部分地产生按照本申请实施例的流程或功能。计算机可以是通用计算机、专用计算机、计算机网络、或者其他可编程装置。计算机指令可以存储在计算机可读存储介质中,或者从一个计算机可读存储介质向另一计算机可读存储介质传输,例如,计算机指令可以从一个网站站点、计算机、训练设备或数据中心通过有线(例如同轴电缆、光纤、数字用户线)或无线(例如红外、无线、微波等)方式向另一个网站站点、计算机、训练设备或数据中心进行传输。计算机可读存储介质可以是计算机能够存储的任何可用介质或者是包含一个或多个可用介质集成的训练设备、数据中心等数据存储设备。可用介质可以是磁性介质,(例如,软盘、硬盘、磁带)、光介质(例如,DVD)、或者半导体介质(例如固态硬盘(Solid State Disk,SSD))等。

Claims (55)

  1. 一种数字助手虚拟形象的显示方法,应用于电子设备,其特征在于,所述虚拟形象包括多个特征体,所述方法包括:
    在第一界面上以第一形态显示所述虚拟形象;
    检测到所述电子设备产生的信号指令,以第二形态显示所述虚拟形象;
    其中,所述信号指令为基于第一用户输入产生的,所述第一形态和所述第二形态展示所述多个特征体中的至少一个,从所述第一形态变化为所述第二形态是,至少一个特征体发生形状变化、位置变化、运动规律变化、显示面积变化中的至少一个。
  2. 根据权利要求1所述的方法,其特征在于,所述多个特征体包括第一特征体,所述第一特征体的形状变化包括以下变化中的至少一种:放大、缩小、拉伸、压缩、旋转,以及在主形态、几何阵列或长条形之间切换。
  3. 根据权利要求2所述的方法,其特征在于,所述多个特征体还包括第二特征体,其中,所述第一特征体的主形态为一圆环,所述第二特征体为一球体,所述圆环的圆心与所述球体的球心共点;
    所述第一特征体的位置变化包括,围绕所述球体翻转后的轨道位置变化、所述虚拟形象整体移动或旋转所导致的位置变化中的至少一种;
    所述第一特征体的运动规律包括,所述圆环以圆心为支点,按照预设的轨迹环绕所述球体转动;
    所述第二特征体的运动规律包括,所述球体沿着经过所述圆环的圆心、且垂直于所述圆环所在平面的直线按照预设的规律做往复运动。
  4. 根据权利要求3所述的方法,其特征在于,所述方法还包括:
    检测到第二用户输入,在所述第一特征体上显示第三特征体,所述第三特征体能够在所述第一特征体上移动并指示用户的方位。
  5. 根据权利要求1-3任意一项所述的方法,其特征在于,所述虚拟形象具有呼吸态,所述在第一界面上以第一形态显示所述虚拟形象,包括:
    若所述第一界面为熄屏界面,则所述第一形态为包括第一特征体和第二特征体的呼吸态,且所述第一特征体以主形态显示。
  6. 根据权利要求5所述的方法,其特征在于,在所述第一界面为熄屏界面时,所述呼吸态的虚拟形象在屏幕上按照预设的规律做往复运动。
  7. 根据权利要求5所述的方法,其特征在于,在所述第一界面为熄屏界面,所述第一形态为呼吸态时,所述方法还包括:
    检测到第二用户输入,在所述第一特征体上显示第三特征体,所述第三特征体能够在所述第一特征体上移动并指示用户的方位。
  8. 根据权利要求1-4任意一项所述的方法,其特征在于,所述虚拟形象具有待唤醒态和聆听态,所述第一界面为锁屏界面或解锁后的任意屏幕界面;
    所述在第一界面上以第一形态显示所述虚拟形象,包括:
    以待唤醒态作为第一形态显示在所述第一界面上,所述待唤醒态包括第一特征体,且所述第一特征体作为导航提示线以长条形显示,所述导航提示线用于作为屏幕手势导航的基准线;其中,在未接收到唤醒数字助手的输入时,所述待唤醒态在所述第一界面常驻显示;
    或者,
    检测到第三用户输入,以所述聆听态作为所述第一形态显示在所述第一界面上,所述第三用户输入用于唤醒所述数字助手,所述聆听态包括第一特征体和第二特征体,且所述第一特征体以主形态显示。
  9. 根据权利要求8所述的方法,其特征在于,所述第一界面为锁屏界面或解锁后的任意屏幕界面,所述第一形态为聆听态;
    所述在第一界面上以第一形态显示所述虚拟形象,具体包括:
    所述第一特征体先以几何阵列显示;
    所述第一特征体再由几何阵列变化为主形态,并显示所述第二特征体。
  10. 根据权利要求9所述的方法,其特征在于,所述第一特征体先以几何阵列显示,具体为:
    在初始显示位置显示部分阵列的几何图形,再在整个屏幕显示所述几何图形。
  11. 根据权利要求10所述的方法,其特征在于,
    所述初始显示位置是基于用户输入确定的;
    且在未检测到用户输入时,所述初始位置为显示所述虚拟形象的区域。
  12. 根据权利要求1-11任意一项所述的方法,其特征在于,所述在第一界面上以第一形态显示所述虚拟形象,具体为:
    在所述第一界面上的预设位置显示所述虚拟形象;或者,
    将所述虚拟形象覆盖显示在所述第一界面内的目标内容元素上,其中,所述目标内容元素为预设类型的内容元素。
  13. 根据权利要求12所述的方法,其特征在于,当所述虚拟形象覆盖显示在所述第一界面内的目标内容元素上时,将所述目标内容元素的手势事件接口复制到所述虚拟形象上,以使得对所述虚拟形象执行手势能够实现所述目标内容元素的手势对应功能。
  14. 根据权利要求13所述的方法,其特征在于,所述第一界面为相机界面,所述目标内容元素为相机界面内的快门控件。
  15. 根据权利要求8所述的方法,其特征在于,所述第一界面为锁屏界面或解锁后的任意屏幕界面,所述第一形态为待唤醒态,所述第二形态为聆听态;所述信号指令是基于用户对所述第一形态的数字助手执行预设手势后产生。
  16. 根据权利要求15所述的方法,其特征在于,所述多个特征体还包括第二特征体;
    在所述第二形态下,所述第一特征体为圆环的主形态,所述第二特征体为球体,且所述圆环位于所述球体的俯视视角的赤道上。
  17. 根据权利要求8所述的方法,其特征在于,所述虚拟形象还具有任务执行态,所述任务执行态包括所述第一特征体;
    在所述数字助手执行自动化任务过程中的至少一个过程界面,所述任务执行态下的所述第一特征体对应产生形状变化、位置变化中的至少一种。
  18. 根据权利要求17所述的方法,其特征在于,所述第一特征体为几何阵列,且所述几何阵列的显示位置与过程界面内的目标控件相匹配,用于模拟所述目标控件的点击事件以进入下一过程界面。
  19. 根据权利要求17所述的方法,其特征在于,所述第一形态为聆听态,所述第二形态为任务执行态,所述聆听态和所述任务执行态均包括所述第一特征体,且所述聆听态和所述任务执行态下的第一特征体分别具有不同的形态;
    在所述任务执行态下,所述第一特征体的形态包括:界面内的几何阵列、对话卡片的外框、调用数字助手输入时输入框的外框或任务结果页的外框。
  20. 根据权利要求19所述的方法,其特征在于,在自动化任务的不同任务进度下,所述第一特征体具有不同的形态变化。
  21. 根据权利要求19所述的方法,其特征在于,在所述自动化任务为智慧识别任务时,所述第一特征体由铺满屏幕的几何阵列变化为仅显示在识别到的可执行对象上的几何阵列。
  22. 根据权利要求19或20所述的方法,其特征在于,在所述任务执行态下,所述第一特征体的形态变化包括:
    所述第一特征体先以圆环的主形态显示在第二特征体的外围;
    所述第一特征体再由圆环脱离所述第二特征体,并放大显示,以作为对话卡片的外框;
    所述第一特征体由所述对话卡片的外框变为对话气泡内的几何阵列;
    在自动化任务执行完成后,所述虚拟形象以聆听态显示。
  23. 根据权利要求1-4所述的方法,其特征在于,所述方法还包括:
    获取场景信息,并基于所述场景信息以目标形态显示所述多个特征体中的第一特征体。
  24. 根据权利要求1-4所述的方法,其特征在于,所述虚拟形象具有待唤醒态,所述待唤醒态至少包括第一特征体,所述第一特征体以长条形显示;
    响应于检测到人眼视线焦点落在屏幕所在区域,显示第二特征体。
  25. 根据权利要求24所述的方法,其特征在于,所述方法还包括:
    响应于检测到人声,所述第二特征体相对所述第一特征体向靠近人声声源的方向移动。
  26. 根据权利要求1-4所述的方法,其特征在于,所述电子设备为智能汽车,所述方法还包括:
    通过所述多个特征体中第一特征体的形变或运动来指示车载功能或告警信息。
  27. 一种数字助手虚拟形象的显示装置,应用于电子设备,其特征在于,所述虚拟形象包括多个特征体;所述装置包括:
    显示模块,用于在第一界面上以第一形态显示所述虚拟形象;
    所述显示模块,还用于检测到所述电子设备产生的信号指令,以第二形态显示所述虚拟形象;
    其中,所述信号指令为基于第一用户输入产生的,所述第一形态和所述第二形态展示所述多个特征体中的至少一个,从所述第一形态变化为所述第二形态是,至少一个特征体发生形状变化、位置变化、运动规律变化、显示面积变化中的至少一个。
  28. 根据权利要求27所述的装置,其特征在于,所述多个特征体包括第一特征体,所述第一特征体的形状变化包括以下变化中的至少一种:放大、缩小、拉伸、压缩、旋转,以及在主形态、几何阵列或长条形之间切换。
  29. 根据权利要求28所述的装置,其特征在于,所述多个特征体还包括第二特征体,其中,所述第一特征体的主形态为一圆环,所述第二特征体为一球体,所述圆环的圆心与所述球体的球心共点;
    所述第一特征体的位置变化包括,围绕所述球体翻转后的轨道位置变化、所述虚拟形象整体移动或旋转所导致的位置变化中的至少一种;
    所述第一特征体的运动规律包括,所述圆环以圆心为支点,按照预设的轨迹环绕所述球体转动;
    所述第二特征体的运动规律包括,所述球体沿着经过所述圆环的圆心、且垂直于所述圆环所在平面的直线按照预设的规律做往复运动。
  30. 根据权利要求29所述的装置,其特征在于,
    所述显示模块,还用于检测到第二用户输入,在所述第一特征体上显示第三特征体,所述第三特征体能够在所述第一特征体上移动并指示用户的方位。
  31. 根据权利要求27-29任意一项所述的装置,其特征在于,所述虚拟形象具有呼吸态,所述在第一界面上以第一形态显示所述虚拟形象,包括:
    若所述第一界面为熄屏界面,则所述第一形态为包括第一特征体和第二特征体的呼吸态,且所述第一特征体以主形态显示。
  32. 根据权利要求31所述的装置,其特征在于,在所述第一界面为熄屏界面时,所述呼吸态的虚拟形象在屏幕上按照预设的规律做往复运动。
  33. 根据权利要求31所述的装置,其特征在于,在所述第一界面为熄屏界面,所述第一形态为呼吸态时,所述显示模块,还用于:
    检测到第二用户输入,在所述第一特征体上显示第三特征体,所述第三特征体能够在所述第一特征体上移动并指示用户的方位。
  34. 根据权利要求27-30任意一项所述的装置,其特征在于,所述虚拟形象具有待唤醒态和聆听态,所述第一界面为锁屏界面或解锁后的任意屏幕界面;
    所述在第一界面上以第一形态显示所述虚拟形象,包括:
    以待唤醒态作为第一形态显示在所述第一界面上,所述待唤醒态包括第一特征体,且所述第一特征体作为导航提示线以长条形显示,所述导航提示线用于作为屏幕手势导航的基准线;其中,在未接收到唤醒数字助手的输入时,所述待唤醒态在所述第一界面常驻显示;
    或者,
    检测到第三用户输入,以所述聆听态作为所述第一形态显示在所述第一界面上,所述第三用户输入用于唤醒所述数字助手,所述聆听态包括第一特征体和第二特征体,且所述第一特征体以主形态显示。
  35. 根据权利要求34所述的装置,其特征在于,所述第一界面为锁屏界面或解锁后的任意屏幕界面, 所述第一形态为聆听态;
    所述在第一界面上以第一形态显示所述虚拟形象,具体包括:
    所述第一特征体先以几何阵列显示;
    所述第一特征体再由几何阵列变化为主形态,并显示所述第二特征体。
  36. 根据权利要求35所述的装置,其特征在于,所述第一特征体先以几何阵列显示,具体为:
    在初始显示位置显示部分阵列的几何图形,再在整个屏幕显示所述几何图形。
  37. 根据权利要求36所述的装置,其特征在于,
    所述初始显示位置是基于用户输入确定的;
    且在未检测到用户输入时,所述初始位置为显示所述虚拟形象的区域。
  38. 根据权利要求27-37任意一项所述的装置,其特征在于,所述在第一界面上以第一形态显示所述虚拟形象,具体为:
    在所述第一界面上的预设位置显示所述虚拟形象;或者,
    将所述虚拟形象覆盖显示在所述第一界面内的目标内容元素上,其中,所述目标内容元素为预设类型的内容元素。
  39. 根据权利要求38所述的装置,其特征在于,当所述虚拟形象覆盖显示在所述第一界面内的目标内容元素上时,将所述目标内容元素的手势事件接口复制到所述虚拟形象上,以使得对所述虚拟形象执行手势能够实现所述目标内容元素的手势对应功能。
  40. 根据权利要求39所述的装置,其特征在于,所述第一界面为相机界面,所述目标内容元素为相机界面内的快门控件。
  41. 根据权利要求34所述的装置,其特征在于,所述第一界面为锁屏界面或解锁后的任意屏幕界面,所述第一形态为待唤醒态,所述第二形态为聆听态;所述信号指令是基于用户对所述第一形态的数字助手执行预设手势后产生。
  42. 根据权利要求39所述的装置,其特征在于,所述多个特征体还包括第二特征体;
    在所述第二形态下,所述第一特征体为圆环的主形态,所述第二特征体为球体,且所述圆环位于所述球体的俯视视角的赤道上。
  43. 根据权利要求34所述的装置,其特征在于,所述虚拟形象还具有任务执行态,所述任务执行态包括所述第一特征体;
    在所述数字助手执行自动化任务过程中的至少一个过程界面,所述任务执行态下的所述第一特征体对应产生形状变化、位置变化中的至少一种。
  44. 根据权利要求43所述的装置,其特征在于,所述第一特征体为几何阵列,且所述几何阵列的显示位置与过程界面内的目标控件相匹配,用于模拟所述目标控件的点击事件以进入下一过程界面。
  45. 根据权利要求43所述的装置,其特征在于,所述第一形态为聆听态,所述第二形态为任务执行态,所述聆听态和所述任务执行态均包括所述第一特征体,且所述聆听态和所述任务执行态下的第一特征体分别具有不同的形态;
    在所述任务执行态下,所述第一特征体的形态包括:界面内的几何阵列、对话卡片的外框、调用数字助手输入时输入框的外框或任务结果页的外框。
  46. 根据权利要求45所述的装置,其特征在于,在自动化任务的不同任务进度下,所述第一特征体具有不同的形态变化。
  47. 根据权利要求46所述的装置,其特征在于,在所述自动化任务为智慧识别任务时,所述第一特征体由铺满屏幕的几何阵列变化为仅显示在识别到的可执行对象上的几何阵列。
  48. 根据权利要求45或46所述的装置,其特征在于,在所述任务执行态下,所述第一特征体的形态变化包括:
    所述第一特征体先以圆环的主形态显示在第二特征体的外围;
    所述第一特征体再由圆环脱离所述第二特征体,并放大显示,以作为对话卡片的外框;
    所述第一特征体由所述对话卡片的外框变为对话气泡内的几何阵列;
    在自动化任务执行完成后,所述虚拟形象以聆听态显示。
  49. 根据权利要求27-30所述的装置,其特征在于,所述装置还包括:
    获取模块,用于获取场景信息,并基于所述场景信息以目标形态显示所述多个特征体中的第一特征体。
  50. 根据权利要求27-30所述的装置,其特征在于,所述虚拟形象具有待唤醒态,所述待唤醒态至少包括第一特征体,所述第一特征体以长条形显示;
    所述显示模块,还用于响应于检测到人眼视线焦点落在屏幕所在区域,显示第二特征体。
  51. 根据权利要求50所述的装置,其特征在于,
    所述显示模块,还用于响应于检测到人声,使得所述第二特征体相对所述第一特征体向靠近人声声源的方向移动。
  52. 根据权利要求27-30所述的装置,其特征在于,所述电子设备为智能汽车,所述显示模块,还用于:
    通过所述多个特征体中第一特征体的形变或运动来指示车载功能或告警信息。
  53. 一种数字助手虚拟形象的显示装置,其特征在于,包括存储器和处理器;所述存储器存储有代码,所述处理器被配置为执行所述代码,当所述代码被执行时,所述装置执行如权利要求1至26任意一项所述的方法。
  54. 一种计算机存储介质,其特征在于,所述计算机存储介质存储有指令,所述指令在由计算机执行时使得所述计算机实施权利要求1至26任意一项所述的方法。
  55. 一种计算机程序产品,其特征在于,所述计算机程序产品存储有指令,所述指令在由计算机执行时使得所述计算机实施权利要求1至26任意一项所述的方法。
PCT/CN2024/097229 2023-06-29 2024-06-04 一种数字助手虚拟形象的显示方法及相关装置 Ceased WO2025001769A1 (zh)

Priority Applications (1)

Application Number Priority Date Filing Date Title
EP24830416.4A EP4645047A4 (en) 2023-06-29 2024-06-04 METHOD FOR DISPLAYING A DIGITAL ASSISTANT AVATAR AND ASSOCIATED DEVICE

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
CN202310786258.8 2023-06-29
CN202310786258.8A CN117008762A (zh) 2023-06-29 2023-06-29 一种数字助手的显示方法及相关装置
CN202410178141.6 2024-02-08
CN202410178141.6A CN119225568A (zh) 2023-06-29 2024-02-08 一种数字助手虚拟形象的显示方法及相关装置

Related Child Applications (1)

Application Number Title Priority Date Filing Date
US19/433,973 Continuation US20260127793A1 (en) 2023-06-29 2025-12-29 Method for displaying virtual image of digital assistant and related apparatus

Publications (1)

Publication Number Publication Date
WO2025001769A1 true WO2025001769A1 (zh) 2025-01-02

Family

ID=93937318

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2024/097229 Ceased WO2025001769A1 (zh) 2023-06-29 2024-06-04 一种数字助手虚拟形象的显示方法及相关装置

Country Status (2)

Country Link
EP (1) EP4645047A4 (zh)
WO (1) WO2025001769A1 (zh)

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109358923A (zh) * 2018-08-29 2019-02-19 华为技术有限公司 一种虚拟机器人形象的呈现方法及装置
CN110825469A (zh) * 2019-09-18 2020-02-21 华为技术有限公司 语音助手显示方法及装置
US20200210142A1 (en) * 2018-12-29 2020-07-02 Baidu Online Network Technology (Beijing) Co., Ltd. Method and apparatus for controlling virtual speech assistant, user device and storage medium
CN116168701A (zh) * 2023-02-24 2023-05-26 上海传英信息技术有限公司 交互方法、智能终端及存储介质
CN116204253A (zh) * 2021-11-30 2023-06-02 华为技术有限公司 一种语音助手显示方法及相关装置
CN117008762A (zh) * 2023-06-29 2023-11-07 华为技术有限公司 一种数字助手的显示方法及相关装置

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9338493B2 (en) * 2014-06-30 2016-05-10 Apple Inc. Intelligent automated assistant for TV user interactions
US12045437B2 (en) * 2020-05-22 2024-07-23 Apple Inc. Digital assistant user interfaces and response modes
US11955137B2 (en) * 2021-03-11 2024-04-09 Apple Inc. Continuous dialog with a digital assistant

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109358923A (zh) * 2018-08-29 2019-02-19 华为技术有限公司 一种虚拟机器人形象的呈现方法及装置
US20200210142A1 (en) * 2018-12-29 2020-07-02 Baidu Online Network Technology (Beijing) Co., Ltd. Method and apparatus for controlling virtual speech assistant, user device and storage medium
CN110825469A (zh) * 2019-09-18 2020-02-21 华为技术有限公司 语音助手显示方法及装置
CN116204253A (zh) * 2021-11-30 2023-06-02 华为技术有限公司 一种语音助手显示方法及相关装置
CN116168701A (zh) * 2023-02-24 2023-05-26 上海传英信息技术有限公司 交互方法、智能终端及存储介质
CN117008762A (zh) * 2023-06-29 2023-11-07 华为技术有限公司 一种数字助手的显示方法及相关装置

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
See also references of EP4645047A1

Also Published As

Publication number Publication date
EP4645047A1 (en) 2025-11-05
EP4645047A4 (en) 2026-04-22

Similar Documents

Publication Publication Date Title
US11500510B2 (en) Information processing apparatus and non-transitory computer readable medium
CN107801413B (zh) 对电子设备进行控制的终端及其处理方法
CN109739361B (zh) 基于眼睛跟踪的可见度提高方法和电子装置
US12039161B2 (en) Electronic device comprising a plurality of touch screen displays and screen division method
WO2021057830A1 (zh) 一种信息处理方法及电子设备
WO2021244443A1 (zh) 分屏显示方法、电子设备及计算机可读存储介质
US11886673B2 (en) Trackpad on back portion of a device
US20180274936A1 (en) Method and apparatus for providing augmented reality function in electronic device
US20250265755A1 (en) Hand-tracked text selection and modification
TWI659333B (zh) 用以處理與移動相關的資料之計算裝置及方法
CN112181207B (zh) 一种显示设备及几何图形识别方法
US10936184B2 (en) Display apparatus and controlling method thereof
CN119225568A (zh) 一种数字助手虚拟形象的显示方法及相关装置
EP4589977A1 (en) Gesture sensing method, device and storage medium
US20250199621A1 (en) Gesture-based application invocation
WO2023235672A1 (en) Ar-based virtual keyboard
US20220350997A1 (en) Pointer-based content recognition using a head-mounted device
CN111240483A (zh) 操作控制方法、头戴式设备及介质
US12483776B2 (en) Photographing method and electronic device
WO2025001769A1 (zh) 一种数字助手虚拟形象的显示方法及相关装置
CN114282544A (zh) 显示设备和控件识别方法
US12436610B2 (en) Display method and electronic device
US20260127793A1 (en) Method for displaying virtual image of digital assistant and related apparatus
CN117472262A (zh) 交互方法和电子设备
CN119013650A (zh) 控制多个窗口的显示的电子设备及其操作方法和存储介质

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 24830416

Country of ref document: EP

Kind code of ref document: A1

WWE Wipo information: entry into national phase

Ref document number: 2024830416

Country of ref document: EP

ENP Entry into the national phase

Ref document number: 2024830416

Country of ref document: EP

Effective date: 20250728

ENP Entry into the national phase

Ref document number: 2024830416

Country of ref document: EP

Effective date: 20250728

WWP Wipo information: published in national office

Ref document number: 2024830416

Country of ref document: EP

NENP Non-entry into the national phase

Ref country code: DE