WO2022214101A1 - 一种视频生成方法、装置、电子设备及存储介质 - Google Patents
一种视频生成方法、装置、电子设备及存储介质 Download PDFInfo
- Publication number
- WO2022214101A1 WO2022214101A1 PCT/CN2022/086090 CN2022086090W WO2022214101A1 WO 2022214101 A1 WO2022214101 A1 WO 2022214101A1 CN 2022086090 W CN2022086090 W CN 2022086090W WO 2022214101 A1 WO2022214101 A1 WO 2022214101A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- video
- interest
- original
- preset template
- segment
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Ceased
Links
Images
Classifications
-
- G—PHYSICS
- G11—INFORMATION STORAGE
- G11B—INFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
- G11B27/00—Editing; Indexing; Addressing; Timing or synchronising; Monitoring; Measuring tape travel
- G11B27/02—Editing, e.g. varying the order of information signals recorded on, or reproduced from, record carriers
- G11B27/031—Electronic editing of digitised analogue information signals, e.g. audio or video signals
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N5/00—Details of television systems
- H04N5/222—Studio circuitry; Studio devices; Studio equipment
- H04N5/262—Studio circuits, e.g. for mixing, switching-over, change of character of image, other special effects ; Cameras specially adapted for the electronic generation of special effects
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V20/00—Scenes; Scene-specific elements
- G06V20/40—Scenes; Scene-specific elements in video content
- G06V20/41—Higher-level, semantic clustering, classification or understanding of video scenes, e.g. detection, labelling or Markovian modelling of sport events or news items
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V20/00—Scenes; Scene-specific elements
- G06V20/40—Scenes; Scene-specific elements in video content
- G06V20/46—Extracting features or characteristics from the video content, e.g. video fingerprints, representative shots or key frames
-
- G—PHYSICS
- G11—INFORMATION STORAGE
- G11B—INFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
- G11B27/00—Editing; Indexing; Addressing; Timing or synchronising; Monitoring; Measuring tape travel
- G11B27/10—Indexing; Addressing; Timing or synchronising; Measuring tape travel
- G11B27/34—Indicating arrangements
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/41—Structure of client; Structure of client peripherals
- H04N21/422—Input-only peripherals, i.e. input devices connected to specially adapted client devices, e.g. global positioning system [GPS]
- H04N21/4223—Cameras
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/43—Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
- H04N21/44—Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs
- H04N21/44008—Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs involving operations for analysing video streams, e.g. detecting features or characteristics in the video stream
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/47—End-user applications
- H04N21/472—End-user interface for requesting content, additional data or services; End-user interface for interacting with content, e.g. for content reservation or setting reminders, for requesting event notification, for manipulating displayed content
- H04N21/47205—End-user interface for requesting content, additional data or services; End-user interface for interacting with content, e.g. for content reservation or setting reminders, for requesting event notification, for manipulating displayed content for manipulating displayed content, e.g. interacting with MPEG-4 objects, editing locally
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/80—Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
- H04N21/85—Assembly of content; Generation of multimedia applications
- H04N21/854—Content authoring
- H04N21/8549—Creating video summaries, e.g. movie trailer
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N23/00—Cameras or camera modules comprising electronic image sensors; Control thereof
- H04N23/60—Control of cameras or camera modules
- H04N23/64—Computer-aided capture of images, e.g. transfer from script file into camera, check of taken image quality, advice or proposal for image composition or decision on when to take image
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N5/00—Details of television systems
- H04N5/222—Studio circuitry; Studio devices; Studio equipment
- H04N5/262—Studio circuits, e.g. for mixing, switching-over, change of character of image, other special effects ; Cameras specially adapted for the electronic generation of special effects
- H04N5/2624—Studio circuits, e.g. for mixing, switching-over, change of character of image, other special effects ; Cameras specially adapted for the electronic generation of special effects for obtaining an image which is composed of whole input images, e.g. splitscreen
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N5/00—Details of television systems
- H04N5/222—Studio circuitry; Studio devices; Studio equipment
- H04N5/262—Studio circuits, e.g. for mixing, switching-over, change of character of image, other special effects ; Cameras specially adapted for the electronic generation of special effects
- H04N5/265—Mixing
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N23/00—Cameras or camera modules comprising electronic image sensors; Control thereof
- H04N23/60—Control of cameras or camera modules
- H04N23/62—Control of parameters via user interfaces
Definitions
- the embodiments of the present disclosure relate to the field of computer technologies, for example, to a video generation method, apparatus, electronic device, and storage medium.
- Embodiments of the present disclosure provide a video generation method, device, electronic device, and storage medium, which realize automatic editing and synthesis of videos, and improve the processing effect of videos.
- an embodiment of the present disclosure provides a video generation method, the method comprising:
- an embodiment of the present disclosure further provides a video generation device, the device comprising:
- a shooting module configured to receive a trigger operation acting on the video shooting page, and to shoot the original video in response to the trigger operation;
- a determining module configured to determine the video segment of interest in the original video
- the processing module is configured to perform video synthesis processing based on the video segment of interest and the original video to obtain a target video.
- an embodiment of the present disclosure further provides a device, the device comprising:
- storage means arranged to store at least one program
- the at least one processor When the at least one program is executed by the at least one processor, the at least one processor implements the video generation method according to any one of the embodiments of the present disclosure.
- an embodiment of the present disclosure further provides a storage medium containing computer-executable instructions, when executed by a computer processor, the computer-executable instructions are used to perform the video generation according to any of the embodiments of the present disclosure method.
- FIG. 1 is a schematic flowchart of a video generation method according to Embodiment 1 of the present disclosure
- FIG. 2 is a schematic interface diagram of video shooting when a user performs a task setting according to Embodiment 1 of the present disclosure
- FIG. 3 is a schematic flowchart of a method for generating a video according to Embodiment 2 of the present disclosure
- FIG. 4 is a schematic flowchart of a video generation method provided in Embodiment 3 of the present disclosure.
- FIG. 5 is a schematic diagram of a title animation image interface provided by Embodiment 3 of the present disclosure.
- 6a-6e are schematic diagrams of image interfaces of an in-film animation according to Embodiment 3 of the present disclosure.
- FIG. 7 is a schematic diagram of an end animation image interface provided by Embodiment 3 of the present disclosure.
- FIG. 8 is a schematic structural diagram of a video generating apparatus according to Embodiment 4 of the present disclosure.
- FIG. 9 is a schematic structural diagram of an electronic device according to Embodiment 5 of the present disclosure.
- the term “including” and variations thereof are open-ended inclusions, ie, "including but not limited to”.
- the term “based on” is “based at least in part on.”
- the term “one embodiment” means “at least one embodiment”; the term “another embodiment” means “at least one additional embodiment”; the term “some embodiments” means “at least some embodiments”. Relevant definitions of other terms will be given in the description below.
- FIG. 1 is a schematic flowchart of a video generation method provided in Embodiment 1 of the present disclosure, and the method can be applied to automatically edit and synthesize an original video shot by a user, so as to obtain a video with richer information,
- the target video with a higher degree of completion and more exciting, the entire video generation process is completed automatically, without the user's manual operation, which improves the video processing effect and efficiency, helps to improve the user experience, and enhances the user stickiness of the application product.
- the video generation method may be performed by a video generation apparatus, which may be implemented in the form of software and/or hardware.
- the video generation method provided by this embodiment includes the following steps:
- Step 110 Receive a trigger operation acting on the video shooting page, and shoot an original video in response to the trigger operation.
- the camera may be receiving a trigger operation acting on a target shooting control on the video shooting page, and shooting the original video in response to the trigger operation. For example, when the user clicks on the target shooting control, the camera is started, and the video within the shooting range of the camera is captured; when the user clicks the target shooting control again, the shooting ends.
- the original video may be a video obtained by shooting for a user, or a video obtained by shooting a scene or thing.
- the original video includes a video obtained by photographing a picture of a user performing a set task.
- the set task may be any form of task, for example, the user himself or his friend imitates a funny video, the user sings a song, or the user dances a hot dance.
- the set task may also include a tongue twister challenge game, and/or a quiz game, and/or a video imitation game, etc.
- This embodiment does not limit the content and execution method of the set task.
- the user is required to repeat a certain tongue twister clearly and fluently within a limited time, and video recording is performed when the user repeats a tongue twister. While recording the user’s real-time performance, it can also analyze whether the user’s speech is clear and accurate based on the original video. Whether the duration is shorter than other users, etc., enhances the fun and entertainment of the game.
- the setting task may include at least one sub-task.
- the prompt information of the setting task may be displayed on the video shooting page. , to guide the user to perform the setting task.
- the set task includes multiple subtasks
- prompt information of the multiple subtasks may be sequentially displayed in the non-shooting area of the current interface according to the difficulty level of the multiple subtasks.
- the set task is a tongue twister, which includes 2 subtasks, the first tongue twister and the second tongue twister, the difficulty of the second tongue twister is greater than the difficulty of the first tongue twister, and the prompt information of the first tongue twister is on the video shooting page.
- the display sequence of precedes the second tongue twister, thereby increasing user stickiness.
- the prompt information of the first tongue twister is displayed on the video shooting page
- the user is guided to perform the first tongue twister task
- the picture of the user performing the first tongue twister task is captured at the same time
- the prompt information of the second tongue twister is displayed on the video shooting page.
- the prompt information may include the name, introduction and/or countdown identifier of the setting task.
- the content details of the tongue twister are displayed in the non-shooting area of the current interface "Red Phoenix Pink Phoenix, Red Pink Phoenix Flower. "Phoenix” 210, and the countdown logo "2s" 220, where the reference number 230 represents the shooting area.
- the content details and countdown logo of the next tongue twister that is more difficult are automatically displayed, for example, "Niuniu pulls Niuniu, Niuniu pulls Niuniu”.
- the vertical screen mode is used to shoot the video of the screen when the user performs the setting task, so as to obtain the original vertical screen video.
- Step 120 Determine the video segment of interest in the original video.
- the video clip of interest in the original video may refer to a video clip including a preset expression
- the preset expression may be a laughing or crying expression.
- the video clip of interest may include a smile. , video clip of crying emoji.
- the preset expression may also be an exaggerated expression of a laughing, frustrated and crying face, and correspondingly, the video clip of interest may also be a video clip with an exaggerated expression (laughing out loud, crying in frustration).
- the expression recognition model can be used to perform expression recognition on each frame of the original video, and mark the image frames including the set expressions, so as to obtain the video clips of interest based on the marked image frames. For example, a video segment composed of 20 image frames before and after a marked image frame is intercepted as a video segment of interest.
- the set expression is, for example, an expression when laughing, an expression on a crying face, or the like.
- Step 130 Perform video synthesis processing based on the video segment of interest and the original video to obtain a target video.
- the video clips of interest may be used to create images of some highlights, and then the images of the highlights may be used as the opening or ending, and the original video may be used to generate an in-film video with some animation special effects.
- the original video may be played in the middle of the template in combination with the set template, and some animation effects may be added at other positions of the template. For example, if the user's retelling of the current tongue twister is clear and fluent, the animation special effect "You are amazing” can be displayed; if the user's current tongue twister recitation is not very clear and fluent, the animation special effect "Continue to come on” can be displayed, and the animation form can also be displayed. the "microphone", etc. Finally, the title, middle and end titles obtained through processing are synthesized and spliced to obtain the target video.
- the target video can be generated as a landscape video.
- an original video is captured in response to the trigger operation; an interesting video segment in the original video is determined;
- the above-mentioned original video is processed by video synthesis to obtain the technical means of the target video, which realizes the automatic editing and synthesis of the video, and improves the processing effect of the video.
- FIG. 3 is a schematic flowchart of a video generation method according to Embodiment 2 of the present disclosure.
- this embodiment refines the foregoing step 120 of "determining the video segment of interest in the original video", and provides an optional implementation manner for determining the video segment of interest.
- the content that is the same as or similar to the above-mentioned embodiment will not be repeated in this embodiment, and reference may be made to the explanation of the above-mentioned embodiment.
- the method includes:
- Step 310 Receive a trigger operation acting on the video shooting page, and shoot an original video in response to the trigger operation.
- Step 320 Determine the video segment of interest in the original video based on the image recognition.
- performing facial expression recognition on the image frames of the original video based on the facial expression recognition model and recording the timestamp of the first image frame including the set facial expression and the facial expression score corresponding to at least one of the first image frames;
- the first image frame whose expression score reaches the set threshold is determined as the second image frame; and the video segment of interest is acquired according to the timestamp of the second image frame.
- the expression recognition model may be an algorithm constructed based on a neural network and implemented through the principle of image recognition for recognizing expressions in images.
- each image frame of the original video is sequentially input to the expression recognition model, and the expression recognition model outputs whether the recognition result of the set expression and the corresponding expression score are included. For example, if the recognition result is "1", it means that the current image frame includes the set expression.
- the expression score is a quantity used to characterize the degree of expression change, eg a smiling expression score is lower than a laughing expression score.
- the acquiring the video segment of interest according to the timestamp of the second image frame includes:
- a video of a set duration is intercepted within the duration time interval of the task corresponding to the second image frame as the video segment of interest.
- the sub-task is the tongue twister "Red Phoenix Pink Phoenix, Red Phoenix Flower Phoenix"
- the default time for the user to repeat the tongue twister is 5s, assuming that the user starts to repeat the tongue twister
- the duration of the subtask is the 1s
- the duration time interval of the subtask is from the 1s to the 5th
- the timestamp of the second image frame is the 3s
- the duration of the video clip of interest is 1s
- the 3s is the reference point
- take The image frames within 0.5s before and after the reference point constitute the video segment of interest, that is, the image frames whose timestamps fall within the 2.5s-3.5s are determined as the image frames of the video segment of interest.
- the timestamp of the second image frame is the 4.7s, if it is taken backward by 0.5s (that is, the 5.2s), it exceeds the historical time interval (1s to 5s) of the subtask, and the timestamp is taken at this time.
- the image frames that fall within the 4th to 5th s are the image frames of the video clip of interest, that is, the time stamp of the second image frame is used as a reference time point, and the duration of the task corresponding to the second image frame is used.
- a set number of image frames close to the second image frame within the interval are determined as the video segment of interest.
- the video clips of interest can also be determined for each subtask and then synthesized into the final video clips of interest for the setting task, and the method of determining the video clips of interest for each subtask can also be determined. Similar to the above, it is not repeated here.
- Step 330 Perform video synthesis processing based on the video segment of interest and the original video to obtain a target video.
- the expression recognition is performed on the image frames of the original video based on the expression recognition model, and the timestamp of the first image frame including the set expression and at least one of the first image frames corresponding to each other are recorded.
- the expression score of The set number of image frames of the second image frame are determined as the video segment of interest, which realizes accurate determination of the video segment of interest and provides a data basis for obtaining the target video.
- FIG. 4 is a schematic flowchart of a video generation method according to Embodiment 3 of the present disclosure.
- this embodiment refines the above step 130 of “performing video synthesis processing based on the video clip of interest and the original video to obtain the target video”, and provides video clipping and synthesis processing.
- optional implementation The content that is the same as or similar to the above-mentioned embodiment will not be repeated in this embodiment, and reference may be made to the explanation of the above-mentioned embodiment.
- the method includes the following steps:
- Step 410 Receive a trigger operation acting on the video shooting page, and shoot an original video in response to the trigger operation.
- Step 420 Determine the video segment of interest in the original video.
- Step 430 Generate title video data and/or end title video data based on the video segment of interest, and generate in-title video data based on the original video.
- the generating the title video data based on the video segment of interest includes:
- the title video data is generated based on the video segment of interest and the first preset template.
- the identification (eg serial number, name, introduction) information of the setting task and/or the user's identification (eg nickname) are displayed in the second setting position of the first preset template to obtain the title video data.
- the video segment of interest a small video of about 1 s
- the introduction information of the set task such as a challenge tongue twister
- the user's nickname as shown in FIG. 5 : challenger
- forest output the second set position 520 of the first preset template.
- the generation of video data in-slice based on the original video includes:
- In-slice video data is generated based on the original video and the second preset template.
- the original video is added to the third setting position of the second preset template, so as to play the original video at the third setting position;
- the fourth setting position of the second preset template displays the matching animation, and/or the association of the setting task is displayed at the fifth setting position of the second preset template according to the content of the setting task information; thereby generating the in-slice video data.
- the original video includes multiple partial videos, and each partial video corresponds to a subtask, and it may be determined based on the original video that the user performs a single setting.
- Partial video during a task i.e. a subtask.
- the original video is a video when the user performs a tongue twister challenge.
- the user performs four tongue twister challenges. Based on the difficulty of each tongue twister, the user first challenges the simpler tongue twister, and then challenges the more difficult tongue twister. .
- the user first challenges the tongue twister "red phoenix pink phoenix, red phoenix phoenix flower phoenix", when the user finishes repeating the current tongue twister, the next more difficult tongue twister such as "niu niu niu niu, niu niu pull niu niu” is automatically displayed; then proceed to the third A tongue twister challenge, such as "Li Xiaoli's family raised red carp, green carp and a donkey"; and finally a fourth tongue twister challenge, such as "Blue coach is a female coach, Lu coach is a male coach".
- the video when the user performs each tongue twister challenge is determined as a partial video when the subtask is performed.
- the video when the user repeats the tongue twister "Red Phoenix Pink Phoenix, Red Phoenix Flower Phoenix” is a partial video, and the user repeats the tongue twister "Niuniu”.
- the video when "Morning Niu Niu Niu Niu Niu Niu Niu Niu Niu Niu Niu Niu Niu Niu” is another partial video.
- a plurality of the partial videos are respectively added to the third setting position of the corresponding second preset template, so as to play the partial videos in the third setting position of the second preset template, wherein each partial video corresponds to An independent second preset template; a matching animation is displayed at the fourth setting position of the second preset template according to the situation that the user performs the set task; in the fifth setting of each of the second preset templates The position displays the associated information of the corresponding subtask, and obtains the video data in the film.
- a plurality of the partial videos are respectively added to the third setting position 610 of the corresponding second preset template (No. The middle position of the second template) to play the partial video at the third set position 610 of the second preset template.
- the associated information of the corresponding subtask is displayed, and the associated information includes at least one of the following: content detail information of the subtask (for example, the "red" in Fig.
- the content information, the microphone, the countdown reminder logo and the game category can all be added in the form of information stickers in the setting position 620 of the second preset template, for example, the positions on the left and right sides of the second preset template.
- special effects can also be added according to the content of the information. For example, when the content of the information is "Grandma Liu likes to drink durian milk", stickers with rendering effects can be added, such as the stickers of the "milk” picture.
- Each tongue twister has a system-set completion time, and a countdown stopwatch can be displayed accordingly.
- a matching animation is displayed at the fourth setting position of the second preset template according to the situation that the user performs the setting task, including at least one of the following:
- an animation matching the preset word is displayed at the fourth set position.
- the animation effect of "black face” is displayed in the fourth setting position of the second preset template, and the fourth setting The position may be the position where the user's face image is displayed, that is, the face becomes a black face to enhance the animation effect and improve the interest.
- an animation matching the set action is displayed at the fourth set position.
- the special effect of the big head is displayed in the fourth setting position to realize the effect of magnifying the expression.
- the fourth set position may be a position where the face image of the user is displayed, that is, adding a special effect of a big head to the face and amplifying the expression of the user, so as to enhance the animation effect and improve the interest.
- an animation matching the accuracy is displayed at the fourth set position.
- the accuracy and completeness of the user's recitation are determined through speech recognition, and the evaluation is given according to the accuracy and completeness, such as "perfect", “excellent”, “average” in the form of animation. ", "Come on”, etc.
- the generation of end-credit video data based on the video clip of interest includes:
- the end animation number is generated based on the video clip of interest and the third preset template.
- the generating of the end-credit video data based on the video clip of interest and the third preset template includes:
- the matching content is displayed in the seventh setting position of the third preset template according to the situation that the user performs the setting task.
- the matching content includes at least one of the following: title information and compliment information that match the situation that the user performs the set task.
- FIG. 7 a schematic diagram of an end-credits video data image, a face image is displayed at the sixth setting position of the third preset template, and title information “Little Achievement” and praise information are displayed at the seventh setting position "Like”, “Come on”, “Aoli give!!!” and so on.
- Step 440 Generating a target video by splicing at least one of the title video data and the ending video data with the in-title video data.
- the title video data is generated based on the video clip of interest, and then the title video data is spliced and synthesized with the original video to obtain the target video; the ending video data can also be generated based on the video clip of interest, and then The title video data and the original video are spliced and synthesized to obtain the target video; the title video data and the title video data can also be generated respectively based on the video clips of interest, and then the title video data, the original video and the title video data are spliced, Synthesis processing to obtain the target video.
- the introductory video data and the introductory video data can be generated based on the video clips of interest respectively, the in-credits video data can be generated based on the original video, and then the introductory video data, the in-credits video data, and the in-credits video data can be generated. Perform splicing and synthesis processing to obtain the target video.
- the title video data is generated based on the video clip of interest and the first preset template, and it is exemplified that funny expressions such as laughing, exaggerated, etc. are added to the first setting of the first preset template. position, and display relevant game prop introduction information and challenge user nicknames and other information in the second set position, generate in-film video data based on the original video and the second preset template, and generate video data based on the video clip of interest and the third preset template.
- FIG. 8 provides a video generation apparatus according to Embodiment 4 of the present disclosure.
- the apparatus includes a shooting module 810 , a determination module 820 , and a processing module 830 .
- the shooting module 810 is set to receive a trigger operation acting on the video shooting page, and shoot the original video in response to the trigger operation;
- the determination module 820 is set to determine the video segment of interest in the original video;
- the processing module 830 It is set to perform video synthesis processing based on the video segment of interest and the original video to obtain a target video.
- the original video includes a video obtained by shooting a picture of a user performing a set task; correspondingly, the device further includes:
- the display module is configured to display prompt information on the video shooting page in response to the trigger operation, so as to guide the user to perform the setting task.
- the display module is set to:
- prompt information of each setting task is displayed on the video shooting page in sequence.
- the determining module 820 is set to:
- Video segments of interest in the original video are determined based on image recognition.
- the determining module 820 includes:
- a recognition recording unit configured to perform facial expression recognition on the image frame of the original video based on the facial expression recognition model, and record the timestamp including the first image frame of the set facial expression and the facial expression score corresponding to each of the first image frames;
- the obtaining unit is configured to determine the first image frame whose expression score reaches the set threshold as the second image frame; and obtain the video segment of interest according to the timestamp of the second image frame.
- the acquisition unit is set to:
- a video of a set duration is intercepted within the duration time interval of the currently set task as the video segment of interest.
- the processing module 830 includes:
- a first generating unit configured to generate credit video data and/or credit video data based on the video segment of interest
- the second generation unit is set to generate video data in the slice based on the original video
- a splicing unit configured to splicing at least one of the video data in the title and the video data at the end of the title and the video data in the title to generate a target video.
- the first generating unit includes:
- a first generating subunit configured to generate title video data based on the video segment of interest and the first preset template
- the second generation subunit is set to generate video data in a slice based on the original video and the second preset template
- a third generating subunit is configured to generate end-cap video data based on the video segment of interest and a third preset template.
- the first generation subunit is set to:
- the introduction information of the set task and/or the user's identification is displayed at the second set position of the first preset template, so as to obtain the title video data.
- the second generation subunit is set to:
- a matching animation is displayed in the fourth setting position of the second preset template according to the situation that the user performs the set task, and/or the fifth setting of the second preset template is displayed according to the content of the set task.
- the related information of the set task is displayed at a fixed position, thereby generating the in-film video data.
- the associated information includes at least one of the following: content detail information of a single set task, a microphone, a countdown reminder identifier, and a game category to which the set task belongs.
- a matching animation is displayed at the fourth setting position of the second preset template according to the situation that the user performs the setting task, including at least one of the following:
- an animation matching the accuracy is displayed in the fourth setting position of the second preset template.
- the third generation subunit is set to:
- the matching content is displayed in the seventh setting position of the third preset template according to the situation that the user performs the setting task.
- the matching content includes at least one of the following: title information and praise information that match the situation in which the user performs the set task.
- the set task includes a tongue twister challenge game and/or a quiz game.
- the original video includes a portrait video
- the target video includes a landscape video
- an original video is captured in response to the trigger operation; an interesting video segment in the original video is determined;
- the original video is processed by video synthesis to obtain the target video, which realizes the automatic editing and synthesis of the video, and improves the processing effect of the video.
- the video generation apparatus provided by the embodiment of the present disclosure can execute the video generation method provided by any embodiment of the present disclosure, and has functional modules corresponding to the execution method.
- FIG. 9 it shows a schematic structural diagram of an electronic device (eg, a terminal device or a server in FIG. 9 ) 400 suitable for implementing an embodiment of the present disclosure.
- Terminal devices in the embodiments of the present disclosure may include, but are not limited to, such as mobile phones, notebook computers, digital broadcast receivers, personal digital assistants (Personal Digital Assistant, PDA), PAD (tablet computers), portable multimedia players (Portable Media Players) , PMP), mobile terminals such as in-vehicle terminals (eg, in-vehicle navigation terminals), etc., as well as fixed terminals such as digital televisions (Television, TV), desktop computers, and the like.
- PDA Personal Digital Assistant
- PAD tablet computers
- PMP portable multimedia players
- PMP portable multimedia players
- the electronic device 400 may include a processing device (such as a central processing unit, a graphics processor, etc.) 401, which may be stored in a read-only memory (Read-Only Memory, ROM) 402 according to a program or from a storage device 406 is a program loaded into a random access memory (Random Access Memory, RAM) 403 to perform various appropriate actions and processes.
- ROM Read-Only Memory
- RAM Random Access Memory
- the processing device 401, the ROM 402, and the RAM 403 are connected to each other through a bus 404.
- An Input/Output (I/O) interface 405 is also connected to the bus 404 .
- the following devices can be connected to the I/O interface 405: input devices 406 including, for example, a touch screen, touch pad, keyboard, mouse, camera, microphone, accelerometer, gyroscope, etc.; including, for example, a Liquid Crystal Display (LCD) output device 407 , speaker, vibrator, etc.; storage device 406 including, eg, magnetic tape, hard disk, etc.; and communication device 409 .
- Communication means 409 may allow electronic device 400 to communicate wirelessly or by wire with other devices to exchange data.
- FIG. 9 shows electronic device 400 having various means, it should be understood that not all of the illustrated means are required to be implemented or provided. More or fewer devices may alternatively be implemented or provided.
- embodiments of the present disclosure include a computer program product comprising a computer program carried on a non-transitory computer readable medium, the computer program containing program code for performing the method illustrated in the flowchart.
- the computer program may be downloaded and installed from the network via the communication device 409, or from the storage device 406, or from the ROM 402.
- the processing apparatus 401 When the computer program is executed by the processing apparatus 401, the above-mentioned functions defined in the methods of the embodiments of the present disclosure are executed.
- the terminal provided by the embodiments of the present disclosure and the video generation method provided by the above embodiments belong to the same inventive concept, and the technical details not described in detail in the embodiments of the present disclosure may refer to the above embodiments.
- Embodiments of the present disclosure provide a computer storage medium on which a computer program is stored, and when the program is executed by a processor, implements the video generation method provided by the foregoing embodiments.
- the computer-readable medium mentioned above in the present disclosure may be a computer-readable signal medium or a computer-readable storage medium, or any combination of the above two.
- the computer-readable storage medium can be, for example, but not limited to, an electrical, magnetic, optical, electromagnetic, infrared, or semiconductor system, apparatus or device, or a combination of any of the above.
- Computer readable storage media may include, but are not limited to, electrical connections having at least one wire, portable computer disks, hard disks, random access memory (RAM), read only memory (ROM), erasable programmable Read memory (Erasable Programmable Read-Only Memory, EPROM or flash memory), optical fiber, portable compact disk read-only memory (Compact Disc Read-Only Memory, CD-ROM), optical storage device, magnetic storage device, or any of the above suitable combination.
- a computer-readable storage medium may be any tangible medium that contains or stores a program that can be used by or in conjunction with an instruction execution system, apparatus, or device.
- a computer-readable signal medium may include a data signal propagated in baseband or as part of a carrier wave with computer-readable program code embodied thereon. Such propagated data signals may take a variety of forms, including but not limited to electromagnetic signals, optical signals, or any suitable combination of the foregoing.
- a computer-readable signal medium can also be any computer-readable medium other than a computer-readable storage medium that can transmit, propagate, or transport the program for use by or in connection with the instruction execution system, apparatus, or device .
- the program code embodied on the computer-readable medium may be transmitted by any suitable medium, including but not limited to: electric wire, optical fiber cable, radio frequency (RF), etc., or any suitable combination of the above.
- the client and server can use any currently known or future developed network protocol such as HTTP (HyperText Transfer Protocol) to communicate, and can communicate with digital data in any form or medium Communication (eg, a communication network) interconnects.
- HTTP HyperText Transfer Protocol
- Examples of communication networks include Local Area Networks (LANs), Wide Area Networks (WANs), the Internet (eg, the Internet), and end-to-end networks (eg, ad hoc Known or future developed networks.
- the above-mentioned computer-readable medium may be included in the above-mentioned electronic device; or may exist alone without being assembled into the electronic device.
- the above-mentioned computer-readable medium carries at least one program, and when the above-mentioned at least one program is executed by the electronic device, causes the electronic device to:
- Computer program code for performing the operations of the present disclosure may be written in at least one programming language, including but not limited to object-oriented programming languages—such as Java, Smalltalk, C++, but also conventional procedural programming language - such as "C" language or similar programming language.
- the program code may execute entirely on the user's computer, partly on the user's computer, as a stand-alone software package, partly on the user's computer and partly on a remote computer, or entirely on the remote computer or server.
- the remote computer may be connected to the user's computer through any kind of network, including a local area network (LAN) or a wide area network (WAN), or may be connected to an external computer (eg, using an Internet service provider through Internet connection).
- LAN local area network
- WAN wide area network
- each block in the flowchart or block diagrams may represent a module, segment, or portion of code that contains at least one configurable function for implementing the specified logical function. Execute the instruction.
- the functions noted in the blocks may occur out of the order noted in the figures. For example, two blocks shown in succession may, in fact, be executed substantially concurrently, or the blocks may sometimes be executed in the reverse order, depending upon the functionality involved.
- each block of the block diagrams and/or flowchart illustrations, and combinations of blocks in the block diagrams and/or flowchart illustrations can be implemented in dedicated hardware-based systems that perform the specified functions or operations , or can be implemented in a combination of dedicated hardware and computer instructions.
- the units involved in the embodiments of the present disclosure may be implemented in a software manner, and may also be implemented in a hardware manner. Wherein, the name of the unit does not constitute a limitation of the unit itself under certain circumstances, for example, the editable content display unit may also be described as an "editing unit".
- exemplary types of hardware logic components include: Field Programmable Gate Arrays (FPGAs), Application Specific Integrated Circuits (ASICs), Application Specific Standard Products (Application Specific Standard Products) Standard Parts, ASSP), system on chip (System on Chip, SOC), complex programmable logic device (Complex Programmable Logic Device, CPLD) and so on.
- FPGAs Field Programmable Gate Arrays
- ASICs Application Specific Integrated Circuits
- ASSP Application Specific Standard Products
- SOC System on Chip
- complex programmable logic device Complex Programmable Logic Device, CPLD
- a machine-readable medium may be a tangible medium that may contain or store a program for use by or in connection with the instruction execution system, apparatus or device.
- the machine-readable medium may be a machine-readable signal medium or a machine-readable storage medium.
- Machine-readable media may include, but are not limited to, electronic, magnetic, optical, electromagnetic, infrared, or semiconductor systems, devices, or devices, or any suitable combination of the foregoing.
- machine-readable storage media would include at least one wire-based electrical connection, portable computer disks, hard disks, random access memory (RAM), read only memory (ROM), erasable programmable read only memory (EPROM) or flash memory), fiber optics, compact disk read only memory (CD-ROM), optical storage devices, magnetic storage devices, or any suitable combination of the foregoing.
- RAM random access memory
- ROM read only memory
- EPROM erasable programmable read only memory
- flash memory flash memory
- fiber optics compact disk read only memory (CD-ROM), optical storage devices, magnetic storage devices, or any suitable combination of the foregoing.
- CD-ROM compact disk read only memory
- magnetic storage devices or any suitable combination of the foregoing.
- Example 1 provides a video generation method, the method includes:
- Example 2 provides a method for generating a video.
- the original video includes a video obtained by shooting a picture of a user performing a set task;
- the method also includes:
- prompt information is displayed on the video shooting page to guide the user to perform the setting task.
- Example 3 provides a video generation method.
- the determining the video segment of interest in the original video includes:
- Video segments of interest in the original video are determined based on image recognition.
- Example 4 provides a video generation method.
- the determining an interesting video segment in the original video based on image recognition includes:
- the video segment of interest is acquired according to the timestamp of the second image frame.
- Example 5 provides a video generation method.
- the acquiring the video segment of interest according to the timestamp of the second image frame includes:
- a video of a set duration is intercepted within the duration time interval of the task corresponding to the second image frame as the video segment of interest.
- Example 6 provides a video generation method.
- performing video synthesis processing based on the video segment of interest and the original video to obtain a target video includes:
- a target video is generated by splicing at least one of the intro video data and the end video data with the in-slice video data.
- Example 7 provides a video generation method, optionally,
- the generating of the title video data based on the video segment of interest includes:
- the generating video data in-slice based on the original video includes:
- the generating the end-credit video data based on the video clip of interest includes:
- End-credit video data is generated based on the video segment of interest and the third preset template.
- Example 8 provides a video generation method, optionally,
- the generating of title video data based on the video segment of interest and the first preset template includes:
- the title video data is thereby generated.
- Example 9 provides a video generation method, optionally, generating in-slice video data based on the original video and a second preset template, including:
- a matching animation is displayed in the fourth setting position of the second preset template according to the situation that the user performs the set task, and/or the fifth setting of the second preset template is displayed according to the content of the set task. Display the associated information of the set task at a fixed position;
- Example 10 provides a video generation method.
- the fourth set position of shows a matching animation, including at least one of the following:
- an animation matching the accuracy is displayed at the fourth set position.
- Example 11 provides a video generation method.
- the generation of end-credit video data based on the video segment of interest and a third preset template includes:
- the end-credit video data is thereby generated.
- Example 12 provides a video generation method.
- the original video includes a portrait video
- the target video includes a landscape video.
- Example thirteen provides a video generation apparatus, the apparatus comprising:
- a shooting module configured to receive a trigger operation acting on the video shooting page, and to shoot the original video in response to the trigger operation;
- a determining module configured to determine the video segment of interest in the original video
- the processing module is configured to perform video synthesis processing based on the video segment of interest and the original video to obtain a target video.
- Example 14 provides an electronic device, the electronic device comprising:
- storage means arranged to store at least one program
- the at least one processor When the at least one program is executed by the at least one processor, the at least one processor implements the following video generation method:
- Example 15 provides a storage medium containing computer-executable instructions, when executed by a computer processor, the computer-executable instructions are used to perform the following video generation method:
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Theoretical Computer Science (AREA)
- Databases & Information Systems (AREA)
- Computational Linguistics (AREA)
- Software Systems (AREA)
- Computer Security & Cryptography (AREA)
- Human Computer Interaction (AREA)
- Television Signal Processing For Recording (AREA)
Abstract
Description
Claims (15)
- 一种视频生成方法,包括:接收作用于视频拍摄页面的触发操作,响应于所述触发操作拍摄原始视频;确定所述原始视频中的感兴趣视频片段;基于所述感兴趣视频片段以及所述原始视频进行视频合成处理,获得目标视频。
- 根据权利要求1所述的方法,其中,所述原始视频包括对用户执行设定任务的画面进行拍摄得到的视频;所述方法还包括:响应于拍摄所述原始视频,在所述视频拍摄页面显示提示信息,以引导用户执行所述设定任务。
- 根据权利要求2所述的方法,其中,所述确定所述原始视频中的感兴趣视频片段,包括:基于图像识别确定所述原始视频中的感兴趣视频片段。
- 根据权利要求3所述的方法,其中,所述基于图像识别确定所述原始视频中的感兴趣视频片段,包括:基于表情识别模型对所述原始视频的图像帧进行表情识别,并记录包括设定表情的第一图像帧的时间戳及对应的表情得分;将所述表情得分达到设定阈值的第一图像帧确定为第二图像帧;根据所述第二图像帧的时间戳获取感兴趣视频片段。
- 根据权利要求4所述的方法,其中,所述根据所述第二图像帧的时间戳获取感兴趣视频片段,包括:以所述第二图像帧的时间戳为参考时间点,在所述第二图像帧所对应的任务的历时时间区间内截取设定时长的视频作为所述感兴趣视频片段。
- 根据权利要求2所述的方法,其中,所述基于所述感兴趣视频片段以及所述原始视频进行视频合成处理,获得目标视频,包括:基于所述感兴趣视频片段生成片头视频数据和片尾视频数据中的至少之一;基于所述原始视频生成片中视频数据;将所述片头视频数据和片尾视频数据两者的至少其中之一与所述片中视频数据拼接,生成目标视频。
- 根据权利要求6所述的方法,其中,所述视频生成方法满足以下至少之一:所述基于所述感兴趣视频片段生成片头视频数据,包括:基于所述感兴趣视频片段以及第一预设模板生成片头视频数据;所述基于所述原始视频生成片中视频数据,包括:基于所述原始视频以及第二预设模板生成片中视频数据;所述基于所述感兴趣视频片段生成片尾视频数据,包括:基于所述感兴趣视频片段以及第三预设模板生成片尾视频数据。
- 根据权利要求7所述的方法,其中,所述基于所述感兴趣视频片段以及第一预设模板生成片头视频数据,包括:将所述感兴趣视频片段添加至所述第一预设模板的第一设定位置,以在所述第一设定位置播放所述感兴趣视频片段;在所述第一预设模板的第二设定位置展示所述任务的标识信息和所述用户的标识信息中的至少之一;由此生成所述片头视频数据。
- 根据权利要求7所述的方法,其中,基于所述原始视频以及第二预设模板生成片中视频数据,包括:将所述原始视频添加至所述第二预设模板的第三设定位置,以在所述第三设定位置播放所述原始视频;执行根据用户进行设定任务的情况在所述第二预设模板的第四设定位置展示匹配的动画和根据所述设定任务的内容在所述第二预设模板的第五设定位置展示所述设定任务的关联信息中的至少之一;由此生成所述片中视频数据。
- 根据权利要求9所述的方法,其中,所述根据用户进行设定任务的情况在所述第二预设模板的第四设定位置展示匹配的动画,包括下述至少一种:在用户语音说出预设词语的情况下,在所述第四设定位置展示与所述预设词语匹配的动画;在用户做出设定动作的情况下,在所述第四设定位置展示与所述设定动作匹配的动画;根据用户进行设定任务的准确度在所述第四设定位置展示与所述准确度匹配的动画。
- 根据权利要求7所述的方法,其中,所述基于所述感兴趣视频片段以及第三预设模板生成片尾视频数据,包括:基于所述感兴趣视频片段截取目标图像;将所述目标图像添加至所述第三预设模板的第六设定位置,以在所述第六设定位置显示所述目标图像;根据用户的任务完成度在所述第三预设模板的第七设定位置展示与所述完成度相对应的匹配内容;由此生成所述片尾视频数据。
- 根据权利要求1-11任一项所述的方法,其中,所述原始视频包括竖屏视频,所述目标视频包括横屏视频。
- 一种视频生成装置,包括:拍摄模块,设置为接收作用于视频拍摄页面的触发操作,及响应于所述触发操作拍摄原始视频;确定模块,设置为确定所述原始视频中的感兴趣视频片段;处理模块,设置为基于所述感兴趣视频片段以及所述原始视频进行视频合成处理,获得目标视频。
- 一种电子设备,包括:至少一个处理器;存储装置,设置为存储至少一个程序,当所述至少一个程序被所述至少一个处理器执行,使得所述至少一个处理器实现如权利要求1-12中任一项所述的视频生成方法。
- 一种包含计算机可执行指令的存储介质,所述计算机可执行指令在由计算机处理器执行时用于执行如权利要求1-12中任一项所述的视频生成方法。
Priority Applications (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| EP22784167.3A EP4322521A4 (en) | 2021-01-27 | 2022-04-11 | VIDEO GENERATING METHOD AND APPARATUS, ELECTRONIC DEVICE AND STORAGE MEDIUM |
| US18/483,289 US12592260B2 (en) | 2021-01-27 | 2023-10-09 | Video generation method and apparatus, electronic device, and storage medium |
Applications Claiming Priority (3)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN202110112638 | 2021-01-27 | ||
| CN202110384712.8 | 2021-04-09 | ||
| CN202110384712.8A CN113099129A (zh) | 2021-01-27 | 2021-04-09 | 一种视频生成方法、装置、电子设备及存储介质 |
Related Child Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| US18/483,289 Continuation-In-Part US12592260B2 (en) | 2021-01-27 | 2023-10-09 | Video generation method and apparatus, electronic device, and storage medium |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| WO2022214101A1 true WO2022214101A1 (zh) | 2022-10-13 |
Family
ID=76675987
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| PCT/CN2022/086090 Ceased WO2022214101A1 (zh) | 2021-01-27 | 2022-04-11 | 一种视频生成方法、装置、电子设备及存储介质 |
Country Status (4)
| Country | Link |
|---|---|
| US (1) | US12592260B2 (zh) |
| EP (1) | EP4322521A4 (zh) |
| CN (1) | CN113099129A (zh) |
| WO (1) | WO2022214101A1 (zh) |
Families Citing this family (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN113099129A (zh) * | 2021-01-27 | 2021-07-09 | 北京字跳网络技术有限公司 | 一种视频生成方法、装置、电子设备及存储介质 |
| CN113870133B (zh) * | 2021-09-27 | 2024-03-12 | 抖音视界有限公司 | 多媒体显示及匹配方法、装置、设备及介质 |
| CN115550550B (zh) * | 2022-09-20 | 2026-04-17 | 成都光合信号科技有限公司 | 拍摄与生成视频的方法及相关设备 |
| CN116112743B (zh) * | 2023-02-01 | 2025-09-19 | 北京有竹居网络技术有限公司 | 视频处理的方法、装置、设备和存储介质 |
| CN120186287A (zh) * | 2023-12-14 | 2025-06-20 | 荣耀终端股份有限公司 | 视频处理方法、电子设备、芯片系统及存储介质 |
Citations (6)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20180122422A1 (en) * | 2016-11-02 | 2018-05-03 | Lr Acquisition, Llc | Multimedia creation, production, and presentation based on sensor-driven events |
| CN108833934A (zh) * | 2018-06-21 | 2018-11-16 | 广州酷狗计算机科技有限公司 | 获取视频数据的方法、服务器和系统 |
| CN109714644A (zh) * | 2019-01-22 | 2019-05-03 | 广州虎牙信息科技有限公司 | 一种视频数据的处理方法、装置、计算机设备和存储介质 |
| CN111654619A (zh) * | 2020-05-18 | 2020-09-11 | 成都市喜爱科技有限公司 | 智能拍摄方法、装置、服务器及存储介质 |
| CN111988638A (zh) * | 2020-08-19 | 2020-11-24 | 北京字节跳动网络技术有限公司 | 一种拼接视频的获取方法、装置、电子设备和存储介质 |
| CN113099129A (zh) * | 2021-01-27 | 2021-07-09 | 北京字跳网络技术有限公司 | 一种视频生成方法、装置、电子设备及存储介质 |
Family Cites Families (35)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US8364633B2 (en) * | 2005-01-12 | 2013-01-29 | Wandisco, Inc. | Distributed computing systems and system components thereof |
| US8121618B2 (en) * | 2009-10-28 | 2012-02-21 | Digimarc Corporation | Intuitive computing methods and systems |
| WO2011146711A1 (en) * | 2010-05-21 | 2011-11-24 | Hsbc Technologies Inc. | Account opening computer system architecture and process for implementing same |
| US20120052930A1 (en) * | 2010-06-24 | 2012-03-01 | Dr. Elliot McGucken | System and method for the heros journey mythology code of honor video game engine and heros journey code of honor spy games wherein one must fake the enemy's ideology en route to winning |
| US9760123B2 (en) * | 2010-08-06 | 2017-09-12 | Dynavox Systems Llc | Speech generation device with a projected display and optical inputs |
| CN103915106B (zh) * | 2014-03-31 | 2017-01-11 | 宇龙计算机通信科技(深圳)有限公司 | 片头生成方法及生成系统 |
| US20150356093A1 (en) * | 2014-06-06 | 2015-12-10 | Mohamad Abbas | Methods and systems relating to ratings |
| CN105519091A (zh) * | 2014-08-29 | 2016-04-20 | 深圳市大疆创新科技有限公司 | 用于摄影机的片头片尾自动生成方法和系统 |
| US9632664B2 (en) * | 2015-03-08 | 2017-04-25 | Apple Inc. | Devices, methods, and graphical user interfaces for manipulating user interface objects with visual and/or haptic feedback |
| US11956516B2 (en) * | 2015-04-16 | 2024-04-09 | W.S.C. Sports Technologies Ltd. | System and method for creating and distributing multimedia content |
| US20160365124A1 (en) * | 2015-06-11 | 2016-12-15 | Yaron Galant | Video editing method using participant sharing |
| KR101777242B1 (ko) * | 2015-09-08 | 2017-09-11 | 네이버 주식회사 | 동영상 컨텐츠의 하이라이트 영상을 추출하여 제공하는 방법과 시스템 및 기록 매체 |
| US20180132006A1 (en) * | 2015-11-02 | 2018-05-10 | Yaron Galant | Highlight-based movie navigation, editing and sharing |
| US9609230B1 (en) * | 2015-12-30 | 2017-03-28 | Google Inc. | Using a display as a light source |
| US20210019982A1 (en) * | 2016-10-13 | 2021-01-21 | Skreens Entertainment Technologies, Inc. | Systems and methods for gesture recognition and interactive video assisted gambling |
| US10412139B2 (en) * | 2017-05-26 | 2019-09-10 | Streamsure Solutions Limited | Communication event |
| US10740620B2 (en) * | 2017-10-12 | 2020-08-11 | Google Llc | Generating a video segment of an action from a video |
| US10567707B2 (en) * | 2017-10-13 | 2020-02-18 | Blue Jeans Network, Inc. | Methods and systems for management of continuous group presence using video conferencing |
| KR102045347B1 (ko) * | 2018-03-09 | 2019-11-15 | 에스케이브로드밴드주식회사 | 영상제작지원장치 및 그 동작 방법 |
| US11594028B2 (en) * | 2018-05-18 | 2023-02-28 | Stats Llc | Video processing for enabling sports highlights generation |
| US20190373322A1 (en) * | 2018-05-29 | 2019-12-05 | Sony Interactive Entertainment LLC | Interactive Video Content Delivery |
| US10650861B2 (en) * | 2018-06-22 | 2020-05-12 | Tildawatch, Inc. | Video summarization and collaboration systems and methods |
| CN109168015B (zh) * | 2018-09-30 | 2021-04-09 | 北京亿幕信息技术有限公司 | 一种云剪直播剪辑方法和系统 |
| US11080532B2 (en) * | 2019-01-16 | 2021-08-03 | Mediatek Inc. | Highlight processing method using human pose based triggering scheme and associated system |
| CN109819179B (zh) * | 2019-03-21 | 2022-02-01 | 腾讯科技(深圳)有限公司 | 一种视频剪辑方法和装置 |
| US11025964B2 (en) * | 2019-04-02 | 2021-06-01 | Wangsu Science & Technology Co., Ltd. | Method, apparatus, server, and storage medium for generating live broadcast video of highlight collection |
| CN110191357A (zh) * | 2019-06-28 | 2019-08-30 | 北京奇艺世纪科技有限公司 | 视频片段精彩度评估、动态封面生成方法及装置 |
| CN110347872B (zh) * | 2019-07-04 | 2023-10-24 | 腾讯科技(深圳)有限公司 | 视频封面图像提取方法及装置、存储介质及电子设备 |
| US11343474B2 (en) * | 2019-10-02 | 2022-05-24 | Qualcomm Incorporated | Image capture based on action recognition |
| US11154773B2 (en) * | 2019-10-31 | 2021-10-26 | Nvidia Corpration | Game event recognition |
| US11170471B2 (en) * | 2020-01-20 | 2021-11-09 | Nvidia Corporation | Resolution upscaling for event detection |
| CN111432290B (zh) * | 2020-04-10 | 2022-04-19 | 深圳市乔安科技有限公司 | 基于音频调节的视频生成方法 |
| CN111556363B (zh) * | 2020-05-21 | 2021-09-28 | 腾讯科技(深圳)有限公司 | 视频特效处理方法、装置、设备及计算机可读存储介质 |
| US11468915B2 (en) * | 2020-10-01 | 2022-10-11 | Nvidia Corporation | Automatic video montage generation |
| WO2021077141A2 (en) * | 2021-02-05 | 2021-04-22 | Innopeak Technology, Inc. | Highlight moment detection for slow-motion videos |
-
2021
- 2021-04-09 CN CN202110384712.8A patent/CN113099129A/zh active Pending
-
2022
- 2022-04-11 EP EP22784167.3A patent/EP4322521A4/en active Pending
- 2022-04-11 WO PCT/CN2022/086090 patent/WO2022214101A1/zh not_active Ceased
-
2023
- 2023-10-09 US US18/483,289 patent/US12592260B2/en active Active
Patent Citations (6)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20180122422A1 (en) * | 2016-11-02 | 2018-05-03 | Lr Acquisition, Llc | Multimedia creation, production, and presentation based on sensor-driven events |
| CN108833934A (zh) * | 2018-06-21 | 2018-11-16 | 广州酷狗计算机科技有限公司 | 获取视频数据的方法、服务器和系统 |
| CN109714644A (zh) * | 2019-01-22 | 2019-05-03 | 广州虎牙信息科技有限公司 | 一种视频数据的处理方法、装置、计算机设备和存储介质 |
| CN111654619A (zh) * | 2020-05-18 | 2020-09-11 | 成都市喜爱科技有限公司 | 智能拍摄方法、装置、服务器及存储介质 |
| CN111988638A (zh) * | 2020-08-19 | 2020-11-24 | 北京字节跳动网络技术有限公司 | 一种拼接视频的获取方法、装置、电子设备和存储介质 |
| CN113099129A (zh) * | 2021-01-27 | 2021-07-09 | 北京字跳网络技术有限公司 | 一种视频生成方法、装置、电子设备及存储介质 |
Non-Patent Citations (1)
| Title |
|---|
| See also references of EP4322521A4 |
Also Published As
| Publication number | Publication date |
|---|---|
| US20240038273A1 (en) | 2024-02-01 |
| EP4322521A1 (en) | 2024-02-14 |
| EP4322521A4 (en) | 2024-08-14 |
| US12592260B2 (en) | 2026-03-31 |
| CN113099129A (zh) | 2021-07-09 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| WO2022214101A1 (zh) | 一种视频生成方法、装置、电子设备及存储介质 | |
| US12271578B2 (en) | Audio sharing method and apparatus, device and medium | |
| CN113207025B (zh) | 视频处理方法、装置、电子设备和存储介质 | |
| WO2021196903A1 (zh) | 视频处理方法、装置、可读介质及电子设备 | |
| US20250209760A1 (en) | Special effect video determination method and apparatus, electronic device and storage medium | |
| WO2020083021A1 (zh) | 视频录制方法、视频播放方法、装置、设备及存储介质 | |
| WO2022083148A1 (zh) | 特效展示方法、装置、电子设备及计算机可读介质 | |
| WO2020207080A1 (zh) | 视频拍摄方法、装置、电子设备及存储介质 | |
| WO2023056847A1 (zh) | 表情显示的方法、装置、设备及存储介质 | |
| WO2024001802A1 (zh) | 图像处理方法、装置、电子设备及存储介质 | |
| US12555607B2 (en) | Audio data processing method and apparatus, and device and storage medium | |
| WO2023241377A1 (zh) | 视频数据的处理方法、装置、设备、系统及存储介质 | |
| WO2023040749A1 (zh) | 图像处理方法、装置、电子设备及存储介质 | |
| CN112887796A (zh) | 视频生成方法、装置、设备及介质 | |
| CN116233561A (zh) | 一种虚拟礼物生成方法、装置、设备及介质 | |
| US11908490B2 (en) | Video recording method and device, electronic device and storage medium | |
| CN112312163B (zh) | 视频生成方法、装置、电子设备及存储介质 | |
| EP4496317A1 (en) | Video generation method and apparatus, and device, storage medium and program product | |
| CN109286760B (zh) | 一种娱乐视频制作方法及其终端 | |
| CN115243087A (zh) | 音视频合拍处理方法、装置、终端设备及存储介质 | |
| CN113891108A (zh) | 字幕优化方法、装置、电子设备和存储介质 | |
| CN116016817B (zh) | 视频剪辑方法、装置、电子设备及存储介质 | |
| US12301896B2 (en) | Audio bullet screen processing method and device | |
| CN117556066A (zh) | 多媒体内容生成方法和电子设备 | |
| WO2023045786A1 (zh) | 交互方法、装置、电子设备及介质 |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| 121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 22784167 Country of ref document: EP Kind code of ref document: A1 |
|
| WWE | Wipo information: entry into national phase |
Ref document number: 2022784167 Country of ref document: EP |
|
| WWE | Wipo information: entry into national phase |
Ref document number: 2022784167 Country of ref document: EP |
|
| NENP | Non-entry into the national phase |
Ref country code: DE |
|
| ENP | Entry into the national phase |
Ref document number: 2022784167 Country of ref document: EP Effective date: 20231109 |
|
| ENP | Entry into the national phase |
Ref document number: 2022784167 Country of ref document: EP Effective date: 20231108 |