WO2024221941A1 - 一种视频生成方法、装置、设备及存储介质 - Google Patents

一种视频生成方法、装置、设备及存储介质 Download PDF

Info

Publication number
WO2024221941A1
WO2024221941A1 PCT/CN2023/136857 CN2023136857W WO2024221941A1 WO 2024221941 A1 WO2024221941 A1 WO 2024221941A1 CN 2023136857 W CN2023136857 W CN 2023136857W WO 2024221941 A1 WO2024221941 A1 WO 2024221941A1
Authority
WO
WIPO (PCT)
Prior art keywords
video
text information
multimedia material
video editing
editing template
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Ceased
Application number
PCT/CN2023/136857
Other languages
English (en)
French (fr)
Inventor
唐艾妮
张天奇
张琪智
周慧敏
郑涵奇
钟浩华
张浩然
李�根
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Zitiao Network Technology Co Ltd
Original Assignee
Beijing Zitiao Network Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Zitiao Network Technology Co Ltd filed Critical Beijing Zitiao Network Technology Co Ltd
Priority to JP2024520785A priority Critical patent/JP7803636B2/ja
Priority to EP23866709.1A priority patent/EP4478723A4/en
Priority to US18/622,479 priority patent/US12524940B2/en
Publication of WO2024221941A1 publication Critical patent/WO2024221941A1/zh
Anticipated expiration legal-status Critical
Ceased legal-status Critical Current

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/44Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs
    • H04N21/44016Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs involving splicing one content stream with another content stream, e.g. for substituting a video clip
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/439Processing of audio elementary streams
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/439Processing of audio elementary streams
    • H04N21/4398Processing of audio elementary streams involving reformatting operations of audio signals
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/44Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N5/00Details of television systems
    • H04N5/222Studio circuitry; Studio devices; Studio equipment
    • H04N5/262Studio circuits, e.g. for mixing, switching-over, change of character of image, other special effects ; Cameras specially adapted for the electronic generation of special effects

Definitions

  • the present disclosure relates to the field of data processing, and in particular to a video generation method, device, equipment and storage medium.
  • the present disclosure provides a video generation method, device, equipment and storage medium, which can enrich the video generation method and improve the user experience.
  • the present disclosure provides a video generation method, the method comprising:
  • a target video is generated based on the first text information and the at least one multimedia material; wherein the target video presents the at least one multimedia material, the video effect of the target video meets the video effect requirements described in the first text information, and the target video is used to present a combination of at least one video clip, wherein the at least one video clip is formed based on each image material in the at least one multimedia material,
  • the various image materials include video materials and/or image materials.
  • generating a target video based on the first text information and the at least one multimedia material includes:
  • the video editing draft includes the at least one multimedia material and editing information
  • the editing information is used to indicate an editing operation for the at least one multimedia material
  • the editing operation is at least used to edit each image material in the at least one multimedia material into the at least one video clip
  • the video editing effect corresponding to the editing operation and/or the at least one multimedia material meets the video effect requirements described in the first text information
  • a target video is generated according to the video editing draft.
  • generating a video editing draft based on the first text information and the at least one multimedia material includes:
  • determining at least one video editing template based on the first text information and the at least one multimedia material includes:
  • At least one video editing template is obtained by matching the feature tags of the first text information and the at least one multimedia material with the available video editing templates, wherein the at least one video editing template includes a first video editing template that matches the feature tags of the first text information and a second video editing template that matches the feature tags of the at least one multimedia material.
  • the acquiring at least one multimedia material includes:
  • the method before obtaining the first text information, the method further includes:
  • the obtaining of the first text information includes:
  • the method before receiving the first text message based on the text input box, the method further includes:
  • receiving the first text information based on the text input box includes:
  • first text information is acquired.
  • the method further includes:
  • a fourth video editing template is selected from the at least one video editing template, and the fourth video editing template is used to replace the third video editing template presented on the preview page, so that the preview page is used to preview the video effect obtained by importing the at least one multimedia material into the fourth video editing template.
  • the method further includes:
  • the fifth video editing template displayed on the preview page is replaced by the sixth video editing template in the second video editing template set.
  • the method before determining the second video editing template set based on the adjusted text information and the at least one multimedia material, the method further includes:
  • determining a second video editing template set based on the adjusted text information and the at least one multimedia material includes:
  • a second video editing template set is determined.
  • the present disclosure provides a video generation device, the device comprising:
  • a first acquisition module is used to acquire first text information; wherein the first text information is used to describe video effect requirements;
  • a second acquisition module used to acquire at least one multimedia material
  • a generation module is used to generate a target video based on the first text information and the at least one multimedia material; wherein the target video presents the at least one multimedia material, the video effect of the target video meets the video effect requirements described in the first text information, and the target video is used to present a combination of at least one video clip, wherein the at least one video clip is formed based on each image material in the at least one multimedia material, and the each image material includes a video material and/or an image material.
  • the present disclosure provides a computer-readable storage medium, wherein the computer-readable storage medium stores instructions, and when the instructions are executed on a terminal device, the terminal device implements the above method.
  • the present disclosure provides a video generation device, comprising: a memory, a processor, and a computer program stored in the memory and executable on the processor, When the processor executes the computer program, the above method is implemented.
  • the present disclosure provides a computer program product, wherein the computer program product comprises a computer program/instructions, and the computer program/instructions implement the above method when executed by a processor.
  • the disclosed embodiments provide a method for generating a video. Specifically, first text information for describing video effect requirements and at least one multimedia material are obtained; then, a target video is generated based on the first text information and the at least one multimedia material; wherein the target video presents the at least one multimedia material, the video effect of the target video meets the video effect requirements described in the first text information, and the target video is used to present a combination of at least one video clip, wherein the at least one video clip is formed based on each image material in the at least one multimedia material, and the each image material includes a video material and/or an image material. It can be seen that the disclosed embodiments can generate a target video that meets the video effect requirements described in the first text information based on the acquisition of the first text information and the multimedia material, thereby enriching the video generation method and improving the user experience.
  • FIG1 is a flow chart of a video generation method provided by an embodiment of the present disclosure.
  • FIG2 is a schematic diagram of a material selection page provided by an embodiment of the present disclosure.
  • FIG3 is a schematic diagram of another material selection page provided in an embodiment of the present disclosure.
  • FIG4 is a flow chart of another video generation method provided by an embodiment of the present disclosure.
  • FIG5 is a flow chart of another video generation method provided by an embodiment of the present disclosure.
  • FIG6 is a schematic diagram of a preview page provided by an embodiment of the present disclosure.
  • FIG7 is a schematic diagram of the structure of a video generating device provided by an embodiment of the present disclosure.
  • FIG8 is a schematic diagram of the structure of a video generating device provided by an embodiment of the present disclosure.
  • the disclosed embodiment provides a video generation method, which can generate a target video that meets the video effect requirements by analyzing text information describing video effect requirements, enriching the video generation method and thus improving the user experience.
  • first text information for describing the video effect requirements is obtained, and at least one multimedia material is obtained; then, based on the first text information and the at least one multimedia material, a target video is generated.
  • the at least one multimedia material is presented in the target video, and the video effect of the target video meets the video effect requirements described in the first text information.
  • the target video is used to present a combination of at least one video clip, and the at least one video clip is formed based on the image material in the at least one multimedia material obtained, and the image material may include video material and/or image material.
  • an embodiment of the present disclosure provides a video generation method.
  • FIG1 is a flow chart of a video generation method provided by an embodiment of the present disclosure, the method includes:
  • the first text information is used to describe the video effect requirements.
  • the first text information may be text information input by a user.
  • the method of inputting the text information is not limited.
  • the first text information may be input by voice input, or by keyboard input, or by importing text information, etc.
  • the first text information is text information that can describe the video effect requirement.
  • the video effect requirement described in the first text information may be a requirement for the video style type, such as the first text information may be "comic style”.
  • the video effect requirement described in the first text information may also be a requirement for the video display content, such as the first text information may be "warm summer, warm afternoon”.
  • the present disclosed embodiment does not specifically limit the video effect requirement described in the first text information.
  • S102 Acquire at least one multimedia material.
  • the embodiment of the present disclosure needs to obtain multimedia materials, which may include pictures, videos, audios, etc.
  • multimedia materials can be obtained by user import.
  • FIG2 it is a schematic diagram of a material selection page provided in an embodiment of the present disclosure.
  • the material selection page displays various multimedia materials in the user material set.
  • an import operation for at least one multimedia material is received, the at least one imported multimedia material is obtained.
  • a text input box can also be triggered to be displayed.
  • FIG2 after the multimedia material 201 is imported, a text input box 202 is displayed on the material selection page.
  • the first text information can be entered in the text input box 202, thereby obtaining the first text information.
  • At least one video tag can also be displayed.
  • multiple video tags are displayed below the text input box 301 as shown in FIG3 , such as the video tag "comic style".
  • the target video tag is triggered to be added to the text input box 301, and the first text information is obtained.
  • the target video tag added to the text input box 301 may include one or more video tags displayed on the material selection page.
  • the first text information may include only the target video tag, or only the text information input by the user, or may include the target video tag and the text information input by the user.
  • At least one multimedia material can be obtained based on the analysis of the first text information.
  • at least one multimedia material can be matched from a user material set based on the analysis result of the first text information.
  • the first text information is semantically analyzed by a natural language analysis algorithm, and at least one multimedia material is matched from a user material set based on the analysis result, and is subsequently used to generate a target video.
  • At least one multimedia material can be generated based on the analysis results of the first text information.
  • the first text information is semantically analyzed through a natural language analysis algorithm, and multimedia materials such as pictures, video clips, and audio are generated based on the analysis results, which are subsequently used to generate the target video.
  • the multimedia material matched from the user material set and the generated multimedia material all meet the video effect requirement described in the first text information.
  • S103 Generate a target video based on the first text information and the at least one multimedia material.
  • the target video presents the at least one multimedia material
  • the video effect of the target video meets the video effect requirements described in the first text information
  • the target video is used to present a combination of at least one video clip
  • the at least one video clip is formed based on each image material in the at least one multimedia material
  • the image material includes video material and/or image material.
  • the target video is generated using the first text information and the at least one multimedia material.
  • the specific video generation method is specifically introduced in the subsequent embodiments and will not be repeated here.
  • first text information for describing the video effect requirements is obtained, and at least one multimedia material is obtained; then, based on the first text information and the at least one multimedia material, a target video is generated.
  • the target video presents the at least one multimedia material, and the video effect of the target video meets the requirements of the first text information.
  • the video effect requirements described in this information is used to present a combination of at least one video clip, and the at least one video clip is formed based on the image material in the at least one multimedia material obtained, and the image material may include video material and/or image material.
  • the embodiment of the present disclosure further provides a video generation method.
  • FIG4 is a flow chart of another video generation method provided by the embodiment of the present disclosure, the video generation method specifically includes:
  • S401 Acquire first text information; wherein the first text information is used to describe video effect requirements.
  • S402 Acquire at least one multimedia material.
  • the manner of obtaining the first text information and at least one multimedia material can be understood with reference to the above embodiment, and will not be described in detail here.
  • S403 Generate a video editing draft based on the first text information and the at least one multimedia material.
  • the video editing draft includes the at least one multimedia material and editing information
  • the editing information is used to indicate the editing operation on the at least one multimedia material
  • the editing operation is at least used to edit each image material in the at least one multimedia material into the at least one video clip respectively
  • the video editing effect and/or the at least one multimedia material corresponding to the editing operation meet the video effect requirements described in the first text information.
  • a video editing draft is generated based on an analysis of the first text information, or a comprehensive analysis of the first text information and the at least one multimedia material.
  • the video editing draft includes obtaining at least one multimedia material and editing information, wherein the editing information is used to indicate an editing operation for the at least one multimedia material, and the editing operation is at least used to edit each image material in the at least one multimedia material into one or more
  • a video clip may include one image material or a combination of multiple image materials.
  • the video editing effect corresponding to the editing operation referred to by the editing information meets the video effect requirements described in the first text information, and the multimedia materials in the video editing draft also meet the video effect requirements described in the first text information.
  • the editing information included in the video editing draft can be used to indicate an editing operation determined based on an analysis of the first text information. For example, if the first text information is "Warm summer, tender afternoon", then through analysis of the first text information, it can be determined that the editing information is used to indicate an editing operation including adding an A filter to one or more video clips.
  • the editing information included in the video editing draft may be used to indicate an editing operation determined based on an analysis of the multimedia material imported by the user. For example, if the multimedia material imported by the user includes summer vacation pictures, video clips, etc., then the editing operation indicated by the editing information may include adding a B filter to one or more video clips, etc., through analysis of the multimedia material imported by the user.
  • the editing information included in the video editing draft can be used to indicate editing operations, which may include editing operations determined based on the analysis of the first text information and multimedia materials imported by the user.
  • editing operations may include editing operations determined based on the analysis of the first text information and multimedia materials imported by the user. The specific method is described in the above two implementations and will not be repeated here.
  • the editing information included in the video editing draft can be used to indicate editing operations, including editing operations indicated by a target video editing template, wherein the editing operations indicated by the target video editing template are used to edit the acquired multimedia material.
  • the target video editing template can be a video editing template selected by the user, or a video editing template determined based on an analysis of the first text information, and the contents related to the target video editing template are described in detail in subsequent embodiments.
  • a video editing draft is generated based on the acquired first text information and multimedia material
  • further editing operations may be performed on the video editing draft, such as adjusting all or part of the editing information in the video editing draft.
  • a video editing draft is displayed on a preview page, in response to In the export operation for the video editing operation, a target video is generated based on the video editing draft, wherein the video effect of the exported target video meets the video effect requirement described in the first text information.
  • a video editing draft is generated based on the first text information and the multimedia material. Then, a target video that meets the video effect requirements described in the first text information is generated based on the video editing draft. It can be seen that the embodiment of the present disclosure generates a video editing operation based on the first text information and the multimedia material, and then generates a target video that meets the video effect requirements described in the first text information based on the video editing draft, enriching the video generation method, thereby improving the user experience.
  • the present disclosure further provides a video generation method.
  • a flow chart of another video generation method provided by the present disclosure embodiment is shown, wherein the video generation method includes:
  • S501 Acquire first text information; wherein the first text information is used to describe video effect requirements;
  • S502 Acquire at least one multimedia material.
  • the method of obtaining the first text information and at least one multimedia material can still be understood by referring to the above embodiment, and will not be described in detail here.
  • S503 Determine at least one video editing template based on the first text information and the at least one multimedia material.
  • the editing effect of the at least one video editing template meets the video effect requirement described in the first text information.
  • the feature tags of the first text information and the multimedia material are extracted respectively, and then, based on the feature tags of the first text information and the at least one multimedia material, the available video editing templates are matched to obtain at least one video editing template, wherein the at least one video editing template includes a first video editing template that matches the feature tags of the first text information and a second video editing template that matches the feature tags of the at least one multimedia material. template.
  • the feature tags corresponding to the first text information and the multimedia material are matched with the available video editing templates in the template library to obtain at least one successfully matched video editing template, wherein the editing effect of the successfully matched video editing template meets the video effect requirements described in the first text information.
  • the video editing templates corresponding to the two are mixed, and the mixed video editing templates can be displayed on the preview page.
  • FIG6 it is a schematic diagram of a preview page provided by an embodiment of the present disclosure, wherein a preset number of video editing templates are displayed in the lower area 601 of the preview page, and the preset number of video editing templates are determined based on the acquired first text information and at least one multimedia material.
  • the user can trigger the display of more video editing templates on the preview page by sliding horizontally in the lower area 601, for example, by sliding horizontally to the left to trigger the display of more video editing templates from the right side of the preview page.
  • the video editing template pulled out for display is also determined based on the acquired first text information and at least one multimedia material.
  • a third video editing template is selected from at least one acquired video editing template and presented on a preview page of the video editing effect, so that the preview page is used to preview the video effect obtained by importing at least one acquired multimedia material into the third video editing template, and an update recommendation control is provided on the preview page; wherein the third video editing template can be any selected video editing template displayed on the preview page.
  • a fourth video editing template is selected from the at least one video editing template, and the third video editing template presented on the preview page is replaced with the fourth video editing template, so that the preview page is used to preview the video effect obtained by importing the at least one multimedia material into the fourth video editing template.
  • the fourth video editing template used to replace the third video editing template may belong to at least one video editing template determined based on the first text information and the at least one multimedia material obtained.
  • the current user can trigger an update display of the video editing template displayed on the preview page by triggering a "change batch" control 602 set on the preview page.
  • the first video editing template in the first video editing template set is displayed on the preview page.
  • the first video editing template set is composed of at least one video editing template determined based on the first text information and at least one multimedia material.
  • a preset number of video editing templates in the first video editing template set are displayed on the preview page.
  • the lower area 601 of the preview page as shown in FIG6 displays a preset number of video editing templates, including the first video editing template.
  • the first video editing template can be any video editing template displayed on the preview page.
  • the first video editing template displayed on the preview page is replaced with the second video editing template in the first video editing template set.
  • the video editing template currently displayed on the preview page is replaced with the preset number of video editing templates in the first video editing template set, so as to realize the updating of the video editing template, so that the user can generate the target video based on the updated video editing template displayed on the preview page.
  • S504 Apply the editing operation indicated by the target video editing template in the at least one video editing template to the at least one multimedia material to generate a video editing draft.
  • the editing operation indicated by the target video editing template is applied to the acquired multimedia material to generate a video editing operation.
  • users can also trigger switching operations for video editing templates.
  • a preview effect of a video editing draft applying any video editing template (such as video editing template A) is displayed in the preview window 603 on the preview page.
  • Users can trigger a preview effect of a video editing draft applying video editing template B in the preview window 603 by selecting other video editing templates (such as video editing template B).
  • the user can also adjust the first text information according to the preview effect of the video editing draft so as to generate a symbol A video editing draft that meets the user's video effect requirements.
  • a video editing template determined based on the initial first text information and the multimedia material is displayed on the preview page, and after receiving the text adjustment operation for the initial first text information, the adjusted text information is obtained, and then, based on the adjusted text information and the multimedia material, a video editing template that meets the video effect requirements described in the adjusted text information is re-determined.
  • the user can generate a video editing draft that meets the video effect requirements described in the adjusted text information based on the re-determined video editing template.
  • a fifth video editing template in the at least one video editing template is displayed on the preview page; wherein the fifth video editing template is any video editing template determined based on the first text information and the multimedia material.
  • the adjusted text information is obtained; then, based on the adjusted text information and the multimedia material, the second video editing template set is re-determined; and the fifth video editing template displayed on the preview page is replaced with the sixth video editing template in the second video editing template set.
  • the sixth video editing template is any video editing template re-determined based on the adjusted text information and the multimedia material.
  • the user can also adjust not only the first text information but also the multimedia material according to the preview effect of the video editing draft, so as to generate a video editing draft that meets the user's video effect requirements.
  • a material adjustment operation for an initial multimedia material is received to obtain an adjusted multimedia material; wherein the adjusted multimedia material may include all or part of the multimedia material in the initial multimedia material, and the material adjustment operation may include operations such as adding, deleting, and replacing the material for the initial multimedia material.
  • a second video editing template set is determined. The video editing templates in the second video editing template set are re-determined based on the first text information (or the adjusted text information) and the adjusted multimedia material.
  • S505 Generate a target video according to the video editing draft.
  • the target video after the video editing draft is generated, can be generated by triggering an export operation for the video editing draft.
  • the generated target video can be Save to local or cloud, or trigger publishing operations for target videos.
  • a video editing template that meets the video effect requirements is determined based on text information and multimedia materials that describe the video effect requirements, and then a target video is generated based on the video editing template, thereby enriching the video generation method and improving the user experience.
  • the present disclosure further provides a video generating device.
  • a schematic diagram of the structure of a video generating device provided by an embodiment of the present disclosure is shown.
  • the device includes:
  • the first acquisition module 701 is used to acquire first text information; wherein the first text information is used to describe the video effect requirements;
  • the second acquisition module 702 is used to acquire at least one multimedia material
  • Generation module 703 is used to generate a target video based on the first text information and the at least one multimedia material; wherein the target video presents the at least one multimedia material, the video effect of the target video meets the video effect requirements described in the first text information, and the target video is used to present a combination of at least one video clip, and the at least one video clip is formed based on each image material in the at least one multimedia material, and the each image material includes video material and/or image material.
  • the generating module includes:
  • a first generating submodule is configured to generate a video editing draft based on the first text information and the at least one multimedia material; wherein the video editing draft includes the at least one multimedia material and editing information, the editing information is used to indicate an editing operation for the at least one multimedia material, the editing operation is at least used to edit each image material in the at least one multimedia material into the at least one video clip, and the video editing effect corresponding to the editing operation and/or the at least one multimedia material meets the video effect requirement described in the first text information;
  • the second generating submodule is used to generate a target video according to the video editing draft.
  • the second generation submodule includes:
  • the first determining submodule is used to determine the first text information and the at least one media material, determining at least one video editing template; wherein the editing effect of the at least one video editing template meets the video effect requirement described in the first text information;
  • the third generating submodule is used to apply the editing operation indicated by the target video editing template in the at least one video editing template to the at least one multimedia material to generate a video editing draft.
  • the first determining submodule includes:
  • An extraction submodule used to extract feature tags of the first text information and the at least one multimedia material respectively;
  • a first matching submodule is used to obtain at least one video editing template by matching available video editing templates based on the first text information and the feature tags of the at least one multimedia material, wherein the at least one video editing template includes a first video editing template that matches the feature tags of the first text information and a second video editing template that matches the feature tags of the at least one multimedia material.
  • the second acquisition module includes:
  • a second matching submodule configured to match a first multimedia material from at least one multimedia material in a user material set based on an analysis result of the first text information
  • the fourth generation submodule is used to generate a second multimedia material in at least one multimedia material based on the analysis result of the first text information; wherein the at least one multimedia material meets the video effect requirement described in the first text information.
  • the device further includes:
  • a first display module configured to display a text input box in response to an import operation for at least one multimedia material
  • the first acquisition module is specifically used for:
  • the device further includes:
  • a second display module is used to display at least one video tag; wherein the video tag is used to represent the video effect;
  • the first acquisition module is specifically used for:
  • first text information is acquired.
  • the device further includes:
  • a third display module is used to select a third video editing template from the at least one video editing template and present it on a preview page of the video editing effect, so that the preview page is used to preview the video effect obtained by importing the at least one multimedia material into the third video editing template, and an update recommendation control is set on the preview page;
  • the first replacement module is used to select a fourth video editing template from the at least one video editing template in response to a trigger operation on the update recommendation control, and use the fourth video editing template to replace the third video editing template presented on the preview page, so that the preview page is used to preview the video effect obtained by importing the at least one multimedia material into the fourth video editing template.
  • the device further includes:
  • a fourth display module configured to display a fifth video editing template among the at least one video editing template on a preview page
  • a first adjustment module configured to obtain adjusted text information in response to a text adjustment operation on the first text information on the preview page
  • a first determining module determining a second video editing template set based on the adjusted text information and the at least one multimedia material
  • the second replacement module is used to replace the fifth video editing template displayed on the preview page with the sixth video editing template in the second video editing template set.
  • the device further includes:
  • a second adjustment module configured to receive a material adjustment operation for the at least one multimedia material, and obtain an adjusted multimedia material
  • the base first determination module is specifically used for:
  • a second video editing template set is determined.
  • the video generation device obtains first text information for describing video effect requirements and obtains at least one multimedia material; then, based on the first text information, a text message and the at least one multimedia material, to generate a target video; wherein the target video presents the at least one multimedia material, the video effect of the target video meets the video effect requirements described in the first text message, and the target video is used to present a combination of at least one video clip, and the at least one video clip is formed based on each image material in the at least one multimedia material, and the each image material includes a video material and/or an image material. It can be seen that the embodiment of the present disclosure can generate a target video that meets the video effect requirements described in the first text message based on obtaining the first text message and the multimedia material, thereby enriching the video generation method and improving the user experience.
  • the embodiments of the present disclosure further provide a computer-readable storage medium, in which instructions are stored.
  • the terminal device implements the video generation method described in the embodiments of the present disclosure.
  • the embodiment of the present disclosure further provides a computer program product, which includes a computer program/instructions.
  • a computer program product which includes a computer program/instructions.
  • the embodiment of the present disclosure further provides a video generating device, as shown in FIG8 , which may include:
  • the number of processors 801 in the video generation device can be one or more, and one processor is taken as an example in FIG8.
  • the processor 801, memory 802, input device 803 and output device 804 can be connected via a bus or other means, wherein FIG8 takes the connection via a bus as an example.
  • the memory 802 can be used to store software programs and modules.
  • the processor 801 executes various functional applications and data processing of the video generation device by running the software programs and modules stored in the memory 802.
  • the memory 802 can mainly include a program storage area and a data storage area.
  • the program storage area can store an operating system, an application required for at least one function, etc.
  • the memory 802 can include a high-speed random access memory, and can also
  • the input device 803 may be used to receive input digital or character information and generate signal input related to user settings and function control of the video generation device.
  • the processor 801 will load the executable files corresponding to the processes of one or more applications into the memory 802 according to the following instructions, and the processor 801 will run the applications stored in the memory 802, thereby realizing various functions of the above-mentioned video generation device.

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)
  • Television Signal Processing For Recording (AREA)

Abstract

本公开提供了一种视频生成方法、装置、设备及存储介质,所述方法包括:获取用于描述视频效果要求的第一文本信息,以及获取至少一个多媒体素材;基于第一文本信息和至少一个多媒体素材,生成目标视频;目标视频中呈现有至少一个多媒体素材,目标视频的视频效果符合第一文本信息所描述的视频效果要求,目标视频用于呈现至少一个视频片段的组合,至少一个视频片段分别是基于至少一个多媒体素材中的各个影像素材形成的,各个影像素材包括视频素材和/或图像素材。可见,本公开实施例基于获取到第一文本信息和多媒体素材,能够生成符合第一文本信息描述的视频效果要求的目标视频,丰富了视频生成方式,从而提升了用户体验。

Description

一种视频生成方法、装置、设备及存储介质
本申请要求于2023年4月23日递交的、发明名称为“一种视频生成方法、装置、设备及存储介质”、申请号为202310446304.X的中国发明专利申请的优先权,该申请的全部内容通过引用结合在本申请中。
技术领域
本公开涉及数据处理领域,尤其涉及一种视频生成方法、装置、设备及存储介质。
背景技术
随着视频处理技术的不断发展,用户对视频生成方式的要求越来越多样化。因此,如何丰富视频生成方式,以满足用户多样化的视频生成方式要求,提升用户体验,是目前亟需解决的技术问题。
发明内容
为了解决上述技术问题,本公开提供了一种视频生成方法、装置、设备及存储介质,能够丰富视频生成方式,提升用户体验。
第一方面,本公开提供了一种视频生成方法,所述方法包括:
获取第一文本信息;其中,所述第一文本信息用于描述视频效果要求;
获取至少一个多媒体素材;
基于所述第一文本信息和所述至少一个多媒体素材,生成目标视频;其中,所述目标视频中呈现有所述至少一个多媒体素材,所述目标视频的视频效果符合所述第一文本信息所描述的视频效果要求,所述目标视频用于呈现至少一个视频片段的组合,所述至少一个视频片段分别是基于所述至少一个多媒体素材中的各个影像素材形成的,所 述各个影像素材包括视频素材和/或图像素材。
一种可选的实施方式中,所述基于所述第一文本信息和所述至少一个多媒体素材,生成目标视频,包括:
基于所述第一文本信息和所述至少一个多媒体素材,生成视频编辑草稿;其中,所述视频编辑草稿包括所述至少一个多媒体素材和编辑信息,所述编辑信息用于指示针对所述至少一个多媒体素材的编辑操作,所述编辑操作至少用于将所述至少一个多媒体素材中的各个影像素材分别编辑成所述至少一个视频片段,所述编辑操作对应的视频编辑效果和/或所述至少一个多媒体素材,符合所述第一文本信息所描述的视频效果要求;
根据所述视频编辑草稿生成目标视频。
一种可选的实施方式中,所述基于所述第一文本信息和所述至少一个多媒体素材,生成视频编辑草稿,包括:
基于所述第一文本信息和所述至少一个多媒体素材,确定至少一个视频编辑模板;其中,所述至少一个视频编辑模板的编辑效果符合所述第一文本信息描述的视频效果要求;
将所述至少一个视频编辑模板中的目标视频编辑模板所指示的编辑操作应用于所述至少一个多媒体素材上,生成视频编辑草稿。
一种可选的实施方式中,所述基于所述第一文本信息和所述至少一个多媒体素材,确定至少一个视频编辑模板,包括:
分别提取所述第一文本信息和所述至少一个多媒体素材的特征标签;
基于所述第一文本信息和所述至少一个多媒体素材的特征标签与可用的视频编辑模板进行匹配得到至少一个视频编辑模板,所述至少一个视频编辑模板中包括与所述第一文本信息的特征标签相匹配的第一视频编辑模板和与所述至少一个多媒体素材的特征标签相匹配的第二视频编辑模板。
一种可选的实施方式中,所述获取至少一个多媒体素材,包括:
基于对所述第一文本信息的分析结果,从用户素材集合中匹配出 至少一个多媒体素材中的第一多媒体素材;
和/或,基于对所述第一文本信息的分析结果,生成至少一个多媒体素材中的第二多媒体素材;其中,所述至少一个多媒体素材符合所述第一文本信息描述的视频效果要求。
一种可选的实施方式中,所述获取第一文本信息之前,还包括:
响应于针对至少一个多媒体素材的导入操作,显示文本输入框;
相应的,所述获取第一文本信息,包括:
基于所述文本输入框,接收第一文本信息。
一种可选的实施方式中,所述基于所述文本输入框,接收第一文本信息之前,还包括:
显示至少一个视频标签;其中,所述视频标签用于表征视频效果;
相应的,所述基于所述文本输入框,接收第一文本信息,包括:
基于将所述至少一个视频标签中的目标视频标签添加至所述文本输入框的操作,获取第一文本信息。
一种可选的实施方式中,所述基于所述第一文本信息和所述至少一个多媒体素材,确定至少一个视频编辑模板之后,还包括:
在所述至少一个视频编辑模板中选取第三视频编辑模板并呈现在视频编辑效果的预览页面上,以便所述预览页面用于预览所述至少一个多媒体素材导入所述第三视频编辑模板所得到的视频效果,所述预览页面上设置有更新推荐控件;
响应于针对所述更新推荐控件的触发操作,在所述至少一个视频编辑模板中选取第四视频编辑模板,并利用所述第四视频编辑模板替换所述预览页面上呈现的第三视频编辑模板,以便所述预览页面用于预览所述至少一个多媒体素材导入所述第四视频编辑模板所得到的视频效果。
一种可选的实施方式中,所述基于所述第一文本信息和所述至少一个多媒体素材,确定至少一个视频编辑模板之后,还包括:
在预览页面上显示所述至少一个视频编辑模板中的第五视频编辑模板;
响应于在所述预览页面上针对所述第一文本信息的文本调整操作,得到调整后文本信息;
基于所述调整后文本信息和所述至少一个多媒体素材,确定第二视频编辑模板集合;
利用所述第二视频编辑模板集合中的第六视频编辑模板替换所述预览页面上显示的所述第五视频编辑模板。
一种可选的实施方式中,所述基于所述调整后文本信息和所述至少一个多媒体素材,确定第二视频编辑模板集合之前,还包括:
接收针对所述至少一个多媒体素材的素材调整操作,得到调整后多媒体素材;
相应的,所述基于所述调整后文本信息和所述至少一个多媒体素材,确定第二视频编辑模板集合,包括:
基于所述调整后文本信息和所述调整后多媒体素材,确定第二视频编辑模板集合。
第二方面,本公开提供了一种视频生成装置,所述装置包括:
第一获取模块,用于获取第一文本信息;其中,所述第一文本信息用于描述视频效果要求;
第二获取模块,用于获取至少一个多媒体素材;
生成模块,用于基于所述第一文本信息和所述至少一个多媒体素材,生成目标视频;其中,所述目标视频中呈现有所述至少一个多媒体素材,所述目标视频的视频效果符合所述第一文本信息所描述的视频效果要求,所述目标视频用于呈现至少一个视频片段的组合,所述至少一个视频片段分别是基于所述至少一个多媒体素材中的各个影像素材形成的,所述各个影像素材包括视频素材和/或图像素材。
第三方面,本公开提供了一种计算机可读存储介质,所述计算机可读存储介质中存储有指令,当所述指令在终端设备上运行时,使得所述终端设备实现上述的方法。
第四方面,本公开提供了一种视频生成设备,包括:存储器,处理器,及存储在所述存储器上并可在所述处理器上运行的计算机程序, 所述处理器执行所述计算机程序时,实现上述的方法。
第五方面,本公开提供了一种计算机程序产品,所述计算机程序产品包括计算机程序/指令,所述计算机程序/指令被处理器执行时实现上述的方法。
本公开实施例提供的技术方案与现有技术相比至少具有如下优点:
本公开实施例提供了一种视频生成方法,具体的,获取用于描述视频效果要求的第一文本信息,以及获取至少一个多媒体素材;然后,基于所述第一文本信息和所述至少一个多媒体素材,生成目标视频;其中,所述目标视频中呈现有所述至少一个多媒体素材,所述目标视频的视频效果符合所述第一文本信息所描述的视频效果要求,所述目标视频用于呈现至少一个视频片段的组合,所述至少一个视频片段分别是基于所述至少一个多媒体素材中的各个影像素材形成的,所述各个影像素材包括视频素材和/或图像素材。可见,本公开实施例基于获取到第一文本信息和多媒体素材,能够生成符合第一文本信息描述的视频效果要求的目标视频,丰富了视频生成方式,从而提升了用户体验。
附图说明
此处的附图被并入说明书中并构成本说明书的一部分,示出了符合本公开的实施例,并与说明书一起用于解释本公开的原理。
为了更清楚地说明本公开实施例或现有技术中的技术方案,下面将对实施例或现有技术描述中所需要使用的附图作简单地介绍,显而易见地,对于本领域普通技术人员而言,在不付出创造性劳动性的前提下,还可以根据这些附图获得其他的附图。
图1为本公开实施例提供的一种视频生成方法的流程图;
图2为本公开实施例提供的一种素材选择页面的示意图;
图3为本公开实施例提供的另一种素材选择页面的示意图;
图4为本公开实施例提供的另一种视频生成方法的流程图;
图5为本公开实施例提供的另一种视频生成方法的流程图;
图6为本公开实施例提供的一种预览页面的示意图;
图7为本公开实施例提供的一种视频生成装置的结构示意图;
图8为本公开实施例提供的一种视频生成设备的结构示意图。
具体实施方式
为了能够更清楚地理解本公开的上述目的、特征和优点,下面将对本公开的方案进行进一步描述。需要说明的是,在不冲突的情况下,本公开的实施例及实施例中的特征可以相互组合。
在下面的描述中阐述了很多具体细节以便于充分理解本公开,但本公开还可以采用其他不同于在此描述的方式来实施;显然,说明书中的实施例只是本公开的一部分实施例,而不是全部的实施例。
随着视频处理技术的不断发展,用户对视频生成方式的要求越来越多样化。为此,本公开实施例提供了一种视频生成方法,通过对描述视频效果要求的文本信息的分析,能够生成符合该视频效果要求的目标视频,丰富了视频生成方式,从而提升了用户体验。
具体的,本公开实施例提供的视频生成方法中,获取用于描述视频效果要求的第一文本信息,以及获取至少一个多媒体素材;然后,基于第一文本信息以及该至少一个多媒体素材,生成目标视频。其中,目标视频中呈现有该至少一个多媒体素材,目标视频的视频效果符合第一文本信息描述的视频效果要求。另外,目标视频用于呈现至少一个视频片段的组合,该至少一个视频片段分别是基于获取到的至少一个多媒体素材中的影像素材形成的,影像素材可以包括视频素材和/或图像素材。可见,本公开实施例基于获取到第一文本信息和多媒体素材,能够生成符合第一文本信息描述的视频效果要求的目标视频,丰富了视频生成方式,从而提升了用户体验。
基于此,本公开实施例提供了一种视频生成方法,参考图1,为本公开实施例提供的一种视频生成方法的流程图,该方法包括:
S101:获取第一文本信息。
其中,所述第一文本信息用于描述视频效果要求。
本公开实施例中,第一文本信息可以是用户输入的文本信息,具体的,文本信息输入的方式不做限定,例如可以是通过语音输入的方式输入第一文本信息,也可以是基于键盘输入的方式输入第一文本信息,还可以是通过导入文本信息的方式输入第一文本信息等。
第一文本信息为能够描述视频效果要求的文本信息,可选的,第一文本信息描述的视频效果要求可以是对视频风格类型的要求,如第一文本信息可以为“漫画风格”。第一文本信息描述的视频效果要求还可以是对视频展示内容的要求,如第一文本信息可以为“温暖夏日,温情午后”。本公开实施例对于第一文本信息描述的视频效果要求不做具体限定。
S102:获取至少一个多媒体素材。
本公开实施例在生成目标视频之前,还需要获取多媒体素材。其中,多媒体素材可以包括图片、视频、音频等。
一种可选的实施方式中,可以通过用户导入的方式获取多媒体素材,如图2所示,为本公开实施例提供的一种素材选择页面的示意图,素材选择页面上展示有用户素材集合中的各个多媒体素材,在接收到针对至少一个多媒体素材的导入操作时,获取导入的至少一个多媒体素材。另外,在接收到针对至少一个多媒体素材的导入操作时,还可以触发显示文本输入框,如图2所示,在导入多媒体素材201之后,在素材选择页面上显示文本输入框202。在文本输入框202内可以输入第一文本信息,从而实现第一文本信息的获取。
另外,在素材选择页面上显示文本输入框的同时,还可以显示至少一个视频标签,如图3所示的文本输入框301下方显示有多个视频标签,如视频标签“漫画风”等。通过从显示的视频标签中选定目标视频标签,触发将目标视频标签添加至文本输入框301内,得到第一文本信息。其中,添加至文本输入框301内的目标视频标签可以包括一个或多个显示在素材选择页面上的视频标签。
具体的,第一文本信息可以仅包括目标视频标签、也可以仅包括用户输入的文本信息,还可以包括目标视频标签和用户输入的文本信息。
另一种可选的实施方式中,还可以基于对第一文本信息的分析,获取至少一个多媒体素材。可选的,可以基于对第一文本信息的分析结果,从用户素材集合中匹配出至少一个多媒体素材。具体的,通过自然语言分析算法对第一文本信息进行语义分析,并基于分析结果从用户素材集合中匹配出至少一个多媒体素材,后续用于生成目标视频。
另外,还可以基于对第一文本信息的分析结果,生成至少一个多媒体素材,具体的,通过自然语言分析算法对第一文本信息进行语义分析,并基于分析结果生成图片、视频片段、音频等多媒体素材,后续用于生成目标视频。
本公开实施例中,基于第一文本信息的分析结果,从用户素材集合中匹配出的多媒体素材,以及生成的多媒体素材,均符合第一文本信息描述的视频效果要求。
S103:基于所述第一文本信息和所述至少一个多媒体素材,生成目标视频。
其中,所述目标视频中呈现有所述至少一个多媒体素材,所述目标视频的视频效果符合所述第一文本信息所描述的视频效果要求,所述目标视频用于呈现至少一个视频片段的组合,所述至少一个视频片段分别是基于所述至少一个多媒体素材中的各个影像素材形成的,所述影像素材包括视频素材和/或图像素材。
本公开实施例中,在获取到第一文本信息和至少一个多媒体素材之后,利用第一文本信息和该至少一个多媒体素材,生成目标视频。具体的视频生成方式在后续实施例中具体展开介绍,在此不再赘述。
本公开实施例提供的视频生成方法中,获取用于描述视频效果要求的第一文本信息,以及获取至少一个多媒体素材;然后,基于第一文本信息以及该至少一个多媒体素材,生成目标视频。其中,目标视频中呈现有该至少一个多媒体素材,目标视频的视频效果符合第一文 本信息描述的视频效果要求。另外,目标视频用于呈现至少一个视频片段的组合,该至少一个视频片段分别是基于获取到的至少一个多媒体素材中的影像素材形成的,影像素材可以包括视频素材和/或图像素材。可见,本公开实施例基于获取到第一文本信息和多媒体素材,能够生成符合第一文本信息描述的视频效果要求的目标视频,丰富了视频生成方式,从而提升了用户体验。
在上述实施例的基础上,本公开实施例还提供了一种视频生成方法,参考图4,为本公开实施例提供的另一种视频生成方法的流程图,该视频生成方法具体包括:
S401:获取第一文本信息;其中,所述第一文本信息用于描述视频效果要求。
S402:获取至少一个多媒体素材。
本公开实施例中,获取第一文本信息以及至少一个多媒体素材的方式可参照上述实施例进行理解,在此不再赘述。
S403:基于所述第一文本信息和所述至少一个多媒体素材,生成视频编辑草稿。
其中,所述视频编辑草稿包括所述至少一个多媒体素材和编辑信息,所述编辑信息用于指示针对所述至少一个多媒体素材的编辑操作,所述编辑操作至少用于将所述至少一个多媒体素材中的各个影像素材分别编辑成所述至少一个视频片段,所述编辑操作对应的视频编辑效果和/或所述至少一个多媒体素材,符合所述第一文本信息所描述的视频效果要求。
本公开实施例中,在获取到第一文本信息和至少一个多媒体素材之后,基于对第一文本信息的分析,或者对第一文本信息和该至少一个多媒体素材的综合分析,生成视频编辑草稿。
其中,视频编辑草稿包括获取到至少一个多媒体素材和编辑信息,编辑信息用于指示针对该至少一个多媒体素材的编辑操作,编辑操作至少用于将该至少一个多媒体素材中的各个影像素材编辑成一个或多 个视频片段,一个视频片段中可以包括一个影像素材或多个影像素材的组合。编辑信息所指的编辑操作对应的视频编辑效果,符合第一文本信息所描述的视频效果要求,视频编辑草稿中的多媒体素材也符合第一文本信息所描述的视频效果要求。
一种可选的实施方式中,视频编辑草稿包括的编辑信息可以用于指示基于对第一文本信息的分析确定的编辑操作,例如第一文本信息为“温暖夏日,温情午后”,则可以通过对第一文本信息的分析,确定编辑信息用于指示的编辑操作包括在一段或多段视频片段中添加A滤镜等。
另一种可选的实施方式中,视频编辑草稿包括的编辑信息可以用于指示基于对用户导入的多媒体素材的分析确定的编辑操作。例如,用户导入的多媒体素材包括夏日度假图片、视频片段等,则可以通过用户导入的多媒体素材的分析,确定编辑信息用于指示的编辑操作包括在一段或多段视频片段中添加B滤镜等。
综合上述两种实施方式,视频编辑草稿包括的编辑信息可以用于指示的编辑操作,可以包括基于对所述第一文本信息和用户导入的多媒体素材进行分析确定的编辑操作,具体方式参照上述两种实施方式的描述,在此不再赘述。
又一种可选的实施方式中,视频编辑草稿包括的编辑信息可以用于指示的编辑操作,包括目标视频编辑模板所指示的编辑操作,其中,目标视频编辑模板所指示的编辑操作用于对获取到的多媒体素材进行编辑。目标视频编辑模板可以是用户选定的视频编辑模板,也可以是基于对第一文本信息的分析确定的视频编辑模板,对于目标视频编辑模板相关的内容在后续实施例中进行详细描述。
S404:根据所述视频编辑草稿生成目标视频。
本公开实施例中,在基于获取到的第一文本信息和多媒体素材生成视频编辑草稿之后,可以对视频编辑草稿进行进一步的编辑操作,例如对视频编辑草稿中的全部或部分编辑信息进行调整等。
一种可选的实施方式中,在预览页面上展示视频编辑草稿,响应 于针对视频编辑操作的导出操作,基于视频编辑草稿生成目标视频。其中,导出的目标视频的视频效果符合第一文本信息描述的视频效果要求。
本公开实施例提供的视频生成方法中,在获取到描述视频效果要求的第一文本信息以及多媒体素材之后,基于第一文本信息和多媒体素材生成视频编辑草稿。进而,根据视频编辑草稿生成符合第一文本信息描述的视频效果要求的目标视频。可见,本公开实施例基于第一文本信息和多媒体素材生成视频编辑操作,进而基于视频编辑草稿生成符合第一文本信息描述的视频效果要求的目标视频,丰富了视频生成方式,从而提升了用户体验。
基于上述实施例,本公开实施例还提供了一种视频生成方法,参考图5,为本公开实施例提供的另一种视频生成方法的流程图,其中,该视频生成方法包括:
S501:获取第一文本信息;其中,所述第一文本信息用于描述视频效果要求;
S502:获取至少一个多媒体素材。
本公开实施例中,获取第一文本信息以及至少一个多媒体素材的方式依然可参照上述实施例进行理解,在此不再赘述。
S503:基于所述第一文本信息和所述至少一个多媒体素材,确定至少一个视频编辑模板。
其中,所述至少一个视频编辑模板的编辑效果符合所述第一文本信息描述的视频效果要求。
一种可选的实施方式中,在获取到第一文本信息和多媒体素材之后,分别提取第一文本信息和多媒体素材的特征标签,然后,基于所述第一文本信息和所述至少一个多媒体素材的特征标签与可用的视频编辑模板进行匹配得到至少一个视频编辑模板,所述至少一个视频编辑模板中包括与所述第一文本信息的特征标签相匹配的第一视频编辑模板和与所述至少一个多媒体素材的特征标签相匹配的第二视频编辑 模板。
一种可选的实施方式中,针对第一文本信息和多媒体素材分别对应的特征标签,与模板库中可用的视频编辑模板进行匹配,获取匹配成功的至少一个视频编辑模板。其中,匹配成功的视频编辑模板的编辑效果符合第一文本信息描述的视频效果要求。
进一步的,在获取到第一文本信息的特征标签相匹配的视频编辑模板,以及多媒体素材的特征标签相匹配的视频编辑模板之后,对二者分别对应的视频编辑模板进行混排,经过混排后的视频编辑模板可以展示在预览页面上。
如图6所示,为本公开实施例提供的一种预览页面的示意图,其中,预览页面的下方区域601内展示有预设个数的视频编辑模板,该预设个数的视频编辑模板为基于获取到的第一文本信息和至少一个多媒体素材确定。用户可以通过作用在该下方区域601内的横滑操作,触发在预览页面上显示更多的视频编辑模板,例如通过向左横滑操作触发从预览页面的右侧拉出显示更多的视频编辑模板。其中,拉出显示的视频编辑模板也为基于获取到的第一文本信息和至少一个多媒体素材确定。
一种可选的实施方式中,在获取到的至少一个视频编辑模板中选取第三视频编辑模板并呈现在视频编辑效果的预览页面上,以便预览页面用于预览获取到的至少一个多媒体素材导入第三视频编辑模板得到的视频效果,预览页面上设置有更新推荐控件;其中,第三视频编辑模板可以为显示在预览页面上的任意一个被选定的视频编辑模板。
响应于针对所述更新推荐控件的触发操作,在所述至少一个视频编辑模板中选取第四视频编辑模板,并利用所述第四视频编辑模板替换所述预览页面上呈现的第三视频编辑模板,以便所述预览页面用于预览所述至少一个多媒体素材导入所述第四视频编辑模板所得到的视频效果。其中,用于替换第三视频编辑模板的第四视频编辑模板可以属于基于第一文本信息和获取到的至少一个多媒体素材,确定的至少一个视频编辑模板。
一种可选的实施方式中,如果预览页面上展示的视频编辑模板不能够满足当前用户对视频编辑模板的使用需求,那么,当前用户可以通过针对预览页面上设置的“换一批”控件602的触发操作,触发对预览页面上展示的视频编辑模板的更新展示。
具体的,将第一视频编辑模板集合中的第一视频编辑模板显示在预览页面上。其中,第一视频编辑模板集合由基于第一文本信息和至少一个多媒体素材确定的至少一个视频编辑模板构成。在预览页面上显示第一视频编辑模板集合中的预设个数的视频编辑模板,如图6所示的预览页面的下方区域601内展示有预设个数的视频编辑模板,其中包括第一视频编辑模板,第一视频编辑模板可以为显示在预览页面上的任意一个视频编辑模板。响应于针对预览页面上的更新推荐控件(如图6所示的“换一批”控件602)的触发操作,利用第一视频编辑模板集合中的第二视频编辑模板替换预览页面上显示的第一视频编辑模板。也就是说,在响应于针对预览页面上的更新推荐控件的触发操作,利用第一视频编辑模板集合中的预设个数视频编辑模板替换预览页面上正在显示的视频编辑模板,实现视频编辑模板的更新,以便用户能够基于预览页面上更新显示后的视频编辑模板生成目标视频。
S504:将所述至少一个视频编辑模板中的目标视频编辑模板所指示的编辑操作应用于所述至少一个多媒体素材上,生成视频编辑草稿。
本公开实施例中,响应于针对预览页面上显示的至少一个视频编辑模板中的目标视频编辑模板的选定操作,将目标视频编辑模板所指示的编辑操作应用于获取到的多媒体素材上,生成视频编辑操作。
实际应用中,用户还可以触发针对视频编辑模板的切换操作,具体的,在预览页面上的预览窗口603内展示有应用任一视频编辑模板(如视频编辑模板A)的视频编辑草稿的预览效果,用户可以通过针对其他视频编辑模板(如视频编辑模板B)的选定操作,触发在预览窗口603内展示应用视频编辑模板B的视频编辑草稿的预览效果。
另一种可选的实施方式中,在预览页面上,用户还可以根据视频编辑草稿的预览效果,针对第一文本信息进行调整,以便能够生成符 合用户的视频效果要求的视频编辑草稿。
具体的,在预览页面上显示基于初始第一文本信息和多媒体素材确定的视频编辑模板,在接收到针对初始第一文本信息的文本调整操作后,得到调整后文本信息,然后,基于调整后文本信息和多媒体素材,重新确定符合调整后文本信息描述的视频效果要求的视频编辑模板。用户可以基于重新确定的视频编辑模板,生成符合调整后文本信息描述的视频效果要求的视频编辑草稿。
一种可选的实施方式中,在预览页面上显示所述至少一个视频编辑模板中的第五视频编辑模板;其中,第五视频编辑模板为基于第一文本信息和多媒体素材确定的任意一个视频编辑模板。响应于在预览页面上针对第一文本信息的文本调整操作,得到调整后文本信息;然后,基于调整后文本信息和多媒体素材,重新确定第二视频编辑模板集合;并利用第二视频编辑模板集合中的第六视频编辑模板替换预览页面上显示的第五视频编辑模板。其中,第六视频编辑模板为基于调整后文本信息和多媒体素材重新确定的任意一个视频编辑模板。
在上述内容的基础上,在预览页面上,用户还可以根据视频编辑草稿的预览效果,不仅能够对第一文本信息进行调整,还可以针对多媒体素材进行调整,以便能够生成符合用户的视频效果要求的视频编辑草稿。
一种可选的实施方式中,接收针对初始多媒体素材的素材调整操作,得到调整后多媒体素材;其中,调整后多媒体素材可以包括初始多媒体素材中的全部或部分多媒体素材,素材调整操作可以包括针对初始多媒体素材的素材增加、删除、替换等操作。基于调整后文本信息和调整后多媒体素材,确定第二视频编辑模板集合。其中,第二视频编辑模板集合中的视频编辑模板是基于第一文本信息(或调整后文本信息)和调整后多媒体素材重新确定的。
S505:根据所述视频编辑草稿生成目标视频。
本公开实施例中,在生成视频编辑草稿之后,通过针对视频编辑草稿触发导出操作,能够生成目标视频。另外,生成的目标视频可以 保存至本地或云端,或者,也可以针对目标视频触发发布操作等。
本公开实施例提供的视频生成方法中,基于描述视频效果要求的文本信息和多媒体素材,确定符合该视频效果要求的视频编辑模板,进而基于视频编辑模板生成目标视频,丰富了视频生成方式,从而提升了用户体验。
基于上述方法实施例,本公开还提供了一种视频生成装置,参考图7,为本公开实施例提供的一种视频生成装置的结构示意图,所述装置包括:
第一获取模块701,用于获取第一文本信息;其中,所述第一文本信息用于描述视频效果要求;
第二获取模块702,用于获取至少一个多媒体素材;
生成模块703,用于基于所述第一文本信息和所述至少一个多媒体素材,生成目标视频;其中,所述目标视频中呈现有所述至少一个多媒体素材,所述目标视频的视频效果符合所述第一文本信息所描述的视频效果要求,所述目标视频用于呈现至少一个视频片段的组合,所述至少一个视频片段分别是基于所述至少一个多媒体素材中的各个影像素材形成的,所述各个影像素材包括视频素材和/或图像素材。
一种可选的实施方式中,所述生成模块,包括:
第一生成子模块,用于基于所述第一文本信息和所述至少一个多媒体素材,生成视频编辑草稿;其中,所述视频编辑草稿包括所述至少一个多媒体素材和编辑信息,所述编辑信息用于指示针对所述至少一个多媒体素材的编辑操作,所述编辑操作至少用于将所述至少一个多媒体素材中的各个影像素材分别编辑成所述至少一个视频片段,所述编辑操作对应的视频编辑效果和/或所述至少一个多媒体素材,符合所述第一文本信息所描述的视频效果要求;
第二生成子模块,用于根据所述视频编辑草稿生成目标视频。
一种可选的实施方式中,所述第二生成子模块,包括:
第一确定子模块,用于基于所述第一文本信息和所述至少一个多 媒体素材,确定至少一个视频编辑模板;其中,所述至少一个视频编辑模板的编辑效果符合所述第一文本信息描述的视频效果要求;
第三生成子模块,用于将所述至少一个视频编辑模板中的目标视频编辑模板所指示的编辑操作应用于所述至少一个多媒体素材上,生成视频编辑草稿。
一种可选的实施方式中,所述第一确定子模块,包括:
提取子模块,用于分别提取所述第一文本信息和所述至少一个多媒体素材的特征标签;
第一匹配子模块,用于基于所述第一文本信息和所述至少一个多媒体素材的特征标签与可用的视频编辑模板进行匹配得到至少一个视频编辑模板,所述至少一个视频编辑模板中包括与所述第一文本信息的特征标签相匹配的第一视频编辑模板和与所述至少一个多媒体素材的特征标签相匹配的第二视频编辑模板。
一种可选的实施方式中,所述第二获取模块,包括:
第二匹配子模块,用于基于对所述第一文本信息的分析结果,从用户素材集合中匹配出至少一个多媒体素材中的第一多媒体素材;
和/或,
第四生成子模块,用于基于对所述第一文本信息的分析结果,生成至少一个多媒体素材中的第二多媒体素材;其中,所述至少一个多媒体素材符合所述第一文本信息描述的视频效果要求。
一种可选的实施方式中,所述装置还包括:
第一显示模块,用于响应于针对至少一个多媒体素材的导入操作,显示文本输入框;
相应的,所述第一获取模块,具体用于:
基于所述文本输入框,接收第一文本信息。
一种可选的实施方式中,所述装置还包括:
第二显示模块,用于显示至少一个视频标签;其中,所述视频标签用于表征视频效果;
相应的,所述第一获取模块,具体用于:
基于将所述至少一个视频标签中的目标视频标签添加至所述文本输入框的操作,获取第一文本信息。
一种可选的实施方式中,所述装置还包括:
第三显示模块,用于在所述至少一个视频编辑模板中选取第三视频编辑模板并呈现在视频编辑效果的预览页面上,以便所述预览页面用于预览所述至少一个多媒体素材导入所述第三视频编辑模板所得到的视频效果,所述预览页面上设置有更新推荐控件;
第一替换模块,用于响应于针对所述更新推荐控件的触发操作,在所述至少一个视频编辑模板中选取第四视频编辑模板,并利用所述第四视频编辑模板替换所述预览页面上呈现的第三视频编辑模板,以便所述预览页面用于预览所述至少一个多媒体素材导入所述第四视频编辑模板所得到的视频效果。
一种可选的实施方式中,所述装置还包括:
第四显示模块,用于在预览页面上显示所述至少一个视频编辑模板中的第五视频编辑模板;
第一调整模块,用于响应于在所述预览页面上针对所述第一文本信息的文本调整操作,得到调整后文本信息;
第一确定模块,基于所述调整后文本信息和所述至少一个多媒体素材,确定第二视频编辑模板集合;
第二替换模块,用于利用所述第二视频编辑模板集合中的第六视频编辑模板替换所述预览页面上显示的所述第五视频编辑模板。
一种可选的实施方式中,所述装置还包括:
第二调整模块,用于接收针对所述至少一个多媒体素材的素材调整操作,得到调整后多媒体素材;
相应的,所述基第一确定模块,具体用于:
基于所述调整后文本信息和所述调整后多媒体素材,确定第二视频编辑模板集合。
本公开实施例提供的视频生成装置,获取用于描述视频效果要求的第一文本信息,以及获取至少一个多媒体素材;然后,基于所述第 一文本信息和所述至少一个多媒体素材,生成目标视频;其中,所述目标视频中呈现有所述至少一个多媒体素材,所述目标视频的视频效果符合所述第一文本信息所描述的视频效果要求,所述目标视频用于呈现至少一个视频片段的组合,所述至少一个视频片段分别是基于所述至少一个多媒体素材中的各个影像素材形成的,所述各个影像素材包括视频素材和/或图像素材。可见,本公开实施例基于获取到第一文本信息和多媒体素材,能够生成符合第一文本信息描述的视频效果要求的目标视频,丰富了视频生成方式,从而提升了用户体验。
除了上述方法和装置以外,本公开实施例还提供了一种计算机可读存储介质,计算机可读存储介质中存储有指令,当所述指令在终端设备上运行时,使得所述终端设备实现本公开实施例所述的视频生成方法。
本公开实施例还提供了一种计算机程序产品,所述计算机程序产品包括计算机程序/指令,所述计算机程序/指令被处理器执行时实现本公开实施例所述的视频生成方法。
另外,本公开实施例还提供了一种视频生成设备,参见图8所示,可以包括:
处理器801、存储器802、输入装置803和输出装置804。视频生成设备中的处理器801的数量可以一个或多个,图8中以一个处理器为例。在本公开的一些实施例中,处理器801、存储器802、输入装置803和输出装置804可通过总线或其它方式连接,其中,图8中以通过总线连接为例。
存储器802可用于存储软件程序以及模块,处理器801通过运行存储在存储器802的软件程序以及模块,从而执行视频生成设备的各种功能应用以及数据处理。存储器802可主要包括存储程序区和存储数据区,其中,存储程序区可存储操作系统、至少一个功能所需的应用程序等。此外,存储器802可以包括高速随机存取存储器,还可以 包括非易失性存储器,例如至少一个磁盘存储器件、闪存器件、或其他易失性固态存储器件。输入装置803可用于接收输入的数字或字符信息,以及产生与视频生成设备的用户设置以及功能控制有关的信号输入。
具体在本实施例中,处理器801会按照如下的指令,将一个或一个以上的应用程序的进程对应的可执行文件加载到存储器802中,并由处理器801来运行存储在存储器802中的应用程序,从而实现上述视频生成设备的各种功能。
需要说明的是,在本文中,诸如“第一”和“第二”等之类的关系术语仅仅用来将一个实体或者操作与另一个实体或操作区分开来,而不一定要求或者暗示这些实体或操作之间存在任何这种实际的关系或者顺序。而且,术语“包括”、“包含”或者其任何其他变体意在涵盖非排他性的包含,从而使得包括一系列要素的过程、方法、物品或者设备不仅包括那些要素,而且还包括没有明确列出的其他要素,或者是还包括为这种过程、方法、物品或者设备所固有的要素。在没有更多限制的情况下,由语句“包括一个……”限定的要素,并不排除在包括所述要素的过程、方法、物品或者设备中还存在另外的相同要素。
以上所述仅是本公开的具体实施方式,使本领域技术人员能够理解或实现本公开。对这些实施例的多种修改对本领域的技术人员来说将是显而易见的,本文中所定义的一般原理可以在不脱离本公开的精神或范围的情况下,在其它实施例中实现。因此,本公开将不会被限制于本文所述的这些实施例,而是要符合与本文所公开的原理和新颖特点相一致的最宽的范围。

Claims (13)

  1. 一种视频生成方法,其特征在于,所述方法包括:
    获取第一文本信息;其中,所述第一文本信息用于描述视频效果要求;
    获取至少一个多媒体素材;
    基于所述第一文本信息和所述至少一个多媒体素材,生成目标视频;其中,所述目标视频中呈现有所述至少一个多媒体素材,所述目标视频的视频效果符合所述第一文本信息所描述的视频效果要求,所述目标视频用于呈现至少一个视频片段的组合,所述至少一个视频片段分别是基于所述至少一个多媒体素材中的各个影像素材形成的,所述各个影像素材包括视频素材和/或图像素材。
  2. 根据权利要求1所述的方法,其特征在于,所述基于所述第一文本信息和所述至少一个多媒体素材,生成目标视频,包括:
    基于所述第一文本信息和所述至少一个多媒体素材,生成视频编辑草稿;其中,所述视频编辑草稿包括所述至少一个多媒体素材和编辑信息,所述编辑信息用于指示针对所述至少一个多媒体素材的编辑操作,所述编辑操作至少用于将所述至少一个多媒体素材中的各个影像素材分别编辑成所述至少一个视频片段;所述编辑操作对应的视频编辑效果和/或所述至少一个多媒体素材,符合所述第一文本信息所描述的视频效果要求;
    根据所述视频编辑草稿生成目标视频。
  3. 根据权利要求1所述的方法,其特征在于,所述基于所述第一文本信息和所述至少一个多媒体素材,生成视频编辑草稿,包括:
    基于所述第一文本信息和所述至少一个多媒体素材,确定至少一个视频编辑模板;其中,所述至少一个视频编辑模板的编辑效果符合所述第一文本信息描述的视频效果要求;
    将所述至少一个视频编辑模板中的目标视频编辑模板所指示的编 辑操作应用于所述至少一个多媒体素材上,生成视频编辑草稿。
  4. 根据权利要求3所述的方法,其特征在于,所述基于所述第一文本信息和所述至少一个多媒体素材,确定至少一个视频编辑模板,包括:
    分别提取所述第一文本信息和所述至少一个多媒体素材的特征标签;
    基于所述第一文本信息和所述至少一个多媒体素材的特征标签与可用的视频编辑模板进行匹配得到至少一个视频编辑模板,所述至少一个视频编辑模板中包括与所述第一文本信息的特征标签相匹配的第一视频编辑模板和与所述至少一个多媒体素材的特征标签相匹配的第二视频编辑模板。
  5. 根据权利要求1所述的方法,其特征在于,所述获取至少一个多媒体素材,包括:
    基于对所述第一文本信息的分析结果,从用户素材集合中匹配出至少一个多媒体素材中的第一多媒体素材;
    和/或,
    基于对所述第一文本信息的分析结果,生成至少一个多媒体素材中的第二多媒体素材;其中,所述至少一个多媒体素材符合所述第一文本信息描述的视频效果要求。
  6. 根据权利要求1所述的方法,其特征在于,所述获取第一文本信息之前,还包括:
    响应于针对至少一个多媒体素材的导入操作,显示文本输入框;
    相应的,所述获取第一文本信息,包括:
    基于所述文本输入框,接收第一文本信息。
  7. 根据权利要求6所述的方法,其特征在于,所述基于所述文本输入框,接收第一文本信息之前,还包括:
    显示至少一个视频标签;其中,所述视频标签用于表征视频效果;
    相应的,所述基于所述文本输入框,接收第一文本信息,包括:
    基于将所述至少一个视频标签中的目标视频标签添加至所述文本 输入框的操作,获取第一文本信息。
  8. 根据权利要求3所述的方法,其特征在于,所述基于所述第一文本信息和所述至少一个多媒体素材,确定至少一个视频编辑模板之后,还包括:
    在所述至少一个视频编辑模板中选取第三视频编辑模板并呈现在视频编辑效果的预览页面上,以便所述预览页面用于预览所述至少一个多媒体素材导入所述第三视频编辑模板所得到的视频效果,所述预览页面上设置有更新推荐控件;
    响应于针对所述更新推荐控件的触发操作,在所述至少一个视频编辑模板中选取第四视频编辑模板,并利用所述第四视频编辑模板替换所述预览页面上呈现的第三视频编辑模板,以便所述预览页面用于预览所述至少一个多媒体素材导入所述第四视频编辑模板所得到的视频效果。
  9. 根据权利要求3所述的方法,其特征在于,所述基于所述第一文本信息和所述至少一个多媒体素材,确定至少一个视频编辑模板之后,还包括:
    在预览页面上显示所述至少一个视频编辑模板中的第五视频编辑模板;
    响应于在所述预览页面上针对所述第一文本信息的文本调整操作,得到调整后文本信息;
    基于所述调整后文本信息和所述至少一个多媒体素材,确定第二视频编辑模板集合;
    利用所述第二视频编辑模板集合中的第六视频编辑模板替换所述预览页面上显示的所述第五视频编辑模板。
  10. 根据权利要求9所述的方法,其特征在于,所述基于所述调整后文本信息和所述至少一个多媒体素材,确定第二视频编辑模板集合之前,还包括:
    接收针对所述至少一个多媒体素材的素材调整操作,得到调整后多媒体素材;
    相应的,所述基于所述调整后文本信息和所述至少一个多媒体素材,确定第二视频编辑模板集合,包括:
    基于所述调整后文本信息和所述调整后多媒体素材,确定第二视频编辑模板集合。
  11. 一种视频生成装置,其特征在于,所述装置包括:
    第一获取模块,用于获取第一文本信息;其中,所述第一文本信息用于描述视频效果要求;
    第二获取模块,用于获取至少一个多媒体素材;
    生成模块,用于基于所述第一文本信息和所述至少一个多媒体素材,生成目标视频;其中,所述目标视频中呈现有所述至少一个多媒体素材,所述目标视频的视频效果符合所述第一文本信息所描述的视频效果要求,所述目标视频用于呈现至少一个视频片段的组合,所述至少一个视频片段分别是基于所述至少一个多媒体素材中的各个影像素材形成的,所述各个影像素材包括视频素材和/或图像素材。
  12. 一种计算机可读存储介质,其特征在于,所述计算机可读存储介质中存储有指令,当所述指令在终端设备上运行时,使得所述终端设备实现如权利要求1-10任一项所述的方法。
  13. 一种视频处理设备,其特征在于,包括:存储器,处理器,及存储在所述存储器上并可在所述处理器上运行的计算机程序,所述处理器执行所述计算机程序时,实现如权利要求1-10任一项所述的方法。
PCT/CN2023/136857 2023-04-23 2023-12-06 一种视频生成方法、装置、设备及存储介质 Ceased WO2024221941A1 (zh)

Priority Applications (3)

Application Number Priority Date Filing Date Title
JP2024520785A JP7803636B2 (ja) 2023-04-23 2023-12-06 ビデオ生成方法、装置、デバイスおよび記憶媒体
EP23866709.1A EP4478723A4 (en) 2023-04-23 2023-12-06 VIDEO GENERATING METHOD AND APPARATUS, DEVICE AND STORAGE MEDIUM
US18/622,479 US12524940B2 (en) 2023-04-23 2024-03-29 Method, apparatus, device and storage medium for video generation

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202310446304.XA CN118842959A (zh) 2023-04-23 2023-04-23 一种视频生成方法、装置、设备及存储介质
CN202310446304.X 2023-04-23

Related Child Applications (1)

Application Number Title Priority Date Filing Date
US18/622,479 Continuation US12524940B2 (en) 2023-04-23 2024-03-29 Method, apparatus, device and storage medium for video generation

Publications (1)

Publication Number Publication Date
WO2024221941A1 true WO2024221941A1 (zh) 2024-10-31

Family

ID=90718831

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2023/136857 Ceased WO2024221941A1 (zh) 2023-04-23 2023-12-06 一种视频生成方法、装置、设备及存储介质

Country Status (2)

Country Link
CN (1) CN118842959A (zh)
WO (1) WO2024221941A1 (zh)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN120897092A (zh) * 2025-07-18 2025-11-04 北京字跳网络技术有限公司 交互方法、装置、电子设备以及存储介质

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20070083851A1 (en) * 2005-10-06 2007-04-12 Moda Co., Ltd. Template-based multimedia editor and editing method thereof
US20180096708A1 (en) * 2016-09-30 2018-04-05 Jocoos Co., Ltd. Video editing system and method
CN110996017A (zh) * 2019-10-08 2020-04-10 清华大学 一种生成剪辑视频的方法及装置
CN113518160A (zh) * 2021-01-12 2021-10-19 腾讯科技(深圳)有限公司 视频生成方法、装置、设备及存储介质
WO2022088783A1 (zh) * 2020-10-28 2022-05-05 北京达佳互联信息技术有限公司 视频制作方法及装置
CN115442539A (zh) * 2021-06-04 2022-12-06 北京字跳网络技术有限公司 一种视频编辑方法、装置、设备及存储介质

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20070083851A1 (en) * 2005-10-06 2007-04-12 Moda Co., Ltd. Template-based multimedia editor and editing method thereof
US20180096708A1 (en) * 2016-09-30 2018-04-05 Jocoos Co., Ltd. Video editing system and method
CN110996017A (zh) * 2019-10-08 2020-04-10 清华大学 一种生成剪辑视频的方法及装置
WO2022088783A1 (zh) * 2020-10-28 2022-05-05 北京达佳互联信息技术有限公司 视频制作方法及装置
CN113518160A (zh) * 2021-01-12 2021-10-19 腾讯科技(深圳)有限公司 视频生成方法、装置、设备及存储介质
CN115442539A (zh) * 2021-06-04 2022-12-06 北京字跳网络技术有限公司 一种视频编辑方法、装置、设备及存储介质

Also Published As

Publication number Publication date
CN118842959A (zh) 2024-10-25

Similar Documents

Publication Publication Date Title
CN109344241B (zh) 信息的推荐方法、装置、终端及存储介质
CN104572846A (zh) 一种热词推荐方法、装置和系统
CN114185465B (zh) 信息处理方法、装置、存储介质和电子设备
CN116264603B (zh) 直播信息处理方法、装置、设备和存储介质
EP4099711A1 (en) Method and apparatus and storage medium for processing video and timing of subtitles
CN112004137A (zh) 一种智能视频创作方法及装置
KR102786716B1 (ko) 비디오 처리 방법 및 장치, 및 비휘발 컴퓨터 판독가능 저장 매체
CN117880581A (zh) 视频编辑处理方法、装置、电子设备及存储介质
JP6603925B1 (ja) 動画編集サーバおよびプログラム
WO2024221941A1 (zh) 一种视频生成方法、装置、设备及存储介质
US12524940B2 (en) Method, apparatus, device and storage medium for video generation
CN116916092A (zh) 视频处理方法、装置、电子设备和存储介质
US10629087B2 (en) Information processing apparatus, information processing system, and non-transitory computer readable medium
US20240281103A1 (en) Interaction method and apparatus based on multimedia content, and device and storage medium
CN115665109B (zh) 多媒体数据的处理方法、装置、设备及介质
CN115942058A (zh) 视频进度条生成方法、设备、存储介质及装置
WO2025167288A1 (zh) 一种媒体内容处理方法、装置、设备及存储介质
CN118573917B (zh) 物品分享方法及装置、电子设备及计算机可读存储介质
WO2025167287A1 (zh) 一种媒体内容处理方法、装置、设备及存储介质
CN117743686A (zh) 目标推送方法、装置和计算机可读存储介质
JP2010146295A (ja) 文書検索装置、文書検索システム及びプログラム
CN121256124A (zh) 交互方法、装置、设备和存储介质
WO2026001828A1 (zh) 一种视频生成方法、装置、设备及存储介质
WO2025201137A1 (zh) 一种视频生成方法、装置、设备及存储介质
JP2008269085A (ja) 情報推薦装置及び情報推薦システム

Legal Events

Date Code Title Description
WWE Wipo information: entry into national phase

Ref document number: 2024520785

Country of ref document: JP

ENP Entry into the national phase

Ref document number: 2023866709

Country of ref document: EP

Effective date: 20240329

121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 23866709

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE