WO2022152064A1 - 视频生成方法、装置、电子设备和存储介质 - Google Patents

视频生成方法、装置、电子设备和存储介质 Download PDF

Info

Publication number
WO2022152064A1
WO2022152064A1 PCT/CN2022/070919 CN2022070919W WO2022152064A1 WO 2022152064 A1 WO2022152064 A1 WO 2022152064A1 CN 2022070919 W CN2022070919 W CN 2022070919W WO 2022152064 A1 WO2022152064 A1 WO 2022152064A1
Authority
WO
WIPO (PCT)
Prior art keywords
target
video
music
template
user
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Ceased
Application number
PCT/CN2022/070919
Other languages
English (en)
French (fr)
Inventor
莫唯书
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Zitiao Network Technology Co Ltd
Original Assignee
Beijing Zitiao Network Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Zitiao Network Technology Co Ltd filed Critical Beijing Zitiao Network Technology Co Ltd
Priority to JP2023542890A priority Critical patent/JP7632816B2/ja
Priority to EP22738945.9A priority patent/EP4262221B1/en
Publication of WO2022152064A1 publication Critical patent/WO2022152064A1/zh
Priority to US18/352,643 priority patent/US12033671B2/en
Anticipated expiration legal-status Critical
Ceased legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11BINFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
    • G11B27/00Editing; Indexing; Addressing; Timing or synchronising; Monitoring; Measuring tape travel
    • G11B27/02Editing, e.g. varying the order of information signals recorded on, or reproduced from, record carriers
    • G11B27/031Electronic editing of digitised analogue information signals, e.g. audio or video signals
    • G11B27/036Insert-editing
    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11BINFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
    • G11B27/00Editing; Indexing; Addressing; Timing or synchronising; Monitoring; Measuring tape travel
    • G11B27/02Editing, e.g. varying the order of information signals recorded on, or reproduced from, record carriers
    • G11B27/031Electronic editing of digitised analogue information signals, e.g. audio or video signals
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/44Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs
    • H04N21/44008Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs involving operations for analysing video streams, e.g. detecting features or characteristics in the video stream
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/60Information retrieval; Database structures therefor; File system structures therefor of audio data
    • G06F16/68Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
    • G06F16/683Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content
    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11BINFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
    • G11B27/00Editing; Indexing; Addressing; Timing or synchronising; Monitoring; Measuring tape travel
    • G11B27/10Indexing; Addressing; Timing or synchronising; Measuring tape travel
    • G11B27/34Indicating arrangements 
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/435Processing of additional data, e.g. decrypting of additional data, reconstructing software from modules extracted from the transport stream
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/439Processing of audio elementary streams
    • H04N21/4394Processing of audio elementary streams involving operations for analysing the audio stream, e.g. detecting features or characteristics in audio streams
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/44Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs
    • H04N21/44016Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs involving splicing one content stream with another content stream, e.g. for substituting a video clip
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/47End-user applications
    • H04N21/472End-user interface for requesting content, additional data or services; End-user interface for interacting with content, e.g. for content reservation or setting reminders, for requesting event notification, for manipulating displayed content
    • H04N21/47205End-user interface for requesting content, additional data or services; End-user interface for interacting with content, e.g. for content reservation or setting reminders, for requesting event notification, for manipulating displayed content for manipulating displayed content, e.g. interacting with MPEG-4 objects, editing locally

Definitions

  • the present disclosure relates to the field of computer technology, for example, to a video generation method, apparatus, electronic device, and storage medium.
  • Short video application software generally provides users with the function of making and publishing short videos.
  • the creation threshold of short videos is relatively high, and it takes a long time to complete the production, resulting in a poor creation experience for users.
  • the present disclosure provides a video generation method, device, electronic device, and storage medium, so as to lower the threshold for creating short videos and shorten the time required to produce short videos.
  • the present disclosure provides a video generation method, including:
  • the present disclosure also provides a video generation device, comprising:
  • a first receiving module configured to receive a first trigger operation that uses target music to generate a video
  • the video generation module is configured to, in response to the first trigger operation, select a candidate material matching the target music from the user's candidate material set based on the first music feature information of the target music as the target candidate material, and based on the first music feature information of the target music
  • the target candidate material generates a target video, wherein the target video uses the target music as background music.
  • the present disclosure also provides an electronic device, comprising:
  • processors one or more processors
  • memory arranged to store one or more programs
  • the one or more processors When the one or more programs are executed by the one or more processors, the one or more processors implement the above-mentioned video generation method.
  • the present disclosure also provides a computer-readable storage medium on which a computer program is stored, and when the program is executed by a processor, implements the above-mentioned video generation method.
  • FIG. 1 is a schematic flowchart of a video generation method according to an embodiment of the present disclosure
  • FIG. 2 is a schematic diagram of a music details page provided by an embodiment of the present disclosure
  • FIG. 3 is a schematic diagram of a material notification window according to an embodiment of the present disclosure.
  • FIG. 4 is a schematic diagram of a permission opening window provided by an embodiment of the present disclosure.
  • FIG. 5 is a schematic flowchart of another video generation method provided by an embodiment of the present disclosure.
  • FIG. 6 is a schematic diagram of a loading page according to an embodiment of the present disclosure.
  • FIG. 7 is a schematic diagram of a preview page provided by an embodiment of the present disclosure.
  • FIG. 8 is a schematic diagram of a template window provided by an embodiment of the present disclosure.
  • FIG. 9 is a schematic diagram of a material window provided by an embodiment of the present disclosure.
  • FIG. 10 is a schematic diagram of a material editing window according to an embodiment of the present disclosure.
  • FIG. 11 is a schematic diagram of a video editing page provided by an embodiment of the present disclosure.
  • FIG. 12 is a structural block diagram of a video generation apparatus according to an embodiment of the present disclosure.
  • FIG. 13 is a schematic structural diagram of an electronic device according to an embodiment of the present disclosure.
  • method embodiments of the present disclosure may be performed in different orders and/or in parallel. Furthermore, method embodiments may include additional steps and/or omit performing the illustrated steps. The scope of the present disclosure is not limited in this regard.
  • the term “including” and variations thereof are open-ended inclusions, ie, "including but not limited to”.
  • the term “based on” is “based at least in part on.”
  • the term “one embodiment” means “at least one embodiment”; the term “another embodiment” means “at least one additional embodiment”; the term “some embodiments” means “at least some embodiments”. Relevant definitions of other terms will be given in the description below.
  • FIG. 1 is a schematic flowchart of a video generation method according to an embodiment of the present disclosure.
  • the method may be performed by a video generating apparatus, wherein the apparatus may be implemented by software and/or hardware, and may be configured in an electronic device, for example, the apparatus may be configured in a mobile phone or a tablet computer.
  • the video generation method provided by the embodiment of the present disclosure is suitable for a scene of automatically generating a video, and especially suitable for a scene of automatically generating a music video.
  • the video generation method provided by this embodiment may include:
  • S101 Receive a first trigger operation for generating a video by using target music.
  • the first trigger operation can be understood as a trigger operation used to instruct the electronic device to automatically generate a video by using a piece of music, such as a trigger operation of clicking a video generation control, and the video generation control can be displayed on the music list page, music details page, music playback page and / or displayed on pages such as the video playback page.
  • the target music can be the music that the user wants to use as the background music to make a video, such as the music in the selected state in the music list page when the user performs the first trigger operation, the music corresponding to the music details page, and the current music in the music playback page.
  • the following description will be given by taking the first trigger operation as a trigger operation acting on the video generation control in the music details page as an example.
  • the music details page of a piece of music can display the music information of the music (such as the music cover 20 of the music, the music name, the user name of the publisher, the number of times of playing, etc.) and the video Generate control 21, so that the user can instruct the electronic device to automatically generate a video with this music as background music by triggering the video generation control 21, such as generating a music video with this music as background music; can also be provided with play/pause controls 22, The favorite control 23 and/or the same style control 24, so that the user can instruct the electronic device to play/pause the music by clicking on the play/pause control 22, favorite the music by clicking on the favorite control 23, and/or, by clicking on the favorite control 23
  • the control 24 instructs the electronic device to switch the currently displayed page from the music details page of the music to the video shooting page and turn on the camera, so that the user can shoot the video with the music as the background music.
  • the music details page may also display prompt information 25 for the video generation control 21, such as animations, pictures and/or texts for prompting, so that the user can specify the video Generates the role of control 21.
  • the electronic device displays a music details page of music; when the user wants to instruct the electronic device to use the music to automatically generate a video, the user can click on the video generation control displayed on the music details page; When the user clicks on the video generation control on the music details page, the music corresponding to the music details page can be determined as the target music, and it is determined that the first trigger operation for generating a video by using the target music is received.
  • the electronic device may also display a corresponding window to inform the user that the user's material will be read and/or prompt the user to enable access to the material. permission.
  • the first trigger operation may also be an operation of triggering the corresponding control in the displayed window.
  • the electronic device displays a music details page of music, and can determine whether the user clicks the video generation control 21 for the first time when monitoring that the user clicks the video generation control 21 in the music details page, if the user clicks the video generation control 21 for the first time Click for the first time, the material notification window 30 is displayed, as shown in FIG.
  • the first music feature information may be music feature information of the target music, such as lyric information, rhythm information, and/or music style information of the target music.
  • the target video can be understood as a video generated by an electronic device using target music.
  • the candidate material can be understood as the material for the electronic device to generate the target video, which can be all the materials in the user's local album and/or cloud album, or part of the material in the user's local album and/or cloud album, such as High-quality materials captured in the user's local album and/or cloud album, the materials may include pictures and/or videos, etc.
  • the materials include pictures and videos, which can be captured, downloaded, imported by the user in advance , screenshot or record.
  • the electronic device When the electronic device receives the first trigger operation, it acquires the user's candidate material set, and based on the first music feature information of the target music, selects the candidate material matching the target music from the candidate material set according to the preset selection rule as the target candidate. materials, and synthesize all the selected target candidate materials into the target video, so that the user only needs to select the target music to automatically generate the target video.
  • the selection rule of the target candidate material can be set by the developer as needed, for example, according to the first music feature information of the target music, a candidate material with a picture style similar to the first music feature information can be selected as the target candidate material; or, The video template that matches the second music feature information of the background music with the first music feature information of the target music is used as the target video template, and the candidate material whose material theme is similar to the template body of the target video template is selected as the target candidate material.
  • the embodiment does not limit this.
  • the candidate materials in the candidate material set may be materials with better quality captured in the user's local album and/or cloud album, for example, in the candidate materials from the user
  • the method further includes: acquiring materials in the user's album, the album including local albums and/or associated cloud albums; Filter multiple materials in the album to form the user's candidate material set.
  • the associated cloud album can be understood as a cloud album associated with the account logged in by the user.
  • the album from which the user obtains materials may include the user's local album and/or the user's associated cloud album.
  • the album may only include the local album, that is, the electronic device Only the materials in the user's local album can be obtained; when the user is associated with a cloud album and the user has enabled the permission to access the cloud album, the album can include the local album and the associated cloud album, that is, the electronic device can obtain the information in the user's local album.
  • the electronic device may first acquire all the materials in the user's album, and perform aesthetic evaluation on all the materials according to preset aesthetic evaluation rules or using a pre-trained aesthetic evaluation model to obtain aesthetic scores of all the materials; then, select the aesthetics.
  • Materials with scores greater than the preset score threshold are selected as candidate materials, or, in descending order of aesthetic scores, a preset number (such as 30 or 50, etc.) of materials are selected as candidate materials to form the user's candidate material set .
  • the timing at which the electronic device acquires the materials in the user's album and constructs the candidate material set according to the aesthetic scores of all the materials can be flexibly set as required.
  • the electronic device can perform the above operation when receiving the first trigger operation ; It can also be executed at a preset time (such as 10s) after the application software to which the target music belongs is completed, and the application software to which the target music belongs is completed, or when it is detected that the user turns on the camera of the electronic device through the application software to which the target music belongs. operate.
  • a first trigger operation for generating a video by using target music is received, and in response to the first trigger operation, based on the first music feature information of the target music, a set of candidate materials of the user that matches the target music is selected.
  • the candidate material is used as the target candidate material, and according to the target candidate material, the target video with the target music as the background music is generated.
  • FIG. 5 is a schematic flowchart of another video generation method provided by an embodiment of the present disclosure.
  • the solution in this embodiment can be combined with one or more optional solutions in the above-mentioned embodiments.
  • selecting the candidate material matching the target music from the user's candidate material set based on the first music feature information of the target music as the target candidate material including: obtaining the background music of each video template. the second music feature information, and select a video template that matches the second music feature information and the first music feature information of the target music as the target video template; The correlation between the template topics of the target video template determines the target candidate material.
  • the target video template includes a plurality of material slots for filling materials
  • the correlation between the material theme of each candidate material in the user's candidate material set and the template theme of the target video template is After the target candidate material is determined, the method further includes: sequentially acquiring target candidate materials that meet the filling conditions of each material slot of the target video template, as the material to be filled in the material slot, and storing the material to be filled Filling the material slot; the generating a target video according to the target candidate material includes: generating a target video according to a target video template after filling the material to be filled.
  • the method further includes: displaying a preview page, and displaying the material to be synthesized in the target video template on the preview page, where The material to be synthesized includes the material to be filled.
  • the method further includes: receiving a second trigger operation acting on a template control in the preview page; in response to the second trigger operation, displaying a template window, in the template window.
  • the video generation method provided by this embodiment includes:
  • S201 Receive a first trigger operation for generating a video by using target music.
  • S202 in response to the first trigger operation, obtain second music feature information of the background music of each video template, and select a video template whose second music feature information matches the first music feature information of the target music, as target video template.
  • a video template can be understood as a template for making a video.
  • the second music feature information may be music feature information of background music that needs to be added to the video template, such as rhythm information and/or music style information of the background music.
  • the electronic device when it receives the first trigger operation, it can switch the currently displayed page from the music details page to the loading page, as shown in FIG. 6 , obtain the music feature information of the background music of each video template, and select The video template in which the music feature information of the background music matches the music feature information of the target music is used as the target video template.
  • the loading page can display the current loading progress and the cancel control 60, so that the user can instruct the electronic device to stop performing subsequent operations by triggering the cancel control 60, and the current display page is displayed. Switch back to the music details page of the target music from the loading page.
  • the determination method of the target video template can be set as required. For example, when the electronic device determines the target video template, it can directly use the preset matching degree calculation rule or use the pre-trained matching degree calculation model to calculate each video template in the template library.
  • the degree of matching between the second music feature information of the background music and the first music feature information of the target music, and select the video template with the highest degree of matching as the target video template also can first judge the user and/or the user who performs the first trigger operation Or among the videos posted by other users, whether there is a video with the target music as the background music and the video template is used when generating the video, if there is a video with the target music as the background music and the video template is used when generating the video, then Obtain the video template used by this video as the target video template, if there is no video with the target music as the background music and the video template is used when generating the video, then calculate the second of the background music of each video template in the template library.
  • the matching degree between the music feature information and the first music feature information of the target music, and the video template with the highest matching degree is selected as the target video template.
  • a video template can be randomly selected as the target video
  • the personalized information of the user such as the video style of the video published and/or played by the user, etc., select the video template that is most consistent with the user's personalized information as the target video template.
  • a preset number (such as 5) can also be selected according to the matching degree between the music feature information of the video using the target music as the background music or the background music of each template and the music feature information of the target music.
  • the video template most consistent with the user's personalized information is selected as the target video template. Therefore, the generated target video is more in line with the user's personalized needs, and the user's experience is improved.
  • S203 Determine the target candidate material according to the correlation between the material theme of each candidate material in the user's candidate material set and the template theme of the target video template.
  • the material theme can be understood as the theme of the material, such as the theme of the candidate material in the candidate material set, which can be determined according to the screen content of the material;
  • the template theme can be understood as the theme of the video template, such as the theme of the target video template, a material/video template
  • the theme can be plants, buildings, landscapes, people, travel or food, and so on.
  • a material whose material theme in the candidate material set is similar to the template theme of the target video template (for example, the matching degree is greater than the preset matching degree threshold) can be selected as the target candidate material;
  • the candidate material set is divided into multiple sub-candidate material sets with different material themes, so that after the target video template is determined, the sub-candidate material set with the material theme similar to the template theme of the target video template can be selected.
  • the candidate material in the candidate material set is used as the target candidate material.
  • target candidate materials all candidate materials with a theme similar to the template theme of the target video template may be determined as target candidate materials; you may also select a preset with a theme similar to the template theme of the target video template
  • the number of candidate materials (such as 20 or 50, etc.) is used as the target candidate material, which is not limited in this embodiment.
  • the filling condition may include the aspect ratio of the material to be filled (eg, 16:9, etc.), the type of the material (eg, video or picture, etc.), and/or the maximum duration of the material, and the like.
  • the electronic device can select a material slot of the target video template as the current material operation, such as selecting the first material slot of the target video template as the current material.
  • Material slot select a target candidate material that meets the aspect ratio requirements of the current material slot from the target candidate materials that have not yet been determined as the material to be filled, as the material to be filled in the current material slot, and the material to be filled. Fill into the current material slot, determine the next material slot of the target candidate material as the current material slot, and return to perform the operation from the target candidate material that has not been determined as the material to be filled until there is no next material until the slot.
  • the target candidate material when there is only one target candidate material that meets the filling conditions of the current material slot, the target candidate material can be determined as the material to be filled in the material slot; when there are multiple target candidate materials that meet the current material slot
  • a target candidate material of a video type may be randomly selected from the plurality of target candidate materials as the material to be filled in the current material slot, so as to improve the quality of the generated target video.
  • a target candidate material may be randomly selected as the material to be filled in the current material slot.
  • the material to be filled is a target candidate material of a video type whose duration is longer than the maximum duration of the material that can be filled in the current material slot
  • the The key video whose duration matches the maximum duration is captured from the middle, and the captured key video is filled into the current material slot.
  • the electronic device can stop loading, switch the currently displayed page from the loading page to the music details page of the target music, and prompt the user to load Failure, such as prompting the user "Failed to complete the video, please try again later" or "Insufficient material, please retry after replenishing", etc., so that the user can clarify the reason for the failure of video generation.
  • the material to be synthesized can be understood as the material synthesized by the user into the target video in the target video template (such as video or picture), which may include the material to be filled filled in each material slot in the target video template, and may also include transition material.
  • the transition material can be carried in the target video template, or generated by the electronic device after filling the material to be filled into the target video template according to the adjacent material to be filled and filled between the adjacent materials to be filled.
  • the currently displayed page can be switched from the loading page to the preview page, as shown in FIG.
  • Display the materials to be synthesized in the target video template for example, display all the materials to be synthesized in sequence according to the order of all the materials to be synthesized in the target video template, or switch the materials to be synthesized currently displayed in the preview page according to the user's switching operation. , so that the user can specify the material used to generate the target video and the arrangement order of all the materials.
  • the second trigger operation may be an operation for triggering the template control displayed in the preview page, such as a click operation acting on the template control.
  • the template information of the video template may be related information of the video template, such as the template cover and/or the template name of the video template.
  • the template window can be used to display a template list, which can include all video templates in the template library; it can also only include the second music feature information of the background music and the first music feature information of the target music.
  • the matching degree is greater than the set value.
  • a template for determining a matching degree threshold which is not limited in this embodiment.
  • the preview page may be provided with a template control 70 for the user to instruct the electronic device to display the template window; thus, when the user wants to change the target video template used when generating the target video, the user can Click the template control 70; correspondingly, when the electronic device detects that the user clicks the template control 70, it may determine that a second trigger operation is received, and in response to the second trigger operation, the template window 80 is displayed, and the template window 80 is displayed in the template window 80.
  • the template information of each video template is displayed in the template window, for example, the template cover and/or template name of each video template are displayed in the template window, as shown in FIG. 8 .
  • each video template may be displayed in the template window 80 randomly or in descending order of the matching degree between the second music feature information of the background music and the first music feature information of the target music.
  • the third trigger operation can be understood as an operation acting on the display area of the template information of a video template in the template window, such as an operation of clicking the template cover of a video template displayed in the template window.
  • the electronic device detects that the user clicks on the video template displayed in the video template.
  • a template cover of a video template it can be determined that a third trigger operation is received, and the target video template is switched to the video template, according to the difference between the material theme of each candidate material in the user's candidate material set and the template theme of the video template.
  • the correlation determines the target candidate template of the video template, sequentially obtains the target candidate material of the filler that matches each material slot of the video template, and uses it as the material to be filled in the material slot.
  • the material is filled into the corresponding material slot, and the material to be synthesized displayed on the preview page is updated to the material to be synthesized in the video template, so that the user can view the updated material to be synthesized.
  • the electronic device receives the third trigger operation, in response to the trigger operation, it can also display the download prompt window 81 to remind the user that the template is currently being downloaded; at this time, correspondingly, the updated template is displayed on the preview page.
  • the prompt window 81 may be stopped to avoid hindering the user from viewing the updated material to be synthesized.
  • the first save control 82 and the first cancel control 83 can be displayed in the preview page, so that when it is detected that the user clicks the first save control 82, the first save control 82 can be saved.
  • the template window 80 can be closed, as shown in FIG. 7 ; when it is detected that the user clicks the first cancel control 83, the target video template after the switch and the material to be synthesized can be switched. The previous target video template and the material to be synthesized are returned, and the template window 80 can be closed, as shown in FIG. 7 .
  • a material control 71 may also be displayed on the preview page, and introduction information 72 for the function of the material control 71 may be displayed when the user enters the preview page for the first time.
  • the user wants to replace the material to be filled in the target video template, he can click the material control 71 .
  • the electronic device detects that the user clicks on the material control 71, it can display the material window 90, and display the material information of the user's material, candidate material or target candidate material in the material window 90, such as the material cover, as shown in FIG. 9 .
  • the second save control 91 and the second cancel control 92 may also be displayed in the preview page.
  • the electronic device detects that the user clicks on a material displayed in the material window 90
  • the cover of the material if the material is a picture, the material displayed on the preview page at the current moment can be replaced with the picture; if the material is a video, the video material editing window 100 can be displayed, as shown in FIG. 10 , for the user to select A segment in the video is used as the material, and when it is detected that the user clicks on the determination control 102 in the video material editing window 100, the material displayed in the preview page at the current moment is replaced with the segment selected by the user, and when the user clicks on the click is detected.
  • the control 101 in the video material editing window 100 is canceled, the segment selected by the user is canceled, and when it is detected that the user clicks the replacement material control 103 in the video material editing window 100, the video material editing window 100 is closed and the material window 90 is displayed. , so that users can change the material.
  • the user wants to save the modification to the material he can click the second save control 91; correspondingly, when the electronic device detects that the user clicks the second save control 91, the modified material to be filled can be saved, and the material window 90 can be closed. .
  • the user wants to cancel the modification of the material he can click the second cancel control 92; correspondingly, when the electronic device detects that the user clicks the second cancel control 92, the modified material to be synthesized can be switched back to the previous material to be synthesized. , and the material window 90 can be closed, as shown in FIG. 7 .
  • the material window 90 may also display a text editing control 93 for the user to edit the text carried in the target template.
  • the electronic device can synchronously fill the text carried in the target video template and need to be displayed in the video into the corresponding picture or video frame, and monitor the user's click.
  • a text editing window is displayed for the user to edit the text added in the picture or video frame and carried in the target video template.
  • the electronic device can automatically generate the target video according to the filled target video template.
  • the generating the target video according to the target video template after filling the material to be filled includes: when a fourth trigger operation acting on the video synthesis control in the preview page is received, using the target music as a background For music, according to the sequence of all the materials to be synthesized in the target video template, all the materials to be synthesized are synthesized into the target video.
  • the electronic device displays a video synthesis control 73 on the preview page; the user can click the video synthesis control 73 when synthesizing the material to be synthesized in the target video template into the target video.
  • the electronic device detects that the user clicks on the video synthesis control 73, it can determine that the fourth trigger operation is received, and in response to the fourth trigger operation, the target music is used as the background music, and all the materials to be synthesized are in the target video template according to the target music.
  • All the materials to be synthesized are synthesized into the target video, and the currently displayed page can be switched from the preview page to the video editing page, as shown in Figure 11, so that the user can edit the synthesized target video, such as changing the target video.
  • the background music of the video, adding text, stickers, special effects and/or filters to the target video, or performing image enhancement on the target video, etc., and the target video can be released when it is detected that the user clicks the daily control 110 in the video editing page.
  • the target video is set to be visible only within a preset number of days (such as 1 day or 1 week); when it is detected that the user clicks the next control 111 in the video editing page, the currently displayed page is switched from the video editing page to the publishing page to For users to edit the title, posting position and visible range of the target video, and publish the target video or save the target video to the draft box.
  • a material to be synthesized is a video type material, during synthesis, the audio data of the material to be synthesized can be deleted or the material to be synthesized can be muted, so as to avoid the sound other than the target music from appearing in the obtained target video A condition that appears or disappears suddenly.
  • the preview page can also display a first return control 74; thus, when the user wants to return to the music details page of the target music, he can click the first return control 74; correspondingly, the electronic device is in the When it is detected that the user clicks on the first return control 74, a first save prompt window (not shown in FIG. 7 ) can be displayed to prompt the user that the current effect will be lost after returning, and when it is detected that the user clicks the first save prompt window When the first confirmation control is displayed, the currently displayed page is switched from the preview page to the music details page of the target music. As shown in FIG.
  • a second return control 112 may also be displayed on the video editing page; thus, when the user wants to return to the preview page, he may click the second return control 112; correspondingly, the electronic device detects that the user clicks the first return control Second, when returning to the control 112, a second save prompt window (not shown in FIG. 11) can be displayed to remind the user that the content edited in the video editing page will not be saved, and when it is detected that the user clicks the second save When the second confirmation control in the window is prompted, the currently displayed page is switched from the video editing page to the preview page.
  • a target video template is selected according to the music characteristics of the target video
  • target candidate materials are determined according to the theme of the target video template
  • the materials filled in the target video template are synchronously updated , not only can automatically generate a video according to the music selected by the user, but also can improve the quality of the generated target video, and make the generated target video more in line with the user's wishes, thereby improving the user's use experience.
  • FIG. 12 is a structural block diagram of a video generating apparatus according to an embodiment of the present disclosure.
  • the apparatus may be implemented by software and/or hardware, and may be configured in an electronic device.
  • the apparatus may be configured in a mobile phone or a tablet computer, and a video may be generated by executing a video generation method.
  • the video generating apparatus provided in this embodiment may include: a first receiving module 1201 and a video generating module 1202, wherein:
  • the first receiving module 1201 is configured to receive a first trigger operation for generating a video using target music; the video generation module 1202 is configured to respond to the first trigger operation, based on the first music feature information of the target music, from the user.
  • the candidate material set of select the candidate material that matches the target music as the target candidate material, and generate the target video according to the target candidate material, and the target video uses the target music as the background music.
  • the first receiving module 1201 receives a first trigger operation for generating a video by using the target music
  • the video generation module 1202 responds to the first trigger operation based on the first music feature information of the target music
  • a candidate material matching the target music is selected from the user's candidate material set as the target candidate material
  • a target video with the target music as the background music is generated according to the target candidate material.
  • the video generation module 1202 may include: a template acquisition unit, configured to acquire the second music feature information of the background music of each video template, and select the second music feature information and the target music.
  • the video template matched with the first music feature information is used as the target video template;
  • the material determination unit is set to determine the target according to the correlation between the material theme of each candidate material in the user's candidate material set and the template theme of the target video template candidate material.
  • the target video template may include a plurality of material slots for filling materials
  • the video generation module 1202 may further include: a material filling unit, which is set to each of the user-based candidate material sets The correlation between the material theme of the candidate material and the template theme of the target video template After determining the target candidate material, sequentially acquiring the target candidate material that meets the filling conditions of each material slot of the target video template, as the target candidate material The material to be filled in the material slot, and the to-be-filled material is filled into the material slot; the video generation unit is configured to generate a target video according to the target video template filled with the to-be-filled material.
  • the video generation module 1202 may further include: a material display unit, configured to display a preview page after filling the material to be filled into the material slot, and display a preview page on the preview page
  • a material display unit configured to display a preview page after filling the material to be filled into the material slot, and display a preview page on the preview page
  • the to-be-synthesized material in the target video template is displayed in the target video template, and the to-be-combined material includes the plurality of to-be-filled materials.
  • the video generation module 1202 may further include: a first receiving unit, configured to receive a second trigger operation acting on the template control in the preview page after the preview page is displayed; the template display unit is configured to display a template window in response to the second trigger operation, in which template information of a plurality of video templates is displayed; the second receiving unit is configured to receive template information acting on the template window displayed in the template window the third trigger operation; the material update unit is configured to, in response to the third trigger operation, switch the target video template to the video template to which the template information corresponding to the third trigger operation belongs, and update the target video template The material to be composited in the video template.
  • a first receiving unit configured to receive a second trigger operation acting on the template control in the preview page after the preview page is displayed
  • the template display unit is configured to display a template window in response to the second trigger operation, in which template information of a plurality of video templates is displayed
  • the second receiving unit is configured to receive template information acting on the template window displayed in the template window the third trigger operation
  • the material update unit
  • the video generation unit may be set to: when receiving the fourth trigger operation acting on the video synthesis control in the preview page, use the target music as the background music, according to the target video template In the sequence of all the materials to be synthesized, the all the materials to be synthesized are synthesized into the target video.
  • the video generation apparatus may further include: a material acquisition module, configured to acquire a material in the user's album before selecting the candidate material matching the target music from the user's candidate material set as the target candidate material materials, the albums include local albums and/or associated cloud albums; the material screening module is configured to filter multiple materials from the albums according to the aesthetic scores of the materials to form a user's candidate material set.
  • a material acquisition module configured to acquire a material in the user's album before selecting the candidate material matching the target music from the user's candidate material set as the target candidate material materials, the albums include local albums and/or associated cloud albums;
  • the material screening module is configured to filter multiple materials from the albums according to the aesthetic scores of the materials to form a user's candidate material set.
  • the video generation apparatus provided by the embodiment of the present disclosure can execute the video generation method provided by any embodiment of the present disclosure, and has corresponding functional modules and effects for executing the video generation method.
  • the video generation method provided by any embodiment of the present disclosure can execute the video generation method provided by any embodiment of the present disclosure, and has corresponding functional modules and effects for executing the video generation method.
  • Terminal devices in the embodiments of the present disclosure may include, but are not limited to, such as mobile phones, notebook computers, digital broadcast receivers, personal digital assistants (Personal Digital Assistants, PDAs), tablet computers (PADs), and portable multimedia players (Portable Media Players). , PMP), in-vehicle terminals (eg, in-vehicle navigation terminals), etc., and stationary terminals such as digital (Television, TV), desktop computers, and the like.
  • PMP Personal Digital Assistants
  • PDAs Personal Digital Assistants
  • PADs tablet computers
  • portable multimedia players Portable Media Players
  • PMP Personal Digital Assistants
  • in-vehicle terminals eg, in-vehicle navigation terminals
  • stationary terminals such as digital (Television, TV), desktop computers, and the like.
  • the electronic device shown in FIG. 13 is only an example, and should not impose any limitation on the function and scope of use of the embodiments of the present disclosure.
  • the electronic device 1300 may include a processing device (eg, a central processing unit, a graphics processor, etc.) 1301, which may be based on a program stored in a read-only memory (Read-Only Memory, ROM) 1302 or from a storage device 1308 programs loaded into Random Access Memory (RAM) 1303 to perform various appropriate actions and processes.
  • a processing device eg, a central processing unit, a graphics processor, etc.
  • RAM Random Access Memory
  • various programs and data required for the operation of the electronic device 1300 are also stored.
  • the processing device 1301, the ROM 1302, and the RAM 1303 are connected to each other through a bus 1304.
  • An Input/Output (I/O) interface 1305 is also connected to the bus 1304 .
  • I/O interface 1305 the following devices can be connected to the I/O interface 1305: input devices 1306 including, for example, a touch screen, touch pad, keyboard, mouse, camera, microphone, accelerometer, gyroscope, etc.; including, for example, a Liquid Crystal Display (LCD) Output devices 1307, speakers, vibrators, etc.; storage devices 1308, including, for example, tapes, hard disks, etc.; and communication devices 1309. Communication means 1309 may allow electronic device 1300 to communicate wirelessly or by wire with other devices to exchange data.
  • FIG. 13 shows an electronic device 1300 having various means, it is not required to implement or have all of the illustrated means. More or fewer devices may alternatively be implemented or provided.
  • embodiments of the present disclosure include a computer program product comprising a computer program carried on a non-transitory computer readable medium, the computer program containing program code for performing the method illustrated in the flowchart.
  • the computer program may be downloaded and installed from the network via the communication device 1309, or from the storage device 1308, or from the ROM 1302.
  • the processing device 1301 the above-mentioned functions defined in the methods of the embodiments of the present disclosure are executed.
  • the computer-readable medium described above in the present disclosure may be a computer-readable signal medium or a computer-readable storage medium, or any combination of the two.
  • the computer-readable storage medium can be, for example, but not limited to, an electrical, magnetic, optical, electromagnetic, infrared, or semiconductor system, apparatus or device, or a combination of any of the above.
  • Examples of computer-readable storage media may include, but are not limited to, electrical connections with one or more wires, portable computer disks, hard disks, RAM, ROM, Erasable Programmable Read-Only Memory (EPROM) or flash memory), optical fiber, portable compact disk read-only memory (Compact Disc Read-Only Memory, CD-ROM), optical storage device, magnetic storage device, or any suitable combination of the above.
  • a computer-readable storage medium may be any tangible medium that contains or stores a program that can be used by or in conjunction with an instruction execution system, apparatus, or device.
  • a computer-readable signal medium may include a data signal propagated in baseband or as part of a carrier wave with computer-readable program code embodied thereon. Such propagated data signals may take a variety of forms, including but not limited to electromagnetic signals, optical signals, or any suitable combination of the foregoing.
  • a computer-readable signal medium can also be any computer-readable medium other than a computer-readable storage medium that can transmit, propagate, or transport the program for use by or in connection with the instruction execution system, apparatus, or device .
  • the program code embodied on the computer-readable medium may be transmitted by any suitable medium, including but not limited to: electric wire, optical fiber cable, radio frequency (RF), etc., or any suitable combination of the above.
  • clients and servers can communicate using any currently known or future developed network protocol, such as HyperText Transfer Protocol (HTTP), and can communicate with digital data in any form or medium.
  • Communication eg, a communication network
  • Examples of communication networks include Local Area Networks (LANs), Wide Area Networks (WANs), the Internet (eg, the Internet), and peer-to-peer networks (eg, ad hoc peer-to-peer networks), as well as any currently Known or future developed networks.
  • LANs Local Area Networks
  • WANs Wide Area Networks
  • the Internet eg, the Internet
  • peer-to-peer networks eg, ad hoc peer-to-peer networks
  • the above-mentioned computer-readable medium may be included in the above-mentioned electronic device; or may exist alone without being assembled into the electronic device.
  • the above-mentioned computer-readable medium carries one or more programs, and when the above-mentioned one or more programs are executed by the electronic device, the electronic device: receives a first trigger operation for generating a video using target music; in response to the first Triggering operation, based on the first music feature information of the target music, select a candidate material matching the target music from the user's candidate material set as the target candidate material, and generate a target video according to the target candidate material, the target The video has the target music as background music.
  • Computer program code for performing operations of the present disclosure may be written in one or more programming languages, including but not limited to object-oriented programming languages—such as Java, Smalltalk, C++, and This includes conventional procedural programming languages - such as the "C" language or similar programming languages.
  • the program code may execute entirely on the user's computer, partly on the user's computer, as a stand-alone software package, partly on the user's computer and partly on a remote computer, or entirely on the remote computer or server.
  • the remote computer may be connected to the user's computer through any kind of network, including a LAN or WAN, or may be connected to an external computer (eg, using an Internet service provider to connect through the Internet).
  • each block in the flowchart or block diagrams may represent a module, segment, or portion of code that contains one or more logical functions for implementing the specified functions executable instructions.
  • the functions noted in the blocks may occur out of the order noted in the figures. For example, two blocks shown in succession may, in fact, be executed substantially concurrently, or the blocks may sometimes be executed in the reverse order, depending upon the functionality involved.
  • each block of the block diagrams and/or flowchart illustrations, and combinations of blocks in the block diagrams and/or flowchart illustrations can be implemented in dedicated hardware-based systems that perform the specified functions or operations , or can be implemented in a combination of dedicated hardware and computer instructions.
  • the units involved in the embodiments of the present disclosure may be implemented in a software manner, and may also be implemented in a hardware manner.
  • the name of the module does not constitute a limitation on the unit itself.
  • exemplary types of hardware logic components include: Field Programmable Gate Arrays (FPGAs), Application Specific Integrated Circuits (ASICs), Application Specific Standard Products (Application Specific Standard Products) Standard Parts, ASSP), system on chip (System on Chip, SOC), complex programmable logic device (Complex Programmable Logic Device, CPLD) and so on.
  • FPGAs Field Programmable Gate Arrays
  • ASICs Application Specific Integrated Circuits
  • ASSP Application Specific Standard Products
  • SOC System on Chip
  • complex programmable logic device Complex Programmable Logic Device, CPLD
  • a machine-readable medium may be a tangible medium that may contain or store a program for use by or in connection with the instruction execution system, apparatus or device.
  • the machine-readable medium may be a machine-readable signal medium or a machine-readable storage medium.
  • Machine-readable media may include, but are not limited to, electronic, magnetic, optical, electromagnetic, infrared, or semiconductor systems, devices, or devices, or any suitable combination of the foregoing. Examples of machine-readable storage media would include one or more wire-based electrical connections, portable computer disks, hard disks, RAM, ROM, EPROM or flash memory, optical fibers, CD-ROMs, optical storage devices, magnetic storage devices, or Any suitable combination of the above.
  • Example 1 provides a video generation method, comprising:
  • the target video uses the target music as background music.
  • Example 2 According to the method described in Example 1, based on the first music feature information of the target music, the candidate material matching the target music is selected from the user's candidate material set The material is used as the target candidate material, including:
  • the target candidate material is determined according to the correlation between the material topic of each candidate material in the user's candidate material set and the template topic of the target video template.
  • Example 3 According to the method of Example 2, the target video template includes a plurality of material slots for filling materials, and each of the candidate material sets according to the user is each After the target candidate material is determined by the correlation between the material subject of each candidate material and the template subject of the target video template, the method further includes:
  • the generating a target video according to the target candidate material includes:
  • the target video is generated according to the target video template after filling the material to be filled.
  • Example 4 in Example 4, according to the method of Example 3, after the material to be filled is filled into the material slot, the method further includes:
  • a preview page is displayed, and the material to be synthesized in the target video template is displayed on the preview page, and the material to be synthesized includes the material to be filled.
  • Example 5 according to the method of Example 4, after the displaying the preview page, further includes:
  • a template window is displayed, and template information of a plurality of video templates is displayed in the template window;
  • the target video template is switched to the video template to which the template information corresponding to the third trigger operation belongs, and the material to be synthesized in the target video template is updated.
  • Example 6 According to the method of Example 4 or 5, the generating the target video according to the target video template after filling the material to be filled includes:
  • the target music is used as the background music, and according to the sequence of all the materials to be synthesized in the target video template, the The material is synthesized into the target video.
  • Example 7 before the candidate material matching the target music is selected from the user's candidate material set as the target candidate material, also includes:
  • the album including local albums and/or associated cloud albums;
  • a plurality of materials are selected from the album according to the aesthetic scores of the materials to form a candidate material set of the user.
  • Example 8 provides a video generation apparatus, comprising:
  • a first receiving module configured to receive a first trigger operation that uses target music to generate a video
  • the video generation module is configured to, in response to the first trigger operation, select a candidate material that matches the target music from the user's candidate material set based on the first music feature information of the target music as the target candidate material, and based on the first music feature information of the target music
  • the target candidate material generates a target video, and the target video uses the target music as background music.
  • Example 9 provides an electronic device, comprising:
  • processors one or more processors
  • memory arranged to store one or more programs
  • the one or more programs when executed by the one or more processors, cause the one or more processors to implement the video generation method as described in any of Examples 1-7.
  • Example 10 provides a computer-readable storage medium having stored thereon a computer program that, when executed by a processor, implements the video as described in any of Examples 1-7 generate method.

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Theoretical Computer Science (AREA)
  • Library & Information Science (AREA)
  • Databases & Information Systems (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Human Computer Interaction (AREA)
  • Studio Circuits (AREA)
  • Television Signal Processing For Recording (AREA)
  • Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)

Abstract

本文公开了一种视频生成方法、装置、电子设备和存储介质。视频生成方法包括:接收采用目标音乐生成视频的第一触发操作(S101);响应于第一触发操作,基于目标音乐的第一音乐特征信息,自用户的候选素材集中选取与目标音乐匹配的候选素材作为目标候选素材,并依据目标候选素材生成目标视频,目标视频以目标音乐作为背景音乐(S102)。

Description

视频生成方法、装置、电子设备和存储介质
本申请要求在2021年01月15日提交中国专利局、申请号为202110057581.2的中国专利申请的优先权,该申请的全部内容通过引用结合在本申请中。
技术领域
本公开涉及计算机技术领域,例如涉及一种视频生成方法、装置、电子设备和存储介质。
背景技术
短视频应用软件一般会为用户提供制作与发布短视频的功能。
用户在制作视频时,通常需要首先手动拍摄或选取视频素材,设置多个视频素材的排列顺序,并为所选择的视频素材设置合适的背景音乐,然后通过短视频应用软件将所选择的素材和背景音乐合成为视频。
但是,在相关技术中,短视频的创作门槛较高,且需要耗费较长的时间才能制作完成,导致用户的创作体验不佳。
发明内容
本公开提供一种视频生成方法、装置、电子设备和存储介质,以降低短视频的创作门槛,并缩短制作短视频需要花费的时间。
本公开提供了一种视频生成方法,包括:
接收采用目标音乐生成视频的第一触发操作;
响应于所述第一触发操作,基于所述目标音乐的第一音乐特征信息,自用户的候选素材集中选取与所述目标音乐匹配的候选素材作为目标候选素材,并依据所述目标候选素材生成目标视频,其中,所述目标视频以所述目标音乐作为背景音乐。
本公开还提供了一种视频生成装置,包括:
第一接收模块,设置为接收采用目标音乐生成视频的第一触发操作;
视频生成模块,设置为响应于所述第一触发操作,基于所述目标音乐的第一音乐特征信息,自用户的候选素材集中选取与所述目标音乐匹配的候选素材作为目标候选素材,并依据所述目标候选素材生成目标视频,其中,所述目标视频以所述目标音乐作为背景音乐。
本公开还提供了一种电子设备,包括:
一个或多个处理器;
存储器,设置为存储一个或多个程序;
当所述一个或多个程序被所述一个或多个处理器执行,使得所述一个或多个处理器实现上述的视频生成方法。
本公开还提供了一种计算机可读存储介质,其上存储有计算机程序,该程序被处理器执行时实现上述的视频生成方法。
附图说明
图1为本公开实施例提供的一种视频生成方法的流程示意图;
图2为本公开实施例提供的一种音乐详情页面示意图;
图3为本公开实施例提供的一种素材通知窗口示意图;
图4为本公开实施例提供的一种权限开启窗口示意图;
图5为本公开实施例提供的另一种视频生成方法的流程示意图;
图6为本公开实施例提供的一种加载页面示意图;
图7为本公开实施例提供的一种预览页面示意图;
图8为本公开实施例提供的一种模板窗口示意图;
图9为本公开实施例提供的一种素材窗口示意图;
图10为本公开实施例提供的一种素材编辑窗口示意图;
图11为本公开实施例提供的一种视频编辑页面示意图;
图12为本公开实施例提供的一种视频生成装置的结构框图;
图13为本公开实施例提供的一种电子设备的结构示意图。
具体实施方式
下面将参照附图描述本公开的实施例。虽然附图中显示了本公开的一些实施例,然而,本公开可以通过多种形式来实现,而且不应该被解释为限于这里阐述的实施例,提供这些实施例是为了更加透彻和完整地理解本公开。本公开的附图及实施例仅用于示例性作用。
本公开的方法实施方式中记载的多个步骤可以按照不同的顺序执行,和/或并行执行。此外,方法实施方式可以包括附加的步骤和/或省略执行示出的步骤。 本公开的范围在此方面不受限制。
本文使用的术语“包括”及其变形是开放性包括,即“包括但不限于”。术语“基于”是“至少部分地基于”。术语“一个实施例”表示“至少一个实施例”;术语“另一实施例”表示“至少一个另外的实施例”;术语“一些实施例”表示“至少一些实施例”。其他术语的相关定义将在下文描述中给出。
本公开中提及的“第一”、“第二”等概念仅用于对不同的装置、模块或单元进行区分,并非用于限定这些装置、模块或单元所执行的功能的顺序或者相互依存关系。
本公开中提及的“一个”、“多个”的修饰是示意性而非限制性的,本领域技术人员应当理解,除非在上下文另有指出,否则应该理解为“一个或多个”。
本公开实施方式中的多个装置之间所交互的消息或者信息的名称仅用于说明性的目的,而并不是用于对这些消息或信息的范围进行限制。
图1为本公开实施例提供的一种视频生成方法的流程示意图。该方法可以由视频生成装置执行,其中,该装置可以由软件和/或硬件实现,可配置于电子设备中,例如,该装置可以配置在手机或平板电脑中。本公开实施例提供的视频生成方法适用于自动生成视频的场景,尤其适用于自动生成音乐视频的场景。如图1所示,本实施例提供的视频生成方法可以包括:
S101、接收采用目标音乐生成视频的第一触发操作。
第一触发操作可以理解为用于指示电子设备自动采用一音乐生成视频的触发操作,如点击视频生成控件的触发操作等,该视频生成控件可以在音乐列表页面、音乐详情页面、音乐播放页面和/或视频播放页面等页面中进行显示。相应的,目标音乐可以为用户欲以其为背景音乐制作视频的音乐,如用户执行第一触发操作时音乐列表页面中处于选中状态的音乐、音乐详情页面所对应的音乐、音乐播放页面中当前所播放的音乐或者视频播放页面中当前播放的视频所采用的背景音乐等。以下以第一触发操作为作用于音乐详情页面中的视频生成控件的触发操作为例进行说明。
在本实施例中,如图2所示,一音乐的音乐详情页面中可以显示有该音乐的音乐信息(如该音乐的音乐封面20、音乐名称、发布者用户名、播放次数等)和视频生成控件21,以便于用户通过触发该视频生成控件21指示电子设备自动生成以该音乐为背景音乐的视频,如生成以该音乐为背景音乐的音乐视频;还可以设置有播放/暂停控件22、收藏控件23和/或拍同款控件24,从而,用户可以通过点击播放/暂停控件22指示电子设备播放/暂停播放该音乐,通过点击收 藏控件23收藏该音乐,和/或,通过点击拍同款控件24指示电子设备将当前显示页面由该音乐的音乐详情页面切换为视频拍摄页面并开启摄像头,以便于用户拍摄以该音乐为背景音乐的视频。此外,当用户首次进入音乐详情页面时,音乐详情页面中还可以显示有针对视频生成控件21的提示信息25,如用于进行提示的动画、图片和/或文字等,以使得用户能够明确视频生成控件21的作用。
示例性的,电子设备展示一音乐的音乐详情页面;用户在欲指示电子设备采用该音乐自动生成视频时,可以点击该音乐详情页面中所显示的视频生成控件;相应的,电子设备在监测到用户点击音乐详情页面中的视频生成控件时,可以将该音乐详情页面所对应的音乐确定为目标音乐,并确定接收到采用目标音乐生成视频的第一触发操作。
当用户首次点击视频生成控件时和/或当用户未开启访问用户的素材的权限时,电子设备还可以显示相应的窗口,以告知用户会读取用户的素材和/或提示用户开启访问其素材的权限。此时,相应的,第一触发操作还可以为触发所展示的窗口中的对应控件的操作。
例如,如图2所示,电子设备显示一音乐的音乐详情页面,并可以在监测到用户点击该音乐详情页面中的视频生成控件21时,判断该用户是否首次点击视频生成控件21,若用户首次点击,则显示素材通知窗口30,如图3所示,以通过该素材通知窗口30告知用户自动生成视频时会读取用户的素材,在监测到用户点击素材通知窗口30中的关闭控件31时,停止显示该素材通知窗口30,并在监测到用户点击素材通知窗口30中的开始体验控件32时,判断用户是否开启了访问其素材的权限;相应的,若用户非首次点击,则可以直接判断用户是否开启了访问其素材的权限。从而,当判定用户已开启访问其素材的权限时,可以确定接收到第一触发操作;当判定用户未开启访问其素材的权限时,可以显示权限开启窗口40,如图4所示,以提示用户开启访问其素材的权限,并在监测到用户点击该权限开启窗口40内的允许控件41时,开启访问用户素材的权限,并确定接收到第一触发操作,在监测到用户点击该权限开启窗口40内的不允许控42件时,停止显示权限提示窗口40。
S102、响应于所述第一触发操作,基于所述目标音乐的第一音乐特征信息,自用户的候选素材集中选取与所述目标音乐匹配的候选素材作为目标候选素材,并依据所述目标候选素材生成目标视频,所述目标视频以所述目标音乐作为背景音乐。
第一音乐特征信息可以为目标音乐的音乐特征信息,如目标音乐的歌词信息、节奏信息和/或音乐风格信息等。目标视频可以理解为电子设备采用目标音乐生成的视频。候选素材可以理解为可供电子设备生成目标视频的素材,其可 以为用户本地相册和/或云相册中的全部素材,也可以为用户本地相册和/或云相册中的部分素材,如可以为用户本地相册和/或云相册中拍摄质量较高的素材,该素材可以包括图片和/或视频等,一实施方式中,该素材包括图片和视频,其可以由用户预先通过拍摄、下载、导入、截屏或录制得到。
电子设备在接收到第一触发操作时,获取用户的候选素材集,基于目标音乐的第一音乐特征信息,依据预先设置的选取规则自该候选素材集中选取与目标音乐匹配的候选素材作为目标候选素材,并将所选取的所有目标候选素材合成为目标视频,从而,用户仅需选取目标音乐即可自动生成目标视频,例如,当用户在观看一视频时,如果比较喜欢该视频中的背景音乐,只需要将当前显示页面切换为该背景音乐的视频详情页面并点击该视频详情页面内的视频生成控件即可生成以该背景音乐作为背景音乐的视频,无需用户挑选视频模板与视频素材,进而能够降低制作视频的难度,并缩短用户制作视频所花费的时间。在此,目标候选素材的选取规则可以由开发人员根据需要设置,如可以依据目标音乐的第一音乐特征信息,选取画面风格与该第一音乐特征信息相似的候选素材作为目标候选素材;或者,将背景音乐的第二音乐特征信息与目标音乐的第一音乐特征信息匹配的视频模板,作为目标视频模板,并选取素材主题与该目标视频模板的模板主体相似的候选素材作为目标候选素材,本实施例不对此进行限制。
在一个实施方式中,为了提高所生成的目标视频的质量,候选素材集中的候选素材可以为用户本地相册和/或云相册中拍摄质量较好的素材,例如,在所述自用户的候选素材集中选取与所述目标音乐匹配的候选素材作为目标候选素材之前,还包括:获取用户的相册中的素材,所述相册包括本地相册和/或关联的云相册;依据素材的美学评分自所述相册中筛选多个素材,形成用户的候选素材集。
关联的云相册可以理解为与用户所登陆的账号关联的云相册。用户获取素材的相册可以包括用户的本地相册和/或用户关联的云相册,当用户不具有关联的云相册或者用户未开启访问云相册的权限时,该相册可以仅包括本地相册,即电子设备可以仅获取用户本地相册中的素材;当用户关联有云相册且用户开启了访问云相册的权限时,该相册可以包括本地相册和关联的云相册,即电子设备可以获取用户的本地相册中的素材以及用户关联的云相册中的素材。
示例性的,电子设备可以首先获取用户相册中的所有素材,并依据预先设置的美学评估规则或者采用预先训练的美学评估模型对所有素材进行美学评估,得到所有素材的美学评分;然后,选取美学评分大于预设分数阈值的素材作为候选素材,或者,按照美学评分由大到小的顺序,选取预设数量(如30个 或50个等)的素材作为候选素材,以构成用户的候选素材集。
在上述实施方式中,电子设备获取用户相册中的素材并依据所有素材的美学评分构建候选素材集的时机可以根据需要灵活设置,例如,电子设备可以在接收到第一触发操作时,执行上述操作;也可以在目标音乐所属的应用软件启动完成、目标音乐所属的应用软件启动完成后的预设时间(如10s)或者在监测到用户通过目标音乐所属的应用软件开启电子设备的摄像头时执行上述操作。
本实施例提供的视频生成方法,接收采用目标音乐生成视频的第一触发操作,响应于该第一触发操作,基于目标音乐的第一音乐特征信息,自用户的候选素材集中选取与目标音乐匹配的候选素材作为目标候选素材,并依据该目标候选素材生成以目标音乐作为背景音乐的目标视频。本实施例通过采用上述技术方案,可以自动根据用户所选择的音乐,利用与该音乐匹配的素材生成视频,能够降低视频的制作门槛,缩短用户制作视频所耗费的时间,并能够提高所生成的视频与用户所选择的音乐的匹配度,提升所生成视频的质量。
图5为本公开实施例提供的另一种视频生成方法的流程示意图。本实施例中的方案可以与上述实施例中的一个或多个可选方案组合。可选的,所述基于所述目标音乐的第一音乐特征信息,自用户的候选素材集中选取与所述目标音乐匹配的候选素材作为目标候选素材,包括:获取每个视频模板的背景音乐的第二音乐特征信息,并选取第二音乐特征信息与所述目标音乐的第一音乐特征信息匹配的视频模板,作为目标视频模板;根据用户的候选素材集中每个候选素材的素材主题与所述目标视频模板的模板主题之间的相关性确定目标候选素材。
可选的,所述目标视频模板包含多个用于填充素材的素材槽位,在所述根据用户的候选素材集中每个候选素材的素材主题与所述目标视频模板的模板主题之间的相关性确定目标候选素材之后,还包括:依次获取符合所述目标视频模板的每个素材槽位的填充条件的目标候选素材,作为所述素材槽位的待填充素材,并将所述待填充素材填充至所述素材槽位中;所述依据所述目标候选素材生成目标视频,包括:依据填充所述待填充素材后的目标视频模板生成目标视频。
可选的,在所述将所述待填充素材填充至所述素材槽位中之后,还包括:显示预览页面,并在所述预览页面中展示所述目标视频模板中的待合成素材,所述待合成素材包括所述待填充素材。可选的,在所述显示预览页面之后,还包括:接收作用于所述预览页面中的模板控件的第二触发操作;响应于所述第二触发操作,显示模板窗口,所述模板窗口中显示有多个视频模板的模板信息; 接收作用于所述模板窗口内显示的模板信息的第三触发操作;响应于所述第三触发操作,将所述目标视频模板切换为与所述第三触发操作对应的模板信息所属的视频模板,并更新所述目标视频模板中的待合成素材。
相应的,如图5所示,本实施例提供的视频生成方法包括:
S201、接收采用目标音乐生成视频的第一触发操作。
S202、响应于所述第一触发操作,获取每个视频模板的背景音乐的第二音乐特征信息,并选取第二音乐特征信息与所述目标音乐的第一音乐特征信息匹配的视频模板,作为目标视频模板。
视频模板可以为理解为用于制作视频的模板。第二音乐特征信息可以为视频模板所需添加的背景音乐的音乐特征信息,如背景音乐的节奏信息和/或音乐风格信息等。
示例性的,电子设备在接收到第一触发操作时,可以将当前显示页面由音乐详情页面切换为加载页面,如图6所示,获取每个视频模板的背景音乐的音乐特征信息,并选取背景音乐的音乐特征信息与目标音乐的音乐特征信息匹配的视频模板,作为目标视频模板。
在本实施例中,如图6所示,加载页面中可以显示有当前的加载进度以及取消控件60,从而,用户可以通过触发该取消控件60指示电子设备停止执行后续操作,并将当前显示页面由加载页面切换回目标音乐的音乐详情页面。目标视频模板的确定方式可以根据需要设置,如电子设备在确定目标视频模板时,可以直接采用预先设置的匹配度计算规则或者采用预先训练的匹配度计算模型,计算模板库中的每个视频模板的背景音乐的第二音乐特征信息与目标音乐的第一音乐特征信息之间的匹配度,并选取匹配度最高的视频模板作为目标视频模板;也可以首先判断执行第一触发操作的用户和/或其他用户已发布的视频中,是否存在以目标音乐作为背景音乐且在生成视频时使用了视频模板的视频,若存在以目标音乐作为背景音乐且在生成视频时使用了视频模板的视频,则获取该视频所使用的视频模板作为目标视频模板,若不存在以目标音乐作为背景音乐且在生成视频时使用了视频模板的视频,则计算模板库中的每个视频模板的背景音乐的第二音乐特征信息与目标音乐的第一音乐特征信息之间的匹配度,并选取匹配度最高的视频模板作为目标视频模板。
此外,当存在多个以目标背景音乐作为背景音乐的视频所使用的模板时,或者,当存在多个匹配度最高的视频模板时,可以随机选取一个视频模板作为目标视频模板,也可以依据用户的个性化信息,如用户发布和/或播放的视频的视频风格等,选取与用户的个性化信息最为相符的视频模板作为目标视频模板。 或者,在确定目标视频模板时,也可以依据采用目标音乐作为背景音乐的视频或者每个模板的背景音乐的音乐特征信息与目标音乐的音乐特征信息之间的匹配度选取预设数量(如5个或10个等)的视频模板,并自该预设数量的视频模板中选取与用户的个性化信息最为相符的视频模板作为目标视频模板。从而,使得所生成的目标视频更为符合用户的个性化需求,提高用户的体验。
S203、根据用户的候选素材集中每个候选素材的素材主题与所述目标视频模板的模板主题之间的相关性确定目标候选素材。
素材主题可以理解为素材的主题,如候选素材集中的候选素材的主题,其可以根据素材的画面内容确定;模板主题可以理解为视频模板的主题,如目标视频模板的主题,一素材/视频模板的主题例如可以为植物、建筑、风景、人物、旅游或美食,等等。
在本实施例中,可以依据目标视频模板的模板主题,选取候选素材集中素材主题与目标视频模板的模板主题相似(如匹配度大于预设匹配度阈值))的素材作为目标候选素材;也可以预先依据每个候选素材的素材主题,将候选素材集划分为多个素材主题不同的子候选素材集,从而,在确定目标视频模板后,可以选取素材主题与目标视频模板的模板主题相似的子候选素材集中的候选素材作为目标候选素材。在此,在选取目标候选素材时,可以将素材主题与目标视频模板的模板主题相似的所有候选素材均确定为目标候选素材;也可以选取将素材主题与目标视频模板的模板主题相似的预设数量(如20个或50个等)的候选素材作为目标候选素材,本实施例不对此进行限制。
S204、依次获取符合所述目标视频模板的每个素材槽位的填充条件的目标候选素材,作为所述素材槽位的待填充素材,并将所述待填充素材填充至所述素材槽位中,其中,所述目标视频模板包含多个用于填充素材的素材槽位。
该填充条件可以包括所需填充素材的长宽比(如16:9等)、素材类型(如视频或图片等)和/或素材的最大时长等。
以填充条件为素材的长宽比为例,电子设备在确定目标候选素材后,可以选取目标视频模板的一个素材槽位作为当前素材操作,如选取目标视频模板的第一个素材槽位作为当前素材槽位,自尚未被确定为待填充素材的目标候选素材中,选取符合当前素材槽位的长宽比要求的一个目标候选素材,作为当前素材槽位的待填充素材,将该待填充素材填充到当前素材槽位中,将目标候选素材的下一素材槽位确定为当前素材槽位,并返回执行自尚未被确定为待填充素材的目标候选素材中的操作,直至不存在下一素材槽位为止。
在本步骤中,当仅存在一个符合当前素材槽位的填充条件的目标候选素材 时,可以将该目标候选素材确定为该素材槽位的待填充素材;当存在多个符合当前素材槽位的填充条件的目标候选素材时,可以自该多个目标候选素材中随机选取一个视频类型的目标候选素材作为当前素材槽位的待填充素材,以提高所生成的目标视频的质量,当该多个目标候选素材中不存在视频类型的目标候选素材时,可以随机选取一个目标候选素材作为当前素材槽位的待填充素材。此外,若待填充素材为时长大于当前素材槽位所能够填充的素材的最大时长的视频类型的目标候选素材,在将该待填充素材填充至当前素材槽位中时,可以自该待填充素材中截取时长符合该最大时长的关键视频,并将所截取的该关键视频填充到当前素材槽位中。
当加载失败时,如当由于网络原因导致目标视频模板无法正常加载,或者,由于素材主题与目标视频模板的模板主题相似的素材的数量过少(如少于目标视频模板中所包含的素材槽位的数量)或者不存在符合一个或多个素材槽位的填充条件的目标候选素材时,电子设备可以停止加载,将当前显示页面由加载页面切换回目标音乐的音乐详情页面,并提示用户加载失败,如提示用户“成片失败,请稍后重试”或者“素材不足,请补充后重试”等,以便于用户明确视频生成失败的原因。
S205、显示预览页面,并在所述预览页面中展示所述目标视频模板中的待合成素材,所述待合成素材包括所述待填充素材。
待合成素材可以理解为目标视频模板中用户合成为目标视频的素材,(如视频或图片),其可以包括目标视频模板中每个素材槽位所填充的待填充素材,还可以包括转场素材,该转场素材可以携带于目标视频模板中,也可以由电子设备在将待填充素材填充至目标视频模板中之后根据相邻的待填充素材生成并填充于此相邻待填充素材之间。
示例性的,在目标视频模板以及目标视频模板的每个素材槽位对应的素材均加载完成后,可以将当前显示页面由加载页面切换为预览页面,如图7所示,并在该预览页面中展示目标视频模板中的待合成素材,如按照所有待合成素材在目标视频模板中的顺序,依次展示所有待合成素材,或者依据用户的切换操作,切换预览页面中当前所展示的待合成素材,以便于用户明确被用于生成目标视频的素材以及所有素材的排列顺序。
S206、接收作用于所述预览页面中的模板控件的第二触发操作。
S207、响应于所述第二触发操作,显示模板窗口,所述模板窗口中显示有多个视频模板的模板信息。
第二触发操作可以为用于触发预览页面中显示的模板控件的操作,如作用 于该模板控件的点击操作等。视频模板的模板信息可以为该视频模板的相关信息,如视频模板的模板封面和/或模板名称等。模板窗口可用于显示模板列表,该模板列表中可以包含模板库中的所有视频模板;也可以仅包含背景音乐的第二音乐特征信息与目标音乐的第一音乐特征信息之间的匹配度大于设定匹配度阈值的模板,本实施例不对此进行限制。
在本实施例中,如图7所示,预览页面中可以设置有供用户指示电子设备显示模板窗口的模板控件70;从而,用户在欲更改生成目标视频时所使用的目标视频模板时,可以点击该模板控件70;相应的,电子设备在监测到用户点击该模板控件70时,可以确定接收到第二触发操作,响应于该第二触发操作,显示模板窗口80,并在该模板窗口80中显示每个视频模板的模板信息,如在模板窗口内显示每个视频模板的模板封面和/或模板名称,如图8所示。在此,每个视频模板可以随机或按照背景音乐的第二音乐特征信息与目标音乐的第一音乐特征信息之间的匹配度由大到小的顺序在模板窗口80中进行展示。
S208、接收作用于所述模板窗口内显示的模板信息的第三触发操作。
S209、响应于所述第三触发操作,将所述目标视频模板切换为与所述第三触发操作对应的模板信息所属的视频模板,并更新所述目标视频模板中的待合成素材。
第三触发操作可以理解为作用于模板窗口内一视频模板的模板信息的显示区域内的操作,如点击模板窗口内显示的一视频模板的模板封面的操作等。
如图8所示,用户在欲将目标视频模板切换为其他视频模板时,可以点击模板窗口80内所显示的该视频模板的封面;相应的,电子设备在监测到用户点击视频模板内显示的一视频模板的模板封面时,可以确定接收到第三触发操作,将目标视频模板切换为该视频模板,根据用户的候选素材集中每个候选素材的素材主题与该视频模板的模板主题之间的相关性确定该视频模板的目标候选模板,依次获取符合该视频模板的每个素材槽位的填充件的目标候选素材,作为该素材槽位的待填充素材,将每个素材槽位的待填充素材填充到对应的素材槽位中,并将预览页面中所展示的待合成素材更新为该视频模板中的待合成素材,以便于用户查看更新后的待合成素材。并且,电子设备在接收到第三触发操作时,响应于该触发操作,还可以显示下载提示窗口81,以提示用户当前正在下载模板;此时,相应的,在在预览页面中展示更新后的待合成素材时,可以停止显示该提示窗口81,以避免阻碍用户查看更新后的待合成素材。
此外,请继续参考图8,当切换目标视频模板后,可以在预览页面中显示第一保存控件82和第一取消控件83,从而,当监测到用户点击该第一保存控件82时,可以保存切换后的目标视频模板和待合成素材,并可以关闭模板窗口80, 如图7所示;当监测到用户点击该第一取消控件83时,可以将切换后的目标视频模板和待合成素材切换回之前的目标视频模板和待合成素材,并可以关闭模板窗口80,如图7所示。
在一个实施方式中,请参考图7,预览页面中还可以显示有素材控件71,并可以在用户首次进入预览页面时显示对素材控件71的功能的介绍信息72。用户在欲更换目标视频模板中所填充的待填充素材时,可以点击该素材控件71。相应的,电子设备在监测到用户点击素材控件71时,可以显示素材窗口90,并在该素材窗口90中显示用户的素材、候选素材或目标候选素材的素材信息,如素材封面,如图9所示;还可以在预览页面中显示第二保存控件91和第二取消控件92。
从而,用户在欲将预览页面中当前显示的素材替换为其他素材时,可以点击素材窗口90中所显示的该其他素材的封面;电子设备在监测到用户点击素材窗口90中显示的一素材的素材封面时,若该素材为图片,则可以将当前时刻预览页面显示的素材替换为该图片,若该素材为视频,则可以显示视频素材编辑窗口100,如图10所示,以供用户选择该视频中的一片段作为素材,并在监测到用户点击视频素材编辑窗口100中的确定控件102时,将当前时刻在预览页面中显示的素材替换为用户所选择的片段,在监测到用户点击视频素材编辑窗口100中的取消控件101时,取消用户所选择的片段,以及,在监测到用户点击视频素材编辑窗口100中的更换素材控件103时,关闭视频素材编辑窗口100并显示素材窗口90,以供用户更换素材。
用户在欲保存对素材的修改时,可以点击第二保存控件91;相应的,电子设备在监测到用户点击第二保存控件91时,可以保存修改后的待填充素材,并可以关闭素材窗口90。用户在欲取消对素材的修改时,可以点击第二取消控件92;相应的,电子设备在监测到用户点击第二取消控件92时,可以将修改后的待合成素材切换回之前的待合成素材,并可以关闭素材窗口90,如图7所示。
此外,素材窗口90中还可以显示有供用户对目标模板中所携带的文字进行编辑的文字编辑控件93。电子设备可以在将待填充素材填充至目标视频模板中时,同步将目标视频模板中所携带的、需要在视频中进行显示的文字填充到相应的图片或视频帧中,并在监测到用户点击素材窗口90中的文字编辑控件93时,显示文字编辑窗口,以供用户对图片或视频帧中所添加的、目标视频模板中所携带的文字进行编辑。
S210、依据填充所述待填充素材后的目标视频模板生成目标视频。
示例性的,电子设备可以在目标视频模板的每个素材槽位的待填充素材填充完成后,自动根据填充后的目标视频模板生成目标视频,如按照所有待填充 素材/待合成素材在目标视频模板中的顺序,将所有待填充素材/待合成素材合成为目标视频;也可以在目标视频模板的所有素材槽位的待填充素材均填充完成后,显示预览页面,以供用户查看与修改目标视频模板以及目标视频模板中的待填充素材/待合成素材,并在监测到用户触发预览页面中的视频合成控件时,将目标视频模板中的所有待填充素材/待合成素材合成为目标视频,例如,所述依据填充所述待填充素材后的目标视频模板生成目标视频,包括:当接收到作用于所述预览页面中的视频合成控件的第四触发操作时,采用所述目标音乐作为背景音乐,依据所述目标视频模板中所有待合成素材的顺序,将所述所有待合成素材合成为目标视频。
示例性的,请继续参考图7,电子设备在预览页面中显示视频合成控件73;用户在欲将目标视频模板中的待合成素材合成为目标视频时,可以点击该视频合成控件73。相应的,电子设备在监测到用户点击视频合成控件73时,可以确定接收到第四触发操作,并响应于该第四触发操作,以目标音乐作为背景音乐,按照所有待合成素材在目标视频模板中的顺序,将所有待合成素材合成为目标视频,并可以将当前显示页面由预览页面切换为视频编辑页面,如图11所示,以供用户对所合成的目标视频进行编辑,如更换目标视频的背景音乐、为目标视频添加文字、贴纸、特效和/或滤镜或者对目标视频进行画面增强等,并可以在监测到用户点击视频编辑页面中的发日常控件110时发布目标视频并将目标视频设置为仅在预设天数(如1天或1周)内可见;在监测到用户点击视频编辑页面中的下一步控件111时,将当前显示页面由视频编辑页面切换为发布页面,以供用户编辑目标视频的标题、发布位置和可见范围,并发布目标视频或将目标视频保存至草稿箱内。其中,若一待合成素材为视频类型的素材,则在合成时,可以删除该待合成素材的音频数据或者将该待合成素材静音,以避免所得到的目标视频出现除目标音乐之外的声音突然出现或消失的情况。
此外,如图7所示,预览页面中还可以显示有第一返回控件74;从而,用户在欲返回目标音乐的音乐详情页面时,可以点击该第一返回控件74;相应的,电子设备在监测到用户点击第一返回控件74时,可以显示第一保存提示窗口(图7中未示出),以提示用户返回后将丢失当前的效果,并在监测到用户点击第一保存提示窗口内的第一确认控件时,将当前显示页面由预览页面切换为目标音乐的音乐详情页面。如图11所示,视频编辑页面中还可以显示有第二返回控件112;从而,用户在欲返回预览页面时,可以点击该第二返回控件112;相应的,电子设备在监测到用户点击第二返回控件112时,可以显示第二保存提示窗口(图11中未示出),以提示用户其在视频编辑页面内所编辑的内容将不会被保存,并在监测到用户点击第二保存提示窗口内的第二确认控件时,将当前显示页面由视频编辑页面切换为预览页面。
本实施例提供的视频生成方法,依据目标视频的音乐特征选取目标视频模板,依据目标视频模板的主题确定目标候选素材,并在用户更改目标视频模板后,同步更新目标视频模板内所填充的素材,不仅能够依据用户所选择的音乐自动生成视频,还能够提高所生成的目标视频的质量,并使得所生成的目标视频更加符合用户的意愿,从而提高用户的使用体验。
图12为本公开实施例提供的一种视频生成装置的结构框图。该装置可以由软件和/或硬件实现,可配置于电子设备中,例如,该装置可以配置在手机或平板电脑中,可通过执行视频生成方法生成视频。如图12所示,本实施例提供的视频生成装置可以包括:第一接收模块1201和视频生成模块1202,其中,
第一接收模块1201,设置为接收采用目标音乐生成视频的第一触发操作;视频生成模块1202,设置为响应于所述第一触发操作,基于所述目标音乐的第一音乐特征信息,自用户的候选素材集中选取与所述目标音乐匹配的候选素材作为目标候选素材,并依据所述目标候选素材生成目标视频,所述目标视频以所述目标音乐作为背景音乐。
本实施例提供的视频生成装置,通过第一接收模块1201接收采用目标音乐生成视频的第一触发操作,通过视频生成模块1202响应于该第一触发操作,基于目标音乐的第一音乐特征信息,自用户的候选素材集中选取与目标音乐匹配的候选素材作为目标候选素材,并依据该目标候选素材生成以目标音乐作为背景音乐的目标视频。本实施例通过采用上述技术方案,可以自动根据用户所选择的音乐,利用与该音乐匹配的素材生成视频,能够降低视频的制作门槛,缩短用户制作视频所耗费的时间,并能够提高所生成的视频与用户所选择的音乐的匹配度,提升所生成视频的质量。
在上述方案中,所述视频生成模块1202可以包括:模板获取单元,设置为获取每个视频模板的背景音乐的第二音乐特征信息,并选取所述第二音乐特征信息与所述目标音乐的第一音乐特征信息匹配的视频模板,作为目标视频模板;素材确定单元,设置为根据用户的候选素材集中每个候选素材的素材主题与所述目标视频模板的模板主题之间的相关性确定目标候选素材。
在上述方案中,所述目标视频模板可以包含多个用于填充素材的素材槽位,所述视频生成模块1202还可以包括:素材填充单元,设置为在所述根据用户的候选素材集中每个候选素材的素材主题与所述目标视频模板的模板主题之间的相关性确定目标候选素材之后,依次获取符合所述目标视频模板的每个素材槽位的填充条件的目标候选素材,作为所述素材槽位的待填充素材,并将所述待填充素材填充至所述素材槽位中;视频生成单元,设置为依据填充所述待填充 素材后的目标视频模板生成目标视频。
在上述方案中,所述视频生成模块1202还可以包括:素材展示单元,设置为在所述将所述待填充素材填充至所述素材槽位中之后,显示预览页面,并在所述预览页面中展示所述目标视频模板中的待合成素材,所述待合成素材包括所述多个待填充素材。
在上述方案中,所述视频生成模块1202还可以包括:第一接收单元,设置为在所述显示预览页面之后,接收作用于所述预览页面中的模板控件的第二触发操作;模板显示单元,设置为响应于所述第二触发操作,显示模板窗口,所述模板窗口中显示有多个视频模板的模板信息;第二接收单元,设置为接收作用于所述模板窗口内显示的模板信息的第三触发操作;素材更新单元,设置为响应于所述第三触发操作,将所述目标视频模板切换为与所述第三触发操作对应的模板信息所属的视频模板,并更新所述目标视频模板中的待合成素材。
在上述方案中,所述视频生成单元可以设置为:当接收到作用于所述预览页面中的视频合成控件的第四触发操作时,采用所述目标音乐作为背景音乐,依据所述目标视频模板中所有待合成素材的顺序,将所述所有待合成素材合成为目标视频。
本实施例提供的视频生成装置还可以包括:素材获取模块,设置为,在所述自用户的候选素材集中选取与所述目标音乐匹配的候选素材作为目标候选素材之前,获取用户的相册中的素材,所述相册包括本地相册和/或关联的云相册;素材筛选模块,设置为依据素材的美学评分自所述相册中筛选多个素材,形成用户的候选素材集。
本公开实施例提供的视频生成装置可执行本公开任意实施例提供的视频生成方法,具备执行视频生成方法相应的功能模块和效果。未在本实施例中详尽描述的技术细节,可参见本公开任意实施例所提供的视频生成方法。
下面参考图13,其示出了适于用来实现本公开实施例的电子设备(例如终端设备)1300的结构示意图。本公开实施例中的终端设备可以包括但不限于诸如移动电话、笔记本电脑、数字广播接收器、个人数字助理(Personal Digital Assistant,PDA)、平板电脑(PAD)、便携式多媒体播放器(Portable Media Player,PMP)、车载终端(例如车载导航终端)等等的移动终端以及诸如数字(Television,TV)、台式计算机等等的固定终端。图13示出的电子设备仅仅是一个示例,不应对本公开实施例的功能和使用范围带来任何限制。
如图13所示,电子设备1300可以包括处理装置(例如中央处理器、图形 处理器等)1301,其可以根据存储在只读存储器(Read-Only Memory,ROM)1302中的程序或者从存储装置1308加载到随机访问存储器(Random Access Memory,RAM)1303中的程序而执行多种适当的动作和处理。在RAM 1303中,还存储有电子设备1300操作所需的多种程序和数据。处理装置1301、ROM 1302以及RAM 1303通过总线1304彼此相连。输入/输出(Input/Output,I/O)接口1305也连接至总线1304。
通常,以下装置可以连接至I/O接口1305:包括例如触摸屏、触摸板、键盘、鼠标、摄像头、麦克风、加速度计、陀螺仪等的输入装置1306;包括例如液晶显示器(Liquid Crystal Display,LCD)、扬声器、振动器等的输出装置1307;包括例如磁带、硬盘等的存储装置1308;以及通信装置1309。通信装置1309可以允许电子设备1300与其他设备进行无线或有线通信以交换数据。虽然图13示出了具有多种装置的电子设备1300,并不要求实施或具备所有示出的装置。可以替代地实施或具备更多或更少的装置。
根据本公开的实施例,上文参考流程图描述的过程可以被实现为计算机软件程序。例如,本公开的实施例包括一种计算机程序产品,其包括承载在非暂态计算机可读介质上的计算机程序,该计算机程序包含用于执行流程图所示的方法的程序代码。在这样的实施例中,该计算机程序可以通过通信装置1309从网络上被下载和安装,或者从存储装置1308被安装,或者从ROM 1302被安装。在该计算机程序被处理装置1301执行时,执行本公开实施例的方法中限定的上述功能。
本公开上述的计算机可读介质可以是计算机可读信号介质或者计算机可读存储介质或者是上述两者的任意组合。计算机可读存储介质例如可以是——但不限于——电、磁、光、电磁、红外线、或半导体的系统、装置或器件,或者任意以上的组合。计算机可读存储介质的例子可以包括但不限于:具有一个或多个导线的电连接、便携式计算机磁盘、硬盘、RAM、ROM、可擦式可编程只读存储器(Erasable Programmable Read-Only Memory,EPROM或闪存)、光纤、便携式紧凑磁盘只读存储器(Compact Disc Read-Only Memory,CD-ROM)、光存储器件、磁存储器件、或者上述的任意合适的组合。在本公开中,计算机可读存储介质可以是任何包含或存储程序的有形介质,该程序可以被指令执行系统、装置或者器件使用或者与其结合使用。而在本公开中,计算机可读信号介质可以包括在基带中或者作为载波一部分传播的数据信号,其中承载了计算机可读的程序代码。这种传播的数据信号可以采用多种形式,包括但不限于电磁信号、光信号或上述的任意合适的组合。计算机可读信号介质还可以是计算机可读存储介质以外的任何计算机可读介质,该计算机可读信号介质可以发送、传播或者传输用于由指令执行系统、装置或者器件使用或者与其结合使用的程 序。计算机可读介质上包含的程序代码可以用任何适当的介质传输,包括但不限于:电线、光缆、射频(Radio Frequency,RF)等等,或者上述的任意合适的组合。
在一些实施方式中,客户端、服务器可以利用诸如超文本传输协议(HyperText Transfer Protocol,HTTP)之类的任何当前已知或未来研发的网络协议进行通信,并且可以与任意形式或介质的数字数据通信(例如,通信网络)互连。通信网络的示例包括局域网(Local Area Network,LAN),广域网(Wide Area Network,WAN),网际网(例如,互联网)以及端对端网络(例如,ad hoc端对端网络),以及任何当前已知或未来研发的网络。
上述计算机可读介质可以是上述电子设备中所包含的;也可以是单独存在,而未装配入该电子设备中。
上述计算机可读介质承载有一个或者多个程序,当上述一个或者多个程序被该电子设备执行时,使得该电子设备:接收采用目标音乐生成视频的第一触发操作;响应于所述第一触发操作,基于所述目标音乐的第一音乐特征信息,自用户的候选素材集中选取与所述目标音乐匹配的候选素材作为目标候选素材,并依据所述目标候选素材生成目标视频,所述目标视频以所述目标音乐作为背景音乐。
可以以一种或多种程序设计语言或其组合来编写用于执行本公开的操作的计算机程序代码,上述程序设计语言包括但不限于面向对象的程序设计语言—诸如Java、Smalltalk、C++,还包括常规的过程式程序设计语言—诸如“C”语言或类似的程序设计语言。程序代码可以完全地在用户计算机上执行、部分地在用户计算机上执行、作为一个独立的软件包执行、部分在用户计算机上部分在远程计算机上执行、或者完全在远程计算机或服务器上执行。在涉及远程计算机的情形中,远程计算机可以通过任意种类的网络——包括LAN或WAN—连接到用户计算机,或者,可以连接到外部计算机(例如利用因特网服务提供商来通过因特网连接)。
附图中的流程图和框图,图示了按照本公开多种实施例的系统、方法和计算机程序产品的可能实现的体系架构、功能和操作。在这点上,流程图或框图中的每个方框可以代表一个模块、程序段、或代码的一部分,该模块、程序段、或代码的一部分包含一个或多个用于实现规定的逻辑功能的可执行指令。也应当注意,在有些作为替换的实现中,方框中所标注的功能也可以以不同于附图中所标注的顺序发生。例如,两个接连地表示的方框实际上可以基本并行地执行,它们有时也可以按相反的顺序执行,这依所涉及的功能而定。也要注意的是,框图和/或流程图中的每个方框、以及框图和/或流程图中的方框的组合,可 以用执行规定的功能或操作的专用的基于硬件的系统来实现,或者可以用专用硬件与计算机指令的组合来实现。
描述于本公开实施例中所涉及到的单元可以通过软件的方式实现,也可以通过硬件的方式来实现。其中,模块的名称在一种情况下并不构成对该单元本身的限定。
本文中以上描述的功能可以至少部分地由一个或多个硬件逻辑部件来执行。例如,非限制性地,可以使用的示范类型的硬件逻辑部件包括:现场可编程门阵列(Field Programmable Gate Array,FPGA)、专用集成电路(Application Specific Integrated Circuit,ASIC)、专用标准产品(Application Specific Standard Parts,ASSP)、片上系统(System on Chip,SOC)、复杂可编程逻辑设备(Complex Programmable Logic Device,CPLD)等等。
在本公开的上下文中,机器可读介质可以是有形的介质,其可以包含或存储以供指令执行系统、装置或设备使用或与指令执行系统、装置或设备结合地使用的程序。机器可读介质可以是机器可读信号介质或机器可读储存介质。机器可读介质可以包括但不限于电子的、磁性的、光学的、电磁的、红外的、或半导体系统、装置或设备,或者上述内容的任何合适组合。机器可读存储介质的示例会包括基于一个或多个线的电气连接、便携式计算机盘、硬盘、RAM、ROM、EPROM或快闪存储器、光纤、CD-ROM、光学储存设备、磁储存设备、或上述内容的任何合适组合。
根据本公开的一个或多个实施例,示例1提供了一种视频生成方法,包括:
接收采用目标音乐生成视频的第一触发操作;
响应于所述第一触发操作,基于所述目标音乐的第一音乐特征信息,自用户的候选素材集中选取与所述目标音乐匹配的候选素材作为目标候选素材,并依据所述目标候选素材生成目标视频,所述目标视频以所述目标音乐作为背景音乐。
根据本公开的一个或多个实施例,示例2根据示例1所述的方法,所述基于所述目标音乐的第一音乐特征信息,自用户的候选素材集中选取与所述目标音乐匹配的候选素材作为目标候选素材,包括:
获取每个视频模板的背景音乐的第二音乐特征信息,并选取第二音乐特征信息与所述目标音乐的第一音乐特征信息匹配的视频模板,作为目标视频模板;
根据所述用户的候选素材集中每个候选素材的素材主题与所述目标视频模板的模板主题之间的相关性确定目标候选素材。
根据本公开的一个或多个实施例,示例3根据示例2所述的方法,所述目标视频模板包含多个用于填充素材的素材槽位,在所述根据所述用户的候选素材集中每个候选素材的素材主题与所述目标视频模板的模板主题之间的相关性确定目标候选素材之后,还包括:
依次获取符合所述目标视频模板的每个素材槽位的填充条件的目标候选素材,作为所述素材槽位的待填充素材,并将所述待填充素材填充至所述素材槽位中;
所述依据所述目标候选素材生成目标视频,包括:
依据填充所述待填充素材后的目标视频模板生成所述目标视频。
根据本公开的一个或多个实施例,示例4根据示例3所述的方法,在所述将所述待填充素材填充至所述素材槽位中之后,还包括:
显示预览页面,并在所述预览页面中展示所述目标视频模板中的待合成素材,所述待合成素材包括所述待填充素材。
根据本公开的一个或多个实施例,示例5根据示例4所述的方法,在所述显示预览页面之后,还包括:
接收作用于所述预览页面中的模板控件的第二触发操作;
响应于所述第二触发操作,显示模板窗口,所述模板窗口中显示有多个视频模板的模板信息;
接收作用于所述模板窗口内显示的模板信息的第三触发操作;
响应于所述第三触发操作,将所述目标视频模板切换为与所述第三触发操作对应的模板信息所属的视频模板,并更新所述目标视频模板中的待合成素材。
根据本公开的一个或多个实施例,示例6根据示例4或5所述的方法,所述依据填充所述待填充素材后的目标视频模板生成所述目标视频,包括:
当接收到作用于所述预览页面中的视频合成控件的第四触发操作时,采用所述目标音乐作为背景音乐,依据所述目标视频模板中所有待合成素材的顺序,将所述所有待合成素材合成为所述目标视频。
根据本公开的一个或多个实施例,示例7根据示例1-5任一所述的方法,在所述自用户的候选素材集中选取与所述目标音乐匹配的候选素材作为目标候选素材之前,还包括:
获取用户的相册中的素材,所述相册包括本地相册和/或关联的云相册;
依据素材的美学评分自所述相册中筛选多个素材,形成所述用户的候选素 材集。
根据本公开的一个或多个实施例,示例8提供了一种视频生成装置,包括:
第一接收模块,设置为接收采用目标音乐生成视频的第一触发操作;
视频生成模块,设置为响应于所述第一触发操作,基于所述目标音乐的第一音乐特征信息,自用户的候选素材集中选取与所述目标音乐匹配的候选素材作为目标候选素材,并依据所述目标候选素材生成目标视频,所述目标视频以所述目标音乐作为背景音乐。
根据本公开的一个或多个实施例,示例9提供了一种电子设备,包括:
一个或多个处理器;
存储器,设置为存储一个或多个程序;
当所述一个或多个程序被所述一个或多个处理器执行,使得所述一个或多个处理器实现如示例1-7中任一所述的视频生成方法。
根据本公开的一个或多个实施例,示例10提供了一种计算机可读存储介质,其上存储有计算机程序,该程序被处理器执行时实现如示例1-7中任一所述的视频生成方法。
此外,虽然采用特定次序描绘了多个操作,但是这不应当理解为要求这些操作以所示出的特定次序或以顺序次序执行来执行。在一定环境下,多任务和并行处理可能是有利的。同样地,虽然在上面论述中包含了多个实现细节,但是这些不应当被解释为对本公开的范围的限制。在单独的实施例的上下文中描述的一些特征还可以组合地实现在单个实施例中。相反地,在单个实施例的上下文中描述的多种特征也可以单独地或以任何合适的子组合的方式实现在多个实施例中。

Claims (10)

  1. 一种视频生成方法,包括:
    接收采用目标音乐生成视频的第一触发操作;
    响应于所述第一触发操作,基于所述目标音乐的第一音乐特征信息,自用户的候选素材集中选取与所述目标音乐匹配的候选素材作为目标候选素材,并依据所述目标候选素材生成目标视频,其中,所述目标视频以所述目标音乐作为背景音乐。
  2. 根据权利要求1所述的方法,其中,所述基于所述目标音乐的第一音乐特征信息,自用户的候选素材集中选取与所述目标音乐匹配的候选素材作为目标候选素材,包括:
    获取每个视频模板的背景音乐的第二音乐特征信息,并选取第二音乐特征信息与所述目标音乐的第一音乐特征信息匹配的视频模板,作为目标视频模板;
    根据所述用户的候选素材集中每个候选素材的素材主题与所述目标视频模板的模板主题之间的相关性确定目标候选素材。
  3. 根据权利要求2所述的方法,其中,所述目标视频模板包含多个用于填充素材的素材槽位,在所述根据所述用户的候选素材集中每个候选素材的素材主题与所述目标视频模板的模板主题之间的相关性确定目标候选素材之后,还包括:
    依次获取符合所述目标视频模板的每个素材槽位的填充条件的目标候选素材,作为所述素材槽位的待填充素材,并将所述待填充素材填充至所述素材槽位中;
    所述依据所述目标候选素材生成目标视频,包括:
    依据填充所述待填充素材后的目标视频模板生成所述目标视频。
  4. 根据权利要求3所述的方法,在所述将所述待填充素材填充至所述素材槽位中之后,还包括:
    显示预览页面,并在所述预览页面中展示所述目标视频模板中的待合成素材,其中,所述待合成素材包括所述待填充素材。
  5. 根据权利要求4所述的方法,在所述显示预览页面之后,还包括:
    接收作用于所述预览页面中的模板控件的第二触发操作;
    响应于所述第二触发操作,显示模板窗口,其中,所述模板窗口中显示有多个视频模板的模板信息;
    接收作用于所述模板窗口内显示的模板信息的第三触发操作;
    响应于所述第三触发操作,将所述目标视频模板切换为与所述第三触发操作对应的模板信息所属的视频模板,并更新所述目标视频模板中的待合成素材。
  6. 根据权利要求4或5所述的方法,其中,所述依据填充所述待填充素材后的目标视频模板生成所述目标视频,包括:
    在接收到作用于所述预览页面中的视频合成控件的第四触发操作的情况下,采用所述目标音乐作为背景音乐,依据所述目标视频模板中所有待合成素材的顺序,将所述所有待合成素材合成为所述目标视频。
  7. 根据权利要求1-5中任一项所述的方法,在所述自用户的候选素材集中选取与所述目标音乐匹配的候选素材作为目标候选素材之前,还包括:
    获取用户的相册中的素材,其中,所述相册包括本地相册和关联的云相册中的至少之一;
    依据素材的美学评分自所述相册中筛选多个素材,形成所述用户的候选素材集。
  8. 一种视频生成装置,包括:
    第一接收模块,设置为接收采用目标音乐生成视频的第一触发操作;
    视频生成模块,设置为响应于所述第一触发操作,基于所述目标音乐的第一音乐特征信息,自用户的候选素材集中选取与所述目标音乐匹配的候选素材作为目标候选素材,并依据所述目标候选素材生成目标视频,其中,所述目标视频以所述目标音乐作为背景音乐。
  9. 一种电子设备,其中,包括:
    至少一个处理器;
    存储器,设置为存储至少一个程序;
    当所述至少一个程序被所述至少一个处理器执行,使得所述至少一个处理器实现如权利要求1-7中任一项所述的视频生成方法。
  10. 一种计算机可读存储介质,存储有计算机程序,其中,所述程序被处理器执行时实现如权利要求1-7中任一项所述的视频生成方法。
PCT/CN2022/070919 2021-01-15 2022-01-10 视频生成方法、装置、电子设备和存储介质 Ceased WO2022152064A1 (zh)

Priority Applications (3)

Application Number Priority Date Filing Date Title
JP2023542890A JP7632816B2 (ja) 2021-01-15 2022-01-10 ビデオ生成方法、装置、電子機器および記憶媒体
EP22738945.9A EP4262221B1 (en) 2021-01-15 2022-01-10 Video generation method and apparatus, electronic device, and storage medium
US18/352,643 US12033671B2 (en) 2021-01-15 2023-07-14 Video generation method and apparatus, electronic device, and storage medium

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202110057581.2 2021-01-15
CN202110057581.2A CN112911379B (zh) 2021-01-15 2021-01-15 视频生成方法、装置、电子设备和存储介质

Related Child Applications (1)

Application Number Title Priority Date Filing Date
US18/352,643 Continuation US12033671B2 (en) 2021-01-15 2023-07-14 Video generation method and apparatus, electronic device, and storage medium

Publications (1)

Publication Number Publication Date
WO2022152064A1 true WO2022152064A1 (zh) 2022-07-21

Family

ID=76113861

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2022/070919 Ceased WO2022152064A1 (zh) 2021-01-15 2022-01-10 视频生成方法、装置、电子设备和存储介质

Country Status (5)

Country Link
US (1) US12033671B2 (zh)
EP (1) EP4262221B1 (zh)
JP (1) JP7632816B2 (zh)
CN (1) CN112911379B (zh)
WO (1) WO2022152064A1 (zh)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115730101A (zh) * 2022-11-23 2023-03-03 北京字跳网络技术有限公司 背景音乐确定方法、装置、计算机设备以及存储介质

Families Citing this family (26)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112911379B (zh) 2021-01-15 2023-06-27 北京字跳网络技术有限公司 视频生成方法、装置、电子设备和存储介质
CN115580749A (zh) 2021-06-17 2023-01-06 北京字跳网络技术有限公司 展示方法、装置及可读存储介质
CN113395588A (zh) 2021-06-23 2021-09-14 北京字跳网络技术有限公司 一种视频处理方法、装置、设备及存储介质
CN113542902B (zh) * 2021-07-13 2023-02-24 北京字跳网络技术有限公司 一种视频处理方法、装置、电子设备和存储介质
CN113590972A (zh) * 2021-08-16 2021-11-02 北京字节跳动网络技术有限公司 信息展示的方法、装置、计算机设备及存储介质
CN115883751A (zh) 2021-09-28 2023-03-31 北京字跳网络技术有限公司 一种视频生成方法、装置、设备及存储介质
CN114329223A (zh) * 2022-01-04 2022-04-12 北京字节跳动网络技术有限公司 媒体内容搜索方法、装置、设备及介质
CN116939323A (zh) * 2022-04-01 2023-10-24 腾讯科技(深圳)有限公司 音乐匹配方法、装置、电子设备和计算机可读存储介质
CN114896001A (zh) * 2022-04-08 2022-08-12 北京达佳互联信息技术有限公司 一种组件展示方法、装置、电子设备、介质及程序产品
CN114817631A (zh) 2022-04-24 2022-07-29 北京字跳网络技术有限公司 媒体内容的发布方法、装置、设备、存储介质和程序产品
CN117076707A (zh) * 2022-05-10 2023-11-17 北京字跳网络技术有限公司 视频剪辑模板搜索方法、装置、电子设备及存储介质
CN114979495B (zh) * 2022-06-28 2024-04-12 北京字跳网络技术有限公司 用于内容拍摄的方法、装置、设备和存储介质
CN115103232B (zh) * 2022-07-07 2023-12-08 北京字跳网络技术有限公司 一种视频播放方法、装置、设备和存储介质
CN115190367B (zh) * 2022-07-07 2023-12-01 北京字跳网络技术有限公司 一种视频播放方法、装置、设备和存储介质
CN115309317B (zh) * 2022-08-08 2024-09-06 北京字跳网络技术有限公司 媒体内容获取方法、装置、设备、可读存储介质及产品
CN117812459A (zh) * 2022-09-23 2024-04-02 北京字跳网络技术有限公司 视频处理方法、装置、设备及介质
CN115776578A (zh) * 2022-11-14 2023-03-10 杭州网易云音乐科技有限公司 一种视频生成方法及装置、一种音频播放方法及装置
CN116489466A (zh) * 2023-05-10 2023-07-25 北京达佳互联信息技术有限公司 视频编辑方法、装置、电子设备及介质
CN116708946A (zh) * 2023-06-08 2023-09-05 北京字跳网络技术有限公司 一种音乐视频生成方法、装置、电子设备及存储介质
CN119545100A (zh) * 2023-08-31 2025-02-28 北京字跳网络技术有限公司 视频编辑方法、装置、电子设备和存储介质
CN119854594A (zh) * 2023-10-16 2025-04-18 北京字跳网络技术有限公司 一种视频生成方法、装置、设备及存储介质
CN118075537A (zh) * 2024-02-19 2024-05-24 北京字跳网络技术有限公司 一种媒体数据的编辑方法、装置、电子设备及存储介质
CN120692428A (zh) * 2024-03-21 2025-09-23 北京字跳网络技术有限公司 视频处理方法、装置及终端设备
CN119364131A (zh) * 2024-10-29 2025-01-24 北京字跳网络技术有限公司 一种视频生成方法、装置、设备、介质及程序产品
CN120378674A (zh) * 2025-04-28 2025-07-25 北京字跳网络技术有限公司 用于交互的方法、装置、设备、介质及程序产品
CN120378673A (zh) * 2025-04-28 2025-07-25 北京字跳网络技术有限公司 用于内容处理的方法、装置、设备、介质及程序产品

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP3038108A1 (en) * 2014-12-22 2016-06-29 Thomson Licensing Method and system for generating a video album
US20170092331A1 (en) * 2015-09-30 2017-03-30 Apple Inc. Synchronizing Audio and Video Components of an Automatically Generated Audio/Video Presentation
CN107483843A (zh) * 2017-08-16 2017-12-15 成都品果科技有限公司 音视频匹配剪辑方法及装置
CN108900768A (zh) * 2018-07-12 2018-11-27 北京微播视界科技有限公司 视频拍摄方法、装置、终端、服务器和存储介质
CN110933487A (zh) * 2019-12-18 2020-03-27 北京百度网讯科技有限公司 卡点视频的生成方法、装置、设备及存储介质
CN111526427A (zh) * 2020-04-30 2020-08-11 维沃移动通信有限公司 视频生成方法、装置及电子设备
CN112911379A (zh) * 2021-01-15 2021-06-04 北京字跳网络技术有限公司 视频生成方法、装置、电子设备和存储介质

Family Cites Families (19)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7352952B2 (en) * 2003-10-16 2008-04-01 Magix Ag System and method for improved video editing
CN102256030A (zh) * 2010-05-20 2011-11-23 Tcl集团股份有限公司 可匹配背景音乐的相册演示系统及其背景音乐匹配方法
US9098579B2 (en) * 2011-06-07 2015-08-04 Kodak Alaris Inc. Automatically selecting thematically representative music
CN103793446B (zh) * 2012-10-29 2019-03-01 汤晓鸥 音乐视频的生成方法和系统
US10122983B1 (en) * 2013-03-05 2018-11-06 Google Llc Creating a video for an audio file
KR20140126556A (ko) * 2013-04-23 2014-10-31 주식회사 엘지유플러스 감성 기반 멀티미디어 재생을 위한 장치, 서버, 단말, 방법, 및 기록 매체
US20180330756A1 (en) * 2016-11-19 2018-11-15 James MacDonald Method and apparatus for creating and automating new video works
US20180295427A1 (en) * 2017-04-07 2018-10-11 David Leiberman Systems and methods for creating composite videos
KR102194281B1 (ko) * 2019-01-14 2020-12-22 박준희 시대별로 음원과 영상이 결합된 음악 방송 콘텐츠 제작 시스템 및 방법
CN111625685B (zh) * 2019-02-27 2023-04-07 阿里巴巴集团控股有限公司 音乐电台呈现方法与装置、视频数据分类方法与装置
CN110519638B (zh) * 2019-09-06 2023-05-16 Oppo广东移动通信有限公司 处理方法、处理装置、电子装置和存储介质
CN111274415B (zh) * 2020-01-14 2024-05-24 广州酷狗计算机科技有限公司 确定替补视频素材的方法、装置及计算机存储介质
BR112022017094A2 (pt) * 2020-03-02 2022-11-16 Visual Supply Company Sistemas e métodos para automatizar edição de vídeo
CN111683209B (zh) * 2020-06-10 2023-04-18 北京奇艺世纪科技有限公司 混剪视频的生成方法、装置、电子设备及计算机可读存储介质
CN111866585B (zh) * 2020-06-22 2023-03-24 北京美摄网络科技有限公司 一种视频处理方法及装置
CN111935537A (zh) * 2020-06-30 2020-11-13 百度在线网络技术(北京)有限公司 音乐短片视频生成方法、装置、电子设备和存储介质
CN111835986B (zh) * 2020-07-09 2021-08-24 腾讯科技(深圳)有限公司 视频编辑处理方法、装置及电子设备
CN111930994A (zh) * 2020-07-14 2020-11-13 腾讯科技(深圳)有限公司 视频编辑的处理方法、装置、电子设备及存储介质
CN112203140B (zh) * 2020-09-10 2022-04-01 北京达佳互联信息技术有限公司 一种视频剪辑方法、装置、电子设备及存储介质

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP3038108A1 (en) * 2014-12-22 2016-06-29 Thomson Licensing Method and system for generating a video album
US20170092331A1 (en) * 2015-09-30 2017-03-30 Apple Inc. Synchronizing Audio and Video Components of an Automatically Generated Audio/Video Presentation
CN107483843A (zh) * 2017-08-16 2017-12-15 成都品果科技有限公司 音视频匹配剪辑方法及装置
CN108900768A (zh) * 2018-07-12 2018-11-27 北京微播视界科技有限公司 视频拍摄方法、装置、终端、服务器和存储介质
CN110933487A (zh) * 2019-12-18 2020-03-27 北京百度网讯科技有限公司 卡点视频的生成方法、装置、设备及存储介质
CN111526427A (zh) * 2020-04-30 2020-08-11 维沃移动通信有限公司 视频生成方法、装置及电子设备
CN112911379A (zh) * 2021-01-15 2021-06-04 北京字跳网络技术有限公司 视频生成方法、装置、电子设备和存储介质

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
See also references of EP4262221A4

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115730101A (zh) * 2022-11-23 2023-03-03 北京字跳网络技术有限公司 背景音乐确定方法、装置、计算机设备以及存储介质

Also Published As

Publication number Publication date
CN112911379A (zh) 2021-06-04
JP2024502664A (ja) 2024-01-22
JP7632816B2 (ja) 2025-02-19
EP4262221A1 (en) 2023-10-18
US20230368817A1 (en) 2023-11-16
US12033671B2 (en) 2024-07-09
CN112911379B (zh) 2023-06-27
EP4262221A4 (en) 2024-06-05
EP4262221B1 (en) 2026-04-22

Similar Documents

Publication Publication Date Title
WO2022152064A1 (zh) 视频生成方法、装置、电子设备和存储介质
US12483746B2 (en) Display method, apparatus, device and storage medium
WO2022143924A1 (zh) 视频生成方法、装置、电子设备和存储介质
WO2022048478A1 (zh) 多媒体数据的处理方法、生成方法及相关设备
CN113852767B (zh) 视频编辑方法、装置、设备及介质
WO2022198811A1 (zh) 一种音乐分享方法、装置、电子设备及存储介质
WO2022121558A1 (zh) 一种直播演唱方法、装置、设备和介质
WO2020077855A1 (zh) 视频拍摄方法、装置、电子设备及计算机可读存储介质
WO2020062683A1 (zh) 视频获取方法、装置、终端和介质
CN113365134A (zh) 音频分享方法、装置、设备及介质
WO2020207106A1 (zh) 关注用户的信息展示方法、装置、设备及存储介质
EP4124052B1 (en) Video production method and apparatus, and device and storage medium
CN110012339A (zh) 视频播放显示方法、装置、设备及存储介质
CN115599273B (zh) 媒体内容处理方法、装置、设备、可读存储介质及产品
CN115633206A (zh) 媒体内容展示方法、装置、设备及存储介质
WO2022052838A1 (zh) 视频文件的处理方法、装置、电子设备及计算机存储介质
WO2020259130A1 (zh) 精选片段处理方法、装置、电子设备及可读介质
US11272136B2 (en) Method and device for processing multimedia information, electronic equipment and computer-readable storage medium
CN115981769A (zh) 页面显示方法、装置、设备、计算机可读存储介质及产品
JP2024529251A (ja) メディアファイル処理方法、装置、デバイス、可読記憶媒体および製品
KR101489211B1 (ko) 사진을 이용한 동영상 제작 방법 및 그 장치
US12301896B2 (en) Audio bullet screen processing method and device
WO2024104333A1 (zh) 演播画面的处理方法、装置、电子设备及存储介质
CN115079908A (zh) 媒体内容收藏方法、装置、电子设备及计算机可读存储介质

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 22738945

Country of ref document: EP

Kind code of ref document: A1

WWE Wipo information: entry into national phase

Ref document number: 202327047553

Country of ref document: IN

Ref document number: 2023542890

Country of ref document: JP

ENP Entry into the national phase

Ref document number: 2022738945

Country of ref document: EP

Effective date: 20230714

REG Reference to national code

Ref country code: BR

Ref legal event code: B01A

Ref document number: 112023014078

Country of ref document: BR

NENP Non-entry into the national phase

Ref country code: DE

WWE Wipo information: entry into national phase

Ref document number: 11202305412T

Country of ref document: SG

ENP Entry into the national phase

Ref document number: 112023014078

Country of ref document: BR

Kind code of ref document: A2

Effective date: 20230713

WWG Wipo information: grant in national office

Ref document number: 2022738945

Country of ref document: EP