WO2023098531A1 - 视频处理方法、视频处理装置和计算机可读存储介质 - Google Patents

视频处理方法、视频处理装置和计算机可读存储介质 Download PDF

Info

Publication number
WO2023098531A1
WO2023098531A1 PCT/CN2022/133682 CN2022133682W WO2023098531A1 WO 2023098531 A1 WO2023098531 A1 WO 2023098531A1 CN 2022133682 W CN2022133682 W CN 2022133682W WO 2023098531 A1 WO2023098531 A1 WO 2023098531A1
Authority
WO
WIPO (PCT)
Prior art keywords
video
subtitle
translator
subtitles
processing method
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Ceased
Application number
PCT/CN2022/133682
Other languages
English (en)
French (fr)
Inventor
戚薇薇
唐艺丹
张益翔
高瑞娥
谈斯奇
林晔
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing ByteDance Network Technology Co Ltd
Original Assignee
Beijing ByteDance Network Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing ByteDance Network Technology Co Ltd filed Critical Beijing ByteDance Network Technology Co Ltd
Priority to KR1020247020364A priority Critical patent/KR102862444B1/ko
Priority to EP22900336.3A priority patent/EP4429258A4/en
Priority to AU2022399687A priority patent/AU2022399687A1/en
Priority to JP2024532477A priority patent/JP2024543970A/ja
Publication of WO2023098531A1 publication Critical patent/WO2023098531A1/zh
Anticipated expiration legal-status Critical
Priority to AU2026200035A priority patent/AU2026200035A1/en
Ceased legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/40Processing or translation of natural language
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/431Generation of visual interfaces for content selection or interaction; Content or additional data rendering
    • H04N21/4312Generation of visual interfaces for content selection or interaction; Content or additional data rendering involving specific graphical features, e.g. screen layout, special fonts or colors, blinking icons, highlights or animations
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/01Input arrangements or combined input and output arrangements for interaction between user and computer
    • G06F3/048Interaction techniques based on graphical user interfaces [GUI]
    • G06F3/0484Interaction techniques based on graphical user interfaces [GUI] for the control of specific functions or operations, e.g. selecting or manipulating an object, an image or a displayed text element, setting a parameter value or selecting a range
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/166Editing, e.g. inserting or deleting
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/40Processing or translation of natural language
    • G06F40/51Translation evaluation
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/40Processing or translation of natural language
    • G06F40/58Use of machine translation, e.g. for multi-lingual retrieval, for server-side translation for client devices or for real-time translation
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/40Business processes related to social networking or social networking services
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/40Business processes related to social networking or social networking services
    • G06Q10/42Determination of affinities or common interests between users
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/431Generation of visual interfaces for content selection or interaction; Content or additional data rendering
    • H04N21/4312Generation of visual interfaces for content selection or interaction; Content or additional data rendering involving specific graphical features, e.g. screen layout, special fonts or colors, blinking icons, highlights or animations
    • H04N21/4314Generation of visual interfaces for content selection or interaction; Content or additional data rendering involving specific graphical features, e.g. screen layout, special fonts or colors, blinking icons, highlights or animations for fitting data in a restricted space on the screen, e.g. EPG data in a rectangular grid
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/442Monitoring of processes or resources, e.g. detecting the failure of a recording device, monitoring the downstream bandwidth, the number of times a movie has been viewed, the storage space available from the internal hard disk
    • H04N21/44213Monitoring of end-user related data
    • H04N21/44222Analytics of user selections, e.g. selection of programmes or purchase activity
    • H04N21/44224Monitoring of user activity on external systems, e.g. Internet browsing
    • H04N21/44226Monitoring of user activity on external systems, e.g. Internet browsing on social networks
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/47End-user applications
    • H04N21/472End-user interface for requesting content, additional data or services; End-user interface for interacting with content, e.g. for content reservation or setting reminders, for requesting event notification, for manipulating displayed content
    • H04N21/47205End-user interface for requesting content, additional data or services; End-user interface for interacting with content, e.g. for content reservation or setting reminders, for requesting event notification, for manipulating displayed content for manipulating displayed content, e.g. interacting with MPEG-4 objects, editing locally
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/47End-user applications
    • H04N21/478Supplemental services, e.g. displaying phone caller identification, shopping application
    • H04N21/4788Supplemental services, e.g. displaying phone caller identification, shopping application communicating with other users, e.g. chatting
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/47End-user applications
    • H04N21/485End-user interface for client configuration
    • H04N21/4856End-user interface for client configuration for language selection, e.g. for the menu or subtitles
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/47End-user applications
    • H04N21/488Data services, e.g. news ticker
    • H04N21/4884Data services, e.g. news ticker for displaying subtitles
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/80Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
    • H04N21/85Assembly of content; Generation of multimedia applications
    • H04N21/854Content authoring

Definitions

  • the present disclosure relates to the field of computer technology, and in particular to a video processing method, a video processing device and a computer-readable storage medium.
  • a social network may provide various services, such as photo or video sharing, messaging, etc., based on user input, to facilitate social interaction among users.
  • Digital media may include images, video, audio, text, and more.
  • users can post videos they create to social networks.
  • Other users on social networks can interact with video creators by browsing, liking, commenting, etc.
  • a video processing method including:
  • the creator of the received video added the original subtitles for the video to be translated
  • a video processing device including:
  • the creator of the received video added the original subtitles for the video to be translated
  • a display configured to display approved subtitle translations within the video.
  • a video processing device including:
  • a processor coupled to the memory, the processor configured to execute one or more steps in the video processing method of any embodiment described in the present disclosure based on instructions stored in the memory.
  • a computer-readable storage medium on which a computer program is stored, and when the program is executed by a processor, one of the video processing methods in any embodiment described in the present disclosure is executed. or multiple steps.
  • FIG. 1 shows a flowchart of a video processing method according to some embodiments of the present disclosure
  • FIGS. 2A-2G illustrate schematic diagrams of an interactive interface for subtitle management according to some embodiments of the present disclosure
  • 3A-3C show schematic diagrams of an interactive interface for translating original subtitles in a video according to some embodiments of the present disclosure
  • Fig. 3D shows a schematic diagram showing the guideline to register as a community translator according to some embodiments of the present disclosure
  • Fig. 4A shows a schematic diagram of a subtitle translation viewing page according to some embodiments of the present disclosure
  • FIG. 4B shows a schematic diagram of a translator information page according to some embodiments of the present disclosure
  • Fig. 5A shows a schematic diagram of a video playback page according to some embodiments of the present disclosure
  • FIG. 5B shows a schematic diagram of a subtitle language management page in some embodiments of the present disclosure
  • Fig. 5C shows a schematic diagram of a subtitle translation viewing page according to other embodiments of the present disclosure
  • Fig. 6A shows a schematic diagram of a video playing page according to other embodiments of the present disclosure
  • Fig. 6B shows a schematic diagram of an interactive interface where translators interact with audiences according to some embodiments of the present disclosure
  • FIG. 7 shows a block diagram of a video processing device according to some embodiments of the present disclosure.
  • Fig. 8 shows a block diagram of a video processing device according to other embodiments of the present disclosure.
  • FIG. 9 illustrates a block diagram of an electronic device according to some embodiments of the present disclosure.
  • comprising and its variants used in the present disclosure mean an open term including at least the following elements/features but not excluding other elements/features, ie “including but not limited to”.
  • the term “comprising” and its variants used in the present disclosure mean an open term that includes at least the following elements/features but does not exclude other elements/features, namely “comprising but not limited to”. Thus, including is synonymous with comprising.
  • the term “based on” means “based at least in part on”.
  • references throughout this specification to "one embodiment,” “some embodiments,” or “an embodiment” means that a particular feature, structure, or characteristic described in connection with the embodiment is included in at least one embodiment of the present invention.
  • the term “one embodiment” means “at least one embodiment”; the term “another embodiment” means “at least one additional embodiment”; the term “some embodiments” means “at least some embodiments.”
  • appearances of the phrase “in one embodiment,” “in some embodiments,” or “in an embodiment” in various places throughout the specification are not necessarily all referring to the same embodiment, but may also refer to the same embodiment. Example.
  • the present disclosure does not limit how to obtain the image or video to be applied/processed.
  • it can be obtained from a storage device, such as an internal memory or an external storage device.
  • a camera assembly can be mobilized to take pictures.
  • the image or video type is not specifically limited.
  • the image or video may be a raw image or video obtained by a camera device, or an image or video that has undergone specific processing on the raw image or video, such as preliminary filtering, anti-aliasing, color adjustment, contrast adjustment, normalization, etc.
  • the preprocessing operation may also include other types of preprocessing operations known in the art, which will not be described in detail here.
  • Fig. 1 shows a flowchart of a video processing method according to some embodiments of the present disclosure.
  • the video processing method comprises: step S1, the creator of receiving video adds the subtitle original text for the video to be translated; Step S3, selects a translator based on the subtitle original text and the language used by the user on the social network; step S5, provide the selected translator with the video to be translated, and provide the translator in the video with an interactive interface for translating the original text of subtitles in the video; step S7, receive the subtitle translation returned by the translator from the interactive page; step S9, in the video Display approved subtitle translations.
  • step S1 the original text of subtitles added by the video creator for the video to be translated may be received by providing the creator of the video with an interactive interface for subtitle management.
  • FIGS. 2A-2G illustrate schematic diagrams of an interactive interface for subtitle management according to some embodiments of the present disclosure.
  • FIG. 2A when the creator of the video plays the video to be translated, the "generate subtitle” option is displayed in the video, for example, in the form of a banner.
  • FIG. 2A can also show the playing volume and sharing status of the video to be translated.
  • FIG. 2A may also show a prompt to generate captions above the "Generate Captions” option.
  • the prompt to generate subtitles disappears after the creator of the video clicks the close button. It will also disappear automatically after the video's creator has successfully added subtitles.
  • FIG. 2B can also show the progress of uploading subtitles, for example, 20% has been uploaded.
  • Figure 2B also provides a cancel option. If the creator of the video wants to cancel the operation of uploading subtitles, he can click the cancel button.
  • the operation guide For creators who are not familiar with subtitle management, such as those who use subtitle management for the first time, before adding subtitles, you can open the operation guide provided by the platform to learn how to add subtitles. For example, the operation guide will inform the creator of the video that the original subtitles can be generated first, and then the original subtitles can be automatically translated by translators (such as viewers from all over the world), and community translators can help optimize the automatically translated subtitle translation.
  • a prompt can also be displayed telling them that the subtitles will be automatically translated.
  • the creator of the video After the creator of the video uploads the original text of the subtitle, he can enter the preview page, as shown in FIG. 2C . As shown in FIG. 2C , multiple subtitles are displayed on the preview page, and the subtitles are highlighted one by one (eg sentence by sentence). In response to a play request for the video, the video may be played in a synchronous manner with voice and subtitles. On the preview page in Figure 2C, options such as edit, cancel and save are also provided. The creator of the video can click the corresponding option button to edit, cancel or save according to the specific situation.
  • the subtitles added by the creator will be displayed on the corresponding video playback page, as shown in Figure 2D.
  • FIG. 2D it is also possible to prompt to adjust the position of the subtitle.
  • the creator of the video can adjust the position of the subtitle on the video playback page by, for example, dragging and dropping.
  • FIG. 2F illustrates the "Creator Tools” page. Click “manage subtitles” to enter the subtitle management page, as shown in Figure 2G.
  • the subtitle management page of FIG. 2G displays the subtitle status of the video.
  • the subtitle status of the video includes subtitles to be generated and subtitles added.
  • the videos to be subtitled sort the videos by the amount of video playback.
  • the video play count is displayed on the video cover on the subtitle management page, for example in a language other than the creator's language.
  • the subtitle management page can also display the type of subtitle translation, for example, whether it is an automatic translation or a community translation.
  • the number of automatic translations and the number of community translations can also be displayed on the subtitle management page.
  • videos are arranged by update time. For example, updates can be made when subtitle translations are added; when subtitles are translated by a translator; when the creator deletes a translator's subtitle translations.
  • a path to enter the page of "adding subtitles" from the notification bar of "generating subtitles” in the played video is provided.
  • Video creation can also be directly from the “creator tools” page shown in Figure 2F. Click “Manage subtitles” to enter the subtitle management page. After entering the subtitle management page, you can click the "Generate Subtitle” button on the video to be translated to add subtitles; you can also click the cover of the video to be translated to view the video and click the "Generate Subtitle” button on the video to add subtitles .
  • the language used by the user may include at least one of the following: a system language used by the user, a video language browsed by the user, and a language input by the user.
  • a system language used by the user For example, if the original text of a video’s subtitles is English and you want to be translated into Chinese or Japanese, you can select the system language used by the user, the language of the video viewed by the user, and users whose input language includes both English and Chinese or Japanese as translators. According to the language used by the user, selecting a translator that matches the language of the original subtitle and the translated text can more effectively improve the quality of the translated text and the efficiency of translation review.
  • Translators can be registered community translators or unregistered translators. For unregistered translators, guidelines for registering as community translators can also be provided to translators. Selecting registered translators to provide translations can further improve the quality of translations and the efficiency of translation review.
  • a translator can also be selected according to the type of video to be translated and the type of video that the user is interested in. For example, according to the user's historical behavior data, determine the user's expected interest value for different types of videos; determine the type of video that the user is interested in according to the user's expected interest value for different types of videos; The video type matches the user, acting as a translator. Providing translators with their favorite types of videos can increase the translator's enthusiasm for translation, improve the quality of the translation, and reduce the cost of translation review.
  • Step S5 will be described below in conjunction with FIGS. 3A-3D .
  • 3A-3C show schematic diagrams of an interactive interface for translating original subtitles in a video according to some embodiments of the present disclosure.
  • FIG. 3D shows a schematic diagram illustrating the guideline for registering as a community translator according to some embodiments of the present disclosure.
  • step S5 only the video to be translated is provided to the selected translator, and an interactive interface for translating the original subtitles in the video is provided to the translator in the video, as shown in FIG. 3A .
  • the translator is provided with the translation of the original text of subtitles in the translated video in the form of information flow in the video.
  • Fig. 3A may also show the original text of subtitles. Except for the selected translator, no interactive interface is provided for other users to translate the original subtitles in the video.
  • the proofreading page In response to the translator's editing request for the translation, enter the proofreading page, as shown in Figure 3B.
  • the proofreading page displays the original subtitle text item by item (eg, sentence by sentence), and displays the translation below each original subtitle text, and the editing position is located at the translation of the subtitle selected by the translator.
  • Translations can be machine translations or translations of subtitles provided by other translators.
  • the editing interface may include a pop-up keyboard, as shown in FIG. 3C.
  • Translators can clear the translation and enter their own subtitle translation. After the translator completes the subtitle translation, he can submit it to the platform and wait for review. After the review is passed, you will receive a notification from the platform. That is, the video processing method further includes: sending a notification to the translator after the translator's subtitle translation is approved. Of course, the translator can also enter the corresponding task management interface to check the status of the subtitle translation, such as being reviewed, approved, and failed.
  • providing the translator in the video with an interactive interface for translating the original text of subtitles in the video further includes providing the translator with a guide to register as a community translator, as shown in FIG. 3D .
  • FIG. 4A shows a schematic diagram of a subtitle translation viewing page according to some embodiments of the present disclosure.
  • FIG. 4B shows a schematic diagram of a translator information page according to some embodiments of the present disclosure.
  • Fig. 5A shows a schematic diagram of a video playing page according to some embodiments of the present disclosure.
  • FIG. 5B shows a schematic diagram of a subtitle language management page in some embodiments of the present disclosure.
  • Fig. 5C shows a schematic diagram of a subtitle translation viewing page according to other embodiments of the present disclosure.
  • step S7 the subtitle translation returned by the translator from the interactive page is received and reviewed.
  • the creator of the video may also be notified that a subtitle translation has been approved.
  • the creator of the video can view the subtitle translation according to the path provided by the notification, such as the subtitle translation viewing page shown in Figure 4A.
  • the creator of the video can also click the "View Translator" button shown in Figure 4A to enter the translator's information page to view information about the translator, as shown in Figure 4B.
  • Figure 4B shows the identity of the translator, such as whether he belongs to the community translator, the language he is good at, and can also show thumbnails of other translation works of the translator (such as video covers).
  • the creator of the video can also jump to the video playback page by clicking on the video, as shown in Figure 5A.
  • the video playback page displays the original text of the subtitle by default, but provides language options.
  • the subtitle language management page is entered, as shown in FIG. 5B .
  • FIG. 5B the languages of the original subtitle text and the languages of the subtitle translations of various versions are shown.
  • the creator of the video can select a subtitle translation in a corresponding language to view the corresponding subtitle translation, as shown in FIG. 5C . Similar to FIG. 4A, FIG. 5C shows the subtitle translation and displays a "View Translator" button.
  • the creator of the video can also click the "View Translator” button shown in Figure 5C to enter the translator's information page to view and understand the relevant information of the translator, as shown in Figure 4B.
  • the creator of the video can also remove the translator's subtitle translation through the "Remove Translator's Editing" button shown in FIG. 5C . Subtitle translations adopted by the video's creator will be published.
  • FIGS. 6A-6B show a schematic diagram of a video playing page according to other embodiments of the present disclosure.
  • Fig. 6B shows a schematic diagram of an interactive interface where a translator interacts with an audience according to some embodiments of the present disclosure.
  • step S9 the approved subtitle translation is displayed in the video.
  • the original subtitle text of the video and the published subtitle translation will be synchronized with the audio of the video.
  • the video processing method further includes: providing an interactive interface for the translator to interact with the audience in the video, as shown in FIG. 6B .
  • the approved subtitle translation is displayed in the video
  • the signature of the translator corresponding to the subtitle translation is displayed, such as the translator's username.
  • Viewers can interact with the translator shown in Figure 6B, such as likes.
  • FIG. 7 shows a block diagram of a video processing device according to some embodiments of the present disclosure.
  • the video processing device 7 includes a processor 71 and a display 72 .
  • the processor 71 is configured to: receive the original subtitle text added by the creator of the video for the video to be translated; select a translator based on the original subtitle text and the language used by the user on the social network; provide the selected translator with the video to be translated, and Provide the translator with an interactive interface for translating the original text of subtitles in the video; receive the subtitle translation returned by the translator from the interactive page.
  • the processor 71 is configured to perform steps S1-S7.
  • the display 72 is configured to display the approved subtitle translation in the video, for example, execute step S9.
  • the apparatus may also include a memory that can store various information generated in operation by the video processing device, each unit included in the video processing device, programs and data for operations, and the like.
  • the memory can be volatile memory and/or non-volatile memory.
  • memory may include, but is not limited to, random access memory (RAM), dynamic random access memory (DRAM), static random access memory (SRAM), read only memory (ROM), flash memory.
  • RAM random access memory
  • DRAM dynamic random access memory
  • SRAM static random access memory
  • ROM read only memory
  • flash memory flash memory
  • the memory may also be located outside the video processing device.
  • Fig. 8 shows a block diagram of a video processing device according to other embodiments of the present disclosure.
  • the video processing device 8 can be various types of equipment, such as but not limited to mobile phones, notebook computers, digital broadcast receivers, PDA (personal digital assistant), PAD (tablet computer), PMP (portable multimedia players), mobile terminals such as vehicle-mounted terminals (such as vehicle-mounted navigation terminals), and the like, and stationary terminals such as digital TVs, desktop computers, and the like.
  • PDA personal digital assistant
  • PAD tablet computer
  • PMP portable multimedia players
  • mobile terminals such as vehicle-mounted terminals (such as vehicle-mounted navigation terminals), and the like
  • stationary terminals such as digital TVs, desktop computers, and the like.
  • the video processing device 8 includes: a memory 81 and a processor 82 coupled to the memory 81 . It should be noted that the components of the video processing device 8 shown in FIG. 8 are exemplary rather than limiting, and the video processing device 8 may also have other components according to actual application requirements. Processor 82 may control other components in video processing device 8 to perform desired functions.
  • memory 81 is used to store one or more computer readable instructions.
  • the processor 82 is used to execute computer-readable instructions, the computer-readable instructions are executed by the processor 82 to implement the method according to any one of the above-mentioned embodiments.
  • the specific implementation and related explanations of each step of the method reference may be made to the above-mentioned embodiments, and repeated descriptions will not be repeated here.
  • processor 82 and the memory 81 may directly or indirectly communicate with each other.
  • processor 82 and memory 81 may communicate via a network.
  • a network may include a wireless network, a wired network, and/or any combination of a wireless network and a wired network.
  • the processor 82 and the memory 81 may also communicate with each other through the system bus, which is not limited in the present disclosure.
  • the processor 82 can be embodied as various suitable processors, processing devices, etc., such as a central processing unit (CPU), a graphics processing unit (Graphics Processing Unit, GPU), a network processor (NP), etc.; Signal Processor (DSP), Application Specific Integrated Circuit (ASIC), Field Programmable Gate Array (FPGA) or other Programmable Logic Devices, Discrete Gate or Transistor Logic Devices, Discrete Hardware Components.
  • the central processing unit (CPU) may be an X86 or ARM architecture or the like.
  • memory 81 may include any combination of various forms of computer-readable storage media, such as volatile memory and/or non-volatile memory.
  • the memory 81 may include, for example, a system memory, and the system memory stores, for example, an operating system, an application program, a boot loader (Boot Loader), a database, and other programs. Various application programs, various data, and the like can also be stored in the storage medium.
  • a system memory stores, for example, an operating system, an application program, a boot loader (Boot Loader), a database, and other programs.
  • Various application programs, various data, and the like can also be stored in the storage medium.
  • a central processing unit (CPU) 901 executes various processes according to programs stored in a read only memory (ROM) 902 or programs loaded from a storage section 908 to a random access memory (RAM) 903 .
  • ROM read only memory
  • RAM random access memory
  • data required when the CPU 901 executes various processing and the like is also stored as necessary.
  • the central processing unit is only exemplary, and it may also be other types of processors, such as the various processors mentioned above.
  • ROM 902, RAM 903, and storage portion 908 may be various forms of computer-readable storage media, as described below. It should be noted that although the ROM 902, the RAM 903 and the storage portion 908 are shown separately in FIG. 9, one or more of them may be combined or located in the same or different memory or storage modules.
  • the CPU 901, ROM 902, and RAM 903 are connected to each other via a bus 904.
  • An input/output interface 905 is also connected to the bus 904 .
  • the following components are connected to the input/output interface 905: an input section 906, such as a touch screen, touch pad, keyboard, mouse, image sensor, microphone, accelerometer, gyroscope, etc.; an output section 907, including a display, such as a cathode ray tube (CRT ), a liquid crystal display (LCD), a speaker, a vibrator, etc.; a storage section 908 including a hard disk, a magnetic tape, etc.; and a communication section 909 including a network interface card such as a LAN card, a modem, and the like.
  • the communication section 909 allows communication processing to be performed via a network such as the Internet. It is easy to understand that although it is shown in FIG. 9 that each device or module in the electronic device 900 communicates through the bus 904, they may also communicate through a network or other methods, wherein the network may include a wireless network, a wired network , and/or any combination of wireless and wired networks.
  • a drive 910 is also connected to the input/output interface 905 as needed.
  • a removable medium 911 such as a magnetic disk, an optical disk, a magneto-optical disk, a semiconductor memory, or the like is mounted on the drive 910 as necessary, so that a computer program read therefrom is installed into the storage section 908 as necessary.
  • programs constituting the software can be installed from a network such as the Internet or a storage medium such as the removable medium 911 .
  • embodiments of the present disclosure include a computer program product, which includes a computer program carried on a computer-readable medium, where the computer program includes program codes for executing the methods shown in the flowcharts.
  • the computer program may be downloaded and installed from the network via the communication means 909, or from the storage portion 908, or from the ROM 902.
  • the CPU 901 When the computer program is executed by the CPU 901, the above-mentioned functions defined in the methods of the embodiments of the present disclosure are performed.
  • a computer-readable medium may be a tangible medium that may contain or store information for use by or in conjunction with an instruction execution system, device, or device. program.
  • a computer readable medium may be a computer readable signal medium or a computer readable storage medium or any combination of the two.
  • a computer-readable storage medium may be, for example, but not limited to: an electrical, magnetic, optical, electromagnetic, infrared, or semiconductor system, device, or device, or any combination thereof.
  • Computer-readable storage media may include, but are not limited to, electrical connections with one or more wires, portable computer diskettes, hard disks, random access memory (RAM), read-only memory (ROM), erasable Programmable read-only memory (EPROM or flash memory), optical fiber, portable compact disk read-only memory (CD-ROM), optical storage device, magnetic storage device, or any suitable combination of the above.
  • a computer-readable storage medium may be any tangible medium that contains or stores a program that can be used by or in conjunction with an instruction execution system, apparatus, or device.
  • a computer-readable signal medium may include a data signal propagated in baseband or as part of a carrier wave carrying computer-readable program code therein.
  • Such propagated data signals may take many forms, including but not limited to electromagnetic signals, optical signals, or any suitable combination of the foregoing.
  • a computer-readable signal medium may also be any computer-readable medium other than a computer-readable storage medium, which can transmit, propagate, or transmit a program for use by or in conjunction with an instruction execution system, apparatus, or device .
  • Program code embodied on a computer readable medium may be transmitted by any appropriate medium, including but not limited to wires, optical cables, RF (radio frequency), etc., or any suitable combination of the above.
  • the above-mentioned computer-readable medium may be included in the above-mentioned electronic device, or may exist independently without being incorporated into the electronic device.
  • a computer program including: instructions, which when executed by a processor cause the processor to execute the method of any one of the above embodiments.
  • instructions may be embodied as computer program code.
  • the computer program codes for performing the operations of the present disclosure may be written in one or more programming languages or a combination thereof, the above-mentioned programming languages include but not limited to object-oriented programming languages, Such as Java, Smalltalk, C++, also includes conventional procedural programming languages, such as the "C" language or similar programming languages.
  • the program code may execute entirely on the user's computer, partly on the user's computer, as a stand-alone software package, partly on the user's computer and partly on a remote computer or entirely on the remote computer or server.
  • the remote computer can be connected to the user computer through any kind of network, including a local area network (LAN) or a wide area network (WAN), or it can be connected to an external computer (such as through an Internet service provider). Internet connection).
  • LAN local area network
  • WAN wide area network
  • Internet connection any kind of network, including a local area network (LAN) or a wide area network (WAN), or it can be connected to an external computer (such as through an Internet service provider). Internet connection).
  • each block in a flowchart or block diagram may represent a module, program segment, or portion of code that contains one or more logical functions for implementing specified executable instructions.
  • the functions noted in the block may occur out of the order noted in the figures. For example, two blocks shown in succession may, in fact, be executed substantially concurrently, or they may sometimes be executed in the reverse order, depending upon the functionality involved.
  • each block of the block diagrams and/or flowchart illustrations, and combinations of blocks in the block diagrams and/or flowchart illustrations can be implemented by a dedicated hardware-based system that performs the specified functions or operations , or may be implemented by a combination of dedicated hardware and computer instructions.
  • modules, components or units involved in the embodiments described in the present disclosure may be implemented by software or by hardware. Wherein, the name of a module, component or unit does not constitute a limitation on the module, component or unit itself under certain circumstances.
  • exemplary hardware logic components include: Field Programmable Gate Array (FPGA), Application Specific Integrated Circuit (ASIC), Application Specific Standard Product (ASSP), System on Chip (SOC), Complex Programmable Logical device (CPLD) and so on.
  • FPGA Field Programmable Gate Array
  • ASIC Application Specific Integrated Circuit
  • ASSP Application Specific Standard Product
  • SOC System on Chip
  • CPLD Complex Programmable Logical device

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • General Engineering & Computer Science (AREA)
  • Health & Medical Sciences (AREA)
  • General Health & Medical Sciences (AREA)
  • Business, Economics & Management (AREA)
  • Artificial Intelligence (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computational Linguistics (AREA)
  • Human Computer Interaction (AREA)
  • Quality & Reliability (AREA)
  • Human Resources & Organizations (AREA)
  • General Business, Economics & Management (AREA)
  • Tourism & Hospitality (AREA)
  • Strategic Management (AREA)
  • Operations Research (AREA)
  • Marketing (AREA)
  • Economics (AREA)
  • Entrepreneurship & Innovation (AREA)
  • Databases & Information Systems (AREA)
  • Social Psychology (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Computer Security & Cryptography (AREA)
  • Machine Translation (AREA)
  • Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)
  • Computing Systems (AREA)

Abstract

本公开涉及一种视频处理方法、视频处理装置和计算机可读存储介质。视频处理方法包括:接收视频的创作者为待翻译的视频添加的字幕原文;基于字幕原文和在社交网络上用户使用的语言,选择译者;向选择的译者提供待翻译的视频,并在视频中向译者提供翻译视频中字幕原文的交互界面;接收译者从交互页面返回的字幕译文;在视频中显示审核通过的字幕译文。

Description

视频处理方法、视频处理装置和计算机可读存储介质
相关申请的交叉引用
本申请是以中国申请号为202111447076.5,申请日为2021年11月30日,题目为“视频处理方法、视频处理装置和计算机可读存储介质”的申请为基础,并主张其优先权,该中国申请的公开内容在此作为整体引入本申请中。
技术领域
本公开涉及计算机技术领域,特别涉及一种视频处理方法、视频处理装置和计算机可读存储介质。
背景技术
社交网络可根据用户的输入,提供各种服务,例如,照片或视频共享、消息传递等,以促进用户之间的社交互动。
利用与社交网络的交互,用户可将数字媒体上传至系统,供他人浏览。数字媒体可包括图像、视频、音频、文本等。例如,用户可将自己创作的视频发布到社交网络上。社交网络上的其他用户可以通过浏览、点赞、评论等方式与视频创作者进行互动。
随着用户对社交网络的依赖性日益增加,用户对社交网络的体验要求也越来越高,尤其是对于视频中字幕译文的质量提出了越来越高的要求。
发明内容
根据本公开的一些实施例,提供了一种视频处理方法,包括:
接收视频的创作者为待翻译的视频添加的字幕原文;
基于字幕原文和在社交网络上用户使用的语言,选择译者;
向选择的译者提供待翻译的视频,并在视频中向译者提供翻译视频中字幕原文的交互界面;
接收译者从交互页面返回的字幕译文;
在视频中显示审核通过的字幕译文。
根据本公开的另一些实施例,提供了一种视频处理装置,包括:
处理器,配置为
接收视频的创作者为待翻译的视频添加的字幕原文;
基于字幕原文和在社交网络上用户使用的语言,选择译者;
向选择的译者提供待翻译的视频,并在视频中向译者提供翻译视频中字幕原文的交互界面;
接收译者从交互页面返回的字幕译文;和
显示器,配置为在视频中显示审核通过的字幕译文。
根据本公开的又一些实施例,提供了一种视频处理装置,包括:
存储器;和
耦接至所述存储器的处理器,所述处理器被配置为基于存储在所述存储器中的指令,执行本公开中所述的任一实施例的视频处理方法中的一个或多个步骤。
根据本公开的再一些实施例,提供一种计算机可读存储介质,其上存储有计算机程序,该程序被处理器执行时执行本公开中所述的任一实施例的视频处理方法中的一个或多个步骤。
提供该发明内容部分以便以简要的形式介绍构思,这些构思将在后面的具体实施方式部分被详细描述。该发明内容部分并不旨在标识要求保护的技术方案的关键特征或必要特征,也不旨在用于限制所要求的保护的技术方案的范围。
通过以下参照附图对本公开的示例性实施例的详细描述,本公开的其它特征、方面及其优点将会变得清楚。
附图说明
下面参照附图说明本公开的优选实施例。此处所说明的附图用来提供对本公开的进一步理解,各附图连同下面的具体描述一起包含在本说明书中并形成说明书的一部分,用于解释本公开。应当理解的是,下面描述中的附图仅仅涉及本公开的一些实施例,而非对本公开构成限制。在附图中:
图1示出根据本公开一些实施例的视频处理方法的流程图;
图2A-2G示出根据本公开一些实施例字幕管理的交互界面的示意图;
图3A-3C示出根据本公开一些实施例翻译视频中字幕原文的交互界面的示意图;
图3D示出示出根据本公开一些实施例指引注册成为社区译者的示意图;
图4A示出根据本公开一些实施例字幕译文查看页面的示意图;
图4B示出根据本公开一些实施例的译者信息页面的示意图;
图5A示出根据本公开一些实施例视频播放页面的示意图;
图5B示出本公开一些实施例字幕语言管理页面的示意图;
图5C示出根据本公开另一些实施例字幕译文查看页面的示意图;
图6A示出根据本公开另一些实施例的视频播放页面的示意图;
图6B示出根据本公开一些实施例译者与观众互动的交互界面的示意图;
图7示出根据本公开一些实施例的视频处理装置的框图;
图8示出根据本公开另一些实施例的视频处理装置的框图;
图9示出根据本公开一些实施例的电子设备的框图。
应当明白,为了便于描述,附图中所示出的各个部分的尺寸并不一定是按照实际的比例关系绘制的。在各附图中使用了相同或相似的附图标记来表示相同或者相似的部件。因此,一旦某一项在一个附图中被定义,则在随后的附图中可能不再对其进行进一步讨论。
具体实施方式
下面将结合本公开实施例中的附图,对本公开实施例中的技术方案进行清楚、完整地描述,但是显然,所描述的实施例仅仅是本公开一部分实施例,而不是全部的实施例。以下对实施例的描述实际上也仅仅是说明性的,决不作为对本公开及其应用或使用的任何限制。应当理解的是,本公开可以通过各种形式来实现,而且不应该被解释为限于这里阐述的实施例。
应当理解,本公开的方法实施方式中记载的各个步骤可以按照不同的顺序执行,和/或并行执行。此外,方法实施方式可以包括附加的步骤和/或省略执行示出的步骤。本公开的范围在此方面不受限制。除非另外具体说明,否则在这些实施例中阐述的部件和步骤的相对布置、数字表达式和数值应被解释为仅仅是示例性的,不限制本公开的范围。
本公开中使用的术语“包括”及其变型意指至少包括后面的元件/特征、但不排除其他元件/特征的开放性术语,即“包括但不限于”。此外,本公开使用的术语“包含”及其变型意指至少包含后面的元件/特征、但不排除其他元件/特征的开放性术语,即“包含但不限于”。因此,包括与包含是同义的。术语“基于”意指“至少部分地基于”。
整个说明书中所称“一个实施例”、“一些实施例”或“实施例”意味着与实施 例结合描述的特定的特征、结构或特性被包括在本发明的至少一个实施例中。例如,术语“一个实施例”表示“至少一个实施例”;术语“另一实施例”表示“至少一个另外的实施例”;术语“一些实施例”表示“至少一些实施例”。而且,短语“在一个实施例中”、“在一些实施例中”或“在实施例中”在整个说明书中各个地方的出现不一定全都指的是同一个实施例,但是也可以指同一个实施例。
需要注意,本公开中提及的“第一”、“第二”等概念仅用于对不同的装置、模块或单元进行区分,并非用于限定这些装置、模块或单元所执行的功能的顺序或者相互依存关系。除非另有指定,否则“第一”、“第二”等概念并非意图暗示如此描述的对象必须按时间上、空间上、排名上的给定顺序或任何其他方式的给定顺序。
需要注意,本公开中提及的“一个”、“多个”的修饰是示意性而非限制性的,本领域技术人员应当理解,除非在上下文另有明确指出,否则应该理解为“一个或多个”。
本公开实施方式中的多个装置之间所交互的消息或者信息的名称仅用于说明性的目的,而并不是用于对这些消息或信息的范围进行限制。
下面结合附图对本公开的实施例进行详细说明,但是本公开并不限于这些具体的实施例。下面这些具体实施例可以相互结合,对于相同或者相似的概念或过程可能在某些实施例不再赘述。此外,在一个或多个实施例中,特定的特征、结构或特性可以由本领域的普通技术人员从本公开将清楚的任何合适的方式组合。
应理解,本公开对于如何获得待应用/待处理的图像或视频也不做限制。在本公开的一个实施例中,可以从存储装置,例如内部存储器或者外部存储装置获取,在本公开的另一个实施例中,可以调动摄影组件来拍摄。应指出,在本说明书的上下文中,图像的或视频类型未被具体限制。此外,图像或视频可以是由摄像装置获得的原始图像或视频,或者已对原始图像或视频进行过特定处理的图像或视频,例如初步过滤、去混叠、颜色调整、对比度调整、规范化等等。应指出,预处理操作还可以包括本领域已知的其它类型的预处理操作,这里将不再详细描述。
随着用户对社交网络的依赖性日益增加,用户对社交网络的体验要求也越来越高,尤其是对于视频中字幕译文的质量提出了越来越高的要求。相关技术中,只能由视频发布者对视频的字幕信息进行编辑和发布。为了进一步提高于视频中字幕译文的质量,利用众包翻译提供视频中字幕译文。
图1示出根据本公开一些实施例的视频处理方法的流程图。
如图1所示,视频处理方法包括:步骤S1,接收视频的创作者为待翻译的视频添加的字幕原文;步骤S3,基于字幕原文和在社交网络上用户使用的语言,选择译者;步骤S5,向选择的译者提供待翻译的视频,并在视频中向译者提供翻译视频中字幕原文的交互界面;步骤S7,接收译者从交互页面返回的字幕译文;步骤S9,在视频中显示审核通过的字幕译文。
在步骤S1中,可以通过向视频的创作者提供字幕管理的交互界面,接收视频的创作者为待翻译的视频添加的字幕原文。
图2A-2G示出根据本公开一些实施例字幕管理的交互界面的示意图。
如图2A所示,当视频的创作者播放待翻译的视频时,在该视频中显示“生成字幕”选项,例如以横幅的方式。图2A还可以示出该待翻译的视频的播放量和分享情况。图2A还可以在“生成字幕”选项上方示出生成字幕的提示。生成字幕的提示,在视频的创作者点击关闭按钮后消失。在视频的创作者成功添加完字幕后,该提示也会自动消失。
点击“生成字幕”按钮,即进入“添加字幕”页面。在“添加字幕”页面,视频的创作者可以上传字幕原文,如图2B所示。图2B还可以示出上传字幕的进度,例如已上传20%。图2B也提供了取消选项,在视频的创作者希望取消上传字幕的操作,可以点击取消按钮。
对于不熟悉字幕管理的创作者,例如首次使用字幕管理的创作者,在添加字幕前,可以打开平台提供的操作指引,学习如何添加字幕。例如,操作指引将告知视频的创作者可以先生成原文字幕,然后原文字幕可以通过译者(如来自全球的观看者)自动翻译,社区译者可以帮助优化自动翻译的字幕译文。
在视频的创作者添加完字幕后,还可以展示一个提示,告知字幕将自动翻译。
在视频的创作者上传完字幕原文后,可以进入预览页面,如图2C所示。如图2C所示,在预览页面显示多条字幕,并逐条(如逐句)突出显示字幕。响应于对视频的播放请求,可以语音与字幕同步的方式播放视频。在图2C的预览页面,还提供了编辑、取消和保存等选项。视频的创作者可以根据具体情形,点击相应的选项按钮进行编辑、取消或保存等操作。
创作者添加的字幕将显示在对应的视频播放页面,如图2D所示。在图2D中,还可以提示调整字幕的位置。视频的创作者可以通过例如拖拽等方式,调整字幕在视频 播放页面上的位置。
在完成调整字幕的位置后,即完成添加字幕,页面显示“字幕已添加”的提示,并提供视频的创作者从“设置”中的“创作者工具”选项进入字幕管理页面的入口,如图2E所示。
图2F示出“创作者工具”页面。点击其中的“管理字幕”,即可进入字幕管理页面,如图2G所示。图2G的字幕管理页面显示视频的字幕状态。视频的字幕状态包括待生成字幕、已添加字幕。
对于待生成字幕的视频,按视频播放量对视频进行排序。在一些实施例中,在字幕管理页面上的视频封面上显示视频播放量,例如以除创作者的语言之外的其他语言显示视频播放量。
对于已添加字幕的视频,在字幕管理页面上还可以显示字幕译文的类型,例如,属于自动翻译还是社区翻译。在一些实施例中,对于已添加字幕的视频,在字幕管理页面上还可以分别显示自动翻译的数量和社区翻译的数量。
在一些实施例中,在字幕管理页面上,按更新时间排列视频。例如,可以在以下时间进行更新:添加字幕译文后;字幕由译者翻译;创作者删除译者的字幕译文。
在上述实施例中,提供了从播放的视频中“生成字幕”的通知栏进入“添加字幕”页面的路径。视频的创作则也可以直接从图2F所示的“创作者工具”页面。点击其中的“管理字幕”,即可进入字幕管理页面。进入字幕管理页面后,可以通过点击待翻译的视频上的“生成字幕”按钮,添加字幕;也可以通过点击待翻译的视频封面,查看视频,并点击视频上的“生成字幕”按钮,添加字幕。
对于视频的创作者,即使添加的字幕已提交,仍然可以通过字幕管理页面再次编辑字幕。
在步骤S3,用户使用的语言可以包括以下至少一种:用户使用的系统语言、用户浏览的视频语言、用户输入的语言。例如,视频的字幕原文为英文,希望被翻译为中文或日文,则可以选择用户使用的系统语言、用户浏览的视频语言、用户输入的语言中包括英文且包括中文或日文的用户作为译者。根据用户使用的语言,选择与字幕原文和译文语言匹配的译者,可以更有效地提高译文的质量和译文审核的效率。
译者可以是已注册的社区译者,也可以是未注册的译者。对于未注册的译者,也可以向译者提供注册成为社区译者的指引。选择已注册的译者提供翻译,可以进一步更有效地提高译文的质量和译文审核的效率。
在一些实施例中,还可以根据待翻译的视频类型和用户感兴趣的视频类型,选择译者。例如,根据用户的历史行为数据,确定用户对不同类型的视频的预期兴趣值;根据用户对不同类型的视频的预期兴趣值,确定用户感兴趣的视频类型;选择感兴趣的视频类型与待翻译的视频类型匹配的用户,作为译者。向译者提供其喜爱类型的视频,可以提高译者翻译的积极性,也有利于提升译文质量,便于降低译文审核的成本。
下面结合图3A-3D来描述步骤S5。图3A-3C示出根据本公开一些实施例翻译视频中字幕原文的交互界面的示意图。图3D示出示出根据本公开一些实施例指引注册成为社区译者的示意图。
在步骤S5,仅向选择的译者提供待翻译的视频,并且在视频中向译者提供翻译视频中字幕原文的交互界面,如图3A所示。在图3A中,在视频中以信息流的方式,向译者提供翻译视频中字幕原文的译文。图3A也可以示出字幕原文。除选择的译者之外,不向其他用户提供翻译视频中字幕原文的交互界面。
响应于译者对译文的编辑请求,进入校对页面,如图3B所示。在图3B中,校对页面逐条(如逐句)显示字幕原文,并在每一条字幕原文下方显示译文,编辑位置定位在译者选择的字幕的译文。译文可以是机器译文,也可以是其他译者提供的字幕译文。
编辑的界面可以包括弹出的键盘,如图3C所示。译者可以清除译文,输入自己的字幕译文。译者在完成字幕译文后,可以提交到平台,等待审核。在审核通过后,会收到平台的通知。即,视频处理方法还包括:在译者的字幕译文通过审核后,向译者发送通知。当然,译者也可以进入对应的任务管理界面查看字幕译文的状态,如审核中、审核通过、审核未通过等。
在一些实施例中,在视频中向译者提供翻译视频中字幕原文的交互界面还包括向译者提供注册成为社区译者的指引,如图3D所示。
下面结合图4A-5C来描述步骤S7。图4A示出根据本公开一些实施例字幕译文查看页面的示意图。图4B示出根据本公开一些实施例的译者信息页面的示意图。图5A示出根据本公开一些实施例视频播放页面的示意图。图5B示出本公开一些实施例字幕语言管理页面的示意图。图5C示出根据本公开另一些实施例字幕译文查看页面的示意图。
在步骤S7,接收译者从交互页面返回的字幕译文,并进行审核。在一些实施例中,也可以通知视频的创作者已有审核通过的字幕译文。视频的创作者可以根据通知提供 的路径查看字幕译文,如图4A所示的字幕译文查看页面。视频的创作者还可以通过点击图4A示出中示出的“查看译者”按钮,进入译者信息页面,查看了解译者的相关信息,如图4B所示。图4B示出译者的身份,例如是否属于社区译者,擅长的语言,并且还可以示出译者的其他翻译作品的缩略图(如视频封面)。
视频的创作者也可以通过点击视频跳转到视频播放页面,如图5A所示。在图5A中,视频播放页面默认显示字幕原文,但提供语言选项。点击语言选项后,进入字幕语言管理页面,如图5B所示。在图5B中,展示字幕原文的语言,以及各种版本的字幕译文的语言。视频的创作者可以选择相应语言的字幕译文,查看对应的字幕译文,如图5C所示。类似于图4A,图5C示出字幕译文,并显示“查看译者”按钮。如前所述,视频的创作者也可以点击图5C示出中示出的“查看译者”按钮,进入译者信息页面,查看了解译者的相关信息,如图4B所示。视频的创作者还可以通过图5C示出的“移除译者的编辑”按钮,移除译者的字幕译文。视频的创作者采纳的字幕译文将被公布。
下面结合图6A-6B来描述步骤S9。图6A示出根据本公开另一些实施例的视频播放页面的示意图。图6B示出根据本公开一些实施例译者与观众互动的交互界面的示意图。
在步骤S9,在视频中显示审核通过的字幕译文。如图6A所示,视频的字幕原文、公布的字幕译文将与视频的声音同步。
在一些实施例中,视频处理方法还包括:在视频中提供译者与观众互动的交互界面,如图6B所示。在图6B中,在视频中显示审核通过的字幕译文,并显示字幕译文对应的译者的署名,例如译者的用户名。观众可以对图6B所示的译者进行点赞等互动。
在上述实施例中,为视频的众包翻译提供了完整的解决方案,能够更有效地提高译文的质量和译文审核的效率。
图7示出根据本公开一些实施例的视频处理装置的框图。如图7所示,视频处理装置7包括处理器71和显示器72。
处理器71配置为:接收视频的创作者为待翻译的视频添加的字幕原文;基于字幕原文和在社交网络上用户使用的语言,选择译者;向选择的译者提供待翻译的视频,并在视频中向译者提供翻译视频中字幕原文的交互界面;接收译者从交互页面返回的字幕译文。例如,处理器71配置为执行步骤S1-S7。
显示器72配置为在视频中显示审核通过的字幕译文,例如,执行步骤S9。
此外,尽管未示出,该设备也可以包括存储器,其可以存储由视频处理装置、视频处理装置所包含的各个单元在操作中产生的各种信息、用于操作的程序和数据等。存储器可以是易失性存储器和/或非易失性存储器。例如,存储器可以包括但不限于随机存储存储器(RAM)、动态随机存储存储器(DRAM)、静态随机存取存储器(SRAM)、只读存储器(ROM)、闪存存储器。当然,存储器可也位于视频处理装置之外。
图8示出根据本公开另一些实施例的视频处理装置的框图。
在一些实施例中,视频处理装置8可以为各种类型的设备,例如可以包括但不限于诸如移动电话、笔记本电脑、数字广播接收器、PDA(个人数字助理)、PAD(平板电脑)、PMP(便携式多媒体播放器)、车载终端(例如车载导航终端)等等的移动终端以及诸如数字TV、台式计算机等等的固定终端。
如图8所示,视频处理装置8包括:存储器81以及耦接至该存储器81的处理器82。应当注意,图8所示的视频处理装置8的组件只是示例性的,而非限制性的,根据实际应用需要,视频处理装置8还可以具有其他组件。处理器82可以控制视频处理装置8中的其它组件以执行期望的功能。
在一些实施例中,存储器81用于存储一个或多个计算机可读指令。处理器82用于运行计算机可读指令时,计算机可读指令被处理器82运行时实现根据上述任一实施例所述的方法。关于该方法的各个步骤的具体实现以及相关解释内容可以参见上述的实施例,重复之处在此不作赘述。
例如,处理器82和存储器81之间可以直接或间接地互相通信。例如,处理器82和存储器81可以通过网络进行通信。网络可以包括无线网络、有线网络、和/或无线网络和有线网络的任意组合。处理器82和存储器81之间也可以通过系统总线实现相互通信,本公开对此不作限制。
例如,处理器82可以体现为各种适当的处理器、处理装置等,诸如中央处理器(CPU)、图形处理器(Graphics Processing Unit,GPU)、网络处理器(NP)等;还可以是数字信号处理器(DSP)、专用集成电路(ASIC)、现场可编程门阵列(FPGA)或者其他可编程逻辑器件、分立门或者晶体管逻辑器件、分立硬件组件。中央处理元(CPU)可以为X86或ARM架构等。例如,存储器81可以包括各种形式的计算机可读存储介质的任意组合,例如易失性存储器和/或非易失性存储器。存储器81例如可以包括系统存储器,系统存储器例如存储有操作系统、应用程序、引导装载程序(Boot Loader)、数据库 以及其他程序等。在存储介质中还可以存储各种应用程序和各种数据等。
另外,根据本公开的一些实施例,根据本公开的各种操作/处理在通过软件和/或固件实现的情况下,可从存储介质或网络向具有专用硬件结构的计算机系统,例如图9所示的电子设备900的计算机系统安装构成该软件的程序,该计算机系统在安装有各种程序时,能够执行各种功能,包括诸如前文所述的功能等等。
在图9中,中央处理单元(CPU)901根据只读存储器(ROM)902中存储的程序或从存储部分908加载到随机存取存储器(RAM)903的程序执行各种处理。在RAM 903中,也根据需要存储当CPU 901执行各种处理等时所需的数据。中央处理单元仅仅是示例性的,其也可以是其它类型的处理器,诸如前文所述的各种处理器。ROM 902、RAM 903和存储部分908可以是各种形式的计算机可读存储介质,如下文所述。需要注意的是,虽然图9中分别示出了ROM 902、RAM 903和存储部分908,但是它们中的一个或多个可以合并或者位于相同或不同的存储器或存储模块中。
CPU 901、ROM 902和RAM 903经由总线904彼此连接。输入/输出接口905也连接到总线904。
下述部件连接到输入/输出接口905:输入部分906,诸如触摸屏、触摸板、键盘、鼠标、图像传感器、麦克风、加速度计、陀螺仪等;输出部分907,包括显示器,比如阴极射线管(CRT)、液晶显示器(LCD),扬声器,振动器等;存储部分908,包括硬盘,磁带等;和通信部分909,包括网络接口卡比如LAN卡、调制解调器等。通信部分909允许经由网络比如因特网执行通信处理。容易理解的是,虽然图9中示出电子设备900中的各个装置或模块是通过总线904来通信的,但它们也可以通过网络或其它方式进行通信,其中,网络可以包括无线网络、有线网络、和/或无线网络和有线网络的任意组合。
根据需要,驱动器910也连接到输入/输出接口905。可拆卸介质911比如磁盘、光盘、磁光盘、半导体存储器等等根据需要被安装在驱动器910上,使得从中读出的计算机程序根据需要被安装到存储部分908中。
在通过软件实现上述系列处理的情况下,可以从网络比如因特网或存储介质比如可拆卸介质911安装构成软件的程序。
根据本公开的实施例,上文参考流程图描述的过程可以被实现为计算机软件程序。例如,本公开的实施例包括一种计算机程序产品,其包括承载在计算机可读介质上的计算机程序,该计算机程序包含用于执行流程图所示的方法的程序代码。在这样的实 施例中,该计算机程序可以通过通信装置909从网络上被下载和安装,或者从存储部分908被安装,或者从ROM 902被安装。在该计算机程序被CPU 901执行时,执行本公开实施例的方法中限定的上述功能。
需要说明的是,在本公开的上下文中,计算机可读介质可以是有形的介质,其可以包含或存储以供指令执行系统、装置或设备使用或与指令执行系统、装置或设备结合地使用的程序。计算机可读介质可以是计算机可读信号介质或者计算机可读存储介质或者是上述两者的任意组合。计算机可读存储介质例如可以是,但不限于:电、磁、光、电磁、红外线、或半导体的系统、装置或器件,或者任意以上的组合。计算机可读存储介质的更具体的例子可以包括但不限于:具有一个或多个导线的电连接、便携式计算机磁盘、硬盘、随机访问存储器(RAM)、只读存储器(ROM)、可擦式可编程只读存储器(EPROM或闪存)、光纤、便携式紧凑磁盘只读存储器(CD-ROM)、光存储器件、磁存储器件、或者上述的任意合适的组合。在本公开中,计算机可读存储介质可以是任何包含或存储程序的有形介质,该程序可以被指令执行系统、装置或者器件使用或者与其结合使用。而在本公开中,计算机可读信号介质可以包括在基带中或者作为载波一部分传播的数据信号,其中承载了计算机可读的程序代码。这种传播的数据信号可以采用多种形式,包括但不限于电磁信号、光信号或上述的任意合适的组合。计算机可读信号介质还可以是计算机可读存储介质以外的任何计算机可读介质,该计算机可读信号介质可以发送、传播或者传输用于由指令执行系统、装置或者器件使用或者与其结合使用的程序。计算机可读介质上包含的程序代码可以用任何适当的介质传输,包括但不限于:电线、光缆、RF(射频)等等,或者上述的任意合适的组合。
上述计算机可读介质可以是上述电子设备中所包含的;也可以是单独存在,而未装配入该电子设备中。
在一些实施例中,还提供了一种计算机程序,包括:指令,指令当由处理器执行时使处理器执行上述任一个实施例的方法。例如,指令可以体现为计算机程序代码。
在本公开的实施例中,可以以一种或多种程序设计语言或其组合来编写用于执行本公开的操作的计算机程序代码,上述程序设计语言包括但不限于面向对象的程序设计语言,诸如Java、Smalltalk、C++,还包括常规的过程式程序设计语言,诸如“C”语言或类似的程序设计语言。程序代码可以完全地在用户计算机上执行、部分地在用户计算机上执行、作为一个独立的软件包执行、部分在用户计算机上部分在远程计算机上执行、或者完全在远程计算机或服务器上执行。在涉及远程计算机的情形中,远 程计算机可以通过任意种类的网络(,包括局域网(LAN)或广域网(WAN))连接到用户计算机,或者,可以连接到外部计算机(例如利用因特网服务提供商来通过因特网连接)。
附图中的流程图和框图,图示了按照本公开各种实施例的系统、方法和计算机程序产品的可能实现的体系架构、功能和操作。在这点上,流程图或框图中的每个方框可以代表一个模块、程序段、或代码的一部分,该模块、程序段、或代码的一部分包含一个或多个用于实现规定的逻辑功能的可执行指令。也应当注意,在有些作为替换的实现中,方框中所标注的功能也可以以不同于附图中所标注的顺序发生。例如,两个接连地表示的方框实际上可以基本并行地执行,它们有时也可以按相反的顺序执行,这依所涉及的功能而定。也要注意的是,框图和/或流程图中的每个方框、以及框图和/或流程图中的方框的组合,可以用执行规定的功能或操作的专用的基于硬件的系统来实现,或者可以用专用硬件与计算机指令的组合来实现。
描述于本公开实施例中所涉及到的模块、部件或单元可以通过软件的方式实现,也可以通过硬件的方式来实现。其中,模块、部件或单元的名称在某种情况下并不构成对该模块、部件或单元本身的限定。
本文中以上描述的功能可以至少部分地由一个或多个硬件逻辑部件来执行。例如,非限制性地,可以使用的示例性的硬件逻辑部件包括:现场可编程门阵列(FPGA)、专用集成电路(ASIC)、专用标准产品(ASSP)、片上系统(SOC)、复杂可编程逻辑设备(CPLD)等等。
以上描述仅为本公开的一些实施例以及对所运用技术原理的说明。本领域技术人员应当理解,本公开中所涉及的公开范围,并不限于上述技术特征的特定组合而成的技术方案,同时也应涵盖在不脱离上述公开构思的情况下,由上述技术特征或其等同特征进行任意组合而形成的其它技术方案。例如上述特征与本公开中公开的(但不限于)具有类似功能的技术特征进行互相替换而形成的技术方案。
在本文提供的描述中,阐述了许多特定细节。然而,理解的是,可以在没有这些特定细节的情况下实施本发明的实施例。在其他情况下,为了不模糊该描述的理解,没有对众所周知的方法、结构和技术进行详细展示。
此外,虽然采用特定次序描绘了各操作,但是这不应当理解为要求这些操作以所示出的特定次序或以顺序次序执行来执行。在一定环境下,多任务和并行处理可能是有利的。同样地,虽然在上面论述中包含了若干具体实现细节,但是这些不应当被解释为对本公开的范围的限制。在单独的实施例的上下文中描述的某些特征还可以组合 地实现在单个实施例中。相反地,在单个实施例的上下文中描述的各种特征也可以单独地或以任何合适的子组合的方式实现在多个实施例中。
虽然已经通过示例对本公开的一些特定实施例进行了详细说明,但是本领域的技术人员应该理解,以上示例仅是为了进行说明,而不是为了限制本公开的范围。本领域的技术人员应该理解,可在不脱离本公开的范围和精神的情况下,对以上实施例进行修改。本公开的范围由所附权利要求来限定。

Claims (16)

  1. 一种视频处理方法,包括:
    接收视频的创作者为待翻译的视频添加的字幕原文;
    基于字幕原文和在社交网络上用户使用的语言,选择译者;
    向选择的译者提供待翻译的视频,并在视频中向译者提供翻译视频中字幕原文的交互界面;
    接收译者从交互界面返回的字幕译文;
    在视频中显示审核通过的字幕译文。
  2. 根据权利要求1所述的视频处理方法,其中,用户使用的语言包括以下至少一种:用户使用的系统语言、用户浏览的视频语言、用户输入的语言。
  3. 根据权利要求1或2所述的视频处理方法,其中,译者包括已注册的社区译者。
  4. 根据权利要求1至3任一项所述的视频处理方法,其中,选择译者还包括:
    根据待翻译的视频类型和用户感兴趣的视频类型,选择译者。
  5. 根据权利要求4所述的视频处理方法,其中,根据待翻译的视频类型和用户感兴趣的视频类型,选择译者包括:
    根据用户的历史行为数据,确定用户对不同类型的视频的预期兴趣值;
    根据用户对不同类型的视频的预期兴趣值,确定用户感兴趣的视频类型;
    选择感兴趣的视频类型与待翻译的视频类型匹配的用户,作为译者。
  6. 根据权利要求1至5任一项所述的视频处理方法,其中,在视频中向译者提供翻译视频中字幕原文的交互界面包括:
    在视频中以信息流的方式,向译者提供待翻译的视频中字幕原文的译文;
    响应于译者对译文的编辑请求,进入校对页面。
  7. 根据权利要求1至6任一项所述的视频处理方法,其中,在视频中向译者提供翻译视频中字幕原文的交互界面包括:
    向译者提供注册成为社区译者的指引。
  8. 根据权利要求1至7任一项所述的视频处理方法,其中,在视频中显示审核通过的字幕译文包括:
    在视频中显示字幕译文对应的译者的署名。
  9. 根据权利要求1至8任一项所述的视频处理方法,还包括:
    在译者的字幕译文通过审核后,向译者发送通知;和/或
    在视频中提供译者与观众互动的交互界面。
  10. 根据权利要求1至9任一项所述的视频处理方法,其中,通过向视频的创作者提供字幕管理的交互界面,接收视频的创作者为待翻译的视频添加的字幕原文。
  11. 根据权利要求10所述的视频处理方法,其中,在视频的创作者添加字幕原文后,进入字幕管理页面,字幕管理页面显示视频的字幕状态,视频的字幕状态包括待生成字幕、已添加字幕。
  12. 根据权利要求11所述的视频处理方法,其中:
    对于待生成字幕的视频,在字幕管理页面上还显示视频播放量;
    对于已添加字幕的视频,在字幕管理页面上还显示字幕译文的类型。
  13. 一种视频处理装置,包括:
    处理器,配置为
    接收视频的创作者为待翻译的视频添加的字幕原文;
    基于字幕原文和在社交网络上用户使用的语言,选择译者;
    向选择的译者提供待翻译的视频,并在视频中向译者提供翻译视频中字幕原文的交互界面;
    接收译者从交互页面返回的字幕译文;和
    显示器,配置为在视频中显示审核通过的字幕译文。
  14. 一种视频处理装置,包括:
    存储器;和
    耦接至所述存储器的处理器,所述处理器被配置为基于存储在所述存储器中的指令,执行权利要求1-12任一项所述的视频处理方法中的一个或多个步骤。
  15. 一种计算机可读存储介质,其上存储有计算机程序,该程序被处理器执行时实现权利要求1-12任一项所述的视频处理方法。
  16. 一种计算机程序,被处理器执行时实现权利要求1-12任一项所述的视频处理方法。
PCT/CN2022/133682 2021-11-30 2022-11-23 视频处理方法、视频处理装置和计算机可读存储介质 Ceased WO2023098531A1 (zh)

Priority Applications (5)

Application Number Priority Date Filing Date Title
KR1020247020364A KR102862444B1 (ko) 2021-11-30 2022-11-23 비디오 처리 방법, 비디오 처리 장치 및 컴퓨터-판독가능 저장 매체
EP22900336.3A EP4429258A4 (en) 2021-11-30 2022-11-23 VIDEO PROCESSING METHOD, VIDEO PROCESSING APPARATUS, AND COMPUTER READABLE STORAGE MEDIUM
AU2022399687A AU2022399687A1 (en) 2021-11-30 2022-11-23 Video processing method, video processing apparatus, and computer-readable storage medium
JP2024532477A JP2024543970A (ja) 2021-11-30 2022-11-23 ビデオ処理方法、ビデオ処理装置及びコンピュータ可読記憶媒体
AU2026200035A AU2026200035A1 (en) 2021-11-30 2026-01-05 Video processing method, video processing apparatus, and computer-readable storage medium

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202111447076.5 2021-11-30
CN202111447076.5A CN114143592B (zh) 2021-11-30 2021-11-30 视频处理方法、视频处理装置和计算机可读存储介质

Publications (1)

Publication Number Publication Date
WO2023098531A1 true WO2023098531A1 (zh) 2023-06-08

Family

ID=80386155

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2022/133682 Ceased WO2023098531A1 (zh) 2021-11-30 2022-11-23 视频处理方法、视频处理装置和计算机可读存储介质

Country Status (7)

Country Link
US (1) US12271708B2 (zh)
EP (1) EP4429258A4 (zh)
JP (1) JP2024543970A (zh)
KR (1) KR102862444B1 (zh)
CN (1) CN114143592B (zh)
AU (2) AU2022399687A1 (zh)
WO (1) WO2023098531A1 (zh)

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114143592B (zh) * 2021-11-30 2023-10-27 抖音视界有限公司 视频处理方法、视频处理装置和计算机可读存储介质
CN114143593B (zh) * 2021-11-30 2024-07-19 抖音视界有限公司 视频处理方法、视频处理装置和计算机可读存储介质
CN115455213A (zh) * 2022-08-31 2022-12-09 北京字跳网络技术有限公司 一种多媒体作品的展示方法、装置、设备及存储介质

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103167360A (zh) * 2013-02-21 2013-06-19 中国对外翻译出版有限公司 实现多语字幕翻译的方法
US20140142918A1 (en) * 2012-10-17 2014-05-22 Proz.Com Method and apparatus to facilitate high-quality translation of texts by multiple translators
US20140303956A1 (en) * 2013-04-09 2014-10-09 International Business Machines Corporation Translating a language in a crowdsourced environment
US20160342587A1 (en) * 2015-05-21 2016-11-24 International Business Machines Corporation Social network crowd translation
US20170371869A1 (en) * 2016-06-24 2017-12-28 Facebook, Inc. Crowd-matching translators
CN110276349A (zh) * 2019-06-24 2019-09-24 腾讯科技(深圳)有限公司 视频处理方法、装置、电子设备及存储介质
CN111898388A (zh) * 2020-07-20 2020-11-06 北京字节跳动网络技术有限公司 视频字幕翻译编辑方法、装置、电子设备及存储介质
CN114143592A (zh) * 2021-11-30 2022-03-04 北京字节跳动网络技术有限公司 视频处理方法、视频处理装置和计算机可读存储介质

Family Cites Families (47)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20060227240A1 (en) * 2005-03-30 2006-10-12 Inventec Corporation Caption translation system and method using the same
US20070106516A1 (en) * 2005-11-10 2007-05-10 International Business Machines Corporation Creating alternative audio via closed caption data
US20070211169A1 (en) * 2006-03-06 2007-09-13 Dotsub Llc Systems and methods for rendering text onto moving image content
WO2008055273A2 (en) * 2006-11-05 2008-05-08 Sean Joseph Leonard System and methods for rapid subtitling
US7992183B1 (en) * 2007-11-09 2011-08-02 Google Inc. Enabling users to create, to edit and/or to rate online video captions over the web
JP2009289091A (ja) * 2008-05-30 2009-12-10 Brother Ind Ltd 翻訳者選択方法、翻訳者選択サーバ、および翻訳者選択プログラム
CN102265271B (zh) * 2008-10-29 2016-07-13 谷歌公司 用于翻译web视频中的定时文本的系统和方法
US8913188B2 (en) * 2008-11-12 2014-12-16 Cisco Technology, Inc. Closed caption translation apparatus and method of translating closed captioning
JP2010157961A (ja) 2009-01-05 2010-07-15 Kadokawa Marketing Co Ltd 字幕作成システム及びプログラム
US8645134B1 (en) * 2009-11-18 2014-02-04 Google Inc. Generation of timed text using speech-to-text technology and applications thereof
WO2012030965A2 (en) * 2010-08-31 2012-03-08 Fox Entertainment Group, Inc. Localized media content editing
US8756050B1 (en) * 2010-09-14 2014-06-17 Amazon Technologies, Inc. Techniques for translating content
US8620139B2 (en) * 2011-04-29 2013-12-31 Microsoft Corporation Utilizing subtitles in multiple languages to facilitate second-language learning
US8914276B2 (en) * 2011-06-08 2014-12-16 Microsoft Corporation Dynamic video caption translation player
JP5451696B2 (ja) * 2011-07-13 2014-03-26 Necビッグローブ株式会社 字幕付加装置、コンテンツデータ、字幕付加方法及びプログラム
US9595020B2 (en) * 2012-03-15 2017-03-14 International Business Machines Corporation Dynamic media captions in a social network environment
US9418060B1 (en) * 2012-03-19 2016-08-16 Amazon Technologies, Inc. Sample translation reviews
US20140143218A1 (en) * 2012-11-20 2014-05-22 Apple Inc. Method for Crowd Sourced Multimedia Captioning for Video Content
EP2946279B1 (en) * 2013-01-15 2019-10-16 Viki, Inc. System and method for captioning media
US10025776B1 (en) * 2013-04-12 2018-07-17 Amazon Technologies, Inc. Language translation mediation system
KR102129536B1 (ko) * 2013-08-06 2020-07-03 삼성전자주식회사 모바일 단말기 및 그의 제어 방법
US9600474B2 (en) * 2013-11-08 2017-03-21 Google Inc. User interface for realtime language translation
US20180034961A1 (en) * 2014-02-28 2018-02-01 Ultratec, Inc. Semiautomated Relay Method and Apparatus
CN104378692A (zh) 2014-11-17 2015-02-25 天脉聚源(北京)传媒科技有限公司 一种处理视频字幕的方法及装置
JP6259804B2 (ja) 2014-11-26 2018-01-10 ネイバー コーポレーションNAVER Corporation コンテンツ参加翻訳装置、及びそれを利用したコンテンツ参加翻訳方法
KR20160081032A (ko) * 2014-12-30 2016-07-08 송광식 외국어 번역 서비스 관리 시스템
RU2604984C1 (ru) * 2015-05-25 2016-12-20 Общество с ограниченной ответственностью "Аби Девелопмент" Переводческий сервис на базе электронного сообщества
US9779372B2 (en) 2015-06-25 2017-10-03 One Hour Translation, Ltd. System and method for ensuring the quality of a human translation of content through real-time quality checks of reviewers
US20170139904A1 (en) * 2015-11-16 2017-05-18 Comcast Cable Communications, Llc Systems and methods for cloud captioning digital content
KR20180006728A (ko) * 2016-07-11 2018-01-19 손솔 동영상 자막 표시 제어를 위한 전자 장치
US9703775B1 (en) 2016-08-16 2017-07-11 Facebook, Inc. Crowdsourcing translations on online social networks
US20180143956A1 (en) * 2016-11-18 2018-05-24 Microsoft Technology Licensing, Llc Real-time caption correction by audience
US20180144747A1 (en) * 2016-11-18 2018-05-24 Microsoft Technology Licensing, Llc Real-time caption correction by moderator
KR20180066398A (ko) * 2016-12-08 2018-06-19 김미경 자막제작 평가 확산 시스템의 영상 음원과 자막 매칭 방법
CN107688792B (zh) 2017-09-05 2020-06-05 语联网(武汉)信息技术有限公司 一种视频翻译方法及其系统
US11856315B2 (en) 2017-09-29 2023-12-26 Apple Inc. Media editing application with anchored timeline for captions and subtitles
US10728623B2 (en) * 2018-06-06 2020-07-28 Home Box Office, Inc. Editing timed-text elements
KR102226765B1 (ko) * 2018-06-14 2021-03-11 이동준 동영상 자막 번역 서비스를 제공하는 시스템 및 방법
JP7434762B2 (ja) * 2019-09-10 2024-02-21 富士フイルムビジネスイノベーション株式会社 情報処理装置およびプログラム
US11373048B2 (en) * 2019-09-11 2022-06-28 International Business Machines Corporation Translation of multi-format embedded files
CN112584252B (zh) * 2019-09-29 2022-02-22 深圳市万普拉斯科技有限公司 即时译文显示方法、装置、移动终端和计算机存储介质
CN112437337B (zh) * 2020-02-12 2023-07-07 上海哔哩哔哩科技有限公司 一种直播实时字幕的实现方法、系统及设备
US11776578B2 (en) * 2020-06-02 2023-10-03 Trapelo Corp. Automatic modification of values of content elements in a video
CN112104896B (zh) 2020-08-31 2023-04-07 火星语盟(深圳)科技有限公司 字幕编辑方法、终端、服务器、系统及存储介质
CN112954434B (zh) 2021-02-26 2023-07-04 北京奇艺世纪科技有限公司 字幕处理方法、系统、电子设备和存储介质
US20220383228A1 (en) * 2021-05-25 2022-12-01 Glocalize Inc. Multilingual subtitle service system and method for controlling server thereof
CN114143593B (zh) * 2021-11-30 2024-07-19 抖音视界有限公司 视频处理方法、视频处理装置和计算机可读存储介质

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20140142918A1 (en) * 2012-10-17 2014-05-22 Proz.Com Method and apparatus to facilitate high-quality translation of texts by multiple translators
CN103167360A (zh) * 2013-02-21 2013-06-19 中国对外翻译出版有限公司 实现多语字幕翻译的方法
US20140303956A1 (en) * 2013-04-09 2014-10-09 International Business Machines Corporation Translating a language in a crowdsourced environment
US20160342587A1 (en) * 2015-05-21 2016-11-24 International Business Machines Corporation Social network crowd translation
US20170371869A1 (en) * 2016-06-24 2017-12-28 Facebook, Inc. Crowd-matching translators
CN110276349A (zh) * 2019-06-24 2019-09-24 腾讯科技(深圳)有限公司 视频处理方法、装置、电子设备及存储介质
CN111898388A (zh) * 2020-07-20 2020-11-06 北京字节跳动网络技术有限公司 视频字幕翻译编辑方法、装置、电子设备及存储介质
CN114143592A (zh) * 2021-11-30 2022-03-04 北京字节跳动网络技术有限公司 视频处理方法、视频处理装置和计算机可读存储介质

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
See also references of EP4429258A4 *

Also Published As

Publication number Publication date
JP2024543970A (ja) 2024-11-26
KR20240110043A (ko) 2024-07-12
CN114143592A (zh) 2022-03-04
US12271708B2 (en) 2025-04-08
AU2026200035A1 (en) 2026-01-22
EP4429258A4 (en) 2025-03-05
CN114143592B (zh) 2023-10-27
KR102862444B1 (ko) 2025-09-18
EP4429258A1 (en) 2024-09-11
AU2022399687A1 (en) 2024-06-27
US20230169275A1 (en) 2023-06-01

Similar Documents

Publication Publication Date Title
WO2023098531A1 (zh) 视频处理方法、视频处理装置和计算机可读存储介质
WO2023098533A1 (zh) 视频处理方法、视频处理装置和计算机可读存储介质
US20160300594A1 (en) Video creation, editing, and sharing for social media
JP2023539815A (ja) 議事録のインタラクション方法、装置、機器及び媒体
US9402050B1 (en) Media content creation application
CN108781311B (zh) 用于媒体分发和管理平台的视频播放器框架
US20170294212A1 (en) Video creation, editing, and sharing for social media
WO2025103333A1 (zh) 媒体内容的生成方法、显示方法和相关设备
US12586610B2 (en) Method, apparatus, device, storage medium and program product for video generation
WO2020220776A1 (zh) 图片类评论数据的展示方法、装置、设备及介质
US12401844B2 (en) Video processing method, video processing apparatus, and non-transitory computer-readable storage medium
WO2025113546A1 (zh) 多媒体内容的显示方法、装置、电子设备和存储介质
CN116339560A (zh) 信息展示方法、信息发送方法、装置、设备和存储介质
WO2023116479A1 (zh) 视频的发布方法、装置、电子设备、存储介质和程序产品
US12574614B2 (en) Method, apparatus, device, medium and program product for obtaining text material
CN117376636A (zh) 视频处理方法、装置、设备、存储介质及程序
JP7684446B2 (ja) ビデオ生成方法、装置、機器、記憶媒体及びプログラム製品
US9940645B1 (en) Application installation using in-video programming
CN115269920A (zh) 交互方法、装置、电子设备和存储介质
US20120331507A1 (en) Systems and methods for providing local event listings with video
CN118708827A (zh) 信息展示方法、装置、设备、存储介质及产品
KR101396020B1 (ko) 컨텐츠 편집툴을 이용한 멀티미디어 컨텐츠 편집 서비스제공 방법
Wang Short Video Applet Based on Wechat
WO2023279982A1 (zh) 商品评价方法、装置和客户端

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 22900336

Country of ref document: EP

Kind code of ref document: A1

ENP Entry into the national phase

Ref document number: 2024532477

Country of ref document: JP

Kind code of ref document: A

WWE Wipo information: entry into national phase

Ref document number: 2022900336

Country of ref document: EP

REG Reference to national code

Ref country code: BR

Ref legal event code: B01A

Ref document number: 112024010863

Country of ref document: BR

WWE Wipo information: entry into national phase

Ref document number: 2022399687

Country of ref document: AU

Ref document number: AU2022399687

Country of ref document: AU

WWE Wipo information: entry into national phase

Ref document number: 202427045848

Country of ref document: IN

ENP Entry into the national phase

Ref document number: 2022900336

Country of ref document: EP

Effective date: 20240607

ENP Entry into the national phase

Ref document number: 20247020364

Country of ref document: KR

Kind code of ref document: A

WWE Wipo information: entry into national phase

Ref document number: 1020247020364

Country of ref document: KR

ENP Entry into the national phase

Ref document number: 2022399687

Country of ref document: AU

Date of ref document: 20221123

Kind code of ref document: A

NENP Non-entry into the national phase

Ref country code: DE

WWE Wipo information: entry into national phase

Ref document number: 11202403725Y

Country of ref document: SG

ENP Entry into the national phase

Ref document number: 112024010863

Country of ref document: BR

Kind code of ref document: A2

Effective date: 20240529