WO2025218295A1 - Procédé et appareil de lecture vidéo panoramique, dispositif et support de stockage - Google Patents
Procédé et appareil de lecture vidéo panoramique, dispositif et support de stockageInfo
- Publication number
- WO2025218295A1 WO2025218295A1 PCT/CN2025/072781 CN2025072781W WO2025218295A1 WO 2025218295 A1 WO2025218295 A1 WO 2025218295A1 CN 2025072781 W CN2025072781 W CN 2025072781W WO 2025218295 A1 WO2025218295 A1 WO 2025218295A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- texture
- target
- video frame
- resolution
- target area
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N23/00—Cameras or camera modules comprising electronic image sensors; Control thereof
- H04N23/60—Control of cameras or camera modules
- H04N23/698—Control of cameras or camera modules for achieving an enlarged field of view, e.g. panoramic image capture
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T15/00—Three-dimensional [3D] image rendering
- G06T15/04—Texture mapping
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T19/00—Manipulating three-dimensional [3D] models or images for computer graphics
- G06T19/20—Editing of three-dimensional [3D] images, e.g. changing shapes or colours, aligning objects or positioning parts
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T5/00—Image enhancement or restoration
- G06T5/80—Geometric correction
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N23/00—Cameras or camera modules comprising electronic image sensors; Control thereof
- H04N23/95—Computational photography systems, e.g. light-field imaging systems
Definitions
- the embodiments of the present disclosure relate to a method, apparatus, device, and storage medium for playing panoramic videos.
- Panoramic video also known as virtual reality video (VR video) is a video that can display a three-dimensional space. VR video can present 360-degree panoramic images to viewers, enhancing the viewing experience.
- VR video virtual reality video
- low-quality VR videos suffer from low clarity, which impacts the playback experience.
- low-quality VR videos are often processed using super-resolution processing.
- super-resolution processing consumes significant performance resources on VR devices, making them prone to issues such as video freezes, slow playback, and frame drops.
- the present disclosure provides a method, apparatus, device, and storage medium for playing panoramic videos, which can improve the playback experience and reduce performance resource overhead.
- an embodiment of the present disclosure provides a method for playing a panoramic video, including:
- the target texture is super-resolution processed to obtain a super-resolution texture
- the super-resolution texture and the original texture are fused to obtain a target video frame
- the target video frame is played, wherein the original texture represents the texture in the original video frame that does not belong to the target area.
- an embodiment of the present disclosure further provides a device for playing panoramic videos, the device comprising:
- An information acquisition module is used to obtain the shooting device posture information corresponding to the original video frame in the target panoramic video;
- a texture acquisition module configured to determine a target area of the original video frame according to the posture information of the shooting device, and acquire a target texture on a mapping model corresponding to the original video frame according to the target area, wherein a surface of the mapping model has the texture corresponding to the original video frame;
- the texture fusion module is used to perform super-resolution processing on the target texture to obtain a super-resolution texture, fuse the super-resolution texture with the original texture to obtain a target video frame, and play the target video frame, wherein the original texture represents the texture in the original video frame that does not belong to the target area.
- an embodiment of the present disclosure further provides an electronic device, the electronic device comprising:
- processors one or more processors
- a storage device for storing one or more programs
- the one or more processors implement the method for playing panoramic video as described in any embodiment of the present disclosure.
- an embodiment of the present disclosure further provides a storage medium comprising computer-executable instructions, which, when executed by a computer processor, are used to execute the method for playing a panoramic video as described in any embodiment of the present disclosure.
- FIG1 is a flow chart of a method for playing a panoramic video provided by an embodiment of the present disclosure
- FIG2 is a schematic diagram of a mapping model provided by an embodiment of the present disclosure.
- FIG3 is a flow chart of another method for playing panoramic videos provided by an embodiment of the present disclosure.
- FIG4 is a schematic diagram of the structure of a device for playing panoramic videos provided by an embodiment of the present disclosure.
- FIG5 is a schematic structural diagram of an electronic device provided by an embodiment of the present disclosure.
- the term “including” and its variations are open-ended, i.e., “including but not limited to.”
- the term “based on” means “based, at least in part, on.”
- the term “one embodiment” means “at least one embodiment,” the term “another embodiment” means “at least one additional embodiment,” and the term “some embodiments” means “at least some embodiments.”
- Other terms are defined in the following description.
- a prompt message is sent to the user to clearly inform the user that the operation requested will require the acquisition and use of the user's personal information. This allows the user to independently choose whether to provide personal information to the electronic device, application, server, storage medium, or other software or hardware that performs the operations of the disclosed technical solution based on the prompt message.
- the prompt information in response to receiving a user's active request, may be sent to the user in the form of a pop-up window, in which the prompt information may be presented in text form.
- the pop-up window may also contain a selection control for the user to select "agree” or “disagree” to provide personal information to the electronic device.
- Figure 1 is a flow chart illustrating a method for playing panoramic videos provided by an embodiment of the present disclosure.
- This embodiment of the present disclosure is applicable to scenarios where VR videos are played, such as on a virtual reality device such as VR glasses or a VR helmet.
- VR videos can be played on a smartphone.
- the method can be performed by a device for playing panoramic videos, which can be implemented in software and/or hardware.
- the method can be implemented by an electronic device such as a smartphone or a virtual reality device.
- the method includes:
- S110 Obtaining shooting device posture information corresponding to original video frames in the target panoramic video.
- the target panoramic video represents the VR video to be played on the electronic device, and the VR video appearing below may refer to the target panoramic video.
- the VR video can be played through the client configured on the electronic device.
- Multiple candidate panoramic videos can be displayed on the interactive page of the client for the user to select the target panoramic video from.
- a search control can be displayed on the interactive page of the client for the user to search for the target panoramic video based on the search control.
- VR videos may include panoramic videos.
- Panoramic videos can convert static panoramic images into dynamic video images. Panoramic videos can be viewed from any angle of 360 degrees, providing an immersive experience.
- the target panoramic video can also be a low-resolution gear corresponding to the high-resolution VR video selected by the user.
- high-resolution VR videos such as VR videos with a resolution of 8K or 4K
- the bit rate of high-resolution VR videos is also high, which increases the consumption of bandwidth resources. Therefore, after the user selects a high-resolution gear for the panoramic video, a lower-resolution target panoramic video can be selected at the start of playback.
- the shooting device here is a virtual shooting device, and the shooting device posture information represents the posture of the virtual shooting device. Different shooting device postures correspond to different visual ranges.
- the shooting device posture information includes the shooting device's front vector.
- the shooting device's front vector is obtained in the target function of the target engine.
- the shooting device's front vector of the current shooting device posture information is obtained in the target engine's frame function update.
- the target engine can be used to generate panoramic videos and perform super-resolution processing on panoramic videos.
- obtaining the shooting device posture information corresponding to the original video frame in the target panoramic video includes: parsing the objective function to obtain the front vector of the shooting device of the original video frame in the target panoramic video, wherein the objective function is used to determine the shooting device posture corresponding to the target panoramic video.
- VR videos provide viewers with a virtual reality scene (i.e., a VR scene)
- a VR scene if VR is enabled in Unity, all cameras within the VR scene can be rendered directly to the head-mounted display.
- the human eye is only sensitive to the content directly in front of the camera. Therefore, the front vector of the current camera's perspective can be obtained from the frame function update to determine the area directly in front of the camera based on the front vector of the camera. It should be noted that the camera posture corresponding to the panoramic video is updated in real time in the frame function update.
- S120 Determine a target area of the original video frame according to the shooting device posture information, and obtain a target texture on a mapping model corresponding to the original video frame according to the target area.
- the surface of the mapping model has a texture corresponding to the original video frame.
- the mapping model may include an equirectangular projection (ERP) mapping model and an equiangular cubemap (EAC) mapping model.
- the target area represents the mapping model of each original video frame of the target panoramic video and the area directly in front of the camera. Since the human eye is generally only sensitive to content in the area directly in front of the camera, the target area can be determined based on the camera's front vector from the current camera's perspective.
- the mapping model can represent a model that carries the panoramic content corresponding to the original video frame.
- the texture corresponding to the original video frame can be projected onto the surface of the mapping model.
- the projection represents the process of unfolding a real scene with a full physical field of view onto a 2D image, which can be restored to a VR device for immersive viewing.
- projection methods may include cylindrical projection and regular octahedron projection.
- Cylindrical projection directly projects a sphere onto a plane without the need for an intermediate projection geometry.
- the ERP mapping model is used to convert spherical content to a plane.
- a cubic projection format is used in which the spherical content is projected onto a cubic model, the faces are unfolded, and then the faces are spliced into a rectangle.
- Cubic projection uses perspective to map the sphere to the faces of the cube.
- the EAC mapping model is used to first deform a spherical film into a cube, and then flatten the cube into a plane consisting of 6 squares. During the projection process, the angle corresponding to each segment is adjusted so that each projected segment has a close pixel density.
- the target texture represents the texture corresponding to the target area, and the texture corresponding to the target area on the mapping model can be intercepted to obtain the target texture.
- the target area of the original video frame is determined according to the posture information of the shooting device, and the target texture on the mapping model corresponding to the original video frame is obtained according to the target area, including: determining the polar angle and azimuth angle of the intersection of the front vector of the shooting device and the mapping model; determining the intersection texture coordinates corresponding to the intersection according to the polar angle and azimuth angle; determining the target area according to the size setting information and the intersection texture coordinates of the target area, and obtaining the texture on the mapping model according to the target area as the target texture.
- a mapping model is used to project each frame of a VR video onto a plane for playback on a virtual reality device.
- the camera's forward vector from each camera's perspective intersects the mapping model.
- the mapping model is an ERP mapping model, for a given camera's perspective, the polar angle and azimuth of the intersection in the spherical coordinate system can be determined based on the three-dimensional coordinates of the intersection of the camera's forward vector from the current camera's perspective and the spherical model.
- the polar angle and azimuth represent the latitude and longitude of the intersection.
- a predetermined formula is used to calculate the intersection texture coordinates (u, v) of the intersection based on the polar angle and azimuth.
- Users can set the size of the target area through an interactive interface to obtain size setting information. For example, users can set a w*h rectangular area through the interactive interface, where w and h represent the size setting information for the target area.
- the target area is determined by combining the target area's size setting information and the intersection texture coordinates.
- a corresponding texture area in the mapping model is searched for based on the target area.
- render-to-texture technology the texture of the mapping model surface is cropped based on the texture area to obtain the target texture.
- render-to-texture technology allows developers to create a texture map based on a portion of the scene's image, and save it for later deferred shading, multi-channel rendering, or more advanced rendering effects.
- the target area is determined based on the size setting information and intersection texture coordinates of the target area, including: taking the intersection as the center of the target area, and determining the upper left corner coordinates of the target area based on the intersection texture coordinates and the length setting information and width setting information of the target area; determining the target area based on the upper left corner coordinates, length setting information and width setting information.
- the intersection is used as the center of the target area, and the intersection is used as the basis to diffuse to the surrounding areas to form the target area.
- the target area can be expressed as (u′, v′, w, h).
- the texture of the mapped model surface is cropped based on the target area (u′, v′, w, h) to obtain the target texture.
- the target area of the original video frame is determined according to the posture information of the shooting device, and the target texture on the mapping model corresponding to the original video frame is obtained according to the target area, including: dividing each surface of the mapping model into a set number of sub-surfaces, and determining the texture area of each sub-surface; for each sub-surface of the mapping model, determining the target vector according to the set coordinate points on the sub-surface; determining the target sub-surface set according to the angle between the front vector of the shooting device and the target vector corresponding to each sub-surface, and forming a target area according to the target sub-surface set; obtaining the texture corresponding to the texture area of each target sub-surface in the target sub-surface set as the target texture.
- each surface of the mapping model is divided into a set number of sub-surfaces.
- Figure 2 is a schematic diagram of a mapping model provided by an embodiment of the present disclosure. As shown in Figure 2, since the projection method of the EAC mapping model is to convert the texture of the surface of the spherical model 210 to the surface of the cube model 220. Each face of the cube model 220 is divided into 4 sub-surfaces, and an index is set. The texture area corresponding to each sub-surface is calculated, and the index and texture area of the same sub-surface are associated and stored. Optionally, the texture area can be represented by the upper left corner texture coordinate, the texture area length and the texture area width.
- the texture area length can be determined based on the length of the sub-surface, and the width of the texture area can be determined based on the width of the sub-surface.
- a target vector OA' is formed based on the center point A' of the current sub-surface and the origin O of the spherical model 210.
- the front vector of the shooting device is calculated as the angle between OV and the target vector OA'.
- the same method can be used to calculate the angle OV between the target vector corresponding to each sub-surface and the vector directly in front of the shooting device.
- the sub-surface indices are sorted in ascending order according to the angle, and a set number of indices that are sorted first are selected. If the sub-surface indices are sorted in descending order according to the angle, a set number of indices that are sorted last are selected.
- the sub-surfaces corresponding to the selected indices constitute a target sub-surface set. According to the index corresponding to each target sub-surface in the target sub-surface set, and the association between the index and the texture area, the texture area of each sub-surface is obtained. Combined with the render-to-texture technology, the texture of the mapped model surface is cropped based on the texture area to obtain the target texture.
- each surface of the EAC mapping model can be divided into more sub-surfaces.
- certain functions, components, models and other existing technologies in the industry may be mentioned. They should be considered as exemplary. Their purpose is only to illustrate the feasibility of implementing the technical solution of the present disclosure, but it does not mean that the applicant has or will necessarily use them in the present disclosure.
- the original texture represents the texture in the original video frame that does not belong to the target area.
- Super-resolution processing is the process of using optics and related optical knowledge to restore image details and other data based on known image information. Super-resolution is used to increase image resolution to prevent image quality degradation. In video playback, super-resolution processing can significantly improve video quality while maintaining the same resolution.
- the super-resolution texture represents a new texture with increased resolution obtained by performing super-resolution processing on the target texture.
- the pixels of the target area are super-resolved from 4K to 8K through super-resolution technology.
- the super-resolution result is represented by the target parameter of the screen-up function of the target engine. If the super-resolution result is successful, the super-resolution texture and the original texture are fused and screen-up. If the super-resolution result is unsuccessful, the original texture is used for screen-up.
- the screen-up function can represent a function for displaying VR video. Screen-up can be an operation for displaying VR video.
- fusing the super-resolution texture and the original texture to obtain a target video frame includes: obtaining the texture coordinates of the pixel to be rendered in the original video frame; if the texture coordinates belong to the target area, obtaining the super-resolution texture corresponding to the texture coordinates as the texture of the pixel to be rendered; if the texture coordinates do not belong to the target area, obtaining the original texture corresponding to the texture coordinates as the texture of the pixel to be rendered; and determining the target video frame based on the texture of each pixel to be rendered in the original video frame.
- the three-dimensional coordinates of the pixels to be rendered in the original video frame are converted into texture coordinates through the vertex shader, and the texture coordinates are passed to the fragment shader.
- the fragment shader determines whether the texture coordinates belong to the target area. If so, the super-resolution texture is sampled based on the texture coordinates. Otherwise, the original texture is sampled based on the texture coordinates to determine the texture of the pixel to be rendered.
- the pixel points are colored based on the texture of each pixel to be rendered in the original video frame to obtain the target video frame, which is then displayed on the screen for playback.
- the technical solution of the disclosed embodiments determines the target area of the original video frame using the camera's posture information. Based on the target area, the target texture on the mapping model corresponding to the original video frame is obtained and super-resolution processing is performed on the target texture. During playback, the super-resolution texture corresponding to each original video frame is fused with the original texture to obtain the target video frame, which is then played. Because the target area represents the viewer's area of interest, super-resolution processing is performed only on the target area, reducing performance resource overhead and alleviating issues such as lag, slow start-up, and frame drop. It also allows viewers to watch higher-quality videos, enhancing the playback experience.
- Figure 3 is a flowchart of another method for playing panoramic videos provided by an embodiment of the present disclosure. Based on the above embodiments, the present disclosure embodiment additionally defines a downscaling and super-resolution scheme. Downscaling refers to reducing the resolution level of the VR video selected by the user.
- the method includes:
- the VR client's interactive interface can display multiple resolutions for the user to choose from when the user requests the VR video.
- the user's selection of a VR video triggers a VR video selection event.
- the VR client determines the target resolution of the selected VR video.
- S320 Determine whether the target resolution meets a preset super-resolution condition. If so, execute S330; otherwise, execute S340.
- the preset super-resolution condition is used to determine whether a VR video can be downscaled and super-resolved.
- Downscaled and super-resolved may involve obtaining an original VR video with a resolution lower than the resolution of the VR video selected by the user, and performing super-resolution processing on a local area of the original VR video based on the camera posture information during playback of the original VR video, so that the local area of the original VR video achieves high-resolution image quality.
- high-resolution VR videos are prone to playback issues such as lag, slow start-up, and high memory consumption. Furthermore, high-resolution VR videos have a higher bitrate, increasing bandwidth consumption.
- low-resolution VR videos can be re-encoded to achieve high-resolution image quality. This allows for a smoother playback experience while maintaining the same image quality while using a lower bitrate video stream. This also reduces bandwidth consumption and lowers bandwidth costs.
- the VR video selected by the user has multiple resolutions and the user selects a higher resolution, it can be determined that the preset super-resolution condition is met.
- a target identifier can be used to indicate whether the VR video supports downscaling and super-resolution. After the user selects a VR video, the corresponding target identifier is used to determine whether the preset super-resolution condition is met.
- a preset button can be used to control whether the downscaling and super-resolution function is enabled for the target panoramic video.
- S330 Acquire the target panoramic video having a resolution lower than the target resolution according to the identification information of the selected panoramic video.
- the 4K source can be used as the original VR video.
- the VR client plays the 4K VR video and, during playback, performs local super-resolution based on the camera's posture information, ensuring that the local area of each frame in the VR video achieves high-resolution quality.
- a player for playing VR videos includes a video-on-demand SDK, a VR client, and a target engine. If super-resolution is turned on, the camera posture information is obtained in the frame function update of the target engine.
- the camera posture information includes the front vector of the camera, and the intersection texture coordinates of the intersection with the spherical model are calculated based on the front vector, and the target area is determined based on the intersection texture coordinates.
- the target area can be a rectangle, and the target area is represented by the upper left corner texture coordinates (u, v) and the length w and width h of the rectangle. Wherein, u, v, w, h ⁇ (0,1).
- the target texture in the target area is cropped by combining the rendering to texture technology, and the target texture is super-resolution processed to obtain a super-resolution texture, and the super-resolution result is stored in the screen function. If super-resolution is not turned on, the original texture is used for screen.
- the target engine returns the camera posture information to the VR client, and the VR client transmits the camera posture information to the GPU and triggers a render texture event.
- the video-on-demand SDK executes the fragment shader code through the rendering thread. Get the super-resolution result.
- the texture coordinates in the target area are sampled from the super-resolution texture, and the texture coordinates outside the target area are sampled from the original texture, so that the super-resolution texture and the original texture are fused and displayed on the screen. If the super-resolution fails, the original texture is used for the display.
- S340 The selected panoramic video is used as the target panoramic video.
- S350 Obtaining shooting device posture information corresponding to the original video frame in the target panoramic video.
- S360 Determine a target area of the original video frame according to the shooting device posture information, and obtain a target texture on a mapping model corresponding to the original video frame according to the target area.
- S370 Perform super-resolution processing on the target texture to obtain a super-resolution texture, fuse the super-resolution texture with the original texture to obtain a target video frame, and play the target video frame.
- the technical solution of the disclosed embodiment obtains a target VR video with a resolution lower than the target resolution corresponding to the selected gear of the VR video when the VR video starts playing.
- a local area of the original video frame in the target VR video is determined based on the posture information of the shooting device of the original video frame, and the target texture in the local area is cropped and super-resolution processed to obtain a super-resolution texture.
- the super-resolution texture and the original texture are then fused and displayed on the screen, thereby reducing the freeze rate, the non-startup rate, the playback bandwidth cost, and the memory consumption.
- Figure 4 is a schematic diagram of the structure of a device for playing panoramic videos provided by an embodiment of the present disclosure.
- the device can be implemented in the form of software and/or hardware.
- it can be implemented by an electronic device, which can be a smart phone or a virtual reality device.
- the apparatus includes: an information acquisition module 410 , a texture acquisition module 420 , and a texture fusion module 430 .
- An information acquisition module 410 is used to acquire the camera posture information corresponding to the original video frame in the target panoramic video;
- a texture acquisition module 420 configured to determine a target region of the original video frame based on the camera posture information, and acquire a target texture on a mapping model corresponding to the original video frame based on the target region, wherein a surface of the mapping model has the texture corresponding to the original video frame;
- the texture fusion module 430 is configured to perform super-resolution processing on the target texture to obtain a super-resolution texture, fuse the super-resolution texture with the original texture to obtain a target video frame, and play the target video frame, wherein the original texture represents the texture in the original video frame that does not belong to the target area.
- the information acquisition module 410 is specifically configured to:
- the objective function is parsed to obtain a front vector of the shooting device of the original video frame in the target panoramic video, wherein the objective function is used to determine a posture of the shooting device corresponding to the target panoramic video.
- the texture acquisition module 420 is specifically configured to:
- the target area is determined according to the size setting information of the target area and the texture coordinates of the intersection point, and the texture on the mapping model is acquired according to the target area as the target texture.
- determining the target area according to the size setting information of the target area and the texture coordinates of the intersection point includes:
- the target area is determined according to the upper left corner coordinates, the length setting information, and the width setting information.
- the texture acquisition module 420 is further specifically configured to:
- mapping model For each sub-surface of the mapping model, determining a target vector according to a set coordinate point on the sub-surface;
- the texture corresponding to the texture area of each target sub-face in the target sub-face set is obtained as the target texture.
- the texture fusion module 430 is specifically configured to:
- the texture coordinates belong to the target area, obtaining the super-resolution texture corresponding to the texture coordinates as the texture of the pixel to be rendered;
- a target video frame is determined based on the texture of each pixel to be rendered in the original video frame.
- the device further comprises:
- the video acquisition module is used to obtain the target resolution of the selected panoramic video if a panoramic video selection event is detected; if the target resolution meets the preset super-resolution condition, the target panoramic video with a resolution lower than the target resolution is obtained based on the identification information of the selected panoramic video.
- the device for playing panoramic videos provided by the embodiments of the present disclosure can execute the method for playing panoramic videos provided by any embodiment of the present disclosure, and has functional modules and beneficial effects corresponding to the execution method.
- Figure 5 is a schematic diagram of the structure of an electronic device provided by an embodiment of the present disclosure. Referring to Figure 5 below, it shows a schematic diagram of the structure of an electronic device (such as a smartphone or virtual reality device in Figure 5) 500 suitable for implementing an embodiment of the present disclosure.
- the terminal device in the embodiment of the present disclosure may include but is not limited to mobile terminals such as smartphones, laptops, VR glasses, VR helmets, etc. and fixed terminals such as digital TVs, desktop computers, etc.
- the electronic device shown in Figure 5 is only an example and should not bring any limitations to the functions and scope of use of the embodiments of the present disclosure.
- electronic device 500 may include a processing device (e.g., a central processing unit, a graphics processing unit, etc.) 501, which can perform various appropriate actions and processes according to a program stored in a read-only memory (ROM) 502 or a program loaded from a storage device 508 into a random access memory (RAM) 503.
- ROM read-only memory
- RAM random access memory
- Various programs and data required for the operation of electronic device 500 are also stored in RAM 503.
- Processing device 501, ROM 502, and RAM 503 are connected to each other via a bus 504.
- An input/output (I/O) interface 505 is also connected to bus 504.
- the following devices may be connected to the I/O interface 505: an input device 506 including, for example, a touch screen, a touchpad, a keyboard, a mouse, a camera, a microphone, an accelerometer, a gyroscope, etc.; an output device 507 including, for example, a liquid crystal display (LCD), a speaker, a vibrator, etc.; a storage device 508 including, for example, a magnetic tape, a hard disk, etc.; and a communication device 509.
- the communication device 509 may allow the electronic device 500 to communicate with other devices wirelessly or by wire to exchange data.
- FIG5 shows the electronic device 500 with various devices, it should be understood that not all of the devices shown are required to be implemented or present. More or fewer devices may alternatively be implemented or present.
- an embodiment of the present disclosure includes a computer program product, which includes a computer program carried on a non-transitory computer-readable medium, and the computer program includes program code for executing the method shown in the flowchart.
- the computer program can be downloaded and installed from the network through the communication device 509, or installed from the storage device 508, or installed from the ROM 502.
- the processing device 501 When the computer program is executed by the processing device 501, the above-mentioned functions defined in the method of the embodiment of the present disclosure are performed.
- the electronic device provided by the embodiment of the present disclosure and the method for playing panoramic video provided by the above embodiment belong to the same inventive concept.
- this embodiment please refer to the above embodiment, and this embodiment has the same beneficial effects as the above embodiment.
- An embodiment of the present disclosure provides a computer storage medium having a computer program stored thereon.
- the program is executed by a processor, the method for playing a panoramic video provided in the above embodiment is implemented.
- the computer-readable medium mentioned above in the present disclosure may be a computer-readable signal medium or a computer-readable storage medium, or any combination of the two.
- a computer-readable storage medium may be, for example, but not limited to, an electrical, magnetic, optical, electromagnetic, infrared, or semiconductor system, device, or component, or any combination of the above.
- Computer-readable storage media may include, but are not limited to: an electrical connection with one or more wires, a portable computer disk, a hard disk, a random access memory (RAM), a read-only memory (ROM), an erasable programmable read-only memory (EPROM or flash memory), an optical fiber, a portable compact disk read-only memory (CD-ROM), an optical storage device, a magnetic storage device, or any suitable combination of the above.
- a computer-readable storage medium may be any tangible medium that contains or stores a program that can be used by or in conjunction with an instruction execution system, device, or component.
- a computer-readable signal medium may include a data signal propagated in baseband or as part of a carrier wave, which carries computer-readable program code. Such a propagated data signal may take a variety of forms, including but not limited to electromagnetic signals, optical signals, or any suitable combination of the above.
- a computer-readable signal medium may also be any computer-readable medium other than a computer-readable storage medium that can transmit, propagate, or transport a program for use by or in conjunction with an instruction execution system, apparatus, or device.
- the program code contained on the computer-readable medium may be transmitted using any suitable medium, including but not limited to wires, optical cables, RF (radio frequency), etc., or any suitable combination thereof.
- the client and server can communicate using any currently known or later developed network protocol, such as HTTP (HyperText Transfer Protocol), and can be interconnected with any form or medium of digital data communication (e.g., a communication network).
- HTTP HyperText Transfer Protocol
- Examples of communication networks include a local area network ("LAN”), a wide area network ("WAN”), an internet (e.g., the Internet), and a peer-to-peer network (e.g., an ad hoc peer-to-peer network), as well as any currently known or later developed network.
- the computer-readable medium may be included in the electronic device, or may exist independently without being incorporated into the electronic device.
- the computer-readable medium carries one or more programs.
- the electronic device When the one or more programs are executed by the electronic device, the electronic device:
- the target texture is super-resolution processed to obtain a super-resolution texture
- the super-resolution texture and the original texture are fused to obtain a target video frame
- the target video frame is played, wherein the original texture represents the texture in the original video frame that does not belong to the target area.
- Computer program code for performing the operations of the present disclosure may be written in one or more programming languages, or a combination thereof, including, but not limited to, object-oriented programming languages such as Java, Smalltalk, C++, and conventional procedural programming languages such as "C" or similar programming languages.
- the program code may be executed entirely on the user's computer, partially on the user's computer, as a stand-alone software package, partially on the user's computer and partially on a remote computer, or entirely on the remote computer or server.
- the remote computer may be connected to the user's computer through any type of network, including a local area network (LAN) or a wide area network (WAN), or may be connected to an external computer (e.g., through the Internet using an Internet service provider).
- LAN local area network
- WAN wide area network
- Internet service provider e.g., AT&T, MCI, Sprint, EarthLink, MSN, GTE, etc.
- each box in the flowchart or block diagram can represent a module, program segment, or a part of code, and the module, program segment, or a part of code contains one or more executable instructions for realizing the specified logical function.
- the functions marked in the box can also occur in a different order than that marked in the accompanying drawings. For example, two boxes represented in succession can actually be executed substantially in parallel, and they can sometimes be executed in the opposite order, depending on the functions involved.
- each box in the block diagram and/or flowchart, and the combination of the boxes in the block diagram and/or flowchart can be implemented with a dedicated hardware-based system that performs the specified function or operation, or can be implemented with a combination of dedicated hardware and computer instructions.
- the units involved in the embodiments described in this disclosure may be implemented in software or hardware, wherein the name of a unit does not necessarily limit the unit itself.
- exemplary types of hardware logic components include: field programmable gate arrays (FPGAs), application specific integrated circuits (ASICs), application specific standard products (ASSPs), systems on chip (SOCs), complex programmable logic devices (CPLDs), and the like.
- FPGAs field programmable gate arrays
- ASICs application specific integrated circuits
- ASSPs application specific standard products
- SOCs systems on chip
- CPLDs complex programmable logic devices
- a machine-readable medium can be a tangible medium that can contain or store a program for use by or in conjunction with an instruction execution system, device or equipment.
- a machine-readable medium can be a machine-readable signal medium or a machine-readable storage medium.
- a machine-readable medium can include, but is not limited to, an electronic, magnetic, optical, electromagnetic, infrared, or semiconductor system, device or equipment, or any suitable combination of the foregoing.
- a more specific example of a machine-readable storage medium can include an electrical connection based on one or more lines, a portable computer disk, a hard disk, a random access memory (RAM), a read-only memory (ROM), an erasable programmable read-only memory (EPROM or flash memory), an optical fiber, a portable compact disk read-only memory (CD-ROM), an optical storage device, a magnetic storage device, or any suitable combination of the foregoing.
- RAM random access memory
- ROM read-only memory
- EPROM or flash memory erasable programmable read-only memory
- CD-ROM portable compact disk read-only memory
- CD-ROM compact disk read-only memory
- magnetic storage device or any suitable combination of the foregoing.
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Physics & Mathematics (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Computer Graphics (AREA)
- Computer Hardware Design (AREA)
- Software Systems (AREA)
- Computing Systems (AREA)
- General Engineering & Computer Science (AREA)
- Architecture (AREA)
- Processing Or Creating Images (AREA)
- Television Signal Processing For Recording (AREA)
Abstract
Les modes de réalisation de la présente divulgation concernent un procédé et un appareil de lecture vidéo panoramique, un dispositif et un support de stockage. Le procédé de lecture de vidéo panoramique comprend les étapes consistant à : acquérir des informations d'orientation de dispositif de photographie correspondant à une trame vidéo d'origine dans une vidéo panoramique cible ; déterminer une zone cible de la trame vidéo d'origine sur la base des informations d'orientation de dispositif de photographie, et sur la base de la zone cible, acquérir une texture cible sur un modèle de correspondance correspondant à la trame vidéo d'origine ; et effectuer un traitement de super-résolution sur la texture cible pour obtenir une texture de super-résolution, fusionner la texture de super-résolution et une texture d'origine pour obtenir une trame vidéo cible, et lire la trame vidéo cible. Étant donné qu'une zone cible représente une zone de focalisation d'audience, un traitement de super-résolution n'est effectué que sur la zone cible, ce qui permet de réduire le surdébit de ressources de performance, de réduire les problèmes de saccade, de démarrage de lecture lente, de pertes de trame, etc., et de permettre à l'audience de regarder des vidéos ayant une qualité d'image élevée, améliorant ainsi l'expérience de lecture.
Applications Claiming Priority (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN202410452487.0 | 2024-04-15 | ||
| CN202410452487.0A CN118413745A (zh) | 2024-04-15 | 2024-04-15 | 一种播放全景视频的方法、装置、设备及存储介质 |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| WO2025218295A1 true WO2025218295A1 (fr) | 2025-10-23 |
Family
ID=92032086
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| PCT/CN2025/072781 Pending WO2025218295A1 (fr) | 2024-04-15 | 2025-01-16 | Procédé et appareil de lecture vidéo panoramique, dispositif et support de stockage |
Country Status (2)
| Country | Link |
|---|---|
| CN (1) | CN118413745A (fr) |
| WO (1) | WO2025218295A1 (fr) |
Families Citing this family (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN118413745A (zh) * | 2024-04-15 | 2024-07-30 | 北京字跳网络技术有限公司 | 一种播放全景视频的方法、装置、设备及存储介质 |
Citations (6)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN103400409A (zh) * | 2013-08-27 | 2013-11-20 | 华中师范大学 | 一种基于摄像头姿态快速估计的覆盖范围3d可视化方法 |
| US20140210824A1 (en) * | 2013-01-28 | 2014-07-31 | The Boeing Company | Panoptic Visualization Of A Three-Dimensional Representation Of A Complex System |
| CN107197135A (zh) * | 2016-03-21 | 2017-09-22 | 成都理想境界科技有限公司 | 一种视频生成方法、播放方法及视频生成装置、播放装置 |
| CN110324640A (zh) * | 2019-07-12 | 2019-10-11 | 青岛一舍科技有限公司 | 一种智能全景视频播放方法及装置 |
| CN112465939A (zh) * | 2020-11-25 | 2021-03-09 | 上海哔哩哔哩科技有限公司 | 全景视频渲染方法及系统 |
| CN118413745A (zh) * | 2024-04-15 | 2024-07-30 | 北京字跳网络技术有限公司 | 一种播放全景视频的方法、装置、设备及存储介质 |
-
2024
- 2024-04-15 CN CN202410452487.0A patent/CN118413745A/zh active Pending
-
2025
- 2025-01-16 WO PCT/CN2025/072781 patent/WO2025218295A1/fr active Pending
Patent Citations (6)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20140210824A1 (en) * | 2013-01-28 | 2014-07-31 | The Boeing Company | Panoptic Visualization Of A Three-Dimensional Representation Of A Complex System |
| CN103400409A (zh) * | 2013-08-27 | 2013-11-20 | 华中师范大学 | 一种基于摄像头姿态快速估计的覆盖范围3d可视化方法 |
| CN107197135A (zh) * | 2016-03-21 | 2017-09-22 | 成都理想境界科技有限公司 | 一种视频生成方法、播放方法及视频生成装置、播放装置 |
| CN110324640A (zh) * | 2019-07-12 | 2019-10-11 | 青岛一舍科技有限公司 | 一种智能全景视频播放方法及装置 |
| CN112465939A (zh) * | 2020-11-25 | 2021-03-09 | 上海哔哩哔哩科技有限公司 | 全景视频渲染方法及系统 |
| CN118413745A (zh) * | 2024-04-15 | 2024-07-30 | 北京字跳网络技术有限公司 | 一种播放全景视频的方法、装置、设备及存储介质 |
Also Published As
| Publication number | Publication date |
|---|---|
| CN118413745A (zh) | 2024-07-30 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| US20200286279A1 (en) | Method, apparatus, medium, and device for processing multi-angle free-perspective image data | |
| CN109743626B (zh) | 一种图像显示方法、图像处理方法和相关设备 | |
| WO2024104248A1 (fr) | Procédé et appareil de rendu pour panorama virtuel, dispositif, et support de stockage | |
| CN114531553B (zh) | 生成特效视频的方法、装置、电子设备及存储介质 | |
| US9996311B2 (en) | Efficient communication interface for casting interactively controlled visual content | |
| US20240276068A1 (en) | Subtitle rendering method and device for virtual reality space, equipment and medium | |
| US20220159197A1 (en) | Image special effect processing method and apparatus, and electronic device and computer readable storage medium | |
| US20250061665A1 (en) | Image display method, electronic device and storage medium | |
| CN114419213A (zh) | 图像处理方法、装置、设备和存储介质 | |
| US20250292444A1 (en) | Video synthesis method, apparatus, device, medium and product | |
| US20250024088A1 (en) | Video generation method and apparatus, and device and storage medium | |
| CN114900621B (zh) | 特效视频确定方法、装置、电子设备及存储介质 | |
| WO2025213828A1 (fr) | Procédé et appareil de diffusion en direct, dispositif électronique et support de stockage | |
| CN116302268A (zh) | 媒体内容的展示方法、装置、电子设备和存储介质 | |
| WO2025218295A1 (fr) | Procédé et appareil de lecture vidéo panoramique, dispositif et support de stockage | |
| WO2024193511A1 (fr) | Procédé et appareil d'interaction, et dispositif électronique et support lisible par ordinateur | |
| US12041379B2 (en) | Image special effect processing method, apparatus, and electronic device, and computer-readable storage medium | |
| CN115761090B (zh) | 特效渲染方法、装置、设备、计算机可读存储介质及产品 | |
| CN117115267A (zh) | 免标定的图像处理方法、装置、电子设备和存储介质 | |
| CN111818265B (zh) | 基于增强现实模型的交互方法、装置、电子设备及介质 | |
| CN117830810A (zh) | 图像处理方法、装置、设备及介质 | |
| US20250193359A1 (en) | Method, apparatus, electronic device, and storage medium for image display | |
| CN118694910B (zh) | 用于三维场景的视频融合方法、装置、系统、设备及介质 | |
| US20240269553A1 (en) | Method, apparatus, electronic device and storage medium for extending reality display | |
| CN111277886A (zh) | 全景视频的视场控制方法、装置、电子设备及存储介质 |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| 121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 25789169 Country of ref document: EP Kind code of ref document: A1 |