WO2023005579A1 - 视频编码、视频解码方法、装置、电子设备和存储介质 - Google Patents
视频编码、视频解码方法、装置、电子设备和存储介质 Download PDFInfo
- Publication number
- WO2023005579A1 WO2023005579A1 PCT/CN2022/102406 CN2022102406W WO2023005579A1 WO 2023005579 A1 WO2023005579 A1 WO 2023005579A1 CN 2022102406 W CN2022102406 W CN 2022102406W WO 2023005579 A1 WO2023005579 A1 WO 2023005579A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- image
- video
- weighted prediction
- parameters
- weighted
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Ceased
Links
Images
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/50—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
- H04N19/503—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal prediction
- H04N19/51—Motion estimation or motion compensation
- H04N19/573—Motion compensation with multiple frame prediction using two or more reference frames in a given prediction direction
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/50—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/102—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
- H04N19/124—Quantisation
- H04N19/126—Details of normalisation or weighting functions, e.g. normalisation matrices or variable uniform quantisers
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/102—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
- H04N19/103—Selection of coding mode or of prediction mode
- H04N19/105—Selection of the reference unit for prediction within a chosen coding or prediction mode, e.g. adaptive choice of position and number of pixels used for prediction
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/134—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
- H04N19/136—Incoming video signal characteristics or properties
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/134—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
- H04N19/136—Incoming video signal characteristics or properties
- H04N19/14—Coding unit complexity, e.g. amount of activity or edge presence estimation
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/169—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
- H04N19/17—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
- H04N19/172—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a picture, frame or field
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/169—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
- H04N19/17—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
- H04N19/174—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a slice, e.g. a line of blocks or a group of blocks
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/42—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by implementation details or hardware specially adapted for video compression or decompression, e.g. dedicated software implementation
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/46—Embedding additional information in the video signal during the compression process
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/60—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding
- H04N19/61—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding in combination with predictive coding
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/70—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by syntax aspects related to video coding, e.g. related to compression standards
Definitions
- the present application relates to the technical field of image processing, and in particular to a video encoding and decoding method, device, electronic equipment and storage medium.
- inter-frame prediction technology can effectively eliminate data redundancy in the time domain and greatly reduce the video transmission bit rate.
- conventional inter-frame motion estimation domain motion compensation will be difficult to achieve the ideal data compression effect, even in actual coding
- the decision result of the optimization model is usually to use intra-frame prediction coding, which greatly reduces the video coding efficiency. Therefore, in order to improve the coding effect in the content brightness change scene, weighted prediction technology can be used in video coding.
- the coding end needs to determine the brightness change weight and offset of the current image relative to the reference image, and The corresponding weighted predicted frame is generated by the brightness compensation operation.
- weighted prediction technology has been proposed, and there are two modes of weighted prediction in application, which are explicit weighted prediction and implicit weighted prediction.
- implicit weighted prediction the model parameters are fixed, that is, the codec end agrees to use the same weighted prediction parameters, and the parameters do not need to be transmitted by the encoder end, which reduces the pressure of code stream transmission and improves transmission efficiency.
- the weighted prediction parameters in implicit mode are fixed, when it is applied to inter-frame unidirectional prediction, the varying distance between the current frame and the reference frame will lead to unsatisfactory prediction performance with fixed weights.
- weighted prediction involves three prediction parameters, namely weight (weight), offset (offset), log weight denominator (logarithmic weight denominator).
- weight weight
- offset offset
- log weight denominator logarithmic weight denominator
- a series of parameter information for weighted prediction can be included in the Picture header or Slice header. It is worth noting that each luma or chrominance component of the reference image has independent weighted prediction parameters.
- each luma or chrominance component of the reference image has independent weighted prediction parameters.
- each reference image is only equipped with a set of weighted prediction parameters (that is, a set of weights and offsets that cooperate with each other).
- a set of weighted prediction parameters that is, a set of weights and offsets that cooperate with each other.
- the current entire image has a complete and consistent brightness variation form, only A good prediction effect can be achieved by selecting the nearest forward reference image and its weighted prediction parameters.
- multiple different reference images can be selected according to existing standards, and artificially configure the weights of adaptive reference images for different areas of the current image. and offset, as shown in Figure 1.
- the buffer at the decoding end can only store a small amount of decoded reference images, especially for media content with a large amount of data such as ultra-high-definition video and panoramic video
- the solution shown in Figure 1 can only be applied to partial brightness changes in practical applications. Scenes.
- the main purpose of the embodiment of the present application is to propose a video coding method, device, electronic equipment and storage medium, aiming at realizing flexible video coding in complex graphics brightness changing scenes, improving video coding efficiency, and reducing the impact of graphics brightness changes on coding efficiency. Influence.
- An embodiment of the present application provides a video encoding method, wherein the method includes the following steps: acquiring a video image, wherein the video image is at least one frame image of the video; performing weighted predictive encoding on the video image to generate an image code stream, wherein the weighted predictive encoding uses at least one set of weighted predictive identification information and parameters.
- An embodiment of the present application provides a video decoding method, wherein the method includes the following steps: obtaining an image code stream, and parsing the weighted prediction identification information and parameters in the image code stream; The image code stream is decoded to generate a reconstructed image.
- the embodiment of the present application also provides a video encoding device, wherein the device includes the following modules: an image acquisition module, configured to acquire a video image, wherein the video image is at least one frame image of a video; a video encoding module, configured to performing weighted predictive encoding on the video image to generate an image code stream, wherein the weighted predictive encoding uses at least one set of weighted predictive identification information and parameters.
- the embodiment of the present application also provides a video decoding device, wherein the device includes the following modules: a code stream acquisition module, configured to obtain an image code stream, and analyze the weighted prediction identification information and parameters in the image code stream; A reconstruction module, configured to decode the image code stream according to the weighted prediction identification information and parameters to generate a reconstructed image.
- a code stream acquisition module configured to obtain an image code stream, and analyze the weighted prediction identification information and parameters in the image code stream
- a reconstruction module configured to decode the image code stream according to the weighted prediction identification information and parameters to generate a reconstructed image.
- An embodiment of the present application also provides an electronic device, wherein the electronic device includes: one or more processors; a memory for storing one or more programs; when the one or more programs are executed by the one or more Multiple processors are executed, so that the one or multiple processors implement the method described in any one of the embodiments of the present application.
- the embodiment of the present application also provides a computer-readable storage medium, on which a computer program is stored, and when the computer program is executed by a processor, the method described in any one of the embodiments of the present application is implemented.
- the video image is at least one frame image in the video, and performing weighted predictive coding on the video image to generate an image code stream, wherein at least one set of weighted predictive identification information is used in the weighted predictive coding process and parameters, the flexible coding of video images is realized, the video coding efficiency can be improved, and the influence of video image brightness changes on coding efficiency can be reduced.
- Figure 1 is an example diagram of region-adaptive weighted prediction for multiple reference images in some technical solutions
- FIG. 2 is a flow chart of a video encoding method provided by an embodiment of the present application.
- FIG. 3 is a flow chart of another video coding method provided by an embodiment of the present application.
- FIG. 4 is a flow chart of another video encoding method provided by an embodiment of the present application.
- FIG. 5 is an example diagram of a video coding method provided by an embodiment of the present application.
- FIG. 6 is an example diagram of another video coding method provided by the embodiment of the present application.
- FIG. 7 is a flowchart of a video decoding method provided by an embodiment of the present application.
- FIG. 8 is a flow chart of another video decoding method provided by an embodiment of the present application.
- FIG. 9 is an example diagram of a video decoding method provided by an embodiment of the present application.
- FIG. 10 is an example diagram of another video decoding method provided by an embodiment of the present application.
- FIG. 11 is a schematic structural diagram of a video encoding device provided by an embodiment of the present application.
- FIG. 12 is a schematic structural diagram of a video decoding device provided by an embodiment of the present application.
- FIG. 13 is a schematic structural diagram of an encoder provided in an embodiment of the present application.
- FIG. 14 is a schematic structural diagram of a decoder provided in an embodiment of the present application.
- FIG. 15 is a schematic structural diagram of an electronic device provided by an embodiment of the present application.
- Fig. 2 is a flow chart of a video coding method provided by the embodiment of the present application.
- the embodiment of the present application is applicable to video coding in a brightness change scene, and the method can be executed by a video coding device, which can be implemented by software and/or Or hardware implementation, and generally integrated in the terminal equipment, referring to Figure 2, the method provided by the embodiment of the present application specifically includes the following steps:
- Step 110 Acquire a video image, where the video image is at least one frame image of the video.
- the video image may need to be transmitted video data, and the video image may be a certain frame of data in the video data sequence or a corresponding frame of data at a certain moment.
- video data may be processed, and one or more frames of image data may be extracted as video data for video encoding.
- Step 120 Perform weighted predictive encoding on the video image to generate an image code stream, wherein the weighted predictive encoding uses at least one set of weighted predictive identification information and parameters.
- the current frame can be equivalent to multiplying a weight by the previous frame as a whole, plus an offset to perform video encoding based on the previous frame.
- This process of video encoding through weight and offset can be It is called weighted predictive coding.
- three prediction parameters are mainly involved, which are weight, offset, and log weight denominator.
- the log weight denominator It can avoid the floating-point operation in the encoding process and amplify the weight.
- the weighted prediction identification information may be identification information of parameters used for weighted prediction, and the parameters may be specific parameters of the weighted prediction identification information, which may include at least one of weight, offset, and logarithmic weight denominator.
- the image code stream may be data generated after video image encoding, and the image code stream may be used for transmission between terminal devices.
- weighted predictive encoding can be performed on video images, and one or more sets of weighted predictive identification information and parameters can be used in the process of weighted predictive encoding, for example, different weights can be used for video images of different frames
- the prediction identification information and parameters, or different weighted prediction identification information and parameters may be used for video images in different regions within the same frame. It can be understood that, in the process of performing weighted predictive coding on video images, different weighted predictive identification information and parameters can be selected for video coding according to brightness changes of video images.
- the video image is at least one frame image in the video, and performing weighted predictive coding on the video image to generate an image code stream, wherein at least one set of weighted predictive identification information is used in the weighted predictive coding process and parameters, the flexible coding of video images is realized, the video coding efficiency can be improved, and the influence of video image brightness changes on coding efficiency can be reduced.
- Fig. 3 is a flowchart of another video encoding method provided by the embodiment of the present application.
- the embodiment of the present application is based on the embodiment of the above application. See Fig. 3.
- the method provided by the embodiment of the present application specifically includes the following step:
- Step 210 Acquire a video image, where the video image is at least one frame image of the video.
- Step 220 determine the brightness change according to the comparison result between the video image and the reference image.
- the reference image can be the book data located before or after the currently processed video image in the video data sequence, the reference image can be used for the motion estimation operation of the currently processed video image, and the number of reference images can be one or more frames .
- the change in brightness may be the change in brightness of the video image compared to the reference image, and the change in brightness may specifically be determined by the change in pixel values between the video image and the reference image.
- the video image can be compared with the reference image
- the comparison method can include calculating the difference between the pixel values of the corresponding positions or calculating the difference between the pixel average value of the video image and the reference image, etc.
- the video can be
- the comparison result of the image and the reference image determines the change of brightness, which may include gradually brightening, gradually darkening, no change, random change, and the like.
- the brightness change situation includes at least one of the following: an average value of image brightness change, and a pixel point brightness change value.
- the brightness change between the video image and the reference image can be determined by the average value of the image brightness change and/or the brightness change value of the pixel point, wherein the average value of the image brightness change can refer to the average value change of the brightness value of the current image and the reference image, and the pixel point
- the luminance change value may be a change between the luminance value of each pixel in the video image and the luminance value of the corresponding pixel in the reference image. It can be understood that the luminance change may also be a change of other luminance value statistical properties, for example, luminance variance, luminance mean square deviation, and the like.
- Step 230 Perform weighted predictive encoding on the video image according to the brightness variation of the video image, wherein the weighted predictive encoding uses at least one set of weighted predictive identification information and parameters.
- multiple sets of weighted prediction identification information and parameters can be preset, and the corresponding prediction identification information and parameters can be selected according to the brightness change to perform weighted prediction encoding on video images.
- the predictive weighted encoding can be performed on the video image according to the specific content of the brightness change.
- different weighted prediction identification information and parameters can be selected for the weighted predictive encoding according to the brightness change of the video image of different frames. Different regions in the video image select different weighted prediction identification information and parameters for weighted prediction coding.
- Step 240 write weighted prediction identification information and parameters into the image code stream.
- the weighted prediction identification information and parameters used in the encoding process can be written into the image code stream to facilitate the video decoding process in the subsequent process.
- the image code stream realizes flexible encoding of video images, which can improve video encoding efficiency and reduce the impact of video image brightness changes on encoding efficiency.
- the weighted predictive coding of the video image according to the brightness change includes at least one of the following:
- weighted predictive encoding is performed on the video image
- the change in brightness is that the brightness of the partitioned images is consistent, it is determined to perform weighted predictive coding on the partitioned images in the video image respectively.
- the uniform brightness of the entire frame of the image may mean that the brightness changes of the entire image frame of the video image are the same.
- the uniform brightness of the partition image may mean that there are multiple regions in the video image, and the brightness changes of each region are different.
- weighted predictive coding can be performed on the entire frame of video image; In various ways, weighted predictive encoding can be performed for each image area. It can be understood that when the brightness changes of each image area are different, the weighted predictive identification information and parameters used in the weighted predictive encoding process can be different.
- the video image has at least one set of the weighted prediction identification information and parameters for the reference image or a partition image of the reference image.
- the weighted prediction identification information and parameters used in the weighted prediction coding process of the video image are equivalent to the information determined by the reference image, and the video image has one or more sets of weighted prediction identification information and parameters based on the reference image , when the reference image of the video image is multiple frames, the video image can have one or more sets of weighted prediction identification information and parameters for each frame of reference image, and each set of weighted prediction identification information and parameters can be related to the corresponding video image and/or Or there is an associated relationship with the reference image.
- Fig. 4 is a flowchart of another video coding method provided by the embodiment of the present application.
- the embodiment of the present application is based on the embodiment of the above application. Referring to Fig. 4, the method provided by the embodiment of the present application specifically includes the following steps :
- Step 310 Acquire a video image, where the video image is at least one frame image of the video.
- Step 320 perform weighted predictive coding on the video image according to the pre-trained neural network model.
- the neural network model can perform weighted predictive coding processing on the video image, and can determine the weighted prediction identification information and parameters used in the video image, and the neural network model can be generated by training with image samples marked with the weighted prediction identification information and parameters.
- the neural network model can determine the weighted prediction identification information and parameters of the video image or directly determine the image code stream of the video image.
- the video image may be directly or indirectly input into the neural network model, and the weighted predictive coding of the video image may be implemented through the neural network model.
- the input layer of the neural network model can accept video images or features of video images, and the neural network model can generate weighted prediction identification information and parameters used in weighted predictive coding of video images, or directly perform weighted prediction on video images coding.
- Step 330 write weighted prediction identification information and parameters into the image code stream.
- the video by acquiring a video image, using a pre-trained neural network model to perform weighted predictive encoding on the video image, and writing the weighted predictive identification information and parameters used in the weighted predictive encoding into the image code stream generated by the image video encoding, the video
- the flexible encoding of images can improve video encoding efficiency and reduce the impact of video image brightness changes on encoding efficiency.
- the weighted prediction identification information further includes a neural network model structure and neural network model parameters.
- the weighted prediction identification information may also include the neural network model structure and neural network model parameters, wherein the neural network model result may be information reflecting the neural network result, for example, the function used by the fully connected layer, the activation function, the loss function etc., the neural network model parameters may be specific values of the parameters of the neural network model, for example, the network weight value, the number of hidden layers, and the like.
- the set of weighted prediction identification information and parameters used in the weighted predictive encoding corresponds to one frame of the video image or at least one partition image of the video image.
- one or more sets of weighted prediction identification information and parameters can be used in the weighted prediction coding process, and each set of machine prediction identification information and parameters corresponds to a frame of video image or a video image in the weighted prediction coding process.
- a partitioned image which may be a part of a video image, for example, a sliced image or a sub-image.
- the specification of the partition image includes at least one of the following: Slice, Tile, Subpicture, Coding Tree Unit, Coding Unit.
- the partitioned image can be one of Slice, Tile, Subpicture, Coding Tree Unit, Coding Unit, or Various.
- the weighted prediction identification information and the parameters are included in at least one of the following parameter sets: sequence layer parameter set, image layer parameter set, slice layer parameter set, supplementary enhancement information, video availability information, image header information, slice header information, network abstraction layer unit header information, coding tree unit, coding unit.
- the weighted prediction identification information and parameters can be written into the image code stream, and the identification information and parameters are included in all or part of the following parameter sets: sequence layer parameter set, image layer parameter set, slice layer parameter set, supplementary enhancement information , video usability information, image header information, slice header information, network abstraction layer unit header information, or as a new information unit, may also be included in the coding tree unit or the coding unit.
- the weighted prediction identification information and the parameters include at least one of the following information: reference image index information, weighted prediction enable control information, and region-adaptive weighted prediction enable control information , Weighted prediction parameters.
- the weighted prediction identification information may be reference image index information, which is used to determine the reference image used for brightness changes, and weighted prediction enabling control information, which is used to determine whether to perform weighted predictive coding, region adaptive Weighted prediction enabling control information, this information is used to determine whether to perform area weighted predictive coding on the image video, weighted prediction parameters can be parameters used in the weighted predictive coding process, and can include weight, offset, logarithmic weight denominator, etc.
- the image code stream includes a transport stream or a media file.
- the image code stream may be a transport stream or a media file.
- FIG. 5 is an example diagram of a video encoding method provided in the embodiment of the present application.
- the input of the encoding process is the image contained in the video
- the output is the image code stream or the video code stream containing the image code stream.
- the process of video encoding may include the following steps:
- Step 101 Read an image, where the image may be a frame of data in a video sequence, or a frame of data corresponding to a certain moment.
- Step 102 Detect brightness changes of the image relative to a reference image.
- the reference image may refer to an image before the image on the timeline in the video sequence, or an image after the current image on the timeline, and the reference image is used to perform a motion estimation operation on the current image.
- the reference image can have one frame or multiple frames;
- Brightness change can refer to the mean value change of the brightness value of the current image and the reference image, or the brightness value of each pixel of the current image and the brightness value of the pixel corresponding to the reference image, or the change of other statistical characteristics of the brightness value. ;
- the current image needs to detect brightness changes for each frame of the reference image.
- Step 103 According to the brightness change detection result in step 102, it is judged whether a weighted prediction operation needs to be adopted.
- the basis for judging includes a trade-off between improving coding efficiency and increasing coding complexity in a weighted prediction operation for a specific luminance change situation.
- Step 104 When weighted prediction operation is required, perform weighted prediction encoding, and record the index information of the aforementioned reference image, one or more sets of weighted prediction parameters, and/or weighted prediction parameter index information.
- the encoder records the regional characteristics of the luminance change.
- the regional characteristics of the luminance change include that the luminance change keeps a consistent change in the entire frame image area, or the luminance change keeps a consistent change in a partial area of the image.
- the partial area may be one or more slices (Slice), or one or more tiles (Tile), or one or more subpictures (Subpicture), or one or more coding tree units (Coding Tree Unit, CTU), or one or more coding units (Coding Unit, CU);
- the brightness change is consistent in the entire frame image area, it can be considered that the brightness change of the current image corresponding to the reference image is uniform, otherwise it is not uniform;
- weighted predictive coding If the brightness change of the current image is uniform, perform weighted predictive coding, and record the aforementioned reference image index information and a set of weighted prediction parameters, wherein the weighted prediction parameters include weight and offset;
- one image region may correspond to a set of weighted prediction parameters, or multiple image regions may correspond to a set of weighted prediction parameters.
- Step 105 Write weighted prediction identification information and parameters into the code stream, and the identification information and parameters are included in all or part of the following parameter sets: sequence layer parameter set, image layer parameter set, slice layer parameter set, supplementary enhancement information , video usability information, image header information, slice header information, network abstraction layer unit header information, or as a new information unit, may also be included in the coding tree unit or the coding unit.
- Step 106 When no weighted prediction operation is needed, directly perform image coding based on traditional methods;
- Step 107 Output the image code stream or the transport stream or media file containing the image code stream.
- FIG. 6 is an example diagram of a video encoding method provided by the embodiment of the present application. Referring to FIG. 6, this embodiment applies deep learning technology to region-adaptive weighted prediction operations, and the process of video encoding May include the following steps:
- Step 201 Read an image, where the image may be a frame of data in a video sequence, or a frame of data corresponding to a certain moment.
- Step 202 Determine whether to use a weighted prediction scheme based on deep learning technology.
- Step 203 When using deep learning technology, use the neural network model and parameters generated by training to perform image weighted predictive coding.
- weighted prediction parameters may include weights and offsets; wherein, a single image may use all or part of the weights and offsets in a set of values.
- Step 204 When performing image weighted prediction encoding based on the neural network model, record the necessary weighted prediction parameters of the deep learning scheme, including but not limited to the structure and parameters of the neural network model, a set of extracted weighted prediction parameter values and the parameters used in each image area Index information.
- Step 205 When the deep learning technology is not used, perform region-adaptive weighted predictive coding based on a traditional computing scheme, for example, based on the operation process in Embodiment 1.
- Step 206 Write weighted prediction identification information and parameters into the code stream, and the identification information and parameters are included in all or part of the following parameter sets: sequence layer parameter set, image layer parameter set, slice layer parameter set, supplementary enhancement information , video usability information, image header information, slice header information, network abstraction layer unit header information, or as a new information unit, may also be included in the coding tree unit or the coding unit.
- Step 207 Output the image code stream or the transport stream or media file containing the image code stream.
- Fig. 7 is a flow chart of a video decoding method provided by the embodiment of the present application.
- the embodiment of the present application is applicable to video decoding in a brightness change scene, and the method can be executed by a video decoding device, which can be implemented by software and/or Or hardware implementation, and generally integrated in the terminal device, see Figure 7, the method provided by the embodiment of the present application specifically includes the following steps:
- Step 410 acquire the image code stream, and analyze the weighted prediction identification information and parameters in the image code stream.
- the transmitted image code stream may be received, and weighted prediction identification information and parameters may be extracted from the image code stream, wherein the image code stream may include one or more sets of weighted prediction identification information and parameters.
- Step 420 decode the image code stream according to the weighted prediction identification information and parameters to generate a reconstructed image.
- weighted predictive decoding can be performed on the image code stream by using the obtained weighted prediction identification information and parameters, and the image code stream can be processed into a reconstructed image, wherein the reconstructed image can be an image generated according to the transmission code stream.
- the image code by obtaining the image code stream, and obtaining the weighted prediction identification information and parameters in the image code stream, and processing the image code stream according to the obtained weighted prediction identification information and parameters to generate a new image, the image code is realized.
- Dynamic decoding of streams can improve video decoding efficiency and reduce the impact of image brightness changes on video decoding efficiency.
- Fig. 8 is a flowchart of another video decoding method provided by the embodiment of the present application.
- the embodiment of the present application is based on the embodiment of the above application. See Fig. 8.
- the method provided by the embodiment of the present application specifically includes the following steps :
- Step 510 acquire the image code stream, and analyze the weighted prediction identification information and parameters in the image code stream.
- Step 520 Perform weighted predictive decoding on the image code stream by using the weighted predictive identification information and parameters according to the pre-trained neural network model to generate a reconstructed image.
- the neural network model may be a deep learning model for image code stream decoding
- the deep network model may be generated by training sample code streams and sample images
- the neural network model may perform weighted predictive decoding on the image code stream.
- the image code stream and weighted prediction identification information and parameters can be input into the pre-trained neural network model, and the neural network model performs weighted predictive decoding processing on the image code stream, and processes the image code stream into a reconstructed image .
- the realization of The dynamic decoding of the image code stream can improve the video decoding efficiency and reduce the impact of image brightness changes on the video decoding efficiency.
- the number of weighted prediction identification information and parameters in the image code stream is at least one set.
- the image code stream may be the information generated by weighted predictive coding of video images. Based on the different methods of weighted predictive coding, there may be one or more sets of weighted predictive identification information and parameters in the image code stream. For example, encoding If the terminal performs weighted predictive coding on different regions in the video image, multiple sets of weighted predictive identification information and parameters may exist in the image code stream.
- the weighted prediction identification information and the parameters are included in at least one of the following parameter sets: sequence layer parameter set, image layer parameter set, slice layer parameter set, supplementary enhancement information, video availability information, image header information, slice header information, network abstraction layer unit header information, coding tree unit, coding unit.
- the weighted prediction identification information and the parameters include at least one of the following information: reference image index information, weighted prediction enable control information, and region-adaptive weighted prediction enable control information , Weighted prediction parameters.
- the image code stream includes a transport stream or a media file.
- the weighted prediction identification information and parameters also include neural network model structure and neural network model parameters.
- FIG. 9 is an example diagram of a video decoding method provided in the embodiment of the present application.
- the input of the decoding process is an image code stream or a transport data stream or a media file containing an image code stream
- the output It is an image that constitutes a video
- the decoding process of a video image may include:
- Step 301 Read code stream.
- Step 302 Parse the code stream to obtain weighted prediction identification information.
- the decoder parses the sequence layer parameter set, the image layer parameter set and/or the slice layer parameter set to obtain weighted prediction identification information.
- the sequence layer parameter set includes the sequence parameter set (Sequence Parameter Set, SPS)
- the image layer parameter set includes the picture parameter set (Picture Parameter Set, PPS)
- the adaptation parameter set Adaptation Parameter Set, APS
- the slice layer parameter Set includes APS.
- the weighted prediction identification information in the sequence layer parameter set can be referenced by the image layer parameter set and the slice layer parameter set, and the weighted prediction identification information in the image layer parameter set can be referenced by the slice layer parameter set.
- the weighted prediction identification information includes but is not limited to whether the sequence and/or image indicated by the current parameter set adopts unidirectional weighted prediction, bidirectional weighted prediction, and/or region-adaptive multi-weighted weighted prediction;
- distinguishing whether to adopt the region-adaptive multi-weight weighted prediction method includes but is not limited to parsing binary identifiers, or the number of weighted prediction parameter sets (whether it is equal to or greater than 1).
- Step 303 According to the weighted prediction identification information, determine whether the current image adopts weighted prediction decoding.
- Step 304 When it is determined that the current image adopts weighted prediction decoding, obtain weighted prediction parameter information.
- the decoder acquires weighted prediction parameter information from the parameter set and/or data header information according to the indication of the identification information.
- the parameter set includes SPS, PPS, APS
- the data header information includes image header PH, slice header (Slice Header, SH).
- the weighted prediction parameter information includes, but is not limited to, whether each reference image in the reference image list is configured with weighted prediction parameters (weight and offset), the number of sets of weighted prediction parameters configured for each reference image, and the specific value of each set of weighted prediction parameters. value etc.
- Step 305 Perform weighted prediction decoding on the current image according to the weighted prediction identification information and parameters.
- the decoder can perform unified weighted prediction decoding on the current complete image, or perform differentiated weighted prediction decoding on each partial content in the image.
- Step 306 When it is determined that the current image does not use weighted predictive decoding, directly perform image decoding based on traditional methods.
- Step 307 Generate a reconstructed image.
- the reconstructed image can be used for display or saved directly.
- FIG. 10 is an example diagram of another video decoding method provided by the embodiment of the present application.
- this embodiment applies deep learning technology to region-adaptive weighted prediction operations, and the video decoding method Specifically include the following steps:
- Step 401 Read code stream.
- Step 402 Parse the code stream to obtain identification information of weighted prediction based on deep learning.
- the decoder parses the sequence layer parameter set, image layer parameter set and/or slice layer parameter set to obtain identification information of weighted prediction based on deep learning.
- the sequence layer parameter set includes the sequence parameter set (Sequence Parameter Set, SPS)
- the image layer parameter set includes the picture parameter set (Picture Parameter Set, PPS)
- the adaptation parameter set Adaptation Parameter Set, APS
- the slice layer parameter Set includes APS.
- the weighted prediction identification information in the sequence layer parameter set can be referenced by the image layer parameter set and the slice layer parameter set
- the weighted prediction identification information in the image layer parameter set can be referenced by the slice layer parameter set.
- the identification information of the weighted prediction based on deep learning includes but not limited to whether the sequence and/or image indicated by the current parameter set adopts the weighted prediction based on deep learning.
- Step 403 According to the identification information of weighted prediction based on deep learning, determine whether the current image is decoded by weighted prediction based on deep learning.
- Step 404 When it is determined that the current image adopts weighted prediction decoding based on deep learning, obtain necessary weighted prediction parameters of the deep learning scheme.
- the decoder acquires weighted prediction parameter information from the parameter set and/or data header information according to the indication of the identification information.
- the parameter set includes SPS, PPS, APS
- the data header information includes image header PH, slice header (Slice Header, SH).
- the weighted prediction parameter information includes, but is not limited to, reference image index information, neural network model structure and parameters, all or part of the weighted prediction parameter values, and weighted prediction parameter index information used in each region of the current image.
- Step 405 Perform weighted prediction decoding based on deep learning on the current image according to the weighted prediction identification information and parameters.
- Step 406 When it is determined that the current image does not adopt weighted predictive decoding based on deep learning, directly perform image decoding based on traditional methods.
- Step 407 Generate a reconstructed image.
- the reconstructed image can be used for display or saved directly.
- the embodiment provides identification information of region-adaptive weighted prediction parameters included in the sequence layer parameter set SPS in the code stream.
- the identification information in SPS can be referenced by PPS and APS.
- the syntax and semantics in Table 1 are defined as follows:
- sps_weighted_pred_flag is the enabling control information for the sequence layer to apply weighted prediction techniques to unidirectional prediction slices (P slices).
- P slices unidirectional prediction slices
- sps_weighted_bipred_flag is the enabling control information for the sequence layer to apply the weighted prediction technique to biprediction slices (B slices).
- B slices biprediction slices
- sps_wp_multi_weights_flag is the enabling control information for the sequence layer to have multiple sets of weighted prediction parameters for a single reference picture.
- sps_wp_multi_weights_flag is equal to 1, it indicates that the single reference picture of the picture indicated by the SPS can have multiple sets of weighted prediction parameters; conversely, when sps_wp_multi_weights_flag is equal to 0, it indicates that the single reference picture of the picture indicated by the SPS has only a single set of weighted prediction parameters.
- this embodiment provides identification information of region-adaptive weighted prediction parameters included in the picture layer parameter set PPS in the code stream, and the identification information in the PPS can be referenced by the APS.
- pps_weighted_pred_flag is the enabling control information for the image layer to apply weighted prediction technology to unidirectional prediction slices (P slices).
- pps_weighted_pred_flag is the enabling control information for the image layer to apply weighted prediction technology to unidirectional prediction slices (P slices).
- pps_weighted_pred_flag is the enabling control information for the image layer to apply weighted prediction technology to unidirectional prediction slices (P slices).
- pps_weighted_bipred_flag is the enabling control information for the image layer to apply the weighted prediction technique to bipredictive slices (B slices).
- B slices bipredictive slices
- pps_wp_multi_weights_flag is the enabling control information for the picture layer to have multiple sets of weighted prediction parameters with respect to a single reference picture.
- pps_wp_multi_weights_flag is equal to 1
- sps_wp_multi_weights_flag is equal to 0, the value of pps_wp_multi_weights_flag should be equal to 0.
- pps_no_pic_partition_flag When pps_no_pic_partition_flag is equal to 1, it indicates that no image segmentation is applied to each image indicated by the PPS; when pps_no_pic_partition_flag is equal to 0, it indicates that each image indicated by the PPS may be divided into multiple tiles Tile or Slice.
- pps_rpl_info_in_ph_flag When pps_rpl_info_in_ph_flag is equal to 1, it indicates that the reference picture list (Reference Picture List, RPL) information exists in the picture header (Picture Header, PH) syntax structure, and does not exist in the slice header that does not contain the PH syntax structure indicated by the PPS.
- pps_rpl_info_in_ph_flag is equal to 0, it indicates that the RPL information does not exist in the PH syntax structure, and may exist in the slice header indicated by the PPS.
- pps_wp_info_in_ph_flag When pps_wp_info_in_ph_flag is equal to 1, it indicates that the weighted prediction information may exist in the PH syntax structure, and does not exist in the slice header that does not include the PH syntax structure indicated by the PPS. When pps_wp_info_in_ph_flag is equal to 0, it indicates that the weighted prediction information does not exist in the PH syntax structure, and may exist in the slice header indicated by the PPS.
- the embodiment provides identification information of region-adaptive weighted prediction parameters included in the picture header (PH) in the code stream.
- the weighted prediction parameters included in the PH may be referenced by the current picture, slices in the current picture, and/or CTUs or CUs in the current picture.
- ph_inter_slice_allowed_flag When ph_inter_slice_allowed_flag is equal to 0, it indicates that all coded slices of the image are intra prediction type (I slice). When ph_inter_slice_allowed_flag is equal to 1, it indicates that the image is allowed to contain one or more unidirectional or bidirectional inter prediction type slices (P slice or B slice).
- multi_pred_weights_table() is a numerical table containing weighted prediction parameters, where a single reference image can have multiple sets of weighted prediction parameters (weight + offset).
- weighted prediction parameters can be obtained from the table.
- pred_weight_table() is a numerical table containing weighted prediction parameters, where a single reference image has only a single set of weighted prediction parameters (weight + offset).
- the embodiment provides identification information of region-adaptive weighted prediction parameters included in the slice header SH in the code stream.
- the weighted prediction parameters contained in the SH can be referenced by the current slice and/or the CTU or CU in the current slice.
- sh_picture_header_in_slice_header_flag is equal to 1, indicating that the PH syntax structure exists in the slice header.
- sh_picture_header_in_slice_header_flag is equal to 0, the PH syntax structure does not exist in the slice header, that is, the slice layer may not completely inherit the identification information of the image layer, and the encoding tool can be flexibly selected.
- sh_slice_type indicates the coding type of the slice, which can be intra coding type (I slice), unidirectional inter coding type (P slice), and bidirectional inter coding type (B slice).
- sh_wp_multi_weights_flag is the enable control information for the slice layer to have multiple sets of weighted prediction parameters with respect to a single reference picture.
- sh_wp_multi_weights_flag is the enable control information for the slice layer to have multiple sets of weighted prediction parameters with respect to a single reference picture.
- weighted prediction parameters can be obtained from the value table multi_pred_weights_table(); if pps_wp_multi_weights_flag is equal to 0, the weighted prediction parameters can be obtained from the value Obtained in the table pred_weight_table().
- the embodiment provides the syntax and semantics of the weighted prediction parameter value table, pred_weight_table() and multi_pred_weights_table() are both value tables containing weighted prediction parameters, the difference is that the former defines a single reference image only has a single set of weighted prediction parameters, while the latter defines that a single reference picture can have multiple sets of weighted prediction parameters.
- the syntax and semantics of pred_weight_table() can refer to the document description of the international standard H.266/VVCversion1; the syntax and semantics of multi_pred_weights_table() are given in Table 5 and its description. in,
- luma_log2_weight_denom and delta_chroma_log2_weight_denom are magnification factors for luma and chrominance weighting factors, respectively, to avoid floating-point operations at the encoding end.
- num_l0_weights indicates that when pps_wp_info_in_ph_flag is equal to 1, the number of weighting factors that need to be indicated for many entries (reference pictures) in the reference picture list 0 (RPL0).
- the value range of num_l0_weights is [0,Min(15,num_ref_entries[0][RplsIdx[0]])], where num_ref_entries[listIdx][rplsIdx] indicates the entries in the reference image list syntax structure ref_pic_list_struct(listIdx,rplsIdx) number.
- the variable NumWeightsL0 is set to num_l0_weights; otherwise, when pps_wp_info_in_ph_flag is equal to 0, the variable NumWeightsL0 is set to NumRefIdxActive[0].
- the value of NumRefIdxActive[i]-1 indicates the maximum reference index that may be used to decode the slice in the reference picture list i (RPLi). When the value of NumRefIdxActive[i] is 0, it indicates that there is no reference index in RPLi for decoding slices.
- luma_weight_l0_flag[i] When luma_weight_l0_flag[i] is equal to 1, it indicates that the luma component for unidirectional prediction using the i-th entry in the reference list 0 (RefPicList[0][i]) has a weighting factor (weight+offset). When luma_weight_l0_flag[i] is equal to 0, it indicates that the above weighting factor does not exist.
- chroma_weight_l0_flag[i] When chroma_weight_l0_flag[i] is equal to 1, it indicates that the chroma prediction value using the i-th entry in the reference list 0 (RefPicList[0][i]) for unidirectional prediction has a weighting factor (weight + offset). When chroma_weight_l0_flag[i] is equal to 0, it indicates that the above weighting factors do not exist (by default).
- num_l0_luma_pred_weights[i] indicates that when luma_weight_l0_flag[i] is equal to 1, the number of weighting factors that need to be indicated for the brightness component of entry i (reference image i) in reference image list 0 (RPL0), that is, the brightness of a single reference image i in list 0 The number of weighted prediction parameters that a component can carry.
- delta_luma_weight_l0[i][k] and luma_offset_l0[i][k] respectively indicate the k-th weight factor and offset value of the i-th reference image luminance component in reference image list 0.
- num_l0_chroma_pred_weights[i][j] indicates that when chroma_weight_l0_flag[i] is equal to 1, the number of weighting factors that need to be indicated for the jth chrominance component of entry i (reference image i) in reference image list 0 (RPL0), namely The number of weighted prediction parameters that can be carried by the jth chrominance component of a single reference image i in list 0.
- delta_chroma_weight_l0[i][j][k] and delta_chroma_offset_l0[i][j][k] respectively indicate the kth weight factor and offset of the jth chrominance component of the ith reference image in the reference image list 0 Quantity value.
- this embodiment provides the syntax and semantics of another numerical table of weighted prediction parameters.
- Both pred_weight_table() and multi_pred_weights_table() are numerical tables containing weighted prediction parameters. The difference is that the former defines that a single reference image has only one set of weighted prediction parameters, while the latter defines that a single reference image can have multiple sets of weighted prediction parameters.
- the syntax and semantics of pred_weight_table() can be found in the document description of the international standard H.266/VVC version 1; the syntax and semantics of multi_pred_weights_table() are given in Table 6 and its description.
- multi_pred_weights_table() involves many entries in the reference image list, that is, multiple specified reference images need to determine whether there are weighting factors, and a single reference image with weighting factors may also have multiple sets of weighted prediction parameters (including weights and offsets).
- this embodiment is a special case of the eleventh embodiment, where only one reference image in each reference image list is considered, that is, when determining to apply the weighted prediction technique, directly specify the multiple sets of weighted predictions that the reference image has parameter.
- the meaning of each field in Table 6 is the same as the corresponding semantic interpretation of each field in Table 5.
- the identification information and parameters of the region-adaptive weighted prediction are given in the coding tree unit CTU.
- the weighted prediction parameter information in the CTU can be independently identified, and can also refer to other parameter sets (such as sequence layer parameter set SPS, picture layer parameter set PPS) or header information (such as picture header PH, slice header SH) , you can also record the weighted prediction parameter difference, or record the weighted prediction parameter index information and the difference.
- the weighted prediction parameter information in the CTU is the record difference, or the record index information and the difference, it refers to obtaining a weighted value from other parameter sets or header information, plus a difference in the CTU, so that The weighted prediction parameters finally applied to the CTU or CU can be obtained.
- the weighted prediction parameter information is included in the unit of CTU, a refined brightness gradient effect in the unit of CTU, such as circle gradient and radiation gradient, can be realized.
- the specific code stream organization method can be shown in Table 7.
- the weighted prediction parameter finally applied to the current CTU is the specific weighted prediction parameter of the reference image plus the weighted prediction parameter difference defined in coding_tree_unit().
- the weighted prediction parameter difference includes, but not limited to, the weighted prediction parameter difference (ctu_delta_luma_weight_l0 and ctu_delta_luma_offset_l0) for the luma component in RPL0, and the weighted prediction parameter difference for the chroma component in RPL0 (ctu_delta_chroma_weight_l0[i] and ctu_delta_chroma_offset_l0[ i]), the weighted prediction parameter difference for the luma component in RPL1 (ctu_delta_luma_weight_l1 and ctu_delta_luma_offset_l1), and the weighted prediction parameter difference for the chroma component in RPL1 (ctu_delta_chroma_weight_l1[i] and ctu_delta_chroma_offset_l1[i]).
- the weighted prediction parameter index information includes but not limited to the weighted prediction parameter index number (ctu_index_l0_luma_pred_weights) for the luma component in RPL0, the weighted prediction parameter index number (ctu_index_l0_chroma_pred_weights[i]) for the chroma component in RPL0, The weighted prediction parameter index number (ctu_index_l1_luma_pred_weights) of the luma component, and the weighted prediction parameter index number (ctu_index_l1_chroma_pred_weights[i]) for the chrominance component in RPL1.
- the weighted prediction parameter finally applied to the current CTU is the specific weighted prediction parameter of the reference image plus the weighted prediction parameter difference defined in coding_tree_unit().
- the identification information and parameters of the region-adaptive weighted prediction are given in the coding unit CU.
- the weighted prediction parameter information in the CU can be independently identified, or it can refer to other parameter sets (such as sequence layer parameter set SPS, picture layer parameter set PPS) or header information (such as picture header PH, slice header SH) or encoding
- the difference value of the weighted prediction parameter may also be recorded, or the index information and the difference value of the weighted prediction parameter may be recorded.
- the weighted prediction parameter information in the CU is the record difference, or the record index information and the difference, it refers to obtaining a weighted value from other parameter sets or header information, plus a difference in the CU, so that The weighted prediction parameters finally applied to the CU can be obtained.
- the weighted prediction parameter information is included in units of CUs, refined brightness gradient effects in units of CUs, such as circle gradients and radiation gradients, can be achieved.
- the specific code stream organization method may be as shown in Table 8.
- cu_pred_weights_adjust_flag indicates whether to adjust the weighted prediction parameter value indexed by the current CU.
- cu_pred_weights_adjust_flag is equal to 1, it indicates that the current CU needs to adjust the weighted prediction parameter value, that is, the weighted prediction parameter value finally applied to the current CU is the sum of the difference between the weighted prediction parameter value at the CTU level and the CU level identification; when cu_pred_weights_adjust_flag is equal to 1 , indicating that the current CU directly uses the weighted prediction parameter value determined by the CTU level.
- the weighted prediction parameter differences identified at the CU level include the weighted prediction parameter differences (cu_delta_luma_weight_l0 and cu_delta_luma_offset_l0) for the luma component in RPL0.
- the weighted prediction parameter difference identified at the CU level also includes the weighted prediction parameter difference for each chroma component; if the current CU is bidirectionally predictive, the weighted prediction parameter difference identified at the CU level It also includes weighted prediction parameter differences for the luma component and/or each chrominance component in RPL1.
- the weighted prediction parameters finally applied to the current CU are the specific weighted prediction parameters carried by the upper layer data (such as CTU level, Slice level, and Subpicture level) in the codec structure plus the weighted prediction parameter difference defined in coding_unit().
- the identification information and parameters of the region-adaptive weighted prediction are given in supplemental enhancement information (SupplementalEnhancementInformation, SEI).
- SEI SupplementalEnhancementInformation
- the NALunittype in the network abstraction layer unit header information nal_unit_header() is set to 23, indicating the pre-SEI information.
- sei_rbsp() contains related code stream sei_message(), and sei_message() contains valid data information. It only needs to set the value of payloadType to be different from other SEI information in the current H.266/VVCversion1 (for example, the possible value is 100), then payload_size_byte contains code stream information related to region-adaptive weighted prediction.
- the specific code stream organization method is shown in Table 7.
- multi_pred_weights_cancel_flag 1, the SEI information related to the previous image will be canceled, and the image does not use the relevant SEI function; if multi_pred_weights_cancel_flag is 0, the previous SEI information will be used (during the decoding process, if the current image does not carry SEI information, the previous The SEI information of an image will continue to be used in the decoding process of the current image), and the relevant SEI function is enabled for the image; if multi_pred_weights_persistence_flag is 1, the SEI information is applied to the current image and images after the current layer; if multi_pred_weights_persistence_flag is 0, the SEI information is only applied to current image.
- the meanings of other fields in Table 9 are the same as the semantic explanations corresponding to each field in Table 5.
- the identification information and parameters of the region-adaptive weighted prediction are given in the media description information.
- the media description information includes, but is not limited to, the Media Presentation Description (MPD) information in the HTTP-based Dynamic Adaptive Streaming over HTTP (DASH) protocol, MPEG Media Transport (MPEG Media Transport, MMT) Asset Descriptor information in the protocol.
- MPD Media Presentation Description
- MMT MPEG Media Transport
- Table 10 The syntax and field meanings in Table 10 are the same as the semantic explanations corresponding to the fields in Table 5.
- Fig. 11 is a schematic structural diagram of a video coding device provided by an embodiment of the present application; the video coding method provided by any embodiment of the present application can be executed, and it has corresponding functional modules and beneficial effects for executing the method, and the device can be implemented by software and/or
- the hardware implementation specifically includes: an image acquisition module 610 and a video encoding module 620 .
- the image acquiring module 610 is configured to acquire a video image, wherein the video image is at least one frame of video.
- the video encoding module 620 is configured to perform weighted predictive encoding on the video image to generate an image code stream, wherein the weighted predictive encoding uses at least one set of weighted predictive identification information and parameters.
- the video image is acquired by the image acquisition module, and the video image is at least one frame image in the video, and the video encoding module performs weighted predictive encoding on the video image to generate an image code stream, wherein at least A set of weighted prediction identification information and parameters realizes flexible encoding of video images, improves video encoding efficiency, and reduces the impact of video image brightness changes on encoding efficiency.
- the device further includes:
- the change determination module is used to determine the brightness change according to the comparison result between the video image and the reference image.
- the brightness change situation includes at least one of the following: an average value of image brightness change, and a pixel point brightness change value.
- the video encoding module 620 includes:
- An encoding processing unit configured to perform weighted predictive encoding on the video image according to the brightness change of the video image.
- the encoding processing unit is specifically configured to: if the brightness change is that the brightness of the entire frame image is consistent, perform weighted predictive encoding on the video image; if the brightness change If the image brightness of the partitions is the same, it is determined to perform weighted predictive coding on each of the partition images in the video image.
- the set of weighted prediction identification information and parameters used in the weighted prediction encoding in the device corresponds to one frame of the video image or at least one partition image of the video image.
- the specification of the partitioned image in the device includes at least one of the following: Slice, Tile, Subpicture, Coding Tree Unit, Coding Unit Unit.
- the device further includes:
- a code stream writing module configured to write the weighted prediction identification information and parameters into the image code stream.
- the weighted prediction identification information and the parameters in the device are included in at least one of the following parameter sets: a sequence layer parameter set, an image layer parameter set, a slice layer parameter set, Supplementary enhancement information, video usability information, image header information, slice header information, network abstraction layer unit header information, coding tree unit, coding unit.
- the weighted prediction identification information and the parameters in the device include at least one of the following information: reference image index information, weighted prediction enabling control information, and region-adaptive weighted prediction enabling Control information, weighted prediction parameters.
- the image code stream in the device includes a transport stream or a media file.
- the video coding module 620 also includes:
- the deep learning unit is used to perform weighted predictive coding on the video image according to the pre-trained neural network model.
- the weighted prediction identification information in the device further includes a neural network model structure and neural network model parameters.
- Fig. 12 is a schematic structural diagram of a video decoding device provided in an embodiment of the present application, which can execute the video decoding method provided in any embodiment of the present application, and has corresponding functional modules and beneficial effects for executing the method.
- the device can be implemented by software and/or
- the hardware implementation specifically includes: a code stream acquisition module 710 and an image reconstruction module 720 .
- the code stream obtaining module 710 is configured to obtain an image code stream, and parse the weighted prediction identification information and parameters in the image code stream.
- An image reconstruction module 720 configured to decode the image code stream according to the weighted prediction identification information and parameters to generate a reconstructed image.
- the image code stream is obtained by the code stream acquisition module, and the weighted prediction identification information and parameters in the image code stream are obtained, and the image reconstruction module processes the image code stream according to the obtained weighted prediction identification information and parameters to generate Re-image realizes dynamic decoding of image code stream, which can improve video decoding efficiency and reduce the impact of image brightness changes on video decoding efficiency.
- the number of weighted prediction identification information and parameters in the image code stream in the device is at least one set.
- the weighted prediction identification information and the parameters in the device are included in at least one of the following parameter sets: a sequence layer parameter set, an image layer parameter set, a slice layer parameter set, Supplementary enhancement information, video usability information, image header information, slice header information, network abstraction layer unit header information, coding tree unit, coding unit.
- the weighted prediction identification information and the parameters in the device include at least one of the following information: reference image index information, weighted prediction enabling control information, and region-adaptive weighted prediction enabling Control information, weighted prediction parameters.
- the image code stream in the device includes a transport stream or a media file.
- the image reconstruction module 720 includes:
- the deep learning decoding unit is configured to use the weighted prediction identification information and parameters to perform weighted prediction decoding on the image code stream according to the pre-trained neural network model to generate a reconstructed image.
- the weighted prediction identification information and parameters in the device also include neural network model structure and neural network model parameters.
- FIG. 13 is a schematic structural diagram of an encoder provided in an embodiment of the present application.
- the encoder shown in FIG. 13 is applied to an apparatus for encoding video.
- the input of the device is the image included in the video, and the output is the image code stream or the transport stream or media file containing the image code stream.
- the encoder is used for: Step 501: Input an image.
- the specific operation process may be the same as the video encoding method provided in any of the foregoing embodiments.
- Step 503 Output code stream.
- FIG. 14 is a schematic structural diagram of a decoder provided in an embodiment of the present application.
- the decoder shown in FIG. 14 is applied to an apparatus for decoding video.
- the input of the device is an image code stream or a transport stream or a media file containing the image code stream, and the output is an image constituting a video.
- the decoder is used for: Step 601: Input an image.
- An example of a specific operation process is the video decoding method provided in any of the foregoing embodiments.
- Step 604 the player plays the image.
- Figure 15 is a schematic structural diagram of an electronic device provided by an embodiment of the present application, the electronic device includes a processor 70, a memory 71, an input device 72, and an output device 73; the number of processors 70 in the electronic device can be one or more
- a processor 70 is taken as an example; the processor 70, memory 71, input device 72 and output device 73 in the electronic device can be connected by bus or other methods.
- the connection by bus is taken as an example.
- the memory 71 can be used to store software programs, computer-executable programs and modules, such as modules corresponding to the video encoding device or video decoding device in the embodiment of the present application (the image acquisition module 610 and the video encoding module 620, or, code stream acquisition module 710 and image reconstruction module 720).
- the processor 70 executes various functional applications and data processing of the electronic device by running software programs, instructions and modules stored in the memory 71 , that is, implements the above-mentioned method.
- the memory 71 may mainly include a program storage area and a data storage area, wherein the program storage area may store an operating system and at least one application required by a function; the data storage area may store data created according to the use of the electronic device, and the like.
- the memory 71 may include a high-speed random access memory, and may also include a non-volatile memory, such as at least one magnetic disk storage device, flash memory device, or other non-volatile solid-state storage devices.
- the memory 71 may further include a memory that is remotely located relative to the processor 70, and these remote memories may be connected to the electronic device through a network. Examples of the aforementioned networks include, but are not limited to, the Internet, intranets, local area networks, mobile communication networks, and combinations thereof.
- the input device 72 can be used to receive input numbers or character information, and generate key signal input related to user settings and function control of the electronic device.
- the output device 73 may include a display device such as a display screen.
- the embodiment of the present application also provides a storage medium containing computer-executable instructions, the computer-executable instructions are used to execute a video encoding method when executed by a computer processor, the method comprising:
- the computer-executable instructions are used to perform a video decoding method when executed by a computer processor, the method comprising:
- the present application can be realized by means of software and necessary general-purpose hardware, and of course it can also be realized by hardware, but in many cases the former is a better implementation .
- the essence of the technical solution of this application or the part that contributes to the prior art can be embodied in the form of a software product, and the computer software product can be stored in a computer-readable storage medium, such as a computer floppy disk , read-only memory (Read-Only Memory, ROM), random access memory (Random Access Memory, RAM), flash memory (FLASH), hard disk or optical disc, etc., including several instructions to make a computer device (which can be a personal computer) , server, or network device, etc.) execute the method described in each embodiment of the present application.
- a computer-readable storage medium such as a computer floppy disk , read-only memory (Read-Only Memory, ROM), random access memory (Random Access Memory, RAM), flash memory (FLASH), hard disk or optical disc, etc.
- the division between functional modules/units mentioned in the above description does not necessarily correspond to the division of physical components; for example, one physical component may have multiple functions, or one function or step may be composed of several physical components. Components cooperate to execute.
- Some or all of the physical components may be implemented as software executed by a processor, such as a central processing unit, digital signal processor, or microprocessor, or as hardware, or as an integrated circuit, such as an application-specific integrated circuit .
- a processor such as a central processing unit, digital signal processor, or microprocessor
- Such software may be distributed on computer readable media, which may include computer storage media (or non-transitory media) and communication media (or transitory media).
- computer storage media includes both volatile and nonvolatile media implemented in any method or technology for storage of information, such as computer readable instructions, data structures, program modules, or other data. permanent, removable and non-removable media.
- Computer storage media includes, but is not limited to, RAM, ROM, EEPROM, flash memory or other memory technology, CD-ROM, digital versatile disk (DVD) or other optical disk storage, magnetic cartridges, tape, magnetic disk storage or other magnetic storage devices, or can Any other medium used to store desired information and which can be accessed by a computer.
- communication media typically embodies computer readable instructions, data structures, program modules, or other data in a modulated data signal such as a carrier wave or other transport mechanism, and may include any information delivery media .
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Physics & Mathematics (AREA)
- Theoretical Computer Science (AREA)
- Data Mining & Analysis (AREA)
- General Health & Medical Sciences (AREA)
- Biomedical Technology (AREA)
- Biophysics (AREA)
- Computational Linguistics (AREA)
- Life Sciences & Earth Sciences (AREA)
- Evolutionary Computation (AREA)
- Artificial Intelligence (AREA)
- Molecular Biology (AREA)
- Computing Systems (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Mathematical Physics (AREA)
- Software Systems (AREA)
- Health & Medical Sciences (AREA)
- Compression Or Coding Systems Of Tv Signals (AREA)
Abstract
Description
Claims (25)
- 一种视频编码方法,包括:获取视频图像,其中,所述视频图像为视频的至少一帧图像;对所述视频图像进行加权预测编码以生成图像码流,其中,所述加权预测编码使用至少一套加权预测标识信息和参数。
- 根据权利要求1所述的方法,还包括:根据所述视频图像与参考图像的比较结果确定亮度变化情况。
- 根据权利要求2所述的方法,其中,所述亮度变化情况包括以下至少之一:图像亮度变化均值、像素点亮度变化值。
- 根据权利要求1所述的方法,其中,所述对所述视频图像进行加权预测编码以生成图像码流,包括:根据所述视频图像的亮度变化情况对所述视频图像进行加权预测编码。
- 根据权利要求4所述的方法,其中,所述根据所述亮度变化情况对所述视频图像进行加权预测编码,包括以下至少之一:若所述亮度变化情况为整帧图像亮度一致,则对所述视频图像进行加权预测编码;若所述亮度变化情况为分区图像亮度一致,则确定分别对所述视频图像内各所述分区图像进行加权预测编码。
- 根据权利要求1所述的方法,其中,所述视频图像针对所述参考图像或所述参考图像的分区图像存在至少一套所述加权预测标识信息和参数。
- 根据权利要求1所述的方法,其中,所述加权预测编码使用的一套所述加权预测标识信息和参数对应一帧所述视频图像或者所述视频图像的至少一个分区图像。
- 根据权利要求7所述的方法,其中,所述分区图像的规格包括以下至少之一:分片Slice、瓦片Tile、子图像Subpicture、编码树单元Coding Tree Unit、编码单元Coding Unit。
- 根据权利要求1所述的方法,还包括:将所述加权预测标识信息和参数写入所述图像码流。
- 根据权利要求9所述的方法,其中,所述加权预测标识信息和所述参数包含在下述至少一种参数集合中:序列层参数集、图像层参数集、分片层参数集、补充增强信息、视频可用性信息、图像头信息、分片头信息、网络抽象层单元头信息、编码树单元、编码单元。
- 根据权利要求1所述的方法,其中,所述加权预测标识信息和所述参数包括以下信息中至少之一:参考图像索引信息、加权预测启用控制信息、区域自适应的加权预测启用控制信息、加权预测参数。
- 根据权利要求1所述的方法,其中,所述图像码流包括传输流或媒体文件。
- 根据权利要求1所述的方法,其中,所述对所述视频图像进行加权预测编码以生成图像码流,包括根据预先训练的神经网络模型对所述视频图像进行加权预测编码。
- 根据权利要求13所述的方法,其中,所述加权预测标识信息还包括神经网络模型结构和神经网络模型参数。
- 一种视频解码方法,包括:获取图像码流,并解析所述图像码流中的加权预测标识信息和参数;根据所述加权预测标识信息和参数解码所述图像码流以生成重建图像。
- 根据权利要求15所述的方法,其中,所述图像码流中的加权预测标识信息和参数的数量为至少一套。
- 根据权利要求15所述的方法,其中,所述加权预测标识信息和所述参数包含在下述至少一种参数集合中:序列层参数集、图像层参数集、分片层参数集、补充增强信息、视频可用性信息、图像头信息、分片头信息、网络抽象层单元头信息、编码树单元、编码单元。
- 根据权利要求15所述的方法,其中,所述加权预测标识信息和所述参数包括以下信息中至少之一:参考图像索引信息、加权预测启用控制信息、区域自适应的加权预测启用控制信息、加权预测参数。
- 根据权利要求15所述的方法,其中,所述图像码流包括传输流或媒体文件。
- 根据权利要求15所述的方法,其中,所述根据所述加权预测标识信息和参数解码所述图像码流以生成重建图像,包括:根据预先训练的神经网络模型使用所述加权预测标识信息和参数对所述图像码流进行加权预测解码以生成重建图像。
- 根据权利要求20所述的方法,其中,所述加权预测标识信息和参数还包括神经网络模型结构和 神经网络模型参数。
- 一种视频编码装置,包括:图像获取模块,用于获取视频图像,其中,所述视频图像为视频的至少一帧图像;视频编码模块,用于对所述视频图像进行加权预测编码以生成图像码流,其中,所述加权预测编码使用至少一套加权预测标识信息和参数。
- 一种视频解码装置,包括:码流获取模块,用于获取图像码流,并解析所述图像码流中的加权预测标识信息和参数;图像重建模块,用于根据所述加权预测标识信息和参数解码所述图像码流以生成重建图像。
- 一种电子设备,包括:一个或多个处理器;存储器,用于存储一个或多个程序;其中,当所述一个或多个程序被所述一个或多个处理器执行,使得所述一个或多个处理器实现如权利要求1-14或15-21中任一所述的方法。
- 一种计算机可读存储介质,存储有一个或者多个程序,其中,所述一个或者多个程序可被一个或者多个处理器执行,以实现根据权利要求1-14或15-21中任一所述的方法。
Priority Applications (4)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| JP2023578781A JP7698746B2 (ja) | 2021-07-30 | 2022-06-29 | ビデオ符号化、ビデオ復号化の方法、装置、電子機器及び記憶媒体 |
| US18/577,790 US12457351B2 (en) | 2021-07-30 | 2022-06-29 | Video encoding method and apparatus, video decoding method and apparatus, and electronic device and storage medium |
| EP22848184.2A EP4380156A4 (en) | 2021-07-30 | 2022-06-29 | VIDEO ENCODING METHOD AND APPARATUS, VIDEO DECODING METHOD AND APPARATUS, ELECTRONIC DEVICE, AND STORAGE MEDIUM |
| KR1020247002467A KR20240024975A (ko) | 2021-07-30 | 2022-06-29 | 비디오 인코딩, 비디오 디코딩 방법, 장치, 전자 기기 및 저장 매체 |
Applications Claiming Priority (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN202110875430.8A CN115695812A (zh) | 2021-07-30 | 2021-07-30 | 视频编码、视频解码方法、装置、电子设备和存储介质 |
| CN202110875430.8 | 2021-07-30 |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| WO2023005579A1 true WO2023005579A1 (zh) | 2023-02-02 |
Family
ID=85059734
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| PCT/CN2022/102406 Ceased WO2023005579A1 (zh) | 2021-07-30 | 2022-06-29 | 视频编码、视频解码方法、装置、电子设备和存储介质 |
Country Status (6)
| Country | Link |
|---|---|
| US (1) | US12457351B2 (zh) |
| EP (1) | EP4380156A4 (zh) |
| JP (1) | JP7698746B2 (zh) |
| KR (1) | KR20240024975A (zh) |
| CN (1) | CN115695812A (zh) |
| WO (1) | WO2023005579A1 (zh) |
Families Citing this family (3)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN119520491B (zh) * | 2023-08-23 | 2026-02-17 | 上海交通大学 | 一种基于http服务器的媒体资源的发送方法、接收方法和交互方法 |
| CN120151515A (zh) * | 2023-12-12 | 2025-06-13 | 腾讯科技(深圳)有限公司 | 视频编解码方法、装置、计算机可读介质及电子设备 |
| US20260113463A1 (en) * | 2024-10-18 | 2026-04-23 | Nvidia Corporation | Hardware-assisted weighted prediction encoding with software-generated weight prediction parameters |
Citations (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| EP2302933A1 (en) * | 2009-09-17 | 2011-03-30 | Mitsubishi Electric R&D Centre Europe B.V. | Weighted motion compensation of video |
| CN103458240A (zh) * | 2012-05-29 | 2013-12-18 | 韩国科亚电子股份有限公司 | 利用自适应加权预测的影像处理方法 |
| CN106358041A (zh) * | 2016-08-30 | 2017-01-25 | 北京奇艺世纪科技有限公司 | 一种帧间预测编码方法及装置 |
| CN109417619A (zh) * | 2016-04-29 | 2019-03-01 | 英迪股份有限公司 | 用于编码/解码视频信号的方法和设备 |
| CN112261409A (zh) * | 2019-07-22 | 2021-01-22 | 中兴通讯股份有限公司 | 残差编码、解码方法及装置、存储介质及电子装置 |
Family Cites Families (19)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| RU2699253C2 (ru) | 2010-09-03 | 2019-09-04 | Гуандун Оппо Мобайл Телекоммьюникейшнз Корп., Лтд. | Способ и система для компенсации освещенности и перехода при кодировании и обработке видеосигнала |
| JP2012244353A (ja) * | 2011-05-18 | 2012-12-10 | Sony Corp | 画像処理装置および方法 |
| AU2011379258C1 (en) | 2011-10-17 | 2015-11-26 | Kabushiki Kaisha Toshiba | Encoding method and decoding method |
| KR101974261B1 (ko) * | 2016-06-24 | 2019-04-30 | 한국과학기술원 | Cnn 기반 인루프 필터를 포함하는 부호화 방법과 장치 및 복호화 방법과 장치 |
| WO2018037919A1 (ja) | 2016-08-26 | 2018-03-01 | シャープ株式会社 | 画像復号装置、画像符号化装置、画像復号方法、および画像符号化方法 |
| WO2019110125A1 (en) * | 2017-12-08 | 2019-06-13 | Huawei Technologies Co., Ltd. | Polynomial fitting for motion compensation and luminance reconstruction in texture synthesis |
| EP3562162A1 (en) | 2018-04-27 | 2019-10-30 | InterDigital VC Holdings, Inc. | Method and apparatus for video encoding and decoding based on neural network implementation of cabac |
| WO2019205117A1 (en) * | 2018-04-28 | 2019-10-31 | Intel Corporation | Weighted prediction mechanism |
| JP7185467B2 (ja) | 2018-09-28 | 2022-12-07 | Kddi株式会社 | 画像復号装置、画像符号化装置、画像処理システム及びプログラム |
| KR20210145754A (ko) * | 2019-04-12 | 2021-12-02 | 베이징 바이트댄스 네트워크 테크놀로지 컴퍼니, 리미티드 | 행렬 기반 인트라 예측에서의 산출 |
| CN114051735B (zh) * | 2019-05-31 | 2024-07-05 | 北京字节跳动网络技术有限公司 | 基于矩阵的帧内预测中的一步下采样过程 |
| JP2022534320A (ja) * | 2019-06-05 | 2022-07-28 | 北京字節跳動網絡技術有限公司 | マトリクスベースイントラ予測のためのコンテキスト決定 |
| CN120302048A (zh) | 2019-08-22 | 2025-07-11 | Lg电子株式会社 | 图像解码方法、图像编码方法和比特流发送方法 |
| US11677987B2 (en) * | 2020-10-15 | 2023-06-13 | Qualcomm Incorporated | Joint termination of bidirectional data blocks for parallel coding |
| US20230407239A1 (en) * | 2020-11-13 | 2023-12-21 | Teewinot Life Sciences Corporation | Tetrahydrocannabinolic acid (thca) synthase variants, and manufacture and use thereof |
| US12015785B2 (en) * | 2020-12-04 | 2024-06-18 | Ofinno, Llc | No reference image quality assessment based decoder side inter prediction |
| US11729424B2 (en) * | 2020-12-04 | 2023-08-15 | Ofinno, Llc | Visual quality assessment-based affine transformation |
| US11599748B2 (en) * | 2020-12-18 | 2023-03-07 | Tiliter Pty Ltd. | Methods and apparatus for recognizing produce category, organic type, and bag type in an image using a concurrent neural network model |
| US11825090B1 (en) * | 2022-07-12 | 2023-11-21 | Qualcomm Incorporated | Bit-rate estimation for video coding with machine learning enhancement |
-
2021
- 2021-07-30 CN CN202110875430.8A patent/CN115695812A/zh active Pending
-
2022
- 2022-06-29 EP EP22848184.2A patent/EP4380156A4/en active Pending
- 2022-06-29 WO PCT/CN2022/102406 patent/WO2023005579A1/zh not_active Ceased
- 2022-06-29 JP JP2023578781A patent/JP7698746B2/ja active Active
- 2022-06-29 US US18/577,790 patent/US12457351B2/en active Active
- 2022-06-29 KR KR1020247002467A patent/KR20240024975A/ko active Pending
Patent Citations (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| EP2302933A1 (en) * | 2009-09-17 | 2011-03-30 | Mitsubishi Electric R&D Centre Europe B.V. | Weighted motion compensation of video |
| CN103458240A (zh) * | 2012-05-29 | 2013-12-18 | 韩国科亚电子股份有限公司 | 利用自适应加权预测的影像处理方法 |
| CN109417619A (zh) * | 2016-04-29 | 2019-03-01 | 英迪股份有限公司 | 用于编码/解码视频信号的方法和设备 |
| CN106358041A (zh) * | 2016-08-30 | 2017-01-25 | 北京奇艺世纪科技有限公司 | 一种帧间预测编码方法及装置 |
| CN112261409A (zh) * | 2019-07-22 | 2021-01-22 | 中兴通讯股份有限公司 | 残差编码、解码方法及装置、存储介质及电子装置 |
Non-Patent Citations (1)
| Title |
|---|
| See also references of EP4380156A4 |
Also Published As
| Publication number | Publication date |
|---|---|
| CN115695812A (zh) | 2023-02-03 |
| US12457351B2 (en) | 2025-10-28 |
| EP4380156A4 (en) | 2025-07-09 |
| EP4380156A1 (en) | 2024-06-05 |
| US20240380911A1 (en) | 2024-11-14 |
| JP2024521528A (ja) | 2024-05-31 |
| JP7698746B2 (ja) | 2025-06-25 |
| KR20240024975A (ko) | 2024-02-26 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| WO2023005579A1 (zh) | 视频编码、视频解码方法、装置、电子设备和存储介质 | |
| US8428144B2 (en) | Method and apparatus for decoding/encoding of a video signal | |
| CN101658038A (zh) | 针对可缩放视频编码的视频可用信息的方法和设备 | |
| US11336965B2 (en) | Method and apparatus for processing video bitstream, network device, and readable storage medium | |
| CN118435608A (zh) | 用于转换胶片颗粒元数据的方法和装置 | |
| CN104754358A (zh) | 码流的生成和处理方法、装置及系统 | |
| US20050114887A1 (en) | Quality of video | |
| KR20090099547A (ko) | 멀티뷰 코딩 비디오에서 비디오 에러 정정을 위한 방법 및 장치 | |
| CN115668936A (zh) | 时间运动向量预测、层间参考和时间子层指示的视频编码方面 | |
| JP2025036622A (ja) | 画像復号装置、画像復号方法、画像符号化装置、および画像符号化方法 | |
| US9565454B2 (en) | Picture referencing control for video decoding using a graphics processor | |
| TW202545183A (zh) | 編碼/解碼之方法及設備 | |
| CN120266479A (zh) | 用于胶片颗粒合成的sei信息 | |
| EP4654589A1 (en) | Decoding method, coding method, electronic device, and storage medium | |
| WO2021114305A1 (zh) | 视频处理方法、装置及计算机可读存储介质 | |
| CN115988171A (zh) | 一种视频会议系统及其沉浸式布局方法和装置 | |
| US12452429B2 (en) | Picture encoding method and apparatus, picture decoding method and apparatus, electronic device and storage medium | |
| JP7302144B2 (ja) | ビットストリームの処理方法及び装置 | |
| CN121753332A (zh) | 多层比特流一致性检查以及对应装置 | |
| WO2026040641A1 (zh) | 视频码流的解码方法、编码方法、电子设备及存储介质 | |
| WO2026091948A1 (zh) | 图像编码方法、图像解码方法、装置、设备、存储介质及程序产品 | |
| WO2025148442A1 (zh) | 胶片颗粒区域处理方法、设备及存储介质 | |
| CN120345235A (zh) | 发信号通知表示视频的对准轴线的封装的数据 | |
| HK40056285B (zh) | 视频码流解码方法和解码器、编码器以及电子设备 |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| 121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 22848184 Country of ref document: EP Kind code of ref document: A1 |
|
| ENP | Entry into the national phase |
Ref document number: 2023578781 Country of ref document: JP Kind code of ref document: A |
|
| WWE | Wipo information: entry into national phase |
Ref document number: 18577790 Country of ref document: US |
|
| ENP | Entry into the national phase |
Ref document number: 20247002467 Country of ref document: KR Kind code of ref document: A |
|
| WWE | Wipo information: entry into national phase |
Ref document number: 1020247002467 Country of ref document: KR |
|
| WWE | Wipo information: entry into national phase |
Ref document number: 2022848184 Country of ref document: EP |
|
| NENP | Non-entry into the national phase |
Ref country code: DE |
|
| ENP | Entry into the national phase |
Ref document number: 2022848184 Country of ref document: EP Effective date: 20240229 |
|
| WWG | Wipo information: grant in national office |
Ref document number: 18577790 Country of ref document: US |
















