WO2022121786A1 - 视频预测编码的方法及装置 - Google Patents
视频预测编码的方法及装置 Download PDFInfo
- Publication number
- WO2022121786A1 WO2022121786A1 PCT/CN2021/135246 CN2021135246W WO2022121786A1 WO 2022121786 A1 WO2022121786 A1 WO 2022121786A1 CN 2021135246 W CN2021135246 W CN 2021135246W WO 2022121786 A1 WO2022121786 A1 WO 2022121786A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- coding
- block
- mode
- information
- current
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Ceased
Links
Images
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/50—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
- H04N19/503—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal prediction
- H04N19/51—Motion estimation or motion compensation
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/102—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
- H04N19/103—Selection of coding mode or of prediction mode
- H04N19/105—Selection of the reference unit for prediction within a chosen coding or prediction mode, e.g. adaptive choice of position and number of pixels used for prediction
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/102—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
- H04N19/103—Selection of coding mode or of prediction mode
- H04N19/107—Selection of coding mode or of prediction mode between spatial and temporal predictive coding, e.g. picture refresh
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/102—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
- H04N19/103—Selection of coding mode or of prediction mode
- H04N19/109—Selection of coding mode or of prediction mode among a plurality of temporal predictive coding modes
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/102—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
- H04N19/119—Adaptive subdivision aspects, e.g. subdivision of a picture into rectangular or non-rectangular coding blocks
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/134—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
- H04N19/146—Data rate or code amount at the encoder output
- H04N19/147—Data rate or code amount at the encoder output according to rate distortion criteria
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/134—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
- H04N19/157—Assigned coding mode, i.e. the coding mode being predefined or preselected to be further used for selection of another element or parameter
- H04N19/159—Prediction type, e.g. intra-frame, inter-frame or bidirectional frame prediction
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/169—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
- H04N19/17—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
- H04N19/176—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a block, e.g. a macroblock
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/169—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
- H04N19/182—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being a pixel
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/189—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the adaptation method, adaptation tool or adaptation type used for the adaptive coding
- H04N19/192—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the adaptation method, adaptation tool or adaptation type used for the adaptive coding the adaptation method, adaptation tool or adaptation type being iterative or recursive
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/50—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
- H04N19/503—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal prediction
- H04N19/51—Motion estimation or motion compensation
- H04N19/513—Processing of motion vectors
- H04N19/517—Processing of motion vectors by encoding
- H04N19/52—Processing of motion vectors by encoding by predictive encoding
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/50—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
- H04N19/503—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal prediction
- H04N19/51—Motion estimation or motion compensation
- H04N19/557—Motion estimation characterised by stopping computation or iteration based on certain criteria, e.g. error magnitude being too large or early exit
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/50—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
- H04N19/503—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal prediction
- H04N19/51—Motion estimation or motion compensation
- H04N19/567—Motion estimation based on rate distortion criteria
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/50—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
- H04N19/503—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal prediction
- H04N19/51—Motion estimation or motion compensation
- H04N19/577—Motion compensation with bidirectional frame interpolation, i.e. using B-pictures
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/50—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
- H04N19/59—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving spatial sub-sampling or interpolation, e.g. alteration of picture size or resolution
Definitions
- the embodiments of the present application relate to video coding technologies, for example, to a method and apparatus for video predictive coding.
- predictive coding is one of the core technologies of video coding, and predictive coding is further divided into intra-frame prediction and inter-frame prediction.
- Intra prediction is based on the spatial correlation of the video image, using the encoded adjacent pixels in the image to predict the current pixel.
- Inter-frame coding uses the coded images to predict the images to be coded according to the temporal correlation of the video images.
- the encoder can eliminate the spatiotemporal correlation of the video, transform, quantize, and entropy encode the predicted residual instead of the original pixel value, thereby greatly improving the encoding efficiency.
- the main video coding standard inter-frame prediction part adopts block-based motion compensation (Motion Compensation, MC) technology.
- the main principle is to perform motion estimation (Motion Estimation, ME) in the reference frame through the prediction unit (PU), search for the matching block, and use the motion compensation technology to generate the prediction block, and then obtain the residual block, and then use the residual block.
- the difference block is used as the input of the subsequent encoding module to perform the next encoding process.
- the motion of the object is not all whole pixel motion, it may only move half a pixel or 1/4 pixel, so it is also necessary to perform sub-pixel ME, that is, to generate sub-pixels by interpolation on the basis of integer pixels. Data, and then search and compensate for matching blocks in sub-pixels.
- High Efficiency Video Coding HEVC
- HEVC High Efficiency Video Coding
- the inter-frame coding PU in HEVC includes at least 7 kinds of 2Nx2N/BIDIR, 2NxN/Nx2N/2NxnD/2NxnU/nRx2N/nLx2N (RECT/AMP), and each PU has to go through the above-mentioned inter-frame coding process, and the computational overhead is very large.
- the present application provides a video predictive coding method and apparatus, to avoid the situation in the related art that the PU of each mode needs to perform an inter-frame coding process including motion compensation during the inter-frame coding process, resulting in high coding computational overhead.
- an embodiment of the present application provides a method for video predictive coding, including:
- the decision information related to the current coding block is obtained, where the decision information includes one of the following: pre-analysis information determined by the encoder when performing pre-analysis for lightweight video coding, the current coding block is in The encoding information of the multiple sub-blocks obtained after recursive encoding, and the executed mode information determined according to the executed mode;
- an embodiment of the present application also provides a device for video predictive coding, including:
- the decision information obtaining module is configured to obtain decision information related to the current coding block during inter-frame predictive coding, where the decision information includes one of the following: pre-analysis information determined by the encoder when performing pre-analysis of lightweight video coding , the encoding information of the multiple sub-blocks obtained after recursive encoding of the current encoding block, and the executed mode information determined according to the executed mode;
- the ME skipping decision module is configured to judge, according to the decision information, whether to skip the motion estimation ME coding of the current coding block.
- an embodiment of the present application further provides an electronic device, including a memory, a processor, and a computer program stored in the memory and running on the processor, where the processor implements the above method when executing the program .
- an embodiment of the present application further provides a computer-readable storage medium on which a computer program is stored, and when the program is executed by a processor, the foregoing method is implemented.
- FIG. 1 is a schematic diagram of a PU division mode provided by an embodiment of the present application
- FIG. 2 is a schematic diagram of a decision process for an optimal mode of an inter-frame predictive coding process of a single PU provided by an embodiment of the present application;
- FIG. 3 is a schematic diagram of an encoding process of a single inter-frame mode provided by an embodiment of the present application
- FIG. 4 is a flowchart of an embodiment of a method for video predictive coding provided by an embodiment of the present application
- FIG. 5 is a flow chart of performing ME fast skip judgment according to pre-analysis information provided by an embodiment of the present application
- FIG. 6 is a schematic diagram of the relationship between recursive division of coding blocks and down-sampling blocks according to an embodiment of the present application
- FIG. 7 is a flow chart of performing ME fast skip decision according to coding information of sub-blocks provided by an embodiment of the present application.
- FIG. 8 is a schematic diagram of recursive division of coding blocks provided by an embodiment of the present application.
- FIG. 9 is a flowchart of a ME fast skip decision of a non-square PU according to the executed mode information of the executed mode provided by an embodiment of the present application.
- FIG. 10 is a structural block diagram of an embodiment of an apparatus for video predictive coding provided by an embodiment of the present application.
- FIG. 11 is a schematic structural diagram of an electronic device provided by an embodiment of the present application.
- Predictive coding refers to using one or several coded sample values to predict the current sample value according to a certain model or method, and to compare the real value of the sample and the value of the sample. The difference between the predicted values is encoded, and the encoding efficiency is greatly improved by transforming, quantizing, and entropy encoding the predicted residual instead of the original pixel value.
- Predictive coding may include intra-prediction Intra mode and inter-prediction Inter mode.
- the new video compression standard HEVC defines a new set of syntax units for image division, including Coding Unit (CU), Prediction Unit (PU) and Transform Unit (TU).
- CU is the basic unit for prediction, transformation, quantization and entropy coding
- PU is the basic unit for predictive coding, including intra-frame prediction or inter-frame prediction
- TU is the basic unit for transformation and quantization.
- the PU specifies all the prediction modes of the CU, and all prediction-related information can be defined in the PU. For example, the direction of intra-frame prediction, the division method of inter-frame prediction, motion vector prediction, and the reference image index number of inter-frame prediction are all available. It belongs to the category of PU.
- the PU division modes included in a 2Nx2N CU may include: for a 2Nx2N CU mode, there are two optional modes for intra-frame prediction PU, 2Nx2N and NxN; frame There are 8 optional modes for inter-prediction PU: 4 symmetric modes (2Nx2N, 2NxN, Nx2N, NxN) and 4 asymmetric modes (2NxnD, 2NxnU, nRx2N, nLx2N), among which, according to the shape of the PU, 2Nx2N can be It is called square PU with NxN, and 2NxN, Nx2N, 2NxnD, 2NxnU, nRx2N, and nLx2N are called non-square PU; skip mode is a kind of inter-frame prediction.
- the motion information to be encoded is only the motion parameter set index
- the encoding residual When encoding residual when en
- the decision-making process may include the following processes (the NxN mode is not used in FIG. 2):
- non-square modes such as 2NxN, Nx2N, 2NxnU, 2NxnD, nLx2N, and nRx2N contain two PUs. Each PU needs to perform MERGE/SKIP mode, forward ME, backward ME, and two-way ME separately, and compare the optimal model.
- the motion of the object since the motion of the object is not all whole pixel motion, it may only move half a pixel or 1/4 pixel, so it is also necessary to perform sub-pixel ME, that is, to generate sub-pixel data by interpolation on the basis of integer pixels. , and then search and compensate for matching blocks in sub-pixels.
- the highest resolution of HEVC luminance component sub-pixel is 1/4 pixel, and the highest resolution of chrominance component is 1/8 pixel.
- FIG. 3 the encoding process of a single inter-frame mode in the above-mentioned various modes of FIG. 2 is shown: for a PU, it is first judged whether to perform sub-pixel ME, and based on the judgment result of performing sub-pixel ME, the motion estimation of sub-pixel is performed. And motion compensation, based on the judgment result of not performing sub-pixel ME, perform motion estimation and motion supplementation of the whole pixel, and then perform Rate Distortion Optimation (RDO) encoding process on the obtained residual block.
- each inter-frame inter PU has to go through the inter-frame coding process in Figure 3, and the overhead is very large. If unnecessary PU ME can be reduced, the resource overhead of the encoder can be greatly reduced, thereby improving the coding. speed and reduce coding costs.
- any encoder that complies with the HEVC coding standard can be applied.
- AVS2, AVS3, AV1, VVC and other standard encoders can also be used directly after adjustment.
- the encoder mentioned in this embodiment can be applied not only to the transcoding service, but also to the real-time encoding service of the mobile terminal.
- FIG. 4 is a flowchart of an embodiment of a video predictive coding method provided by an embodiment of the present application. This embodiment may include the following steps:
- Step 410 During the inter-frame predictive coding, obtain decision information related to the current coding block, where the decision information includes: pre-analysis information determined by the encoder when performing pre-analysis for lightweight video coding, or the current coding Encoding information of multiple sub-blocks obtained after the block is recursively encoded, or executed mode information determined according to the executed mode.
- Step 420 determine whether to skip the motion estimation ME coding of the current coding block.
- skipping the ME process can be divided into two cases, one is to skip all MEs in the above decision-making process; the other is to skip some MEs in the above-mentioned decision-making process, for example, only execute 2Nx2N ME, but after skipping 2Nx2N of other modes of ME.
- this embodiment may use different methods to determine whether to skip the ME process according to different obtained decision information.
- the decision to quickly skip ME coding may be made based on pre-analysis information, and the decision information may include: pre-analysis information determined by the encoder when performing pre-analysis of lightweight video coding, as shown in FIG. 5 .
- step 420 may include the following sub-steps:
- Sub-step S11 according to the size of the current coding block, determine the number of corresponding down-sampling blocks, and index the pre-analysis information of each down-sampling block.
- the encoder may use a pre-analysis module to perform a pre-analysis operation on the video before actually encoding the input video.
- the pre-analysis operation may be: the pre-analysis module is performing a pre-analysis operation on the input video.
- lightweight video encoding is performed to obtain the frame structure, frame type, and encoding quantization parameters (Quantization Parameter, QP) when encoding is actually performed.
- QP Quantization Parameter
- the pre-analyzed coding unit is an 8x8 downsampling block.
- the 8x8 downsampling block corresponds to 16x16 That is, if the method of sub-step S11 is adopted, the size of the coding block in this embodiment is at least 16 ⁇ 16, including 16 ⁇ 16, 32 ⁇ 32, and 64 ⁇ 64.
- the 32x32 coding block can be divided into four 16x16 coding sub-blocks, and each 16x16 coding sub-block corresponds to a down-sampling block ( Lowres shown in Figure 6), therefore, the number of downsampling blocks that can be collected from the 32x32 coding block is 4. Similarly, if it is a 64x64 coding block, it can be divided into 16 16x16 coding sub-blocks, corresponding to 16 down-sampling blocks.
- the ME fast skip decision of the current block can be made according to the pre-analysis information.
- the corresponding down-sampling block may be indexed according to the coordinate information of the current coding block.
- Figure 6 for a 32x32 coding block, it can be divided into 4 16x16 coding sub-blocks, and the coordinate information of the coding block is known, so the coordinate information of each 16x16 coding sub-block can be obtained, and then find the The down-sampling block within the coordinate range of each coding sub-block is used as the down-sampling block corresponding to the coding sub-block. After finding the down-sampling block corresponding to each coding block, the pre-analysis information of the down-sampling block can be obtained.
- the pre-analysis information of the down-sampling block may include, but is not limited to: inter-frame coding cost, intra-frame coding cost, and an optimal mode for prediction of the down-sampling block.
- Sub-step S12 summarizing the pre-analysis information of the down-sampling block, and determining whether to skip ME coding of the current coding block according to the summarizing result.
- the pre-analysis information of all the down-sampling blocks corresponding to the current coding block can be aggregated, and whether to skip the ME process of the inter prediction mode is determined according to the aggregated result.
- sub-step S12 may include the following sub-steps:
- Sub-step S121 Calculate the sum of the inter-frame coding costs of all down-sampling blocks corresponding to the current coding block, and obtain the estimated inter-frame coding cost of the current coding block.
- 4 down-sampling blocks can be indexed. After obtaining the inter-frame coding cost of each down-sampling block, the sum of the inter-frame coding costs of the 4 down-sampling blocks can be calculated as The estimated inter-coding cost InterCost of the 32x32 coding block.
- Sub-step S122 Calculate the sum of the intra-frame coding costs of all down-sampling blocks corresponding to the current coding block to obtain the estimated intra-frame coding cost of the current coding block.
- the sum of the intra-coding costs of the 4 down-sampling blocks is calculated as the estimated intra-frame cost of the 32x32 coding block.
- Encoding cost IntraCost is calculated as the estimated intra-frame cost of the 32x32 coding block.
- Sub-step S123 Calculate the number of down-sampling blocks whose optimal mode is the intra-frame prediction mode in all down-sampling blocks corresponding to the current coding block, and determine that the optimal mode of the current coding block is the intra-frame prediction mode according to the quantity the first ratio.
- the optimal modes of the 4 down-sampling blocks can be calculated as the number of intra-frame prediction Intra modes and the first ratio IntraRatio, For example, assuming that the optimal mode of 2 downsampling blocks is Intra mode, IntraRatio is 2/4; for another example, assuming that the optimal mode of 3 downsampling blocks is Intra mode, then IntraRatio is 3/4.
- Sub-step S124 if the first ratio is greater than the first preset ratio threshold, and the product of the estimated intra-frame coding cost and the first adjustment factor is less than the estimated inter-frame coding cost and the second adjustment factor product, the ME coding of the current coding block is skipped, wherein the first adjustment factor is smaller than the second adjustment factor.
- a is a first preset ratio threshold
- b is a first adjustment factor
- c is a second adjustment factor
- skipME is skip ME.
- a, b, and c can be adjusted according to requirements.
- the encoding speed can be linearly increased or decreased to meet different business needs.
- b ⁇ c can be set according to business needs.
- the default values of a, b, and c can be set to 0.5, 1, and 4.
- a decision to quickly skip ME may be made based on the coded information of the sub-block, and the decision information may include: multiple sub-blocks obtained after recursive coding of the coding block Encoding information for the block.
- the coding information of the sub-block may at least include: the optimal mode of the sub-block and whether the sub-block is further recursively divided.
- step 420 may include the following sub-steps:
- Sub-step S21 according to the optimal mode of each sub-block of the current coding block, determine that the optimal mode of the current coding block is the second ratio of the intra prediction mode, and the optimal mode of the current coding block is the third ratio of the SKIP mode. ratio.
- the second ratio at which the optimal mode of the current coding block is the intra-frame prediction mode may be the number of sub-blocks whose optimal mode is the intra-frame prediction Intra mode in the current coding block to all sub-blocks of the current coding block
- the ratio of the total number; and the third ratio of the optimal mode of the current coding block is the SKIP mode may be the number of sub-blocks whose optimal mode is the intra-frame prediction SKIP mode in the current coding block and the total number of all sub-blocks of the current coding block ratio.
- the number of sub-blocks whose optimal mode is Intra mode is 1, and the optimal mode is SKIP
- Sub-step S22 according to whether each sub-block of the current coding block is further recursively divided, determine the sub-block recursion ratio of the current coding block.
- the sub-block recursion ratio is used to reflect the ratio of sub-blocks that can be further divided in the current coding block.
- Sub-step S23 determine whether to skip ME coding of the current coding block.
- sub-step S23 may include the following sub-steps:
- SubSplitRatio is the sub-block recursion ratio
- SubIntraRatio is the second ratio
- threshold_A is the preset recursion ratio threshold
- threshold_B is the second preset ratio threshold
- threshold_A and threshold_B can be set according to actual service requirements.
- sub-step S23 may include the following sub-steps:
- the third ratio is greater than a third preset ratio threshold, and the sub-block recursion ratio is greater than a fourth preset ratio threshold, ME coding of the current coding block is skipped.
- SubSplitRatio is the sub-block recursion ratio
- SubSkipRatio is the third ratio
- threshold_C is the third preset ratio threshold
- threshold_D is the fourth preset ratio threshold
- threshold_C and threshold_D can be set according to actual service requirements.
- the decision information may include according to the executed mode.
- Determined executed mode information may at least include: a stage optimal mode SATD cost and an intra-frame prediction SATD cost; step 420 may include the following sub-steps:
- the SATD cost of the phase-optimized mode is the SATD cost of the phase-optimized mode obtained by performing mode judgment after this mode is executed.
- the SATD cost of intra-frame prediction may be the SATD cost of the optimal intra-frame mode determined through SATD rough selection during intra-frame prediction.
- IntraSATDCost is the intra-frame prediction SATD cost
- InterSATDCost is the stage optimal mode SATD cost
- ⁇ is the third adjustment factor
- ⁇ is the fourth adjustment factor.
- both ⁇ and ⁇ can be adjusted according to actual business requirements. For example, the default value of ⁇ can be 5, and the default value of ⁇ can be 3.
- IntraSATDCost* ⁇ is less than ⁇ *InterSATDCost, it means that the spatial correlation of the current block is stronger than the temporal correlation, so the ME coding process of the current coding block can be skipped.
- each PU needs to perform MERGE separately mode, forward ME, backward ME, and bidirectional ME, and compare the optimal mode.
- the overall ME process is highly complex.
- the ME mode that has been executed can be used to quickly jump out of the ME of the non-square PU.
- the decision information may include the executed mode information determined according to the executed mode, wherein, the executed mode may include at least the 2Nx2N mode and the BIDIR mode in FIG. 2, then the executed mode information may include: The MERGE mode SATD cost of the MERGE mode, and the phase-optimal mode SATD cost.
- step 420 may include the following sub-steps:
- Sub-step S31 if the current prediction unit is a non-square prediction unit, determine whether the SATD cost of the MERGE mode is less than the SATD cost of the stage optimal mode.
- Sub-step S32 if the SATD cost of the MERGE mode is less than the SATD cost of the optimal mode of the stage, execute the MERGE mode on the non-square prediction unit, and skip forward ME coding and backward ME coding of the non-square prediction unit.
- ME coding and bidirectional ME coding if the SATD cost of the MERGE mode is less than the SATD cost of the optimal mode of the stage, execute the MERGE mode on the non-square prediction unit, and skip forward ME coding and backward ME coding of the non-square prediction unit.
- the MERGE mode SATD cost of the MERGE mode executed in the 2Nx2N mode is less than the stage-optimal mode SATD cost, only the current non-square PU is executed in the MERGE mode, and the forward ME coding and the backward ME are no longer performed. Encoding and bidirectional ME encoding process.
- the optimal value of the current coding block may also be considered.
- the mode is the ratio of the SKIP mode, that is, in the coding sub-blocks of the current coding block, the optimal mode is the ratio of the sub-blocks of the SKIP mode, and the calculation method can refer to the above-mentioned third ratio.
- Step 420 may include the following sub-steps:
- the SATD cost of the MERGE mode is less than the SATD cost of the optimal mode of the stage, and the ratio of the optimal mode of the current coding block to the SKIP mode is greater than a preset ratio threshold, execute the MERGE mode on the non-square prediction unit , and skip forward ME coding, backward ME coding and bidirectional ME coding of the non-square prediction unit.
- SubSkipRatio is the ratio of the optimal mode of the current coding block to the SKIP mode
- MrgCost is the MERGE mode SATD cost of the MERGE mode executed in the 2Nx2N mode
- InterSATDCost is the stage optimal mode SATD cost
- ⁇ , a1, b1 are adjustment factors, respectively , which can be set according to actual business requirements. For example, the default value of ⁇ can be set to 0.8, and the default values of a1 and b1 are both 1.
- skipMELeft means that RECT/AMP only executes the MERGE mode, and no longer performs forward ME, backward ME, and bidirectional ME.
- the inter-frame predictive coding it can be determined whether to skip the motion estimation ME coding of the current coding block according to the obtained decision information related to the current coding block.
- the decision information may include that the encoder is performing light.
- the pre-analysis information determined during the pre-analysis of the video coding of the magnitude of information different fast decision schemes can be used to make ME skip decision to reduce unnecessary PU ME, which can greatly reduce the coding complexity, reduce the resource overhead of the encoder, and improve the coding compression of the encoder.
- the throughput rate of the transcoding service can be improved, thereby saving server resources and reducing the cost of transcoding.
- the coding speed can be increased by 18% under the condition that the compression rate of the encoder is lost by 0.2%. From the perspective of online transcoding applications, it can save 18% of server computing resources under the condition of minor image quality loss, thereby reducing transcoding costs.
- FIG. 10 is a structural block diagram of an apparatus embodiment of a video predictive coding apparatus provided by an embodiment of the present application, which may include the following modules:
- the decision information obtaining module 1001 is configured to obtain decision information related to the current coding block during inter-frame predictive coding, where the decision information includes: pre-analysis information determined when the encoder performs pre-analysis of lightweight video coding, or , the encoding information of the multiple sub-blocks obtained after recursive encoding of the current encoding block, or, the executed mode information determined according to the executed mode;
- the skip ME judgment module 1002 is configured to judge whether to skip the motion estimation ME coding of the current coding block according to the judgment information.
- the skip ME decision module 1002 may include the following sub-modules:
- the pre-analysis information index submodule is set to determine the number of corresponding down-sampling blocks according to the size of the current coding block, and index the pre-analysis information of each down-sampling block;
- the pre-analysis information decision sub-module is configured to summarize the pre-analysis information of the down-sampling block, and judge whether to skip ME coding of the current coding block according to the summary result.
- the pre-analysis information of the down-sampling block includes an inter-coding cost, an intra-coding cost, and an optimal mode for prediction of the down-sampling block;
- the pre-analysis information decision sub-module is set to:
- the product of the estimated intra-coding cost and the first adjustment factor is less than the product of the estimated inter-coding cost and the second adjustment factor, skip through the ME encoding of the current encoding block, wherein the first adjustment factor is smaller than the second adjustment factor.
- the encoding information of the sub-blocks includes: the optimal mode of the sub-block and the Whether the sub-block is further recursively divided;
- the skip ME decision module 1002 may include the following sub-modules:
- the ratio calculation submodule is set to determine the optimal mode of the current coding block as the second ratio of the intra-frame prediction mode according to the optimal mode of each sub-block of the current coding block, and the optimal mode of the current coding block is the SKIP mode and, according to whether each sub-block of the current coding block is further recursively divided, determine the sub-block recursion ratio of the current coding block;
- the ratio decision submodule is configured to determine whether to skip ME coding of the current coding block according to the second ratio, the third ratio and/or the sub-block recursion ratio.
- the ratio decision sub-module is set to:
- sub-block recursion ratio is greater than a preset recursion ratio threshold, and the second ratio is greater than a second preset ratio threshold, skip ME coding of the current coding block;
- the third ratio is greater than a third preset ratio threshold, and the sub-block recursion ratio is greater than a fourth preset ratio threshold, ME coding of the current coding block is skipped.
- the executed mode information when the decision information is executed mode information determined according to the executed mode, includes: a stage optimal mode SATD cost and an intra-frame prediction SATD cost;
- the skip ME decision module 1002 may include the following sub-modules:
- the SATD cost judgment sub-module is set to if the current encoding block is not recursively encoded, and the product of the intra-frame prediction SATD cost and the third adjustment factor is less than the product of the optimal mode SATD cost and the fourth adjustment factor, Then, the ME coding of the current coding block is skipped, wherein the third adjustment factor is greater than the fourth adjustment factor.
- the executed mode when the decision information is executed mode information determined according to the executed mode, the executed mode includes at least a 2Nx2N mode and a BIDIR mode, and the executed mode information includes: 2Nx2N mode The MERGE mode SATD cost of the MERGE mode executed in , and the phase-optimal mode SATD cost;
- the skip ME decision module 1002 may include the following sub-modules:
- the first part of ME skips the judgment sub-module, and is set to judge whether the SATD cost of the MERGE mode is less than the SATD cost of the optimal mode if the current prediction unit is a non-square prediction unit; if the SATD cost of the MERGE mode is less than the If the SATD cost of the stage-optimal mode is satisfied, the MERGE mode is performed on the non-square PU, and forward ME coding, backward ME coding and bidirectional ME coding of the non-square PU are skipped.
- the executed mode information further includes the ratio at which the optimal mode of the current coding block is the SKIP mode
- the skip ME decision module 1002 may include the following sub-modules:
- the first part of the ME skip decision sub-module is set to if the SATD cost of the MERGE mode is less than the SATD cost of the optimal mode of the stage, and the ratio of the optimal mode of the current coding block to the SKIP mode is greater than a preset ratio threshold, Then the MERGE mode is performed on the non-square PU, and forward ME coding, backward ME coding and bidirectional ME coding of the non-square PU are skipped.
- the apparatus for video predictive coding provided by the embodiments of the present application can execute the video predictive coding methods provided by the embodiments of the present application, and has functional modules and beneficial effects corresponding to the execution methods.
- FIG. 11 is a schematic structural diagram of an electronic device provided by an embodiment of the application.
- the electronic device includes a processor 1110, a memory 1120, an input device 1130, and an output device 1140; the number of processors 1110 in the electronic device There may be one or more, and a processor 1110 is taken as an example in FIG. 11; the processor 1110, the memory 1120, the input device 1130 and the output device 1140 in the electronic device can be connected through a bus or other means, and in FIG. Connect as an example.
- the memory 1120 may be configured to store software programs, computer-executable programs, and modules, such as program instructions/modules corresponding to the methods in the embodiments of the present application.
- the processor 1110 executes various functional applications and data processing of the electronic device by running the software programs, instructions and modules stored in the memory 1120, ie, implements the above-mentioned method.
- the memory 1120 may mainly include a stored program area and a stored data area, wherein the stored program area
- memory 1120 may include high-speed random access memory, and may also include non-volatile memory, such as at least one magnetic disk storage device, flash memory device, or other non-volatile solid-state storage device.
- memory 1120 may include memory located remotely from processor 1110, which may be connected to the electronic device through a network. Examples of such networks include, but are not limited to, the Internet, intranets, local area networks, mobile communication networks, and combinations thereof.
- the input device 1130 may be configured to receive input numerical or character information, and generate key signal input related to user settings and function control of the electronic device.
- the output device 1140 may include a display device such as a display screen.
- the embodiments of the present application further provide a storage medium containing computer-executable instructions, where the computer-executable instructions, when executed by a processor of a server, are configured to execute the method in any of the embodiments of the present application.
- the present application can be implemented by means of software and necessary general-purpose hardware, and certainly can also be implemented by hardware. Based on such understanding, the technical solutions of the present application can be embodied in the form of software products in essence or the parts that make contributions to related technologies, and the computer software products can be stored in a computer-readable storage medium, such as a computer floppy disk, Read-Only Memory (ROM), Random Access Memory (RAM), flash memory (FLASH), hard disk or optical disk, etc., including several instructions to make a computer device (which can be a personal computer, A server, or a network device, etc.) executes the methods described in the various embodiments of the present application.
- the computer-readable storage medium may be a non-transitory computer-readable storage medium.
- the multiple units and modules included are only divided according to functional logic, but are not limited to the above-mentioned division, as long as the corresponding functions can be realized;
- the specific names of the functional units are only for the convenience of distinguishing from each other, and are not used to limit the protection scope of the present application.
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Computing Systems (AREA)
- Theoretical Computer Science (AREA)
- Compression Or Coding Systems Of Tv Signals (AREA)
Abstract
Description
Claims (11)
- 一种视频预测编码的方法,包括:在帧间预测编码时,获取与当前编码块相关的判决信息,所述判决信息包括以下之一:编码器在进行轻量级视频编码预分析时确定的预分析信息,所述当前编码块在进行递归编码后得到的多个子块的编码信息,以及根据已执行的模式确定的已执行模式信息;根据所述判决信息,判断是否跳过所述当前编码块的运动估计ME编码。
- 根据权利要求1所述的方法,其中,响应于确定所述判决信息为编码器在进行轻量级视频编码预分析时确定的预分析信息,所述根据所述判决信息,判断是否跳过所述当前编码块的运动估计ME编码,包括:根据所述当前编码块的大小,确定对应的下采样块的数量,以及索引每个下采样块的预分析信息,其中,所述下采样块的数量为多个;汇总所述下采样块的预分析信息,并根据所述汇总的结果判断是否跳过所述当前编码块的ME编码。
- 根据权利要求2所述的方法,其中,所述下采样块的预分析信息包括帧间编码代价、帧内编码代价、以及所述下采样块预测的最优模式;所述汇总所述下采样块的预分析信息,并根据所述汇总的结果判断是否跳过所述当前编码块的ME编码,包括:计算所述当前编码块对应的所有下采样块的帧间编码代价的总和,得到所述当前编码块的预估帧间编码代价;计算所述当前编码块对应的所有下采样块的帧内编码代价的总和,得到所述当前编码块的预估帧内编码代价;计算所述当前编码块对应的所有下采样块中最优模式为帧内预测模式的下采样块的数量,并根据所述数量确定所述当前编码块的最优模式为帧内预测模式的第一比率;响应于确定所述第一比率大于第一预设比率阈值,且所述预估帧内编码代价与第一调节因子的乘积小于所述预估帧间编码代价与第二调节因子的乘积,跳过所述当前编码块的ME编码,其中,所述第一调节因子小于所述第二调节因子。
- 根据权利要求1所述的方法,其中,响应于确定所述判决信息为所述当前编码块在进行递归编码后得到的多个子块的编码信息,所述子块的编码信息包括:每个子块的最优模式以及所述每个子块是否递归划分;所述根据所述判决信息,判断是否跳过所述当前编码块的运动估计ME编码,包括:根据所述当前编码块的所述每个子块的最优模式,确定所述当前编码块的最优模式为帧内预测模式的第二比率,以及所述当前编码块的最优模式为SKIP 模式的第三比率;根据所述当前编码块的所述每个子块是否递归划分,确定所述当前编码块的子块递归比率;根据所述第二比率、所述第三比率和所述子块递归比率中的至少之一,判断是否跳过所述当前编码块的ME编码。
- 根据权利要求4所述的方法,其中,所述根据所述第二比率、所述第三比率和所述子块递归比率中的至少之一,判断是否跳过所述当前编码块的ME编码,包括:响应于确定所述子块递归比率大于预设递归比率阈值,且所述第二比率大于第二预设比率阈值,跳过所述当前编码块的ME编码;或者,响应于确定所述第三比率大于第三预设比率阈值,且所述子块递归比率大于第四预设比率阈值,跳过所述当前编码块的ME编码。
- 根据权利要求1所述的方法,其中,响应于确定所述判决信息为根据已执行的模式确定的已执行模式信息,所述已执行模式信息包括:阶段最优模式SATD代价以及帧内预测SATD代价;所述根据所述判决信息,判断是否跳过所述当前编码块的运动估计ME编码,包括:响应于确定所述当前编码块未进行递归编码,且所述帧内预测SATD代价与第三调节因子的乘积小于所述阶段最优模式SATD代价与第四调节因子的乘积,跳过所述当前编码块的ME编码,其中,所述第三调节因子大于所述第四调节因子。
- 根据权利要求1所述的方法,其中,响应于确定所述判决信息为根据已执行的模式确定的已执行模式信息,所述已执行的模式至少包括2Nx2N模式以及BIDIR模式,所述已执行模式信息包括:2Nx2N模式中执行的MERGE模式的MERGE模式SATD代价,以及阶段最优模式SATD代价,N为正整数;所述根据所述判决信息,判断是否跳过所述当前编码块的运动估计ME编码,包括:响应于确定所述当前预测单元为非正方形预测单元,判断所述MERGE模式SATD代价是否小于所述阶段最优模式SATD代价;响应于确定所述MERGE模式SATD代价小于所述阶段最优模式SATD代价,对所述非正方形预测单元执行MERGE模式,并跳过所述非正方形预测单元的前向ME编码、后向ME编码以及双向ME编码。
- 根据权利要求7所述的方法,其中,响应于确定所述当前编码块进行了递归编码,所述已执行模式信息还包括所述当前编码块的最优模式为SKIP模式的比率;所述根据所述判决信息,判断是否跳过所述当前编码块的运动估计ME编码,还包括:响应于确定所述MERGE模式SATD代价小于所述阶段最优模式SATD代价,且所述当前编码块的最优模式为SKIP模式的比率大于预设比率阈值,对所述非正方形预测单元执行MERGE模式,并跳过所述非正方形预测单元的前向ME编码、后向ME编码以及双向ME编码。
- 一种视频预测编码的装置,包括:判决信息获取模块,设置为在帧间预测编码时,获取与当前编码块相关的判决信息;所述判决信息包括以下之一:编码器在进行轻量级视频编码预分析时确定的预分析信息,所述当前编码块在进行递归编码后得到的多个子块的编码信息,以及根据已执行的模式确定的已执行模式信息;跳过ME判决模块,设置为根据所述判决信息,判断是否跳过所述当前编码块的运动估计ME编码。
- 一种电子设备,包括存储器、处理器及存储在存储器上并可在处理器上运行的计算机程序,所述处理器执行所述程序时实现如权利要求1-8中任一所述的方法。
- 一种计算机可读存储介质,其上存储有计算机程序,所述计算机程序被处理器执行时实现如权利要求1-8任一所述的方法。
Priority Applications (3)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| EP21902493.2A EP4262205A4 (en) | 2020-12-11 | 2021-12-03 | METHOD AND APPARATUS FOR PREDICTIVE VIDEO CODING |
| US18/256,894 US12581057B2 (en) | 2020-12-11 | 2021-12-03 | Video predictive coding method and apparatus |
| JP2023535576A JP7589929B2 (ja) | 2020-12-11 | 2021-12-03 | 動画予測符号化の方法及び装置 |
Applications Claiming Priority (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN202011459744.1 | 2020-12-11 | ||
| CN202011459744.1A CN112637592B (zh) | 2020-12-11 | 2020-12-11 | 一种视频预测编码的方法及装置 |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| WO2022121786A1 true WO2022121786A1 (zh) | 2022-06-16 |
Family
ID=75312249
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| PCT/CN2021/135246 Ceased WO2022121786A1 (zh) | 2020-12-11 | 2021-12-03 | 视频预测编码的方法及装置 |
Country Status (5)
| Country | Link |
|---|---|
| US (1) | US12581057B2 (zh) |
| EP (1) | EP4262205A4 (zh) |
| JP (1) | JP7589929B2 (zh) |
| CN (1) | CN112637592B (zh) |
| WO (1) | WO2022121786A1 (zh) |
Cited By (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN116389763A (zh) * | 2023-06-05 | 2023-07-04 | 瀚博半导体(上海)有限公司 | 基于多种编码器的视频编码方法和装置 |
| CN116996679A (zh) * | 2023-06-16 | 2023-11-03 | 有半岛(北京)信息科技有限公司 | 基于预编码信息拆分的编码加速方法及系统 |
Families Citing this family (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN112637592B (zh) * | 2020-12-11 | 2024-07-05 | 百果园技术(新加坡)有限公司 | 一种视频预测编码的方法及装置 |
| CN115314716A (zh) * | 2021-05-06 | 2022-11-08 | 腾讯科技(深圳)有限公司 | 视频编码方法、装置、计算机可读介质及电子设备 |
| CN113645465B (zh) * | 2021-08-06 | 2023-10-13 | 广州安思创信息技术有限公司 | 一种基于hevc的帧间cu深度估计方法及装置 |
| CN114584768A (zh) * | 2022-02-17 | 2022-06-03 | 百果园技术(新加坡)有限公司 | 视频编码控制方法、装置、设备和存储介质 |
| CN114845103B (zh) * | 2022-06-08 | 2025-08-29 | 百果园技术(新加坡)有限公司 | 基于肤色检测的视频编码控制方法及装置 |
Citations (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20080019448A1 (en) * | 2006-07-24 | 2008-01-24 | Samsung Electronics Co., Ltd. | Motion estimation apparatus and method and image encoding apparatus and method employing the same |
| CN104918047A (zh) * | 2015-05-29 | 2015-09-16 | 北京奇艺世纪科技有限公司 | 一种双向运动估计的排除方法和装置 |
| CN107396102A (zh) * | 2017-08-30 | 2017-11-24 | 中南大学 | 一种基于Merge技术运动矢量的帧间模式快速选择方法及装置 |
| CN108848381A (zh) * | 2018-06-20 | 2018-11-20 | 腾讯科技(深圳)有限公司 | 视频编码方法、解码方法、装置、计算机设备及存储介质 |
| CN112637592A (zh) * | 2020-12-11 | 2021-04-09 | 百果园技术(新加坡)有限公司 | 一种视频预测编码的方法及装置 |
Family Cites Families (15)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| BRPI0411765A (pt) | 2003-06-25 | 2006-08-08 | Thomson Licensing | codificação de decisão modal rápida para inter-quadros |
| US8848799B2 (en) * | 2009-09-02 | 2014-09-30 | Sony Computer Entertainment Inc. | Utilizing thresholds and early termination to achieve fast motion estimation in a video encoder |
| KR20110069740A (ko) * | 2009-12-17 | 2011-06-23 | 에스케이 텔레콤주식회사 | 영상 부호화/복호화 방법 및 장치 |
| WO2011129672A2 (ko) * | 2010-04-16 | 2011-10-20 | 에스케이텔레콤 주식회사 | 영상 부호화/복호화 장치 및 방법 |
| JP5598199B2 (ja) | 2010-09-16 | 2014-10-01 | 富士通株式会社 | 動画像符号化装置 |
| JP5655480B2 (ja) | 2010-10-04 | 2015-01-21 | 富士通株式会社 | 動画像符号化装置 |
| KR101337345B1 (ko) * | 2012-01-27 | 2013-12-06 | 한양대학교 산학협력단 | 부호화기 및 서브펠 움직임 추정 생략 방법 |
| US9667996B2 (en) * | 2013-09-26 | 2017-05-30 | Qualcomm Incorporated | Sub-prediction unit (PU) based temporal motion vector prediction in HEVC and sub-PU design in 3D-HEVC |
| US10187655B2 (en) * | 2015-09-30 | 2019-01-22 | Apple Inc. | Memory-to-memory low resolution motion estimation systems and methods |
| US10652570B2 (en) | 2016-01-15 | 2020-05-12 | Nec Corporation | Moving image encoding device, moving image encoding method, and recording medium for recording moving image encoding program |
| CN105828084B (zh) | 2016-03-30 | 2021-04-13 | 腾讯科技(深圳)有限公司 | Hevc帧间编码处理方法和装置 |
| CN110351552B (zh) * | 2018-04-03 | 2021-06-25 | 杭州微帧信息科技有限公司 | 视频编码中一种快速编码方法 |
| CN109729351B (zh) | 2018-12-18 | 2020-09-29 | 中南大学 | 一种低复杂度配置下的hevc快速模式选择方法 |
| EP3893505A4 (en) * | 2018-12-27 | 2022-05-11 | Huawei Technologies Co., Ltd. | METHOD AND DEVICE FOR DETERMINING PREDICTION MODE, ENCODING DEVICE AND DECODING DEVICE |
| CN110896481B (zh) | 2019-12-31 | 2022-03-08 | 杭州当虹科技股份有限公司 | 一种适用于hevc的快速帧内模式编码方法 |
-
2020
- 2020-12-11 CN CN202011459744.1A patent/CN112637592B/zh active Active
-
2021
- 2021-12-03 WO PCT/CN2021/135246 patent/WO2022121786A1/zh not_active Ceased
- 2021-12-03 US US18/256,894 patent/US12581057B2/en active Active
- 2021-12-03 EP EP21902493.2A patent/EP4262205A4/en active Pending
- 2021-12-03 JP JP2023535576A patent/JP7589929B2/ja active Active
Patent Citations (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20080019448A1 (en) * | 2006-07-24 | 2008-01-24 | Samsung Electronics Co., Ltd. | Motion estimation apparatus and method and image encoding apparatus and method employing the same |
| CN104918047A (zh) * | 2015-05-29 | 2015-09-16 | 北京奇艺世纪科技有限公司 | 一种双向运动估计的排除方法和装置 |
| CN107396102A (zh) * | 2017-08-30 | 2017-11-24 | 中南大学 | 一种基于Merge技术运动矢量的帧间模式快速选择方法及装置 |
| CN108848381A (zh) * | 2018-06-20 | 2018-11-20 | 腾讯科技(深圳)有限公司 | 视频编码方法、解码方法、装置、计算机设备及存储介质 |
| CN112637592A (zh) * | 2020-12-11 | 2021-04-09 | 百果园技术(新加坡)有限公司 | 一种视频预测编码的方法及装置 |
Non-Patent Citations (1)
| Title |
|---|
| See also references of EP4262205A4 * |
Cited By (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN116389763A (zh) * | 2023-06-05 | 2023-07-04 | 瀚博半导体(上海)有限公司 | 基于多种编码器的视频编码方法和装置 |
| CN116389763B (zh) * | 2023-06-05 | 2023-08-11 | 瀚博半导体(上海)有限公司 | 基于多种编码器的视频编码方法和装置 |
| CN116996679A (zh) * | 2023-06-16 | 2023-11-03 | 有半岛(北京)信息科技有限公司 | 基于预编码信息拆分的编码加速方法及系统 |
| CN116996679B (zh) * | 2023-06-16 | 2026-02-06 | 有半岛(北京)信息科技有限公司 | 基于预编码信息拆分的编码加速方法及系统 |
Also Published As
| Publication number | Publication date |
|---|---|
| US20240098248A1 (en) | 2024-03-21 |
| CN112637592A (zh) | 2021-04-09 |
| JP2023553628A (ja) | 2023-12-25 |
| US12581057B2 (en) | 2026-03-17 |
| EP4262205A4 (en) | 2024-06-19 |
| CN112637592B (zh) | 2024-07-05 |
| EP4262205A1 (en) | 2023-10-18 |
| JP7589929B2 (ja) | 2024-11-26 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| CN112637592B (zh) | 一种视频预测编码的方法及装置 | |
| JP7610233B2 (ja) | 動画予測符号化の方法および装置 | |
| CN110351552B (zh) | 视频编码中一种快速编码方法 | |
| US10798414B2 (en) | Method and device for selective multi-sample intra-prediction | |
| CN112075077B (zh) | 图像预测方法、装置、设备、系统及存储介质 | |
| CN105141954A (zh) | 一种hevc帧间编码快速模式选择方法 | |
| CN111355951A (zh) | 视频解码方法、装置及解码设备 | |
| CN113491132B (zh) | 视频图像解码、编码方法、装置及可读存储介质 | |
| CN111385572A (zh) | 预测模式确定方法、装置及编码设备和解码设备 | |
| US11962803B2 (en) | Method and device for intra-prediction | |
| JP2021536689A (ja) | ピクチャパーティショニング方法及び機器 | |
| CN111355959A (zh) | 一种图像块划分方法及装置 | |
| CN110139099B (zh) | 基于预编码和编码satd值加权的帧间预测模式选择方法 | |
| WO2020143684A1 (zh) | 图像预测方法、装置、设备、系统及存储介质 | |
| WO2020224476A1 (zh) | 一种图像划分方法、装置及设备 | |
| CN105872564A (zh) | H.264/avc到h.265/hevc转码的帧内预测模式决定方法及系统 | |
| WO2023123736A1 (zh) | 预测方法、装置、设备、系统、及存储介质 | |
| WO2020134817A1 (zh) | 预测模式确定方法、装置及编码设备和解码设备 | |
| WO2020043136A1 (zh) | 图像划分方法及装置 | |
| CN112135128A (zh) | 图像预测方法、编码树节点划分方法及其装置 | |
| CN112135148B (zh) | 非可分离变换方法以及设备 | |
| RU2837746C1 (ru) | Способ и устройство для прогнозного видеокодирования | |
| Wang et al. | Optimizing Technology in Video Coding and Decoding | |
| KR20180121340A (ko) | 압축 영역에서 야간 영상의 장면 전환 검출 방법 및 장치 | |
| WO2025145288A1 (zh) | 编解码方法、码流、编码器、解码器以及存储介质 |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| 121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 21902493 Country of ref document: EP Kind code of ref document: A1 |
|
| WWE | Wipo information: entry into national phase |
Ref document number: 18256894 Country of ref document: US Ref document number: 2023535576 Country of ref document: JP |
|
| NENP | Non-entry into the national phase |
Ref country code: DE |
|
| ENP | Entry into the national phase |
Ref document number: 2021902493 Country of ref document: EP Effective date: 20230711 |
|
| WWG | Wipo information: grant in national office |
Ref document number: 2023116573 Country of ref document: RU |
|
| WWG | Wipo information: grant in national office |
Ref document number: 18256894 Country of ref document: US |