WO2012081949A2 - 인터 예측 방법 및 그 장치 - Google Patents
인터 예측 방법 및 그 장치 Download PDFInfo
- Publication number
- WO2012081949A2 WO2012081949A2 PCT/KR2011/009772 KR2011009772W WO2012081949A2 WO 2012081949 A2 WO2012081949 A2 WO 2012081949A2 KR 2011009772 W KR2011009772 W KR 2011009772W WO 2012081949 A2 WO2012081949 A2 WO 2012081949A2
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- motion information
- motion
- motion vector
- reference picture
- information
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Ceased
Links
Images
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/50—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
- H04N19/503—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal prediction
- H04N19/51—Motion estimation or motion compensation
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/50—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
- H04N19/503—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal prediction
- H04N19/51—Motion estimation or motion compensation
- H04N19/513—Processing of motion vectors
- H04N19/517—Processing of motion vectors by encoding
- H04N19/52—Processing of motion vectors by encoding by predictive encoding
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/70—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by syntax aspects related to video coding, e.g. related to compression standards
Definitions
- the present invention relates to image processing, and more particularly, to an inter prediction method and apparatus.
- an inter prediction technique for predicting a pixel value included in a current picture from a previous and / or subsequent picture in time, and for predicting a pixel value included in a current picture using pixel information in the current picture.
- An intra prediction technique an entropy encoding technique of allocating a short code to a symbol with a high frequency of appearance and a long code to a symbol with a low frequency of appearance may be used.
- An object of the present invention is to provide an image encoding method and apparatus for improving image encoding / decoding efficiency.
- Another object of the present invention is to provide an image decoding method and apparatus for improving image encoding / decoding efficiency.
- Another object of the present invention is to provide an inter prediction method and apparatus for improving image encoding / decoding efficiency.
- Another object of the present invention is to provide a method and apparatus for encoding temporal motion information that can increase image encoding / decoding efficiency.
- Another technical problem of the present invention is to provide a method and apparatus for decoding temporal motion information that can increase image encoding / decoding efficiency.
- An embodiment of the present invention is an inter prediction method.
- the method includes deriving reference motion information for a decoding target unit in a current picture, and performing motion compensation on the decoding target unit using the derived reference motion information.
- Reference motion information is motion information included in a reference picture for the current picture, and includes a reference picture list, a reference picture index, a motion vector, and a prediction direction. And a motion vector predictor.
- deriving of the reference motion information may further include extracting the reference motion information from the reference picture.
- the extracting the reference motion information comprises: counting the number of occurrences of each of the plurality of motion information in the reference picture, obtaining count information, and based on the obtained count information, the reference
- the method may further include selecting the reference motion information from among the plurality of motion information in the picture.
- the extracting of the reference motion information comprises: performing a median operation on motion information in the reference picture to derive a motion information median value and converting the motion information median value to the reference motion.
- the method may further include extracting the information.
- the extracting of the reference motion information may further include performing sub-sampling on the motion information in the reference picture.
- the method of claim 5, wherein the performing of the subsampling includes selecting a block having a predetermined position among a plurality of second sized blocks included in the first sized block in the reference picture, and moving the corresponding block.
- the method may further include extracting the information as the reference motion information, and each motion information in the reference picture may be stored in units of the second size block.
- the extracting of the reference motion information comprises: grouping motion information in the reference picture into a plurality of groups and based on a frequency of occurrence of motion information in each of the plurality of groups. And selecting the number of pieces of motion information as the reference motion information, wherein in the grouping step, a depth value of a unit included in the reference picture, a size of a unit included in the reference picture, and The grouping may be performed based on at least one characteristic of a partition form of a unit included in the reference picture.
- the extracting of the reference motion information comprises: dividing the reference picture into a plurality of areas, and in each of the plurality of areas, a predetermined number of pieces of motion information are based on a frequency of occurrence of motion information.
- the method may further include selecting reference motion information.
- the step of extracting the reference motion information may include selecting a predetermined number of motion information based on a frequency of occurrence of motion information in each of the reference pictures. It may further include.
- step of extracting the reference motion information, for each of the reference picture, the step of deriving a temporal distance from the current picture and based on the derived temporal distance, the selected motion information Scaling may be further included.
- performing the motion compensation comprises: receiving and decoding a motion vector difference with respect to the decoding target unit, deriving a predicted motion vector with respect to the decoding target unit Deriving a motion vector for the decoding target unit using the decoded motion vector difference and the derived predicted motion vector, and using the derived motion vector, motion compensation for the decoding target unit. It may further comprise the step of performing.
- the predicting motion vector deriving step comprises: generating a motion vector candidate for the decoding target unit using the reference motion information and using the motion vector candidate, The method may further include deriving a predicted motion vector.
- the performing of motion compensation comprises: receiving and decoding a merge index, generating a merge candidate list using the reference motion information, and the merge candidate
- the method may further include selecting motion information indicated by the merge index from a merge candidate included in a list, and performing motion compensation on the decoding target unit by using the selected motion information.
- the step of performing motion compensation includes: receiving and decoding an encoded motion information index; among the reference motion information, a motion indicated by the motion information index
- the method may further include selecting information and performing motion compensation on the decoding target unit by using the selected motion information.
- the deriving of the reference motion information may further include receiving encoded reference motion information and decoding the received reference motion information.
- the received reference motion information may be decoded using differential pulse code modulation (DPCM).
- DPCM differential pulse code modulation
- image encoding / decoding efficiency can be improved.
- the image decoding method According to the image decoding method according to the present invention, the image encoding / decoding efficiency can be improved.
- image encoding / decoding efficiency can be improved.
- image encoding / decoding efficiency can be improved.
- FIG. 1 is a block diagram illustrating a configuration of an image encoding apparatus according to an embodiment of the present invention.
- FIG. 2 is a block diagram illustrating a configuration of an image decoding apparatus according to an embodiment of the present invention.
- 3 is a conceptual diagram schematically illustrating an embodiment in which one unit is divided into a plurality of sub-units.
- FIG. 4 is a flowchart schematically illustrating an embodiment of an inter prediction method in an encoder.
- FIG. 5 shows an embodiment of a reference picture division method.
- FIG. 6 illustrates an embodiment of a reference picture used for inter prediction and / or motion compensation for an encoding target unit.
- FIG. 7 is a flowchart schematically illustrating an embodiment of an inter prediction method in a decoder.
- first and second may be used to describe various components, but the components should not be limited by the terms. The terms are used only for the purpose of distinguishing one component from another.
- the first component may be referred to as the second component, and similarly, the second component may also be referred to as the first component.
- each component shown in the embodiments of the present invention are shown independently to represent different characteristic functions, and do not mean that each component is made of separate hardware or one software component unit.
- each component is included in each component for convenience of description, and at least two of the components may be combined into one component, or one component may be divided into a plurality of components to perform a function.
- Integrated and separate embodiments of the components are also included within the scope of the present invention without departing from the spirit of the invention.
- the components may not be essential components for performing essential functions in the present invention, but may be optional components for improving performance.
- the present invention can be implemented including only the components essential for implementing the essentials of the present invention except for the components used for improving performance, and the structure including only the essential components except for the optional components used for improving performance. Also included in the scope of the present invention.
- FIG. 1 is a block diagram illustrating a configuration of an image encoding apparatus according to an embodiment of the present invention.
- the image encoding apparatus 100 may include a motion predictor 111, a motion compensator 112, an intra predictor 120, a switch 115, a subtractor 125, and a converter 130. And a quantization unit 140, an entropy encoding unit 150, an inverse quantization unit 160, an inverse transform unit 170, an adder 175, a filter unit 180, and a reference picture buffer 190.
- the image encoding apparatus 100 may encode an input image in an intra mode or an inter mode and output a bitstream.
- Intra prediction means intra prediction and inter prediction means inter prediction.
- the switch 115 may be switched to intra, and in the inter mode, the switch 115 may be switched to inter.
- the image encoding apparatus 100 may generate a prediction block for an input block of an input image and then encode a residual between the input block and the prediction block.
- the intra predictor 120 may generate a prediction block by performing spatial prediction using pixel values of blocks that are already encoded around the current block.
- the motion predictor 111 may obtain a motion vector by searching for a region that best matches an input block in the reference image stored in the reference picture buffer 190 during the motion prediction process.
- the motion compensator 112 may generate a prediction block by performing motion compensation using the motion vector.
- the motion vector is a two-dimensional vector used for inter prediction, and may indicate an offset between the current encoding / decoding target picture and the reference picture.
- the subtractor 125 may generate a residual block by the difference between the input block and the generated prediction block.
- the transform unit 130 may output a transform coefficient by performing a transform on the residual block.
- the quantization unit 140 may output the quantized coefficient by quantizing the input transform coefficient according to the quantization parameter.
- the entropy encoder 150 may output a bit stream by performing entropy encoding based on the values calculated by the quantizer 140 or the encoding parameter values calculated in the encoding process.
- the entropy encoder 150 may use an encoding method such as exponential golomb, context-adaptive variable length coding (CAVLC), or context-adaptive binary arithmetic coding (CABAC) for entropy encoding.
- CAVLC context-adaptive variable length coding
- CABAC context-adaptive binary arithmetic coding
- the image encoding apparatus Since the image encoding apparatus according to the embodiment of FIG. 1 performs inter prediction encoding, that is, inter prediction encoding, the currently encoded image needs to be decoded and stored to be used as a reference image. Accordingly, the quantized coefficients are inversely quantized by the inverse quantizer 160 and inversely transformed by the inverse transformer 170. The inverse quantized and inverse transformed coefficients are added to the prediction block by the adder 175 and a reconstruction block is generated.
- the reconstruction block passes through the filter unit 180, and the filter unit 180 applies at least one or more of a deblocking filter, a sample adaptive offset (SAO), and an adaptive loop filter (ALF) to the reconstruction block or the reconstruction picture. can do.
- the filter unit 180 may be referred to as an adaptive in-loop filter.
- the deblocking filter can remove block distortion generated at the boundary between blocks.
- SAO can add an appropriate offset to the pixel value to compensate for coding errors.
- the ALF may perform filtering based on a value obtained by comparing the reconstructed image with the original image.
- the reconstructed block that has passed through the filter unit 180 may be stored in the reference picture buffer 190.
- FIG. 2 is a block diagram illustrating a configuration of an image decoding apparatus according to an embodiment of the present invention.
- the image decoding apparatus 200 may include an entropy decoder 210, an inverse quantizer 220, an inverse transformer 230, an intra predictor 240, a motion compensator 250, and an adder ( 255, a filter unit 260, and a reference picture buffer 270.
- the image decoding apparatus 200 may receive a bitstream output from the encoder and perform decoding in an intra mode or an inter mode, and output a reconstructed image, that is, a reconstructed image.
- the switch In the intra mode, the switch may be switched to intra, and in the inter mode, the switch may be switched to inter.
- the image decoding apparatus 200 may obtain a residual block from the input bitstream, generate a prediction block, and then add the residual block and the prediction block to generate a reconstructed block, that is, a reconstruction block.
- the entropy decoder 210 may entropy decode the input bitstream according to a probability distribution to generate symbols including symbols in the form of quantized coefficients.
- the entropy decoding method is similar to the entropy coding method described above.
- the entropy decoding method When the entropy decoding method is applied, a small number of bits are allocated to a symbol having a high probability of occurrence and a large number of bits are allocated to a symbol having a low probability of occurrence, whereby the size of the bit string for each symbol is increased. Can be reduced. Therefore, the compression performance of image decoding can be improved through an entropy decoding method.
- the quantized coefficient is inversely quantized by the inverse quantizer 220 and inversely transformed by the inverse transformer 230, and as a result of the inverse quantization / inverse transformation of the quantized coefficient, a residual block may be generated.
- the intra predictor 240 may generate a prediction block by performing spatial prediction using pixel values of blocks that are already encoded around the current block.
- the motion compensator 250 may generate a predictive block by performing motion compensation using the reference image stored in the motion vector and the reference picture buffer 270.
- the residual block and the prediction block may be added through the adder 255, and the added block may pass through the filter unit 260.
- the filter unit 260 may apply at least one or more of the deblocking filter, SAO, and ALF to the reconstructed block or the reconstructed picture.
- the filter unit 260 may output a reconstructed image, that is, a reconstructed image.
- the reconstructed picture may be stored in the reference picture buffer 270 and used for inter prediction.
- a unit means a unit of image encoding and decoding.
- a coding or decoding unit refers to a divided unit when an image is divided and encoded or decoded.
- a block, a coding unit (CU), an encoding block, and a prediction unit (PU) are used.
- One unit may be further divided into smaller sub-units.
- the prediction unit refers to a basic unit that is a unit of performing prediction and / or motion compensation.
- the prediction unit may be divided into a plurality of partitions, and each partition may be called a prediction unit partition.
- each of the plurality of partitions may be a basic unit that is a unit of performing prediction and / or motion compensation.
- each partition in which the prediction unit is divided may also be called a prediction unit.
- the encoder and the decoder may perform inter prediction and / or motion compensation on the encoding / decoding target unit.
- the encoding / decoding target unit may mean a prediction unit and / or a prediction unit partition.
- the encoder and the decoder may improve the encoding / decoding efficiency by using a motion vector of a reconstructed neighbor unit and / or a collocated unit.
- the reconstructed neighboring unit is a neighboring unit adjacent to the encoding / decoding target unit or located at a corner of the encoding / decoding target unit, and may mean a unit that has already been encoded or decoded.
- the same position unit may mean a unit that exists at the same spatial position as the encoding / decoding target unit in the reconstructed reference picture.
- the motion vector of the unit included in the reference picture is referred to as a temporal motion vector.
- the motion vector of the same location unit may be called a temporal motion vector.
- the encoder and the decoder may use the reconstructed motion vector and / or temporal motion vector as the motion vector of the encoding / decoding target unit.
- the encoder since the motion vector and the temporal motion vector of the reconstructed neighboring unit are used in the encoding / decoding target unit, the encoder may not encode the motion vector for the encoding / decoding target unit. Therefore, the amount of bits transmitted to the decoder can be reduced, and the coding efficiency can be improved.
- the inter prediction mode may include a skip mode and / or a direct mode.
- the encoder may use an identifier and / or an index indicating whether one of the reconstructed neighboring units uses a motion vector.
- the inter prediction mode in which the identifier and / or index is used may be called a merge mode.
- the encoder and the decoder may perform prediction and / or compensation by using the motion vector of the encoding / decoding target unit and then predict the encoding / decoding target unit when encoding the motion vector of the encoding / decoding target unit.
- a predicted motion vector can be used.
- the predicted motion vector may be a motion vector or a temporal motion vector of the reconstructed neighboring unit. That is, the encoder and the decoder can efficiently encode the motion vector of the encoding / decoding target unit by using the reconstructed motion vector or temporal motion vector as the predictive motion vector.
- the encoder may generate a motion vector difference by the difference between the motion vector of the encoding target unit and the predictive motion vector.
- the motion vector difference may mean a difference value between the motion vector of the encoding target unit and the predictive motion vector.
- the encoder may encode the generated motion vector difference and transmit the encoded motion vector difference to the decoder.
- the decoder may decode the motion vector difference and derive the motion vector of the decoding target unit through the sum of the decoded motion vector difference and the predictive motion vector.
- Such an inter prediction method may be referred to as a motion vector prediction (MVP).
- MVP motion vector prediction
- the encoder may use an identifier and / or an index indicating which unit of the reconstructed neighboring unit is used.
- MVP in which the identifier and / or index is additionally used, may be called Advanced Motion Vector Prediction (AMVP).
- AMVP Advanced Motion Vector Prediction
- the motion information in the reference picture may be used for prediction and / or motion compensation of the current encoding / decoding target unit.
- Motion information in the reference picture used for prediction and / or motion compensation of the current encoding / decoding target unit may be referred to as temporal motion information.
- the temporal motion information may include, for example, a temporal motion vector.
- motion information refers to coding parameters used for inter prediction and motion compensation.
- the encoding parameter may include information that may be inferred in the encoding or decoding process as well as information encoded by the encoder and transmitted to the decoder, such as a syntax element, and refers to information required when encoding or decoding an image. do.
- the motion information may include at least one of a reference picture list, a reference picture index, a motion vector, a prediction direction, and a motion vector predictor. have.
- the reference picture list is a list consisting of a plurality of reference pictures used for inter prediction.
- Two reference picture lists may be used for inter prediction, one may be referred to as reference picture list 0 and the other may be referred to as reference picture list 1.
- the prediction direction included in the motion information may be information indicating which reference picture list is used for inter prediction. That is, the prediction direction may indicate whether reference picture list 0 is used, reference picture list 1 is used, or whether both reference picture list 0 and reference picture list 1 are used.
- the reference picture index is an index indicating a reference picture used for inter prediction of the encoding / decoding target unit among the reference pictures included in the reference picture list.
- the motion vector predictor may mean a motion vector of a unit that is a prediction candidate and / or a unit that is a prediction candidate when the encoder and the decoder predict the motion vector.
- the above-described encoding parameter may include not only motion information but also values and / or statistics such as an inter prediction mode, a coded block pattern (CBP), a block size, block partition information, and the like.
- the block division information may include information about a depth of the unit. The depth information may indicate the number and / or degree of division of the unit.
- 3 is a conceptual diagram schematically illustrating an embodiment in which one unit is divided into a plurality of sub-units.
- One unit may be hierarchically divided with depth information based on a tree structure.
- Each divided subunit may have depth information. Since the depth information indicates the number and / or degree of division of the unit, the depth information may include information about the size of the sub-unit.
- the highest node may be called a root node and may have the smallest depth value. At this time, the highest node may have a depth of level 0 and may represent the first unit that is not divided.
- a lower node having a depth of level 1 may indicate a unit in which the first unit is divided once, and a lower node having a depth of level 2 may indicate a unit in which the first unit is divided twice.
- unit 320 corresponding to node a in 320 of FIG. 3 may be a unit divided once in an initial unit and may have a depth of level 1.
- a leaf node of level 3 may indicate a unit in which the first unit is divided three times.
- the unit d corresponding to the node d in 320 of FIG. 3 may be a unit divided three times in the first unit and may have a depth of level 3.
- FIG. the leaf node at level 3, which is the lowest node, may have the deepest depth.
- temporal motion information (for example, temporal motion vector) Can be used.
- the inter prediction method using temporal motion information has a disadvantage in that a reference picture for a picture to be encoded / decoded must be stored in a memory.
- motion information eg, a temporal motion vector
- an error may propagate as the encoding / decoding process proceeds. Therefore, the inter prediction method using temporal motion information may also have disadvantages in terms of error resiliency. Therefore, there is a need for an inter prediction method capable of efficiently encoding / decoding motion information in a reference picture and improving error robustness.
- FIG. 4 is a flowchart schematically illustrating an embodiment of an inter prediction method in an encoder.
- the encoder may extract motion information of a reference picture with respect to the current picture (S410).
- the encoder may extract N pieces of motion information among the motion information included in the reference picture.
- the extracted motion information may be used for inter prediction and / or motion compensation of a coding target unit in a current picture.
- N represents a positive integer, and in the embodiments described below, N means a positive integer.
- the encoder may extract motion information of the reference picture by using at least one of the motion information extraction methods described below.
- the encoder may extract the motion information according to the frequency of occurrence of the motion information in the reference picture. At this time, for example, the encoder may select and extract N pieces of motion information in order of occurrence frequency among the motion information in the reference picture.
- Table 1 below shows an embodiment of a method for extracting motion information according to the frequency of occurrence of motion information.
- the encoder may count the number of motion vector occurrences in the reference picture to obtain count information.
- the encoder may select N motion vectors in order of occurrence frequency.
- the extracted motion vector may be [0,0], [1,0], [0, -1].
- the encoder may obtain count information for each component of the motion vector, not the motion vector itself, in the encoding process.
- each component of the motion vector may mean an x component and a y component.
- the encoder may select N components in order of occurrence frequency for each motion vector component.
- the encoder may extract N motion vectors using the selected motion vector component.
- the encoder may quantize the motion information of the reference picture and then count the frequency of occurrence of the quantized motion information to obtain count information.
- the encoder may quantize a motion vector of 1/4 pixel unit into a motion vector of integer pixel unit and count the frequency of occurrence of the quantized motion vector in the encoding process.
- the encoder may select and / or extract N motion vectors from the quantized motion vectors in order of occurrence frequency.
- the encoder may perform quantization on the motion information according to the quantization step size.
- the information about the quantization step size may be stored in the encoder and the decoder in the same manner.
- the encoder since the decoder may know the quantization step size used in the encoder, the encoder may not transmit information about the quantization step size to the decoder. If the decoder does not have information about the quantization step size, the encoder may encode the information about the quantization step size and transmit it to the decoder through a bitstream. The decoder can decode the transmitted quantization step size information and use it for quantization of the motion information.
- the encoder may extract temporal motion information by performing a predetermined process on the plurality of motion information included in the reference picture.
- the encoder may extract temporal motion information by performing a median operation on the plurality of motion information included in the reference picture. For example, assume that there are three motion vectors [0,0], [-3,5], and [-4,2] in the reference picture. In this case, the encoder may extract one motion vector [-3, 2] by performing an intermediate value operation on each component of the motion vector.
- the encoder may extract N pieces of motion information by performing sub-sampling on a plurality of pieces of motion information included in a reference picture. For example, assume that motion vectors in a reference picture are arranged in a two-dimensional form as shown in Table 2 below.
- the encoder may extract motion vectors existing in odd-numbered rows and odd-numbered columns in a two-dimensional motion vector array through subsampling.
- four motion vectors may be extracted, such as [-2, 4], [0, -1], [5, -1], and [-2, 1].
- the encoder may extract a motion vector corresponding to a specific position in a two-dimensional motion vector array.
- the encoder may extract a motion vector corresponding to a specific position in the two-dimensional motion vector array.
- the motion vectors may be stored in a two-dimensional motion vector array in units of 4 ⁇ 4 blocks.
- the encoder may select a block corresponding to a predetermined position from among 4x4 size blocks included in the 16x16 size block, and extract a motion vector corresponding to the selected block.
- the predetermined position may be, for example, the leftmost upper position in the block of 16 ⁇ 16 size.
- the predetermined position may be a position corresponding to the motion vector of [-2, 4].
- the encoder may extract a motion vector of [-2, 4].
- the encoder may classify and / or group motion information in the reference picture into a plurality of groups based on characteristics of a unit included in the reference picture.
- the characteristics of the unit may include the depth of the unit, the size of the unit, and / or the partition form of the unit.
- the encoder may extract M motion information for each group according to a frequency of occurrence (hereinafter, M is a positive integer) and extract a total of N motion information.
- the encoder may convert motion information in the reference picture into P groups based on the depth value of the unit. Can be classified. In this case, the encoder may obtain count information by counting the number of times the motion vector is generated for each group in the encoding process. That is, the encoder may obtain count information of the motion vector for each depth of the unit. The encoder may extract N motion vectors by selecting M motion vectors in order of occurrence frequency for each group based on the count information.
- N may be M * P.
- the encoder may classify motion information in the reference picture into P groups based on the size of the unit.
- the encoder may obtain count information by counting the number of times the motion vector is generated for each group in the encoding process. That is, the encoder can obtain count information of the motion vector for each unit size.
- the encoder may extract N motion vectors by selecting M motion vectors in order of occurrence frequency for each group based on the count information.
- N may be M * P.
- the encoder may classify motion information in the reference picture into P groups based on the partition type.
- the encoder may obtain count information by counting the number of times the motion vector is generated for each group in the encoding process. That is, the encoder can obtain count information of the motion vector for each partition type.
- the encoder may extract N motion vectors by selecting M motion vectors in order of occurrence frequency for each group based on the count information.
- N may be N * P.
- the partition of the unit may mean a basic unit used for inter prediction and motion compensation, and may have a size of L * K (L and K are positive integers).
- the encoder may split the reference picture into a plurality of regions.
- each of the plurality of regions may be a region divided by a slice, and the plurality of regions may have different motion vector generation distributions.
- the encoder may extract M (hereinafter, M is a positive integer) motion information for each region according to a frequency of occurrence, and extract a total of N temporal motion information.
- FIG. 5 shows an embodiment of a reference picture division method.
- a reference picture may be divided into a first region 510 and a second region 520.
- Table 3 shows an embodiment of a method for extracting motion information according to the frequency of occurrence of motion information when the reference picture is divided into a first area 510 and a second area 520.
- the encoder may obtain count information by counting the number of motion vector generation for each region in the encoding process. That is, the encoder may obtain count information by counting the number of motion vector occurrences for each of the first region 510 and the second region 520. At this time, the encoder may extract N motion vectors by selecting M motion vectors in order of occurrence frequency for each region based on the count information. For example, when the reference picture is divided into two regions and M is 3, the number of extracted motion vectors may be six in total.
- the encoder when a plurality of reference pictures are used for a coding target unit and / or a current picture, the encoder extracts M motion information for each reference picture according to a frequency of occurrence. A total of N temporal motion information can be extracted.
- FIG. 6 illustrates an embodiment of a reference picture used for inter prediction and / or motion compensation for an encoding target unit.
- the current picture 630 may include an encoding target unit.
- the first reference picture 610 and the second reference picture 620 may be used for inter prediction of the encoding target unit.
- Table 4 below shows an embodiment of a method for extracting motion information according to the frequency of occurrence of motion information when a plurality of reference pictures are used.
- the encoder may count the number of motion vector occurrences for each reference picture to obtain count information. That is, the encoder may obtain count information by counting the number of motion vector occurrences for each of the first reference picture 610 and the second reference picture 620. In this case, the encoder may extract M total motion vectors by selecting M motion vectors in order of occurrence frequency, for each reference picture, based on the count information. For example, if two reference pictures are used and M is 3, the number of extracted motion vectors may be six in total.
- the encoder when a plurality of reference pictures are used for the encoding target unit and / or the current picture, the encoder selects M pieces of motion information for each reference picture and then scales the selected motion information. N pieces of temporal motion information may be extracted by scaling.
- the encoder may calculate a temporal distance from the current picture for each reference picture, and perform scaling using the calculated temporal distance.
- the temporal distance may be a distance determined based on the display order when the current picture and the plurality of reference pictures are listed in a display order.
- the encoder may acquire count information by counting the number of motion vector occurrences for each reference picture in the encoding process. That is, the encoder may count the number of motion vector occurrences for each reference picture to obtain count information. In this case, the encoder may select N motion vectors in total in order of occurrence frequency, for each reference picture, based on the count information, and select a total of N motion vectors.
- the encoder may calculate, for each reference picture, a temporal distance from the current picture.
- the encoder may perform scaling on the selected motion vector using the calculated temporal distance, and extract the scaled motion vector as a temporal motion vector for the current picture and / or the current coding unit. In this case, when the plurality of motion vectors are the same among the scaled motion vectors, the encoder may extract only one motion vector from the same motion vector as the temporal motion vector.
- each of the reference pictures is a first reference picture and a second reference picture.
- first temporal distance between the current picture and the first reference picture is 2 and the second temporal distance between the current picture and the second reference picture is 4.
- the encoder may scale the motion vector selected from the first reference picture and / or the motion vector selected from the second reference picture based on the first temporal distance and the second temporal distance. For example, when the motion vectors [0, 4], [2, 4] selected in the second reference picture are scaled, the scaled motion vectors may be [0, 2], [1, 2]. At this time, since [0, 2] of the scaled motion vectors is the same as the motion vector selected from the first reference picture, the temporal motion vectors finally extracted are ⁇ [0, 2], [1, 1], [1, 2] ⁇ .
- the decoder may extract motion information of the reference picture by using the same method as the motion information extraction method used in the encoder.
- the encoder may not transmit the extracted motion information to the decoder.
- the above-described motion information extraction process may be performed only in the encoder, not the decoder.
- the encoder may encode the extracted motion information and transmit the encoded motion information to the decoder.
- the encoder may perform inter prediction and / or motion compensation on the encoding target unit in the current picture by using the extracted temporal motion information (S420).
- the encoder may use the extracted temporal motion information in performing a motion vector prediction (MVP) and / or an advanced motion vector prediction (AMVP) for the encoding target unit.
- MVP motion vector prediction
- AMVP advanced motion vector prediction
- the encoder may use the temporal motion vector extracted from the reference picture as one of motion vector candidates.
- the encoder may perform inter prediction and / or motion compensation using a block matching algorithm, a skip mode, or a direct mode.
- the block matching algorithm may mean an algorithm for determining a reference unit for the encoding target unit among the reconstructed units in the reference picture.
- the encoder may determine the reference unit for the encoding target unit from the reconstructed units in the reference picture by using the motion vector of the encoding target unit. In this case, the encoder may perform inter prediction and / or motion compensation on the encoding target unit by using the determined reference unit.
- the motion vector and temporal motion vector of the reconstructed peripheral unit may be used as the motion vector of the encoding target unit, and the reference picture index of the reconstructed peripheral unit may be used as the reference picture index of the encoding target unit.
- a residual signal for the current encoding target unit may be encoded and transmitted to the decoder. However, since the residual signal may not exist in the skip mode, the encoder may not encode the residual signal.
- the encoder may perform inter prediction and / or motion compensation using a merge mode.
- the encoder may perform inter prediction and / or motion compensation by using at least one of a motion vector and a temporal motion vector of the reconstructed neighboring unit as the motion vector of the encoding target unit.
- the encoder may use the extracted temporal motion vector to derive the motion vector of the encoding target unit.
- the encoder may use the extracted temporal motion vector as one of merge candidates included in a merge candidate list. That is, the encoder may generate a merge candidate list using the extracted temporal motion vector.
- the encoder may encode the merge index and transmit the encoded index to the decoder.
- the merge index may be an index indicating which candidate among merge candidates included in the merge candidate list is used for inter prediction and motion compensation of the encoding target unit.
- the decoder may receive and decode the merge index, and generate a merge candidate list in the same manner as the encoder. In this case, the decoder may derive motion information used for inter prediction and motion compensation of the decoding target unit by using the generated merge candidate list and the decoded merge index.
- the residual signal for the encoding target unit may not exist in the merge mode.
- the encoder may not encode the residual signal, and such an encoding mode may be called a merge skip mode.
- the temporal motion information extracted from the reference picture may be two or more.
- the encoder may select one of the extracted plurality of temporal motion information and use the selected temporal motion information for inter prediction and / or motion compensation for the encoding target unit.
- the encoder may select the optimal temporal motion information by a rate-distortion optimization (RDO) method.
- RDO rate-distortion optimization
- the rate-distortion optimization method may mean a method of selecting an optimal coding scheme in terms of distortion and distortion.
- the encoder may calculate a rate-distortion cost when encoding is performed for each of a plurality of temporal motion vectors.
- the encoder may select one temporal motion vector having a minimum rate-distortion cost value.
- the encoder may use the selected temporal motion vector in performing inter prediction and / or motion compensation.
- the encoder may encode a motion vector index for the selected temporal motion vector, and the encoded motion vector index may be included in a bitstream and transmitted to the decoder.
- the encoder may use temporal motion information extracted from the reference picture to encode motion information of the encoding target unit.
- the temporal motion information extracted from the reference picture may be used as a prediction value for the motion information of the encoding target unit.
- the predicted value for the motion information of the encoding target unit is called predictive motion information
- the predicted value for the motion vector of the encoding target unit is called a predicted motion vector.
- the encoder may use the temporal motion vector extracted from the reference picture as a prediction motion vector for the encoding target unit.
- the encoder may obtain a motion vector difference based on the difference between the motion vector of the encoding target unit and the predicted motion vector.
- the encoder may encode the obtained motion vector difference, and the encoded motion vector difference may be included in a bitstream and transmitted to the decoder. Equation 1 below shows an embodiment of a motion vector difference calculation method.
- motion_vector_difference motion_vector-extracted_motion_vector
- motion_vector_difference may indicate a motion vector difference.
- motion_vector may represent a motion vector of the encoding target unit
- extracted_motion_vector may represent a temporal motion vector extracted from the reference picture.
- the encoder may encode motion information extracted from a reference picture and / or information related to the motion information (S430).
- the encoded information may be included in a sequence parameter set (SPS), a picture parameter set (PPS), or a slice header of the encoding target unit and transmitted to the decoder.
- SPS sequence parameter set
- PPS picture parameter set
- slice header of the encoding target unit
- the encoder may encode a motion information encoding indicator indicating whether motion information extracted from a reference picture is encoded and transmitted to the decoder.
- the motion information encoding indicator may indicate whether the extracted motion information is used for inter prediction and motion compensation.
- the encoded motion information encoding indicator may be included in the bitstream and transmitted to the decoder.
- the motion information encoding indicator may be represented by a syntax element called coded_motion_vector_present_flag.
- the syntax element may be encoded in a picture parameter set or slice header.
- the encoder may encode the extracted temporal motion information and transmit the encoded temporal motion information to the decoder. At this time, the decoder may decode the transmitted temporal motion information. The decoder may perform inter prediction and motion compensation on a decoding target unit by using the decoded temporal motion information. If the value of coded_motion_vector_present_flag is 0, the encoder may not code the extracted temporal motion information. In this case, the decoder may not use the extracted temporal motion information in performing inter prediction and motion compensation.
- the encoder may encode the motion information value extracted from the reference picture.
- the encoded motion information value may be included in the bitstream and transmitted to the decoder.
- the encoder may encode the extracted motion vector values using difference pulse code modulation (DPCM).
- DPCM difference pulse code modulation
- the encoder may perform prediction for each motion vector by using the DPCM.
- the first motion vector may be represented by extracted_motion_vector_1 and the second motion vector may be represented by extracted_motion_vector_2.
- the first motion vector value may be used as a prediction value for the second motion vector value.
- the encoder may obtain a motion vector difference with respect to the second motion vector by using the prediction value (first motion vector value). This can be represented by Equation 2 as an example.
- motion_vector_difference_2 may represent a motion vector difference with respect to the second motion vector.
- the encoder may encode the derived motion vector difference value and transmit the encoded motion vector difference value to the decoder.
- the encoder may encode the extracted motion information value itself without performing prediction on the extracted motion information.
- the encoded motion information value may be included in the bitstream and transmitted to the decoder.
- the number of motion information extracted from the reference picture may be two or more.
- the encoder may select one of the extracted plurality of motion information and use the inter prediction and / or motion compensation for the encoding target unit.
- the encoder may encode a motion information index indicating which motion information of the extracted plurality of motion information is used.
- the encoded motion information index may be included in the bitstream and transmitted to the decoder.
- Table 5 below shows an embodiment of the extracted motion vectors when the number of motion vectors extracted from the reference picture is two or more.
- each motion vector may be assigned a motion information index.
- the encoder may encode and transmit the motion information index value 2 to the decoder.
- the decoder may derive a motion vector used for inter prediction and motion compensation by using the transmitted motion information index.
- the encoder may not encode the motion information index.
- the decoder can extract N (N is a positive integer) temporal motion information in the same manner as the encoder.
- the extracted temporal motion information may be used for inter prediction and motion compensation for a decoding target unit.
- the encoder may not encode the temporal motion information value and / or the motion information index extracted from the reference picture.
- the encoder can efficiently encode temporal motion information during image encoding.
- the encoder may not store the motion information in the reference picture in the memory, the memory requirement and the memory bandwidth are reduced and the error robustness is reduced in inter prediction and motion compensation for the encoding target unit. Can be improved. Therefore, the overall image coding efficiency can be improved.
- FIG. 7 is a flowchart schematically illustrating an embodiment of an inter prediction method in a decoder.
- the decoder may extract motion information of a reference picture with respect to the current picture (S710).
- the decoder may extract N pieces of motion information among the motion information included in the reference picture.
- the extracted motion information may be used for inter prediction and / or motion compensation of a decoding target unit in a current picture.
- N represents a positive integer, and in the embodiments described below, N means a positive integer.
- the decoder may extract motion information of the reference picture by using the same method as the motion information extraction method used in the encoder. In this case, the decoder may extract the same motion information as the temporal motion information extracted by the encoder. Embodiments of the motion information extraction method have been described above with reference to FIG.
- the encoder may encode the motion information value extracted from the reference picture, and the encoded motion information value may be included in the bitstream and transmitted to the decoder.
- the decoder since the decoder may derive motion information of the reference picture from the transmitted bitstream, the decoder may not perform the motion information extraction process.
- the decoder may decode motion information and / or information related to the motion information of the reference picture transmitted from the encoder (S720).
- the decoder may decode the information in a sequence parameter set (SPS), a picture parameter set (PPS), or a slice header.
- SPS sequence parameter set
- PPS picture parameter set
- slice header a slice header
- the decoder may decode a motion information encoding indicator indicating whether temporal motion information extracted from the encoder is encoded and transmitted to the decoder.
- the motion information encoding indicator may indicate whether temporal motion information extracted by the encoder is used for inter prediction and motion compensation.
- the motion information encoding indicator may be represented by a syntax element called coded_motion_vector_present_flag.
- the syntax element may be decoded in a picture parameter set or slice header.
- the encoder may encode temporal motion information extracted from the reference picture and transmit the encoded motion information to the decoder. At this time, the decoder may decode the transmitted temporal motion information. The decoder may perform inter prediction and motion compensation on a decoding target unit by using the decoded temporal motion information.
- the encoder may not code temporal motion information extracted from the reference picture. In this case, the decoder may not decode the extracted temporal motion information, and may not use the extracted temporal motion information in performing inter prediction and motion compensation.
- the encoder may encode the motion information value extracted from the reference picture and transmit the encoded information to the decoder.
- the decoder may receive and decode the transmitted motion information value.
- the encoder predicts the extracted motion vector value using DPCM, and then encodes the difference between the predicted motion vector value and the extracted motion vector value, that is, the motion vector difference. have.
- the encoded motion vector difference may be transmitted to the decoder, and the decoder may derive temporal motion information using the transmitted motion vector difference.
- the encoded motion vectors are called first motion vectors and second motion vectors, respectively.
- the first motion vector may be represented by extracted_motion_vector_1 and the second motion vector may be represented by extracted_motion_vector_2.
- the first motion vector value may be used as a prediction value for the second motion vector value.
- the encoder may transmit the motion vector difference with respect to the second motion vector to the decoder.
- the motion vector difference with respect to the second motion vector may be represented by motion_vector_difference_2.
- the decoder may decode the first motion vector value.
- the decoder may decode a motion vector difference with respect to the second motion vector, and derive a second motion vector value by adding the decoded motion vector difference to the first motion vector value. This can be represented by Equation 3 below as an example.
- the encoder may encode the extracted motion information value itself and transmit the encoded motion information value itself to the decoder without performing prediction on the extracted motion information.
- the decoder may decode the transmitted motion information value itself without performing prediction on the motion information.
- the number of motion information extracted from the reference picture may be two or more.
- the motion information may be extracted by the encoder and transmitted to the decoder or may be extracted by the decoder.
- the decoder may select one of the plurality of motion information and use the inter prediction and / or motion compensation for the decoding target unit.
- the encoder may encode the motion information index and transmit it to the decoder.
- the motion information index is an index indicating which motion information of the extracted plurality of motion information is used.
- the decoder may decode the received motion information index. Since the decoder may have a plurality of motion information identical to the motion information extracted by the encoder, the decoder may select motion information used for inter prediction and motion compensation using the decoded motion information index.
- Table 6 below shows an embodiment of the extracted motion vectors when the number of motion vectors extracted from the reference picture is two or more.
- each motion vector may be assigned a motion information index. For example, if the motion information index value transmitted from the encoder is 2, the decoder may use the motion vector [0, -1] assigned the index value of 2 for inter prediction and motion compensation.
- the encoder may not encode the motion information index.
- the decoder may not decode the motion information index.
- the decoder may extract N pieces of temporal motion information in the same manner as the encoder.
- the extracted temporal motion information may be used for inter prediction and motion compensation for a decoding target unit.
- the encoder may not encode the temporal motion information value and / or the motion information index extracted from the reference picture.
- the decoder may not decode the motion information value and the motion information index. That is, the above-described decoding process such as the motion information value and the motion information index may be omitted.
- the decoder may derive temporal motion information of the reference picture.
- the motion information of the reference picture derived from the decoder may be the same as the temporal motion information extracted from the encoder.
- the motion information of the reference picture derived from the decoder is referred to as reference motion information
- the motion vector of the reference picture derived from the decoder is called a reference motion vector.
- the decoder may perform inter prediction and / or motion compensation on the decoding target unit in the current picture by using the reference motion information derived by the above-described method (S730).
- the decoder may use the reference motion information when performing a motion vector prediction (MVP) and / or an advanced motion vector prediction (AMVP) for a decoding target unit.
- MVP motion vector prediction
- AMVP advanced motion vector prediction
- the decoder may use the reference motion vector as one of motion vector candidates.
- the decoder may perform inter prediction and / or motion compensation using the reference motion vector.
- the motion vector and temporal motion vector of the reconstructed neighboring unit may be used as the motion vector of the decoded unit, and the reference picture index of the reconstructed neighboring unit may be used as the reference picture index of the decoded unit.
- the direct mode a residual signal for the current decoding target unit may be decoded. However, since the residual signal may not exist in the skip mode, the decoder may not decode the residual signal.
- the decoder may perform inter prediction and / or motion compensation using a merge mode.
- the decoder may perform inter prediction and / or motion compensation by using at least one of the reconstructed neighboring unit's motion vector and temporal motion vector as the motion vector of the decoding target unit.
- the decoder may use the reference motion vector to derive the motion vector of the decoding target unit.
- the decoder may use the reference motion vector as one of the merge candidates included in the merge candidate list.
- the decoder may not decode the residual signal, and such an encoding mode may be called a merge skip mode.
- the reference motion information derived from the decoder may be two or more.
- the decoder may select one of the derived plurality of reference motion information and use the selected reference motion information for inter prediction and / or motion compensation for the encoding target unit.
- the decoder may decode the motion information index transmitted from the encoder and select reference motion information used for inter prediction and motion compensation using the decoded motion information index.
- the motion information index may indicate which reference motion information is used among the derived plurality of reference motion information.
- the decoder may use the reference motion information to derive the motion information of the decoding target unit.
- the reference motion information may be used as a prediction value for the motion information of the decoding target unit.
- the decoder may use the reference motion vector as the prediction motion vector for the decoding target unit.
- the predicted motion vector may mean a predicted value for the motion vector of the decoding target unit.
- the encoder may derive the motion vector difference between the motion vector of the encoding target unit and the motion vector extracted from the reference picture and transmit the difference to the decoder.
- the decoder may receive and decode the motion vector difference, and may add a decoded motion vector difference and a reference motion vector to derive a motion vector of the decoding target unit. This can be represented, for example, by the following equation (4).
- motion_vector motion_vector_difference + extracted_motion_vector
- motion_vector may represent a motion vector of the decoding target unit.
- motion_vector_difference may indicate a motion vector difference
- extracted_motion_vector may indicate a reference motion vector.
- the decoder can efficiently decode temporal motion information during image decoding.
- the decoder since the decoder may not store the motion information in the reference picture in the memory, when inter prediction and motion compensation for the decoding target unit, the memory requirement and the memory bandwidth are reduced, and the error robustness is reduced. Can be improved. Therefore, the overall image decoding efficiency can be improved.
- the methods are described based on a flowchart as a series of steps or blocks, but the present invention is not limited to the order of steps, and certain steps may occur in a different order or at the same time than other steps described above. Can be. Also, one of ordinary skill in the art appreciates that the steps shown in the flowcharts are not exclusive, that other steps may be included, or that one or more steps in the flowcharts may be deleted without affecting the scope of the present invention. I can understand.
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Compression Or Coding Systems Of Tv Signals (AREA)
Abstract
Description
Claims (17)
- 현재 픽쳐 내의 복호화 대상 유닛에 대한, 참조 움직임 정보를 도출하는 단계; 및
상기 도출된 참조 움직임 정보를 이용하여 상기 복호화 대상 유닛에 대한 움직임 보상(motion compensation)을 수행하는 단계를 포함하고,
상기 참조 움직임 정보는, 상기 현재 픽쳐에 대한 참조 픽쳐에 포함된 움직임 정보이고, 참조 픽쳐 리스트(reference picture list), 참조 픽쳐 인덱스(reference picture index), 움직임 벡터(motion vector), 예측 방향(prediction direction) 및 움직임 벡터 예측기(motion vector predictor) 중 적어도 하나를 포함하는 인터 예측 방법. - 청구항 1에 있어서, 상기 참조 움직임 정보 도출 단계는,
상기 참조 픽쳐에서 상기 참조 움직임 정보를 추출(extract)하는 단계를 더 포함하는 인터 예측 방법. - 청구항 2에 있어서, 상기 참조 움직임 정보 추출 단계는,
상기 참조 픽쳐 내의 복수의 움직임 정보 각각의 발생 회수를 카운트(count)하여 카운트 정보를 획득하는 단계; 및
상기 획득된 카운트 정보에 기반하여, 상기 참조 픽쳐 내의 복수의 움직임 정보 중에서 상기 참조 움직임 정보를 선택하는 단계를 더 포함하는 인터 예측 방법. - 청구항 2에 있어서, 상기 참조 움직임 정보 추출 단계는,
상기 참조 픽쳐 내의 움직임 정보에 대해 중간값 연산(median operation)을 수행하여 움직임 정보 중간값을 도출하는 단계; 및
상기 움직임 정보 중간값을 상기 참조 움직임 정보로 추출하는 단계를 더 포함하는 인터 예측 방법. - 청구항 2에 있어서, 상기 참조 움직임 정보 추출 단계는,
상기 참조 픽쳐 내의 움직임 정보에 대해 서브 샘플링(sub-sampling)을 수행하는 단계를 더 포함하는 인터 예측 방법. - 청구항 5에 있어서,
상기 서브 샘플링 수행 단계는,
상기 참조 픽쳐 내의 제1 크기의 블록에 포함된 복수의 제2 크기의 블록 중에서 소정의 위치의 블록을 선택하는 단계; 및
상기 선택된 블록에 해당되는 움직임 정보를 상기 참조 움직임 정보로 추출하는 단계를 더 포함하는 인터 예측 방법. - 청구항 6에 있어서,
상기 소정의 위치는 상기 제1 크기의 블록 내의 가장 좌측 상단 위치인 것을 특징으로 하는 인터 예측 방법. - 청구항 2에 있어서, 상기 참조 움직임 정보 추출 단계는,
상기 참조 픽쳐 내의 움직임 정보를 복수의 그룹으로 그룹화(grouping)하는 단계; 및
상기 복수의 그룹 각각에서, 움직임 정보의 발생 빈도를 기반으로, 소정의 개수의 움직임 정보를 상기 참조 움직임 정보로 선택하는 단계를 더 포함하고,
상기 그룹화 단계에서는, 상기 참조 픽쳐에 포함된 유닛의 깊이(depth) 값, 상기 참조 픽쳐에 포함된 유닛의 크기(size) 및 상기 참조 픽쳐에 포함된 유닛의 파티션(partition) 형태 중 적어도 하나의 특성을 기반으로, 상기 그룹화를 수행하는 것을 특징으로 하는 인터 예측 방법. - 청구항 2에 있어서, 상기 참조 움직임 정보 추출 단계는,
상기 참조 픽쳐를 복수의 영역으로 분할하는 단계; 및
상기 복수의 영역 각각에서, 움직임 정보의 발생 빈도를 기반으로, 소정의 개수의 움직임 정보를 상기 참조 움직임 정보로 선택하는 단계를 더 포함하는 인터 예측 방법. - 청구항 2에 있어서,
상기 참조 픽쳐의 개수가 2 이상인 경우,
상기 참조 움직임 정보 추출 단계는,
상기 참조 픽쳐 각각에서, 움직임 정보의 발생 빈도를 기반으로, 소정의 개수의 움직임 정보를 선택하는 단계를 더 포함하는 인터 예측 방법. - 청구항 10에 있어서, 상기 참조 움직임 정보 추출 단계는,
상기 참조 픽쳐 각각에 대해, 상기 현재 픽쳐로부터의 시간적 거리(temporal distance)를 도출하는 단계; 및
상기 도출된 시간적 거리를 기반으로, 상기 선택된 움직임 정보를 스케일링하는 단계를 더 포함하는 인터 예측 방법. - 청구항 1에 있어서,
상기 움직임 보상 수행 단계는,
상기 복호화 대상 유닛에 대한 움직임 벡터 차분(motion information difference)을 수신하여 복호화하는 단계;
상기 복호화 대상 유닛에 대한 예측 움직임 벡터(predicted motion vector)를 도출하는 단계;
상기 복호화된 움직임 벡터 차분 및 상기 도출된 예측 움직임 벡터를 이용하여, 상기 복호화 대상 유닛에 대한 움직임 벡터를 도출하는 단계; 및
상기 도출된 움직임 벡터를 이용하여, 상기 복호화 대상 유닛에 대한 움직임 보상을 수행하는 단계를 더 포함하는 인터 예측 방법. - 청구항 12에 있어서, 상기 예측 움직임 벡터 도출 단계는,
상기 참조 움직임 정보를 이용하여, 상기 복호화 대상 유닛에 대한 움직임 벡터 후보(motion vector candidate)를 생성하는 단계; 및
상기 움직임 벡터 후보를 이용하여, 상기 예측 움직임 벡터를 도출하는 단계를 더 포함하는 인터 예측 방법. - 청구항 1에 있어서,
상기 움직임 보상 수행 단계는,
머지 인덱스(merge index)를 수신하여 복호화하는 단계;
상기 참조 움직임 정보를 이용하여, 머지 후보 리스트(merge candidate list)를 생성하는 단계;
상기 머지 후보 리스트에 포함된 머지 후보 중에서 상기 머지 인덱스가 지시하는 움직임 정보를 선택하는 단계; 및
상기 선택된 움직임 정보를 이용하여, 상기 복호화 대상 유닛에 대한 움직임 보상을 수행하는 단계를 더 포함하는 인터 예측 방법. - 청구항 1에 있어서,
상기 참조 움직임 정보의 개수가 2 이상인 경우,
상기 움직임 보상 수행 단계는,
부호화된 움직임 정보 인덱스를 수신하여 복호화하는 단계;
상기 참조 움직임 정보 중에서, 상기 움직임 정보 인덱스가 지시하는 움직임 정보를 선택하는 단계; 및
상기 선택된 움직임 정보를 이용하여 상기 복호화 대상 유닛에 대한 움직임 보상을 수행하는 단계를 더 포함하는 인터 예측 방법. - 청구항 1에 있어서, 상기 참조 움직임 정보 도출 단계는,
부호화된 참조 움직임 정보를 수신하는 단계; 및
상기 수신된 참조 움직임 정보를 복호화하는 단계를 더 포함하는 인터 예측 방법. - 청구항 16에 있어서,
상기 부호화된 참조 움직임 정보의 개수가 2 이상인 경우,
상기 복호화 단계에서는,
DPCM(Differential Pulse Code Modulation)을 이용하여, 상기 수신된 참조 움직임 정보를 복호화하는 것을 특징으로 하는 인터 예측 방법.
Priority Applications (7)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US13/989,126 US10397599B2 (en) | 2010-12-17 | 2011-12-19 | Method and apparatus for inter prediction using motion vector candidate based on temporal motion prediction |
| JP2013540905A JP2014501091A (ja) | 2010-12-17 | 2011-12-19 | インター予測方法及びその装置 |
| US16/456,629 US10708614B2 (en) | 2010-12-17 | 2019-06-28 | Method and apparatus for inter prediction using motion vector candidate based on temporal motion prediction |
| US16/887,370 US11206424B2 (en) | 2010-12-17 | 2020-05-29 | Method and apparatus for inter prediction using motion vector candidate based on temporal motion prediction |
| US17/525,053 US11743486B2 (en) | 2010-12-17 | 2021-11-12 | Method and apparatus for inter prediction using motion vector candidate based on temporal motion prediction |
| US18/347,483 US12063385B2 (en) | 2010-12-17 | 2023-07-05 | Method and apparatus for inter prediction using motion vector candidate based on temporal motion prediction |
| US18/746,655 US12464157B2 (en) | 2010-12-17 | 2024-06-18 | Method and apparatus for inter prediction using motion vector candidate based on temporal motion prediction |
Applications Claiming Priority (4)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| KR10-2010-0130229 | 2010-12-17 | ||
| KR20100130229 | 2010-12-17 | ||
| KR1020110137042A KR20120068743A (ko) | 2010-12-17 | 2011-12-19 | 인터 예측 방법 및 그 장치 |
| KR10-2011-0137042 | 2011-12-19 |
Related Child Applications (2)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| US13/989,126 A-371-Of-International US10397599B2 (en) | 2010-12-17 | 2011-12-19 | Method and apparatus for inter prediction using motion vector candidate based on temporal motion prediction |
| US16/456,629 Continuation US10708614B2 (en) | 2010-12-17 | 2019-06-28 | Method and apparatus for inter prediction using motion vector candidate based on temporal motion prediction |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| WO2012081949A2 true WO2012081949A2 (ko) | 2012-06-21 |
| WO2012081949A3 WO2012081949A3 (ko) | 2012-09-07 |
Family
ID=46245254
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| PCT/KR2011/009772 Ceased WO2012081949A2 (ko) | 2010-12-17 | 2011-12-19 | 인터 예측 방법 및 그 장치 |
Country Status (1)
| Country | Link |
|---|---|
| WO (1) | WO2012081949A2 (ko) |
Cited By (6)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JP2015167373A (ja) * | 2011-09-09 | 2015-09-24 | ケィティ、コーポレーションKt Corporation | 時間的な予測動きベクトル誘導方法及びこのような方法を用いる装置 |
| CN105308967A (zh) * | 2013-04-05 | 2016-02-03 | 三星电子株式会社 | 根据用于多视点视频的预测结构的视频流编码方法及其装置以及根据用于多视点视频的预测结构的视频流解码方法及其装置 |
| WO2016068685A1 (ko) * | 2014-10-31 | 2016-05-06 | 삼성전자 주식회사 | 고정밀 스킵 부호화를 이용한 비디오 부호화 장치 및 비디오 복호화 장치 및 그 방법 |
| JP2017085611A (ja) * | 2011-09-23 | 2017-05-18 | ケィティ、コーポレーションKt Corporation | マージ候補ブロック誘導方法及びこのような方法を用いる装置 |
| WO2018026222A1 (ko) * | 2016-08-03 | 2018-02-08 | 주식회사 케이티 | 비디오 신호 처리 방법 및 장치 |
| CN111163322A (zh) * | 2020-01-08 | 2020-05-15 | 绍兴文理学院 | 对基于历史运动矢量的索引进行映射的编码及解码方法 |
Family Cites Families (3)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| KR100557576B1 (ko) * | 2002-12-10 | 2006-03-03 | 엘지전자 주식회사 | 픽쳐 모션벡터 예측방법 |
| KR100584603B1 (ko) * | 2004-08-03 | 2006-05-30 | 학교법인 대양학원 | 다시점 영상의 다이렉트 모드 움직임 예측 방법 및 장치 |
| KR100907174B1 (ko) * | 2008-09-11 | 2009-07-09 | 엘지전자 주식회사 | 모션벡터 결정방법 |
-
2011
- 2011-12-19 WO PCT/KR2011/009772 patent/WO2012081949A2/ko not_active Ceased
Cited By (39)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JP2015167373A (ja) * | 2011-09-09 | 2015-09-24 | ケィティ、コーポレーションKt Corporation | 時間的な予測動きベクトル誘導方法及びこのような方法を用いる装置 |
| JP2016007040A (ja) * | 2011-09-09 | 2016-01-14 | ケィティ、コーポレーションKt Corporation | 時間的な予測動きベクトル誘導方法及びこのような方法を用いる装置 |
| JP2016007039A (ja) * | 2011-09-09 | 2016-01-14 | ケィティ、コーポレーションKt Corporation | 時間的な予測動きベクトル誘導方法及びこのような方法を用いる装置 |
| US11089333B2 (en) | 2011-09-09 | 2021-08-10 | Kt Corporation | Method for deriving a temporal predictive motion vector, and apparatus using the method |
| US10805639B2 (en) | 2011-09-09 | 2020-10-13 | Kt Corporation | Method for deriving a temporal predictive motion vector, and apparatus using the method |
| US10523967B2 (en) | 2011-09-09 | 2019-12-31 | Kt Corporation | Method for deriving a temporal predictive motion vector, and apparatus using the method |
| JP2017098967A (ja) * | 2011-09-09 | 2017-06-01 | ケィティ、コーポレーションKt Corporation | 時間的な予測動きベクトル誘導方法及びこのような方法を用いる装置 |
| JP2018139422A (ja) * | 2011-09-09 | 2018-09-06 | ケィティ、コーポレーションKt Corporation | 時間的な予測動きベクトル誘導方法及びこのような方法を用いる装置 |
| US9924185B2 (en) | 2011-09-23 | 2018-03-20 | Kt Corporation | Method for inducing a merge candidate block and device using same |
| JP2017085611A (ja) * | 2011-09-23 | 2017-05-18 | ケィティ、コーポレーションKt Corporation | マージ候補ブロック誘導方法及びこのような方法を用いる装置 |
| US9930359B2 (en) | 2011-09-23 | 2018-03-27 | Kt Corporation | Method for inducing a merge candidate block and device using same |
| JP2018088697A (ja) * | 2011-09-23 | 2018-06-07 | ケィティ、コーポレーションKt Corporation | マージ候補ブロック誘導方法及びこのような方法を用いる装置 |
| JP2018088695A (ja) * | 2011-09-23 | 2018-06-07 | ケィティ、コーポレーションKt Corporation | マージ候補ブロック誘導方法及びこのような方法を用いる装置 |
| JP2018088696A (ja) * | 2011-09-23 | 2018-06-07 | ケィティ、コーポレーションKt Corporation | マージ候補ブロック誘導方法及びこのような方法を用いる装置 |
| US10165295B2 (en) | 2011-09-23 | 2018-12-25 | Kt Corporation | Method for inducing a merge candidate block and device using same |
| US10165294B2 (en) | 2011-09-23 | 2018-12-25 | Kt Corporation | Method for inducing a merge candidate block and device using same |
| US10182237B2 (en) | 2011-09-23 | 2019-01-15 | Kt Corporation | Method for inducing a merge candidate block and device using same |
| CN105308967A (zh) * | 2013-04-05 | 2016-02-03 | 三星电子株式会社 | 根据用于多视点视频的预测结构的视频流编码方法及其装置以及根据用于多视点视频的预测结构的视频流解码方法及其装置 |
| CN107113440B (zh) * | 2014-10-31 | 2020-10-13 | 三星电子株式会社 | 一种由视频解码装置执行的视频解码方法 |
| CN112188204B (zh) * | 2014-10-31 | 2024-04-05 | 三星电子株式会社 | 使用高精度跳过编码的视频编码设备和视频解码设备及其方法 |
| US10602179B2 (en) | 2014-10-31 | 2020-03-24 | Samsung Electronics Co., Ltd. | Video encoding device and video decoding device using high-precision skip encoding and method thereof |
| US12495157B2 (en) | 2014-10-31 | 2025-12-09 | Samsung Electronics Co., Ltd. | Video encoding device and video decoding device using high-precision skip encoding and method thereof |
| WO2016068685A1 (ko) * | 2014-10-31 | 2016-05-06 | 삼성전자 주식회사 | 고정밀 스킵 부호화를 이용한 비디오 부호화 장치 및 비디오 복호화 장치 및 그 방법 |
| US12052435B2 (en) | 2014-10-31 | 2024-07-30 | Samsung Electronics Co., Ltd. | Video encoding device and video decoding device using high-precision skip encoding and method thereof |
| CN112188207A (zh) * | 2014-10-31 | 2021-01-05 | 三星电子株式会社 | 使用高精度跳过编码的视频编码设备和视频解码设备及其方法 |
| CN112188204A (zh) * | 2014-10-31 | 2021-01-05 | 三星电子株式会社 | 使用高精度跳过编码的视频编码设备和视频解码设备及其方法 |
| US10992951B2 (en) | 2014-10-31 | 2021-04-27 | Samsung Electronics Co., Ltd. | Video encoding device and video decoding device using high-precision skip encoding and method thereof |
| CN107113440A (zh) * | 2014-10-31 | 2017-08-29 | 三星电子株式会社 | 使用高精度跳过编码的视频编码设备和视频解码设备及其方法 |
| US11228782B2 (en) | 2014-10-31 | 2022-01-18 | Samsung Electronics Co., Ltd. | Video encoding device and video decoding device using high-precision skip encoding and method thereof |
| CN112188207B (zh) * | 2014-10-31 | 2023-10-20 | 三星电子株式会社 | 使用高精度跳过编码的视频编码设备和视频解码设备及其方法 |
| US11523130B2 (en) | 2014-10-31 | 2022-12-06 | Samsung Electronics Co., Ltd. | Video encoding device and video decoding device using high-precision skip encoding and method thereof |
| US11509926B2 (en) | 2014-10-31 | 2022-11-22 | Samsung Electronics Co., Ltd. | Video encoding device and video decoding device using high-precision skip encoding and method thereof |
| US11503305B2 (en) | 2016-08-03 | 2022-11-15 | Kt Corporation | Method and apparatus for processing video signal |
| CN109644267A (zh) * | 2016-08-03 | 2019-04-16 | 株式会社Kt | 视频信号处理方法和装置 |
| WO2018026222A1 (ko) * | 2016-08-03 | 2018-02-08 | 주식회사 케이티 | 비디오 신호 처리 방법 및 장치 |
| US12069279B2 (en) | 2016-08-03 | 2024-08-20 | Kt Corporation | Method and apparatus for processing video signal |
| US12495148B2 (en) | 2016-08-03 | 2025-12-09 | Kt Corporation | Method and apparatus for processing video signal |
| CN111163322B (zh) * | 2020-01-08 | 2022-08-30 | 绍兴文理学院 | 对基于历史运动矢量的索引进行映射的编码及解码方法 |
| CN111163322A (zh) * | 2020-01-08 | 2020-05-15 | 绍兴文理学院 | 对基于历史运动矢量的索引进行映射的编码及解码方法 |
Also Published As
| Publication number | Publication date |
|---|---|
| WO2012081949A3 (ko) | 2012-09-07 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| JP7585535B2 (ja) | 映像復号化方法 | |
| JP7387841B2 (ja) | 映像復号化方法 | |
| WO2012081949A2 (ko) | 인터 예측 방법 및 그 장치 |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| 121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 11849462 Country of ref document: EP Kind code of ref document: A2 |
|
| ENP | Entry into the national phase |
Ref document number: 2013540905 Country of ref document: JP Kind code of ref document: A |
|
| WWE | Wipo information: entry into national phase |
Ref document number: 13989126 Country of ref document: US |
|
| NENP | Non-entry into the national phase |
Ref country code: DE |
|
| 122 | Ep: pct application non-entry in european phase |
Ref document number: 11849462 Country of ref document: EP Kind code of ref document: A2 |





