WO2006129280A2 - Video decoder with hybrid reference texture - Google Patents

Video decoder with hybrid reference texture Download PDF

Info

Publication number
WO2006129280A2
WO2006129280A2 PCT/IB2006/051732 IB2006051732W WO2006129280A2 WO 2006129280 A2 WO2006129280 A2 WO 2006129280A2 IB 2006051732 W IB2006051732 W IB 2006051732W WO 2006129280 A2 WO2006129280 A2 WO 2006129280A2
Authority
WO
WIPO (PCT)
Prior art keywords
decoding
hybrid
macroblocks
reference texture
field
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Ceased
Application number
PCT/IB2006/051732
Other languages
French (fr)
Other versions
WO2006129280A3 (en
Inventor
Stéphane VALENTE
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Koninklijke Philips NV
NXP BV
Original Assignee
NXP BV
Koninklijke Philips Electronics NV
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by NXP BV, Koninklijke Philips Electronics NV filed Critical NXP BV
Priority to US11/916,321 priority Critical patent/US8520738B2/en
Priority to JP2008514286A priority patent/JP2008543209A/en
Priority to EP06745050A priority patent/EP1894415A2/en
Priority to CN2006800196612A priority patent/CN101189880B/en
Publication of WO2006129280A2 publication Critical patent/WO2006129280A2/en
Publication of WO2006129280A3 publication Critical patent/WO2006129280A3/en
Anticipated expiration legal-status Critical
Ceased legal-status Critical Current

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/42Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by implementation details or hardware specially adapted for video compression or decompression, e.g. dedicated software implementation
    • H04N19/423Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by implementation details or hardware specially adapted for video compression or decompression, e.g. dedicated software implementation characterised by memory arrangements
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/134Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
    • H04N19/157Assigned coding mode, i.e. the coding mode being predefined or preselected to be further used for selection of another element or parameter
    • H04N19/16Assigned coding mode, i.e. the coding mode being predefined or preselected to be further used for selection of another element or parameter for a given display mode, e.g. for interlaced or progressive display mode
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/44Decoders specially adapted therefor, e.g. video decoders which are asymmetric with respect to the encoder
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/50Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
    • H04N19/503Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal prediction
    • H04N19/51Motion estimation or motion compensation

Definitions

  • the present invention relates to a video decoder for decoding a bit stream corresponding to pictures of a video signal, the coded pictures being likely to include macroblocks coded in a progressive and in an interlaced way. More particularly, the invention relates to a decoder including a decoding unit for decoding macroblocks coded in a progressive way.
  • the MPEG-4 standard defines a syntax for video bit streams which allows interoperability between various encoders and decoders. Standards describe many video tools, but implementing all of them can result in a too high complexity for most applications. To offer more flexibility in the choice of available tools and encoder/decoder complexity, the standard further defines profiles, which are subsets of the syntax limited to particular tools.
  • the Simple Profile is a subset of the entire bit stream syntax which includes in MPEG terminology: I and P VOPs, AC/DC prediction, 1 or 4 motion vectors per macroblock, unrestricted motion vectors and half pixel motion compensation for progressive pictures.
  • the Advanced Simple Profile is a superset of the SP syntax: it includes the SP coding tools, and adds B VOPs, global motion compensation, interlaced pictures, quarter pixel motion compensation where interpolation filters are different from the ones used in half-pixel motion compensation, and other tools dedicated to the processing of interlaced pictures.
  • the document US RE38,564 E discloses a motion estimation and compensation technique for interlaced digital video such as video object planes (VOPs).
  • VOPs video object planes
  • Predictor motion vectors for use in differentially encoding a current field coded macroblock are obtained using the median of motion vectors of surrounding blocks or macroblocks. When a surrounding macroblock is itself interlaced an average motion vector for that macroblock is used.
  • This document also discloses a decoder that is provided with functions enabling the direct decoding of field coded macroblocks as defined in ASP.
  • interlacing modifies two low- level processes: motion compensation and inverse Direct Cosine Transform (DCT in the following).
  • DCT Direct Cosine Transform
  • a video decoder including a hybrid reference construction unit for constructing, for each reference picture, a hybrid reference texture which has the property of representing said reference texture in a frame-based and in a field-based manner, said hybrid reference texture being used by said decoding unit for decoding interlaced macroblocks.
  • said hybrid reference texture includes the reference texture in frame structure as found in the bit stream and a reference texture in field structure obtained by de-interlacing and extracting the two fields.
  • said hybrid texture is stored in a memory.
  • said hybrid reference texture is extended with repetitive pixel padding, the two fields of the reference texture in field structure being extended independently from each other.
  • said hybrid reference construction unit is activated on a picture basis when a flag, in a corresponding video object layer header, is set to a value indicating that coded pictures in this video object layer are interlaced.
  • the invention also relates to a method for decoding a bit stream corresponding to pictures of a video signal, the coded pictures being likely to include macroblocks coded in a progressive and in an interlaced way, said method including a decoding step for decoding macroblocks coded in a progressive way.
  • Said method is characterized in that it includes a hybrid reference construction step for constructing, for each reference picture, a hybrid reference texture HRT which has the property of representing said reference texture in a frame-based and in a field- based manner.
  • the invention also relates to a computer program product comprising program instructions for implementing, when said program is executed by a processor, a decoding method as disclosed above.
  • the invention finds application in the playback of video standards as MPEG- 4 and DivX streams on mobile phones in which a video encoder as described above is advantageously implemented.
  • Fig.1 illustrates a macroblock structure in frame DCT coding
  • Fig.2 illustrates a macroblock structure in field DCT coding
  • - Fig. 3 represents a video decoder according to the invention
  • - Fig.4 illustrates the construction of a hybrid reference texture according to the invention
  • - Fig.5 gives an example of an advantageous implementation for the construction of a hybrid reference texture.
  • the inverse DCT can be either a frame DCT or a field DCT as specified by a syntax element called dct type included in the bit stream for each macroblock with texture information.
  • dct type flag is set to 0 for a particular macroblock
  • the macroblock is frame coded and the DCT coefficients of luminance data encode 8*8 blocks that are composed of lines from two fields alternatively. This mode is illustrated in figure 1.
  • Two fields TF and BF are respectively represented by hatched part and blank part.
  • Figure 1 illustrates the frame structure of the 8*8 blocks Bl, B2, B3, B4 of an interlaced macroblock MB after frame DCT coding.
  • FIG. 2 illustrates the frame structure of the 8*8 blocks Bl', B2', B3', B4' of an interlaced macroblock MB after field DCT coding.
  • the luminance blocks Bl ', B2', B3' and B4' have then to be inverse permuted back to frame macroblocks. It is here reminded that, generally, even if field DCT is selected for a particular macroblock, the chrominance texture is still coded by frame DCT.
  • the motion compensation can also either be frame-based or field-based for each macroblock. This feature is specified by a syntax element called field_prediction at the macroblock level in P and S-VOPs, for non global motion compensation (GMC) macroblocks. Effectively, it has to be noted that global motion compensation is always frame-based in interlaced pictures.
  • non-GMC motion compensation is performed just like in the non-interlaced case. This can be done either with a single motion vector applied to 16*16 blocks in mode 1-MV, or with 4 motion vectors applied to 8*8 blocks in mode 4-MV. Chrominance motion vectors are always inferred from the luminance ones. If the field_prediction flag is set to 1, non-GMC blocks are predicted with two motion vectors, one for each field, applied to 16*8 blocks of each field. Like in the field DCT case, the predicted blocks have to be permuted back to frame macroblocks after motion compensation.
  • field based predictions may result in 8*4 predictions for chrominance blocks, by displacement of one chroma line out of two, which correspond to one field only in the 4:2:0 interlaced color format.
  • Figure 3 schematically represents a video decoder DEC for decoding a bit stream BS in pictures P of a video signal.
  • the bit stream is likely to include macroblocks coded in a progressive way and in an interlaced way.
  • the decoder DEC includes a decoding unit DEU for decoding macroblocks coded in a progressive way. It is the case for MPEG-4 Simple Profile decoding functions that can only reconstruct frame-based 8*8 inverse DCT and motion compensate 16*16 or 8*8 frame-based blocks for the luminance channel and 8*8 blocks for the chrominance ones. Therefore said decoding unit DEU can only directly handle the decoding in a frame-based structure of macroblock types 1, 3 and 5 in Table 1.
  • Macroblocks of type 2 can be decoded this way by the decoding unit DEU and could then be re-interlaced by a re-interlacing unit RIU to obtain the final interlaced structure of the decoded macroblock as illustrated on the left hand side of figure 2.
  • This technique cannot directly be applied for macroblocks of types 4 and 8. In fact, it would need to add the 8*8 field-based DCT pixels to 8*8 blocks of motion prediction having the same "one field only" structure (as illustrated on the right side of figure 2), but said decoding unit DEU cannot form such a "one field only" motion- prediction because it can only displace 8*8 blocks of pixels in a frame-based manner from a reference texture.
  • the decoder DEC includes a hybrid reference construction unit HRCU for calculating a hybrid reference texture HRT from each reference texture RT.
  • a hybrid reference texture HRT is represented in figure 4
  • the hybrid reference texture HRT has the property of representing a reference texture both in a frame-based manner FR-HRT, with interlaced fields as represented on the left side of figure 4, and in a field-based manner FI-HRT, each field TF and BF being accessible independently from the other one as represented on right side of figure 4.
  • Said hybrid reference texture HRT, constituted of the reference texture in frame structure FR-HRT and of the reference texture in field structure FI-HRT is stored in a memory MEM and is addressable by said decoding unit DEU.
  • said decoding unit DEU can form motion prediction with pixels coming from both fields TF and BF of from one field only, still by displacing 8*8 blocks of pixels in a frame-based manner.
  • said hybrid reference construction unit HRCU de-interlaces the luminance component of the reference texture in a frame-based manner FR-HRT in order to extract its two separate fields TF and BF. Once split, the two fields TF and BF are placed in a reference texture memory MEM, next to the original frame representation FR-HRT and forming said reference texture in field structure FI-HRT.
  • the invention enables to process macroblocks of type 4 and 8. Effectively, using this hybrid reference texture HRT, it becomes possible to reconstruct the luminance prediction of a macroblock either in the frame or in field mode as explained hereinafter.
  • the motion vector will be considered by said decoding unit DEU as pointing on the left hand or on the right hand side of said reference texture HRT.
  • the decoding unit DEU By passing an 8*8 motion vector that points to the left hand side of the reference texture FR-HRT, the decoding unit DEU forms an 8*8 motion prediction that comprises the two fields TF and BF.
  • This mode is suitable for instance for macroblocks of type 3 and 5, where the decoding unit DEU can also directly add an 8*8 frame-based inverse DCT to reconstruct the picture.
  • the decoding unit DEU forms an 8*8 motion prediction from one field only. This last mode is suitable for macroblocks of type 4 and 8.
  • each 16*16 frame-based decoded motion vector is translated into four 8*8 motion vectors pointing to the reference texture in field structure FI-HRT.
  • the motion prediction will thus consist of two 8*8 blocks containing predictions for the odd field and two other 8*8 blocks for the even field. Since the DCT coefficients for macroblocks of type 4 encode pixels coming also from one field only, the decoding unit DEU can directly add the 8*8 inverse DCT of field-based pixels to the 8*8 motion predictions that contain pixels relevant from one odd or even field only, without any field structure mismatch.
  • the top and bottom field-based motion vectors decoded from the bitstream are translated to point to the field representation FI-HRT.
  • the "one field only" motion predictions for the luminance channel can also directly be added to the "one field only” 8*8 inverse DCTs.
  • the same hybrid reference texture HRT can also be used by the decoding unit DEU to form the motion prediction, but, this time, without directly adding the frame-based inverse DCT that simultaneously encodes residual pixels for both fields. Effectively, the motion prediction has to be re- interlaced by the re-interlacing unit RIU to a frame-based structure before applying the residual DCT texture in said decoding unit DEU.
  • each region of the hybrid reference texture HRT is extended with repetitive pixel padding PAD.
  • the frame-based reference texture FR-HRT is extended by 16 pixels in each direction, constituting the padding PAD.
  • the field-based reference texture is extended by 8 pixels in each direction, independently for each field TF and BF of the field-based hybrid reference texture FI-HRT, as illustrated in figure 5.
  • Two repetitive pixel padding TPAD and BPAD are respectively obtained for each of the two fields TF and BF.
  • the padding process is as follows: each sample at the boundary of a region is replicated horizontally to the left and right direction in order to fill the extended area around the field-based or frame-based representation.
  • the remaining unfilled pixels in the extended regions are padded by a similar process, but in the vertical direction.
  • This padding in extended regions is designed to support the "unrestricted motion compensation" MPEG-4 tool, which allows motion vectors to point outside of the reference texture HRT area.
  • motion prediction uses pixels replicated from the closest boundary. Clipping the motion vectors passed in the decoding unit DEU so that they do not point beyond the extended areas of each frame-based or field-based region achieves the correct motion prediction.
  • the invention is particularly interesting for processing of video signals on mobile devices like mobile phones.
  • MPEG-4 or DivX streams can thus be processed by reusing an SP decoding unit to decode ASP streams.
  • Said hardware or software items can be implemented in several manners, such as by means of wired electronic circuits or by means of an integrated circuit that is suitable programmed respectively.

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Compression Or Coding Systems Of Tv Signals (AREA)

Abstract

The present invention relates to a video decoder (DEC) for decoding a bit stream (BS) corresponding to pictures (P) of a video signal, the coded pictures being likely to include macroblocks coded in a progressive and in an interlaced way. The decoder includes a decoding unit (DEU) for decoding macroblocks coded in a progressive way, and a hybrid reference construction unit (HRCU) for constructing, for each reference picture, a hybrid reference texture (HRT) which has the property of representing said reference picture in a frame-based and in a field-based manner. Said hybrid reference texture is used by said decoding unit for decoding interlaced macroblocks.

Description

"VIDEO DECODER WITH HYBRID REFERENCE TEXTURE
FIELD OF THE INVENTION
The present invention relates to a video decoder for decoding a bit stream corresponding to pictures of a video signal, the coded pictures being likely to include macroblocks coded in a progressive and in an interlaced way. More particularly, the invention relates to a decoder including a decoding unit for decoding macroblocks coded in a progressive way.
BACKGROUND OF THE INVENTION
As indicated in "Information Technology - Coding of audio-visual objects - Part 2: Visual, Amendment 1: Visual extensions", ISO/IEC 14496-2: 1999/Amd. 1 :2000, ISO/IEC JTV 1/SC 29/WG 11 N 3056, the MPEG-4 standard defines a syntax for video bit streams which allows interoperability between various encoders and decoders. Standards describe many video tools, but implementing all of them can result in a too high complexity for most applications. To offer more flexibility in the choice of available tools and encoder/decoder complexity, the standard further defines profiles, which are subsets of the syntax limited to particular tools. For instance, the Simple Profile (SP) is a subset of the entire bit stream syntax which includes in MPEG terminology: I and P VOPs, AC/DC prediction, 1 or 4 motion vectors per macroblock, unrestricted motion vectors and half pixel motion compensation for progressive pictures. The Advanced Simple Profile (ASP) is a superset of the SP syntax: it includes the SP coding tools, and adds B VOPs, global motion compensation, interlaced pictures, quarter pixel motion compensation where interpolation filters are different from the ones used in half-pixel motion compensation, and other tools dedicated to the processing of interlaced pictures.
The document US RE38,564 E discloses a motion estimation and compensation technique for interlaced digital video such as video object planes (VOPs). Predictor motion vectors for use in differentially encoding a current field coded macroblock are obtained using the median of motion vectors of surrounding blocks or macroblocks. When a surrounding macroblock is itself interlaced an average motion vector for that macroblock is used. This document also discloses a decoder that is provided with functions enabling the direct decoding of field coded macroblocks as defined in ASP.
Nevertheless, interlacing modifies two low- level processes: motion compensation and inverse Direct Cosine Transform (DCT in the following). In some devices with limited CPU resources or power resources, it can be advantageous to use hardware accelerated functions to carry on some of the decoding operations, even if the hardware acceleration devices are not capable to perform the decoding operations in a conformant way. This results in decoding errors which are penalizing in the case of interlaced macroblocks in interlaced pictures.
SUMMARY OF THE INVENTION
Accordingly, it is an object of the invention to provide a video decoder that uses a decoding unit for decoding progressive pictures and macroblocks and that avoids penalizing errors concerning the decoding of interlaced pictures.
To this end, there is provided a video decoder including a hybrid reference construction unit for constructing, for each reference picture, a hybrid reference texture which has the property of representing said reference texture in a frame-based and in a field-based manner, said hybrid reference texture being used by said decoding unit for decoding interlaced macroblocks.
It is thus provided a pseudo-ASP decoder that relies on a decoding unit able to process progressive pictures and, in the case of MPEG-4, on MPEG-4 SP acceleration functions. In an embodiment, said hybrid reference texture includes the reference texture in frame structure as found in the bit stream and a reference texture in field structure obtained by de-interlacing and extracting the two fields.
Advantageously, said hybrid texture is stored in a memory. In an advantageous implementation, said hybrid reference texture is extended with repetitive pixel padding, the two fields of the reference texture in field structure being extended independently from each other.
In an embodiment, said hybrid reference construction unit is activated on a picture basis when a flag, in a corresponding video object layer header, is set to a value indicating that coded pictures in this video object layer are interlaced.
The invention also relates to a method for decoding a bit stream corresponding to pictures of a video signal, the coded pictures being likely to include macroblocks coded in a progressive and in an interlaced way, said method including a decoding step for decoding macroblocks coded in a progressive way. Said method is characterized in that it includes a hybrid reference construction step for constructing, for each reference picture, a hybrid reference texture HRT which has the property of representing said reference texture in a frame-based and in a field- based manner. The invention also relates to a computer program product comprising program instructions for implementing, when said program is executed by a processor, a decoding method as disclosed above.
The invention finds application in the playback of video standards as MPEG- 4 and DivX streams on mobile phones in which a video encoder as described above is advantageously implemented.
BRIEF DESCRIPTION OF THE DRAWINGS
Additional objects, features and advantages of the invention will become apparent upon reading the following detailed description and upon reference to the accompanying drawings in which:
- Fig.1 illustrates a macroblock structure in frame DCT coding,
- Fig.2 illustrates a macroblock structure in field DCT coding,
- Fig. 3 represents a video decoder according to the invention,
- Fig.4 illustrates the construction of a hybrid reference texture according to the invention, - Fig.5 gives an example of an advantageous implementation for the construction of a hybrid reference texture.
DETAILED DESCRIPTION OF THE INVENTION
In the following description, well-known functions or constructions by the person skilled in the art are not described in detail since they would obscure the invention in unnecessary detail.
When interlaced pictures are used in a coding, the inverse DCT can be either a frame DCT or a field DCT as specified by a syntax element called dct type included in the bit stream for each macroblock with texture information. When the dct type flag is set to 0 for a particular macroblock, the macroblock is frame coded and the DCT coefficients of luminance data encode 8*8 blocks that are composed of lines from two fields alternatively. This mode is illustrated in figure 1. Two fields TF and BF are respectively represented by hatched part and blank part. Figure 1 illustrates the frame structure of the 8*8 blocks Bl, B2, B3, B4 of an interlaced macroblock MB after frame DCT coding.
When the dct type flag is set to 1 for a particular macroblock, the macroblock is field coded and the DCT coefficients of luminance data are formed such that a 8*8 block consists of data from one field only. This mode is illustrated on figure 2. Figure 2 illustrates the frame structure of the 8*8 blocks Bl', B2', B3', B4' of an interlaced macroblock MB after field DCT coding. In classical inverse DCT, the luminance blocks Bl ', B2', B3' and B4' have then to be inverse permuted back to frame macroblocks. It is here reminded that, generally, even if field DCT is selected for a particular macroblock, the chrominance texture is still coded by frame DCT.
The motion compensation can also either be frame-based or field-based for each macroblock. This feature is specified by a syntax element called field_prediction at the macroblock level in P and S-VOPs, for non global motion compensation (GMC) macroblocks. Effectively, it has to be noted that global motion compensation is always frame-based in interlaced pictures.
If the field_prediction flag is set to 0, non-GMC motion compensation is performed just like in the non-interlaced case. This can be done either with a single motion vector applied to 16*16 blocks in mode 1-MV, or with 4 motion vectors applied to 8*8 blocks in mode 4-MV. Chrominance motion vectors are always inferred from the luminance ones. If the field_prediction flag is set to 1, non-GMC blocks are predicted with two motion vectors, one for each field, applied to 16*8 blocks of each field. Like in the field DCT case, the predicted blocks have to be permuted back to frame macroblocks after motion compensation.
Moreover, field based predictions may result in 8*4 predictions for chrominance blocks, by displacement of one chroma line out of two, which correspond to one field only in the 4:2:0 interlaced color format.
During encoding, in non-GMC macroblocks, frame and field DCT and frame and field motion prediction can be applied independently from each other. Table 1 summarizes the different combinations that may arise in I-, P- and S-VOPs (Sprite VOPs) of ASP streams excluding GMC macroblocks.
Table 1
Figure imgf000007_0001
Figure imgf000008_0001
Figure 3 schematically represents a video decoder DEC for decoding a bit stream BS in pictures P of a video signal. The bit stream is likely to include macroblocks coded in a progressive way and in an interlaced way. The decoder DEC includes a decoding unit DEU for decoding macroblocks coded in a progressive way. It is the case for MPEG-4 Simple Profile decoding functions that can only reconstruct frame-based 8*8 inverse DCT and motion compensate 16*16 or 8*8 frame-based blocks for the luminance channel and 8*8 blocks for the chrominance ones. Therefore said decoding unit DEU can only directly handle the decoding in a frame-based structure of macroblock types 1, 3 and 5 in Table 1.
For macroblocks types 2, 4 and 8 (Table 1), some field-based DCT has to be applied on the reconstructed picture. This kind of inverse DCT, when decoded by an MPEG-4 simple profile decoding in a frame-based structure, results in decoded blocks of 8*8 pixels belonging to a single field, just like on the right hand side of figure Ib. The top 8*8 blocks correspond to the top field and the bottom 8*8 blocks to the bottom field.
Macroblocks of type 2 can be decoded this way by the decoding unit DEU and could then be re-interlaced by a re-interlacing unit RIU to obtain the final interlaced structure of the decoded macroblock as illustrated on the left hand side of figure 2.This technique cannot directly be applied for macroblocks of types 4 and 8. In fact, it would need to add the 8*8 field-based DCT pixels to 8*8 blocks of motion prediction having the same "one field only" structure (as illustrated on the right side of figure 2), but said decoding unit DEU cannot form such a "one field only" motion- prediction because it can only displace 8*8 blocks of pixels in a frame-based manner from a reference texture.
Thus for decoding macroblocks of type 4 and 8, the decoder DEC includes a hybrid reference construction unit HRCU for calculating a hybrid reference texture HRT from each reference texture RT. Such a hybrid reference texture HRT is represented in figure 4, the hybrid reference texture HRT has the property of representing a reference texture both in a frame-based manner FR-HRT, with interlaced fields as represented on the left side of figure 4, and in a field-based manner FI-HRT, each field TF and BF being accessible independently from the other one as represented on right side of figure 4. Said hybrid reference texture HRT, constituted of the reference texture in frame structure FR-HRT and of the reference texture in field structure FI-HRT is stored in a memory MEM and is addressable by said decoding unit DEU. Therefore, said decoding unit DEU can form motion prediction with pixels coming from both fields TF and BF of from one field only, still by displacing 8*8 blocks of pixels in a frame-based manner. Practically, said hybrid reference construction unit HRCU de-interlaces the luminance component of the reference texture in a frame-based manner FR-HRT in order to extract its two separate fields TF and BF. Once split, the two fields TF and BF are placed in a reference texture memory MEM, next to the original frame representation FR-HRT and forming said reference texture in field structure FI-HRT.
The invention enables to process macroblocks of type 4 and 8. Effectively, using this hybrid reference texture HRT, it becomes possible to reconstruct the luminance prediction of a macroblock either in the frame or in field mode as explained hereinafter.
Considering the value of the field_prediction flag, the motion vector will be considered by said decoding unit DEU as pointing on the left hand or on the right hand side of said reference texture HRT. By passing an 8*8 motion vector that points to the left hand side of the reference texture FR-HRT, the decoding unit DEU forms an 8*8 motion prediction that comprises the two fields TF and BF. This mode is suitable for instance for macroblocks of type 3 and 5, where the decoding unit DEU can also directly add an 8*8 frame-based inverse DCT to reconstruct the picture. By passing an 8*8 motion vector that points to the right hand side of the reference texture FI-HRT, the decoding unit DEU forms an 8*8 motion prediction from one field only. This last mode is suitable for macroblocks of type 4 and 8.
Effectively, for macroblocks of type 4, each 16*16 frame-based decoded motion vector is translated into four 8*8 motion vectors pointing to the reference texture in field structure FI-HRT. The motion prediction will thus consist of two 8*8 blocks containing predictions for the odd field and two other 8*8 blocks for the even field. Since the DCT coefficients for macroblocks of type 4 encode pixels coming also from one field only, the decoding unit DEU can directly add the 8*8 inverse DCT of field-based pixels to the 8*8 motion predictions that contain pixels relevant from one odd or even field only, without any field structure mismatch.
Similarly, for macroblocks of type 8, the top and bottom field-based motion vectors decoded from the bitstream are translated to point to the field representation FI-HRT. The "one field only" motion predictions for the luminance channel can also directly be added to the "one field only" 8*8 inverse DCTs. After re-interlacing of the decoded macroblocks in both cases in a re-interlacing unit RIU connected to said decoding unit DEU, the correct frame-based reconstruction is obtained.
For macroblocks of type 7, the same hybrid reference texture HRT can also be used by the decoding unit DEU to form the motion prediction, but, this time, without directly adding the frame-based inverse DCT that simultaneously encodes residual pixels for both fields. Effectively, the motion prediction has to be re- interlaced by the re-interlacing unit RIU to a frame-based structure before applying the residual DCT texture in said decoding unit DEU.
In a preferred embodiment, as illustrated in figure 5, each region of the hybrid reference texture HRT is extended with repetitive pixel padding PAD. Thus, the frame-based reference texture FR-HRT is extended by 16 pixels in each direction, constituting the padding PAD. The field-based reference texture is extended by 8 pixels in each direction, independently for each field TF and BF of the field-based hybrid reference texture FI-HRT, as illustrated in figure 5. Two repetitive pixel padding TPAD and BPAD are respectively obtained for each of the two fields TF and BF. The padding process is as follows: each sample at the boundary of a region is replicated horizontally to the left and right direction in order to fill the extended area around the field-based or frame-based representation. Then, the remaining unfilled pixels in the extended regions are padded by a similar process, but in the vertical direction.This padding in extended regions is designed to support the "unrestricted motion compensation" MPEG-4 tool, which allows motion vectors to point outside of the reference texture HRT area. In this case, motion prediction uses pixels replicated from the closest boundary. Clipping the motion vectors passed in the decoding unit DEU so that they do not point beyond the extended areas of each frame-based or field-based region achieves the correct motion prediction.
The invention is particularly interesting for processing of video signals on mobile devices like mobile phones. MPEG-4 or DivX streams can thus be processed by reusing an SP decoding unit to decode ASP streams.
It is to be understood that the present invention is not limited to the aforementioned embodiments and variations and modifications may be made without departing from the spirit and scope of the invention as defined in the appended claims. In the respect, the following closing remarks are made.
There are numerous ways of implementing functions of the method according to the invention by means of items of hardware or software, or both, provided that a single item of hardware or software can carries out several functions. It does not exclude that an assembly of items of hardware or software or both carry out a function, thus forming a single function without modifying the method of processing the drift frequency in accordance with the invention.
Said hardware or software items can be implemented in several manners, such as by means of wired electronic circuits or by means of an integrated circuit that is suitable programmed respectively.
Any reference sign in the following claims should not be construed as limiting the claim. It will be obvious that the use of the verb "to include" and its conjugations do not exclude the presence of any other steps or elements besides those defined in any claim. The article "a" or "an" preceding an element or step does not exclude the presence of a plurality of such elements or steps.

Claims

1. A video decoder (DEC) for decoding a bit stream (BS) corresponding to pictures (P) of a video signal, the coded pictures being likely to include macroblocks coded in a progressive and in an interlaced way, said decoder including a decoding unit (DEU) for decoding macroblocks coded in a progressive way, characterized in that said video decoder includes a hybrid reference construction unit (HRCU) for constructing, for each reference picture, a hybrid reference texture (HRT) which has the property of representing said reference picture in a frame-based and in a field-based manner, said hybrid reference texture being used by said decoding unit for decoding interlaced macroblocks.
2. A video decoder wherein said hybrid reference texture includes the reference texture in frame structure (FR-HRT) as found in the bit stream and a reference texture in field structure (FI-HRT) obtained by de-interlacing and extracting the two fields in the reference picture.
3. A video decoder wherein said hybrid reference texture is stored in a memory.
4. A video decoder as claimed in claim 2, wherein said hybrid reference texture is extended with repetitive pixel padding, the two fields of the reference texture in field structure (FI-HRT) being extended independently from each other.
5. A video decoder as claimed in one of the preceding claims, wherein said hybrid reference construction unit (HRCU) is activated on a picture basis when a flag, decoded or inferred from the bitstream, is set to a value indicating which coded pictures are interlaced.
6. A method of decoding a bit stream in pictures of a video signal, coded pictures being likely to include macroblocks coded in a progressive and in an interlaced way, said method including a decoding step for decoding macroblocks coded in a progressive way, characterized in that said method includes a hybrid reference construction step for constructing, for each reference picture, a hybrid reference texture which has the property of representing said reference picture in a frame- based and in a field-based manner, said hybrid reference texture being used in said decoding step for decoding interlaced macroblocks.
7. A computer program product comprising program instructions for implementing, when said program is executed by a processor, a decoding method as claimed in claim 6.
8. A mobile device including a video decoder as claimed in one of claims 1 to 5.
PCT/IB2006/051732 2005-06-03 2006-05-31 Video decoder with hybrid reference texture Ceased WO2006129280A2 (en)

Priority Applications (4)

Application Number Priority Date Filing Date Title
US11/916,321 US8520738B2 (en) 2005-06-03 2006-05-31 Video decoder with hybrid reference texture
JP2008514286A JP2008543209A (en) 2005-06-03 2006-05-31 Video decoder with hybrid reference texture
EP06745050A EP1894415A2 (en) 2005-06-03 2006-05-31 Video decoder with hybrid reference texture
CN2006800196612A CN101189880B (en) 2005-06-03 2006-05-31 Video decoder with mixed reference textures

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
EP05300453 2005-06-03
EP05300453.7 2005-06-03

Publications (2)

Publication Number Publication Date
WO2006129280A2 true WO2006129280A2 (en) 2006-12-07
WO2006129280A3 WO2006129280A3 (en) 2007-02-15

Family

ID=37119120

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/IB2006/051732 Ceased WO2006129280A2 (en) 2005-06-03 2006-05-31 Video decoder with hybrid reference texture

Country Status (5)

Country Link
US (1) US8520738B2 (en)
EP (1) EP1894415A2 (en)
JP (1) JP2008543209A (en)
CN (1) CN101189880B (en)
WO (1) WO2006129280A2 (en)

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20110032996A1 (en) * 2009-08-04 2011-02-10 Polycom, Inc. Using dual hdvicp coprocessor to accelerate dm6467 h.264 decoder
JP6050583B2 (en) * 2011-12-27 2016-12-21 株式会社メガチップス Image processing apparatus and memory access control method
JP6410495B2 (en) * 2014-07-07 2018-10-24 ルネサスエレクトロニクス株式会社 Image encoding device, image decoding device, and image communication system

Family Cites Families (24)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0484140B1 (en) * 1990-10-31 1996-12-18 Victor Company Of Japan, Ltd. Compression method of interlace moving image signals
US5369449A (en) * 1991-11-08 1994-11-29 Matsushita Electric Industrial Co., Ltd. Method for predicting move compensation
US5227878A (en) * 1991-11-15 1993-07-13 At&T Bell Laboratories Adaptive coding and decoding of frames and fields of video
JP2636622B2 (en) * 1992-03-13 1997-07-30 松下電器産業株式会社 Video signal encoding method and decoding method, and video signal encoding apparatus and decoding apparatus
TW283289B (en) * 1994-04-11 1996-08-11 Gen Instrument Corp
US5767898A (en) * 1994-06-23 1998-06-16 Sanyo Electric Co., Ltd. Three-dimensional image coding by merger of left and right images
JPH08331567A (en) * 1995-06-02 1996-12-13 Sony Corp Image information encoding device, image information decoding device, and image information recording / reproducing system
USRE38564E1 (en) 1997-03-07 2004-08-10 General Instrument Corporation Motion estimation and compensation of video object planes for interlaced digital video
KR100285599B1 (en) * 1998-03-14 2001-04-02 전주범 Device and method for texture padding for motion estimation in alternate line encoding
CN1159923C (en) * 1998-04-28 2004-07-28 株式会社大宇电子 Method and device for image signal self-adaptive coding
US6584156B1 (en) 1998-07-17 2003-06-24 Stream Machine Company LSI Architecture and implementation of MPEG video codec
JP2000350212A (en) * 1999-06-04 2000-12-15 Matsushita Electric Ind Co Ltd Video signal decoding device and video signal display system
US6633344B1 (en) 1999-07-15 2003-10-14 Thomson Licensing S.A. Processing of progressive video signals in digital TV receivers
US6931062B2 (en) * 2001-04-11 2005-08-16 Koninklijke Philips Electronics N.V. Decoding system and method for proper interpolation for motion compensation
US7215708B2 (en) * 2001-05-22 2007-05-08 Koninklijke Philips Electronics N.V. Resolution downscaling of video images
EP1294194B8 (en) 2001-09-10 2010-08-04 Texas Instruments Incorporated Apparatus and method for motion vector estimation
US7092442B2 (en) * 2002-12-19 2006-08-15 Mitsubishi Electric Research Laboratories, Inc. System and method for adaptive field and frame video encoding using motion activity
JP2004328634A (en) * 2003-04-28 2004-11-18 Sony Corp Image decoding apparatus and method
US20050013498A1 (en) * 2003-07-18 2005-01-20 Microsoft Corporation Coding of motion vector information
US7724827B2 (en) * 2003-09-07 2010-05-25 Microsoft Corporation Multi-layer run level encoding and decoding
US8107531B2 (en) * 2003-09-07 2012-01-31 Microsoft Corporation Signaling and repeat padding for skip frames
US8270480B2 (en) * 2004-09-16 2012-09-18 Thomson Licensing Method and apparatus for rapid video and field coding
US7839933B2 (en) * 2004-10-06 2010-11-23 Microsoft Corporation Adaptive vertical macroblock alignment for mixed frame video sequences
JP4764807B2 (en) * 2006-11-30 2011-09-07 富士通セミコンダクター株式会社 Motion vector detection apparatus and motion vector detection method

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
HIROSHI NAKAYAMA ET AL.: "An MPEG-4 Video LSI with an Error-Resilient Codec Core based on a Fast Motion Estimation Algorithm", IEEE ISSCC, vol. 1, 2002
SHIZHONG LIU ET AL.: "Local Bandwidth Constrained Fast Inverse Motion Compensation for DCT-Domain Video Transcoding", IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, vol. 12, no. 5, May 2002 (2002-05-01)
THOMAS SIKORA: "MPEG Digital Video-Coding Standards", IEEE SIGNAL PROCESSINGMAGAZINE, September 1997 (1997-09-01), pages 82 - 100

Also Published As

Publication number Publication date
US20080212684A1 (en) 2008-09-04
CN101189880B (en) 2010-06-16
WO2006129280A3 (en) 2007-02-15
US8520738B2 (en) 2013-08-27
JP2008543209A (en) 2008-11-27
CN101189880A (en) 2008-05-28
EP1894415A2 (en) 2008-03-05

Similar Documents

Publication Publication Date Title
Schafer et al. Digital video coding standards and their role in video communications
US20160227221A1 (en) Method of Residue Differential Pulse-Code Modulation for HEVC Range Extension
US6931062B2 (en) Decoding system and method for proper interpolation for motion compensation
WO2004059980A1 (en) Method and apparatus for encoding and decoding stereoscopic video
KR102905256B1 (en) Encoders and decoders, encoding methods and decoding methods with profile and level-dependent coding options
KR100711597B1 (en) Decoding Apparatus and Method for Deriving Reduced Resolution Video Signals
WO2006111915A1 (en) Efficient video decoding accelerator
US8520738B2 (en) Video decoder with hybrid reference texture
EP1081959A1 (en) Method for recognizing a progressive or an interlaced content in a video sequence
EP1894414B1 (en) Multiple pass video decoding method and device
US8199808B2 (en) Decoding method and decoder with rounding means
CN100551059C (en) device for generating progressive frames from interlaced encoded frames
US8126051B2 (en) Video encoding and decoding methods and corresponding encoding and decoding devices
US20080205524A1 (en) Multiple Instance Video Decoder For Macroblocks Coded in Progressive and an Interlaced Way
JP3798432B2 (en) Method and apparatus for encoding and decoding digital images
WO2004088989A1 (en) Video encoding and decoding methods and corresponding devices
JP2009518907A (en) Method for predicting motion and texture data
JP5148503B2 (en) Method for predicting motion and texture data

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application
WWE Wipo information: entry into national phase

Ref document number: 2006745050

Country of ref document: EP

WWE Wipo information: entry into national phase

Ref document number: 11916321

Country of ref document: US

Ref document number: 2008514286

Country of ref document: JP

Ref document number: 200680019661.2

Country of ref document: CN

NENP Non-entry into the national phase

Ref country code: DE

WWW Wipo information: withdrawn in national office

Ref document number: DE

WWP Wipo information: published in national office

Ref document number: 2006745050

Country of ref document: EP