WO2010004424A2 - Systems, methods, and media for providing selectable video using scalable video coding - Google Patents

Systems, methods, and media for providing selectable video using scalable video coding Download PDF

Info

Publication number
WO2010004424A2
WO2010004424A2 PCT/IB2009/006449 IB2009006449W WO2010004424A2 WO 2010004424 A2 WO2010004424 A2 WO 2010004424A2 IB 2009006449 W IB2009006449 W IB 2009006449W WO 2010004424 A2 WO2010004424 A2 WO 2010004424A2
Authority
WO
WIPO (PCT)
Prior art keywords
content sequence
svc
stream
video
svc stream
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Ceased
Application number
PCT/IB2009/006449
Other languages
French (fr)
Other versions
WO2010004424A3 (en
Inventor
Sagee Ben-Zedeff
Yair Wiener
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Radvision Ltd
Original Assignee
Radvision Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Radvision Ltd filed Critical Radvision Ltd
Priority to EP09794060.5A priority Critical patent/EP2324640B1/en
Priority to JP2011517265A priority patent/JP5519663B2/en
Priority to CN2009801327357A priority patent/CN102138325B/en
Publication of WO2010004424A2 publication Critical patent/WO2010004424A2/en
Publication of WO2010004424A3 publication Critical patent/WO2010004424A3/en
Anticipated expiration legal-status Critical
Ceased legal-status Critical Current

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N7/00Television systems
    • H04N7/14Systems for two-way working
    • H04N7/15Conference systems
    • H04N7/152Multipoint control units therefor
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/30Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using hierarchical techniques, e.g. scalability
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/30Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using hierarchical techniques, e.g. scalability
    • H04N19/34Scalability techniques involving progressive bit-plane based encoding of the enhancement layer, e.g. fine granular scalability [FGS]
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/234Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs
    • H04N21/2343Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs involving reformatting operations of video signals for distribution or compliance with end-user requests or end-user device requirements
    • H04N21/234327Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs involving reformatting operations of video signals for distribution or compliance with end-user requests or end-user device requirements by decomposing into layers, e.g. base layer and one or more enhancement layers
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/25Management operations performed by the server for facilitating the content distribution or administrating data related to end-users or client devices, e.g. end-user or client device authentication, learning user preferences for recommending movies
    • H04N21/266Channel or content management, e.g. generation and management of keys and entitlement messages in a conditional access system, merging a VOD unicast channel into a multicast channel
    • H04N21/2662Controlling the complexity of the video stream, e.g. by scaling the resolution or bitrate of the video stream based on the client capabilities

Definitions

  • the disclosed subject matter relates to systems, methods, and media for providing selectable video using scalable video coding.
  • Digital video systems have become widely used for varying purposes ranging from entertainment to video conferencing. Many digital video systems require providing different video signals to different recipients. This can be a quite complex process.
  • systems, methods, and media for providing selectable video using scalable video coding are provided.
  • systems for providing selectable video using scalable video coding are provided, the systems comprising: a scalable video coding capable encoder that receives a base content sequence and at least one added content sequence that has different content from the base content stream and that produces at least one SVC stream that includes a basic layer, that corresponds to the base content sequence, and at least one enhanced layer, that corresponds to content in the at least one added content sequence; and a digital processing device that controls whether the at least one enhanced layer in the SVC stream is displayed at a destination for the SVC stream.
  • methods for providing selectable video using scalable video coding comprising: receiving a base content sequence and at least one added content sequence that has different content from the base content stream; encoding from the base content sequence and the at least one added content sequence at least one SVC stream that includes a basic layer, that corresponds to the base content sequence, and at least one enhanced layer, that corresponds to content in the at least one added content sequence; and controlling whether the at least one enhanced layer in the SVC stream is displayed at a destination for the SVC stream.
  • computer-readable media containing computer- executable instructions that, when executed by a processor, cause the processor to perform a method for providing selectable video using scalable video coding are provided, the method comprising: receiving a base content sequence and at least one added content sequence that has different content from the base content stream; encoding from the base content sequence and the at least one added content sequence at least one SVC stream that includes a basic layer, that corresponds to the base content sequence, and at least one enhanced layer, that corresponds to content in the at least one added content sequence; and controlling whether the at least one enhanced layer in the SVC stream is displayed at a destination for the SVC stream.
  • FIG. 1 is a diagram of signals provided to and received from an SVC-capable encoder in accordance with some embodiments of the disclosed subject matter.
  • FIG. 2 is a diagram of an SVC-capable encoder in accordance with some embodiments of the disclosed subject matter.
  • FIG. 3 is a diagram of a video distribution system in accordance with some embodiments of the disclosed subject matter.
  • FIG. 4 is a diagram illustrating the combination of basic and enhanced layers in accordance with some embodiments of the disclosed subject matter.
  • FIG. 5 is a diagram of a video conferencing system in accordance with some embodiments of the disclosed subject matter.
  • FIG. 6 is a diagram of different user end point displays in accordance with some embodiments of the disclosed subject matter.
  • two or more video signals can be provided to a scalable video coding (SVC)-capable encoder so that a basic layer and one or more enhanced layers are produced by the encoder.
  • the basic layer can be used to provide base video content and the enhanced layer(s) can be used to modify that base video content with enhanced video content.
  • the enhanced layer(s) can be controlled.
  • a scalable video protocol may include any video compression protocol that allows decoding of different representations of video from data encoded using that protocol.
  • the different representations of video may include different resolutions (spatial scalability), frame rates (temporal scalability), bit rates (SNR scalability), portions of content, and/or any other suitable characteristic.
  • Different representations may be encoded in different subsets of the data, or may be encoded in the same subset of the data, in different embodiments.
  • some scalable video protocols may use layering that provides one or more representations (such as a high resolution image of a user) of a video signal in one layer and one or more other representations (such as a low resolution image of the user) of the video signal in another layer.
  • some scalable video protocols may split up a data stream (e.g., in the form of packets) so that different representations of a video signal are found in different portions of the data stream.
  • Examples of scalable video protocols may include the Scalable Video Coding (SVC) protocol defined by the Scalable Video Coding Extension of the H.264/AVC Standard (Annex G) from the International Telecommunication Union (ITU), the MPEG2 protocol defined by the Motion Picture Experts Group, the H.263 (Annex O) protocol from the ITU. and the MPEG4 part 2 FGS protocol from the Motion Picture Experts Group, each of which is hereby incorporated by reference herein in its entirety.
  • SVC Scalable Video Coding
  • Annex G Scalable Video Coding Extension of the H.264/AVC Standard
  • MPEG2 protocol defined by the Motion Picture Experts Group
  • H.263 (Annex O) protocol from the ITU.
  • MPEG4 part 2 FGS protocol from the Motion Picture Experts Group
  • a base content sequence 102 can be supplied to an SVC-capable encoder 106.
  • One or more added content sequences 1-N 104 can also be supplied to the SVC-capable encoder.
  • the encoder can then provide an SVC stream 108 containing a basic layer 110 and one or more enhanced layers 112.
  • Base content sequence 102 can be any suitable video signal containing any suitable content.
  • base content sequence can be video content that is fully or partially in a low-resolution format. This low-resolution video content may be suitable as a teaser to entice a viewer to purchase a higher resolution version of the content, as a more particular example.
  • base content sequence can be video content that is fully or partially distorted to hide prevent complete viewing of the video content.
  • base content sequence can be video content that is missing text (such as close captioning, translations, etc.) or graphics (such as logos, icons, advertisements, etc.) that may be desirable for some viewers.
  • Added content sequence(s) 104 can be any suitable content that provides a desired total content sequence.
  • base content sequence 102 includes low- resolution content
  • added content sequence(s) 104 can be a higher resolution sequence of the same content.
  • base content sequence 102 is video content that is missing desired text or graphics
  • added content sequence(s) 104 can be the video content with the desired text or graphics.
  • the resolution and other parameters of the base content sequence and added content sequence(s) can be identical.
  • added content in case that added content is restricted to a small part of a display screen (e.g., as in the case of a logo or a caption), it may be beneficial to position the content in the added content sequence, so that is aligned to macro block (MB) boundaries. This may improve the visual quality of the one or more enhancements layers encoded by the SVC encoder.
  • SVC-capable encoder 106 can be any suitable SVC-capable encoder for providing an SVC stream.
  • SVC-capable encoder 106 can implement a layered approach (similar to Coarse Grained Scalability) in which two layers are defined (basic and enhanced), the spatial resolution factor is set to one, intra prediction is applied only to the basic layer, the quantization error between a low-quality sequence and a higher-quality sequence is encoded using residual coding, and motion data, up-sampling, and/or other trans-coding is not performed.
  • a layered approach similar to Coarse Grained Scalability
  • SVC-capable encoder 106 can be implemented using the Joint Scalable Video Model (JSVM) software from the Scalable Video Coding (SVC) project of the Joint Video Team (JVT) of the ISO/IEC Moving Pictures Experts Group (MPEG) and the ITU-T Video Coding Experts Group (VCEG). Examples of configuration files for configuring the JSVM software are illustrated in the Appendix below. Any other suitable configuration for an SVC-capable encoder can additionally or alternatively be used.
  • JSVM Joint Scalable Video Model
  • SVC Scalable Video Coding
  • JVT Joint Video Team
  • MPEG Moving Pictures Experts Group
  • VCEG ITU-T Video Coding Experts Group
  • SVC-capable encoder 106 can provide SVC stream 108, which can include basic layer 110 and one or more enhanced layers 112.
  • the basic layer when decoded, can provide the signal in base content sequence 102.
  • the one or more enhanced layers 112, when decoded, can provide any suitable content that, when combined with basic layer 110, can be used to provide a desired video content.
  • Decoding of the SVC stream can be performed by any suitable SVC decoder, and the basic layer can be decoded by any suitable AVC decoder in some embodiments.
  • FIG. 1 illustrates a single SVC stream 108 with one basic layer 110 and one or more enhanced layers 112
  • multiple SVC streams 108 can be produced by SVC-capable encoder 106.
  • three SVC streams 108 can be produced wherein each of the streams includes the basic layer and a respective one of the enhanced layers.
  • any one of more of the streams can include more than one enhanced layer in addition to a basic layer.
  • SVC-capable encoder 106 can receive a base content sequence 102 and an added-content sequence 104.
  • the base content sequence 102 can then be processed by motion compensation and intra prediction mechanism 202.
  • This mechanism can perform any suitable SVC motion compensation and intra prediction processes.
  • a residual texture signal 204 (produced by motion compensation and intra prediction mechanism 202) may then be quantized and provided together with the motion signal 206 to entropy coding mechanism 208.
  • Entropy coding mechanism 208 may then perform any suitable entropy coding function and provide the resulting signal to multiplexer 210.
  • Data from motion compensation and intra prediction process 202 can then be used by inter-layer prediction techniques 220, along with added content sequence 104, to drive motion compensation and prediction mechanism 212.
  • Any suitable data from motion compensation and intra prediction mechanism 202 can be used.
  • Any suitable SVC inter-layer prediction techniques 220 and any suitable SVC motion compensation and intra prediction processes in mechanism 212 can be used.
  • a residual texture signal 214 (produced by motion compensation or intra prediction mechanisms 212) may then be quantized and provided together with the motion signal 216 to entropy coding mechanism 218.
  • Entropy coding mechanism 218 may then perform any suitable entropy coding function and provide the resulting signal to multiplexer 210.
  • Multiplexer 210 can then combine the resulting signals from entropy coding mechanisms 208 and 218 as an SVC compliant stream.
  • Side information can also be provided to encoder 106 in some embodiments.
  • This side information can identify, for example, a region of an image where content corresponding to a difference between the base content sequence and an added content sequence is (e.g., where a logo or text may be located). The side information can then be used in a mode decision step within block 212 to determine whether to process the added content sequence or not.
  • FIG. 3 illustrates an example of a video distribution system 300 in accordance with some embodiments.
  • a distribution controller 306 can receive a base content sequence as video from a base video source 302 and an added content sequence as video from an added video source 304. These sequences can be provided to an SVC-capable encoder 308 that is part of distribution controller 306. The SVC capable encoder 308 can then produce an SVC stream that includes a base layer and at least one enhanced layer as described above, and provides this stream to one or more video displays 312, 314, and 316.
  • the distribution controller can also include a controller 310 that provides control signal to the one or more video displays 312, 314, and 316. This control signal can indicate what enhanced content (if any) a video display is to display.
  • a separate component e.g., such as a network component such as a router, gateway, etc.
  • a controller like controller 310 for example
  • Controller 310 or a similar mechanism in a network component, display, endpoint, etc., may use any suitable software and/or hardware to control which enhancement layers are presented and/or which packets of an SVC stream are concealed.
  • these devices may include a digital processing device that may include one or more of a microprocessor, a processor, a controller, a microcontroller, a programmable logic device. and/or any other suitable hardware and/or software for controlling which enhancement layers are presented and/or which packets of an SVC stream are concealed.
  • a digital processing device may include one or more of a microprocessor, a processor, a controller, a microcontroller, a programmable logic device. and/or any other suitable hardware and/or software for controlling which enhancement layers are presented and/or which packets of an SVC stream are concealed.
  • a base content sequence 402 and three added content sequences 404, 406, and 408 may be provided to encoder 308.
  • the encoder may then produce basic layer 410 and enhancement layers 412, 414, and 416. These layers may then be formed into three SVC streams: one with layers 410 and 412; another with layers 410 and 414; and yet another with layers 410 and 416.
  • Each of the three SVC streams may be addressed to a different one of video display 312, 314, and 316 and presented as shown in displays 418, 420, and 422, respectively.
  • FIGS. 5 and 6 illustrate a video conferencing system 500 in accordance with some embodiments.
  • system 500 includes a multipoint conferencing unit (MCU) 502.
  • MCU 502 can include an SVC-capable encoder 504 and a video generator 506.
  • Video generator 506 may generate a continuous presence (CP) layout in any suitable fashion and provide this layout as a base content sequence to SVC-capable encoder 504.
  • the SVC capable encoder may also receive as added content sequences current speaker video, previous speaker video, and other participant video from current speaker end point 508, previous speaker end point 510, and other participant end points 512, 514. and 516. respectively. SVC streams can then be provided from encoder 504 to current speaker end point 508, previous speaker end point 510, and other participant end points 512, 514, and 516 and be controlled as described below in connection with FIG. 6. [0032] As illustrated in FIG. 6, the display on current speaker end point 508 may be controlled so that the user sees a CP layout from the basic layer (which may include graphics 602 and text 604) along with enhanced layers corresponding to the previous speaker and one or more of the other participants, as shown in display 608.
  • the basic layer which may include graphics 602 and text 604
  • the display on previous speaker end point 510 may be controlled so that the user sees a CP layout from the basic layer along with enhanced layers corresponding to the current speaker and one or more of the other participants, as shown in display 610.
  • the display on other participant end points 512. 514, and 516 may be controlled so that the user sees a CP layout from the basic layer along with enhanced layers corresponding to the current speaker and the previous speaker, as shown in display 612. In this way, no user of an endpoint sees video of himself or herself.
  • FIG. 5 illustrates different SVC streams going from the SVC- capable encoder to endpoints 508, 510, and 512, 514, and 516, in some embodiments, these streams may all be identical and a separate control signal (not shown) for selecting which enhanced layers are presented on each end point may be provided. Additionally or alternatively, the SVC-capable encoder or any other suitable component may select to provide only certain enhanced layers as part of SVC stream based on the destination for the streams using packet concealment or any other suitable technique.
  • JSVM 9 1 encoder in some embodiments is shown below
  • Fra-neRate 30 # Max ⁇ um frame rate [Hz]
  • BaseLayerMode 1 Base layer mode (C: AVC w large DPB,
  • SearchMode 4 # Search mode (C :BlockSearch,
  • encoder cfg file
  • JSVM 9 1 encoder file
  • MeQP3 32 C 0 # QP for motion estimation / mode decision (stage 1)
  • MeQP4 32 C 0 # QP for motion estimation / mode ⁇ ecision (stage 1)
  • encoder cfg file that may be used with a JSVM 9 1 encoder in some embodiments is shown below r
  • MeQP2 32 C 0 # QP f or motion est imation / itio ⁇ e deci s ion ( stage 1)

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Databases & Information Systems (AREA)
  • Compression Or Coding Systems Of Tv Signals (AREA)
  • Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)

Abstract

Systems, methods, and media for providing selectable video using scalable video coding are provided. In some embodiments, systems for providing selectable video using scalable video coding are provided, the systems comprising: a scalable video coding capable encoder that receives a base content sequence and at least one added content sequence that has different content from the base content stream and that produces at least one SVC stream that includes a basic layer, that corresponds to the base content sequence, and at least one enhanced layer, that corresponds to content in the at least one added content sequence; and a digital processing device that controls whether the at least one enhanced layer in the SVC stream is displayed at a destination for the SVC stream.

Description

SYSTEMS, METHODS, AND MEDIA FOR PROVIDING SELECTABLE VIDEO USING SCALABLE VIDEO CODING
Cross Reference to Related Application
[0001] This application claims the benefit of United States Patent Application
No. 12/170,674, filed July 10, 2008, which is hereby incorporated by reference herein in its entirety.
Technical Field
[0002] The disclosed subject matter relates to systems, methods, and media for providing selectable video using scalable video coding.
Background
[0003] Digital video systems have become widely used for varying purposes ranging from entertainment to video conferencing. Many digital video systems require providing different video signals to different recipients. This can be a quite complex process.
[0004] For example, traditionally, when different content is desired to be provided to different recipients, a separate video encoder would need to be provided for each recipient.
In this way, the video for that recipient would be encoded for that user by the corresponding encoder. Dedicated encoders for individual users may be prohibitively expensive, however, both in terms of processing power and bandwidth.
[0005] Accordingly, it is desirable to provide mechanisms for controlling video signals. Summary
[0006] Systems, methods, and media for providing selectable video using scalable video coding are provided. In some embodiments, systems for providing selectable video using scalable video coding are provided, the systems comprising: a scalable video coding capable encoder that receives a base content sequence and at least one added content sequence that has different content from the base content stream and that produces at least one SVC stream that includes a basic layer, that corresponds to the base content sequence, and at least one enhanced layer, that corresponds to content in the at least one added content sequence; and a digital processing device that controls whether the at least one enhanced layer in the SVC stream is displayed at a destination for the SVC stream. [0007] In some embodiments, methods for providing selectable video using scalable video coding are provided, the methods comprising: receiving a base content sequence and at least one added content sequence that has different content from the base content stream; encoding from the base content sequence and the at least one added content sequence at least one SVC stream that includes a basic layer, that corresponds to the base content sequence, and at least one enhanced layer, that corresponds to content in the at least one added content sequence; and controlling whether the at least one enhanced layer in the SVC stream is displayed at a destination for the SVC stream.
[0008] In some embodiments, computer-readable media containing computer- executable instructions that, when executed by a processor, cause the processor to perform a method for providing selectable video using scalable video coding are provided, the method comprising: receiving a base content sequence and at least one added content sequence that has different content from the base content stream; encoding from the base content sequence and the at least one added content sequence at least one SVC stream that includes a basic layer, that corresponds to the base content sequence, and at least one enhanced layer, that corresponds to content in the at least one added content sequence; and controlling whether the at least one enhanced layer in the SVC stream is displayed at a destination for the SVC stream.
Brief Description of the Drawings
[0009] FIG. 1 is a diagram of signals provided to and received from an SVC-capable encoder in accordance with some embodiments of the disclosed subject matter.
[0010] FIG. 2 is a diagram of an SVC-capable encoder in accordance with some embodiments of the disclosed subject matter.
[0011] FIG. 3 is a diagram of a video distribution system in accordance with some embodiments of the disclosed subject matter.
[0012] FIG. 4 is a diagram illustrating the combination of basic and enhanced layers in accordance with some embodiments of the disclosed subject matter.
[0013] FIG. 5 is a diagram of a video conferencing system in accordance with some embodiments of the disclosed subject matter.
[0014] FIG. 6 is a diagram of different user end point displays in accordance with some embodiments of the disclosed subject matter.
Detailed Description
[0015] Systems, methods, and media for providing selectable video using scalable video coding are provided. In accordance with various embodiments, two or more video signals can be provided to a scalable video coding (SVC)-capable encoder so that a basic layer and one or more enhanced layers are produced by the encoder. The basic layer can be used to provide base video content and the enhanced layer(s) can be used to modify that base video content with enhanced video content. By controlling when the enhanced layer(s) are available (e.g., by concealing corresponding packets), the availability of the enhanced video content by a video display can be controlled.
[0016] A scalable video protocol may include any video compression protocol that allows decoding of different representations of video from data encoded using that protocol. The different representations of video may include different resolutions (spatial scalability), frame rates (temporal scalability), bit rates (SNR scalability), portions of content, and/or any other suitable characteristic. Different representations may be encoded in different subsets of the data, or may be encoded in the same subset of the data, in different embodiments. For example, some scalable video protocols may use layering that provides one or more representations (such as a high resolution image of a user) of a video signal in one layer and one or more other representations (such as a low resolution image of the user) of the video signal in another layer. As another example, some scalable video protocols may split up a data stream (e.g., in the form of packets) so that different representations of a video signal are found in different portions of the data stream. Examples of scalable video protocols may include the Scalable Video Coding (SVC) protocol defined by the Scalable Video Coding Extension of the H.264/AVC Standard (Annex G) from the International Telecommunication Union (ITU), the MPEG2 protocol defined by the Motion Picture Experts Group, the H.263 (Annex O) protocol from the ITU. and the MPEG4 part 2 FGS protocol from the Motion Picture Experts Group, each of which is hereby incorporated by reference herein in its entirety.
[0017] Turning to FIG. 1, an illustration of a generalized approach 100 to encoding video in some embodiments is provided. As shown, a base content sequence 102 can be supplied to an SVC-capable encoder 106. One or more added content sequences 1-N 104 can also be supplied to the SVC-capable encoder. In response to receiving these sequences, the encoder can then provide an SVC stream 108 containing a basic layer 110 and one or more enhanced layers 112.
[0018] Base content sequence 102 can be any suitable video signal containing any suitable content. For example, in some embodiments, base content sequence can be video content that is fully or partially in a low-resolution format. This low-resolution video content may be suitable as a teaser to entice a viewer to purchase a higher resolution version of the content, as a more particular example. As another example, in some embodiments, base content sequence can be video content that is fully or partially distorted to hide prevent complete viewing of the video content. As another example, in some embodiments, base content sequence can be video content that is missing text (such as close captioning, translations, etc.) or graphics (such as logos, icons, advertisements, etc.) that may be desirable for some viewers.
[0019] Added content sequence(s) 104 can be any suitable content that provides a desired total content sequence. For example, when base content sequence 102 includes low- resolution content, added content sequence(s) 104 can be a higher resolution sequence of the same content. As another example, when base content sequence 102 is video content that is missing desired text or graphics, added content sequence(s) 104 can be the video content with the desired text or graphics.
[0020] In some embodiments, the resolution and other parameters of the base content sequence and added content sequence(s) can be identical. In some embodiments, in case that added content is restricted to a small part of a display screen (e.g., as in the case of a logo or a caption), it may be beneficial to position the content in the added content sequence, so that is aligned to macro block (MB) boundaries. This may improve the visual quality of the one or more enhancements layers encoded by the SVC encoder. [0021] SVC-capable encoder 106 can be any suitable SVC-capable encoder for providing an SVC stream. For example, in some embodiments, SVC-capable encoder 106 can implement a layered approach (similar to Coarse Grained Scalability) in which two layers are defined (basic and enhanced), the spatial resolution factor is set to one, intra prediction is applied only to the basic layer, the quantization error between a low-quality sequence and a higher-quality sequence is encoded using residual coding, and motion data, up-sampling, and/or other trans-coding is not performed. As another example, SVC-capable encoder 106 can be implemented using the Joint Scalable Video Model (JSVM) software from the Scalable Video Coding (SVC) project of the Joint Video Team (JVT) of the ISO/IEC Moving Pictures Experts Group (MPEG) and the ITU-T Video Coding Experts Group (VCEG). Examples of configuration files for configuring the JSVM software are illustrated in the Appendix below. Any other suitable configuration for an SVC-capable encoder can additionally or alternatively be used.
[0022] As mentioned above, SVC-capable encoder 106 can provide SVC stream 108, which can include basic layer 110 and one or more enhanced layers 112. The basic layer, when decoded, can provide the signal in base content sequence 102. The one or more enhanced layers 112, when decoded, can provide any suitable content that, when combined with basic layer 110, can be used to provide a desired video content. Decoding of the SVC stream can be performed by any suitable SVC decoder, and the basic layer can be decoded by any suitable AVC decoder in some embodiments.
[0023] While FIG. 1 illustrates a single SVC stream 108 with one basic layer 110 and one or more enhanced layers 112, in some embodiments multiple SVC streams 108 can be produced by SVC-capable encoder 106. For example, when three enhanced layers 112 are produced, three SVC streams 108 can be produced wherein each of the streams includes the basic layer and a respective one of the enhanced layers. As another example, when multiple SVC streams are produced, any one of more of the streams can include more than one enhanced layer in addition to a basic layer.
[0024] Turning to FIG. 2, a more detailed illustration of an S VC -capable encoder 106 that can be used in some embodiments is provided. As shown, SVC-capable encoder 106 can receive a base content sequence 102 and an added-content sequence 104. The base content sequence 102 can then be processed by motion compensation and intra prediction mechanism 202. This mechanism can perform any suitable SVC motion compensation and intra prediction processes. A residual texture signal 204 (produced by motion compensation and intra prediction mechanism 202) may then be quantized and provided together with the motion signal 206 to entropy coding mechanism 208. Entropy coding mechanism 208 may then perform any suitable entropy coding function and provide the resulting signal to multiplexer 210.
[0025] Data from motion compensation and intra prediction process 202 can then be used by inter-layer prediction techniques 220, along with added content sequence 104, to drive motion compensation and prediction mechanism 212. Any suitable data from motion compensation and intra prediction mechanism 202 can be used. Any suitable SVC inter-layer prediction techniques 220 and any suitable SVC motion compensation and intra prediction processes in mechanism 212 can be used. A residual texture signal 214 (produced by motion compensation or intra prediction mechanisms 212) may then be quantized and provided together with the motion signal 216 to entropy coding mechanism 218. Entropy coding mechanism 218 may then perform any suitable entropy coding function and provide the resulting signal to multiplexer 210. Multiplexer 210 can then combine the resulting signals from entropy coding mechanisms 208 and 218 as an SVC compliant stream. [0026] Side information can also be provided to encoder 106 in some embodiments.
This side information can identify, for example, a region of an image where content corresponding to a difference between the base content sequence and an added content sequence is (e.g., where a logo or text may be located). The side information can then be used in a mode decision step within block 212 to determine whether to process the added content sequence or not.
[0027] FIG. 3 illustrates an example of a video distribution system 300 in accordance with some embodiments. As shown, a distribution controller 306 can receive a base content sequence as video from a base video source 302 and an added content sequence as video from an added video source 304. These sequences can be provided to an SVC-capable encoder 308 that is part of distribution controller 306. The SVC capable encoder 308 can then produce an SVC stream that includes a base layer and at least one enhanced layer as described above, and provides this stream to one or more video displays 312, 314, and 316. The distribution controller can also include a controller 310 that provides control signal to the one or more video displays 312, 314, and 316. This control signal can indicate what enhanced content (if any) a video display is to display. Additionally or alternatively to using a controller 310 that is part of controller 306 and is coupled to displays 312, 314, and 316, in some embodiments, a separate component (e.g., such as a network component such as a router, gateway, etc.) may be provided between encoder 308 and displays 312, 314, and 316 that contains a controller (like controller 310 for example) that determines what portions (e.g., layers) of the SVC stream can pass through to displays 312, 314, and 316. [0028] Controller 310, or a similar mechanism in a network component, display, endpoint, etc., may use any suitable software and/or hardware to control which enhancement layers are presented and/or which packets of an SVC stream are concealed. For example, these devices may include a digital processing device that may include one or more of a microprocessor, a processor, a controller, a microcontroller, a programmable logic device. and/or any other suitable hardware and/or software for controlling which enhancement layers are presented and/or which packets of an SVC stream are concealed.
[0029] Turning to FIG. 4, an example of how such a distribution system may be used in some embodiments is shown. As illustrated, a base content sequence 402 and three added content sequences 404, 406, and 408 may be provided to encoder 308. The encoder may then produce basic layer 410 and enhancement layers 412, 414, and 416. These layers may then be formed into three SVC streams: one with layers 410 and 412; another with layers 410 and 414; and yet another with layers 410 and 416. Each of the three SVC streams may be addressed to a different one of video display 312, 314, and 316 and presented as shown in displays 418, 420, and 422, respectively.
[0030] Additionally or alternatively to providing three SVC streams, a single stream may be generated and only selected portions (e.g., packets) utilized at each of video displays 312, 314, and 316. The selection of portions may be performed at the displays or at a component between the encoder and the displays as described above in some embodiments. [0031] FIGS. 5 and 6 illustrate a video conferencing system 500 in accordance with some embodiments. As shown, system 500 includes a multipoint conferencing unit (MCU) 502. MCU 502 can include an SVC-capable encoder 504 and a video generator 506. Video generator 506 may generate a continuous presence (CP) layout in any suitable fashion and provide this layout as a base content sequence to SVC-capable encoder 504. The SVC capable encoder may also receive as added content sequences current speaker video, previous speaker video, and other participant video from current speaker end point 508, previous speaker end point 510, and other participant end points 512, 514. and 516. respectively. SVC streams can then be provided from encoder 504 to current speaker end point 508, previous speaker end point 510, and other participant end points 512, 514, and 516 and be controlled as described below in connection with FIG. 6. [0032] As illustrated in FIG. 6, the display on current speaker end point 508 may be controlled so that the user sees a CP layout from the basic layer (which may include graphics 602 and text 604) along with enhanced layers corresponding to the previous speaker and one or more of the other participants, as shown in display 608. The display on previous speaker end point 510 may be controlled so that the user sees a CP layout from the basic layer along with enhanced layers corresponding to the current speaker and one or more of the other participants, as shown in display 610. The display on other participant end points 512. 514, and 516 may be controlled so that the user sees a CP layout from the basic layer along with enhanced layers corresponding to the current speaker and the previous speaker, as shown in display 612. In this way, no user of an endpoint sees video of himself or herself. [0033] Although FIG. 5 illustrates different SVC streams going from the SVC- capable encoder to endpoints 508, 510, and 512, 514, and 516, in some embodiments, these streams may all be identical and a separate control signal (not shown) for selecting which enhanced layers are presented on each end point may be provided. Additionally or alternatively, the SVC-capable encoder or any other suitable component may select to provide only certain enhanced layers as part of SVC stream based on the destination for the streams using packet concealment or any other suitable technique. [0034] Although the invention has been described and illustrated in the foregoing illustrative embodiments, it is understood that the present disclosure has been made only by way of example, and that numerous changes in the details of implementation of the invention can be made without departing from the spirit and scope of the invention, which is only limited by the claims which follow. Features of the disclosed embodiments can be combined and rearranged in various ways. APPENDIX
[0035] An example of a "encoder cfg" configuration file that may be used with a
JSVM 9 1 encoder in some embodiments is shown below
# Scalable H.264/AVC Extension Conf.gαration File $============================== GENEPAL ==========
OutputFile test . 2 t> 4 # Bitstrean file
Fra-neRate 30 # Maxπum frame rate [Hz]
MaxDelay 0 # Maxnum structural delay [ins,
# (requireα for interactive
# communication)
FramesTo3eEncoαed 30 # Number of frames (at iiput frame rate)
CgsSirRefmement 1 # (C: SKR layers as CGS, 1 : SNR layers
# as -IGS)
ΞncodeKeyPictures 1 # Key pictures at temp, level 0
# [0:FGS only, 1:FGS&MGS,
# 2 : always (useless) ]
MGSControl 1 ≠ (0:ME+MC using current layer,
# 1:ME us^ng EL ref. pics, 2:ME+MC
# using EL ref. pics)
MGSKeyPicMotRef 1 # iiot^oi refinement for MGS key pics
# ( 0 : off , l : one )
MCTF
GOPSi ze 1 # GOP Size (at maximum frame rate) (no
# temporal scalability)
IntraPeπoσ - 1 # Intra Period
Nu"iberReferenceFra"ies 1 ≠ Kumber of reference pictures
BaseLayerMode 1 # Base layer mode (C: AVC w large DPB,
# 1:AVC compatible, 2: AVC v subseq
# SEI)
$============================== MOTION SEARCH =========================
SearchMode 4 # Search mode (C :BlockSearch,
# 4 :FastSearcα)
SearenFuτ.cFu11 Pel 0 # Search function full pel
≠ ( 0 : SAD, 1 : SSΞ, 2 : HADAM^RD, 3 : S^D- # YLV) SearcfiFuncSubPel 0 # Search function sub pel
# (0:SAD, 1:SSE, 2:HADAMAPD)
SearciRange 16 # Search range (Full Pel)
BiPredlter 2 # Max iterations for b^-pred search
IterSearc^Raige 2 # Search range for iterations (C:
# normal) f============================== LOOP FILTER ===========================
LoopFilterDisable 0 ≠ Loop filter idc (C: on, 1: off, 2:
# on except for slice boundaries)
LoopFilterAlpfiaCOCffset 0 # &lphaθffset (-6.. +6) : val^d range
LoopFilterBetaOf fset 0 # BetaOffset (-6..+c) : \alid range
$============================== LA\ER DEFINITION ====================
NumLayers 2 # Number of layers
LayerCfg base content. cfg # Layer configuration file La^erCfg added content. cfg # Layer configuration file
#LayerCfg .. \ .. \ .. \data \layer2. cfg # Layer cor figuration file
#LayerCfg .. \ .. \ .. \data\ layerS . cfg # Layer configuration file
#LayerCfg .. \ .. \ .. \data\layer4. cfg # Layer configuration file
#LayerCfg layerS. cfg # Layer configuration f__le
#LayerCfg layerβ.cfg # Layer configuration f_.le
#LaverCfg .. \ .. \ .. \data\layer7. cfg # Layer configuration file
PreAndSuffixUn^tEnable # Add prefix and suffix un^t (0: off,
# 1: on) shall always be on m SVC
# contexts (i.e. when there are
# FGS/CGS/spatial en ancement layers)
MMC03aseEnable # MMCO for base representation (0: off,
# 1: on)
TLKestingFlag 0 # Sets the temporal level nesting flag (0: off, 1: on) TLPicIαxEnable 0 # Add picture index for the lowest temporal level (0: off, 1: on) f============================== RCDO
RCD031ockSizes # restrict block sizes for MC
# (O:off, l:m EL, 2:m all layers) RCDOMotionCompensationY # simplified MC for luna
# (O:off, 1: m EL, 2 : m all layers) RCDOMotionCompensationC # simplified MC for chroma
# (O:off, 1 : m EL, 2 : m all lavers) RCDODeblocking # simplified deblocking
# (O:off, l:m EL, 2 : m all layers)
HRD
EnableKalHRD Enablev'clHRD
[0036] An example of a "base content cfg" configuration file (as referenced in the
"encoder cfg" file) that may be used with a JSVM 9 1 encoder in some embodiments is shown below.
# Layer Configuration File
INPLT / OUTPUT =================
Source'ϊidth 352 # Input fraτie width
SourceHeight 288 # Input fratie height
FrameRateln 30 # Input frame rate [Hz
FrameRateOut 30 # Output frame rate [^z'_
InputFile base content. yuv # Input file
ReconFile rec layerO . yuv # Reconstructed file
SymbolMode 0 ≠ O=CAVLC, I=CABAC f============================== CODING Clcsed^oop # Closed-locp contrcl
# (0,l:at H rate, 2: at L+H rate)
FRExt 0 £ FREXT mode (0:off, l:on)
MaxDeltaQP 0 # May. absolute delta Q? QP 32.0 # Quantization parameters NurnFGSLayers 0 # Number of FGS layers
# ( 1 layer - ~ delta QP = 6 )
FGSMotion # motion refinement m FGS layers (O:off, l:on)
======= CONTPOL ==============================
MeQPO 32 . 00 # QP for motion estimation / mode αecision (stage
0)
MeQPl 32 . 00 # QP for motion estimation / mode αecision (stage
1)
MeQ?2 32 . 00 # QP for motion estimation / mode decision (stage
2)
MeQP3 32 . C 0 # QP for motion estimation / mode decision (stage
3)
MeQP4 32 . C 0 # QP for motion estimation / mode αecision (stage
4)
MeQPS 32 . 00 # QP for motion estimation / mode αecision (stage
5)
InterLayerPred 0 # Inter-layer Prediction (0: no, 1: yes, 2 : adaptive) BaseQuality 3 # Base quality levτel (0, 1, 2, 3) (C: no, 3, all)
[0037] An example of a "added content cfg" configuration file (as referenced m the
"encoder cfg" file) that may be used with a JSVM 9 1 encoder in some embodiments is shown belowr
# Layer Configuration File
INPUT / OUTPUT ================
SourceWidth 352 # Input frame 'width
SourceHeight 238 # Input frame height
FramePateln 30 ≠ Input frame rate [H;
FrameRateOut 30 # Output frame rate [H:
InputFile adαed_content . yu\ £ Input file
ReconF^le rec layerθ.yuvτ # Reconstructed file
SynbolMode 0 # O=CAVLC, I=CABAC
================ CODING ==============================
ClosedLoop 1 # Closed-loop control (0,l:at H rate, 2: at L+H rate)
FRExt 0 ≠ FREXT mode (0:off, l:on)
MaxDeltaQP 0 # -lax. absolute delta Q?
QP 32.C # Quantization parameters
NuπFGSLayers 0 # Number of FGS layers ( 1 layer - delta QP =
FGSMot^on 0 # "lotion refinement m FGS layers (O:off, L: on)
================ CONTROL ==============================
MeQPO 32.00 # Q? for motion estimation / moαe decision (stags 0) MeQPl 32.00 # QP for motion estimation / mode decision (stage
1)
MeQP2 32 . C 0 # QP f or motion est imation / itioαe deci s ion ( stage
2 )
MeQ?3 32.00 # Q? for motion estimation / moαe decision (stage
3)
MeQ?4 32.00 # Q? for motion estimation ' moαe decision (stage
4)
MeQPS 32.00 ≠ QP for motion estimation / mooe decision (stage
5)
InterLayerPred 0 # Inter-layer Prediction (0: no, 1: yes,
2 : adaptive)
3aseQuality 3 # Base quality le\el (0, 1, 2, 3) (0: no, 3, all)

Claims

What is claimed is:
1. A system for providing selectable video using scalable video coding, comprising: a scalable video coding capable encoder that receives a base content sequence and at least one added content sequence that has different content from the base content stream and that produces at least one SVC stream that includes a basic layer, that corresponds to the base content sequence, and at least one enhanced layer, that corresponds to content in the at least one added content sequence; and a digital processing device that controls whether the at least one enhanced layer in the SVC stream is displayed at a destination for the SVC stream.
2. The system of claim 1, further comprising an SVC decoder that receives and decodes the SVC stream.
3. The system of claim 2, wherein the SVC decoder complies with the Scalable Video Coding Extension of the H.264/AVC Standard.
4. The system of claim 1, wherein the base content sequence is a low resolution version of video in at least one of the at least one added content sequence.
5. The system of claim 1, wherein the base content sequence contains distorted video.
6. The system of claim 1 , wherein the at least one added content sequence includes text.
7. The system of claim 1, wherein at least one of the at least one added content sequence includes graphics.
8. The system of claim 1, wherein the digital processing device controls whether the at least one enhanced layer in the SVC stream is displayed at the destination for the SVC stream by concealing packets associated with the at least one enhanced layer.
9. The system of claim 1, wherein the digital processing device controls whether the at least one enhanced layer in the SVC stream is displayed at the destination for the SVC stream by providing a control signal to the destination.
10. The system of claim 1, wherein the basic layer and at least one of the at least one enhanced layer are used to form a continuous presence layout for a video conference.
11. A method for providing selectable video using scalable video coding, comprising: receiving a base content sequence and at least one added content sequence that has different content from the base content stream; encoding from the base content sequence and the at least one added content sequence at least one SVC stream that includes a basic layer, that corresponds to the base content sequence, and at least one enhanced layer, that corresponds to content in the at least one added content sequence; and controlling whether the at least one enhanced layer in the SVC stream is displayed at a destination for the SVC stream.
12. The method of claim 11 , further comprising receiving and decoding the SVC stream.
13. The method of claim 12, wherein the decoding complies with the Scalable Video Coding Extension of the H.264/AVC Standard.
14. The method of claim 11 , wherein the base content sequence is a low resolution version of video in at least one of the at least one added content sequence.
15. The method of claim 11, wherein the base content sequence contains distorted video.
16. The method of claim 11, wherein the at least one added content sequence includes text.
17. The method of claim 11, wherein at least one of the at least one added content sequence includes graphics.
18. The method of claim 11, wherein the controlling controls whether the at least one enhanced layer in the SVC stream is displayed at the destination for the SVC stream by concealing packets associated with the at least one enhanced layer.
19. The method of claim 11 , wherein the controlling controls whether the at least one enhanced layer in the SVC stream is displayed at the destination for the SVC stream by providing a control signal to the destination.
20. The method of claim 11, wherein the basic layer and at least one of the at least one enhanced layer are used to form a continuous presence layout for a video conference.
21. A computer-readable medium containing computer-executable instructions that, when executed by a processor, cause the processor to perform a method for providing selectable video using scalable video coding, the method comprising: receiving a base content sequence and at least one added content sequence that has different content from the base content stream; encoding from the base content sequence and the at least one added content sequence at least one SVC stream that includes a basic layer, that corresponds to the base content sequence, and at least one enhanced layer, that corresponds to content in the at least one added content sequence; and controlling whether the at least one enhanced layer in the SVC stream is displayed at a destination for the SVC stream.
22. The medium of claim 21 , wherein the method further comprises receiving and decoding the SVC stream.
23. The medium of claim 22, wherein the decoding complies with the Scalable Video Coding Extension of the H.264/AVC Standard.
24. The medium of claim 21 , wherein the base content sequence is a low resolution version of video in at least one of the at least one added content sequence.
25. The medium of claim 21, wherein the base content sequence contains distorted video.
26. The medium of claim 21 , wherein the at least one added content sequence includes text.
27. The medium of claim 21. wherein at least one of the at least one added content sequence includes graphics.
28. The medium of claim 21 , wherein the controlling controls whether the at least one enhanced layer in the SVC stream is displayed at the destination for the SVC stream by concealing packets associated with the at least one enhanced layer.
29. The medium of claim 21, wherein the controlling controls whether the at least one enhanced layer in the SVC stream is displayed at the destination for the SVC stream by providing a control signal to the destination.
30. The medium of claim 21. wherein the basic layer and at least one of the at least one enhanced layer are used to form a continuous presence layout for a video conference.
PCT/IB2009/006449 2008-07-10 2009-07-09 Systems, methods, and media for providing selectable video using scalable video coding Ceased WO2010004424A2 (en)

Priority Applications (3)

Application Number Priority Date Filing Date Title
EP09794060.5A EP2324640B1 (en) 2008-07-10 2009-07-09 Systems, methods, and media for providing selectable video using scalable video coding
JP2011517265A JP5519663B2 (en) 2008-07-10 2009-07-09 System, method and medium for providing selectable video using scalable video coding
CN2009801327357A CN102138325B (en) 2008-07-10 2009-07-09 Systems and methods for providing selectable video using scalable video coding

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US12/170,674 2008-07-10
US12/170,674 US9532001B2 (en) 2008-07-10 2008-07-10 Systems, methods, and media for providing selectable video using scalable video coding

Publications (2)

Publication Number Publication Date
WO2010004424A2 true WO2010004424A2 (en) 2010-01-14
WO2010004424A3 WO2010004424A3 (en) 2010-04-22

Family

ID=41505145

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/IB2009/006449 Ceased WO2010004424A2 (en) 2008-07-10 2009-07-09 Systems, methods, and media for providing selectable video using scalable video coding

Country Status (5)

Country Link
US (1) US9532001B2 (en)
EP (1) EP2324640B1 (en)
JP (1) JP5519663B2 (en)
CN (1) CN102138325B (en)
WO (1) WO2010004424A2 (en)

Families Citing this family (24)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20080095228A1 (en) * 2006-10-20 2008-04-24 Nokia Corporation System and method for providing picture output indications in video coding
US20090041100A1 (en) 2006-12-13 2009-02-12 Viasat, Inc. Link aware mobile data network
US8358690B2 (en) * 2006-12-13 2013-01-22 Viasat, Inc. Predictive adaptive coding and modulation
US7944872B2 (en) * 2006-12-13 2011-05-17 Viasat, Inc. Adaptive coding and modulation aware network load balancing
US8395993B2 (en) * 2006-12-13 2013-03-12 Viasat, Inc. Video and data network load balancing with video placeholder
US8576858B2 (en) * 2006-12-13 2013-11-05 Viasat, Inc. Multiple transmission paths for hierarchical layers
US7961665B2 (en) 2006-12-13 2011-06-14 Viasat, Inc. Terminal aware multicasting
US8411571B2 (en) 2006-12-13 2013-04-02 Viasat, Inc. Video and data network load balancing with video drop
US8411572B2 (en) * 2006-12-13 2013-04-02 Viasat, Inc. ACM and fixed coding and modulation of hierarchical layers
US8456986B2 (en) * 2006-12-13 2013-06-04 Viasat, Inc. Video and data network load balancing
US20080144713A1 (en) * 2006-12-13 2008-06-19 Viasat, Inc. Acm aware encoding systems and methods
US20100232521A1 (en) * 2008-07-10 2010-09-16 Pierre Hagendorf Systems, Methods, and Media for Providing Interactive Video Using Scalable Video Coding
US8233026B2 (en) * 2008-12-23 2012-07-31 Apple Inc. Scalable video encoding in a multi-view camera system
US8514931B2 (en) * 2009-03-20 2013-08-20 Ecole Polytechnique Federale De Lausanne (Epfl) Method of providing scalable video coding (SVC) video content with added media content
US8988512B2 (en) * 2011-04-14 2015-03-24 Mediatek Inc. Method for adjusting playback of multimedia content according to detection result of user status and related apparatus thereof
JP6085907B2 (en) * 2011-09-13 2017-03-01 株式会社リコー Conference system, event management server, and program
US9648322B2 (en) * 2012-07-10 2017-05-09 Qualcomm Incorporated Coding random access pictures for video coding
US9131111B2 (en) * 2012-11-02 2015-09-08 OpenExchange, Inc. Methods and apparatus for video communications
CN104125479B (en) * 2013-04-29 2017-03-29 成都懒人享乐科技有限公司 Video interception system and method
JP6406801B2 (en) * 2013-08-14 2018-10-17 キヤノン株式会社 Image forming apparatus, control method therefor, and program
US9973780B2 (en) * 2013-10-31 2018-05-15 Microsoft Technology Licensing, Llc Scaled video for pseudo-analog transmission in spatial domain
EP3092806A4 (en) * 2014-01-07 2017-08-23 Nokia Technologies Oy Method and apparatus for video coding and decoding
GB201817784D0 (en) * 2018-10-31 2018-12-19 V Nova Int Ltd Methods,apparatuses, computer programs and computer-readable media
EP4020998A1 (en) 2020-12-23 2022-06-29 Axis AB Encoding of modified video

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20070206673A1 (en) 2005-12-08 2007-09-06 Stephen Cipolli Systems and methods for error resilience and random access in video communication systems

Family Cites Families (26)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH06197225A (en) * 1992-12-24 1994-07-15 Fujitsu Ltd Image data encoding/supplying method
JP2591439B2 (en) * 1993-08-26 1997-03-19 日本電気株式会社 Video synthesis method for video conference
US5821986A (en) * 1994-11-03 1998-10-13 Picturetel Corporation Method and apparatus for visual communications in a scalable network environment
JP3288898B2 (en) * 1995-05-31 2002-06-04 株式会社東芝 Digital television broadcasting system
JP3318712B2 (en) * 1996-11-08 2002-08-26 ソニー株式会社 Image signal encoding device and image signal encoding method, image signal decoding device and image signal decoding method, image signal transmission method, and image signal recording medium
JP4261630B2 (en) * 1998-02-04 2009-04-30 キヤノン株式会社 Image encoding apparatus and method, and computer-readable recording medium on which an image encoding program is recorded
JP2000209580A (en) * 1999-01-13 2000-07-28 Canon Inc Image processing apparatus and method
US6798838B1 (en) * 2000-03-02 2004-09-28 Koninklijke Philips Electronics N.V. System and method for improving video transmission over a wireless network
US7237032B2 (en) * 2001-02-16 2007-06-26 Microsoft Corporation Progressive streaming media rendering
US6496217B1 (en) * 2001-06-12 2002-12-17 Koninklijke Philips Electronics N.V. Video communication system using model-based coding and prioritzation techniques
US7391807B2 (en) * 2002-04-24 2008-06-24 Mitsubishi Electric Research Laboratories, Inc. Video transcoding of scalable multi-layer videos to single layer video
JP2004147095A (en) * 2002-10-24 2004-05-20 Canon Inc Decryption method
JP4433286B2 (en) * 2004-03-25 2010-03-17 ソニー株式会社 Transmission device and method, reception device and method, recording medium, and program
US20050254575A1 (en) * 2004-05-12 2005-11-17 Nokia Corporation Multiple interoperability points for scalable media coding and transmission
JP3936707B2 (en) * 2004-05-26 2007-06-27 日本電信電話株式会社 Scalable communication conference system, server device, scalable communication conference method, scalable communication conference control method, scalable communication conference control program, and program recording medium thereof
TWI289998B (en) * 2004-10-06 2007-11-11 Nippon Telegraph & Telephone Scalable encoding method and apparatus, scalable decoding method and apparatus, and computer readable storage medium therefor
JP4510678B2 (en) * 2005-03-29 2010-07-28 株式会社Kddi研究所 Video transmission device and video transmission / reception playback device
US8436889B2 (en) * 2005-12-22 2013-05-07 Vidyo, Inc. System and method for videoconferencing using scalable video coding and compositing scalable video conferencing servers
EP1985116A4 (en) * 2005-12-22 2013-06-05 Vidyo Inc System and method for videoconferencing using scalable video coding and compositing scalable video conferencing servers
US20080101456A1 (en) 2006-01-11 2008-05-01 Nokia Corporation Method for insertion and overlay of media content upon an underlying visual media
CN101427573B (en) * 2006-02-16 2013-07-03 维德约股份有限公司 System and method for thinning of scalable video coding bit-streams
JP4687538B2 (en) * 2006-04-04 2011-05-25 パナソニック株式会社 Receiving device, transmitting device, and communication method therefor
US8422555B2 (en) * 2006-07-11 2013-04-16 Nokia Corporation Scalable video coding
US20090060035A1 (en) * 2007-08-28 2009-03-05 Freescale Semiconductor, Inc. Temporal scalability for low delay scalable video coding
JP2008048447A (en) * 2007-09-26 2008-02-28 Dolby Lab Licensing Corp Temporal and resolution layer structure subjected to encryption and watermarking in next-generation television
US20100049865A1 (en) 2008-04-16 2010-02-25 Nokia Corporation Decoding Order Recovery in Session Multiplexing

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20070206673A1 (en) 2005-12-08 2007-09-06 Stephen Cipolli Systems and methods for error resilience and random access in video communication systems

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
See also references of EP2324640A4

Also Published As

Publication number Publication date
EP2324640A4 (en) 2011-08-10
JP2011527546A (en) 2011-10-27
EP2324640B1 (en) 2017-03-22
JP5519663B2 (en) 2014-06-11
WO2010004424A3 (en) 2010-04-22
US9532001B2 (en) 2016-12-27
CN102138325A (en) 2011-07-27
US20100008416A1 (en) 2010-01-14
EP2324640A2 (en) 2011-05-25
CN102138325B (en) 2013-10-16

Similar Documents

Publication Publication Date Title
US9532001B2 (en) Systems, methods, and media for providing selectable video using scalable video coding
US20100232521A1 (en) Systems, Methods, and Media for Providing Interactive Video Using Scalable Video Coding
Marpe et al. The H. 264/MPEG4 advanced video coding standard and its applications
Marpe et al. H. 264/MPEG4-AVC fidelity range extensions: Tools, profiles, performance, and application areas
Schwarz et al. Overview of the scalable video coding extension of the H. 264/AVC standard
Schwarz et al. The scalable video coding extension of the H. 264/AVC standard [standards in a nutshell]
Ye et al. The scalable extensions of HEVC for ultra-high-definition video delivery
US7515759B2 (en) 3D video coding using sub-sequences
Bing Next-generation video coding and streaming
US8218619B2 (en) Transcoding apparatus and method between two codecs each including a deblocking filter
US10070140B2 (en) Method and apparatus for quantization matrix signaling and representation in scalable video coding
US20080212682A1 (en) Reduced resolution video transcoding with greatly reduced complexity
JP2015097416A (en) System and method for providing error tolerance, random access and rate control in scalable video communication
Dominguez et al. The H. 264 video coding standard
AU2023204353A1 (en) Techniques for random access point indication and picture output in coded video stream
Kalva et al. The VC-1 video coding standard
Fischer Video coding (mpeg-2, mpeg-4/avc, hevc)
US20240397056A1 (en) Low complexity enhancement video coding with temporal scalability
Francois et al. Interlaced coding in SVC
Parois et al. Real-time UHD scalable multi-layer HEVC encoder architecture
Roodaki et al. Efficient video resolution adaptation using scalable H. 265/HEVC
Foessel et al. Proposed Extensions to the Third Edition of JPEG XS (ISO/IEC 21122) Standard
Pereira Video compression: An evolving technology for better user experiences
Akramullah Video Coding Standards
Shanableh Hybrid M-JPEG/MPEG-2 video streams using MPEG-2 compliant spatial scalability

Legal Events

Date Code Title Description
WWE Wipo information: entry into national phase

Ref document number: 200980132735.7

Country of ref document: CN

121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 09794060

Country of ref document: EP

Kind code of ref document: A2

ENP Entry into the national phase

Ref document number: 2011517265

Country of ref document: JP

Kind code of ref document: A

NENP Non-entry into the national phase

Ref country code: DE

REEP Request for entry into the european phase

Ref document number: 2009794060

Country of ref document: EP

WWE Wipo information: entry into national phase

Ref document number: 2009794060

Country of ref document: EP