WO2013055162A2 - Procédé de transmission de données multimédia par l'intermédiaire d'un réseau ip hétérogène indépendamment d'un codec multimédia - Google Patents

Procédé de transmission de données multimédia par l'intermédiaire d'un réseau ip hétérogène indépendamment d'un codec multimédia Download PDF

Info

Publication number
WO2013055162A2
WO2013055162A2 PCT/KR2012/008337 KR2012008337W WO2013055162A2 WO 2013055162 A2 WO2013055162 A2 WO 2013055162A2 KR 2012008337 W KR2012008337 W KR 2012008337W WO 2013055162 A2 WO2013055162 A2 WO 2013055162A2
Authority
WO
WIPO (PCT)
Prior art keywords
media
mmt
data
unit
encoded
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Ceased
Application number
PCT/KR2012/008337
Other languages
English (en)
Korean (ko)
Other versions
WO2013055162A3 (fr
Inventor
배성준
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Electronics and Telecommunications Research Institute ETRI
Original Assignee
Electronics and Telecommunications Research Institute ETRI
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from KR1020120112544A external-priority patent/KR20130040132A/ko
Application filed by Electronics and Telecommunications Research Institute ETRI filed Critical Electronics and Telecommunications Research Institute ETRI
Priority to US14/351,776 priority Critical patent/US20140282799A1/en
Publication of WO2013055162A2 publication Critical patent/WO2013055162A2/fr
Publication of WO2013055162A3 publication Critical patent/WO2013055162A3/fr
Anticipated expiration legal-status Critical
Ceased legal-status Critical Current

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/238Interfacing the downstream path of the transmission network, e.g. adapting the transmission rate of a video stream to network bandwidth; Processing of multiplex streams
    • H04N21/2381Adapting the multiplex stream to a specific network, e.g. an Internet Protocol [IP] network
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/236Assembling of a multiplex stream, e.g. transport stream, by combining a video stream with other content or additional data, e.g. inserting a URL [Uniform Resource Locator] into a video stream, multiplexing software data into a video stream; Remultiplexing of multiplex streams; Insertion of stuffing bits into the multiplex stream, e.g. to obtain a constant bit-rate; Assembling of a packetised elementary stream
    • H04N21/23605Creation or processing of packetized elementary streams [PES]
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/60Network structure or processes for video distribution between server and client or between remote clients; Control signalling between clients, server and network components; Transmission of management data between server and client, e.g. sending from server to client commands for recording incoming content stream; Communication details between server and client 
    • H04N21/61Network physical structure; Signal processing
    • H04N21/6106Network physical structure; Signal processing specially adapted to the downstream path of the transmission network
    • H04N21/6125Network physical structure; Signal processing specially adapted to the downstream path of the transmission network involving transmission via Internet
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/60Network structure or processes for video distribution between server and client or between remote clients; Control signalling between clients, server and network components; Transmission of management data between server and client, e.g. sending from server to client commands for recording incoming content stream; Communication details between server and client 
    • H04N21/63Control signaling related to video distribution between client, server and network components; Network processes for video distribution between server and clients or between remote clients, e.g. transmitting basic layer and enhancement layers over different transmission paths, setting up a peer-to-peer communication via Internet between remote STB's; Communication protocols; Addressing
    • H04N21/643Communication protocols
    • H04N21/64322IP

Definitions

  • the present invention relates to a method of transmitting media data, and more particularly, to a method of transmitting encoded media data in a system for transmitting encoded media data through a heterogeneous IP network.
  • the MPEG-2 system has standardized MPEG-2 Transport Stream (TS) technology as a standard for packetization, synchronization, and multiplexing for transmitting AV (Audio Video) content in a broadcasting network.
  • TS MPEG-2 Transport Stream
  • MPEG-2 TS is inefficient in a new environment in which the network is All IP (Internet Protocol).
  • a new media transmission technology is required in a system for transmitting encoded media data through a heterogeneous IP network in consideration of the new media transmission environment and the expected media transmission environment.
  • a media unit that is a part of a media encoder that is accessed and processed rather than a part that the system processes.
  • the header of the media unit has a complicated structure, which makes it difficult to implement and reduces the availability.
  • the present invention provides a method for transmitting encoded media data in a system for transmitting encoded media fragment data through a heterogeneous IP network for transmitting encoded media data.
  • MU media unit
  • coded media fragment data includes a random access point (random access point), and coded media data in which the coded media fragment data is encoded.
  • an indicator indicating whether the encoded media fragment data includes the last part of the encoded media data.
  • the header unit may further include an indicator indicating whether the encoded media fragment data requires lossless transmission.
  • the step of receiving a media unit (MU) having a format independent of a specific media codec from the media codec layer may include a media processing unit having a format independent of a specific media codec from the media codec layer.
  • MPU media processing unit having a format independent of a specific media codec from the media codec layer.
  • Encapsulating the generated media unit to generate an MMT asset may include generating an MMT asset by encapsulating the generated media processing unit.
  • the step of receiving a Media Processing Unit (MPU) having a format independent of a specific media codec from the media codec layer may include a media fragment unit having a format independent of a specific media codec from the media codec layer.
  • the method may include receiving a Fragment Unit (MFU) and generating a Media Processing Unit (MPU) by encapsulating the media fragment unit.
  • MFU Fragment Unit
  • MPU Media Processing Unit
  • the header unit may be an indicator indicating whether the encoded media fragment data includes a random access point, and the encoded media fragment data indicates a start portion of an access unit (AU). And an indicator indicating whether or not to include and an indicator indicating whether the encoded media fragment data includes the last part of the access unit (AU).
  • MMT MPEG Media Transport
  • the present invention can simplify the implementation of the media unit by configuring the media unit header with “general” information that can assist in the transport and consumption of the media.
  • the present invention is to configure the media unit header to the information transmitted by the media encoder to the system, excluding the information dependent on the specific media codec, so that the media data encoded regardless of the specific media codec is heterogeneous IP network ( Systems for transmission over heterogeneous IP networks allow for more efficient media streaming.
  • FIG. 1 is a conceptual diagram illustrating an MMT hierarchical structure.
  • FIG. 2 is a conceptual diagram illustrating a format of unit information (or data or packet) used for each layer of the MMT hierarchical structure.
  • 3 is a conceptual diagram of an MMT package configuration.
  • MFU media fragment unit
  • MFU media fragment unit
  • FIG. 6 is a flowchart illustrating a method of transmitting encoded media fragment data according to an embodiment of the present invention.
  • FIG. 7 is a flowchart illustrating a method of transmitting encoded media fragment data according to another embodiment of the present invention.
  • FIG. 8 is a flowchart illustrating a method of transmitting encoded media fragment data according to another embodiment of the present invention.
  • first and second may be used to describe various components, but the components should not be limited by the terms. The terms are used only for the purpose of distinguishing one component from another.
  • the first component may be referred to as the second component, and similarly, the second component may also be referred to as the first component.
  • MMT MPEG Media Transport
  • the content component or media component is defined as a media of a single type or a subset of the media of a single type. , Video tracks, movie subtitles, or a video enhancement layer of video.
  • Content is defined as a set of content components, and may be, for example, a movie or a song.
  • a presentation is defined as an operation performed by one or more devices to allow a user to experience one content component or one service (eg, watch a movie).
  • a service is defined as one or more content components that are transmitted for presentation or storage.
  • Service information is defined as metadata describing one service, characteristics and components of the service.
  • An access unit is the smallest data entity and may have time information as an attribute.
  • the AU is not defined.
  • An MMT asset is a logical data entity consisting of at least one MPU with the same MMT asset ID or a specific chunk of data with a format defined by other standards.
  • the MMT asset is the largest data unit to which the same composition information and transmission characteristics apply.
  • MMT Asset Delivery Characteristics is a description related to QoS requirements for delivering MMT assets. MMT-ADC is expressed without knowing the specific transmission environment.
  • MMT Composition Information describes spatial and temporal relationships between MMT assets.
  • MFU Media Fragment Unit
  • AU access unit
  • An MMT package is a collection of logically structured data and includes at least one MMT asset, MMT composition information, MMT asset asset, and descriptive information.
  • the MMT packet is a format of data generated or consumed by the MMT protocol.
  • the MMT payload format is a format for payload of an MMT package or MMT signaling message to be delivered by an MMT protocol or an internet application layer protocol (eg, RTP).
  • the Media Processing Unit is a generic container that is independent of any particular media codec and contains at least one AU and information related to additional transmission and consumption. For non-temporal data, the MPU accepts a portion of data that does not fall within the AU range. MPU is encoded media data that can be processed completely and independently. In this context, processing means encapsulation or packetization into an MMT package for transmission.
  • Non-timed data defines all data elements that are consumed without specifying time.
  • Non-timed data can have a time range within which the data can be executed or started.
  • Timed data defines data elements associated with a particular time to be decoded and presented.
  • Media data refers to data elements including both non-timed data and timed-data.
  • the media unit refers to a container including a media fragment unit (MFU) or a media processing unit (MPU).
  • MFU media fragment unit
  • MPU media processing unit
  • FIG. 1 is a conceptual diagram illustrating an MMT hierarchical structure.
  • the MMT layer includes an encapsulation layer, a delivery layer, and an S layer.
  • the MMT layer operates on a transport layer.
  • the encapsulation layer may be responsible for, for example, packetization, fragmentation, synchronization, multiplexing, and the like of transmitted media.
  • the encapsulation functional area defines the logical structure of the format of the media content, the MMT package, and the data units to be processed by the MMT compliant entity.
  • the MMT package specifies the components that contain the media content and the relationships between them.
  • the format of the data units is defined to encapsulate the encoded media to be stored or transmitted in the payload of the transport protocol and to be easily converted between them.
  • Encapsulation layer (E-layer), as shown in Figure 1, MMT E.1 Layer (MMT E.1 Layer), MMT E.2 Layer (MMT E.2 Layer) and MMT E.3 Layer (MMT) E.3 Layer).
  • MMT E.1 Layer MMT E.1 Layer
  • MMT E.2 Layer MMT E.2 Layer
  • MMT E.3 Layer MMT E.3 Layer
  • the E.3 layer encapsulates a Media Fragment Unit (MFU) provided from the Media Codec (A) layer to create a Media Processing Unit (MPU).
  • MFU Media Fragment Unit
  • A Media Codec
  • MPU Media Processing Unit
  • Encoded media data from the upper layer is encapsulated in MFU.
  • the type and value of the encoded media can be abstracted to allow the MFU to be generally used in a particular codec technology. This allows the lower layer to process the MFU without access to the encapsulated encoded media.
  • the lower layer retrieves the required encoded media data from the network or storage buffer and sends it to the media decoder.
  • the MFU has enough information media subunits to perform this operation.
  • the MFU may have a format, independent of any particular codec, that can carry data units that can be consumed independently in the media decoder.
  • the MFU can be, for example, a picture or slice of the video.
  • MFUs One or a group of multiple MFUs that can be independently transmitted and decoded create an MPU.
  • Non-temporal media that are independently transportable and executable also create an MPU.
  • MPUs describe internal structures such as the arrangement and pattern of MFUs that allow for quick access and partial consumption of MFUs.
  • the E.2 layer encapsulates the MPUs created in the E.3 layer to generate MMT assets.
  • An MMT asset is a data entity made up of one or more MPUs from a single data source, and is a data unit in which composition information (CI) and transport characteristics (TC) are defined. Multiplexed by load format and transmitted by MMT protocol.
  • MMT assets can correspond to packetized elementary streams (PES), for example video, audio, program information, MPEG-U widgets, JPEG images, MPEG 4 file format, M2TS (MPEG transport stream), etc.
  • PES packetized elementary streams
  • the E.1 layer creates an MMT package by encapsulating the MMT asset generated in the E.2 layer.
  • the MMT asset is packaged with MMT composition information (MMT-CI) for later response of the same user experience together or separately with other functional areas—transmission area and signal area.
  • MMT-CI MMT composition information
  • the MMT package is also packaged with a transmission characteristic that selects an appropriate transmission method for each MMT asset to satisfy the haptic quality of the MMT asset.
  • the MMT package may be composed of one or more MMT assets together with additional information such as composition information and transport characteristics.
  • Composition information includes information about a relationship between MMT assets, and when one content consists of a plurality of MMT packages, it indicates a relationship between a plurality of MMT packages. It may further include information.
  • the transport characteristics may include transmission characteristic information necessary for determining a delivery condition of an MMT asset or an MMT packet, and may include, for example, a traffic description parameter and a QoS descriptor. ) May be included.
  • the MMT package may correspond to a program of MPEG-2 TS.
  • the delivery layer may perform, for example, network flow multiplexing, network packetization, and QoS control of media transmitted through a network.
  • the delivery functional area defines the application layer protocol and format of the payload.
  • the application layer protocol in the present invention provides enhanced features for the delivery of MMT packages as compared to conventional application layer protocols for the transmission of multimedia including multiplexing.
  • the payload format is defined to carry coded media data regardless of the media type or encoding method.
  • the transport layer (D-layer), as shown in Figure 1, MMT D.1 Layer (MMT D.1 Layer), MMT D.2 Layer (MMT D.2 Layer) and MMT D.3 Layer (MMT) D.3 Layer).
  • the D.1 layer receives the MMT package generated in the E.1 layer and generates an MMT payload format.
  • the MMT payload format is a payload format for carrying MMT assets and for transmitting information for consumption by the MMT application protocol or other existing application transport protocol such as RTP.
  • the MMT payload may include a fragment of the MFU along with information such as AL-FEC.
  • the D.2 layer receives the MMT payload format generated in the D.1 layer and generates an MMT transport packet or an MMT packet.
  • the MMT transport packet or MMT packet is a data format used in an application transport protocol for MMT.
  • D.3 layer supports QoS by providing the function of exchanging information between layers by cross-layer design.
  • the D.3 layer may perform QoS control using QoS parameters of the MAC / PHY layer.
  • the S layer performs a signaling function. For example, signaling functions for session initialization / control / management of transmitted media, server-based and / or client-based trick modes, service discovery, synchronization, etc. Can be done.
  • the signaling functional area defines the format of the message that manages the delivery and consumption of the MMT package.
  • the message for consumption management is used to transmit the structure of the MMT package, and the message for delivery management is used to transmit the structure of the payload format and the configuration of the protocol.
  • the S layer may include an MMT S.1 layer and an MMT S.2 layer.
  • S.1 layer includes service discovery, media session initialization / termination of media, media session presentation / control of media, delivery (D) layer and encapsulation (E).
  • the interface function with the layer can be performed.
  • the S.1 layer may define the format of control messages between applications for media presentation session management.
  • the S.2 layer is responsible for flow control, delivery session management, delivery session monitoring, error control, and hybrid network synchronization control. It is possible to define the format of the control message exchanged between delivery end-points of the D-layer.
  • the S.2 layer supports delivery session establishment and release, delivery session monitoring, flow control, error control, resource scheduling for established delivery sessions, and synchronization in a complex delivery environment to support the behavior of the delivery layer.
  • Signaling for adaptive delivery, and signaling for adaptive delivery. Required signaling may be provided between a sender and a receiver. That is, the S.2 layer may provide signaling required between the sender and the receiver in order to support the operation of the transport layer as described above.
  • the S.2 layer may be responsible for interfacing with the transport layer and the encapsulation layer.
  • FIG. 2 illustrates a format of unit information (or data or packet) used for each layer of the MMT hierarchical structure of FIG. 1.
  • the media fragment unit (MFU) 130 may include coded media fragment data 132 and a media fragment unit header (MFUH) 134.
  • the media fragment unit 130 has a general container format independent of a specific codec and may carry the smallest data unit that can be consumed independently in a media decoder.
  • the MFUH 134 may include additional information such as media characteristics-for example, loss-tolerance.
  • MFU) 130 may be, for example, a picture or slice of a video.
  • the Media Fragment Unit may define a format that encapsulates a portion of the AU in the transport layer to perform adaptive transmission in the range of the MFU.
  • the MFU may be used to transmit certain types of encoded media so that portions of the AU can be independently decoded or discarded.
  • the MFU has an identifier for distinguishing one MFU from other MFUs and may have general relationship information between MFUs in a single AU.
  • the dependency relationship between MFUs in a single AU is described, and the relative priority of the MFUs can be described as part of such information.
  • the information can be used to handle the transmission at the lower transport layer.
  • the transport layer may omit the transmission of MFUs that may be discarded to support QoS transmission in insufficient bandwidth. Detailed description of the MFU structure will be given later.
  • the MPU is a collection of media fragment units including a plurality of media fragment units 130.
  • the MPU may have a general container format independent of a specific codec and may include media data equivalent to an access unit.
  • the MPU may have a timed data unit or a non-timed data unit.
  • MPU is data that is independently and completely processed by an entity following the MMT, and the processing may include encapsulation and packetization.
  • An MPU may consist of at least one MFU or have a portion of data having a format defined by another standard.
  • a single MPU may accommodate the integral number or non-time data of at least one AU.
  • an AU may be delivered from at least one MFU, but one AU may not be divided into multiple MPUs.
  • one MPU receives a portion of non-time data that has been independently and completely processed by an entity that complies with the MMT.
  • An MPU can be uniquely identified within an MMT package with a sequence number and an associated asset ID that distinguishes it from other MPUs.
  • the MPU may have at least one random access point.
  • the first byte of the MPU payload can always start with a random access point. In time data, this fact means that the decoding order of the first MFU in the MPU payload is always zero.
  • the presentation period and decoding order of each AU can be sent to inform the presentation time.
  • the MPU does not have its initial presentation time, and the presentation time of the first AU of one MPU may be described in the composition information.
  • the composition information may specify the first presentation time of the MPU. Details will be described later.
  • the MMT asset 150 is a collection of MPUs composed of a plurality of MPUs.
  • the MMT asset 150 is a data entity composed of multiple MPUs (timed or non-timed data) from a single data source, and the MMT asset information 152 is an asset packaging metadata (Asset). Additional information such as packaging metadata) and data type.
  • MMT asset 150 may include, for example, video, audio, program information, MPEG-U widgets, JPEG images, MPEG 4 FF (File Format), packetized elementary streams (PES), and MPEG transport (M2TS). streams).
  • MMT Assets are logical data entities that contain encoded media data.
  • the MMT asset may be composed of an MMT asset header and encoded media data.
  • the encoded media data may be a collective reference group of MPUs with the same MMT asset ID.
  • Types of data that can be individually consumed by an entity directly connected to the MMT client may be considered as separate MMT assets. Examples of data types that can be considered as individual MMT assets include MPEG-2 TS, PES, MP4 files, MPEG-U Widget Package, and JPEG files.
  • the encoded media of the MMT asset may be time data or non-time data.
  • Temporal data is audiovisual media data that requires synchronized decoding and presentation of specific data at specified times.
  • Non-timed data is data of a data type that can be decoded and provided at any time in accordance with the provision of a service or user interaction.
  • a service provider may create a multimedia service by integrating MMT assets and putting MMT assets on a space-time axis.
  • the MMT package 160 is a collection of MMT assets including one or more MMT assets 150.
  • MMT assets in an MMT package may be multiplexed or concatenated.
  • the MMT package is a container format for MMT asset and configuration information.
  • the MMT package provides a repository of MMT assets and configuration information for the MMT program.
  • the MMT program provider generates configuration information by encapsulating the encoded data into MMT assets and describing the temporal and spatial layout of the MMT assets and their transmission characteristics.
  • MU and MMT assets can be sent directly in the D.1 payload format.
  • the configuration information may be sent by the C.1 Presentation Session Management message.
  • MMT program providers and clients that allow relaying or future reuse of MMT programs store them in MMT package format.
  • the MMT program provider determines which transmission path (eg, broadcast or broadband) the MMT asset will be provided to the client.
  • Configuration information in the MMT package is transmitted in a C.1 presentation session management message along with transmission related information.
  • the client receives the C.1 Presentation Session Management message to know which MMT programs are available and how to receive the MMT assets for the corresponding MMT program.
  • the MMT package can also be transmitted by the D.1 payload format.
  • the MMT package is packetized and delivered in D.1 payload format.
  • the client receives the packetized MMT package and configures all or part of it, where it consumes the MMT program.
  • the package information 165 of the MMT package 160 may include configuration information.
  • the configuration information may include additional information such as a list of MMT assets, package identification information, composition information 162, and transport characteristics 164.
  • Composition information 162 includes information about a relationship between MMT assets 150.
  • composition information 162 may further include information for indicating a relationship between a plurality of MMT packages when one content includes a plurality of MMT packages.
  • Composition information 162 may include information about temporal, spatial and adaptive relations in an MMT package.
  • Composition Information in the MMT provides information about the spatial and temporal relationships between MMT assets in the MMT package.
  • MMT-CI is an explanatory language that extends HTML5 to provide such information. If HTML5 is designed to describe page-based presentations of text-based content, MMT-CI mainly represents spatial relationships between sources. In order to support the presentation of the temporal relationship between MMT assets, information related to MMT assets in an MMT package, such as presentation resources, time information for determining the order in which MMT assets are sent and consumed, and various MMT assets are consumed in HTML5. It can be extended to have additional properties of media elements. Detailed description will be described later.
  • the transport characteristics information 164 may include information on transmission characteristics and may provide information necessary for determining a delivery condition of each MMT asset (or MMT packet).
  • the transmission characteristic information may include a traffic description parameter and a QoS descriptor.
  • the traffic description parameter may include bitrate information, priority information, or the like for the media fragment unit (MFU) 130 or the MPU.
  • the bitrate information is for example information about whether the MMT asset is Variable BitRate (VBR) or Constant BitRate (CBR), guaranteed bitrate for the Media Fragment Unit (MFU) (or MPU). ), The maximum bit rate for the media fragment unit (MFU) (or MPU).
  • the traffic description parameter may be used for resource reservation between servers, clients, and other components on a delivery path, for example, maximum size information of a media fragment unit (MFU) (or MPU) in an MMT asset. It may include.
  • the traffic description parameter may be updated periodically or aperiodically.
  • the QoS descriptor includes information for QoS control and may include, for example, delay information and loss information.
  • the loss information may include, for example, a loss indicator of whether delivery loss of the MMT asset is allowed or not.
  • a loss indicator of '1' may indicate 'lossless', and a '0' indicates 'lossy'.
  • the delay information may include a delay indicator used to distinguish the sensitivity of the transmission delay of the MMT asset.
  • the delay indicator may indicate whether the type of the MMT asset is conversation, interactive, real time, and non-realtime.
  • One content may consist of one MMT package.
  • one content may consist of a plurality of MMT packages.
  • composition information or composition information indicating temporal, spatial, and adaptive relations between the plurality of MMT packages may exist inside one MMT package or outside the MMT package.
  • some of the content components are transmitted through a broadcast network and the rest of the content components are transmitted through a broadband network.
  • a broadband network For example, in the case of a plurality of audio visual streams constituting one multi-view service, one stream may be transmitted to a broadcasting network and the other stream may be transmitted to a broadband network, and each AV stream may be multiplexed and transmitted to a client terminal. Can be individually received and stored.
  • application software such as a widget is transmitted to a broadband network and an AV stream (AV program) is transmitted to an existing broadcasting network.
  • the entire plurality of AV streams may be a single MMT package, and in this case, one of the plurality of streams may be stored in only one client terminal.
  • the storage content becomes part of the MMT package, and the client terminal needs to rewrite the composition information or the configuration information, and the rewritten content becomes a new MMT package independent of the server. .
  • each AV stream may be one MMT package, and in this case, a plurality of MMT packages constitute one content, and storage Storage is recorded in MMT package units and requires composition information or configuration information indicating a relationship between MMT packages.
  • composition information or configuration information included in one MMT package may refer to an MMT asset in another MMT package, and may refer to an outside of an MMT package that refers to the MMT package in an out-band situation. I can express it.
  • the MMT package 160 is controlled through a control (C) layer.
  • the MMT control message may include an information table for service discovery.
  • the server dividing the multimedia content into a plurality of segments allocates URL information to a plurality of segments divided into a predetermined number, and stores URL information about each segment in a media information file and transmits the URL information to the client.
  • the media information file may be called various names such as “media presentation description (MPD)” or “manifest file” according to a standardization organization that standardizes HTTP streaming.
  • MPD media presentation description
  • MPD media presentation description
  • the cross-layer interface is described below.
  • the Cross Layer Interface provides a means for supporting QoS in a single entity by exchanging QoS related information between lower layers including the application layer and the MAC / PHY layer.
  • the lower layer provides bottom-up QoS information such as network channel state, while the application layer provides information related to media characteristics as top-down QoS information.
  • the cross layer interface provides an integrated interface between the application layer and various network layers including IEE802.11 WiFi, IEEE 802.16 WiMAX, 3G, 4G LTE, etc.
  • Common network parameters of popular network standards are extracted as NAM parameters for static and dynamic QoS control of real-time media applications over various networks.
  • the NAM parameter may include a BER value that is a bit error rate. BER can be measured at the PHY or MAC layer.
  • the NAM also provides the identification of the underlying network, possible bit rates, buffer conditions, peak bit rates, service unit sizes, and service data unit loss rates.
  • the first way is to provide an absolute value.
  • the second is to provide relative values.
  • the second method can be used to update the NAM while connected.
  • the application layer provides top-down QoS information related to media characteristics for lower layers.
  • top-down information such as MMT asset level information and packet level information.
  • MMT asset information is used for capacity exchange and / or resource (re) allocation at lower layers.
  • Packet level top down information is recorded in the appropriate field of every packet for the lower layer to identify the QoS level it supports.
  • the lower layer provides bottom-up QoS information to the application layer.
  • the lower layer provides information regarding network conditions that change over time, enabling faster and more accurate QoS control at the application layer.
  • Bottom-up information is expressed in an abstracted form to support heterogeneous network environments. These parameters are measured at the lower layer and read at the application layer periodically or at the request of the MMT application.
  • MMT system a system for transmitting encoded media data through a heterogeneous IP network
  • information that can be provided by the media encoder in relation to the media data is indicated.
  • MMT system a heterogeneous IP network
  • a method for transmitting coded media fragment data through a heterogeneous IP network may be performed as shown in FIG. 6.
  • a media unit (MU) having a format independent of a specific media codec is provided from a media codec layer (S610).
  • the provided media unit is encapsulated to generate an MMT asset (S620).
  • the MMT package is encapsulated to generate an MMT package (S630), the MMT package is received to generate an MMT payload format (S640), and the MMT payload format is received to MMT.
  • a transport packet (MMT Transport Packet) is generated (S650).
  • a number of indicators may be included in the media unit to convey information obtained by the encoder during encoding to the system.
  • An indicator generated by the encoder in the media unit in relation to the media data with the minimum information that the media encoder can provide according to an embodiment of the present invention is encoded by flag_RAP indicating a random access point (RAP).
  • Flag_start_media and flag_end_media representing the start and end of the media data
  • the flag_lossless indicator representing whether the critical (critical) when transmitting the coded media fragment data.
  • the critical status indicates whether the media unit transmits a serious problem in processing at the decoder when a bit or a packet of encoded media fragment data is lost.
  • the flag_lossless indicator indicates that the criticality is very low data.
  • Each of the above-mentioned indicators may be omitted in part as necessary.
  • flag_RAP is an indicator indicating whether encoded media fragment data has a random access point (RAP). If the value of the Flag_RAP indicator is 1, it indicates that the encoded media fragment data includes a start point that can be accessed randomly. On the contrary, if the value of the Flag_RAP indicator is 0, it indicates that the encoded media fragment data does not have a starting point for random access.
  • flag_start_of_media indicates whether the encoded media fragment data has a first fragment of encoded media data sharing the same presentation time stamp (PTS). If the value of the flag_start_of_media indicator is 1, the encoded media fragment data included in the media unit has the first fragment of the encoded media data sharing the same PTS. In the same way, if the value of the flag_start_of_media indicator is not 1, it indicates that the fragment of the encoded media fragment data included in the media unit is not the first fragment of the encoded media data sharing the same PTS.
  • PTS presentation time stamp
  • flag_end_of_media indicates whether the encoded media fragment data included in the media unit has a last fragment of encoded media data sharing the same PTS. That is, if the value of flag_end_of_media is 1, the encoded media fragment data included in the corresponding media unit has the last fragment of the encoded media data sharing the same PTS.
  • flag_lossless indicates whether the encoded media fragment data included in the media unit is critical to loss of bits or packets. If the value of the flag_lossless indicator is 1, it indicates that the coded media fragment data contains data that may cause serious problems in the decoding process at the decoder in the case of bit or packet loss.
  • a method for transmitting coded media fragment data through a heterogeneous IP network may be performed as shown in FIG. 7.
  • a method of transmitting encoded media data starts with receiving a Media Processing Unit (MPU) having a format independent of a specific media codec from a media codec layer (S710).
  • An MMT asset is generated by encapsulating the provided Media Processing Unit (MPU) (S720).
  • an MMT package is generated by encapsulating the MMT asset (S730), the MMT package is received to generate an MMT Payload format (S740), and the MMT payload format is received.
  • a MMT transport packet is generated (S750).
  • a plurality of indicators may be included in the media processing unit (MPU) to convey information obtained by the encoder to the system during encoding.
  • MPU media processing unit
  • a processing unit (Media Processing Unit; MPU) includes a header portion and a data portion.
  • the data portion contains encoded media fragment data.
  • the header part includes a flag_RAP indicator indicating whether the encoded media fragment data includes a random access point, a flag_start_of_media indicator indicating whether the encoded media fragment data includes a start portion of encoded media data, and A flag_end_of_media indicator indicating whether the encoded media fragment data includes the last part of the encoded media data and the header portion include a flag_lossless indicator indicating whether the encoded media fragment data requires lossless transmission.
  • the encoded media data is divided into encoded media fragment data. Generates a sequence number of encoded media fragment data during the partitioning process. Alternatively, the sequence number of the encoded media fragment data may be sequentially inserted into the MPU without generating the sequence number. During the encoding process, it may be determined whether the encoded media fragment data is the first or last encoded media fragment data of the encoded media data and whether the encoded media fragment data has a random access point.
  • the first coded media fragment data is separated from the coded media data. It is determined whether the separated coded media fragment data has a random access point, and the separated coded media fragment data is inserted into the MPU. Since the encoded media fragment data, the sequence number of the encoded media fragment data generated in flag_start_media is stored. In another embodiment of the present invention, when the sequence number of the encoded media fragment data is not generated, the relative position at which the encoded media fragment data is inserted in the MPU is stored. It also indicates whether the flag_RAP has a random access point. In an embodiment of the present invention, the flag_RAP indicator may accommodate sequence numbers of encoded media fragment data.
  • the flag_RAP indicator is generated when the flag_RAP is a bslbf (bit string, left bit first) indicator in which the left bit has priority using the relative position where the media fragment data encoded in the MPU is inserted. If the encoded media fragment data has random access points, 1 is stored in the first bit of flag_RAP. If the encoded media fragment data does not have random access points, 0 is stored in the first bit. In the same way, the Lossless_flags value can be maintained.
  • the above steps are repeated for the next encoded media fragment data of the encoded media data.
  • the generated encoded media fragment data is the last encoded media fragment data of the encoded media data
  • the sequence number of the generated encoded media fragment data is stored in flag_end_of_media of the MPU.
  • a method for transmitting coded media fragment data through a heterogeneous IP network may be performed as shown in FIG. 8.
  • a media fragment unit (MFU) having a format independent of a specific media codec is provided from a media codec layer (S810).
  • the provided media fragment unit is encapsulated to generate a media processing unit (MPU) (S820).
  • An MMT asset is generated by encapsulating a media processing unit (MPU) (S830).
  • the MMT package is generated by encapsulating the MMT asset (S840), the MMT package is received to generate the MMT payload format (S850), and the MMT payload format is received.
  • a MMT Transport Packet is generated (S860).
  • a plurality of indicators may be included in the media fragment unit (MFU) to convey information obtained by the encoder during encoding to the system.
  • MFU media fragment unit
  • the aforementioned Media Fragment Unit may be used as an output of a media encoder (e.g. video encoder or audio encoder).
  • MFU may include an MFU header (mfu_header) field as a header part and a media data (media_data) field as a data part.
  • the MFU separates an access unit (AU) into one or more pieces of encoded media fragment data, and separates and receives each part in each MFU.
  • the separated AU data is accommodated in the media data (media_data) field field of each MFU.
  • the header structure of the MFU according to an embodiment of the present invention is shown in FIG. 5.
  • the MFU header (mfu_header) field may be used as a path for delivering information to the system in a form in which the content is written in the media encoder and passed to the system.
  • the size of the MFU header may be designed to have a fixed size as a single byte for simplicity. That is, the size of the MFU header may be limited to a single byte for simplicity.
  • the header of the MFU may consist of the minimum information that the media encoder can provide. Accordingly, the MFU header according to an embodiment of the present invention minimizes the header size and simplifies the structure of the header. This is because the MFU is not processed by the system but is accessed / processed by the media encoder. Therefore, when the MFU header has a complicated structure, implementation is difficult and the availability is reduced.
  • the MFU header is composed of general information that can help media transmission and consumption as the media encoder knows best.
  • MFU header when MFU header is configured, information dependent on a specific media or a specific media codec is excluded as much as possible.
  • the MFU header may include a reserved area, a flag_RAP, a flag_lossless, a flag_start_of_AU, and a flag_end_of_AU indicator.
  • the flag_RAP, flag_lossless, flag_start_of_AU, and flag_end_of_AU indicators are designed to occupy one bit of space.
  • a reserved area of 4 bits can be set for future use scalability. Each indicator may be omitted in some cases as necessary.
  • flag_RAP is an indicator indicating whether the coded media fragment data included in the MFU has a random access point (RAP). If the value of the Flag_RAP indicator is 1, it indicates that the encoded media fragment data included in the corresponding MFU includes a starting point for random access. On the contrary, if the value of the Flag_RAP indicator is 0, it indicates that the encoded media fragment data included in the corresponding MFU does not have a starting point for random access.
  • flag_lossless is an indicator indicating whether media data included in the MFU is critical to loss of bits or packets. If the value of the flag_lossless indicator is 1, it indicates that the coded media fragment data included in the corresponding MFU includes data that may cause serious problems to the decoder process when bits or packets are lost.
  • flag_start_of_AU indicates whether coded media fragment data included in the corresponding MFU has a first fragment of an access unit (AU) that shares the same presentation time stamp (PTS). If the value of the flag_start_of_AU indicator is 1, the encoded media fragment data included in the corresponding MFU has the first fragment of the AU sharing the same PTS. In the same manner, if the value of the flag_start_of_AU indicator is not 1, it indicates that the fragment of the coded media fragment data included in the corresponding MFU is not the first fragment of the AU sharing the same PTS.
  • flag_end_of_AU indicates whether coded media fragment data included in the corresponding MFU has the last fragment of the AU sharing the same PTS. That is, if the value of flag_end_of_AU is 1, the encoded media fragment data included in this MFU has the last fragment of the AU sharing the same PTS.
  • the MFU generated in the above-described embodiment may be sequentially stored in the MPU to transmit media data without having to allocate the MFU sequence number of the MFU. By not assigning an MFU sequence number, the size of data to be transmitted can be reduced.
  • An MFU sequence number is a serial number of an MFU, and is defined as distinguishing an MFU between one MPU.
  • the decoder on the receiving side decapsulates the received MPUs in the order in which they are received to obtain MFUs sequentially stored in the MPUs.
  • the decoder refers to the header of the MFU to identify the start of random access, the start and end of the media data, and MFUs with data that can cause serious problems to the decoder process in the event of loss of bits or packets require retransmission or decoding of that part. Can be omitted.
  • an MFU sequence number indicating the relationship between the separated AU fragments may be generated by displaying a separate field in the MFU.
  • a separate MFU sequence number field may be used for the MFU header (mfu_header).
  • the MFU sequence number may be stored for each MFU by using a separate MFU sequence number field in addition to the MFU header (mfu_header) field and the media data (media_data) field.
  • the MFU sequence number may be included in a specific portion of the media data (media_data) field in which the encoded media fragment data is accommodated.
  • the encoder and decoder may use the MFU sequence number to ensure that the order between the media data and the MFU matches.
  • a method for transmitting coded media fragment data through a heterogeneous IP network may be performed as shown in FIG. 8.
  • a media fragment unit (MFU) having a format independent of a specific media codec is provided from a media codec layer (S810).
  • the provided media fragment unit is encapsulated to generate a media processing unit (MPU) (S820).
  • An MMT asset is generated by encapsulating a media processing unit (MPU) (S830).
  • an MMT package is generated by encapsulating the MMT asset (S840), an MMT package is generated by generating an MMT payload format (S850), and an MMT payload format is received.
  • a MMT Transport Packet is generated (S860).
  • a plurality of indicators may be included in the media processing unit (MPU) to convey information obtained by the encoder to the system during encoding.
  • MPU media processing unit
  • the MFU stores only encoded media fragment data, sequentially stores the MFU in the MPU, and corresponds to the first encoded media fragment data of the AU and the last encoded media fragment data in the header of the MPU.
  • MFU corresponding to the encoded media fragment data having a start point of random access may be indicated.
  • the MFU coded media fragment data and MFU sequence numbers are stored in the MFU, and the MFU corresponding to the first encoded media fragment data of the AU and the last encoded media fragment data is indicated in the header of the MPU.
  • the MFU corresponding to the encoded media fragment data having the start point of random access may be indicated.
  • the MFU includes an MFU sequence number and encoded media fragment data.
  • the MPU may include Subsample_start_id, Subsample_end_id, and RAP_flags indicators.
  • Subsample_start_id and Subsample_end_id specify a range of MFUs of the corresponding AU.
  • the mfu_sequence_id specified by the subsample_start_id indicates the MFU sequence number of the first MFU of the MFU belonging to the AU.
  • the mfu_sequence_id specified by the subsample_end_id indicates the last MFU sequence number of the MFU belonging to the AU.
  • the RAP_flags indicates whether the corresponding AU has a random access point (RAP). If the AU has a random access point, RAP_flags may have a value of 1. This may be implemented by indicating whether each MFU has a random access point (RAP) with bits having a value of 1 or 0 in successive relative order of MFUs belonging to the MPU. If any of the MFUs belonging to one AU has a random access point, RAP_flags has a nonzero value, and if all MFUs do not have a random access point, RAP_flags has a value of zero. Alternatively, RAP_flags may accommodate an MFU sequence number.
  • the MPU may include Lossless_flags for each MFU indicating whether the data is critical data that can cause serious problems in decoder processing when bits or packets are lost.
  • Lossless_flags may have a value of 1 when the AU has critical data.
  • a bit may have a value of 1 or 0 in a sequential order of MFUs belonging to the MPU to indicate whether each MFU has critical data. If any of the MFUs belonging to one AU has critical data, Lossless_flags has a nonzero value, and Lossless_flags has a value of zero unless all MFUs have random access points.
  • Whether it has critical data may be indicated at the MMT asset level or at the MMT transport packet level rather than at the MFU level. It is possible to indicate that lossless transmission is required for one asset while indicating whether data loss is allowed at the MMT asset level, or lossless transmission may be required for transmitted packets while indicating whether data loss is allowed at the MMT transport packet level. .

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)
  • Data Exchanges In Wide-Area Networks (AREA)

Abstract

La présente invention concerne un procédé pour transmettre des données multimédias par l'intermédiaire d'un réseau IP hétérogène, le procédé transmettant de manière efficace, indépendamment d'un codec multimédia spécifique, une diffusion en continu multimédia dans le format d'une unité de données de fragment multimédia codé, comprenant un en-tête d'une unité multimédia ayant une dimension réduite au minimum et une structure simplifiée.
PCT/KR2012/008337 2011-10-13 2012-10-12 Procédé de transmission de données multimédia par l'intermédiaire d'un réseau ip hétérogène indépendamment d'un codec multimédia Ceased WO2013055162A2 (fr)

Priority Applications (1)

Application Number Priority Date Filing Date Title
US14/351,776 US20140282799A1 (en) 2011-10-13 2012-10-12 Method for transmitting media data via a heterogeneous ip network independently of a media codec

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
KR10-2011-0104853 2011-10-13
KR20110104853 2011-10-13
KR1020120112544A KR20130040132A (ko) 2011-10-13 2012-10-10 이종 ip 네트워크를 통한 미디어 코덱에 독립적인 미디어 데이터 전송 방법
KR10-2012-0112544 2012-10-10

Publications (2)

Publication Number Publication Date
WO2013055162A2 true WO2013055162A2 (fr) 2013-04-18
WO2013055162A3 WO2013055162A3 (fr) 2013-06-13

Family

ID=48082683

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/KR2012/008337 Ceased WO2013055162A2 (fr) 2011-10-13 2012-10-12 Procédé de transmission de données multimédia par l'intermédiaire d'un réseau ip hétérogène indépendamment d'un codec multimédia

Country Status (1)

Country Link
WO (1) WO2013055162A2 (fr)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106105235A (zh) * 2014-01-09 2016-11-09 三星电子株式会社 在多媒体传输系统中发送媒体数据相关信息的方法和装置

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR101777347B1 (ko) * 2009-11-13 2017-09-11 삼성전자주식회사 부분화에 기초한 적응적인 스트리밍 방법 및 장치
KR101737084B1 (ko) * 2009-12-07 2017-05-17 삼성전자주식회사 메인 콘텐트에 다른 콘텐트를 삽입하여 스트리밍하는 방법 및 장치

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106105235A (zh) * 2014-01-09 2016-11-09 三星电子株式会社 在多媒体传输系统中发送媒体数据相关信息的方法和装置
CN111417013A (zh) * 2014-01-09 2020-07-14 三星电子株式会社 在多媒体传输系统中发送媒体数据相关信息的方法和装置
US10779035B2 (en) 2014-01-09 2020-09-15 Samsung Electronics Co., Ltd. Method and apparatus of transmitting media data related information in multimedia transmission system
CN111417013B (zh) * 2014-01-09 2021-12-10 三星电子株式会社 多媒体传输系统中发送和接收数据相关信息的方法和装置
US11297381B2 (en) 2014-01-09 2022-04-05 Samsung Electronics Co., Ltd. Method and apparatus of transmitting media data related information in multimedia transmission system

Also Published As

Publication number Publication date
WO2013055162A3 (fr) 2013-06-13

Similar Documents

Publication Publication Date Title
WO2013077697A1 (fr) Procédé de fourniture hybride d'ensemble mmt et de contenu et procédé de réception de contenu
US10728591B2 (en) Method of configuring and transmitting an MMT transport packet
WO2013077698A1 (fr) Procédé de liaison de média mmt et de média dash
WO2013169084A1 (fr) Procédé de transmission hybride par extension de format de paquets mmt
US20150181003A1 (en) Method and apparatus for transmitting and receiving packets in hybrid transmission service of mmt
WO2013141666A1 (fr) Procédé de transmission et procédé de réception hybrides pour un contenu vidéo svc empaqueté dans un mmt
WO2013162312A1 (fr) Procédé et appareil permettant l'émission-réception de données destinées à un système de transmission multimédia
KR20130040132A (ko) 이종 ip 네트워크를 통한 미디어 코덱에 독립적인 미디어 데이터 전송 방법
WO2011059274A2 (fr) Procédé et appareil de diffusion adaptative en flux
JP2017163552A (ja) マルチメディアサービス送受信方法及び装置
WO2013112027A1 (fr) Procédé d'estimation de gigue de réseau dans un appareil de transmission de données multimédia codées
KR20130085987A (ko) 이종망 네트워크에서 미디어 프래그먼트 유닛으로 나누어진 액세스 유닛을 가지는 미디어 데이터를 전송하는 방법
US20150020138A1 (en) Mmt asset structure for supporting random access to systems for transmitting coded media data in heterogeneous networks, and method and apparatus for generating same
WO2018174367A1 (fr) Procédé et dispositif de transmission et de réception de signal de diffusion
KR102049039B1 (ko) Mmt 전송 패킷의 설정 방법 및 전송 방법
WO2016043432A1 (fr) Procédé et appareil permettant de transmettre ou de recevoir un contenu multimédia
WO2013055168A1 (fr) Procédé de transmission de données multimédias reliées à des informations de composition et des informations de caractéristiques de transmission par un réseau ip hétérogène
WO2013109094A1 (fr) Procédé de transmission de données multimédia ayant une unité d'accès divisée en unités de fragments multimédia dans un réseau hétérogène
WO2014010830A1 (fr) Procédé et appareil de transmission et de réception de paquets dans un service de transmission hybride de mmt
WO2013187667A1 (fr) Procédé d'adaptation de débit utilisant un taux d'erreur sur les bits pour un service multimédia et appareil associé
WO2013119023A1 (fr) Structure d'éléments de contenu mmt pour prendre en charge un accès aléatoire à des systèmes qui sont utilisés afin de transmettre des données multimédias codées dans des réseaux hétérogènes, et procédé et appareil adaptés pour créer cette structure
WO2013055162A2 (fr) Procédé de transmission de données multimédia par l'intermédiaire d'un réseau ip hétérogène indépendamment d'un codec multimédia
WO2017142347A1 (fr) Procédé et dispositif permettant de fournir des informations relatives à un contenu d'un service multimédia
KR20130040151A (ko) 콤포지션 정보 및 전송 특성 정보가 연동된 미디어 데이터를 이종 ip 네트워크를 통하여 전송하는 방법
KR20130040144A (ko) 엠엠티 시스템에서의 패킷 전송 장치 및 방법, 및 패킷 수신 장치 및 방법

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 12840255

Country of ref document: EP

Kind code of ref document: A2

WWE Wipo information: entry into national phase

Ref document number: 14351776

Country of ref document: US

122 Ep: pct application non-entry in european phase

Ref document number: 12840255

Country of ref document: EP

Kind code of ref document: A2