WO2022052102A1 - 一种通信方法及装置 - Google Patents
一种通信方法及装置 Download PDFInfo
- Publication number
- WO2022052102A1 WO2022052102A1 PCT/CN2020/115045 CN2020115045W WO2022052102A1 WO 2022052102 A1 WO2022052102 A1 WO 2022052102A1 CN 2020115045 W CN2020115045 W CN 2020115045W WO 2022052102 A1 WO2022052102 A1 WO 2022052102A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- video frame
- data
- video
- frame
- video frames
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Ceased
Links
Images
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/60—Network structure or processes for video distribution between server and client or between remote clients; Control signalling between clients, server and network components; Transmission of management data between server and client, e.g. sending from server to client commands for recording incoming content stream; Communication details between server and client
- H04N21/63—Control signaling related to video distribution between client, server and network components; Network processes for video distribution between server and clients or between remote clients, e.g. transmitting basic layer and enhancement layers over different transmission paths, setting up a peer-to-peer communication via Internet between remote STB's; Communication protocols; Addressing
- H04N21/637—Control signals issued by the client directed to the server or network components
- H04N21/6371—Control signals issued by the client directed to the server or network components directed to network
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L65/00—Network arrangements, protocols or services for supporting real-time applications in data packet communication
- H04L65/80—Responding to QoS
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/43—Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
- H04N21/442—Monitoring of processes or resources, e.g. detecting the failure of a recording device, monitoring the downstream bandwidth, the number of times a movie has been viewed, the storage space available from the internal hard disk
- H04N21/44209—Monitoring of downstream path of the transmission network originating from a server, e.g. bandwidth variations of a wireless network
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/60—Network structure or processes for video distribution between server and client or between remote clients; Control signalling between clients, server and network components; Transmission of management data between server and client, e.g. sending from server to client commands for recording incoming content stream; Communication details between server and client
- H04N21/63—Control signaling related to video distribution between client, server and network components; Network processes for video distribution between server and clients or between remote clients, e.g. transmitting basic layer and enhancement layers over different transmission paths, setting up a peer-to-peer communication via Internet between remote STB's; Communication protocols; Addressing
- H04N21/631—Multimode Transmission, e.g. transmitting basic layers and enhancement layers of the content over different transmission paths or transmitting with different error corrections, different keys or with different transmission protocols
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L43/00—Arrangements for monitoring or testing data switching networks
- H04L43/08—Monitoring or testing based on specific metrics, e.g. QoS, energy consumption or environmental parameters
- H04L43/0823—Errors, e.g. transmission errors
- H04L43/0829—Packet loss
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L43/00—Arrangements for monitoring or testing data switching networks
- H04L43/08—Monitoring or testing based on specific metrics, e.g. QoS, energy consumption or environmental parameters
- H04L43/0852—Delays
Definitions
- the present application relates to the field of communication technologies, and in particular, to a communication method and device.
- sensors in the helmet can sense the user's position, movement changes, etc., and generate user information including viewing angle, line of sight, and movement rate.
- the user information can be transmitted to the XR server by means of uplink transmission.
- the path of uplink transmission can be helmet->terminal device->wireless network->XR server.
- the XR server can generate new video data according to the user's location, sight line and other information, combined with the user's scene in the game or real scene.
- the new video data can be transmitted to the helmet by means of downstream transmission.
- the downlink transmission path can be XR server -> wireless network -> terminal device -> helmet, and finally the above video data is displayed to the user through the helmet.
- the wireless network is, for example, a 3rd generation partnership project (3GPP) network, such as a long term evolution (LTE) or the 5th generation (5G) network. Since the data volume of the video frame transmitted in the downlink is usually large, how to optimize the downlink transmission mode is a technical problem to be solved in the present application.
- 3GPP 3rd generation partnership project
- LTE long term evolution
- 5G 5th generation
- the present application provides a communication method and apparatus. After receiving a downlink video frame, a terminal device can feed back frame parameters of the downlink video frame to a network device, thereby facilitating the optimization of the transmission of the downlink video frame.
- a communication method is provided, the execution subject of the method is a terminal device, and can also be a component (chip, circuit or others) provided in the terminal device, and the method includes: the terminal device receives from a network device at least one video frame; the terminal device determines a video frame parameter according to the at least one video frame; the terminal device sends the video frame parameter to the network device.
- the terminal device may use the video frame as the granularity to report the reception situation of the video frame, that is, the above-mentioned video frame parameter.
- the terminal device determines the video frame parameters according to the received video frames, and sends the video frame parameters to the network device, so that the network device can know the reception of the downlink video frame, which is convenient for optimizing the downlink transmission of the video frame .
- the video frame parameters include at least one of the following: extended delay parameters, frame spacing parameters, packet loss parameters, late arrival parameters, base layer and enhancement layer parameters.
- the video frame parameters reported by the terminal device can intuitively and accurately display the reception situation of the downlink video frame in different dimensions, which is convenient for the network device to optimize the transmission of the downlink video frame.
- the extension delay parameter is used to indicate at least one of the following: extension delay of the first video frame, average extension delay of multiple video frames, maximum extension delay of multiple video frames, multiple video frames The minimum extension delay of a frame, the variance of the extension delays of multiple video frames; wherein, the extension delay is the time between the terminal device successfully receiving the first data packet of a video frame and successfully receiving the last data packet of the video frame. Time length, the first video frame or the multiple video frames are one or more video frames in at least one video frame received by the terminal device.
- the terminal device can periodically report the above-mentioned extended delay parameters, and/or report the extended delay parameters under different conditions, and the reporting conditions or the contents of the reported extended delay parameters may or may not correspond to .
- the content and/or method of reporting extended delay parameters can be flexibly set to meet various reporting requirements.
- the frame spacing parameter is used to indicate at least one of the following: the frame spacing of adjacent video frames, the average value of multiple frame spacings, the maximum value of multiple frame spacings, and the minimum value of multiple frame spacings, A variance of a plurality of frame spacings; wherein the terminal device receives a plurality of video frames from the network device.
- the network device can know the frame spacing of the downlink video frame. For example, if the frame spacing of the video frames is found to be unstable, the network device may subsequently increase the size of the buffer to ensure that the video frames are sent at the same interval as possible.
- the packet loss parameter includes at least one of the following: the number of video frames in which packet loss occurs, the proportion of video frames in which packet loss occurs, and packet loss occurs in consecutive K video frames, where K is greater than or equal to A positive integer of 1.
- the network device can learn the packet loss situation of the downlink video frame. If the network device finds that there are fewer video frames lost, but many packets are lost once the packets are lost, the network device can determine that the wireless signal has a longer deep decay time, and can send video frames in a new frequency band.
- the method further includes: the terminal device determines whether the packet data convergence protocol PDCP sequence numbers SN of the data packets included in the received video frame are continuous; If the PDCP SNs are discontinuous, it is determined that the video frame is lost.
- the terminal device can determine whether the video frame is lost or not according to whether the PDCP sequence number SN is continuous, which is easy to implement.
- the lateness parameter indicates at least one of the following: the number of late video frames, the proportion of late video frames, the time difference between the actual receiving moment of one late video frame and the correct receiving moment, the number of late video frames The average value of the time difference between the actual receiving time and the correct receiving time, the maximum time difference between the actual receiving time and the correct receiving time among multiple late video frames, and the minimum time difference between the actual receiving time and the correct receiving time among multiple late video frames , the variance of the time difference between the actual receiving moment and the correct receiving moment in multiple late video frames.
- the network device can know the lateness of the downlink video frame, which is convenient for optimizing the transmission of the downlink video frame.
- each video frame includes base layer data and enhancement layer data;
- the base layer and enhancement layer parameters include at least one of the following: in the second video frame, the terminal device receives the base layer data and The time difference of the enhancement layer data, the second video frame is a video frame in the at least one video frame; the number of the third video frame, the ratio of the third video frame, or the number of packet loss in the enhancement layer in the third video frame At least one of, the third video frame is a video frame in which the base layer data has no packet loss, and the enhancement layer data is packet loss, and the third video frame is a video frame in the at least one video frame; the fourth video frame At least one of the number of frames, the ratio of the fourth video frame, or the number of lost packets in the base layer in the fourth video frame, where the fourth video frame is a video frame with no packet loss in the enhancement layer data and packet loss in the base layer data , and the fourth video frame is a video frame in the at least one video frame.
- the network device finds that the time difference between the base layer and the enhancement layer of the same video frame is too large, the subsequent transmission interval can be reduced.
- the video frame includes one or more data slices after network encoding
- the video frame parameter also includes at least one of the following: data slices that the terminal device needs to receive when successfully decoding a video frame.
- the number and/or the data volume of the data fragment the number of data fragments and/or the data volume of the data fragment that the terminal device needs to successfully decode the base layer of a video frame and/or the data fragment;
- the terminal The number of data slices and/or the data volume of the data slices that the device needs to receive to successfully decode the enhancement layer of a video frame.
- the network device can adjust the redundancy rate of the base layer or the enhancement layer in the network coding process according to the reporting situation of the terminal device, and optimize the transmission of downlink video frames.
- a communication method is provided.
- the main body of the method is a network device, and it can also be a component (chip, circuit or others) configured in the network device, including: the network device sends at least one video frame to the terminal device; The network device receives video frame parameters from the terminal device, where the video frame parameters are determined according to the at least one video frame.
- the above-mentioned video frame parameters may be reported with the video frame as the granularity.
- the network device can obtain the reception situation of the downlink video frame, which facilitates the optimization of the transmission process of the downlink video frame.
- the video frame parameters include at least one of the following: extended delay parameters, frame spacing parameters, packet loss parameters, late arrival parameters, base layer and enhancement layer parameters.
- the network device can intuitively and accurately know the reception situation of the downlink video frame in multiple dimensions, which facilitates the optimization of the transmission scheme of the downlink video frame.
- the extension delay parameter is used to indicate at least one of the following: extension delay of the first video frame, extension delay of multiple video frames, maximum extension delay of multiple video frames, multiple video frames
- the minimum extension delay is the variance of the extension delay of multiple video frames; wherein, the extension delay is the length of time between the terminal device successfully receiving the first data packet of a video frame and successfully receiving the last data packet of the video frame , the first video frame or the multiple video frames are one or more video frames in at least one video frame received by the terminal device.
- the network device can adjust the scheduling policy according to the size of the extended delay. For example, if the extension delay is too large, more resources are scheduled to the terminal device, or the modulation and coding scheme (MCS) is adjusted to improve user experience; if the extension delay is too small, the scheduling resources can be reduced , or adjust the MCS to save resources, etc.
- MCS modulation and coding scheme
- the frame spacing parameter is used to indicate at least one of the following: the frame spacing of adjacent video frames, the average value of multiple frame spacings, the maximum value of multiple frame spacings, and the minimum value of multiple frame spacings, A variance of a plurality of frame spacings; wherein the terminal device receives a plurality of video frames from the network device.
- the network device can subsequently increase the size of the buffer to ensure that the video frames are sent at the same interval as possible.
- the packet loss parameter includes at least one of the following: the number of video frames with packet loss, the ratio of video frames with packet loss, and K consecutive video frames with packet loss, where K is greater than or equal to A positive integer of 1.
- the network device determines that the wireless signal has a longer deep decay time, and can send video frames in a new frequency band.
- the lateness parameter indicates at least one of the following: the number of late video frames, the proportion of late video frames, the time difference between the actual receiving moment of one late video frame and the correct receiving moment, the number of late video frames The average value of the time difference between the actual receiving time and the correct receiving time, the maximum time difference between the actual receiving time and the correct receiving time among multiple late video frames, and the minimum time difference between the actual receiving time and the correct receiving time among multiple late video frames , the variance of the time difference between the actual receiving moment and the correct receiving moment in multiple late video frames.
- each video frame includes base layer data and enhancement layer data;
- the base layer and enhancement layer parameters include at least one of the following: in the second video frame, the terminal device receives the base layer data and The time difference of the enhancement layer data, the second video frame is a video frame in the at least one video frame; the number of the third video frame, the ratio of the third video frame, or the number of packet loss in the enhancement layer in the third video frame At least one of, the third video frame is a video frame with no packet loss in the base layer data and packet loss in the enhancement layer data, and the third video frame is a video frame in the at least one video frame; the fourth At least one of the number of video frames, the ratio of the fourth video frame, or the number of lost packets of the base layer in the fourth video frame, where the fourth video frame is the enhancement layer data without packet loss, and the base layer data packet loss.
- video frames, and the fourth video frame is a video frame of the at least one video frame.
- the network device finds that the time difference between the base layer and the enhancement layer of the same video frame is too large, the transmission interval between the two can be reduced subsequently.
- the video frame includes one or more data slices after network encoding
- the video frame parameter further includes at least one of the following: the terminal device needs to receive a video frame that needs to be successfully decoded.
- the number of data fragments and/or the data volume of the data fragment the number of data fragments and/or the data volume of the data fragment that the terminal device needs to successfully decode the base layer of a video frame ; the number of data slices and/or the data volume of the data slices that the terminal device needs to receive to successfully decode the enhancement layer of a video frame.
- the network device can adjust the redundancy rate of the base layer or the enhancement layer in the network coding process according to the reporting situation of the terminal device, and optimize the transmission of downlink video frames.
- an apparatus in a third aspect, is provided, and the beneficial effects can be found in the description of the first aspect.
- the apparatus has the function of implementing the behavior in the method embodiment of the first aspect.
- the functions can be implemented by executing corresponding hardware or software.
- the hardware or software may include one or more units corresponding to the above functions.
- the apparatus may include: a communication unit for receiving at least one video frame from a network device; a processing unit for determining a video frame parameter according to the at least one video frame; and a communication unit for further using for sending the video frame parameters to the network device. These units may perform the corresponding functions in the method examples of the first aspect. For details, refer to the detailed descriptions in the method examples, which will not be repeated here.
- an apparatus in a fourth aspect, is provided, and the beneficial effects can be found in the description of the first aspect.
- the apparatus may be the terminal device in the method of the first aspect, or a chip provided in the terminal device.
- the apparatus includes a communication interface, a processor, and optionally, a memory.
- the memory is used to store computer programs or instructions, and the processor is coupled to the memory and the communication interface, and when the processor executes the computer program or instructions, the apparatus executes the method performed by the terminal device in the method of the first aspect.
- an apparatus in a fifth aspect, is provided, and the beneficial effects can be found in the description of the second aspect.
- the apparatus has the function of implementing the behavior in the method embodiment of the second aspect.
- the functions can be implemented by executing corresponding hardware or software.
- the hardware or software may include one or more units corresponding to the above functions.
- the apparatus may include: a communication unit, configured to send at least one video frame to a terminal device; the communication unit, further configured to receive a video frame parameter from the terminal device, where the video frame parameter is determined from the at least one video frame.
- the processing unit may be configured to optimize the transmission of downlink video frames according to the video frame parameters.
- an apparatus in a sixth aspect, is provided, and the beneficial effects can be found in the description of the second method.
- the apparatus may be the network device in the second aspect above, or a chip provided in the network device.
- the apparatus includes a communication interface, a processor, and optionally, a memory.
- the memory is used to store computer programs or instructions, and the processor is coupled to the memory and the communication interface, and when the processor executes the computer program or instructions, the apparatus is made to execute the method executed by the network device in the method embodiment of the second aspect. method.
- a computer program product includes: computer program code, when the computer program code is executed, the method performed by the terminal device in the first aspect above is executed.
- a computer program product comprising: computer program code, when the computer program code is executed, the method performed by the network device in the above second aspect is executed.
- a chip system in a ninth aspect, includes a processor for implementing the function of the terminal device in the method of the first aspect above.
- the chip system further includes a memory for storing program instructions and/or data.
- the chip system may be composed of chips, or may include chips and other discrete devices.
- a chip system includes a processor for implementing the function of the network device in the method of the second aspect.
- the chip system further includes a memory for storing program instructions and/or data.
- the chip system may be composed of chips, or may include chips and other discrete devices.
- a computer-readable storage medium where a computer program is stored in the computer-readable storage medium, and when the computer program is executed, the method executed by the terminal device in the above-mentioned first aspect is implemented.
- a twelfth aspect provides a computer-readable storage medium, where a computer program is stored in the computer-readable storage medium, and when the computer program is executed, the method executed by the network device in the second aspect above is implemented.
- a thirteenth method provides a communication system, including the apparatus of the third aspect or the fourth aspect, and the apparatus of the fifth aspect or the sixth aspect.
- FIG. 1 is a schematic diagram of a network architecture provided by an embodiment of the present application.
- FIG. 2 is another schematic diagram of a network architecture provided by an embodiment of the present application.
- FIG. 3 is a schematic diagram of adjacent video frames provided by an embodiment of the present application.
- FIG. 4 is a schematic diagram of network coding provided by an embodiment of the present application.
- FIG. 5 is a flowchart of a communication method provided by an embodiment of the present application.
- FIG. 6 is another schematic diagram of adjacent video frames provided by an embodiment of the present application.
- FIG. 7 is a schematic diagram of a late video frame provided by an embodiment of the present application.
- FIG. 8 is a schematic diagram of a base layer and an enhancement layer provided by an embodiment of the present application.
- FIG. 9 is another schematic diagram of network coding provided by an embodiment of the present application.
- FIG. 10 is a schematic structural diagram of an apparatus provided by an embodiment of the present application.
- FIG. 11 is another schematic structural diagram of an apparatus provided by an embodiment of the present application.
- FIG. 1 shows a schematic diagram of a network architecture to which an embodiment of the present application is applied, including at least one of the following: a terminal device, an access network device, a core network (core network, CN) device, and a data network (DN) ).
- the access network device and the core network device can communicate through a next generation (NG) interface, and different access network devices can communicate through the Xn interface.
- NG next generation
- a terminal device which may be referred to as a terminal for short, is a device with a wireless transceiver function.
- Terminal equipment can be mobile or fixed.
- Terminal equipment can be deployed on land, including indoor or outdoor, handheld or vehicle; can also be deployed on water (such as ships, etc.); can also be deployed in the air (such as aircraft, balloons and satellites, etc.).
- the terminal device can be a mobile phone (mobile phone), a tablet computer (pad), a computer with wireless transceiver function, a virtual reality (virtual reality, VR) terminal device, an augmented reality (augmented reality, AR) terminal device, industrial control ( Wireless terminal equipment in industrial control, wireless terminal equipment in self driving, wireless terminal equipment in remote medical, wireless terminal equipment in smart grid, transportation security (transportation) Wireless terminal equipment in safety), wireless terminal equipment in a smart city, and/or wireless terminal equipment in a smart home.
- a virtual reality (virtual reality, VR) terminal device an augmented reality (augmented reality, AR) terminal device
- industrial control Wireless terminal equipment in industrial control, wireless terminal equipment in self driving, wireless terminal equipment in remote medical, wireless terminal equipment in smart grid, transportation security (transportation) Wireless terminal equipment in safety
- wireless terminal equipment in a smart city and/or wireless terminal equipment in a smart home.
- the terminal device may also be a cellular phone, a cordless phone, a session initiation protocol (SIP) phone, a wireless local loop (WLL) station, a personal digital assistant (PDA), a wireless communication Functional handheld or computing device, in-vehicle device, wearable device, terminal device in the 5th generation (5G) network or terminal device in the evolved public land mobile network (PLMN) Wait.
- the terminal device may also be called user equipment (UE), and the terminal device can communicate with multiple access network devices of different technologies.
- the terminal device can communicate with access network devices that support long term evolution (LTE). It can also communicate with 5G-capable access network devices, and it can also be dual-connected with LTE-capable access network devices and 5G-capable access network devices.
- LTE long term evolution
- 5G-capable access network devices and it can also be dual-connected with LTE-capable access network devices and 5G-capable access network devices.
- An access network device can also be called a radio access network (RAN) device, which is a device that connects a terminal device to a wireless network and can provide wireless resource management, service quality management, and data encryption for the terminal device. and compression.
- RAN radio access network
- Access network equipment includes but is not limited to:
- next generation base station generation nodeB, gNB
- evolved node B evolved node B
- eNB radio network controller
- RNC radio network controller
- node B node B
- base station controller base station controller
- BSC base transceiver station
- home base station for example, home evolved node B, or home node B, HNB
- base band unit base band unit, BBU
- transceiver point transmitmitting and receiving point, TRP), transmitting point (TP), and/or mobile switching center, etc.
- the access network device may also be a wireless controller, a centralized unit (centralized unit, CU), and/or a distributed unit (distributed unit, DU) in a cloud radio access network (cloud radio access network, CRAN) scenario.
- the access network device may be a relay station, an access point, a vehicle-mounted device, an access network device in a 5G network, or an access network device in an evolved public land mobile network (PLMN), and the like.
- PLMN evolved public land mobile network
- the access network device may include a centralized unit (central unit, CU) and a distributed unit (distributed unit, DU), that is, the functions of the access network device can be split , some functional parts of the access network equipment are placed in the CU, and the remaining part of the functions are deployed in the DU, and multiple DUs share one CU, which saves costs and facilitates network expansion.
- the functions of the CU and DU may be divided according to the protocol stack. For example, the Radio Resource Control (RRC) layer, the Service Data Adaptation Protocol (SDAP) layer and the Packet Data Convergence Protocol (PDCP) layer are deployed in the CU.
- RRC Radio Resource Control
- SDAP Service Data Adaptation Protocol
- PDCP Packet Data Convergence Protocol
- the rest of the radio link control (radio link control, RLC) layer, medium access control (medium access control, MAC) layer and physical layer (physical, PHY) layer is deployed in the DU.
- the CU and DU can be connected through the FI interface.
- the CU can be connected to the core network on behalf of the access network device through the NG interface, and the CU can also be connected to other access network devices on behalf of the access network device through the Xn interface. Further, the functions of CU can also be divided into:
- CU-CP Central unit-control plane: mainly includes the RRC layer in the CU and the control plane in the PDCP layer;
- CU-UP Central unit-user plane: mainly includes the SDAP layer in the CU and the user plane in the PDCP layer.
- Core network equipment is mainly used to manage terminal equipment and provide gateways for communication with external networks.
- the core network equipment may include one or more of the following network elements: access and mobility management function (AMF) network element, session management function (session management function, SMF) network element, user plane function ( user plane function, UPF) network element, policy control function (policy control function, PCF) network element, application function (application function, AF) network element, unified data management (unified data management, UDM) network element, authentication server function ( authentication server function, AUSF) network element, network slice selection function (network slice selection function, NSSF) network element.
- AMF access and mobility management function
- SMF session management function
- UPF user plane function
- policy control function policy control function
- PCF policy control function
- application function application function, AF
- UDM authentication server function
- AUSF authentication server function
- network slice selection function network slice selection function
- the SMF network element is mainly responsible for session management in the mobile network, such as session establishment, modification, and release; specific functions such as assigning IP addresses to users and selecting UPF network elements that provide message forwarding functions.
- the UPF network element is mainly responsible for the forwarding and reception of user data; in the downlink transmission, the UPF network element can receive user data from the data network (DN) and transmit it to the terminal equipment through the access network equipment; in the uplink transmission, the UPF network element The network element may receive user data from the terminal device through the access network device, and forward the user data to the DN.
- the transmission resources and scheduling functions in the UPF network element that provide services for the terminal equipment may be managed and controlled by the SMF network element.
- the PCF network element mainly supports and provides a unified policy framework to control network behavior, provides policy rules to control plane network functions, and is responsible for acquiring user subscription information related to policy decisions.
- AF network elements mainly support interaction with the core network of wireless networks, such as 3rd generation partnership project (3GPP) networks, to provide services, such as influencing data routing decisions, policy control functions, or providing third-party services to the network side some services.
- 3GPP 3rd generation partnership project
- UDM network elements are mainly used for generating authentication credential, user identification processing (such as storing and managing user permanent identity, etc.), access authorization control and contract data management.
- the AUSF network element is mainly used to perform authentication when the terminal device accesses the network, including receiving the authentication request sent by the security anchor function (SEAF), selecting the authentication method, and sending the authentication storage and processing function (authentication repository and processing function, ARPF) request authentication vector, etc.
- the NSSF network element is mainly used to select network slice instances for terminal equipment, determine the allowed network slice selection assistance information (NSSAI), configure NSSAI, and determine the AMF set serving terminal equipment.
- NSSAI network slice selection assistance information
- network elements or network element names in the core network may be different.
- the fifth-generation mobile communication system is used as an example for description, which is not intended to limit the present application.
- the DN may be a service network that provides data service services for users.
- the DN may be an IP multimedia service (IP multi-media service) network, the Internet, or the like.
- the terminal device may establish a protocol data unit (protocol data unit, PDU) session from the terminal device to the DN to access the DN and the like.
- protocol data unit protocol data unit
- the extended reality (XR) service provides users with an immersive multimedia experience by interacting with an XR application server through an application layer user device (eg, a helmet).
- an application layer user device eg, a helmet
- the XR application server may be located on the DN side in the framework shown in FIG. 1
- the helmet may be located on the terminal device side in the framework shown in FIG. 1 .
- the helmet and the terminal device may be one body or separate in physical form, which is not limited.
- the sensors in the helmet can sense the user's position, movement changes, etc., and generate user information including viewing angle, line of sight, and movement rate.
- the helmet can transmit user information to the XR server through the terminal equipment, access network equipment, UPF network element, etc. in the architecture shown in FIG. 1 .
- the XR server can generate video frames by combining the user information reported by the helmet and the user's scene in the game or real scene.
- the XR server can transmit video frames to the helmet through the UPF network element, access network equipment, terminal equipment, etc. in the architecture shown in FIG. 1, and finally display the above video frames to the user through the helmet.
- the terminal device receives the downlink video frame, how to feed back the reception status of the video frame to the network device, so as to optimize the transmission of the downlink video frame, is a technical problem to be solved in the embodiment of the present application.
- the present application provides a communication method and apparatus.
- the method includes: a terminal device receives at least one video frame from an access network device; the terminal device determines a video frame parameter according to the at least one video frame; the terminal device sends to the access network device
- the video frame parameters enable the access network device to know the reception status of the downlink video frame, which is convenient for optimizing the transmission of the downlink video frame.
- XR services mainly include:
- Uplink business user information generated by the helmet, including user location and line of sight, etc., with a small amount of data;
- Downlink business The XR server generates video frames according to the user information reported by the helmet, with a large amount of data.
- the XR server generates a video frame at a rate of 60 frames per second, that is, an original video frame is generated every 16.67ms.
- the original video frames are encoded and compressed to form different types of video frames.
- typical video frames may include I-frames, P-frames, B-frames, and the like.
- the size of each video frame may vary from 1000 kilobits (kbit) to 10000 kilobits (kbit) in size.
- the basic processing is to divide the video frame into N pictures per second, and encode each picture as a video frame. Since each video frame contains many pixels, direct transmission will consume a lot of bandwidth. Therefore, the video service can be compressed and then transmitted.
- the video frames can be grouped, the first frame of each group is the reference frame, and the subsequent frames are the dependent frames.
- intra-frame compression is performed on the reference frame, that is, only the code stream within the frame is referenced during compression, and other frames are not referenced.
- the decompressor receives a reference frame and can decompress independently without other frames; for the dependent frames following the reference frame, inter-frame compression is performed with reference to the reference frame, that is, the compression refers to its own frame.
- the code stream also refers to other frames, such as the reference frame.
- the reference frame may also be called an I frame, and its size is the largest.
- Dependent frames include P frames and B frames, which are smaller in size. P frames refer to only relying on the previous frame during decoding, and B frames refer to not only relying on the previous frame but also on the subsequent frame during decoding.
- the encoded data stream is divided into multiple data packets according to the standard of 1500 bytes (considering the packet header overhead, the actual divided data pure payload may be slightly less than 1500 bytes).
- the downlink video data is represented by the UPF receiving a cluster of downlink data packets from the XR server every 16.67ms, and a cluster of downlink data packets may include multiple data packets with a size of about 1500 bytes.
- the cluster of downlink data packets is a downlink video frame.
- the video frames are compressed before transmission.
- the current compression method can usually achieve a compression efficiency of 300:1, that is, a 300 megabit (Mbit) file can be compressed to about 1 megabit (Mbit).
- Mbit 300 megabit
- the chrominance division becomes finer
- the size of the video frame becomes larger and larger.
- the amount of compressed data will put pressure on the transmission network. still too big.
- the capacity of the wireless network fluctuates greatly, when the channel quality is poor, the amount of data transmitted by the channel also decreases, and high-definition video frames cannot be transmitted, and users experience mosaics and even image frames.
- video layered coding is introduced, that is, the information of video frames is divided into two categories in the process of compression coding, namely base layer data and enhancement layer data.
- the ratio of the data amounts of the above two types of data may be 1:9.
- the channel capacity is small, if the user only receives the basic layer data, the image with lower resolution can be obtained through the display, but there is no mosaic.
- the channel capacity is large, the user can receive the data of the base layer and the data of the enhancement layer at the same time, and can obtain a higher-resolution image.
- the sum of the data volume of the base layer data and the enhancement layer data is larger than that of the non-layered video.
- the layered coding method can better cope with the rapid changes of the wireless channel, reduce the situation of freezing and mosaic, and improve the user experience.
- the aforementioned enhancement layer may include one layer or be divided into multiple layers, which is not limited. The more layers there are, the more suitable it is for wireless channels with different capacities.
- the network coding mechanism is introduced, and its core idea is to encode a group of data packets uniformly.
- 10 data packets included in a video frame can be divided into a group, and unified network coding can be performed.
- several small data blocks are output. Data blocks are transferred in batches.
- the receiver receives all the small data blocks, it decodes the entire data packet. If it is found that a certain data packet cannot be decoded correctly, the sender can be notified to send some more small data blocks.
- the small data blocks sent by the sender are not the data blocks that have been transmitted before, but the small data blocks newly generated by the network coding module. The receiver can decode all the received small data blocks uniformly.
- a PDCP PDU is generated, transmitted to the RLC layer, and an RLC PDU is generated after being processed by the RLC.
- the RLC PDU can be network-coded at the RLC layer to obtain several small data blocks, which are transmitted to the MAC layer, and processed by the MAC layer to obtain a MAC PDU.
- Each MAC PDU may include a logical channel identity document (LCID), Fields such as length (length, Len) and data (data).
- the small data blocks formed after network coding may also be referred to as data slices.
- data sharding is used as an example for description.
- At least one item (a) of a, b, or c may represent: a, b, c, ab, ac, bc, or abc, where a, b, and c may be single or multiple .
- words such as “first” and “second” are used to distinguish the same or similar items with basically the same function and effect. Those skilled in the art can understand that the words “first”, “second” and the like do not limit the quantity and execution order, and the words “first”, “second” and the like are not necessarily different.
- the network architecture and service scenarios described in the embodiments of the present application are for the purpose of illustrating the technical solutions of the embodiments of the present application more clearly, and do not constitute limitations on the technical solutions provided by the embodiments of the present application.
- the technical solutions provided in the embodiments of the present application are also applicable to similar technical problems.
- the network elements involved in the embodiments of this application include access network equipment, terminal equipment, helmets, XR servers, and the like.
- the access network device may be of a CU/DU architecture, and in this case, the access network device includes two network elements, CU and DU.
- the access network device may also be of a CP-UP architecture, in which case the access network device includes three network elements: CU-CP, CU-UP, and DU.
- the access network equipment may also be of an open radio access network (ORAN) architecture.
- the access network equipment includes CU-CP, CU-UP, DU, and a near real-time access network intelligent controller (RAN intelligent controller, RIC) four network elements or even more network elements.
- RAN intelligent controller a near real-time access network intelligent controller
- the DU can also be separated into DU-H and DU-L, etc., to support the separation of lower layers such as the remote physical layer.
- the network device (or base station) in the following embodiments may be one network element in the access network device, or may include multiple network elements in the access network device.
- a flow of a communication method including:
- Step 501 The UE receives at least one video frame from the base station.
- each video frame may include one or more data packets, and the size of each data packet is about 1500 bytes.
- the data packet may also be referred to as an internet protocol (Internet protocol, IP) packet, which is described below by taking the data packet as an example.
- IP internet protocol
- the data packets included in each video frame may be video data, or video data and audio data, etc., which are not limited.
- Step 502 The UE determines video frame parameters according to the at least one video frame.
- the UE may determine the video frame parameters with the video frame as the granularity.
- Step 503 The UE sends the video frame parameters to the base station.
- the UE may report the video frame parameters in the following manner:
- the UE reports the video frame parameters once per cycle, and the reporting cycle can be configured by the base station, the SMF network element, or the XR server.
- the UE receives a configuration information element from the base station, the configuration information element indicates a reporting period, and the UE reports the video frame parameters according to the reporting period indicated by the configuration information element.
- the SMF network element or the XR server sends the information of the reporting period to the base station, and the base station generates a configuration cell indicating the reporting period according to the information and sends it to the UE, or the base station directly reports the information of the reporting period. It is sent to the UE as a configuration information element, and the UE reports the video frame parameters according to the reporting period indicated by the configuration information element.
- the UE finds that the reporting conditions are met, it reports the video frame parameters once, and the reporting conditions can be configured by the base station, the SMF network element, or the XR service.
- the UE counts video frame parameters. And judge whether the pre-configured conditions are satisfied, if the conditions are satisfied, the video frame parameters are reported, otherwise, the video frame parameters are not reported.
- the reporting period and reporting conditions can be configured by the base station, the SMF network element, or the XR server.
- the configuration method of the reporting condition is the same as that of the reporting period, and details are not repeated here.
- the object to which the UE reports the video frame parameters may be a base station, a core network element, such as an SMF, or a UPF, or even an XR server.
- the receiving object of the video frame parameters is the core network element, or the XR server, etc.
- the base station forwards the received video frame parameters to the core network element or the XR server.
- the above description is given by taking the receiving object of the video frame parameter as an example. In fact, there may also be multiple receiving objects of the video frame parameter.
- the receiving objects of the above video frame parameters may include base stations, core network elements, XR servers, and the like.
- each receiving object can perform corresponding optimization respectively.
- the base station can optimize the scheduling of radio resources.
- the SMF network element can optimize the user's service configuration.
- the UPF network element can prioritize the forwarding priority of the data packet.
- the XR server may optimize the encoding and/or compression method of the downlink video frame.
- the UE obtains the video frame parameters according to the received video frames, and reports the reception status of the video frames, so that the network device can intuitively and accurately obtain the reception status of the video frames, which is convenient for Optimize the transmission scheme of video frames.
- the UE uses data packets as granularity to report the reception status of each data packet, such as packet loss and/or data packet delay.
- This reporting method is not accurate enough for the statistics of video services for the following reasons. For example, UE loses two packets, there are two cases:
- Case 1 Two lost packets, located in different video frames
- the above two situations correspond to the same network performance, and the number of lost data packets is the same, but from the perspective of user experience, the user experience corresponding to the above two situations is different.
- the user can continuously see two pictures with relatively poor picture quality.
- the user sees one picture with poor quality, and the next picture is clear.
- the UE reports the reception status of the data packet with the data packet as the granularity, which is difficult to reflect the above two differences.
- the UE reports the reception status of the video frame by using the video frame as the granularity, which can more accurately reflect the reception status of the video frame and facilitates optimization of the entire video transmission scheme.
- the video frame parameters reported by the UE may include at least one of the following:
- the spread delay parameter may indicate at least one of the following: the spread delay of the first video frame, the average spread delay of multiple video frames, the maximum spread of multiple video frames Delay, minimum spread delay of multiple video frames, spread delay variance of multiple video frames.
- the first video frame or multiple video frames are one or more video frames in at least one video frame received by the UE.
- a picture can form one or more video frames, and each video frame can be divided into multiple data packets.
- one video frame is received, that is, multiple data packets are received.
- the extended delay may refer to the length of time between when the UE successfully receives the first data packet of a video frame and successfully receives the last data packet of a video frame.
- the video frame N is divided into 3 data packets, and the extension delay of the video frame N may be specifically: the UE successfully receives the first data packet of the video frame N, and the time until the UE successfully receives the video frame N The length of time between the 3rd packet.
- the UE may count the extension delay of each video frame; according to the extension delay of multiple video frames, obtain the average, maximum, minimum, or variance of the extension delays of multiple video frames, etc. , and report it.
- the UE may also report the extension delay of a single video frame, that is, the extension delay of the first video frame, which is not limited.
- the manner in which the UE reports the extended delay parameter may include any of the following:
- the reporting period is configured by the base station or SMF network element or XR service, and the reporting period is once every period.
- the reporting conditions are configured by the base station or SMF network element or XR server. If the UE finds that the reporting conditions are met, it will report it once.
- the UE counts the extended delay of each video frame in the period, and judges whether the pre-configured reporting conditions are met; if the reporting conditions are met, the UE reports; otherwise, it does not report.
- the period and reporting conditions can be configured by the base station or SMF network element or XR server, etc.
- the reporting condition may correspond to the reported content.
- the reporting condition may include that when the extended delay of a single video frame is greater than or equal to the threshold A1, the extended delay parameter of the single video frame is reported; or, the reporting condition may include a single video frame When the extension delay is less than or equal to the threshold B1, the extension delay parameter of the single video frame is reported.
- the reporting condition may include the average extension delay of multiple video frames (that is, the average extension delay of multiple video frames) when the delay is greater than or equal to the threshold A2, the average extension delay of multiple video frames is reported; or, The reporting condition may include that when the average extension delay of multiple video frames is less than or equal to the threshold B2, the average extension delay of multiple video frames is reported.
- the reporting of the maximum, minimum, or variance of the extension delay of multiple video frames is similar.
- the reporting condition may not correspond to the reported content.
- the reporting condition may include that when the extension delay of a single video frame is greater than or equal to the threshold A3, the average extension of multiple video frames including the single video frame is reported.
- the reporting condition may include that when the extension delay of a single video frame is greater than or equal to the threshold B3, the average extension delay of multiple video frames including the single video frame is reported; wherein, the single video frame may be The first video frame, the middle video frame or the last video frame of multiple video frames, and the number of multiple video frames can be preset.
- the reporting of the maximum, minimum, or variance of the extension delay of multiple video frames is similar.
- the thresholds A1-A3 the values of B1-B3 are not limited, and they may be the same or different.
- the network device After the network device learns the extended delay parameters, it can adjust the scheduling policy according to the size of the extended delay. For example, if the extended delay is too large, it will schedule more resources to the UE, or adjust the modulation and coding scheme (MCS). ) to improve user experience; if the extended delay is too small, scheduling resources can be reduced, or MCS can be adjusted to save resources.
- MCS modulation and coding scheme
- the frame spacing parameter which is used to indicate at least one of the following: the frame spacing of adjacent video frames, the average value of multiple frame spacings, the maximum value of multiple frame spacings, and the minimum value of multiple frame spacings, The variance of multiple frame spacings.
- the UE receives multiple video frames from the base station.
- the frame gap can also be referred to as the video frame gap.
- the time difference between adjacent video frames is the same, and the specific value depends on the frame rate. For example, at 60 frames per second, the time difference between two adjacent video frames is 16.67ms. However, since the time it takes to transmit each video frame is not exactly the same over a network, especially a wireless network. Therefore, when the UE receives a video frame, it does not receive a video frame every 16.67ms.
- the UE may determine the frame spacing of adjacent video frames according to the reference positions of two adjacent video frames, and the reference position may be a start position, a middle position, an end position or any other position, etc., which is not limited.
- the UE may count the successful reception times of the first data packets corresponding to two adjacent video frames respectively, and calculate the time difference between the two reception times as the frame spacing of adjacent video frames.
- video frame N and video frame N+1 are adjacent video frames, each including 3 data packets.
- the UE can separately count the moment when the first data packet in the video frame N is successfully received, and the moment when the first data packet in the video frame N+1 is successfully received; the time difference between the two moments is the video frame N and the video frame.
- N+1 frame spacing alternatively, the UE may also count the successful reception times of the last data packets corresponding to two adjacent video frames respectively, and calculate the time difference between the two reception times as the frame spacing of adjacent video frames.
- Packet loss parameters include at least one of the following: the number of video frames with packet loss, the proportion of video frames with packet loss, and K consecutive video frames with packet loss, K is a positive value greater than or equal to 1. Integer.
- the UE finds packet loss after receiving multiple data packets of a video frame, it is regarded as a "video frame with packet loss". Specifically, the UE can judge whether a packet loss occurs through the PDCP sequence number (sequence number, SN). If the PDCP SN corresponding to the received data packet is not continuous, it is regarded as a packet loss. For example, a video frame includes M data packets, and the PDCP SNs of the M data packets are 1 to M in sequence. When the UE receives the Mth data packet, the UE finds that the previous data packet whose PDCP SN is N has not been received.
- the UE can determine whether the video frame is lost when it receives the end marker of the video frame; it can also send the data packet corresponding to the video frame to the IP layer or the real-time transport protocol (real-time transport protocol). , RTP) layer submission time, determine whether the video frame has lost packets; also can determine whether the video frame has packet loss at the moment when the data packet corresponding to the video frame is played at the application layer, etc., without limitation.
- the UE can also count that several video frames are continuously lost. For example, within a period of time, the UE receives 10 video frames from the base station, and the sequence numbers are 1 to 10 in sequence. Among them, the video frame No. 1 has packet loss, the video frame No. 2 to No. 5 has no packet loss, the video frame No. 6 to No. 8 has packet loss, and the video frame No. 9 and No. 10 have no packet loss, the UE can report No. 6 to On the 8th, three consecutive video frames are lost.
- Late parameter the late parameter is used to indicate at least one of the following: the number of late video frames, the proportion of late video frames, the time difference between the actual receiving time of a late video frame and the correct receiving time, multiple late video frames The average value of the time difference between the actual receiving time and the correct receiving time, the maximum time difference between the actual receiving time and the correct receiving time among multiple late video frames, the minimum time difference between the actual receiving time and the correct receiving time among the multiple late video frames value, the variance of the time difference between the actual receiving time and the correct receiving time in multiple late video frames, etc.
- the encoder in the XR application server will select a video frame in front of it as a reference frame for compression during the process of compressing the video frame.
- a video frame in front of it As shown in Figure 7, if some data packets in a certain video frame are not successfully transmitted within a predefined delay, the transmission will be successful only after the delay budget. Although this video frame cannot be displayed by the display, it can still be useful as a reference frame for subsequent video frames.
- the UE can count the number of such video frames, the proportion in all video frames, how long the correct reception time is later than the delay budget and other information, and report it to the base station.
- the parameters specifically reported by the UE may be specifically: in a period of time, the number of late video frames, the proportion of late video frames, the difference between the actual receiving moment of a late video frame and the correct receiving moment
- the "actual receiving moment and correct receiving moment of the video frame" may be defined in the following manner: the actual receiving moment of the video frame may refer to the moment when the UE receives all data packets corresponding to a video frame. It is assumed that the Nth video frame includes 10 data packets, and the actual reception moment of the Nth video frame may be the moment when the UE receives all 10 data packets of the Nth video frame. First, a correctly received video frame is defined.
- the video frame can be regarded as a correctly received video frame. If the video frame i is correctly received, then the correct reception time of the video frame i+M is: the actual reception time of the last data packet of the video frame i+M* the time difference between adjacent video frames .
- the time difference between adjacent video frames may be 16.67 ms, and M is a positive integer greater than or equal to 1.
- a late video frame is defined as "received after the delay budget, but can still be used as a reference frame for other video frames following it” as an example for description, which is not a limitation of the embodiments of the present application.
- a late video frame may also be defined as "a video frame received after the first delay budget", or as “after the first delay budget and after the second delay budget” video frames received before the budget”, or defined as "video frames that are late enough to meet the preset time requirement", etc.
- video layered coding is adopted, for example, it is divided into base layer and enhancement layer.
- the UE can receive data of two streams or two data radio bearers ((data radio bearer, DRB) or two data channels, which are the base layer data and the enhancement layer data respectively.
- the UE can count the data of these two layers respectively.
- the UE receives the time difference between the base layer data and the enhancement layer data.
- the UE may count the time difference between reference positions in the base layer data and the enhancement layer data in the same video frame.
- the reference position may be a frame start position, a frame end position, a middle position, or any position, etc., which is not limited.
- the base layer includes 3 data packets
- the enhancement layer includes 4 data packets
- the frame start time difference between the base layer and the enhancement layer of video frame N can be counted, or the video frame N can be counted.
- the UE can report the time difference between the base layer data and the enhancement layer data in a single video frame, and the UE can also report the average, maximum, and minimum time differences between the base layer data and the enhancement layer data in multiple video frames. or variance etc.
- the video frame is divided into two layers as an example for description.
- the video frame may also be divided into three layers, four layers or more layers.
- the entire video frame is divided into a base layer, a first enhancement layer and a second enhancement layer.
- the entire video frame is divided into a base layer, a first enhancement layer, a second enhancement layer and a third enhancement layer.
- the UE can report the time difference between the first enhancement layer data and the base layer data, the time difference between the second enhancement layer and the base layer data, etc., and can even report the time difference between different enhancement layers. etc., without limitation.
- the number, proportion, or number of lost video frames in the enhancement layer or multiple of the number, proportion, and number of lost packets in the enhancement layer.
- the UE may count one or more pieces of information such as the number, proportion, and number of lost video frames in the enhancement layer without packet loss at the base layer, and report it to the base station.
- the base station can adjust the transmission strategy of the enhancement layer according to the information.
- the video frame is divided into two layers, the base layer and the enhancement layer.
- the video frame may be divided into three layers, four layers or more.
- the UE can count the number, proportion or number of lost video frames in the base layer, but the first enhancement layer has no packet loss; the base layer has no packet loss, but the second enhancement layer The number, proportion or number of lost video frames, etc.; even the UE can report that the base layer and the first enhancement layer have no packet loss, but the number, proportion or number of lost packets of the second enhancement layer, etc., or, The UE may report that there is no packet loss in the base layer and the second enhancement layer, but the number of video frames, the ratio or the number of packets lost in the first enhancement layer, or the UE may report that there is no packet loss in the base layer, but the first enhancement layer and The number of video frames, the ratio or the number of lost packets in the second enhancement layer at the same time.
- the above description is given by taking the example that the video frame is divided into two layers, the base layer and the enhancement layer.
- the video frame may be divided into three layers, four layers or even more layers.
- the UE can count that the first enhancement layer has no packet loss, but the base layer has the number, proportion or number of lost packets of the video frame.
- the UE may count the number, proportion, or number of lost video frames in the second enhancement layer, but there is no packet loss in the base layer.
- the UE may count the number of video frames, the ratio or the number of lost packets in the first enhancement layer and the second enhancement layer without packet loss at the same time, but the base layer has lost packets.
- the manner in which the UE specifically reports the above-mentioned frame spacing parameter, packet loss parameter, lateness parameter, or base layer and enhancement layer parameters is similar to the aforementioned manner in which the UE reports the extended delay parameter, and will not be described one by one.
- the UE reports various video frame parameters with the video frame as the granularity, so that the network can more intuitively understand the transmission status of the video frame and optimize the network algorithm.
- network coding may be adopted.
- the base station may perform network coding on the data of the video service, and then transmit it to the UE.
- the base station can generally perform network coding on multiple data packets included in one video frame in a unified manner.
- the video frame parameters reported by the UE may include at least one of the following:
- the difference is only that the video frame in the above-mentioned first embodiment has not undergone network encoding, and the video frame in the embodiment of the present application has undergone network encoding.
- the base station after receiving the extended delay parameter, if the base station finds that the extended delay is too large and exceeds the decoding time of a general video decoder, the base station can subsequently select an appropriate opportunity to send the first one of the subsequent video frames. data packet, ensuring that the last data packet of the video frame is sent within the appropriate extended delay.
- the difference is only that the video frame in the above Embodiment 1 has not undergone network coding, and the video frame in this embodiment of the present application has undergone network coding.
- the base station may subsequently increase the size of the buffer to ensure that the video frames are sent at the same interval as possible.
- the difference is only that the video frame in the first embodiment has not undergone network encoding, and the video frame in the embodiment of the present application has undergone network encoding.
- the base station determines that the wireless signal has a longer deep decay time and can send video frames in a new frequency band.
- the difference is only that the video frame in the first embodiment is not network encoded, and the video frame in the embodiment of the present application is network encoded.
- the base station may reduce the transmission interval between the two subsequently.
- Video frames are network-coded and include one or more data fragments, and the data fragmentation parameters include parameters for at least one of the following:
- the original data after the video frame is network encoded, the original data will be redundant, and the amount of data will increase.
- the data volume is 1000 kilobits (Kbit)
- the data volume is 1500 kilobits (Kbit).
- the UE can perform network decoding and successfully recover all the data of the video frame, then the UE can report the data that needs to be received to indicate that the UE successfully decodes a video frame
- the amount of data to slice e.g. "requires 1200 kilobits for successful network decoding", the parameter.
- the UE may report the number of data slices that the UE needs to receive to successfully decode a video frame, for example, a parameter of "requires N data slices to successfully perform network decoding". After the base station obtains the reported information, it can reduce the redundancy rate of network coding for subsequent video frames.
- the UE can also count and report "the amount of data required to receive and successfully decode a video frame by itself". It can be understood that what the UE reports may be the number of data fragments or the amount of data required to successfully decode a single video frame, or it may be the average of the number of data fragments or the amount of data required by the UE to successfully decode multiple video frames. , maximum, minimum, or variance, etc.
- the base station may perform network coding on the base layer and the enhancement layer respectively.
- the UE can count the number of data fragments or the amount of data required to successfully decode the base layer data, and report it.
- the UE can count the number of data slices or the amount of data required to successfully decode the enhancement layer data.
- the above-mentioned number or amount of data slices may be the number or amount of data slices required for the base layer or enhancement layer of a single video frame, or may be the data slices required for the base layer or enhancement layer in multiple video frames.
- the number of slices or the average value, maximum value, minimum value or variance of the data amount, etc., are not limited.
- the base station may adjust the redundancy rate of the base layer or the enhancement layer in the network coding process according to the reporting situation of the UE.
- FIG. 10 is a schematic block diagram of an apparatus 1000 provided by an embodiment of the present application, which is used to implement the functions of a terminal device or a network device in the foregoing method.
- the apparatus 1000 may be a software unit or a system-on-a-chip.
- the chip system may be composed of chips, and may also include chips and other discrete devices.
- the apparatus includes a communication unit 1001 and may further include a processing unit 1002 .
- the communication unit 1001 can communicate with the outside.
- the processing unit 1002 is used for processing.
- the communication unit 1001 may also be referred to as a communication interface, a transceiver unit, an input/output interface, and the like.
- the apparatus 1000 may implement the steps performed by the terminal device in the foregoing embodiment.
- the apparatus 1000 may be a terminal device, or a chip or circuit configured in the terminal device.
- the communication unit 1001 performs the transceiving operations in the above embodiments, and the processing unit 1002 is configured to perform the processing-related operations of the terminal device in the above method embodiments.
- the communication unit 1001 is configured to receive at least one video frame from a network device; the processing unit 1002 is configured to determine a video frame parameter according to the at least one video frame; the communication unit 1001 is further configured to send all video frames to the network device. describe the video frame parameters.
- the video frame parameters include at least one of the following: extended delay parameters, frame spacing parameters, packet loss parameters, late arrival parameters, base layer and enhancement layer parameters.
- the extension delay parameter is used to indicate at least one of the following: extension delay of the first video frame, average extension delay of multiple video frames, maximum extension delay of multiple video frames, multiple video frames The minimum extension delay of a frame, the variance of the extension delays of multiple video frames; wherein, the extension delay is the time between the terminal device successfully receiving the first data packet of a video frame and successfully receiving the last data packet of the video frame. Time length, the first video frame or the multiple video frames are one or more video frames in at least one video frame received by the terminal device.
- the frame spacing parameter is used to indicate at least one of the following: the frame spacing of adjacent video frames, the average value of multiple frame spacings, the maximum value of multiple frame spacings, and the minimum value of multiple frame spacings, A variance of a plurality of frame spacings; wherein the terminal device receives a plurality of video frames from the network device.
- the packet loss parameter includes at least one of the following: the number of video frames in which packet loss occurs, the proportion of video frames in which packet loss occurs, and packet loss occurs in consecutive K video frames, where K is greater than or equal to A positive integer of 1.
- the processing unit 1002 is also used for judging whether the packet data convergence protocol PDCP sequence number SN of the data packet included in the received video frame is continuous, and when the PDCP SN of the data packet included in the received video frame is discontinuous, then determine. Packet loss occurs in the video frame.
- the lateness parameter indicates at least one of the following: the number of late video frames, the proportion of late video frames, the time difference between the actual receiving moment of one late video frame and the correct receiving moment, the number of late video frames The average value of the time difference between the actual receiving time and the correct receiving time, the maximum time difference between the actual receiving time and the correct receiving time among multiple late video frames, and the minimum time difference between the actual receiving time and the correct receiving time among multiple late video frames , the variance of the time difference between the actual receiving moment and the correct receiving moment in multiple late video frames.
- each video frame includes base layer data and enhancement layer data;
- the base layer and enhancement layer parameters include at least one of the following: in the second video frame, the terminal device receives the base layer data and the enhancement layer data.
- time difference the second video frame is a video frame in the at least one video frame; at least one of the number of third video frames, the ratio of the third video frame, or the number of packet loss in the enhancement layer in the third video frame
- the third video frame is a video frame with no packet loss in the base layer data and packet loss in the enhancement layer data
- the third video frame is a video frame in the at least one video frame
- the number of the fourth video frame The ratio of the fourth video frame, or at least one of the number of lost packets in the base layer in the fourth video frame
- the fourth video frame is a video frame with no packet loss in the enhancement layer data and packet loss in the base layer data
- the fourth video frame is a video frame in the at least one video frame.
- the video frame includes one or more data slices after network encoding
- the video frame parameter also includes at least one of the following: data slices that the terminal device needs to receive when successfully decoding a video frame.
- the number and/or the data volume of the data fragment the number of data fragments and/or the data volume of the data fragment that the terminal device needs to successfully decode the base layer of a video frame and/or the data fragment;
- the terminal The number of data slices and/or the data volume of the data slices that the device needs to receive to successfully decode the enhancement layer of a video frame.
- the apparatus 1000 may implement the steps performed by the network device in the above method embodiments, and the apparatus 1000 may be a network device, or a chip or circuit configured in the network device.
- the communication unit 1001 is configured to perform the transceiving operation of the network device in the above method embodiment
- the processing unit 1002 is configured to perform the processing related operation of the network device in the above method embodiment.
- the communication unit 1001 is configured to send at least one video frame to the terminal device; the communication unit 1001 is further configured to receive video frame parameters from the terminal device, where the video frame parameters are determined according to the at least one video frame of.
- the processing unit 1002 may be configured to optimize the transmission of downlink video frames according to the video frame parameters.
- the video frame parameters include at least one of the following: extended delay parameters, frame spacing parameters, packet loss parameters, late arrival parameters, base layer and enhancement layer parameters.
- the extension delay parameter is used to indicate at least one of the following: the extension delay of the first video frame, the extension delay of multiple video frames, the maximum extension delay of multiple video frames, the minimum extension delay of multiple video frames.
- Extended delay the variance of the extended delay of multiple video frames; wherein, the extended delay is the length of time between the terminal device successfully receiving the first data packet of a video frame to successfully receiving the last data packet of the video frame, so The first video frame or the multiple video frames are one or more video frames in at least one video frame received by the terminal device.
- the frame spacing parameter is used to indicate at least one of the following: the frame spacing of adjacent video frames, the average value of multiple frame spacings, the maximum value of multiple frame spacings, and the minimum value of multiple frame spacings, A variance of a plurality of frame spacings; wherein the terminal device receives a plurality of video frames from the network device.
- the packet loss parameter includes at least one of the following: the number of video frames with packet loss, the ratio of video frames with packet loss, and K consecutive video frames with packet loss, where K is greater than or equal to A positive integer of 1.
- the lateness parameter indicates at least one of the following: the number of late video frames, the proportion of late video frames, the time difference between the actual receiving moment of one late video frame and the correct receiving moment, the number of late video frames The average value of the time difference between the actual receiving time and the correct receiving time, the maximum time difference between the actual receiving time and the correct receiving time among multiple late video frames, and the minimum time difference between the actual receiving time and the correct receiving time among multiple late video frames , the variance of the time difference between the actual receiving moment and the correct receiving moment in multiple late video frames.
- each video frame includes base layer data and enhancement layer data; the base layer and enhancement layer parameters include at least one of the following: in the second video frame, the terminal device receives the base layer data and the enhancement layer data.
- time difference the second video frame is a video frame in the at least one video frame; at least one of the number of third video frames, the ratio of the third video frame, or the number of packet loss in the enhancement layer in the third video frame Item
- the third video frame is a video frame with no packet loss in the base layer data and packet loss in the enhancement layer data
- the third video frame is a video frame in the at least one video frame
- the number of fourth video frames the ratio of the fourth video frame, or at least one of the number of lost packets at the base layer in the fourth video frame
- the fourth video frame being a video frame with no packet loss in the enhancement layer data and packet loss at the base layer
- the fourth video frame is a video frame in the at least one video frame.
- the video frame includes one or more data fragments after network encoding
- the video frame parameter further includes at least one of the following: the data fragment that the terminal device needs to receive when successfully decoding a video frame.
- the number and/or the data volume of the data fragment the number of data fragments and/or the data volume of the data fragment that the terminal device needs to successfully decode the base layer of a video frame and/or the data fragment;
- the terminal The number of data slices and/or the data volume of the data slices that the device needs to receive to successfully decode the enhancement layer of a video frame.
- the division of units in the embodiments of the present application is schematic, and is only a logical function division. In actual implementation, there may be other division methods.
- the functional units in the various embodiments of the present application may be integrated into one processing unit. In the device, it can also exist physically alone, or two or more units can be integrated into one unit.
- the above-mentioned integrated units may be implemented in the form of hardware, or may be implemented in the form of software functional units.
- the function of the communication unit in the above embodiments may be implemented by a transceiver, and the function of the processing unit may be implemented by a processor.
- the transceiver may include a transmitter and/or a receiver, etc., for respectively implementing the functions of the transmitting unit and/or the receiving unit.
- FIG. 11 The following description is given with reference to FIG. 11 as an example.
- FIG. 11 is a schematic block diagram of an apparatus 1100 provided by an embodiment of the present application.
- the apparatus 1100 shown in FIG. 11 may be a hardware circuit implementation of the apparatus shown in FIG. 10 .
- the apparatus may perform the functions of the terminal device or the network device in the foregoing method embodiments.
- FIG. 11 only shows the main components of the communication device.
- the communication apparatus 1100 shown in FIG. 11 includes at least one processor 1101 .
- Communication apparatus 1100 may also include at least one memory 1102 for storing program instructions and/or data.
- Memory 1102 and processor 1101 are coupled.
- the coupling in the embodiments of the present application is an indirect coupling or communication connection between devices, units or modules, which may be in electrical, mechanical or other forms, and is used for information exchange between devices, units or modules.
- the processor 1101 may cooperate with the memory 1102 , the processor 1101 may execute program instructions stored in the memory 1102 , and at least one of the at least one memory 1102 may be included in the processor 1101 .
- the apparatus 1100 may also include a communication interface 1103 for communicating with other devices through a transmission medium, so that the communication apparatus 1100 may communicate with other devices.
- the communication interface may be a transceiver, a circuit, a bus, a module, or other types of communication interfaces.
- the transceiver when the communication interface is a transceiver, the transceiver may include an independent receiver and an independent transmitter; it may also be a transceiver integrating a transceiver function, or an interface circuit.
- connection medium between the above-mentioned processor 1101 , the memory 1102 , and the communication interface 1103 is not limited in the embodiments of the present application.
- the memory 1102, the processor 1101, and the communication interface 1103 are connected through a communication bus 1104 in FIG. 11.
- the bus is represented by a thick line in FIG. 11, and the connection between other components is only a schematic illustration. , not as a limitation.
- the bus may include an address bus, a data bus, a control bus, and the like. For convenience of presentation, only one thick line is used in FIG. 11 , but it does not mean that there is only one bus or one type of bus or the like.
- the apparatus 1100 is configured to implement the steps performed by the terminal device in the above method embodiments.
- the communication interface 1103 is configured to perform the transceiving related operations of the terminal device in the above method embodiments
- the processor 1101 is configured to perform the processing related operations of the terminal device in the above method embodiments.
- the communication interface 1103 is configured to receive at least one video frame from the network device; the processor 1101 is configured to determine video frame parameters according to the at least one video frame; the communication interface 1103 is further configured to send all video frames to the network device. describe the video frame parameters.
- the video frame parameters include at least one of the following: extended delay parameters, frame spacing parameters, packet loss parameters, late arrival parameters, base layer and enhancement layer parameters.
- the extension delay parameter is used to indicate at least one of the following: extension delay of the first video frame, average extension delay of multiple video frames, maximum extension delay of multiple video frames, multiple video frames The minimum extension delay of a frame, the variance of the extension delays of multiple video frames; wherein, the extension delay is the time between the terminal device successfully receiving the first data packet of a video frame and successfully receiving the last data packet of the video frame. Time length, the first video frame or the multiple video frames are one or more video frames in at least one video frame received by the terminal device.
- the frame spacing parameter is used to indicate at least one of the following: the frame spacing of adjacent video frames, the average value of multiple frame spacings, the maximum value of multiple frame spacings, and the minimum value of multiple frame spacings, A variance of a plurality of frame spacings; wherein the terminal device receives a plurality of video frames from the network device.
- the packet loss parameter includes at least one of the following: the number of video frames in which packet loss occurs, the proportion of video frames in which packet loss occurs, and packet loss occurs in consecutive K video frames, where K is greater than or equal to A positive integer of 1.
- the processor 1101 is also used to judge whether the packet data convergence protocol PDCP sequence number SN of the data packet included in the received video frame is continuous, and when the PDCP SN of the data packet included in the received video frame is discontinuous, then determine Packet loss occurs in the video frame.
- the lateness parameter indicates at least one of the following: the number of late video frames, the proportion of late video frames, the time difference between the actual receiving moment of one late video frame and the correct receiving moment, the number of late video frames The average value of the time difference between the actual receiving time and the correct receiving time, the maximum time difference between the actual receiving time and the correct receiving time among multiple late video frames, and the minimum time difference between the actual receiving time and the correct receiving time among multiple late video frames , the variance of the time difference between the actual receiving moment and the correct receiving moment in multiple late video frames.
- each video frame includes base layer data and enhancement layer data;
- the base layer and enhancement layer parameters include at least one of the following: in the second video frame, the terminal device receives the base layer data and the enhancement layer data.
- time difference the second video frame is a video frame in the at least one video frame; at least one of the number of third video frames, the ratio of the third video frame, or the number of packet loss in the enhancement layer in the third video frame
- the third video frame is a video frame with no packet loss in the base layer data and packet loss in the enhancement layer data
- the third video frame is a video frame in the at least one video frame
- the number of the fourth video frame The ratio of the fourth video frame, or at least one of the number of lost packets in the base layer in the fourth video frame
- the fourth video frame is a video frame with no packet loss in the enhancement layer data and packet loss in the base layer data
- the fourth video frame is a video frame in the at least one video frame.
- the video frame includes one or more data slices after network encoding
- the video frame parameter also includes at least one of the following: data slices that the terminal device needs to receive when successfully decoding a video frame.
- the number and/or the data volume of the data fragment the number of data fragments and/or the data volume of the data fragment that the terminal device needs to successfully decode the base layer of a video frame and/or the data fragment;
- the terminal The number of data slices and/or the data volume of the data slices that the device needs to receive to successfully decode the enhancement layer of a video frame.
- the apparatus 1100 is configured to implement the steps performed by the network device in the above method embodiments.
- the communication interface 1103 is configured to perform the transceiving related operations of the network device in the above method embodiments
- the processor 1101 is configured to perform the processing related operations of the network device in the above method embodiments.
- the communication interface 1103 is used to send at least one video frame to the terminal device; the communication interface 1103 is also used to receive video frame parameters from the terminal device, where the video frame parameters are determined according to the at least one video frame of.
- the processor 1101 may be configured to optimize the transmission of downlink video frames according to the video frame parameters.
- the video frame parameters include at least one of the following: extended delay parameters, frame spacing parameters, packet loss parameters, late arrival parameters, base layer and enhancement layer parameters.
- the extension delay parameter is used to indicate at least one of the following: the extension delay of the first video frame, the extension delay of multiple video frames, the maximum extension delay of multiple video frames, the minimum extension delay of multiple video frames.
- Extended delay the variance of the extended delay of multiple video frames; wherein, the extended delay is the length of time between the terminal device successfully receiving the first data packet of a video frame to successfully receiving the last data packet of the video frame, so The first video frame or the multiple video frames are one or more video frames in at least one video frame received by the terminal device.
- the frame spacing parameter is used to indicate at least one of the following: the frame spacing of adjacent video frames, the average value of multiple frame spacings, the maximum value of multiple frame spacings, and the minimum value of multiple frame spacings, A variance of a plurality of frame spacings; wherein the terminal device receives a plurality of video frames from the network device.
- the packet loss parameter includes at least one of the following: the number of video frames with packet loss, the ratio of video frames with packet loss, and K consecutive video frames with packet loss, where K is greater than or equal to A positive integer of 1.
- the lateness parameter indicates at least one of the following: the number of late video frames, the proportion of late video frames, the time difference between the actual receiving moment of one late video frame and the correct receiving moment, the number of late video frames The average value of the time difference between the actual receiving time and the correct receiving time, the maximum time difference between the actual receiving time and the correct receiving time among multiple late video frames, and the minimum time difference between the actual receiving time and the correct receiving time among multiple late video frames , the variance of the time difference between the actual receiving moment and the correct receiving moment in multiple late video frames.
- each video frame includes base layer data and enhancement layer data; the base layer and enhancement layer parameters include at least one of the following: in the second video frame, the terminal device receives the base layer data and the enhancement layer data.
- time difference the second video frame is a video frame in the at least one video frame; at least one of the number of third video frames, the ratio of the third video frame, or the number of packet loss in the enhancement layer in the third video frame Item
- the third video frame is a video frame with no packet loss in the base layer data and packet loss in the enhancement layer data
- the third video frame is a video frame in the at least one video frame
- the number of fourth video frames the ratio of the fourth video frame, or at least one of the number of lost packets at the base layer in the fourth video frame
- the fourth video frame being a video frame with no packet loss in the enhancement layer data and packet loss at the base layer
- the fourth video frame is a video frame in the at least one video frame.
- the video frame includes one or more data fragments after network encoding
- the video frame parameter further includes at least one of the following: the data fragment that the terminal device needs to receive when successfully decoding a video frame.
- the number and/or the data volume of the data fragment the number of data fragments and/or the data volume of the data fragment that the terminal device needs to successfully decode the base layer of a video frame and/or the data fragment;
- the terminal The number of data slices and/or the data volume of the data slices that the device needs to receive to successfully decode the enhancement layer of a video frame.
- an embodiment of the present application further provides an apparatus, where the apparatus is configured to execute the method in the method embodiment of FIG. 5 above.
- a computer-readable storage medium includes a program, when the program is executed by a processor, the method in the method embodiment of FIG. 5 above is executed.
- a computer program product comprising computer program code, when the computer program code is run on a computer, causes the computer to implement the method in the method embodiment of FIG. 5 above.
- a chip comprising: a processor, the processor is coupled with a memory, the memory is used for storing a program or an instruction, when the program or the instruction is executed by the processor, it causes the apparatus to execute the above method implementation in FIG. 5 method in the example.
- a system includes at least one of a terminal device and a network device that perform the above method embodiments.
- the processor may be a general-purpose processor, a digital signal processor, an application-specific integrated circuit, a field programmable gate array or other programmable logic device, a discrete gate or transistor logic device, or a discrete hardware component, which can be implemented or executed
- a general purpose processor may be a microprocessor or any conventional processor or the like.
- the steps of the methods disclosed in conjunction with the embodiments of the present application may be directly embodied as executed by a hardware processor, or executed by a combination of hardware and software modules in the processor.
- the memory may be a non-volatile memory, such as a hard disk drive (HDD) or a solid-state drive (SSD), etc., or may also be a volatile memory (volatile memory), for example Random-access memory (RAM).
- Memory is, but is not limited to, any other medium that can be used to carry or store desired program code in the form of instructions or data structures and that can be accessed by a computer.
- the memory in this embodiment of the present application may also be a circuit or any other device capable of implementing a storage function, for storing program instructions and/or data.
- the methods provided in the embodiments of the present application may be implemented in whole or in part by software, hardware, firmware, or any combination thereof.
- software When implemented in software, it can be implemented in whole or in part in the form of a computer program product.
- the computer program product includes one or more computer instructions. When the computer program instructions are loaded and executed on a computer, all or part of the processes or functions described in the embodiments of the present invention are generated.
- the computer may be a general purpose computer, a special purpose computer, a computer network, network equipment, user equipment, or other programmable apparatus.
- the computer instructions may be stored in or transmitted from one computer-readable storage medium to another computer-readable storage medium, for example, the computer instructions may be downloaded from a website site, computer, server, or data center Transmission to another website site, computer, server or data center by wire (eg coaxial cable, optical fiber, digital subscriber line (DSL)) or wireless (eg infrared, wireless, microwave, etc.).
- the computer-readable storage medium can be any available media that can be accessed by a computer, or a data storage device such as a server, data center, etc. that includes one or more available media integrated.
- the usable media may be magnetic media (eg, floppy disks, hard disks, magnetic tapes), optical media (eg, digital video discs (DVD)), or semiconductor media (eg, SSDs), and the like.
Landscapes
- Engineering & Computer Science (AREA)
- Signal Processing (AREA)
- Computer Networks & Wireless Communication (AREA)
- Multimedia (AREA)
- Environmental & Geological Engineering (AREA)
- Databases & Information Systems (AREA)
- Mobile Radio Communication Systems (AREA)
- Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)
Abstract
一种通信方法及装置,该方法包括:终端设备从网络设备接收至少一个视频帧;终端设备根据所述至少一个视频帧,确定视频帧参数;终端设备向网络设备发送所述视频帧参数,从而使得网络设备可以得知下行视频帧的接收情况,便于优化下行视频帧的传输。
Description
本申请涉及通信技术领域,尤其涉及一种通信方法及装置。
在一种扩展现实(extend reality,XR)业务场景中,头盔内的传感器可以感知用户的位置、动作变化等,生成包括视角、视线、运动速率等的用户信息。可采用上行传输方式将用户信息传输到XR服务器。上行传输的路径可为头盔—>终端设备—>无线网络—>XR服务器。XR服务器可根据用户的位置、视线等信息,结合用户在游戏或实景中的场景,生成新的视频数据。可采用下行传输方式将新的视频数据传输到头盔。下行传输的路径可以为XR服务器—>无线网络—>终端设备—>头盔,最终通过头盔向用户展示上述视频数据。无线网络例如为第三代合作伙伴计划(3rd generation partnership project,3GPP)网络,例如长期演进(long term evolution,LTE)或第五代(the 5th generation,5G)网络等。由于下行传输的视频帧的数据量通常较大,如何优化下行传输方式,是本申请待解决的技术问题。
发明内容
本申请提供一种通信方法及装置,终端设备在接收到下行视频帧后,可向网络设备反馈下行视频帧的帧参数,从而便于优化下行视频帧的传输。
第一方面,提供一种通信方法,该方法的执行主体为终端设备,还可以为设置于终端设备中的部件(芯片,电路或其它等),该方法包括:终端设备从网络设备接收至少一个视频帧;所述终端设备根据所述至少一个视频帧,确定视频帧参数;所述终端设备向所述网络设备发送所述视频帧参数。可选的,终端设备可以视频帧为粒度,上报视频帧的接收情况,即上述视频帧参数。
通过实施上述方法,终端设备根据所接收的视频帧,确定视频帧参数,且向网络设备发送所述视频帧参数,从而使得网络设备可以获知下行视频帧的接收情况,便于优化视频帧的下行传输。
在一种可能的设计中,所述视频帧参数包括以下至少一项:扩展时延参数,帧间距参数,丢包参数,迟到参数,基本层和增强层参数。
通过实施上述方法,终端设备上报的视频帧参数,在不同维度,直观和准确的展示下行视频帧的接收情况,便于网络设备对下行视频帧传输的优化。
可选的,所述扩展时延参数用于指示以下至少一项:第一视频帧的扩展时延,多个视频帧的平均扩展时延,多个视频帧的最大扩展时延,多个视频帧的最小扩展时延,多个视频帧的扩展时延的方差;其中,所述扩展时延为所述终端设备成功接收一个视频帧的首数据包至成功接收该视频帧的尾数据包间的时间长度,所述第一视频帧或所述多个视频帧为所述终端设备接收到的至少一个视频帧中的一个或多个视频帧。
通过实施上述方法,终端设备可周期性上报上述扩展时延参数,和/或在不同的条件下上报扩展时延参数,所述上报条件或上报的扩展时延参数的内容可对应,也可不对应。对 于终端设备侧,可灵活的设置上报扩展时延参数的内容和/或方式,满足各种上报需求。
可选的,所述帧间距参数用于指示以下至少一项:相邻的视频帧的帧间距,多个帧间距的平均值,多个帧间距的最大值,多个帧间距的最小值,多个帧间距的方差;其中,所述终端设备从所述网络设备接收多个视频帧。
通过实施上述方法,网络设备可以获知下行视频帧的帧间距的情况。比如,若发现视频帧的帧间距不稳定,则网络设备后续可增加存缓器buffer的大小,尽量保证间隔相同的时间发送视频帧。
可选的,所述丢包参数,包括以下至少一项:发生丢包的视频帧的数量,发生丢包的视频帧的比例,连续K个视频帧发生丢包,所述K为大于或等于1的正整数。
通过实施上述方法,网络设备可以获知下行视频帧的丢包情况。网络设备如果发现丢包的视频帧较少,但是一旦丢包就丢多个包,则网络设备可以确定无线信号深衰时间较长,可以更换新的频段发送视频帧等。
在一种可能的设计中,还包括:所述终端设备判断接收到的视频帧所包括数据包的分组数据汇聚协议PDCP序列号SN是否连续;所述终端设备所接收视频帧所包括数据包的PDCP SN不连续,则确定所述视频帧发生丢包。
通过实施上述方法,终端设备可通过PDCP序列号SN是否连续,来判定视频帧是否丢包,易于实现。
可选的,所述迟到参数指示以下至少一项:迟到视频帧的数量,迟到视频帧所占的比例,一个迟到视频帧的实际接收时刻与正确接收时刻间的时间差,多个迟到视频帧的实际接收时刻与正确接收时刻间时间差的平均值,多个迟到视频帧中实际接收时刻与正确接收时刻间时间差的最大值,多个迟到视频帧中实际接收时刻与正确接收时刻间时间差的最小值,多个迟到视频帧中实际接收时刻与正确接收时刻间时间差的方差。
通过实施上述方法,网络设备可获知下行视频帧的迟到情况,便于优化下行视频帧的传输。
在一种可能的设计中,每个视频帧包括基本层数据和增强层数据;所述基本层和增强层参数包括以下至少一项:第二视频帧中,所述终端设备接收基本层数据和增强层数据的时间差,所述第二视频帧为所述至少一个视频帧中的视频帧;第三视频帧的数量,第三视频帧的比例,或第三视频帧中增强层丢包的数量中的至少一个,所述第三视频帧为基本层数据没有丢包,增强层数据丢包的视频帧,且所述第三视频帧为所述至少一个视频帧中的视频帧;第四视频帧的数量,第四视频帧的比例,或第四视频帧中基本层丢包的数量中的至少一个,所述第四视频帧为增强层数据没有丢包,基本层数据丢包的视频帧,且所述第四视频帧为所述至少一个视频帧中的视频帧。
通过实施上述方法,网络设备如果发现同一视频帧的基本层和增强层的时间差太大,后续可输少两者的传输间隔。
可选的,所述视频帧经过网络编码后,包括一个或多个数据分片,所述视频帧参数还包括以下至少一项:所述终设备成功解码一个视频帧所需要接收的数据分片的数量和/或所述数据分片的数据量;所述终端设备成功解码一个视频帧的基本层所需要接收的数据分片的数量和/或所述数据分片的数据量;所述终端设备成功解码一个视频帧的增强层所需要接收的数据分片的数量和/或所述数据分片的数据量。
通过实施上述方法,网络设备可根据终端设备的上报情况,调整网络编码过程中基本 层或增强层的冗余率等,优化下行视频帧的传输。
第二方面,提供一种通信方法,该方法的主体为网络设备,还可以为配置于网络设备中的部件(芯片,电路或其它等),包括:网络设备向终端设备发送至少一个视频帧;所述网络设备接收来自所述终端设备的视频帧参数,所述视频帧参数是根据所述至少一个视频帧所确定的。可选的,上述视频帧参数可以是以视频帧为粒度上报的。
通过实施上述方法,网络设备可获得下行视频帧的接收情况,便于对下行视频帧的传输过程进行优化。
在一种可能的设计中,所述视频帧参数包括以下至少一项:扩展时延参数,帧间距参数,丢包参数,迟到参数,基本层和增强层参数。
通过实施上述方法,网络设备可以在多个维度,直观和准确的获知下行视频帧的接收情况,便于优化下行视频帧的传输方案。
可选的,所述扩展时延参数用于指示以下至少一项:第一视频帧的扩展时延,多个视频帧的扩展时延,多个视频帧的最大扩展时延,多个视频帧的最小扩展时延,多个视频帧的扩展时延方差;其中,所述扩展时延为所述终端设备成功接收一个视频帧的首数据包至成功接收该视频帧的尾数据包间的时间长度,所述第一视频帧或所述多个视频帧为所述终端设备接收到的至少一个视频帧中的一个或多个视频帧。
通过实施上述方法,网络设备在获知上述扩展时延参数之后,可以根据扩展时延的大小调整调度策略。例如,如果扩展时延过大,则调度更多的资源给终端设备,或者调整调制和编码方案(modulation and coding scheme,MCS),从而提高用户体验;如果扩展时延过小,可以减少调度资源,或者调整MCS,从而节约资源等。
可选的,所述帧间距参数用于指示以下至少一项:相邻的视频帧的帧间距,多个帧间距的平均值,多个帧间距的最大值,多个帧间距的最小值,多个帧间距的方差;其中,所述终端设备从所述网络设备接收多个视频帧。
通过实施上述方法,网络设备如果发现视频帧的帧间距不稳定,则网络设备后续可增加存缓器buffer的大小,尽量保证间隔相同的时间发送视频帧。
可选的,所述丢包参数,包括以下至少一项:发生丢包的视频帧的数量,发生丢包的视频帧的比值,连续K个视频帧发生丢包,所述K为大于或等于1的正整数。
过实施上述方法,网络设备如果发现丢包的视频帧较少,但是一旦丢包就丢多个包,则网络设备确定无线信号深衰时间较长,可以更换新的频段发送视频帧。
可选的,所述迟到参数指示以下至少一项:迟到视频帧的数量,迟到视频帧所占的比例,一个迟到视频帧的实际接收时刻与正确接收时刻间的时间差,多个迟到视频帧的实际接收时刻与正确接收时刻间时间差的平均值,多个迟到视频帧中实际接收时刻与正确接收时刻间时间差的最大值,多个迟到视频帧中实际接收时刻与正确接收时刻间时间差的最小值,多个迟到视频帧中实际接收时刻与正确接收时刻时间差的方差。
在一种可能的设计中,每个视频帧包括基本层数据和增强层数据;所述基本层和增强层参数包括以下至少一项:第二视频帧中,所述终端设备接收基本层数据和增强层数据的时间差,所述第二视频帧为所述至少一个视频帧中的视频帧;第三视频帧的数量,第三视频帧的比例,或第三视频帧中增强层丢包的数量中的至少一项,所述第三视频帧为基本层数据没有丢包,增强层数据丢包的视频帧,且所述第三视频帧为所述至少一个视频帧中的视频帧;第四视频帧的数量,第四视频帧的比例,或第四视频帧中基本层丢包的数量中的 至少一项,所述第四视频帧为增强层数据没有丢包,基本层数据丢包的视频帧,且所述第四视频帧为所述至少一个视频帧中的视频帧。
通过实施上述方法,网络设备如果发现同一视频帧的基本层和增强层的时间差太大,则后续可减少两者的传输间隔。
在一种可能的设计中,所述视频帧经过网络编码后,包括一个或多个数据分片,所述视频帧参数还包括以下至少一项:所述终端设备成功解码一个视频帧所需要接收的数据分片的数量和/或所述数据分片的数据量;所述终端设备成功解码一个视频帧的基本层所需要接收的数据分片的数量和/或所述数据分片的数据量;所述终端设备成功解码一个视频帧的增强层所需要接收的数据分片的数量和/或所述数据分片的数据量。
通过实施上述方法,网络设备可根据终端设备的上报情况,调整网络编码过程中基本层或增强层的冗余率等,优化下行视频帧的传输。
第三方面,提供一种装置,有益效果可参见第一方面的描述。所述装置具有实现上述第一方面的方法实施例中行为的功能。所述功能可以通过执行相应的硬件或软件实现。所述硬件或软件可包括一个或多个上述功能相对应的单元。在一种可能的设计中,该装置可包括:通信单元,用于从网络设备接收至少一个视频帧;处理单元,用于根据所述至少一个视频帧,确定视频帧参数;通信单元,还用于向所述网络设备发送所述视频帧参数。这些单元可以执行上述第一方面方法示例中的相应功能,具体参见方法示例中的详细描述,此处不做赘述。
第四方面,提供一种装置,有益效果可参见第一方面的描述。该装置可以为上述第一方面方法中的终端设备,或者为设置在终端设备中的芯片。该装置包括通信接口以及处理器,可选的,还包括存储器。其中,该存储器用于存储计算机程序或指令,处理器与存储器、通信接口耦合,当处理器执行所述计算机程序或指令时,使装置执行上述第一方面方法中由终端设备所执行的方法。
第五方面,提供一种装置,有益效果可参见第二方面的描述。所述装置具有实现上述第二方面的方法实施例中行为的功能。所述功能可以通过执行相应的硬件或软件实现。所述硬件或软件可包括一个或多个上述功能相对应的单元。在一种可能的设计中,该装置可以包括:通信单元,用于向终端设备发送至少一个视频帧;通信单元,还用于接收来自所述终端设备的视频帧参数,所述视频帧参数是根据所述至少一个视频帧所确定的。可选的,处理单元,可用于根据所述视频帧参数,优化下行视频帧的传输。这些单元可以执行上述第二方面方法示例中的相应功能,具体参见方法示例中的详细描述,此处不再赘述。
第六方面,提供一种装置,有益效果可参见第二方法的描述。该装置可以为上述第二方面中的网络设备,或者为设置在网络设备中的芯片。该装置包括通信接口以及处理器,可选的,还包括存储器。其中,该存储器用于存储计算机程序或指令,处理器与存储器、通信接口耦合,当处理器执行所述计算机程序或指令时,使装置执行上述第二方面方法实施例中由网络设备所执行的方法。
第七方面,提供了一种计算机程序产品,所述计算机程序产品包括:计算机程序代码,当所述计算机程序代码并运行时,使得上述第一方面中由终端设备执行的方法被执行。
第八方面,提供了一种计算机程序产品,所述计算机程序产品包括:计算机程序代码,当所述计算机程序代码被运行时,使得上述第二方面中由网络设备执行的方法被执行。
第九方面,提供一种芯片系统,该芯片系统包括处理器,用于实现上述第一方面的方 法中终端设备的功能。在一种可能的设计中,所述芯片系统还包括存储器,用于保存程序指令和/或数据。该芯片系统,可以由芯片构成,也可以包括芯片和其他分立器件。
第十方面,提供一种芯片系统,该芯片系统包括处理器,用于实现上述第二方面的方法中网络设备的功能。在一种可能的设计中,所述芯片系统还包括存储器,用于保存程序指令和/或数据。该芯片系统,可以由芯片构成,也可以包括芯片和其他分立器件。
第十一方面,提供一种计算机可读存储介质,该计算机可读存储介质存储有计算机程序,当该计算机程序被运行时,实现上述第一方面中由终端设备执行的方法。
第十二方面,提供一种计算机可读存储介质,该计算机可读存储介质存储有计算机程序,当该计算机程序被运行时,实现上述第二方面中由网络设备执行的方法。
第十三方法,提供一种通信系统,包括上述第三方面或第四方面的装置,和第五方面或第六方面的装置。
图1为本申请实施例提供的网络架构的一示意图;
图2为本申请实施例提供的网络架构的另一示意图;
图3为本申请实施例提供的相邻视频帧的一示意图;
图4为本申请实施例提供的网络编码的一示意图;
图5为本申请实施例提供的通信方法的一流程图;
图6为本申请实施例提供的相邻视频帧的另一示意图;
图7为本申请实施例提供的迟到视频帧的示意图;
图8为本申请实施例提供的基本层和增强层的示意图;
图9为本申请实施例提供的网络编码的另一示意图;
图10为本申请实施例提供的装置的一结构示意图;
图11为本申请实施例提供的装置的另一结构示意图。
图1示出了本申请实施例应用的一种网络架构的示意图,包括以下至少一项:终端设备,接入网设备,核心网(core network,CN)设备,和数据网络(data network,DN)。接入网设备和核心网设备之间可通过下一代(next generation,NG)接口进行通信,不同接入网设备之间可通过Xn接口通信。
1、终端设备
终端设备可以简称为终端,是一种具有无线收发功能的设备。终端设备可以是移动的,或固定的。终端设备可以部署在陆地上,包括室内或室外,手持或车载;也可以部署在水面上(如轮船等);还可以部署在空中(例如飞机、气球和卫星上等)。所述终端设备可以是手机(mobile phone)、平板电脑(pad)、带无线收发功能的电脑、虚拟现实(virtual reality,VR)终端设备、增强现实(augmented reality,AR)终端设备、工业控制(industrial control)中的无线终端设备、无人驾驶(self driving)中的无线终端设备、远程医疗(remote medical)中的无线终端设备、智能电网(smart grid)中的无线终端设备、运输安全(transportation safety)中的无线终端设备、智慧城市(smart city)中的无线终端设备、和/或智慧家庭(smart home) 中的无线终端设备。终端设备还可以是蜂窝电话、无绳电话、会话启动协议(session initiation protocol,SIP)电话、无线本地环路(wireless local loop,WLL)站、个人数字助理(personal digital assistant,PDA)、具有无线通信功能的手持设备或计算设备、车载设备、可穿戴设备,第五代(the 5th generation,5G)网络中的终端设备或者演进的公用陆地移动通信网络(public land mobile network,PLMN)中的终端设备等。终端设备也可以称为用户设备(user equipment,UE),终端设备可以与不同技术的多个接入网设备进行通信,例如,终端设备可以与支持长期演进(long term evolution,LTE)的接入网设备通信,也可以与支持5G的接入网设备通信,还可以与支持LTE的接入网设备以及支持5G的接入网设备的双连接。本申请实施例并不限定。
2、接入网设备
接入网设备还可称为无线接入网(radio access network,RAN)设备,是一种将终端设备接入到无线网络的设备,可以为终端设备提供无线资源管理、服务质量管理、数据加密和压缩等功能。接入网设备包括但不限于:
5G中的下一代基站(generation nodeB,gNB)、演进型节点B(evolved node B,eNB)、无线网络控制器(radio network controller,RNC)、节点B(node B,NB)、基站控制器(base station controller,BSC)、基站收发台(base transceiver station,BTS)、家庭基站(例如,home evolved nodeB,或home node B,HNB)、基带单元(base band unit,BBU)、收发点(transmitting and receiving point,TRP)、发射点(transmitting point,TP)、和/或移动交换中心等。或者,接入网设备还可以是云无线接入网络(cloud radio access network,CRAN)场景下的无线控制器、集中单元(centralized unit,CU)、和/或分布单元(distributed unit,DU)。或者,接入网设备可以为中继站、接入点、车载设备、5G网络中的接入网设备,或者演进的公共陆地移动网络(public land mobile network,PLMN)中的接入网设备等。
在一些实施例中,如图2所示,接入网设备可以包括集中式单元(central unit,CU)和分布式单元(distributed unit,DU),即可对接入网设备的功能进行拆分,将接入网设备的部分功能部置在CU,剩余的部分功能部署在DU,多个DU共用一个CU,节省成本,易于网络扩展。可选的,CU和DU的功能可能按照协议栈划分。例如,将无线资源控制(radio resource control,RRC)层,业务数据适配协议(service data adaptation protocol,SDAP)层和分组数据汇聚协议(packet data convergence protocol,PDCP)层部署在CU。其余的无线链路控制(radio link control,RLC)层,媒体接入控制(medium access control,MAC)层和物理层(physical,PHY)层部署在DU。CU和DU之间可通过FI接口连接。CU可以代表接入网设备通过NG接口与核心网相连,CU还可以代表接入网设备通过Xn接口和其它接入网设备相连。更进一步的,CU的功能还可被划分为:
1、集中单元-控制平面(central unit–control plane,CU-CP):主要包括了CU中的RRC层,以及PDCP层中的控制面;
2、集中单元-用户平面(central unit–user plane,CU-UP):主要包括了CU中的SDAP层,以及PDCP层中的用户面。
3、核心网设备
核心网设备主要用于对终端设备进行管理并提供与外网通信的网关。核心网设备可以包括以下中的一个或多个网元:接入和移动管理功能(access and mobility management function,AMF)网元、会话管理功能(session management function,SMF)网元、用户面 功能(user plane function,UPF)网元、策略控制功能(policy control function,PCF)网元、应用功能(application function,AF)网元、统一数据管理(unified data management,UDM)网元、认证服务器功能(authentication server function,AUSF)网元、网络切片选择功能(network slice selection function,NSSF)网元。其中,AMF网元主要负责移动网络中的移动性管理,如用户位置更新、用户注册网络、用户切换等。SMF网元主要负责移动网络中的会话管理,如会话建立、修改、释放;具体功能如为用户分配IP地址、选择提供报文转发功能的UPF网元等。UPF网元主要负责用户数据的转发和接收;在下行传输中,UPF网元可以从数据网络(data network,DN)接收用户数据,通过接入网设备传输给终端设备;在上行传输中,UPF网元可以通过接入网设备从终端设备接收用户数据,向DN转发该用户数据。可选的,UPF网元中为终端设备提供服务的传输资源和调度功能可以由SMF网元管理控制。PCF网元主要支持提供统一的策略框架来控制网络行为,提供策略规则给控制面网络功能,同时负责获取与策略决策相关的用户签约信息。AF网元主要支持与无线网络,例如第三代合作伙伴计划(3rd generation partnership project,3GPP)网络,的核心网交互来提供服务,例如影响数据路由决策,策略控制功能或者向网络侧提供第三方的一些服务。UDM网元主要用于生成认证信任状、用户标识处理(如存储和管理用户永久身份等)、接入授权控制和签约数据管理等。AUSF网元主要用于在终端设备接入网络时执行认证,包括接收安全锚点功能(security anchor function,SEAF)发送的鉴权请求,选择鉴权方法,以及向鉴权存储和处理功能(authentication repository and processing function,ARPF)请求鉴权向量等。NSSF网元主要用于为终端设备选择网络切片实例,确定允许的网络切片选择辅助信息(network slice selection assistance information,NSSAI)、配置NSSAI和确定服务终端设备的AMF集。在不同的通信系统中,核心网中的网元或网元名称可以不同。在上述图1所示的示意图中,是以第五代移动通信系统为例进行说明的,并不作为对本申请的限定。
4、DN
DN可以是为用户提供数据业务服务的服务网络。例如,DN可以是IP多媒体业务(IP multi-media service)网络或互联网(internet)等。其中,终端设备可以建立从终端设备到DN的协议数据单元(protocol data unit,PDU)会话,来访问DN等。
在一些实施例中,扩展现实(extend reality,XR)业务通过应用层用户设备(如头盔)与XR应用服务器交互为用户提供沉浸式多媒体体验。可选的,所述XR应用服务器可位于上述图1所示框架中的DN侧,头盔可位于上述图1所示架构中的终端设备侧。头盔与终端设备在物理形态上可以是一体的,也可以是分体的,不作限定。
在上行传输过程中,头盔内的传感器可以感知用户的位置、动作变化等,生成包括视角、视线和运动速率等的用户信息。头盔可通过上述图1所示架构中的终端设备、接入网设备、UPF网元等,向XR服务器传输用户信息。
在下行传输过程中,XR服务器可结合头盔上报的用户信息和用户在游戏或实景中的场景,生成视频帧。XR服务器可通过上述图1所示架构中的UPF网元、接入网设备、终端设备等,向头盔传输视频帧,且最终通过头盔向用户展示上述视频帧。在下行传输过程中,终端设备在接收到下行视频帧时,如何向网络设备反馈视频帧的接收情况,以便优化下行视频帧的传输,是本申请实施例待解决的技术问题。
本申请提供一种通信方法及装置,该方法包括:终端设备从接入网设备接收至少一个 视频帧;终端设备根据所述至少一个视频帧,确定视频帧参数;终端设备向接入网设备发送所述视频帧参数,从而使得接入网设备可以得知下行视频帧的接收情况,便于优化下行视频帧的传输。
为了便于理解,对本申请实施例涉及的通信名词或术语进行解释说明。
1、XR的业务模型。可选的,XR业务主要包括:
上行业务:头盔生成的包括用户位置和视线等的用户信息,数据量较小;
下行业务:XR服务器根据头盔上报的用户信息,生成的视频帧,数据量较大。
其中,如图3所示,对于下行业务,XR服务器按每秒60帧的速率,生成视频帧,即每16.67ms生成一个原始视频帧。原始视频帧经过编码、压缩后,形成不同类型的视频帧。例如,典型的视频帧可包括I帧、P帧和B帧等。每个视频帧的尺寸可从1000千比特(kbit)到10000千比特(kbit)大小不等。可选的,对于视频业务,基本的处理是将视频帧划分为每秒N个画面,每个画面作为一个视频帧进行编码。由于每个视频帧包含的像素较多,直接传输会占用很大的带宽。所以,可以对视频业务进行压缩再传输。由于视频业务的天然性,只要镜头不切换,相邻的几个帧之间,通常大部分画面的内容是相同的,只有少量内容不同。所以,可以将视频帧分组,每组的第一帧是基准帧,后续的帧是依赖帧。压缩时,对基准帧进行帧内压缩,即压缩时只参考帧内自身的码流,不参考其它帧。这样,解压缩方接收到一个基准帧,不需要其它的帧,就可以独立的解压缩;对基准帧后面的依赖帧,则参照基准帧进行帧间压缩,即压缩时既参考帧内自身的码流,也参考其它帧,比如参照基准帧等。这样,压缩时可以大大提升压缩率,减少压缩后的数据尺寸。其中,基准帧又可称为I帧,其尺寸最大。依赖帧包括P帧和B帧,其尺寸较小,P帧指解码时只依赖前面的帧,B帧指解码时不仅依赖前面的帧,也依赖后面的帧。
在一些实施例中,由于网络限制,按1500字节(考虑到包头开销,实际划分的数据纯负荷量可略小于1500字节)为标准,将编码后的数据流划分为多个数据包。从无线网络角度,下行视频数据体现为UPF每16.67ms从XR服务器接收到一簇下行数据包,一簇下行数据包可包括多个尺寸约为1500字节的数据包。该一簇下行数据包即为一个下行视频帧。
2、视频分层编码
由于原始视频帧的尺寸太大,对传输网络带来很大压力。所述视频帧在传输前进行压缩。目前的压缩方式通常可以达到300:1的压缩效率,即一个300兆比特(Mbit)的文件,可以压缩至1兆比特(Mbit)左右。但是随着视频帧的分辨率越来越高,色度划分越来越细,视频帧的尺寸也越来越大,按上述300:1的压缩效率,压缩后的数据量对传输网络的压力依然太大。尤其对于无线网络,由于无线网络的容量存在很大波动,当信道质量较差时,信道传输的数据量也随之下降,无法传输高清视频帧,用户感受到马赛克,甚至图像卡帧。
基于上述情况,引入视频分层编码,即在压缩编码过程中将视频帧的信息分为两类,分别为基本层数据和增强层数据。在一些实施例中,上述两类数据的数据量之比可为1:9。当信道容量较小时,用户如果仅接收到基本层数据,可以通过显示器获得分辨率较低的图像,但是没有马赛克。如果信道容量较大,用户同时接收到基本层数据和增强层数据,可以获得分辨率较高的图像。与不分层的高清视频相比,基本层数据和增强层数据的数据量之和,大于不分层视频的数据量。分层编码的方式可以更好地应对无线信道快速变化的情 况,减少卡顿和马赛克的情况,提升用户感受。
在实际分层编码中,上述增强层可以包括一层,也可以被划分为多层,不作限定。层数越多,与不同容量的无线信道越契合。
3、网络编码
由于无线信道情况不稳定,可能发生传输错误,通常采用重传的方式进行弥补。重传能解决错包的问题,但是会引入额外的时延,效率也较低。对于实时多媒体业务,需要传输的数据量很大,几乎每个时隙都有数据包需要传输。如果额外插入一个重传的数据包,后续每个时隙的数据都向后推迟,从而带来时延增加。另外,实时多媒体业务的数据包通常较大,而错误的可能仅是其中一小部分。如果因为数据包的一小部分传输错误而重新传输整个数据包,会浪费无线资源,造成效率降低。
基于上述情况,引入了网络编码机制,其核心思想是将一组数据包统一进行编码。在一种示例中,如图4所示,可将一个视频帧包括的10个数据包划分为一组,进行统一网络编码,编码后输出若干个小数据块,再根据空口情况,将这些小数据块分批传输。接收方收到所有小数据块后,对整个数据包解码。如果发现某个数据包无法正确解出,可以通知发送方再发送一些小数据块。发送方再发送的这些小数据块,并非是前面已经传输过的数据块,而是网络编码模块新生成的小数据块,接收方可以将收到的所有小数据块统一进行解码。
在一些实施例中,如图4所示,针对每个数据包,经过PDCP层的处理后,生成PDCP PDU,传输到RLC层,经RLC的处理后,生成RLC PDU。在RLC层可对RLC PDU进行网络编码,得到若干个小数据块,传输到MAC层,经MAC层的处理,得到MAC PDU,每个MAC PDU可包括逻辑信道标识(logical channelidentity document,LCID)、长度(length,Len)和数据(data)等字段。可选的,经网络编码后形成的小数据块,还可称为数据分片。在以下实施例中,以数据分片为例进行描述。
采用上述网络编码的方式,不需要设计实时的反馈机制,发送方也不需要重传整个原始数据包,提高了无线传输的效率,降低了传输时延。
在本申请的描述中,除非另有说明,“/”表示前后关联的对象是一种“或”的关系,例如,A/B可以表示A或B;本申请中的“和/或”仅仅是一种描述关联对象的关联关系,表示可以存在三种关系,例如,A和/或B,可以表示:单独存在A,同时存在A和B,单独存在B这三种情况,其中A,B可以是单数或者复数。并且,在本申请的描述中,除非另有说明,“多个”是指两个或多于两个。“以下至少一项(个)”或其类似表达,是指的这些项中的任意组合,包括单项(个)或复数项(个)的任意组合。例如,a,b,或c中的至少一项(个),可以表示:a,b,c,a-b,a-c,b-c,或a-b-c,其中a,b,c可以是单个,也可以是多个。另外,为了便于清楚描述本申请实施例的技术方案,在本申请的实施例中,采用了“第一”、“第二”等字样对功能和作用基本相同的相同项或相似项进行区分。本领域技术人员可以理解“第一”、“第二”等字样并不对数量和执行次序进行限定,并且“第一”、“第二”等字样也并不限定一定不同。
此外,本申请实施例描述的网络架构以及业务场景是为了更加清楚的说明本申请实施例的技术方案,并不构成对于本申请实施例提供的技术方案的限定,本领域普通技术人员可知,随着网络架构的演变和新业务场景的出现,本申请实施例提供的技术方案对于类似的技术问题,同样适用。
本申请实施例中涉及的网元包括接入网设备、终端设备、头盔和XR服务器等。其中,接入网设备可以是CU/DU架构的,这时接入网设备包括CU和DU两个网元。或者,接入网设备也可以是CP-UP架构的,这时接入网设备包含CU-CP、CU-UP和DU三个网元。或者,接入网设备还可以是开放无线接入网(open radio access network,ORAN)架构的,这时接入网设备包括CU-CP、CU-UP、DU、近实时接入网智能控制器(RAN intelligent controller,RIC)四个网元甚至更多网元。进一步,DU还可被分离成DU-H和DU-L等,以支持远端物理层等低层分离。以下实施例中的网络设备(或基站)可以是接入网设备中的一个网元,或者可以包括接入网设备中的多个网元。
为了便于理解,以终端设备为UE,网络设备为基站为例进行描述。如图5所示,提供一种通信方法的流程,包括:
步骤501:UE从基站接收至少一个视频帧。可选的,如前所述,每个视频帧中可包括一个或多个数据包,每个数据包的大小约为1500字节。数据包还可称为网际协议(internet protocol,IP)包,以下以数据包为例进行描述。可选的,每个视频帧中包括的数据包可以为视频数据,或者为视频数据和音频数据等,不作限定。
步骤502:UE根据所述至少一个视频帧,确定视频帧参数。
可选的,UE可以以视频帧为粒度,确定视频帧参数。
步骤503:UE向基站发送所述视频帧参数。可选的,UE可以采用以下方式,上报视频帧参数:
-周期性上报。UE每周期上报一次视频帧参数,可以由基站,SMF网元,或XR服务器等配置上报周期。其中,由基站配置时,UE从基站接收配置信元,该配置信元指示上报周期,UE根据该配置信元指示的上报周期上报视频帧参数。由SMF网元或XR服务器配置时,SMF网元或XR服务器将上报周期的信息发送给基站,基站根据该信息生成指示上报周期的配置信元,发送给UE,或者基站直接将上报周期的信息作为配置信元发送给UE,UE根据该配置信元指示的上报周期上报视频帧参数。
-条件触发上报。UE如果发现满足上报条件,则上报一次视频帧参数,可以由基站,SMF网元,或XR服务等配置上报条件。
-周期+条件触发上报。每个周期结束时,UE统计视频帧参数。并判断预先配置的条件是否满足,如果满足条件则上报视频帧参数,否则不上报。同理,可以由基站、SMF网元或XR服务器等配置上报周期和上报条件。
其中,上报条件的配置方式同上报周期,不再赘述。UE上报所述视频帧参数的对象,即视频帧参数的接收对象,可以是基站,也可以是核心网网元,例如,SMF、或UPF等,甚至可以是XR服务器。当然,若视频帧参数的接收对象是核心网网元,或XR服务器等,基站将接收到的视频帧参数转发给核心网网元,或XR服务器。上述是以视频帧参数的接收对象为一个为例进行描述的,实际上,视频帧参数的接收对象也可以为多个。比如,上述视频帧参数的接收对象,可包括基站、核心网网元和XR服务器等。各个接收对象在接收到视频帧参数后,可分别进行对应的优化。例如,基站在接收到视频帧参数后,可针对无线资源的调度进行优化。SMF网元在接收到视频帧参数后,可针对用户的业务配置进行优化。UPF网元在接收到视频帧参数后,可针对数据包的转发优先级进行优先。XR服务器在接收到视频帧参数后,可针对下行视频帧的编码、和/或压缩方式等进行优化。
通过上述可以看出,在本申请实施例中,UE根据接收到的视频帧,得到视频帧参数,以上报视频帧的接收情况,可使得网络设备直观和准确的获得视频帧的接收状况,便于优化视频帧的传输方案。
在一种方案中,针对视频业务,UE以数据包为粒度,上报每个数据包的接收情况,比如丢包、和/或数据包时延等。这种上报方式对于视频业务的统计情况不够准确,理由如下。例如,UE丢失了两个数据包,有两种情况:
情况1:丢失的两个数据包,位于不同的视频帧;
情况2:丢失的两个数据包,位于同一视频帧。
从网络性能的角度,上述两种情况对应相同的网络性能,丢失的数据包的数量相同,但从用户感受的角度,上述两种情况对应的用户感受不同。对上述情况1,用户可以连续看到两个画质比较差的画面。对上述情况2,用户看到一个画质很差的画面,而下一个画面则很清晰。UE以数据包为粒度上报数据包的接收情况,难以反映上述两种区别。而在本申请一实施例中,UE以视频帧为粒度,上报视频帧的接收情况,可更准确的反映视频帧的接收状况,便于优化整个视频传输方案。
实施例一,在本申请实施例中,UE上报的视频帧参数可以包括以下至少一项:
1、扩展时延(spread delay)参数,所述扩展时延参数可指示以下至少一项:第一视频帧的扩展时延,多个视频帧的平均扩展时延,多个视频帧的最大扩展时延,多个视频帧的最小扩展时延,多个视频帧的扩展时延方差。其中,第一视频帧或多个视频帧为UE接收到的至少一个视频帧中的一个或多个视频帧。
一个画面经过编码、压缩等,可形成一个或多个视频帧,每个视频帧可以被划分为多个数据包。此时,从UE角度,接收一个视频帧,即接收多个数据包。扩展时延可指UE成功接收一个视频帧的首数据包至成功接收一个视频帧的尾数据包之间的时间长度。例如,如图6所示,视频帧N被划分为3个数据包,视频帧N的扩展时延可具体为:UE成功接收视频帧N的第一个数据包,至成功接收视频帧N的第3个数据包间的时间长度。
在本申请实施例中,UE可以统计每个视频帧的扩展时延;根据多个视频帧的扩展时延,得到多个视频帧扩展时延的平均值、最大值、最小值,或者方差等,并上报。或者,在本申请实施例中,UE也可以上报单个视频帧的扩展时延,即上述第一视频帧的扩展时延,不作限定。
示例的,UE上报扩展时延参数的方式,可包括以下任一种:
-周期性上报,由基站或SMF网元或XR服务配置上报周期,每周期上报一次。
-条件触发上报,由基站或SMF网元或XR服务器配置上报条件。UE如果发现满足上报条件,则上报一次。
-周期+条件触发上报,每个周期结束时,UE统计该周期内每个视频帧的扩展时延,并判断预先配置的上报条件是否满足;如果满足上报条件,则UE上报;否则不上报。同样,周期和上报条件可以由基站或SMF网元或XR服务器等配置。
其中,上述上报条件可以根据需要进行设定,不作限制。上报条件可以跟上报的内容相应,例如,上报条件可以包括单个视频帧的扩展时延大于或等于门限A1时,则上报该单个视频帧的扩展时延参数;或者,上报条件可以包括单个视频帧的扩展时延小于或等于门限B1时,则上报该单个视频帧的扩展时延参数。再如,上报条件可以包括多个视频帧的平均扩展(即多个视频帧扩展时延的平均值)时延大于或等于门限A2时,则上报多个 视频帧的平均扩展时延;或者,上报条件可以包括多个视频帧的平均扩展时延小于或等于门限B2时,则上报多个视频帧的平均扩展时延。多个视频帧扩展时延最大值、最小值,或者方差的上报与之类似。或者,上报条件可以跟上报的内容不相应,例如,上报条件可以包括单个视频帧的扩展时延大于或等于门限A3时,则上报包括该单个视频帧在内的多个视频帧的平均扩展时延;或者,上报条件可以包括单个视频帧的扩展时延大于或等于门限B3时,则上报包括该单个视频帧在内的多个视频帧的平均扩展时延;其中,该单个视频帧可以为多个视频帧的首个视频帧,中间视频帧或末尾视频帧,多个视频帧的数量可以预先设定。多个视频帧扩展时延最大值、最小值,或者方差的上报与之类似。关于门限A1-A3,B1-B3的取值不做限定,且他们可以相同,也可以不相同。
网络设备获知扩展时延参数之后,可以根据扩展时延的大小调整调度策略,例如如果扩展时延过大,则调度更多的资源给UE,或者调整调制和编码方案(modulation and coding scheme,MCS),从而提高用户体验;如果扩展时延过小,可以减少调度资源,或者调整MCS,从而节约资源。
2、帧间距参数,帧间距参数用于指示以下至少一项:相邻的视频帧的帧间距,多个帧间距的平均值,多个帧间距的最大值,多个帧间距的最小值,多个帧间距的方差。其中,UE从基站接收多个视频帧。
可选的,帧间距还可称为video frame gap。XR应用服务器中的视频编码器发出的各个视频帧,相邻视频帧之间的时间差都相同,具体值取决于帧率。例如,在每秒60帧的情况下,两个相邻视频帧之间的时间差为16.67ms。但是,由于网络传输每个视频帧所用的时间并不完全相同,特别是无线网络。所以UE在接收视频帧时,并非每16.67ms接收一个视频帧。UE可以根据两个相邻视频帧的参考位置,确定相邻视频帧的帧间距,所述参考位置可以是起始位置,中间位置,结束位置或任一其它位置等,不作限定。例如,UE可以统计相邻两个视频帧分别对应的第一个数据包的成功接收时刻,计算两个接收时刻间的时间差,作为相邻视频帧的帧间距。例如,如图6所示,视频帧N和视频帧N+1为相邻视频帧,各自包括3个数据包。UE可分别统计成功接收视频帧N中的第一个数据包的时刻,和视频帧N+1中的第一个数据包的时刻;两个时刻间的时间差,即为视频帧N与视频帧N+1的帧间距;或者,UE也可以统计相邻两个视频帧分别对应的最后一个数据包的成功接收时刻,计算两个接收时刻的时间差,作为相邻视频帧的帧间距。
3、丢包参数,丢包参数包括以下至少一项:发生丢包的视频帧的数量,发生丢包的视频帧的比例,连续K个视频帧发生丢包,K为大于或等于1的正整数。
其中,UE如果接收到一个视频帧的多个数据包后,发现有丢包,则视为“发生丢包的视频帧”。具体的,UE可以通过PDCP序列号(sequence number,SN)判断是否发生丢包,如果接收到的数据包所对应的PDCP SN不连续,则视为丢包。例如,一个视频帧包括M个数据包,M个数据包的PDCP SN依次为1至M。当UE接收到第M个数据包时,UE发现前面PDCP SN为N的数据包没有收到,可以认为该视频帧的PDCP SN不连续,则可以判定该视频帧发生了丢包。可选的,上述PDCP SN是否连续,还可替换为PDCP SN是否丢失等。具体的,UE可以在收到一个视频帧的结束标记的时刻,确定该视频帧是否发生丢包;也可以在该视频帧所对应的数据包向IP层或实时传输协议(real-time transport protocol,RTP)层递交的时刻,确定该视频帧是否发生丢包;也可以在该视频帧所对应的数据包在应用层播放的时刻,确定该视频帧是否发生丢包等,不予限制。
在一些实施例中,在一段时间内,UE可确定发生丢包的视频帧数量,丢包的视频帧占该段时间内的总视频帧的比例等。比如,在一段时间内,UE共接收到来自基站的5个视频帧。其中,3个视频帧发生丢包,则发生丢包的视频帧的数量为3,丢包的视频帧所占的比例为3/5=60%。UE还可以统计,发生丢包的视频帧中,丢1个数据包的视频帧占所有丢包视频帧的比例,丢2个数据包的视频帧占所有丢包视频帧的比例,丢3个数据包的视频帧占所有丢包视频帧的比例等。沿用上述举例,在一段时间内,有3个视频帧发生丢包,其中1个视频帧丢失了1个数据包,2个视频帧丢失了2个数据包,则上述丢1个数据包的视频帧占所有丢包视频帧的比例为1/3=33.3%,丢2个数据包的视频帧占所有丢包视频帧的比例为2/3=66.7%。UE还可以统计,有几个视频帧连续发生丢包。比如,在一段时间内,UE从基站接收到10个视频帧,序号依次为1至10。其中,1号视频帧发生丢包,2号至5号视频帧没有丢包,6号至8号视频帧发生丢包,9号和10号视频帧没有丢包,则UE可以上报6号至8号,连续3个视频帧发生丢包。
4、迟到参数,迟到参数用于指示以下至少一项:迟到视频帧的数量,迟到视频帧所占的比例,一个迟到视频帧的实际接收时刻与正确接收时刻间的时间差,多个迟到视频帧的实际接收时刻与正确接收时刻间时间差的平均值,多个迟到视频帧中实际接收时刻与正确接收时刻间时间差的最大值,多个迟到视频帧中实际接收时刻与正确接收时刻间时间差的最小值,多个迟到视频帧中实际接收时刻与正确接收时刻间时间差的方差等。
由于连续的视频帧之间相关性比较高,XR应用服务器中的编码方在对视频帧进行压缩的过程中,会选定其前面的某个视频帧作为参考帧,进行压缩。如图7所示,所以如果某个视频帧内的部分数据包在预定义的时延内没有成功传输,在时延预算之后才传输成功。虽然该视频帧无法通过显示器显示,但其可以作为后续视频帧的参考帧,依然是有用的。UE可以统计这类视频帧的数量,在所有视频帧中所占的比例,正确接收时刻比时延预算晚了多长时间等信息,并向基站上报。在本申请实施例中,UE所具体上报的参数可具体为:在一段时间内,迟到视频帧的数量,迟到视频帧所占的比例,一个迟到视频帧的实际接收时刻与正确接收时刻间的时间差,多个迟到视频帧的实际接收时刻与正确接收时刻间时间差的最大值,最小值,或方差等,不作限定。比如,在一段时间内,UE从基站接收到10个视频帧,1号至5号视频帧正确到达,6号至10号视频帧迟到。则迟到视频帧的数量为5个,迟到视频帧所占的比例为5/10=50%。6号视频帧的实际到达时刻为2020年08月28日15点05分35秒,6号视频帧的正确到达时刻应该为2020年08月28日15点05分20秒,则6号数据包的实际到达时刻与正确到达时刻间的时间差为15秒。在一种可能的实现方式中,可以采用以下方式定义“视频帧的实际接收时刻和正确接收时刻”:视频帧的实际接收时刻可以指UE接收到一个视频帧对应的所有数据包的时刻。设定第N号视频帧包括10个数据包,所述第N号视频帧的实际接收时刻可以为UE接收到该第N号视频帧的全部10个数据包的时刻。首先定义正确接收的视频帧,若一个视频帧的全部数据包都在预定义的时刻之前接收到,则该视频帧可认为是正确接收的视频帧。设定第i号视频帧是正确接收的,则第i+M号视频帧的正确接收时刻为:第i号视频帧的最后一个数据包的实际接收时刻+M*相邻视频帧间的时间差。可选的,相邻视频帧间的时间差可以为16.67ms,M为大于或等于1的正整数。
在上述描述中,是以将迟到视频帧定义为“在时延预算之后接收到,但仍然可以作为其后面其它视频帧的参考帧”为例进行描述,并不作为对本申请实施例的限定。比如,在 本申请实施例中,还可以将迟到视频帧定义为“在第一时延预算之后收到的视频帧”,或者,定义为“在第一时延预算之后且在第二时延预算之前收到的视频帧”,或者定义为“迟到时长满足预设时间要求的视频帧”等等。
5、基本层和增强层参数
如果采用视频分层编码,例如分为基本层和增强层。UE可接收到两个流flow或两个数据无线承载((data radio bearer,DRB)或两个数据通道的数据,分别为基本层数据和增强层数据。UE可以分别统计这两个层的数据接收情况,并上报以下参数中的一个或多个:
1、同一视频帧中,UE接收基本层数据和增强层数据的时间差。可选的,UE可以统计同一视频帧中,基本层数据和增强层数据中参考位置的时间差。所述参考位置可以为帧起始位置、帧结束位置,中间位置,或任意一个位置等,不作限定。如图8所示,针对视频帧N,基本层包括3个数据包,增强层包括4个数据包,可统计视频帧N的基本层和增强层的帧起始时间差,或者可统计视频帧N的基本层和增强层的帧结束时间差等。可以理解的,UE可以上报单个视频帧中基本层数据和增强层数据之间的时间差,UE也可以上报多个视频帧中基本层数据和增强层数据间时间差的平均值、最大值、最小值或方差等。
在上述描述中,以视频帧分为两层为例进行说明,实际上,视频帧还可以分为三层、四层或者更多层。比如,针对三层的情况,整个视频帧被分为基本层,第一增强层和第二增强层。针对四层的情况,整个视频帧被分为基本层,第一增强层,第二增强层和第三增强层。以上述视频帧被分三层为例,UE可分别上报第一增强层数据和基本层数据间的时间差,第二增强层和基本层数据的时间差等,甚至可以上报不同增强层之间的时间差等,不作限定。
2、同一视频帧中,基本层没有丢包,但增强层丢包的视频帧数量、比例或丢包数量等,或者增强层丢包的视频帧数量、比例和丢包数量等中的多个。由于网络对基本层数据采取更鲁棒的传输策略,所以增强层的数据包发生丢包的概率更大。UE可统计基本层没有丢包,但增强层有丢包的视频帧的数量、比例和丢包数量等信息中的一个或多个,并上报基站。基站可以依据该信息调整增强层的传输策略。
上述是以视频帧被分为基本层和增强层两层为例进行描述的,实际中视频帧可以被分为三层、四层或更多层。以视频帧被分为三层为例,UE可统计基本层没有丢包,但第一增强层丢包的视频帧数量、比例或丢包数量等;基本层没有丢包,但第二增强层丢包的视频帧数量、比例或丢包数量等;甚至UE可以上报基本层和第一增强层没有丢包,但第二增强层丢包的视频帧数量、比例或丢包数量等,或者,UE可以上报基本层和第二增强层没有丢包,但第一增强层丢包的视频帧数量、比例或丢包数量等,或者,UE可以上报基本层没有丢包,但第一增强层和第二增强层同时丢包的视频帧数量、比例或丢包数量等。
3、同一视频帧中,增强层没有丢包,但基本层丢包的视频帧数量、比例或丢包数量等,或者基本层丢包的视频帧数量、比例和丢包数量等中的多个。由于网络对基本层数据采取更鲁棒的传输策略,虽然这种情况出现的概率较低,但是不等于没有。UE可向基站上报这种情况,基站可以据此分析基本层丢包的原因,优化网络部署。
同样的,上述是以视频帧被分为基本层和增强层两层为例进行描述的,实际中视频帧可以被分为三层、四层甚至更多层。以视频帧被分为三层为例,UE可统计第一增强层没有丢包,但基本层有丢包的视频帧数量、比例或丢包数量等。或者,UE可统计第二增强层没有丢包,但基本层有丢包的视频帧数量、比例或丢包数量等。或者,UE可以统计第 一增强层和第二增强层同时没有丢包,但基本层丢包的视频帧数量、比例或丢包数量等。
可选的,UE具体上报上述帧间距参数、丢包参数、迟到参数或基本层和增强层参数的方式,与前述UE上报扩展时延参数的方式相似,不再一一说明。
通过上述可以看出,UE以视频帧为粒度,上报各种视频帧参数,从而使网络更加直观地了解视频帧的传输情况,优化网络算法。
以上以扩展时延为例,描述了上报条件,关于其他参数的上报条件可以参照扩展时延。
实施例二,由于视频业务的特殊性,可以采用网络编码的方式。具体的,可以由基站对视频业务的数据进行网络编码,再传输至UE。基站通常可将一个视频帧所包括的多个数据包统一进行网络编码。在这种情况下,UE上报的视频帧参数可包括以下至少一项:
1、扩展时延参数
同实施例一中的扩展时延参数,区别仅在于上述实施例一中的视频帧未经过网络编码,本申请实施例中的视频帧经过网络编码。在一些实施例中,基站在接收到扩展时延参数后,若发现扩展时延太大,超出通用的视频解码器的解码时长,则基站后续可以选择恰当的时机发送后续视频帧的第一个数据包,确保在合适的扩展时延内发送完该视频帧的最后一个数据包。
2、帧间距参数
同实施例一中的帧间距参数,区别仅在于上述实施例一中的视频帧未经过网络编码,本申请实施例中的视频帧经过网络编码。在一些实施例中,基站如果发现视频帧的帧间距不稳定,则基站后续可增加存缓器buffer的大小,尽量保证间隔相同的时间发送视频帧。
3、丢包参数
同实施例一中的丢包参数,区别仅在于实施一中的视频帧未经过网络编码,本申请实施例中的视频帧经过网络编码。在一些实施例中,基站如果发现丢包的视频帧较少,但是一旦丢包就丢多个包,则基站确定无线信号深衰时间较长,可以更换新的频段发送视频帧。
4、基本层和增强层参数
同实施例一中的基本层和增强层参数,区别仅在于实施例一中的视频帧未经过网络编码,本申请实施例中的视频帧经过网络编码。在一些实施例中,基站如果发现同一视频帧的基本层和增强层的时间差太大,后续可输少两者的传输间隔。
5、数据分片参数;视频帧经过网络编码,包括一个或多个数据分片,所述数据分片参数包括用于以下至少一项的参数:
-UE成功解码一个视频帧所需要接收的数据分片的数量和/或数据量;
其中,视频帧经过网络编码,原有的数据会发生冗余,数据量会增加。冗余越多,对网络传输中的丢包率容忍度越大,对网络传输容量的要求越大。比如,一个视频帧的数据包,在网络编码前,数据量为1000千比特(Kbit),进行网络编码后,数据量为1500千比特(Kbit)。但是,UE接收到1200千比特(Kbit),即使中间有部分丢包,UE也能做网络解码并成功恢复视频帧的全部数据,则UE可以上报指示UE成功解码一个视频帧所需要接收的数据分片的数据量,例如“需要1200千比特就能成功进行网络解码”,的参数。类似的,UE可以上报UE成功解码一个视频帧所需要接收的数据分片的数量,例如“需要N个数据分片就能成功进行网络解码”,的参数。基站获得该上报信息后,可以对后续视频帧,下调网络编码的冗余率。同时,为方便基站控制网络编码的冗余率,UE还可以统计并上报“自己接收到并成功解码一个视频帧所需要的数据量”。可以理解的是,UE上报的 可以是成功解码单个视频帧所需要数据分片的数量或数据量,或者,可以是UE成功解码多个视频帧所需要数据分片的数量或数据量的平均值、最大值、最小值或方差等。
-UE成功解码一个视频帧的基本层所需要接收的数据分片的数量和/或数据量。
-UE成功解码一个视频帧的增强层所需要接收的数据分片的数量和/或数据量。
如图9所示,在采用视频分层编码+网络编码的情况下,基站可以针对基本层和增强层分别进行网络编码。UE在接收到基本层数据后,可统计成功解码基本层数据所需数据分片的数量或数据量,并上报。UE在接收到增强层数据后,可统计成功解码增强层数据所需数据分片的数量或数据量等。当然,上述数据分片的数量或数据量可以是单个视频帧的基本层或增强层所需数据分片的数量或数据量,也可以是多个视频帧中基本层或增强层所需数据分片的数量或数据量的平均值、最大值、最小值或方差等,不作限定。可选的,基站可以根据UE的上报情况,调整网络编码过程中基本层或增强层的冗余率。
以上结合图1至图9详细说明了本申请实施例所提供的方法。以下结合图10和图11详细说明本申请实施例所提供的装置。应理解,装置与方法实施例的描述相互对应,装置中未详细描述的内容可参见上文方法实施例中的描述。
图10是本申请实施例提供的装置1000的示意性框图,用于实现上述方法中终端设备或网络设备的功能。例如,装置1000可以为软件单元或芯片系统。所述芯片系统可以由芯片构成,也可以包括芯片和其它分立器件。该装置包括通信单元1001,还可包括处理单元1002。通信单元1001,可以与外部进行通信。处理单元1002,用于进行处理。通信单元1001,还可以称为通信接口、收发单元、输入\输出接口等。
在一种示例中,装置1000可实现上述实施例中终端设备执行的步骤。所述装置1000可以是终端设备,或者配置于终端设备中的芯片或电路。通信单元1001执行上文实施例的收发操作,处理单元1002用于执行上文方法实施例中终端设备的处理相关操作。
例如,通信单元1001,用于从网络设备接收至少一个视频帧;处理单元1002,用于根据所述至少一个视频帧,确定视频帧参数;通信单元1001,还用于向所述网络设备发送所述视频帧参数。
可选的,所述视频帧参数包括以下至少一项:扩展时延参数,帧间距参数,丢包参数,迟到参数,基本层和增强层参数。
可选的,所述扩展时延参数用于指示以下至少一项:第一视频帧的扩展时延,多个视频帧的平均扩展时延,多个视频帧的最大扩展时延,多个视频帧的最小扩展时延,多个视频帧的扩展时延的方差;其中,所述扩展时延为所述终端设备成功接收一个视频帧的首数据包至成功接收该视频帧的尾数据包间的时间长度,所述第一视频帧或所述多个视频帧为所述终端设备接收到的至少一个视频帧中的一个或多个视频帧。
可选的,所述帧间距参数用于指示以下至少一项:相邻的视频帧的帧间距,多个帧间距的平均值,多个帧间距的最大值,多个帧间距的最小值,多个帧间距的方差;其中,所述终端设备从所述网络设备接收多个视频帧。
可选的,所述丢包参数,包括以下至少一项:发生丢包的视频帧的数量,发生丢包的视频帧的比例,连续K个视频帧发生丢包,所述K为大于或等于1的正整数。
可选的,处理单元1002,还用于判断接收到的视频帧所包括数据包的分组数据汇聚协议PDCP序列号SN是否连续,在接收视频帧所包括数据包的PDCP SN不连续时,则确定所述视频帧发生丢包。
可选的,所述迟到参数指示以下至少一项:迟到视频帧的数量,迟到视频帧所占的比例,一个迟到视频帧的实际接收时刻与正确接收时刻间的时间差,多个迟到视频帧的实际接收时刻与正确接收时刻间时间差的平均值,多个迟到视频帧中实际接收时刻与正确接收时刻间时间差的最大值,多个迟到视频帧中实际接收时刻与正确接收时刻间时间差的最小值,多个迟到视频帧中实际接收时刻与正确接收时刻间时间差的方差。
可选的,每个视频帧包括基本层数据和增强层数据;所述基本层和增强层参数包括以下至少一项:第二视频帧中,所述终端设备接收基本层数据和增强层数据的时间差,所述第二视频帧为所述至少一个视频帧中的视频帧;第三视频帧的数量,第三视频帧的比例,或第三视频帧中增强层丢包的数量中的至少一个,所述第三视频帧为基本层数据没有丢包,增强层数据丢包的视频帧,且所述第三视频帧为所述至少一个视频帧中的视频帧;第四视频帧的数量,第四视频帧的比例,或第四视频帧中基本层丢包的数量中的至少一个,所述第四视频帧为增强层数据没有丢包,基本层数据丢包的视频帧,且所述第四视频帧为所述至少一个视频帧中的视频帧。
可选的,所述视频帧经过网络编码后,包括一个或多个数据分片,所述视频帧参数还包括以下至少一项:所述终设备成功解码一个视频帧所需要接收的数据分片的数量和/或所述数据分片的数据量;所述终端设备成功解码一个视频帧的基本层所需要接收的数据分片的数量和/或所述数据分片的数据量;所述终端设备成功解码一个视频帧的增强层所需要接收的数据分片的数量和/或所述数据分片的数据量。
在一种示例中,装置1000可实现上文方法实施例中网络设备执行的步骤,装置1000可以是网络设备,或者配置于网络设备中的芯片或电路。通信单元1001用于执行上文方法实施例中网络设备的收发操作,处理单元1002用于执行上文方法实施例中网络设备的处理相关操作。
比如,通信单元1001,用于向终端设备发送至少一个视频帧;通信单元1001,还用于接收来自所述终端设备的视频帧参数,所述视频帧参数是根据所述至少一个视频帧所确定的。可选的,处理单元1002,可用于根据所述视频帧参数,优化下行视频帧的传输。
可选的,所述视频帧参数包括以下至少一项:扩展时延参数,帧间距参数,丢包参数,迟到参数,基本层和增强层参数。
可选的,扩展时延参数用于指示以下至少一项:第一视频帧的扩展时延,多个视频帧的扩展时延,多个视频帧的最大扩展时延,多个视频帧的最小扩展时延,多个视频帧的扩展时延方差;其中,所述扩展时延为所述终端设备成功接收一个视频帧的首数据包至成功接收该视频帧的尾数据包间的时间长度,所述第一视频帧或所述多个视频帧为所述终端设备接收到的至少一个视频帧中的一个或多个视频帧。
可选的,所述帧间距参数用于指示以下至少一项:相邻的视频帧的帧间距,多个帧间距的平均值,多个帧间距的最大值,多个帧间距的最小值,多个帧间距的方差;其中,所述终端设备从所述网络设备接收多个视频帧。
可选的,所述丢包参数,包括以下至少一项:发生丢包的视频帧的数量,发生丢包的视频帧的比值,连续K个视频帧发生丢包,所述K为大于或等于1的正整数。
可选的,所述迟到参数指示以下至少一项:迟到视频帧的数量,迟到视频帧所占的比例,一个迟到视频帧的实际接收时刻与正确接收时刻间的时间差,多个迟到视频帧的实际接收时刻与正确接收时刻间时间差的平均值,多个迟到视频帧中实际接收时刻与正确接收 时刻间时间差的最大值,多个迟到视频帧中实际接收时刻与正确接收时刻间时间差的最小值,多个迟到视频帧中实际接收时刻与正确接收时刻时间差的方差。
可选的,每个视频帧包括基本层数据和增强层数据;所述基本层和增强层参数包括以下至少一项:第二视频帧中,所述终端设备接收基本层数据和增强层数据的时间差,所述第二视频帧为所述至少一个视频帧中的视频帧;第三视频帧的数量,第三视频帧的比例,或第三视频帧中增强层丢包的数量中的至少一项,所述第三视频帧为基本层数据没有丢包,增强层数据丢包的视频帧,且所述第三视频帧为所述至少一个视频帧中的视频帧;第四视频帧的数量,第四视频帧的比例,或第四视频帧中基本层丢包的数量中的至少一项,所述第四视频帧为增强层数据没有丢包,基本层数据丢包的视频帧,且所述第四视频帧为所述至少一个视频帧中的视频帧。
可选的,所述视频帧经过网络编码后,包括一个或多个数据分片,所述视频帧参数还包括以下至少一项:所述终端设备成功解码一个视频帧所需要接收的数据分片的数量和/或所述数据分片的数据量;所述终端设备成功解码一个视频帧的基本层所需要接收的数据分片的数量和/或所述数据分片的数据量;所述终端设备成功解码一个视频帧的增强层所需要接收的数据分片的数量和/或所述数据分片的数据量。
本申请实施例中对单元的划分是示意性的,仅仅为一种逻辑功能划分,实际实现时可以有另外的划分方式,另外,在本申请各个实施例中的各功能单元可以集成在一个处理器中,也可以是单独物理存在,也可以两个或两个以上单元集成在一个单元中。上述集成的单元既可以采用硬件的形式实现,也可以采用软件功能单元的形式实现。
可以理解的是,上述实施例中的通信单元的功能可以由收发器实现,处理单元的功能可以由处理器实现。收发器可以包括发射器和/或接收器等,分别用于实现发送单元和/或接收单元的功能。以下结合图11举例进行说明。
图11是本申请实施例提供的装置1100的示意性框图,图11所示的装置1100可以为图10所示的装置的一种硬件电路的实现方式。该装置可执行上述方法实施例中终端设备或网络设备的功能。为了便于说明,图11仅示出该通信装置的主要部件。
图11所示的通信装置1100包括至少一个处理器1101。通信装置1100还可以包括至少一个存储器1102,用于存储程序指令和/或数据。存储器1102和处理器1101耦合。本申请实施例中的耦合是装置、单元或模块之间的间接耦合或通信连接,可以是电性、机械性或其它的形式,用于装置、单元或模块之间的信息交互。处理器1101可以和存储器1102协同操作,处理器1101可以执行存储器1102中存储的程序指令,所述至少一个存储器1102中的至少一个可以包括于处理器1101中。
装置1100还可以包括通信接口1103,用于通过传输介质和其它设备进行通信,从而用于通信装置1100可以和其它设备进行通信。在本申请实施例中,通信接口可以是收发器、电路、总线、模块或其它类型的通信接口。在本申请实施例中,通信接口为收发器时,收发器可以包括独立的接收器、独立的发射器;也可以集成收发功能的收发器、或者是接口电路。
应理解,本申请实施例中不限定上述处理器1101、存储器1102以及通信接口1103之间的连接介质。本申请实施例在图11中以存储器1102、处理器1101以及通信接口1103之间通过通信总线1104连接,总线在图11中以粗线表示,其它部件之间的连接方式,仅 是示意性说明,并不作为限定。所述总线可以包括地址总线、数据总线、控制总线等。为了便于表示,图11中仅用一条粗线表示,但并不表示仅有一根总线或一种类型的总线等。
在一种示例中,装置1100用于实现上文方法实施例中终端设备执行的步骤。通信接口1103用于执行上文方法实施例中终端设备的收发相关操作,处理器1101用于执行上文方法实施例中终端设备的处理相关操作。
例如,通信接口1103,用于从网络设备接收至少一个视频帧;处理器1101,用于根据所述至少一个视频帧,确定视频帧参数;通信接口1103,还用于向所述网络设备发送所述视频帧参数。
可选的,所述视频帧参数包括以下至少一项:扩展时延参数,帧间距参数,丢包参数,迟到参数,基本层和增强层参数。
可选的,所述扩展时延参数用于指示以下至少一项:第一视频帧的扩展时延,多个视频帧的平均扩展时延,多个视频帧的最大扩展时延,多个视频帧的最小扩展时延,多个视频帧的扩展时延的方差;其中,所述扩展时延为所述终端设备成功接收一个视频帧的首数据包至成功接收该视频帧的尾数据包间的时间长度,所述第一视频帧或所述多个视频帧为所述终端设备接收到的至少一个视频帧中的一个或多个视频帧。
可选的,所述帧间距参数用于指示以下至少一项:相邻的视频帧的帧间距,多个帧间距的平均值,多个帧间距的最大值,多个帧间距的最小值,多个帧间距的方差;其中,所述终端设备从所述网络设备接收多个视频帧。
可选的,所述丢包参数,包括以下至少一项:发生丢包的视频帧的数量,发生丢包的视频帧的比例,连续K个视频帧发生丢包,所述K为大于或等于1的正整数。
可选的,处理器1101,还用于判断接收到的视频帧所包括数据包的分组数据汇聚协议PDCP序列号SN是否连续,在接收视频帧所包括数据包的PDCP SN不连续时,则确定所述视频帧发生丢包。
可选的,所述迟到参数指示以下至少一项:迟到视频帧的数量,迟到视频帧所占的比例,一个迟到视频帧的实际接收时刻与正确接收时刻间的时间差,多个迟到视频帧的实际接收时刻与正确接收时刻间时间差的平均值,多个迟到视频帧中实际接收时刻与正确接收时刻间时间差的最大值,多个迟到视频帧中实际接收时刻与正确接收时刻间时间差的最小值,多个迟到视频帧中实际接收时刻与正确接收时刻间时间差的方差。
可选的,每个视频帧包括基本层数据和增强层数据;所述基本层和增强层参数包括以下至少一项:第二视频帧中,所述终端设备接收基本层数据和增强层数据的时间差,所述第二视频帧为所述至少一个视频帧中的视频帧;第三视频帧的数量,第三视频帧的比例,或第三视频帧中增强层丢包的数量中的至少一个,所述第三视频帧为基本层数据没有丢包,增强层数据丢包的视频帧,且所述第三视频帧为所述至少一个视频帧中的视频帧;第四视频帧的数量,第四视频帧的比例,或第四视频帧中基本层丢包的数量中的至少一个,所述第四视频帧为增强层数据没有丢包,基本层数据丢包的视频帧,且所述第四视频帧为所述至少一个视频帧中的视频帧。
可选的,所述视频帧经过网络编码后,包括一个或多个数据分片,所述视频帧参数还包括以下至少一项:所述终设备成功解码一个视频帧所需要接收的数据分片的数量和/或所述数据分片的数据量;所述终端设备成功解码一个视频帧的基本层所需要接收的数据分片的数量和/或所述数据分片的数据量;所述终端设备成功解码一个视频帧的增强层所需要接 收的数据分片的数量和/或所述数据分片的数据量。
在另一种示例中,装置1100用于实现上文方法实施例中网络设备执行的步骤。通信接口1103用于执行上文方法实施例中网络设备的收发相关操作,处理器1101用于执行上文方法实施例中网络设备的处理相关操作。
比如,通信接口1103,用于向终端设备发送至少一个视频帧;通信接口1103,还用于接收来自所述终端设备的视频帧参数,所述视频帧参数是根据所述至少一个视频帧所确定的。可选的,处理器1101,可用于根据所述视频帧参数,优化下行视频帧的传输。
可选的,所述视频帧参数包括以下至少一项:扩展时延参数,帧间距参数,丢包参数,迟到参数,基本层和增强层参数。
可选的,扩展时延参数用于指示以下至少一项:第一视频帧的扩展时延,多个视频帧的扩展时延,多个视频帧的最大扩展时延,多个视频帧的最小扩展时延,多个视频帧的扩展时延方差;其中,所述扩展时延为所述终端设备成功接收一个视频帧的首数据包至成功接收该视频帧的尾数据包间的时间长度,所述第一视频帧或所述多个视频帧为所述终端设备接收到的至少一个视频帧中的一个或多个视频帧。
可选的,所述帧间距参数用于指示以下至少一项:相邻的视频帧的帧间距,多个帧间距的平均值,多个帧间距的最大值,多个帧间距的最小值,多个帧间距的方差;其中,所述终端设备从所述网络设备接收多个视频帧。
可选的,所述丢包参数,包括以下至少一项:发生丢包的视频帧的数量,发生丢包的视频帧的比值,连续K个视频帧发生丢包,所述K为大于或等于1的正整数。
可选的,所述迟到参数指示以下至少一项:迟到视频帧的数量,迟到视频帧所占的比例,一个迟到视频帧的实际接收时刻与正确接收时刻间的时间差,多个迟到视频帧的实际接收时刻与正确接收时刻间时间差的平均值,多个迟到视频帧中实际接收时刻与正确接收时刻间时间差的最大值,多个迟到视频帧中实际接收时刻与正确接收时刻间时间差的最小值,多个迟到视频帧中实际接收时刻与正确接收时刻时间差的方差。
可选的,每个视频帧包括基本层数据和增强层数据;所述基本层和增强层参数包括以下至少一项:第二视频帧中,所述终端设备接收基本层数据和增强层数据的时间差,所述第二视频帧为所述至少一个视频帧中的视频帧;第三视频帧的数量,第三视频帧的比例,或第三视频帧中增强层丢包的数量中的至少一项,所述第三视频帧为基本层数据没有丢包,增强层数据丢包的视频帧,且所述第三视频帧为所述至少一个视频帧中的视频帧;第四视频帧的数量,第四视频帧的比例,或第四视频帧中基本层丢包的数量中的至少一项,所述第四视频帧为增强层数据没有丢包,基本层数据丢包的视频帧,且所述第四视频帧为所述至少一个视频帧中的视频帧。
可选的,所述视频帧经过网络编码后,包括一个或多个数据分片,所述视频帧参数还包括以下至少一项:所述终端设备成功解码一个视频帧所需要接收的数据分片的数量和/或所述数据分片的数据量;所述终端设备成功解码一个视频帧的基本层所需要接收的数据分片的数量和/或所述数据分片的数据量;所述终端设备成功解码一个视频帧的增强层所需要接收的数据分片的数量和/或所述数据分片的数据量。
进一步的,本申请实施例还提供一种装置,所述装置用于执行上文图5方法实施例中的方法。一种计算机可读存储介质,包括程序,当所述程序被处理器运行时,上文图5方法实施例中的方法被执行。一种计算机程序产品,所述计算机程序产品包括计算机程序代 码,当所述计算机程序代码在计算机上运行时,使得计算机实现上文图5方法实施例中的方法。一种芯片,包括:处理器,所述处理器与存储器耦合,所述存储器用于存储程序或指令,当所述程序或指令被所述处理器执行时,使得装置执行上文图5方法实施例中的方法。一种系统,包括执行上文方法实施例的终端设备和网络设备中的至少一个。
本申请实施例中,处理器可以是通用处理器、数字信号处理器、专用集成电路、现场可编程门阵列或者其他可编程逻辑器件、分立门或者晶体管逻辑器件、分立硬件组件,可以实现或者执行本申请实施例中的公开的各方法、步骤及逻辑框图。通用处理器可以是微处理器或者任何常规的处理器等。结合本申请实施例所公开的方法的步骤可以直接体现为硬件处理器执行完成,或者用处理器中的硬件及软件模块组合执行完成。
在本申请实施例中,存储器可以是非易失性存储器,比如硬盘(hard disk drive,HDD)或固态硬盘(solid-state drive,SSD)等,还可以是易失性存储器(volatile memory),例如随机存取存储器(random-access memory,RAM)。存储器是能够用于携带或存储具有指令或数据结构形式的期望的程序代码并能够由计算机存取的任何其他介质,但不限于此。本申请实施例中的存储器还可以是电路或者其它任意能够实现存储功能的装置,用于存储程序指令和/或数据。
本申请实施例提供的方法中,可以全部或部分地通过软件、硬件、固件或者其任意组合来实现。当使用软件实现时,可以全部或部分地以计算机程序产品的形式实现。所述计算机程序产品包括一个或多个计算机指令。在计算机上加载和执行所述计算机程序指令时,全部或部分地产生按照本发明实施例所述的流程或功能。所述计算机可以是通用计算机、专用计算机、计算机网络、网络设备、用户设备或者其他可编程装置。所述计算机指令可以存储在计算机可读存储介质中,或者从一个计算机可读存储介质向另一个计算机可读存储介质传输,例如,所述计算机指令可以从一个网站站点、计算机、服务器或数据中心通过有线(例如同轴电缆、光纤、数字用户线(digital subscriber line,简称DSL))或无线(例如红外、无线、微波等)方式向另一个网站站点、计算机、服务器或数据中心进行传输。所述计算机可读存储介质可以是计算机可以存取的任何可用介质或者是包含一个或多个可用介质集成的服务器、数据中心等数据存储设备。所述可用介质可以是磁性介质(例如,软盘、硬盘、磁带)、光介质(例如,数字视频光盘(digital video disc,简称DVD))、或者半导体介质(例如,SSD)等。
显然,本领域的技术人员可以对本申请进行各种改动和变型而不脱离本申请的范围。这样,倘若本申请的这些修改和变型属于本申请权利要求及其等同技术的范围之内,则本申请也意图包含这些改动和变型在内。
Claims (21)
- 一种通信方法,其特征在于,包括:终端设备从网络设备接收至少一个视频帧;所述终端设备根据所述至少一个视频帧,确定视频帧参数;所述终端设备向所述网络设备发送所述视频帧参数。
- 如权利要求1所述的方法,其特征在于,所述视频帧参数包括以下至少一项:扩展时延参数,帧间距参数,丢包参数,迟到参数,基本层和增强层参数。
- 如权利要求2所述的方法,其特征在于,所述扩展时延参数用于指示以下至少一项:第一视频帧的扩展时延,多个视频帧的平均扩展时延,多个视频帧的最大扩展时延,多个视频帧的最小扩展时延,多个视频帧的扩展时延的方差;其中,所述扩展时延为所述终端设备成功接收一个视频帧的首数据包至成功接收该视频帧的尾数据包间的时间长度,所述第一视频帧或所述多个视频帧为所述终端设备接收到的至少一个视频帧中的一个或多个视频帧。
- 如权利要求2或3所述的方法,其特征在于,所述帧间距参数用于指示以下至少一项:相邻的视频帧的帧间距,多个帧间距的平均值,多个帧间距的最大值,多个帧间距的最小值,多个帧间距的方差;其中,所述终端设备从所述网络设备接收多个视频帧。
- 如权利要求2至4任一项所述的方法,其特征在于,所述丢包参数,包括以下至少一项:发生丢包的视频帧的数量,发生丢包的视频帧的比例,连续K个视频帧发生丢包,所述K为大于或等于1的正整数。
- 如权利要求5所述的方法,其特征在于,还包括:所述终端设备判断接收到的视频帧所包括数据包的分组数据汇聚协议PDCP序列号SN是否连续;所述终端设备所接收视频帧所包括数据包的PDCP SN不连续,则确定所述视频帧发生丢包。
- 如权利要求2至6任一项所述的方法,其特征在于,所述迟到参数指示以下至少一项:迟到视频帧的数量,迟到视频帧所占的比例,一个迟到视频帧的实际接收时刻与正确接收时刻间的时间差,多个迟到视频帧的实际接收时刻与正确接收时刻间时间差的平均值,多个迟到视频帧中实际接收时刻与正确接收时刻间时间差的最大值,多个迟到视频帧中实际接收时刻与正确接收时刻间时间差的最小值,多个迟到视频帧中实际接收时刻与正确接收时刻间时间差的方差。
- 如权利要求2至7任一项所述的方法,其特征在于,每个视频帧包括基本层数据和增强层数据;所述基本层和增强层参数包括以下至少一项:第二视频帧中,所述终端设备接收基本层数据和增强层数据的时间差,所述第二视频帧为所述至少一个视频帧中的视频帧;第三视频帧的数量,第三视频帧的比例,或第三视频帧中增强层丢包的数量中的至少一个,所述第三视频帧为基本层数据没有丢包,增强层数据丢包的视频帧,且所述第三视频帧为所述至少一个视频帧中的视频帧;第四视频帧的数量,第四视频帧的比例,或第四视频帧中基本层丢包的数量中的至少一个,所述第四视频帧为增强层数据没有丢包,基本层数据丢包的视频帧,且所述第四视频帧为所述至少一个视频帧中的视频帧。
- 如权利要求1至8中任一项所述的方法,其特征在于,所述视频帧经过网络编码后,包括一个或多个数据分片,所述视频帧参数还包括以下至少一项:所述终设备成功解码一个视频帧所需要接收的数据分片的数量和/或所述数据分片的数据量;所述终端设备成功解码一个视频帧的基本层所需要接收的数据分片的数量和/或所述数据分片的数据量;所述终端设备成功解码一个视频帧的增强层所需要接收的数据分片的数量和/或所述数据分片的数据量。
- 一种通信方法,其特征在于,包括:网络设备向终端设备发送至少一个视频帧;所述网络设备接收来自所述终端设备的视频帧参数,所述视频帧参数是根据所述至少一个视频帧所确定的。
- 如权利要求10所述的方法,其特征在于,所述视频帧参数包括以下至少一项:扩展时延参数,帧间距参数,丢包参数,迟到参数,基本层和增强层参数。
- 如权利要求11所述的方法,其特征在于,所述扩展时延参数用于指示以下至少一项:第一视频帧的扩展时延,多个视频帧的扩展时延,多个视频帧的最大扩展时延,多个视频帧的最小扩展时延,多个视频帧的扩展时延方差;其中,所述扩展时延为所述终端设备成功接收一个视频帧的首数据包至成功接收该视频帧的尾数据包间的时间长度,所述第一视频帧或所述多个视频帧为所述终端设备接收到的至少一个视频帧中的一个或多个视频帧。
- 如权利要求11或12所述的方法,其特征在于,所述帧间距参数用于指示以下至少一项:相邻的视频帧的帧间距,多个帧间距的平均值,多个帧间距的最大值,多个帧间距的最小值,多个帧间距的方差;其中,所述终端设备从所述网络设备接收多个视频帧。
- 如权利要求11至13任一项所述的方法,其特征在于,所述丢包参数,包括以下至少一项:发生丢包的视频帧的数量,发生丢包的视频帧的比值,连续K个视频帧发生丢包,所述K为大于或等于1的正整数。
- 如权利要求11至14任一项所述的方法,其特征在于,所述迟到参数指示以下至少一项:迟到视频帧的数量,迟到视频帧所占的比例,一个迟到视频帧的实际接收时刻与正确接收时刻间的时间差,多个迟到视频帧的实际接收时刻与正确接收时刻间时间差的平均值,多个迟到视频帧中实际接收时刻与正确接收时刻间时间差的最大值,多个迟到视频帧中实际接收时刻与正确接收时刻间时间差的最小值,多个迟到视频帧中实际接收时刻与正确接收时刻时间差的方差。
- 如权利要求11至15任一项所述的方法,其特征在于,每个视频帧包括基本层数据和增强层数据;所述基本层和增强层参数包括以下至少一项:第二视频帧中,所述终端设备接收基本层数据和增强层数据的时间差,所述第二视频 帧为所述至少一个视频帧中的视频帧;第三视频帧的数量,第三视频帧的比例,或第三视频帧中增强层丢包的数量中的至少一项,所述第三视频帧为基本层数据没有丢包,增强层数据丢包的视频帧,且所述第三视频帧为所述至少一个视频帧中的视频帧;第四视频帧的数量,第四视频帧的比例,或第四视频帧中基本层丢包的数量中的至少一项,所述第四视频帧为增强层数据没有丢包,基本层数据丢包的视频帧,且所述第四视频帧为所述至少一个视频帧中的视频帧。
- 如权利要求10至16任一项所述的方法,其特征在于,所述视频帧经过网络编码后,包括一个或多个数据分片,所述视频帧参数还包括以下至少一项:所述终端设备成功解码一个视频帧所需要接收的数据分片的数量和/或所述数据分片的数据量;所述终端设备成功解码一个视频帧的基本层所需要接收的数据分片的数量和/或所述数据分片的数据量;所述终端设备成功解码一个视频帧的增强层所需要接收的数据分片的数量和/或所述数据分片的数据量。
- 一种装置,其特征在于,包括用于执行权利要求1至9任一项所述的方法的各步骤的单元,或者包括用于执行权利要求10至17任一项所述的方法的各步骤的单元。
- 一种装置,其特征在于,包括至少一个处理器和接口电路,所述至少一个处理器用于通过所述接口电路与其它装置通信,并执行权利要求1至9任一项所述的方法,或者,执行权利要求10至17任一项所述的方法。
- 一种装置,其特征在于,包括处理器,用于调用存储器中存储的程序,以执行如权利要求1至9任一项所述的方法,或者,执行如权利要求10至17任一项所述的方法。
- 一种计算机可读存储介质,其特征在于,包括程序,当所述程序被处理器运行时,如权利要求1至9中任一项所述的方法被执行,或者,如权利要求10至17中任一项所述的方法被执行。
Priority Applications (4)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN202080103785.9A CN116114254B (zh) | 2020-09-14 | 2020-09-14 | 一种通信方法及装置 |
| EP20952890.0A EP4199526A4 (en) | 2020-09-14 | 2020-09-14 | Communication method and apparatus |
| PCT/CN2020/115045 WO2022052102A1 (zh) | 2020-09-14 | 2020-09-14 | 一种通信方法及装置 |
| US18/180,879 US20230231787A1 (en) | 2020-09-14 | 2023-03-09 | Communication method and an apparatus |
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| PCT/CN2020/115045 WO2022052102A1 (zh) | 2020-09-14 | 2020-09-14 | 一种通信方法及装置 |
Related Child Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| US18/180,879 Continuation US20230231787A1 (en) | 2020-09-14 | 2023-03-09 | Communication method and an apparatus |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| WO2022052102A1 true WO2022052102A1 (zh) | 2022-03-17 |
Family
ID=80630181
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| PCT/CN2020/115045 Ceased WO2022052102A1 (zh) | 2020-09-14 | 2020-09-14 | 一种通信方法及装置 |
Country Status (4)
| Country | Link |
|---|---|
| US (1) | US20230231787A1 (zh) |
| EP (1) | EP4199526A4 (zh) |
| CN (1) | CN116114254B (zh) |
| WO (1) | WO2022052102A1 (zh) |
Cited By (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2024067103A1 (zh) * | 2022-09-29 | 2024-04-04 | 中兴通讯股份有限公司 | 视频业务的通话质量确定方法及装置 |
Families Citing this family (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2022188726A1 (en) * | 2021-03-09 | 2022-09-15 | Mediatek Singapore Pte. Ltd. | Methods and apparatus for extended reality enhancement in mobile communications |
| US11963200B2 (en) * | 2021-03-09 | 2024-04-16 | Amdocs Development Limited | System, method, and computer program for resource orchestration in a 5G network based on user equipment behavior analystics |
| CN120282296A (zh) * | 2023-12-29 | 2025-07-08 | 荣耀终端股份有限公司 | 数据同步方法及设备 |
| US12556225B2 (en) * | 2024-03-13 | 2026-02-17 | Infineon Technologies Americas Corp. | Adaptive frequency hopping (AFH) channel classification algorithm for narrowband wireless radio in battery management system-like (BMS-like) environment |
Citations (6)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN108174234A (zh) * | 2018-01-12 | 2018-06-15 | 珠海全志科技股份有限公司 | 一种流媒体传输方法及系统 |
| CN108270738A (zh) * | 2016-12-30 | 2018-07-10 | 北京华为数字技术有限公司 | 一种视频处理方法及网络设备 |
| US20190174139A1 (en) * | 2017-12-06 | 2019-06-06 | Avago Technologies International Sales Pte. Limited | Video decoder rate model and verification circuit |
| US20190364302A1 (en) * | 2007-12-05 | 2019-11-28 | Sony Interactive Entertainment America Llc | System and method for remote-hosted video game streaming and feedback from client on received frames |
| CN110881136A (zh) * | 2019-11-14 | 2020-03-13 | 腾讯科技(深圳)有限公司 | 视频帧率控制方法、装置、计算机设备及存储介质 |
| CN111107440A (zh) * | 2019-12-26 | 2020-05-05 | 腾讯科技(深圳)有限公司 | 视频传输控制方法及装置、设备、存储介质 |
Family Cites Families (6)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US9325998B2 (en) * | 2003-09-30 | 2016-04-26 | Sharp Laboratories Of America, Inc. | Wireless video transmission system |
| CN102547376B (zh) * | 2010-12-28 | 2014-08-06 | 中国移动通信集团公司 | 一种视频数据传输方法、装置和系统 |
| US9154396B2 (en) * | 2013-03-14 | 2015-10-06 | Arris Technology, Inc. | Passive measurement of available link bandwidth |
| CN105916059A (zh) * | 2016-04-29 | 2016-08-31 | 北京奇虎科技有限公司 | 视频传输处理方法及装置 |
| CN106791910B (zh) * | 2016-12-02 | 2019-11-05 | 浙江宇视科技有限公司 | 视频帧处理方法及装置 |
| CN109151612B (zh) * | 2017-06-27 | 2020-10-16 | 华为技术有限公司 | 一种视频传输方法、设备及系统 |
-
2020
- 2020-09-14 CN CN202080103785.9A patent/CN116114254B/zh active Active
- 2020-09-14 EP EP20952890.0A patent/EP4199526A4/en active Pending
- 2020-09-14 WO PCT/CN2020/115045 patent/WO2022052102A1/zh not_active Ceased
-
2023
- 2023-03-09 US US18/180,879 patent/US20230231787A1/en active Pending
Patent Citations (6)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20190364302A1 (en) * | 2007-12-05 | 2019-11-28 | Sony Interactive Entertainment America Llc | System and method for remote-hosted video game streaming and feedback from client on received frames |
| CN108270738A (zh) * | 2016-12-30 | 2018-07-10 | 北京华为数字技术有限公司 | 一种视频处理方法及网络设备 |
| US20190174139A1 (en) * | 2017-12-06 | 2019-06-06 | Avago Technologies International Sales Pte. Limited | Video decoder rate model and verification circuit |
| CN108174234A (zh) * | 2018-01-12 | 2018-06-15 | 珠海全志科技股份有限公司 | 一种流媒体传输方法及系统 |
| CN110881136A (zh) * | 2019-11-14 | 2020-03-13 | 腾讯科技(深圳)有限公司 | 视频帧率控制方法、装置、计算机设备及存储介质 |
| CN111107440A (zh) * | 2019-12-26 | 2020-05-05 | 腾讯科技(深圳)有限公司 | 视频传输控制方法及装置、设备、存储介质 |
Non-Patent Citations (2)
| Title |
|---|
| MOTOROLA: "Video Services over LTE-Advanced", 3GPP DRAFT; R1-083233_LTE_VIDEO_SERVICE_MODEL_FINAL, 3RD GENERATION PARTNERSHIP PROJECT (3GPP), MOBILE COMPETENCE CENTRE ; 650, ROUTE DES LUCIOLES ; F-06921 SOPHIA-ANTIPOLIS CEDEX ; FRANCE, no. Jeju; 20080812, 12 August 2008 (2008-08-12), Mobile Competence Centre ; 650, route des Lucioles ; F-06921 Sophia-Antipolis Cedex ; France , XP050316648 * |
| See also references of EP4199526A4 * |
Cited By (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2024067103A1 (zh) * | 2022-09-29 | 2024-04-04 | 中兴通讯股份有限公司 | 视频业务的通话质量确定方法及装置 |
Also Published As
| Publication number | Publication date |
|---|---|
| US20230231787A1 (en) | 2023-07-20 |
| CN116114254A (zh) | 2023-05-12 |
| EP4199526A4 (en) | 2023-08-23 |
| EP4199526A1 (en) | 2023-06-21 |
| CN116114254B (zh) | 2025-11-07 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| EP4391639A1 (en) | Communication method and apparatus | |
| JP6907444B2 (ja) | データ伝送方法、通信デバイス、端末、および基地局 | |
| US20230231787A1 (en) | Communication method and an apparatus | |
| CN113766567B (zh) | 一种通信方法及装置 | |
| EP4009696B1 (en) | Compression dictionary updates in wireless communications | |
| US20240031870A1 (en) | Media data transmission method and communication apparatus | |
| WO2021213000A1 (zh) | 媒体报文的传输方法、装置及系统 | |
| EP4287636B1 (en) | Video transmission method | |
| US20240305574A1 (en) | Quality of service qos management method and apparatus | |
| CN114007087B (zh) | 一种媒体流切换方法及装置 | |
| CN117768943A (zh) | 一种通信系统中传输数据的方法和通信装置 | |
| WO2022056863A1 (zh) | 一种切换方法及装置 | |
| CN115334574A (zh) | 应用层测量方法及相关的装置 | |
| US20250373362A1 (en) | Joint source-channel encoding method and system | |
| US20250227244A1 (en) | Communication method and apparatus | |
| US20250365615A1 (en) | Data processing method and apparatus | |
| US20240284258A1 (en) | Communication method and apparatus | |
| CN111052650A (zh) | 用于操作网络实体的方法、网络实体、用于操作用户设备的方法以及用户设备 | |
| US12058039B2 (en) | Packet validity time enhancement for quality of service flows | |
| US12615546B2 (en) | Method and apparatus of reporting buffer status | |
| WO2023109743A1 (zh) | 传输数据的方法和通信装置 | |
| CN116321475A (zh) | 传输数据的方法和通信装置 | |
| WO2025247218A1 (zh) | 一种通信方法、装置及系统 | |
| WO2024169477A1 (zh) | 上行链路控制信息的发送方法及装置 | |
| WO2025077451A1 (zh) | 一种缓存状态报告的上报方法及相关设备 |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| 121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 20952890 Country of ref document: EP Kind code of ref document: A1 |
|
| ENP | Entry into the national phase |
Ref document number: 2020952890 Country of ref document: EP Effective date: 20230314 |
|
| NENP | Non-entry into the national phase |
Ref country code: DE |
|
| WWG | Wipo information: grant in national office |
Ref document number: 202080103785.9 Country of ref document: CN |