WO2024197810A1 - 一种数据处理方法、模型的训练方法以及相关设备 - Google Patents
一种数据处理方法、模型的训练方法以及相关设备 Download PDFInfo
- Publication number
- WO2024197810A1 WO2024197810A1 PCT/CN2023/085467 CN2023085467W WO2024197810A1 WO 2024197810 A1 WO2024197810 A1 WO 2024197810A1 CN 2023085467 W CN2023085467 W CN 2023085467W WO 2024197810 A1 WO2024197810 A1 WO 2024197810A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- data
- machine learning
- learning model
- module
- sub
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Ceased
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/044—Recurrent networks, e.g. Hopfield networks
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/045—Combinations of networks
- G06N3/0455—Auto-encoder networks; Encoder-decoder networks
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L27/00—Modulated-carrier systems
- H04L27/26—Systems using multi-frequency codes
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L41/00—Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
- H04L41/14—Network analysis or design
- H04L41/145—Network analysis or design involving simulating, designing, planning or modelling of a network
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L41/00—Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
- H04L41/16—Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks using machine learning or artificial intelligence
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N20/00—Machine learning
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/0464—Convolutional networks [CNN, ConvNet]
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/048—Activation functions
Definitions
- the present application relates to the field of communications, and in particular to a data processing method, a model training method, and related equipment.
- AI Artificial Intelligence
- A is the theory, method, technology and application system that uses digital computers or machines controlled by digital computers to simulate, extend and expand human intelligence, perceive the environment, acquire knowledge and use knowledge to obtain the best results.
- Artificial Intelligence is also the study of the design principles and implementation methods of various intelligent machines, so that machines have the functions of perception, reasoning and decision-making.
- the data to be processed can be input into a machine learning model to obtain processed data output by the model, wherein the processed data includes T sub-data, where T is an integer greater than or equal to 1.
- T can be flexibly determined according to actual conditions.
- the number of output channels of each machine learning model is fixed, which means that the machine learning model can only output a fixed number of sub-data.
- T changes another machine learning model needs to be used for processing, which requires storing multiple machine learning models, resulting in a large storage space overhead. Therefore, a machine learning model that is compatible with multiple values of T is urgently needed.
- the embodiments of the present application provide a data processing method, a model training method, and related equipment.
- a module in a first machine learning model is called at least once, a sub-data can be obtained.
- the number of calls of the module in the first machine learning model can be flexibly adjusted according to the value of T to generate T sub-data. In this way, the first machine learning model can be compatible with multiple values of T, and there is no need to store multiple machine learning models, thereby reducing the storage space overhead.
- the first aspect of the present application provides a data processing method, which can apply artificial intelligence technology to the field of communications.
- the method is applied to the first device side.
- the first device can be a device or a component that can be configured in the device (such as a chip, a chip system, etc.).
- the method includes: the first device obtains the value of T, T represents the number of sub-data included in the output data of the first machine learning model, and T is an integer greater than or equal to 1; the first device inputs the first data into the first machine learning model to obtain the second data generated by the first machine learning model, and the second data includes T sub-data.
- the first machine learning model includes one or more modules, and each time a module in the first machine learning model is called at least once, a sub-data is obtained; illustratively, each time a module in the first machine learning model is called, one module in the first machine learning model can be called, or multiple modules can be called.
- a sub-data can be obtained by calling a module in the first machine learning model at least once, after obtaining the value of T, the number of calls to the module in the first machine learning model can be flexibly adjusted according to the value of T to generate T sub-data, so that the first machine learning model can be compatible with multiple values of T, and it is no longer necessary to store multiple machine learning models, reducing the storage space overhead.
- the function of the first machine learning model includes any one or more of the following combinations: encoding, modulation, and generating a reference signal.
- the first data is the data that needs to be encoded
- the second data is the encoded data.
- the first data is the data that needs to be modulated
- the second data is the modulated data.
- the function of the first machine learning model is to generate a reference signal
- the first data may be the index number of multiple reference signals
- the second data may be a reference signal.
- the function of the first machine learning model is encoding and modulation
- the first data is the data that needs to be encoded and modulated
- the second data is the encoded and modulated data, etc.
- the multiple modules in the first machine learning model include a first module and at least one second module, and the first device inputs the first data into the first machine learning model to obtain the second data output by the first machine learning model, including: the first device inputs the first data into the first module to obtain the first sub-data generated by the first module, and the first sub-data is one of the T sub-data; the first feature information of the first data is input into the second module to obtain the second sub-data generated by the second module, and the second sub-data is one of the T sub-data.
- the first feature information includes the feature information generated when the module in the first machine learning model was last called for data processing; illustratively, the first feature information can be the feature information generated when the first module in the first machine learning model was last called for data processing, or it can be the feature information generated when the second module in the first machine learning model was last called for data processing.
- the first machine learning model includes a first module and at least one second module.
- the input of the first module of the first machine learning model is the entire first data
- the input of the second module is the feature information obtained when the module of the first machine learning model was called last time. Then, the input when the second module is called for the first time is the feature information obtained when the entire first data is processed by the first module.
- the multiple modules in the first machine learning model include a first module and at least one third module
- the first device inputs the first data into the first machine learning model to obtain the second data output by the first machine learning model, including: the first device inputs the first data into the first module, generates the first sub-data through the first module, the first sub-data is one of T sub-data, and the process of generating the first sub-data through the first module includes extracting features from the first data, that is, the feature information of the first data can be obtained in the process of generating the first sub-data through the first module; the first device calls the third module multiple times to obtain the third sub-data generated by the third module, the third sub-data is one of T sub-data, wherein the input of the third module includes the feature information of the first data, and the feature information of the first data is updated multiple times in the process of calling the third module multiple times.
- the feature information of the first data input to the third module is obtained in the process of generating the first sub-data by the first module; when the first device calls the third module for the second time and thereafter, the feature information of the first data input to the third module is obtained in the process of calling the third module last time.
- a third sub-data is generated according to the last updated feature information of the first data. Afterwards, it is helpful to have a more thorough understanding of the first data, thereby generating sub-data with better performance.
- the first device inputs the first feature information into the second module to obtain second sub-data generated by the second module, including: the first device linearly transforms the first feature information through the second module, and processes it with a first activation function to obtain the transformed feature information; linearly transforms the transformed feature information, and processes it with a second activation function to obtain the second sub-data.
- the above method since the above method is simple and easy to implement, it is not only beneficial to reduce the computer resources consumed in the process of generating the second data; and the number of parameters used in the second module shown in the above method is relatively small, which is beneficial to reduce the communication resources consumed when transmitting the parameters of the first machine learning model.
- At least one second module includes a plurality of second modules, wherein at least two of the plurality of second modules use different parameters; that is, the first device can generate a second sub-data each time it calls the second module, but different second modules may be called in the process of generating T-1 second data.
- the meaning of "two second modules using different parameters" may include any of the following differences: the same type of parameters are used in the two second modules, but the parameter values used in the two second modules are not exactly the same; or the types of parameters used in the two second modules are not exactly the same, etc.
- multiple second modules can be used in the first machine learning model, and at least two of the multiple second modules use different parameters, that is, T-1 second sub-data are generated by different second modules, which is beneficial to the matching between the parameters of the second module and the generated second sub-data, and thus is beneficial to obtaining second data with better performance.
- the first device inputs the first data into the first module to obtain the first sub-data generated by the first module, which may include: the first device obtains the first sub-data generated by the first module by calling the first module once or multiple times.
- each time the first device calls the first module to process the input data it may include: the first device linearly transforms the input data through the first module, and processes it with a third activation function to obtain the transformed input data; linearly transforms the transformed input data, and processes it with a fourth activation function to obtain the processing result of the first module.
- the input data of the first module may be the first data or the feature information of the first data.
- the method before the first device inputs the first data into the first machine learning model, the method further includes: the first device obtains the data to be processed and the value of H, where H is an integer greater than or equal to 1, and H indicates the length of the first data; if the length of the data to be processed is less than H, the data to be processed is padded to obtain the first data, and the length of the first data is H.
- the data to be processed is padded to obtain the first data with a length of H, and then the first data with a length of H is input into the first machine learning model, so that no matter how long the data to be processed is, the first machine learning model processes the first data with a length of H, which is not only conducive to compatibility with data to be processed of any length, but also conducive to reducing the difficulty of the first machine learning model in data processing to obtain second data with better performance.
- the first data includes data to be processed and padding data
- the padding data includes first identification information
- the first identification information is used to identify the value of T and/or the value of K
- K is the length of the data to be processed
- K is an integer greater than or equal to 1
- the first identification information can be used to identify the value of T and the value of K, and can also be used to identify the value of T, and can also be used to identify the value of K.
- the first device can use the first function to process the value of T and/or the value of K to obtain the first identification information.
- the conditions that the first function needs to meet include: limiting the value of the first identification information within a preset range, and being able to map different T values and/or K values to different values, that is, the value generated by the first function can uniquely identify a certain T value and/or K value, or in other words, the value generated by the first function can distinguish different T values and/or K values.
- the first data carries first identification information for identifying the value of T and/or the value of K.
- the first machine learning model can process the first data according to the value of T and/or the value of K, that is, according to the length of the output data of the first machine learning model and/or the length of the actual data to be processed, and then the second data output by the first machine learning model is conducive to obtaining second data with better performance.
- the size of the parameters in the first machine learning model is related to the value of H and the value of G, where G is the length of each sub-data.
- the size of the parameters in the first machine learning model is designed according to the length of the first data and the length of each sub-data in the T sub-data, which is conducive to reducing the number of parameters in the first machine learning model while meeting the output requirements, and is conducive to further reducing the communication resources consumed by transmitting the parameters of the first machine learning model.
- the parameters corresponding to the first machine learning model and/or the identification information of the aforementioned parameters may be carried in signaling to enable the parameters corresponding to the first machine learning model and/or the identification information of the aforementioned parameters to be transmitted between different devices.
- the parameters corresponding to the first machine learning model are carried in one or more of the following information: downlink control information DCI, uplink control information UCI, sidelink control information SCI, radio resource control RRC signaling, or media access control control element MAC CE.
- the identification information of the parameters is carried in any one or more of the following information: DCI, UCI, SCI, RRC signaling, MAC CE, physical broadcast channel PBCH, or physical random access channel PRACH.
- transmitting the identification information of the aforementioned at least one set of parameters and/or each set of parameters in a signaling has higher transmission efficiency and consumes less computer resources; in addition, the present solution provides a variety of signaling that can be used to transmit the identification information of the aforementioned at least one set of parameters and/or each set of parameters, thereby improving the implementation flexibility of the present solution.
- the second device is a receiving end of the second data, and the second device contains multiple sets of parameters corresponding to the first machine learning model and identification information of each set of parameters, and the method further includes: the first device sends second identification information to the second device, and the second identification information is used to indicate a set of parameters adopted by the first machine learning model in the first device.
- the second device contains multiple sets of parameters of the first machine learning model and identification information of each set of parameters. The first device only needs to send the second identification information to the second device, and the second device can know which set of parameters is adopted by the first machine learning model in the first device, and the communication resources occupied by transmitting the second identification information are relatively small, which is conducive to reducing the consumed communication resources.
- the present application provides a data processing method that can apply artificial intelligence technology to the field of communications.
- the method is applied to the second device side.
- the second device can be a device or a component that can be configured in the device (such as a chip, a chip system, etc.).
- the method includes: the second device obtains the second data, and then generates the first data based on the second data.
- the second data includes T sub-data, T is an integer greater than or equal to 1, and the second data is generated by a first machine learning model in the first device.
- the first machine learning model includes one or more modules, and each module in the first machine learning model is called at least once to obtain a sub-data.
- the second device can denoise the received signal and obtain the received second data (that is, the estimated first sub-data) from the denoised received signal.
- the received second data that is, the estimated first sub-data
- the denoised received signal is demodulated to obtain the received second data; for another example, if the second data is modulated data, that is, the function of the first machine learning model is modulation, or the function of the first machine learning model is encoding and modulation, then the denoised received signal can be directly determined as the second data.
- the parameters corresponding to the first machine learning model are carried in one or more of the following information: downlink control information DCI, uplink control information UCI, sidelink control information SCI, radio resource control RRC signaling, or media access control control element MAC CE; and/or, the identification information of the parameters is carried in any one or more of the following information: DCI, UCI, SCI, RRC signaling, MAC CE, physical broadcast channel PBCH, or physical random access channel PRACH.
- the second device has third data
- the third data includes multiple groups of parameters corresponding to the first machine learning model and identification information of each group of parameters
- the method further includes: the second device receives second identification information sent by the first device; and determines a group of parameters used by the first machine learning model in the first device according to the second identification information and the third data.
- the second device generates the first data according to the second data, including: the second device can demodulate and/or decode the received second data according to a group of parameters used by the first machine learning model in the first device to generate estimated first data.
- the present application provides a model training method, which can apply artificial intelligence technology to the field of communications.
- the method is applied to a training device, which can be a device or a component that can be configured in the device (such as a chip, a chip system, etc.), and the method includes: the training device obtains training data from a training data set, wherein the training data is used to obtain the value of the first data and T, and T is an integer greater than or equal to 1; illustratively, the training data may include the value of the data to be processed and T, and the data to be processed is used to obtain the first data, for example, the data to be processed is the same as the first data, or the first data is obtained after the data to be processed is filled; T is used to indicate the number of sub-data included in the output data of the first machine learning model, and at least two training data in the training data set include different values of T.
- the training device inputs the first data into the first machine learning model to obtain the second data generated by the first machine learning model, and the second data includes T sub-data, wherein the first machine learning model includes multiple modules, and each module in the first machine learning model is called at least once to obtain a sub-data generated by the module; based on the second data and the loss function, the first machine learning model is trained to obtain the trained first machine learning model.
- the second data is used to determine the signal to be sent
- the training device trains the first machine learning model based on the second data and the loss function, including: the training device obtains a received signal corresponding to the signal to be sent, and demodulates and/or decodes the received signal corresponding to the signal to be sent to obtain estimated data corresponding to the data to be processed; the training device trains the first machine learning model according to the estimated data and the loss function, and the loss function indicates the similarity between the estimated data and the data to be processed.
- the training device obtaining the received signal corresponding to the signal to be sent may include: the training device multiplies the signal to be sent by the channel matrix, and adds the multiplication result to the noise to obtain the received signal, and the above steps are to simulate the process of the signal to be sent being transmitted through the channel. Alternatively, the above steps are performed by two training devices. If the combination is completed, the training device obtaining the received signal corresponding to the signal to be sent may include: the first training device sends the signal to be sent to the second training device, and the second training device receives the received signal.
- a specific implementation method for training the first machine learning model is provided when the functions of the first machine learning model include encoding and/or modulation, which reduces the difficulty of implementing this solution, and the loss function uses the similarity between the estimated data and the data to be processed, that is, the goal of the loss function is to obtain estimated data with better performance.
- the loss function is more in line with the actual needs when sending data between devices, and the second data output by the trained first machine learning model is more in line with actual needs.
- the training device trains the first machine learning model based on the second data and a loss function, including: the training device obtains a received reference signal corresponding to the reference signal, and generates predicted channel information according to the received reference signal corresponding to the reference signal; the training device trains the first machine learning model according to the loss function, and the loss function indicates the similarity between the predicted channel information and the correct channel information.
- the training device acquiring the received reference signal corresponding to the reference signal may include: the training device multiplies the reference signal by a channel matrix, and adds the result of the multiplication to noise to obtain the received reference signal, and the aforementioned steps are to simulate the process of the reference signal being transmitted through a channel.
- the aforementioned steps are completed by two training devices in cooperation, and the training device acquiring the received reference signal corresponding to the reference signal may include: the first training device sends the reference signal to the second training device, and the second training device receives the received reference signal.
- a specific implementation method for training the first machine learning model when the function of the first machine learning model is to generate a reference signal is also provided, which expands the application scenarios of this solution and improves the implementation flexibility of this solution.
- the training device can also be used to execute the steps performed by the first device in the first aspect and various possible implementation methods of the first aspect.
- the specific implementation methods, meanings of terms and beneficial effects brought about by the steps in various possible implementation methods of the third aspect can all be referred to the first aspect and will not be repeated here.
- the present application provides a data processing device that can apply artificial intelligence technology to the field of communications, the data processing device comprising a processing module; wherein the processing module is used to obtain a value of T, where T is an integer greater than or equal to 1, and T represents the number of sub-data included in the output data of the first machine learning model;
- the processing module is also used to input the first data into the first machine learning model to obtain second data generated by the first machine learning model, where the second data includes T sub-data, wherein the first machine learning model includes one or more modules, and each time a module in the first machine learning model is called at least once, one sub-data is obtained.
- the functionality of the first machine learning model includes any one or a combination of the following: encoding, modulating, or generating a reference signal.
- the multiple modules in the first machine learning model include a first module and at least one second module
- the processing module is specifically used to: input the first data into the first module to obtain first sub-data generated by the first module, and the first sub-data is one of T sub-data; input the first feature information into the second module to obtain second sub-data generated by the second module, wherein the first feature information includes the feature information generated when the module in the first machine learning model was last called for data processing, the second sub-data is one of the T sub-data, and the module in the first machine learning model that was last called is the first module or the second module.
- the multiple modules in the first machine learning model include a first module and at least one third module
- the processing module is specifically used to: input the first data into the first module, generate first sub-data through the first module, the first sub-data is one of T sub-data, and the process of generating the first sub-data through the first module includes extracting features of the first data; call the third module multiple times to obtain third sub-data generated by the third module, the third sub-data is one of T sub-data, wherein the input of the third module includes feature information of the first data, and the feature information of the first data is updated multiple times in the process of calling the third module multiple times.
- the processing module is specifically used to: perform a linear transformation on the first feature information through the second module, and process it with a first activation function to obtain the transformed feature information; perform a linear transformation on the transformed feature information, and process it with a second activation function to obtain second sub-data.
- the at least one second module includes a plurality of second modules, wherein parameters adopted by at least two second modules among the plurality of second modules are different.
- the processing module is further used to obtain the value of the data to be processed and H, where H is an integer greater than or equal to 1, and H indicates the length of the first data; the processing module is further used to pad the data to be processed if the length of the data to be processed is less than H to obtain the first data, and the length of the first data is H.
- the first data includes data to be processed and padding data
- the padding data includes first identification information
- the first identification information is used to identify the value of T and/or the value of K
- K is the length of the data to be processed
- K is an integer greater than or equal to 1.
- the size of the parameters in the first machine learning model is related to the value of H and the value of G, where G is the length of each sub-data.
- the parameters corresponding to the first machine learning model are carried in one or more of the following information: downlink control information DCI, uplink control information UCI, sidelink control information SCI, radio resource control RRC signaling, or media access control control element MAC CE; and/or, the identification information of the parameters is carried in any one or more of the following information: DCI, UCI, SCI, RRC signaling, MAC CE, physical broadcast channel PBCH, or physical random access channel PRACH.
- a data processing device is applied to a first device, and the second device is a receiving end of second data.
- the second device contains multiple groups of parameters corresponding to the first machine learning model and identification information of each group of parameters.
- the data processing device also includes: a transceiver module, which is used to send second identification information to the second device, and the second identification information is used to indicate a set of parameters adopted by the first machine learning model in the first device.
- the present application provides a data processing device that can apply artificial intelligence technology to the field of communications, wherein the data processing device includes a processing module; wherein the processing module is used to obtain second data; and generate first data based on the second data.
- the second data includes T sub-data, where T is an integer greater than or equal to 1, and the second data is generated by a first machine learning model in a first device, and the first machine learning model includes one or more modules, and each module in the first machine learning model is called at least once to obtain a sub-data.
- the parameters corresponding to the first machine learning model are carried in one or more of the following information: downlink control information DCI, uplink control information UCI, sidelink control information SCI, radio resource control RRC information Command or media access control control element MAC CE; and/or, the identification information of the parameter is carried in any one or more of the following information: DCI, UCI, SCI, RRC signaling, MAC CE, physical broadcast channel PBCH or physical random access channel PRACH.
- the data processing device is applied to a second device, the second device has third data, the third data includes multiple groups of parameters corresponding to the first machine learning model and identification information of each group of parameters, and the data processing device further includes: a transceiver module, which is used to receive the second identification information sent by the first device; a processing module, which is also used to determine a group of parameters used by the first machine learning model in the first device according to the second identification information and the third data.
- the processing module is specifically used to generate the first data according to a group of parameters used by the first machine learning model in the first device and the second data.
- the present application provides a model training device that can apply artificial intelligence technology to the field of communications, and the model training device includes a processing module; wherein the processing module is used to obtain training data from a training data set, wherein the training data is used to obtain first data and T values, T is an integer greater than or equal to 1, and at least two training data in the training data set include different values of T; the processing module is also used to input the first data into a first machine learning model to obtain second data generated by the first machine learning model, the second data includes T sub-data, wherein the first machine learning model includes multiple modules, and each module in the first machine learning model is called at least once to obtain a sub-data generated by the module; the processing module is also used to train the first machine learning model based on the second data and the loss function to obtain the trained first machine learning model.
- the functionality of the first machine learning model includes any one or a combination of the following: encoding, modulating, or generating a reference signal.
- the multiple modules in the first machine learning model include a first module and at least one second module
- the processing module is specifically used to: input the first data into the first module to obtain first sub-data generated by the first module, and the first sub-data is one of T sub-data; input the first feature information into the second module to obtain second sub-data generated by the second module, wherein the first feature information includes the feature information generated when the module in the first machine learning model was last called for data processing, the second sub-data is one of the T sub-data, and the module in the first machine learning model that was last called is the first module or the second module.
- the processing module is further used to obtain the data to be processed from the training data; the processing module is further used to obtain the value of H, where H is an integer greater than or equal to 1, and H indicates the length of the first data; the processing module is further used to pad the data to be processed if the length of the data to be processed is less than H to obtain the first data, and the length of the first data is H.
- the second data is used to determine the signal to be sent
- the processing module is specifically used to: demodulate and/or decode the received signal corresponding to the signal to be sent to obtain estimated data corresponding to the data to be processed; train the first machine learning model according to the estimated data and the loss function, and the loss function indicates the similarity between the estimated data and the data to be processed.
- the processing module when the second data is a reference signal, based on the second data and the loss function, the processing module is specifically used to: generate predicted channel information according to a received reference signal corresponding to the reference signal; train the first machine learning model according to the loss function, the loss function indicating the predicted channel information and the correct channel information. The similarity between information.
- the present application provides a communication system that can apply artificial intelligence technology to the field of communications.
- the communication system may include a data processing device as in the fourth aspect and a data processing device as in the fifth aspect.
- the communication system further includes a training device for the model as in the fifth aspect.
- the present application provides a data processing method that can apply artificial intelligence technology to the field of communications, the method comprising: a third device obtains a first signaling, wherein the first signaling carries at least one set of parameters adopted by a first machine learning model and indication information corresponding to each set of parameters, the indication information is used to indicate the position of multiple parameters included in each set of parameters in the first machine learning model; and sends the first signaling to the first device.
- the third device and the second device can be the same device or different devices, which is not limited in the present application.
- the signaling when at least one set of parameters of the first machine learning model is transmitted via signaling, the signaling not only carries the aforementioned at least one set of parameters, but also carries indication information corresponding to each set of parameters, and the indication information is used to indicate the positions of multiple parameters included in each set of parameters in the first machine learning module.
- the first device After receiving the signaling, the first device can understand how to use the parameters carried in the signaling, and transmit at least one set of parameters of the first machine learning model by means of signaling, which is conducive to reducing the communication resources consumed in the parameter transmission process and improving the efficiency of the parameter transmission process.
- the first signaling is any one of the following: downlink control information DCI, uplink control information UCI, sidelink control information SCI, radio resource control RRC signaling, or media access control control element MAC CE.
- the present application provides a data processing method that can apply artificial intelligence technology to the field of communications, and the method includes: a first device receives a first signaling, wherein the first signaling carries at least one set of parameters adopted by a first machine learning model and indication information corresponding to each set of parameters, and the indication information is used to indicate the positions of multiple parameters included in each set of parameters in the first machine learning model.
- the first signaling is any one of the following: downlink control information DCI, uplink control information UCI, sidelink control information SCI, radio resource control RRC signaling, or media access control control element MAC CE.
- an embodiment of the present application provides a device, comprising at least one processor, at least one processor coupled to a memory, the memory being used to store programs or instructions; at least one processor being used to execute programs or instructions, so that the aforementioned device executes the method in any of the above aspects.
- an embodiment of the present application provides a computer-readable storage medium, in which a computer program is stored.
- the computer-readable storage medium is run on a computer, the computer executes the method in any of the above aspects.
- an embodiment of the present application provides a computer program product, which includes a program.
- the program When the program is run on a computer, the computer executes the method in any of the above aspects.
- the present application provides a chip system, which includes a processor for supporting a communication device to implement the functions involved in the above aspects, for example, sending or processing the data and/or information involved in the above methods.
- the chip system also includes a memory, which is used to store program instructions and data necessary for the communication device.
- the chip system can be composed of a chip, or it can include a chip and other discrete devices.
- FIG1 is a schematic diagram of an architecture of a wireless communication system provided in an embodiment of the present application.
- FIG2 is another schematic diagram of the architecture of a wireless communication system provided in an embodiment of the present application.
- FIG3 is a flow chart of a data processing method provided in an embodiment of the present application.
- FIG4 is another schematic diagram of a data processing method provided in an embodiment of the present application.
- FIG5 is a schematic diagram of a flow chart of a first device and a second device determining a set of parameters used by a first machine learning model according to an embodiment of the present application;
- FIG6 is a schematic diagram of a process for a first device according to an embodiment of the present application to obtain a set of parameters used by a first machine learning model
- FIG7 is a schematic diagram of first data provided in an embodiment of the present application.
- FIG8 is a schematic diagram of generating T sub-data using a first machine learning model according to an embodiment of the present application.
- FIG9 is a schematic diagram of a model training method provided in an embodiment of the present application.
- FIG10 is a schematic diagram of a structure of a data processing device provided in an embodiment of the present application.
- FIG11 is another schematic diagram of the structure of a data processing device provided in an embodiment of the present application.
- FIG12 is a schematic diagram of a training device for a model provided in an embodiment of the present application.
- FIG13 is a schematic diagram of a device provided in an embodiment of the present application.
- FIG14 is another schematic diagram of a device provided in an embodiment of the present application.
- FIG. 15 is a schematic diagram of the structure of a chip provided in an embodiment of the present application.
- first”, second, etc. in the specification and claims of this application and the above-mentioned drawings are used to distinguish similar objects (for example, to distinguish objects in the same embodiment), and are not necessarily used to describe a specific order or sequence.
- the objects defined by “first”, “second”, etc. may refer to different objects. It should be understood that the data used in this way can be interchangeable under appropriate circumstances so that the embodiments described herein can be implemented in an order other than that illustrated or described herein.
- Send and “receive” in the embodiments of the present application indicate the direction of signal transmission.
- send information to XX device can be understood as the destination of the information is XX device, which can include direct transmission through the air interface, and also include indirect transmission through the air interface by other units or modules.
- Receiveive information from YY device can be understood as the source of the information is YY device. The device may include receiving directly from the YY device through the air interface, or indirectly receiving from the YY device through the air interface from other units or modules.
- Send can also be understood as the "output” of the chip interface, and “receiving” can also be understood as the "input” of the chip interface.
- sending and receiving can be performed between devices or within a device, for example, sending or receiving between components, modules, chips, software modules or hardware modules within the device through a bus, wiring or interface. It is understandable that the information may be subjected to necessary processing between the source and destination of the information transmission, such as encoding, modulation, etc., but the destination can understand the valid information from the source. Similar expressions in this application can be understood similarly and will not be repeated.
- indication may include direct indication and indirect indication, and may also include explicit indication and implicit indication.
- the information indicated by a certain information is called information to be indicated.
- information to be indicated there are many ways to indicate the information to be indicated, such as but not limited to, the information to be indicated can be directly indicated, such as the information to be indicated itself or the index of the information to be indicated.
- the information to be indicated can also be indirectly indicated by indicating other information, wherein there is an association relationship between the other information and the information to be indicated; it is also possible to indicate only a part of the information to be indicated, while the other parts of the information to be indicated are known or agreed in advance, for example, the indication of specific information can be realized by means of the arrangement order of each information agreed in advance (such as predefined by the protocol), thereby reducing the indication overhead to a certain extent.
- the present application does not limit the specific method of indication. It can be understood that for the sender of the indication information, the indication information can be used to indicate the information to be indicated, and for the receiver of the indication information, the indication information can be used to determine the information to be indicated.
- the present application can apply artificial intelligence technology to the field of communications, and optionally, can apply artificial intelligence technology to the application scenario of signal transmission.
- the machine learning model can be used to perform any one or more of the following tasks: encoding, modulation, generating reference signals or other tasks in the field of communications.
- Figure 1 is a schematic diagram of the architecture of a wireless communication system provided by an embodiment of the present application.
- the method provided by the present application can be applied to a wireless communication system.
- the wireless communication system includes a network device 101 and a mobile station (MS) 102.
- MS mobile station
- a wireless connection can be established between the network device 101 and each terminal device, and a wireless connection can also be established between each terminal device.
- the network device 101 may refer to a device that provides wireless access services in a wireless network.
- the network device 101 may be a device that connects the mobile station 102 to the wireless network, and may also be called a base station; the aforementioned base station may be various forms of macro base stations, micro base stations, relay stations or access points, etc.
- the names of the network devices 101 having base station functions may be different.
- the base station may be called an evolved Node B (eNB), a Node B (NB), the next generation Node B (gNB) in the fifth generation (5G) communication system, a home base station (e.g., home evolved Node B, or home Node B, HNB), a base band unit (BBU), a wireless fidelity (Wi-Fi) access point (AP), a transmission reception point (TRP) or a radio network controller (RNC), etc.
- eNB evolved Node B
- NB next generation Node B
- gNB next generation Node B
- 5G fifth generation
- a home base station e.g., home evolved Node B, or home Node B, HNB
- BBU base band unit
- Wi-Fi wireless fidelity
- AP transmission reception point
- TRP transmission reception point
- RNC radio network controller
- multiple network nodes collaborate to assist in achieving wireless access, and different network nodes respectively implement part of the functions of a base station.
- a network node may be a central unit (CU), a distributed unit (DU), a CU-control plane (CP), a CU-user plane (UP), or a radio unit (RADIO).
- CU and DU may be set separately, or may be included in the same network element, such as a baseband unit (BBU).
- BBU baseband unit
- RU may be included in a radio frequency device or a radio frequency unit, such as a remote radio unit (RRU), an active antenna unit (AAU), or a remote radio head (RRH).
- RRU remote radio unit
- AAU active antenna unit
- RRH remote radio head
- CU or CU-CP and CU-UP
- DU or RU may also have different names, but those skilled in the art may understand their meanings.
- CU may also be referred to as an open CU (O-CU)
- DU may also be referred to as an open DU (O-DU)
- CU-CP may also be referred to as an open CU-CP (O-CU-CP)
- CU-UP may also be referred to as an open CU-UP (O-CU-UP)
- RU may also be referred to as an open RU (O-RU).
- any unit among CU (or CU-CP, CU-UP), DU and RU may be implemented by a software module, a hardware module, or a combination of a software module and a hardware module.
- the embodiment of the present application does not limit the specific device form of the network device 101.
- the mobile station 102 refers to a wireless terminal device that can receive scheduling information and indication information sent by the network device 101.
- the mobile station 102 can be a handheld device with wireless communication function, a vehicle-mounted device, a wearable device, a computing device, or other processing device connected to a wireless modem.
- the mobile station 102 can communicate with one or more core networks or the Internet via a wireless access network (RAN).
- RAN wireless access network
- the mobile station 102 can be a portable, pocket-sized, handheld, computer-built-in, or vehicle-mounted mobile device that exchanges voice and/or data with the wireless access network.
- the mobile station 102 can be a user agent, a cellular phone, a smart phone, a personal digital assistant (PDA), a tablet computer (Tablet Personal Computer, Tablet PC), a wireless modem, a handheld device (handset), a laptop computer, a personal communication service (PCS) phone, a remote station (remote station), an access point (access point, AP), a remote terminal equipment (remote terminal), an access terminal equipment (access terminal), a customer premises equipment (customer premises equipment, CPE), a terminal (terminal), a user equipment (user equipment, UE) or a mobile terminal (mobile terminal, MT), etc.
- PDA personal digital assistant
- Tablet PC Tablet PC
- PCS personal communication service
- the mobile station 102 may also be a wearable device, which is a general term for wearable devices that are intelligently designed and developed using wearable technology for daily wear, such as glasses, gloves, watches, clothing, and shoes.
- a wearable device is a portable device that is worn directly on the body or integrated into the user's clothes or accessories. Wearable devices are not just hardware devices, but also achieve powerful functions through software support, data interaction, and cloud interaction.
- wearable smart devices include those that are fully functional, large in size, and can achieve complete or partial functions without relying on smartphones, such as smart watches or smart glasses, as well as those that only focus on a certain type of application function and need to be used in conjunction with other devices such as smartphones, such as various types of smart bracelets, smart helmets, and smart jewelry for vital sign monitoring.
- the mobile station 102 may also be a drone, a robot, a terminal device in device-to-device (D2D) communication, a terminal device in vehicle to everything (V2X), a virtual reality (VR) device, an augmented reality (AR) device, a wireless terminal in industrial control, a terminal device in self driving, a terminal device in remote medical, a terminal device in a smart grid, a wireless terminal in a smart city, a terminal device in a smart home, etc.
- D2D device-to-device
- V2X vehicle to everything
- VR virtual reality
- AR augmented reality
- the mobile station 102 may also be a communication system after the 5G communication system (for example, the sixth generation (6th generation, The embodiment of the present application does not limit the device form of the mobile station 102, such as a terminal device in a future-developed public land mobile network (PLMN), or a terminal device in a future-developed public land mobile network (PLMN).
- a communication system after the 5G communication system for example, the sixth generation (6th generation.
- PLMN public land mobile network
- PLMN public land mobile network
- the network device 101 can send downlink data to each terminal device, or each terminal device can also send uplink data to the network device 101; the network device 101 or each terminal device may use a machine learning model in the process of sending data, and the data processing method provided in this application can be adopted.
- each terminal device can also send data to each other.
- Each terminal device may use a machine learning model in the process of sending data, so the data processing method provided in this application can be adopted.
- FIG 2 is another architectural diagram of the wireless communication system provided in the embodiment of the present application.
- various smart home products are connected through a wireless network to enable data to be transmitted between smart home products.
- these smart home products are all connected to the same wireless network through a wireless router, thereby enabling data interaction between various smart home products.
- other types of smart home products may also be included in actual applications, such as smart refrigerators, smart range hoods, smart curtains, and other smart home products. This embodiment does not limit the types of smart home products.
- smart home products can also be directly connected wirelessly without being connected to the same wireless network through a wireless router.
- the smart home products can be connected wirelessly through Bluetooth.
- the method provided in the embodiments of the present application can also be applied to other communication system scenarios.
- different devices such as intelligent robots, lathes, handling vehicles and other equipment
- a wireless network and transmit data to each other via the wireless network.
- the embodiments of the present application do not limit the specific scenarios in which the data processing method is applied.
- the wireless communication systems mentioned in the embodiments of the present application include but are not limited to: fifth generation mobile communication technology (5th Generation Mobile Communication Technology, 5G) communication system, 6G communication system, satellite communication system, short-range communication system, narrowband Internet of Things system (Narrow Band-Internet of Things, NB-IoT), Global System for Mobile Communications (Global System for Mobile Communications, GSM), Enhanced Data rate for GSM Evolution (Enhanced Data rate for GSM Evolution, EDGE), Wideband Code Division Multiple Access system (Wideband Code Division Multiple Access, WCDMA), Code Division Multiple Access 2000 system (Code Division Multiple Access, CDMA2000), Time Division-Synchronization Code Division Multiple Access system (Time Division-Synchronization Code Division Multiple Access, TD-SCDMA) and Long Term Evolution (LTE) system and other communication systems.
- 5G Fifth Generation Mobile Communication Technology
- 6G communication system 6G communication system
- satellite communication system short-range communication system
- narrowband Internet of Things system Narrow Band-Internet of Things, NB-IoT
- GSM Global System for
- Figure 3 is a flow chart of the data processing method provided by an embodiment of the present application. As shown in Figure 3, 301.
- the first device obtains the value of T, where T is an integer greater than or equal to 1, and T represents the number of sub-data included in the output data of the first machine learning model. 302.
- the first device inputs the first data into the first machine learning model to obtain the second data generated by the first machine learning model, and the second data includes T sub-data; wherein the first machine learning model includes one or more modules, and each time a module in the first machine learning model is called at least once, one of the T sub-data is obtained; exemplarily, each time a module in the first machine learning model is called, the first device can be called.
- a module in a machine learning model can also call multiple modules.
- the first device may be any device that needs to send data in the above-mentioned multiple application scenarios; for example, the first device may be the network device 101 or the mobile station 102 in Figure 1; for another example, the first device may be the smart home or wireless router in Figure 2; or, the first device may also be other devices that need to send data, etc.
- the form of the first device is not limited in the embodiments of the present application.
- the function of the first machine learning model includes any one or more of the following combinations: encoding, modulation, generating reference signals or other functions.
- the first data is the data to be encoded
- the second data is the encoded data.
- the function of the first machine learning model is modulation
- the first data is the data to be modulated
- the second data is the modulated data.
- the first data may be the index number of multiple reference signals
- the second data may be a reference signal.
- the first data is the data to be encoded and modulated
- the second data is the encoded and modulated data, etc.
- the first data and the second data may be expressed as other types of data, etc., which is not limited in the embodiments of the present application.
- the number of calls of the module in the first machine learning model can be flexibly adjusted according to the value of T to generate T sub-data, so that the first machine learning model can be compatible with multiple values of T, and there is no need to store multiple machine learning models, thereby reducing the storage space overhead.
- the detailed implementation process of the reasoning phase of the above-mentioned first machine learning model is first introduced below, and then the detailed implementation process of the training phase of the first machine learning model is introduced.
- the "reasoning phase of the first machine learning model” is the process of using the first machine learning model to process data
- the "training phase of the first machine learning model” is the process of iteratively training the first machine learning model using training data.
- the process of iteratively training the first machine learning model is also the process of iteratively updating the parameters adopted by the first machine learning model.
- one or more sets of trained parameters corresponding to the first machine learning model can be obtained, and the aforementioned parameters obtained in the training phase will be used in the reasoning phase.
- FIG. 4 is another schematic diagram of a data processing method provided in an embodiment of the present application. As shown in FIG. 4 , the data processing method includes steps 401 to 411 .
- the first device obtains a set of parameters adopted by the first machine learning model.
- the first device before the first device uses the first machine learning model to process data, it is necessary to first determine a set of trained parameters adopted by the first machine learning model; exemplarily, the aforementioned set of trained parameters includes the parameters required by all modules in the first machine learning model.
- the first device can obtain a set of parameters used by the first machine learning model in a variety of ways.
- multiple sets of trained parameters of the first machine learning model and identification information of each set of trained parameters can be predefined.
- identification information of each set of trained parameters can also be referred to as the index number of each set of trained parameters.
- a group of parameters used in the first machine learning model includes the following multiple parameters as an example: U, W, ⁇ s , V, and ⁇ 0 .
- the parameters used in the process of extracting features from the input data of the first machine learning model to obtain feature information of the input data include U and ⁇ s
- the parameters used in the process of updating the feature information of the input data include W and ⁇ s
- the parameters used in the process of generating multiple sub-data in the output data of the first machine learning model according to the feature information of the input data include V and ⁇ 0 .
- Matrix-U0 represents a value of parameter U
- Matrix-W0 represents a value of parameter W
- Vector-s0 represents a value of parameter ⁇ s
- Matrix-V0 represents a value of parameter V
- Vector-o0 represents a value of parameter ⁇ 0
- Matrix-U0, Matrix-W0, Vector-s0, Matrix-V0 and Vector-o0 represent a set of parameters of the first machine learning model
- the index number "0" in the second row of Table 1 represents the identification information of the aforementioned set of parameters.
- the third and fourth rows in Table 1 can be understood by referring to the above explanation of the first row in Table 1. It should be noted that the examples in Table 1 are only for the convenience of understanding, and the correspondence between each set of parameters in the multiple sets of parameters of the first machine learning model and the identification information is not used to limit this solution.
- the at least one first preset indicator may include any one or more of the following indicators: the moving speed of the terminal device, the maximum value of the multipath delay spread, the peak to average power ratio (PAPR) or other indicators, etc.
- PAPR peak to average power ratio
- the specific indicators used are not limited in the embodiments of the present application.
- a set of parameters corresponding to when the moving speed of the terminal device is greater than or equal to the speed threshold is different from a set of parameters corresponding to when the moving speed of the terminal device is less than the speed threshold.
- a set of parameters corresponding to when the maximum value of the multipath delay spread is greater than threshold 1 is different from a set of parameters corresponding to when the maximum value of the multipath delay spread is less than threshold 1.
- a set of parameters corresponding to when the PAPR value is within range 1 is different from a set of parameters corresponding to when the PAPR value is within range 2.
- the high-speed mobile scenario represents that the moving speed of the terminal device is greater than or equal to the speed threshold
- the low-speed mobile scenario represents that the moving speed of the terminal device is less than the speed threshold
- the large multipath delay spread represents that the maximum value of the multipath delay spread is greater than the threshold 1
- the small multipath delay spread represents that the maximum value of the multipath delay spread is less than the threshold 1.
- Table 2 takes the 8 groups of parameters of the pre-defined first machine learning model as an example.
- the 8 groups of parameters are parameter 1, parameter 2, parameter 3, parameter 4, parameter 5, parameter 6, parameter 7 and parameter 8. Different groups of parameters in the 8 groups of parameters of the first machine learning model correspond to different scenarios and different indicator ranges of the first preset indicator.
- Table 2 are only for the convenience of understanding the relationship between different groups of parameters in the multiple groups of trained parameters of the first machine learning model.
- the specific number of pre-defined parameter groups and the specific usage method can be flexibly set in combination with the actual scenario, and are not limited here.
- the first device is a terminal device, in an implementation method (hereinafter referred to as implementation method one for the convenience of description), before executing step 401, the first device (i.e., the terminal device) is deployed with multiple identification information corresponding to the parameters of the first machine learning model and a first rule, and the first rule indicates the correspondence between different indicator ranges of at least one first preset indicator of different identification information in the multiple identification information corresponding to the parameters of the first machine learning model.
- the meaning of the first rule can be understood by referring to the "correspondence between different groups of parameters in the multiple groups of trained parameters and different indicator ranges of at least one first preset indicator" disclosed in the above description, and will not be elaborated here; the first device can determine a second identification information from the multiple identification information corresponding to the parameters of the first machine learning model based on the value of at least one first preset indicator and the first rule, and the second identification information is the identification information of a group of parameters adopted by the first machine learning model in the first device.
- first identification information will be used in subsequent descriptions, and the meaning of the first identification information will also be explained in the subsequent descriptions, so it will not be repeated here; for the various forms of base stations and terminal devices, please refer to the above description, so it will not be repeated here.
- the first device may send the second identification information to the base station.
- the base station may send a set of trained parameters pointed to by the second identification information to the first device.
- the aforementioned set of trained parameters sent by the base station is a set of trained parameters adopted by the first machine learning model in the first device.
- Step 401 may include: the first device is able to obtain the aforementioned set of trained parameters sent by the base station, thereby determining the aforementioned set of parameters obtained as a set of parameters adopted by the first machine learning model.
- the base station may send the second identification information and a set of parameters pointed to by the second identification information to the first device.
- the first device can obtain the second identification information and a set of parameters pointed to by the second identification information, and the first device determines the obtained set of parameters as a set of parameters adopted by the first machine learning model.
- the second device that communicates data with the first device is the above-mentioned terminal device
- the first device that is, the above-mentioned terminal device
- the second device has also determined which set of parameters is adopted by the first machine learning model in the first device.
- the first device can also send a set of parameters adopted by the first machine learning model (hereinafter referred to as "a set of target parameters" for the convenience of description) to the second device, so that the second device can determine which set of parameters is adopted by the first machine learning model in the first device.
- a set of target parameters for the convenience of description
- At least one third identification information corresponding to the parameters of the first machine learning model may be configured in the first device, and each third identification information in the aforementioned at least one third identification information is identification information of a set of parameters that can be adopted by the first machine learning model, and the set of parameters pointed to by each third identification information in the aforementioned at least one third identification information all conforms to the hardware capabilities of the first device, that is, the hardware capabilities of the first device can support the execution of a set of parameters pointed to by each third identification information.
- the first device may send all the third identification information configured above to the base station, and correspondingly, after receiving the at least one third identification information sent by the first device, the base station may obtain a set of trained parameters pointed to by each third identification information (that is, a set of parameters that can be adopted by the first machine learning model).
- the base station sends the aforementioned at least one set of trained parameters of the first machine learning model corresponding to the at least one third identification information to the first device, and the first device receives the aforementioned at least one set of parameters corresponding to the at least one third identification information.
- Step 401 may include: the first device may select a set of target parameters adopted by the first machine learning model from at least one set of parameters corresponding one-to-one to at least one third identification information.
- different third identification information in at least one third identification information can correspond to different indicator ranges of at least one second preset indicator.
- the first device can determine a second identification information from multiple third identification information corresponding to the parameters of the first machine learning model based on the value of the at least one second preset indicator, and then select a group of target parameters corresponding to a second identification information from at least one group of parameters corresponding one-to-one to the at least one third identification information.
- the specific implementation method of "the first device can determine a second identification information from multiple third identification information corresponding to the parameters of the first machine learning model according to the value of at least one second preset indicator” can refer to the above description of the specific implementation method of "the first device can determine a second identification information from multiple identification information corresponding to the parameters of the first machine learning model according to the value of at least one first preset indicator", which will not be repeated here.
- At least one second preset indicator and the category of “at least one first preset indicator” may be the same or different.
- the specific category of “at least one second preset indicator” may be flexibly set according to actual conditions and is not limited here.
- the first device may also send to the base station (i.e., an example of the second device) a second identification information determined by the above-mentioned first device from at least one third identification.
- the base station receives the second identification information sent by the first device, so that the second device can determine which set of parameters is adopted by the first machine learning model in the first device.
- the first device can also send a set of target parameters adopted by the first machine learning model to the second device, so that the second device can determine which set of parameters is adopted by the first machine learning model in the first device.
- step 401 may include: the base station (that is, an example of the first device) can determine a set of target parameters adopted by the first machine learning model from the multiple groups of trained parameters of the first machine learning model based on the value of at least one first preset indicator.
- the first device i.e., the base station
- the first device i.e., the base station
- the second device i.e., the terminal device communicating with the base station
- the second device may also send a set of target parameters adopted by the aforementioned first machine learning model to the second device (i.e., the terminal device communicating with the base station), so that the second device can determine which set of parameters is adopted by the first machine learning model in the first device.
- both the first device and the second device already have multiple sets of trained parameters of the first machine learning model and identification information of each set of trained parameters.
- the first device can send multiple sets of trained parameters of the first machine learning model and identification information of each set of trained parameters to the second device.
- the base station (an example of the first device) can send multiple sets of trained parameters of the first machine learning model and identification information of each set of trained parameters to the terminal device (an example of the second device); for another example, the terminal device (an example of the first device) can send multiple sets of trained parameters of the first machine learning model and identification information of each set of trained parameters to the base station (an example of the second device); for another example, the first terminal device (an example of the first device) can send multiple sets of trained parameters of the first machine learning model and identification information of each set of trained parameters to the second terminal device (an example of the second device)
- the network equipment and mobile station in the wireless communication system have been pre-configured with multiple groups of trained parameters of the first machine learning model and identification information of each group of trained parameters, and the network equipment and mobile station in the wireless communication system include a first device and a second device.
- step 401 may include: the first device may obtain a set of target parameters from multiple sets of trained parameters of the first machine learning model.
- the first device may determine a second identification information from multiple identification information corresponding to the parameters of the first machine learning model based on the value of at least one first preset indicator, and then determine a set of parameters pointed to by the second identification information, thereby determining a set of parameters used by the first machine learning model.
- the specific implementation method of the aforementioned steps can be found in the above description and will not be repeated here.
- the first device may also send the aforementioned second identification information to the second device, and correspondingly, the second device receives the second identification information sent by the first device.
- the second device may determine which set of parameters is used by the first machine learning model in the first device based on the received second identification information and third data, and the third data includes multiple sets of parameters corresponding to the first machine learning model in the second device and identification information of each set of parameters.
- the specific forms of the first device and the second device can be flexibly determined in combination with the actual application scenario, and are not limited here.
- FIG. 5 is a flow chart of a first device and a second device determining a set of parameters used by a first machine learning model according to an embodiment of the present application.
- the first device sends multiple sets of parameters of the first machine learning model and identification information of each set of parameters to the second device.
- the first device Send the second identification information to the second device.
- the second device determines a set of parameters used by the first machine learning model in the first device according to the second identification and the third data, wherein the third data includes multiple sets of parameters of the first machine learning model and identification information of each set of parameters.
- the example in FIG. 5 is only for facilitating the understanding of the present solution and is not used to limit the present solution.
- the first device only needs to send the second identification information to the second device, and the second device can know which group of parameters is used by the first machine learning model in the first device.
- the communication resources occupied by transmitting the second identification information are relatively small, which is beneficial to reducing the consumed communication resources.
- the embodiment of the present application does not limit the execution order between the above-mentioned operation of "the second device determines which set of parameters is adopted by the first machine learning model in the first device” and the subsequent steps 402 to 411, and the operation of "the second device determines which set of parameters is adopted by the first machine learning model in the first device” can be performed before or after any step of steps 402 to 411.
- only a set of trained parameters of the first machine learning model may be defined in advance, and the set of trained parameters of the aforementioned first machine learning model may be pre-configured in the first device. Then, when the first device needs to use the first machine learning model, the set of parameters adopted by the first machine learning model may be directly obtained locally.
- the first device in order to enable the first device to obtain a set of trained parameters adopted by the first machine learning model, and in order to enable the second device that communicates data with the first device to determine what parameters are adopted by the first machine learning model in the first device, it may be necessary to send the trained parameters corresponding to the first machine learning model and/or identification information of the aforementioned parameters between different devices.
- the terminal device (that is, an example of the first device) can send second identification information (that is, identification information of the trained parameters corresponding to the first machine learning model) to the base station.
- the base station can send a set of trained parameters pointed to by the second identification information to the terminal device, that is, a set of target parameters adopted by the first machine learning model.
- the terminal device (that is, an example of the first device) can send a set of target parameters adopted by the first machine learning model to another terminal device (that is, an example of the second device).
- the terminal device (also an example of the first device) can send at least one third identification information (also the identification information of the trained parameters corresponding to the first machine learning model) to the base station.
- the base station sends at least one set of trained parameters of the first machine learning model corresponding to the at least one third identification information to the terminal device (also an example of the first device).
- the first device can also send the above-mentioned second identification information to the base station (also an example of the second device).
- the first device can also send a set of target parameters adopted by the first machine learning model to the second device, and so on.
- the situations in the above-mentioned implementation methods 3 and 4 are not listed one by one here. For details, please refer to the descriptions in the above-mentioned various implementation methods.
- the trained parameters corresponding to the first machine learning model and/or identification information of the aforementioned parameters may be carried in the signaling, that is, in the various implementations described above, the trained parameters corresponding to the first machine learning model and/or identification information of the aforementioned parameters are sent between different devices by sending signaling.
- each signaling may carry at least one set of trained parameters of the first machine learning model.
- the trained parameters corresponding to the learning model can be carried in one or more of the following information: downlink control information (DCI), uplink control information (UCI), sidelink control information (SCI), radio resource control (RRC) signaling, media access control control element (MAC CE) or other types of signaling, which are not exhaustive here.
- DCI downlink control information
- UCI uplink control information
- SCI sidelink control information
- RRC radio resource control
- MAC CE media access control control element
- the base station can send a DCI, RRC or MAC CE carrying a set of trained parameters pointed to by the second identification information to the terminal device (that is, an example of the first device); correspondingly, the first device can obtain the set of trained parameters pointed to by the second identification information from the aforementioned DCI, RRC or MAC CE.
- the terminal device (that is, an example of the first device) can carry a set of target parameters adopted by the first machine learning model in SCI, RRC or MAC CE and send it to another terminal device (that is, an example of the second device); correspondingly, the second device can obtain the set of target parameters adopted by the aforementioned first machine learning model from SCI, RRC or MAC CE.
- the base station may carry each third identification information and a set of trained parameters pointed to by each third identification information in DCI, RRC or MAC CE and send them to the terminal device (that is, an example of the first device); correspondingly, the first device may obtain each third identification information and a set of trained parameters pointed to by each third identification information from DCI, RRC or MAC CE.
- the terminal device (an example of the first device) can carry each group of trained parameters of the first machine learning model and the identification information of each group of trained parameters in UCI, RRC or MAC CE and send them to the base station (an example of the second device); correspondingly, the base station can obtain each group of trained parameters of the first machine learning model and the identification information of each group of trained parameters from UCI, RRC or MAC CE, etc.
- the first signaling may carry at least one set of parameters adopted by the first machine learning model and indication information corresponding to each set of parameters, and the indication information corresponding to each set of parameters is used to indicate the position of multiple parameters included in each set of parameters in the first machine learning model.
- the first signaling is any of the following: DCI, UCI, SCI, RRC, MAC CE or other types of signaling.
- the base station when the base station needs to send a set of trained parameters pointed to by the second identification information to the first device, the base station can send a first signaling to the first device.
- the first device can receive the first signaling sent by the base station and obtain a set of parameters adopted by the first machine learning model from the first signaling.
- the base station when the base station needs to send a set of trained parameters pointed to by each third identification information to the first device, the base station may send one or more first signalings to the first device, each first signaling carrying a third identification information and a set of parameters pointed to by the third identification information, and so on.
- the base station may send one or more first signalings to the first device, each first signaling carrying a third identification information and a set of parameters pointed to by the third identification information, and so on.
- Other situations in the various implementation methods mentioned above are not listed one by one here.
- the signaling when at least one set of parameters of the first machine learning model is transmitted by signaling, the signaling not only carries the aforementioned at least one set of parameters, but also carries indication information corresponding to each set of parameters, which is used to indicate each set of parameters.
- the parameters include the positions of multiple parameters in the first machine learning module, so that after receiving the signaling, the first device can understand how to use the parameters carried in the signaling, and transmit at least one set of parameters of the first machine learning model by means of signaling, which is conducive to reducing the communication resources consumed in the parameter transmission process and improving the efficiency of the parameter transmission process.
- the indication information corresponding to each group of parameters may include the number of layers of each parameter in each group of parameters in the first machine learning model, and the parameter value used when operating in the layer.
- the name of each parameter in the first signaling may be consistent with the name of each parameter in the first machine learning model, and the information carried by the first signaling may include the following:
- the above information represents that the parameters used by the first neural network layer in the first machine learning model include matrix U1_1, matrix U1_2 and vector b1_1.
- the values of matrix U1_1 are ⁇ u11(1,1), u11(1,2), u11(2,1), u11(2,2),... ⁇
- the values of matrix U1_2 are ⁇ u12(1,1), u12(1,2), u12(1,3), u12(2,1),... ⁇
- the values of vector b1_1 are b11(1), b11(2),... ⁇ , etc.
- the meanings of the parameters used by the second to fifth neural network layers of the first machine learning model can be understood in combination with the above description and will not be elaborated here.
- the following takes the first signaling as MAC CE and RRC as an example to show the specific format of carrying a set of parameters of the first machine learning model in MAC CE and RRC.
- First refer to the following Table 3, which shows the format of a set of parameters of the first machine learning model in MAC CE.
- R in Table 3 represents a meaningless parameter, that is, R will not be used in the first machine learning model.
- the first machine learning model includes two neural network layers as an example.
- the parameters used by the first neural network layer of the first machine learning model include U 0 , W 0 , ⁇ s0 , V 0 and ⁇ o0 , and the values of the five parameters used by the first neural network layer of the first machine learning model are shown in the second to sixth rows, respectively.
- the seventh row of Table 3 (i.e., the seventh row of MAC CE) states that the parameters used by the second neural network layer of the first machine learning model include U 1 , W 1 , ⁇ s1 , V 1 and ⁇ o1 , and the values of the five parameters used by the second neural network layer of the first machine learning model are shown in the eighth to twelfth rows. Since the parameters used in the first machine learning model of the first device are also U 0 , W 0 , ⁇ s0 , V 0 , ⁇ o0 , U 1 , W 1 , ⁇ s1 , V 1 and ⁇ o1 , the first device can determine the value of each parameter in the first machine learning model after receiving the MAC CE. It should be understood that the examples in Table 3 are only for the convenience of understanding this solution and are not used to limit this solution.
- the content carried in RRC may be as follows:
- matrixU SEQUENCE ⁇ means that the values of the parameters in ⁇ are all matrices, Ui0j0 represents the parameter of the i0th row and j0th column of the parameter matrixU of the first machine learning model, and REAL in Ui0j0,REAL represents that the value type of the parameter Ui0j0 is a real number; similarly, Ui1j1 represents the parameter of the i1th row and j1th column of the parameter matrixU of the first machine learning model, and REAL in Ui1j1,REAL represents that the value type of the parameter Ui1j1 is a real number.
- vector SEQUENCE ⁇ means that the values of the parameters in ⁇ are all vectors, v0 is the 0th parameter of the parameter vector of the first machine learning model, and REAL in v0,REAL represents that the value type of the parameter v0 is a real number; v1 is the 1st parameter of the parameter vector of the first machine learning model, and REAL in v1,REAL represents that the value type of the parameter v1 is a real number.
- a set of parameters of the first machine learning model can be carried in the RRC, and after obtaining the RRC, the position of each parameter in the set of parameters in the first machine learning model can also be known. It should be understood that the above examples are only for the convenience of understanding of this solution and are not used to limit this solution.
- FIG. 6 is a flow chart of a first device obtaining a set of parameters adopted by a first machine learning model provided in an embodiment of the present application.
- the first device sends a UCI to a base station, and the UCI carries second identification information.
- the base station obtains a set of parameters pointed to by the second identification information, that is, a set of parameters adopted by the first machine learning model. 603.
- the base station obtains a DUI, and the DUI carries a set of parameters pointed to by the second identification information and indication information corresponding to the aforementioned set of parameters, and the indication information corresponding to the aforementioned set of parameters is used to indicate the aforementioned set of
- the parameters include the positions of multiple parameters in the first machine learning model. 604.
- the base station sends the DUI to the first device, and correspondingly, the first device receives the DUI.
- the UCI and DUI in Figure 6 can also be replaced by other types of signaling.
- the example in Figure 6 is only for the convenience of understanding this solution and is not used to limit this solution.
- the identification information of the parameters is carried in any one or more of the following information: DCI, UCI, SCI, RRC signaling, MAC CE, physical broadcast channel (PBCH), physical random access channel (PRACH) or other types of signaling, which are not exhaustive here.
- the terminal device (that is, an example of the first device) can carry the second identification information in UCI, MAC CE or PRACH and send it to the base station.
- the base station can carry the second identification information in DCI, MAC CE or PBCH and send it to the terminal device.
- the first terminal device can carry the second identification information in SCI or MAC CE and send it to the second terminal device, and so on; it should be noted that the above-mentioned various implementation methods in which the identification information of the trained parameters corresponding to the first machine learning model is carried by signaling are not described one by one here. Other situations in which the aforementioned identification information is carried by signaling in the aforementioned various implementation methods can be understood by referring to the aforementioned description.
- carrying the identification information of the aforementioned at least one set of parameters and/or each set of parameters in a signaling for transmission has higher transmission efficiency and consumes less computer resources; in addition, the present solution provides a variety of signaling that can be used to transmit the identification information of the aforementioned at least one set of parameters and/or each set of parameters, thereby improving the implementation flexibility of the present solution.
- the trained parameters corresponding to the first machine learning model and/or identification information of the aforementioned parameters may be carried in a data packet. That is, in the above implementations, the trained parameters corresponding to the first machine learning model and/or identification information of the aforementioned parameters are sent between different devices by sending data packets.
- the terminal device (also an example of the first device) can send a first data packet carrying the second identification information to the base station, and correspondingly, the base station can obtain the second identification information from the first data packet.
- the base station can send a second data packet to the terminal device (also an example of the first device), and the second data packet carries a set of trained parameters pointed to by the second identification information; the terminal device can obtain the aforementioned set of target parameters adopted by the first machine learning model from the second data packet.
- the terminal device (also an example of the first device) can send a third data packet to another terminal device (also an example of the second device), and the third data packet carries a set of target parameters adopted by the first machine learning model; the second device can obtain a set of target parameters adopted by the first machine learning model in the first device from the third data packet, and so on.
- implementation method 2 implementation method 3 and implementation method 4 are not described one by one here.
- the specific implementation methods in implementation method 2, implementation method 3 and implementation method 4 can be understood by referring to the above description of implementation method 1.
- the first machine learning model may be retrained to optimize the parameters used in the first machine learning model, and the first device obtains a set of updated parameters of the first machine learning model.
- the set of updated parameters of the first machine learning model may be sent to the second device.
- the base station retrains the first machine learning model to obtain a set of updated parameters of the first machine learning model.
- the first device may send a request to the base station, the aforementioned request is used to request the base station to retrain the first machine learning model, and the base station sends a set of updated parameters of the first machine learning model to the first device.
- the first device may also send the aforementioned set of updated parameters to the second device.
- the base station can also send the aforementioned set of updated parameters to the second device, so that the second device can determine the set of updated parameters adopted by the first machine learning model in the first device.
- the first device retrains the first machine learning model to obtain a set of updated parameters of the first machine learning model.
- the first device also sends the set of updated parameters of the first machine learning model to the second device.
- the first device is a terminal device and the second device is a base station
- the terminal device after the terminal device retrains the first machine learning model and obtains a set of updated parameters of the first machine learning model, it can send the aforementioned set of updated parameters to the base station.
- the first device is a first terminal device and the second device is a second terminal device
- after the first terminal device retrains the first machine learning model and obtains a set of updated parameters of the first machine learning model it can send the aforementioned set of updated parameters to the second terminal device, and so on.
- Various situations are not enumerated here.
- the first device obtains the value of T, where T is an integer greater than or equal to 1, and T represents the number of sub-data included in the output data of the first machine learning model.
- T represents the number of sub-data included in the output data of the first machine learning model.
- the output data of the first machine learning model is modulated data
- the modulated data may include T groups of modulated symbols.
- the output data of the first machine learning model is encoded data
- the encoded data may include T groups of encoded bit data.
- the task performed by the first machine learning model is to generate a reference signal
- the output data of the first machine learning model is a reference signal
- T may represent the length of the aforementioned reference signal.
- the length of the reference signal may indicate the number of symbols included in the reference signal.
- the first device obtains data to be processed.
- the first device determines whether the length of the data to be processed is less than H. If the determination result is yes, the process proceeds to step 405; if the determination result is no, the process proceeds to step 406, where H indicates the length of the first data.
- step 404 is an optional step.
- the first device can also obtain the length of the data to be processed and the value of H.
- the length of the data to be processed can be K, where K is an integer greater than or equal to 1, and H indicates the length of the first data, that is, H indicates the expected length of the input data of the first machine learning model, and H is an integer greater than or equal to 1.
- the first device may determine whether K is less than H; if the determination result is yes, proceed to step 405; if the determination result is no, proceed to step 406.
- the length of the data to be processed may be the number of bits of the data to be processed.
- the data to be processed is the data that needs to be encoded
- the length of the data to be processed may be the number of bits of the data that needs to be encoded.
- the data to be processed is the data that needs to be modulated
- the length of the data to be processed may be the number of bits of the data that needs to be modulated.
- the data to be processed is the data that needs to be encoded and modulated, and the length of the data to be processed is the number of bits of the data that needs to be encoded and modulated.
- the task performed by the first machine learning model is to generate a reference signal, the data to be processed includes the index numbers of multiple reference signals, and the length of the data to be processed is the number of bits of the index numbers of the aforementioned multiple reference signals, and so on.
- the data to be processed includes index numbers of multiple reference signals, and the length of the data to be processed can be the number of the aforementioned multiple reference signals, etc.
- the meaning of "the length of the data to be processed" can be flexibly determined based on actual conditions and is not limited here.
- the first device fills the data to be processed to obtain first data, and the length of the first data is H.
- step 405 is an optional step. If the first device determines that the length of the data to be processed is less than H, the first device can fill the data to be processed to obtain the first data.
- the length of the first data is H, and the first data may include the data to be processed and the filled data.
- the data to be processed when the length of the data to be processed is less than H, the data to be processed is filled to obtain the first data of length H, and then the first data of length H is input into the first machine learning model, so that no matter how long the data to be processed is, the first machine learning model processes the first data of length H, which is not only conducive to compatibility with data to be processed of any length, but also conducive to reducing the difficulty of the first machine learning model in data processing to obtain second data with better performance.
- the padding data may include first identification information, and the first identification information is used to identify the value of T and/or the value of K, that is, the first identification information can be used to identify the value of T and the value of K, and can also be used to identify the value of T, and can also be used to identify the value of K.
- the value of T is the number of sub-data included in the output data of the first machine learning model
- K is the length of the data to be processed
- T and K are both integers greater than or equal to 1.
- the first device may use the first function to process the value of T and/or the value of K to obtain the first identification information.
- the conditions that the first function needs to meet include: limiting the value of the first identification information within a preset range, and being able to map different values of T and/or K to different values, that is, the value generated by the first function can uniquely identify a certain value of T and/or value of K, or in other words, the value generated by the first function can distinguish different values of T and/or value of K.
- the first function may be a binary function, a linear function, or a nonlinear function.
- f(T, K) represents an example of the first function, and the example in formula (1) is only for the convenience of understanding the present solution and is not used to limit the present solution.
- the first data carries first identification information for identifying the value of T and/or the value of K.
- the first machine learning model can process the first data according to the value of T and/or the value of K, that is, according to the length of the output data of the first machine learning model and/or the length of the actual data to be processed, and then the second data output by the first machine learning model is conducive to obtaining second data with better performance.
- the padding The charging data may also carry identification information of the first device.
- the first device may be the identification information of the terminal device; if the first device is a base station, the first device may be the identification information of the base station; for example, the identification information of the first device may be a radio network temporary identity (RNTI), a cell identity (ID), a physical cell identity (PCI), or other types of identification information, etc., which are not exhaustive here.
- RNTI radio network temporary identity
- ID cell identity
- PCI physical cell identity
- the padding data may also carry invalid information.
- the remaining space in the padding data may be filled with 0, 1 or other values. The example here is only for the convenience of understanding the present solution and is not used to limit the present solution.
- i is taken from 0 to H-1 in sequence. If i is less than K, b i is obtained from the data to be processed, and b i is converted to c i and put into the first data, c i represents the i+1th data in the first data; if i is greater than or equal to K and less than H-1, 0 is filled into the first data; if i is equal to H-1, the first identification information is filled into the first data.
- the meaning of the first identification information can be found in the above description and will not be elaborated here. It should be understood that the example here is only for the convenience of understanding this scheme and is not used to limit this scheme.
- Figure 7 is a schematic diagram of the first data provided in the embodiment of the present application.
- the first data includes data to be processed, 0 for padding, and first identification information.
- the example in Figure 7 is only for the convenience of understanding this solution and is not used to limit this solution.
- the padding data may include identification information of the first device but not the first identification information. In another case, the padding data may carry only invalid information.
- the first device merges the first identification information and the data to be processed to obtain the first data, where the first identification information is used to identify the value of T and/or the value of K, where K is the length of the data to be processed and K is an integer greater than or equal to 1.
- step 406 is an optional step. If the first device determines that the length of the data to be processed is equal to H, the first device may also merge the first identification information and the data to be processed to obtain the first data. For the meaning of the first identification information, please refer to the above description and will not be repeated here.
- “merging the first identification information and the data to be processed” includes, but is not limited to: concatenating, adding or other merging methods of the first identification information and the data to be processed, etc., which are not limited here.
- steps 404 to 406 are all optional steps, and if steps 404 to 406 are not performed, the data to be processed can be directly determined as the first data. Alternatively, steps 404 and 405 can be omitted, and only step 406 can be performed.
- steps 404 and 405 may be performed, and step 406 may not be performed. Then, when the first device determines that the length of the data to be processed is equal to H, the data to be processed may be directly determined as the first data.
- the first device inputs the first data into the first machine learning model to obtain second data generated by the first machine learning model, where the second data includes T sub-data, wherein the first machine learning model includes one or more modules, and each time a module in the first machine learning model is called at least once, one sub-data is obtained.
- the first device may directly input the first data into the first machine learning model.
- the first device may also scramble the first data using the identification information of the first device, and input the scrambled first data into the first machine learning model.
- the meaning of the identification information of the first device can be referred to the above description, which will not be repeated here.
- One sub-data is obtained at least once each time a module in the first machine learning model is called means that one sub-data among T sub-data can be obtained each time a module in the first machine learning model is called at least once, or, one sub-data among T sub-data can be obtained each time multiple modules in the first machine learning model are called at least once.
- the function of the first machine learning model includes any one or more of the following combinations: encoding, modulation, or generating a reference signal.
- the T sub-data represent T groups of encoded bit data, and each group of encoded bit data may include one or more bit data.
- the T sub-data represent T groups of modulated symbols, and each group of modulated symbols may include one or more symbols.
- the T sub-data represent T groups of encoded and modulated symbols, and each group of encoded and modulated symbols may include one or more symbols.
- the T sub-data may represent T groups of symbols in the reference signal, etc. It should be understood that the examples given here are only for the convenience of understanding this scheme and are not used to limit this scheme.
- the first machine learning model may include a first module and at least one second module, and the difference between the first module and the second module includes: the initial input of the first module is the first data or the scrambled first data, and the initial input of the second module is the feature information of the first data (or the scrambled first data).
- step 407 may include: the first device inputs the first data (or the scrambled first data) into the first module to obtain the first sub-data generated by the first module, and the first sub-data is one of the T sub-data; the first device can obtain the characteristic information of the first data in the process of using the first module to generate the first sub-data.
- the first device inputs the first characteristic information into the second module to obtain the second sub-data generated by the second module, wherein the first characteristic information includes the characteristic information generated when the module in the first machine learning model was last called for data processing, and the second sub-data is one of the T sub-data.
- the first machine learning model includes a first module and at least one second module, the input of the first module of the first machine learning model is the entire first data, and the input of the second module is the last call to the first If the feature information is obtained when the module of the machine learning model is called, the feature information obtained when the entire first data is processed by the first module is input when the second module is called for the first time, so that the feature information input into the second module each time refers to the entire first data, that is, when generating each sub-data in the T sub-data, the information of the entire first data is referred to, which is conducive to obtaining second data with better performance; and each time the second module is called once, one second sub-data among the T sub-data can be obtained, which is conducive to quickly obtaining the T sub-data included in the second data.
- the first device may obtain the first sub-data generated by the first module by calling the first module once or multiple times; optionally, if the first device obtains the first sub-data generated by the first module by calling the first module multiple times, the first characteristic information input when the second module is called for the first time is the characteristic information generated when the first module is called for the last time.
- the first device inputs the first data (or the first data after scrambling) into the first module, processes the first data (or the first data after scrambling) through the first module, and then directly outputs the first sub-data.
- the first module may use a convolutional neural network, a recurrent neural network, a fully connected neural network, or other types of neural networks, etc., which are not limited here.
- the process of the first device using the first module to process the first data may include: the first device performs a linear transformation on the first data (or the first data after scrambling) through the first module, and processes it with a third activation function to obtain characteristic information of the first data (or the first data after scrambling); linearly transforms the characteristic information of the first data (or the first data after scrambling), and processes it with a fourth activation function to obtain the first sub-data generated by the first module.
- the activation function in the first machine learning module can be any of the following: tanh(x), max(min(a*x,+1),-1), sin(x) or other types of activation functions, etc.
- tanh(x) max(min(a*x,+1),-1), sin(x) or other types of activation functions, etc.
- the examples here are only used to prove the feasibility of this solution and are not used to limit this solution.
- the third activation function and the fourth activation function may use the same activation function or different activation functions, which may be flexibly set according to actual conditions and are not limited in the embodiments of the present application.
- the first device inputs the first data (or the first data after scrambling) into the first module, and processes the first data (or the first data after scrambling) through the first module.
- the first device obtains the second characteristic information of the first data (or the first data after scrambling) obtained in the aforementioned processing process, inputs the second characteristic information into the first module again, and processes the second characteristic information through the first module; the first device repeats the aforementioned step of "obtaining the second characteristic information obtained in the process of calling the first module for data processing last time, inputting the second characteristic information into the first module again, and processing the second characteristic information through the first module" at least once, and uses the processing result of the last call of the first module to process the second characteristic information as the first sub-data.
- the process of the first device using the first module to process the input data each time may include: the first device performs a linear transformation on the input data through the first module, and processes it using the third activation function to obtain the transformed input data; performs a linear transformation on the transformed input data, and processes it using the fourth activation function to obtain the processing result of the first module.
- the input data of the first module may be the first data (or the first data after scrambling), or the characteristic information of the first data (or the first data after scrambling).
- each second sub-data specifically, each time the first device calls the second module, it will The first feature information is input into the second module, and the first feature information is processed by the second module to obtain second sub-data generated by the second module, where the second sub-data is one of the T sub-data.
- the first feature information is the feature information generated when the module in the first machine learning model is called for data processing last time; exemplarily, the first feature information can be the feature information generated when the first module in the first machine learning model is called for data processing last time, or it can be the feature information generated when the second module in the first machine learning model is called for data processing last time.
- each second module can use a convolutional neural network, a recurrent neural network, a fully connected neural network, or other types of neural networks, etc., which are not limited here.
- the process in which the first device uses the second module to process the first feature information each time may include: the first device performs a linear transformation on the first feature information through the second module, and processes it using the first activation function to obtain the transformed feature information; performs a linear transformation on the transformed feature information, and processes it using the second activation function to obtain the second sub-data generated by the second module.
- the first activation function and the second activation function are both activation functions within the first machine learning model, and the specific activation function to be used can be flexibly set according to the actual situation.
- a specific implementation method for data processing by the second module is provided. Since the above method is simple and easy to implement, it is not only beneficial to reduce the computer resources consumed in the process of generating the second data; and the number of parameters used in the second module shown in the above method is relatively small, which is beneficial to reduce the communication resources consumed when transmitting the parameters of the first machine learning model.
- the first machine learning model includes a first module and a second module, and a first sub-data is obtained by calling the first module once, and T-1 second sub-data are obtained by calling the second module T-1 times:
- the parameters used in the first module include U, ⁇ s , V and ⁇ o
- the parameters used in the second module include W, ⁇ s , V and ⁇ o
- ⁇ ′ s represents the transposition of ⁇ s
- ⁇ ′ o represents the transposition of ⁇ o
- the values of the parameters used in the first module and the second module are the same, that is, the values of U in the first module and W in the second module are the same.
- c represents the first data
- c′ represents the transposition of the first data
- Uc′+ ⁇ ′ s represents the linear transformation of the transposed first data.
- o′ 0 Vs′ 0 + ⁇ ′ o represents that the feature information s′ 0 of the first data is subjected to linear transformation to obtain o′ 0
- o 0 represents the transposition of o′ 0
- exp(j2 ⁇ o 0 ) represents that o 0 is processed by the fourth activation function to obtain the first sub-data x 0 .
- the example here in which the first machine learning model only includes a first module and a second module, and the parameters used in the first module and the second module are consistent, is only an example for the convenience of understanding the present solution.
- the parameters used by the first module and the second module may also be inconsistent, and the first machine learning model may also include multiple second modules.
- FIG8 is a schematic diagram of generating T sub-data using the first machine learning model provided in an embodiment of the present application.
- the first data can be linearly transformed by the first module, and processed by the third activation function to obtain S0 (i.e., the feature information of the first data), S0 is linearly transformed to obtain O0 , and O0 is processed to obtain a first sub-data generated by the first module.
- S 0 generated in the process of calling the first module to process the first data is input into the second module of the first machine learning model, S 0 is linearly transformed by the second module, and is processed by the first activation function to obtain S 1 (that is, the updated feature information of the first data), S 1 is linearly transformed to obtain O 1 , and after processing O 1 , the first second sub-data generated by the second module is obtained.
- the feature information (i.e., St -1 ) generated when the module of the first machine learning model was called last time for processing is input into the second module. For example, if it is the first time that the second module of the first machine learning model is called, the feature information (an example of St-1 ) generated when the first module of the first machine learning model is called for processing is input into the second module; if it is the second to T-1th time that the second module is called, the feature information (another example of St -1 ) generated when the second module was called last time for processing is input into the second module called currently.
- St-1 is linearly transformed by the second module and processed using the first activation function to obtain St (i.e., the first data).
- the T - th sub-data generated by the second module is obtained. Then, the second module can be called T-1 times to obtain T-1 second sub-data.
- the T-1 second sub-data and 1 first sub-data can constitute T sub-data in the second data.
- the example in FIG8 is only for facilitating the understanding of the present solution and is not used to limit the present solution.
- the first machine learning model may include multiple second modules, wherein at least two of the multiple second modules use different parameters. That is, the first device can generate one second sub-data each time it calls the second module, but different second modules may be called in the process of generating T-1 second data.
- two second modules using different parameters may include any of the following differences: the same type of parameters are used in the two second modules, but the parameter values used in the two second modules are not exactly the same; or the types of parameters used in the two second modules are not exactly the same, etc., which are not exhaustive here.
- the same parameters are used in the two second modules means that not only the types of parameters used in the two second modules are exactly the same, but also the values of each parameter are exactly the same.
- the multiple second modules may include a second module 1, a second module 2 and a second module 3. If the value of T is 8, 7 second sub-data need to be generated.
- Each second module can be used to generate the same number (for example, 3) of second sub-data, that is, the second module 1 is used to generate the first three second data, the second module 2 is used to generate the fourth, fifth and sixth second sub-data, and the second module 3 is used to generate the seventh second sub-data.
- the multiple second modules may include a second module 1, a second module 2 and a second module 3, and the value of T is 8, so 7 second sub-data need to be generated.
- the number of second sub-data generated by each second module may also be different, that is, the second module 1 is used to generate the first 3 second data, the second module 2 is used to generate the 4th and 5th second sub-data, and the second module 3 is used to generate the 6th and 7th second sub-data. It should be noted that the examples here are only for the convenience of understanding the present scheme and are not used to limit the present scheme.
- x t represents the t+1th sub-data among the T sub-data included in the second data.
- Different second modules use the same ⁇ s and ⁇ o , and different second modules use different W and V.
- W t mod ⁇ and V t mod ⁇ represent the periodic calling of ⁇ second modules. There are ⁇ groups of different parameters in the ⁇ second modules.
- W t mod ⁇ can be specifically expressed as W 0 , W 1 ...W ⁇ -1
- V t mod ⁇ can be specifically expressed as V 0 , V 1 ...V ⁇ -1 .
- the ⁇ groups of different parameters include W 0 and V 0 , W 1 and V 1 ...W ⁇ -1 and V ⁇ -1 , respectively.
- ⁇ second sub-data are generated by ⁇ second modules in one cycle, and the ⁇ second modules are reused in the next cycle.
- the model may include three second modules, namely, the second module 1, the second module 2 and the second module 3. The three second modules can be called cyclically.
- the second module 1 After generating a second sub-data through the second module 1 (that is, using W 1 and V 1 ), and then generating a second sub-data through the second module 2 (that is, using W 2 and V 2 ), and then generating a second sub-data through the second module 3 (that is, using W 0 and V 0 ), the second module 1 can be called again, and so on. It should be understood that the examples here are only for the convenience of understanding the present solution and are not used to limit the present solution.
- multiple second modules may be used in the first machine learning model, and at least two of the multiple second modules use different parameters, that is, T-1 second sub-data are generated by different second modules, which is beneficial to the matching degree between the parameters of the second module and the generated second sub-data, and thus is beneficial to obtaining second data with better performance.
- the first machine learning model may include a first module and at least one third module, and the difference between the first module and the third module includes: the initial input of the first module is the first data or the scrambled first data, and the initial input of the third module is the feature information of the first data (or the scrambled first data).
- the meaning of the "third module” is similar to that of the "second module", which can be understood by referring to the above description and will not be repeated here.
- step 407 may include: the first device inputs the first data (or the first data after interference) into the first module, and generates the first sub-data through the first module, and the first sub-data is one of the T sub-data; the specific implementation of the aforementioned steps can refer to the above description, which is not repeated here.
- the first device calls the third module multiple times to obtain the third sub-data generated by the third module, and the third sub-data is one of the T sub-data, wherein the input of the third module includes the feature information of the first data, and the feature information of the first data (or the first data after interference) is updated multiple times in the process of calling the third module multiple times.
- the process of the third module processing the input data is similar to “the process of the second module processing the input data", the difference is that each time the first device calls the second module once, it will use the processing result generated by the second module as a second sub-data; while the first device needs to call a third module multiple times, and update the characteristic information of the first data (or the first data after interference) multiple times in the process of calling the third module multiple times, and then use the processing result obtained by the last call to the third module as a third sub-data.
- the third sub-data, the second sub-data and the first sub-data are all sub-data included in the second data. For the meaning of the sub-data, please refer to the above description, which will not be repeated here.
- the first device inputs the characteristic information of the first data (or the first data after interference) into the third module, and processes the first data through the third module, and the aforementioned processing process includes updating the characteristic information of the first data.
- the first device inputs the updated characteristic information of the first data (or the first data after interference) into the third module again, and processes the updated characteristic information of the first data (or the first data after interference) through the third module again, and the aforementioned processing process includes updating the characteristic information of the first data (or the first data after interference) again; the first device repeats the aforementioned operation at least once, and when the number of times the characteristic information of the first data (or the first data after interference) is processed by the third module reaches a preset number of times, a third sub-data generated by the third module is obtained.
- s′ tl-1 represents the characteristic information of the first data generated when the third module was called last time
- s′ tN represents the updated characteristic information of the first data generated after the characteristic information of the first data is updated N times
- x t represents a third sub-data generated after the third module is called N times
- N is an integer greater than or equal to 2.
- a third sub-data is generated based on the last updated characteristic information of the first data. After multiple updates to the first data, it is helpful to have a more thorough understanding of the first data, thereby generating sub-data with better performance.
- the size of the parameters in the first machine learning model is related to the value of H and the value of G, where H is the length of the first data and G is the length of each sub-data in the T sub-data.
- the T sub-data represent T groups of encoded bit data, and G represents the number of bits in each group of encoded bit data.
- the T sub-data represent T groups of modulated symbols, and G represents the number of symbols in each group of modulated symbols.
- the T sub-data represent T groups of modulated symbols, and G represents the number of symbols in each group of modulated symbols.
- the T sub-data may represent T groups of symbols in the reference signal, and G represents the number of symbols in each group of symbols, etc. It should be understood that the examples given here are only for the convenience of understanding this solution and are not used to limit this solution.
- the values of H is 12, the value of G is 12, and the parameters used in the first machine learning model may include U, W, ⁇ s , V, and ⁇ o , then That is, U is a 12 by 12 matrix, the size of U is 12 in length and width, the size of W and V is the same as the size of U, ⁇ s is a 1 by 12 vector, the size of ⁇ s is 1 in width and 12 in length, and the size of ⁇ o is the same as the size of ⁇ s .
- the value of H is 6, the value of G is 6, and the parameters used in the first machine learning model may include U, W, ⁇ s , V, and ⁇ o , then That is, U is a 6 by 6 matrix, the size of U is 6 in length and width, the size of W and V is the same as the size of U, ⁇ s is a 1 by 6 vector, the size of ⁇ s is 1 in width and 6 in length, and the size of ⁇ o is the same as the size of ⁇ s .
- the value of H is 12
- the value of G is 6, and the parameters used in the first machine learning model may include U, W, ⁇ s , V, and ⁇ o , then right
- the explanation of the dimensions of ⁇ s , W, ⁇ s , V and ⁇ o can be found in the above description and will not be repeated here. It should be noted that the examples given here are only for the convenience of understanding the present solution and are not used to limit the present solution.
- the size of the parameters in the first machine learning model is designed according to the length of the first data and the length of each sub-data in the T sub-data, which is beneficial to reducing the amount of parameters in the first machine learning model while meeting the output requirements, and is beneficial to further reducing the communication resources consumed by transmitting the parameters of the first machine learning model.
- the first device determines a signal to be sent according to the second data.
- the first device after the first device generates the second data through the first machine learning model, it can also determine the signal to be sent according to the second data. For example, if the function of the first machine learning model is encoding, the second data is the encoded data, and the first device also needs to modulate the second data to obtain the signal to be sent.
- the second data is modulated data
- the first device can also use truncation to rate match the second data to obtain the signal to be sent.
- the second data is the encoded and modulated data
- the first device can also use truncation to rate match the second data to obtain the signal to be sent.
- the function of the first machine learning model is to generate a reference signal
- the second data is the reference signal
- the first device can determine the reference signal as the signal to be sent.
- the first device may also perform other operations in the process of obtaining the second data and determining the signal to be sent according to the second data, which are not limited here.
- the first device sends the signal to be sent to the second device.
- the second device obtains second data, the second data includes T sub-data, T is an integer greater than or equal to 1, the second data is generated by the first machine learning model in the first device, the first machine learning model includes one or more modules, and each time a module in the first machine learning model is called at least once, one sub-data is obtained.
- the second device after the second device obtains the received signal corresponding to the signal to be sent, it can denoise the received signal and obtain the received second data (that is, the estimated second data) from the denoised received signal.
- the received second data that is, the estimated second data
- the received second data is obtained after demodulating the denoised received signal.
- the second data is modulated data, that is, the function of the first machine learning model is modulation, or the function of the first machine learning model is encoding and modulation, then the denoised received signal can be directly determined as the second data.
- the second device can determine the estimated channel information based on the received signal after acquiring the received signal (that is, the received second data) corresponding to the signal to be sent.
- the second device generates first data according to the second data.
- the second device may generate the estimated first data according to the received second data.
- the second device may acquire a set of parameters adopted by the first machine learning model in the first device.
- the specific implementation of the above steps can refer to the description in step 401.
- the second device may demodulate and/or decode the second data according to a set of parameters adopted by the first machine learning model in the first device to generate the estimated first data.
- the first data of the calculation can be performed by the received second data.
- the second device demodulates and/or decodes the second data according to a set of parameters adopted by the first machine learning model in the first device, which may include: after obtaining a set of parameters adopted by the first machine learning model, the second device can determine what operations the first device has performed on the first data using the first machine learning model, and then can use an estimation algorithm to perform an inverse operation on the second data to achieve demodulation and/or decoding of the second data, the aforementioned inverse operation being the inverse operation of the operation performed on the first data using the first machine learning model.
- the estimation algorithm may be any of the following: maximum likelihood estimation algorithm, maximum a posteriori probability estimation or other types of estimation algorithms, etc.
- maximum likelihood estimation algorithm maximum a posteriori probability estimation or other types of estimation algorithms, etc.
- the examples given here are only for facilitating the understanding of the present solution and are not used to limit the present solution.
- the second device demodulates and/or decodes the second data according to a set of parameters adopted by the first machine learning model in the first device, which may include: after the second device obtains a set of parameters adopted by the first machine learning model, it can obtain a second machine learning model corresponding to the first machine learning model, input the second data into the second machine learning model, and demodulate and/or decode the second data through the second machine learning model.
- the second device can also obtain the estimated first data in other ways. The examples here are only used to prove the feasibility of this solution and are not used to limit this solution.
- Figure 9 is a schematic diagram of a model training method provided in an embodiment of the present application.
- the model training method includes steps 901 to 903.
- Acquire training data from a training data set wherein the training data is used to obtain first data and a value of T, where T is an integer greater than or equal to 1, and at least two training data in the training data set include different values of T.
- each training data may include a value of T, where T represents the number of sub-data included in the output data of the first machine learning model, and at least two training data in the training data set include different values of T.
- each training data may also include data to be processed, and the training device may directly determine the aforementioned data to be processed as the first data; or, the first data may be obtained based on the aforementioned data to be processed using the method shown in steps 403 to 406 in the corresponding embodiment of Figure 4.
- the specific implementation method may refer to the description in the corresponding embodiment of Figure 4 above, and the meaning of the noun in step 901 may also be understood in conjunction with the description in the corresponding embodiment of Figure 4, which will not be elaborated here.
- the training device inputs the first data into the first machine learning model to obtain the second data generated by the first machine learning model.
- the specific implementation of step 902 and the meaning of the nouns in step 902 can refer to the description of step 407 in the embodiment corresponding to FIG4, which will not be repeated here.
- the training device in steps 901 and 902 can be a terminal device or a base station, which can be flexibly set according to actual conditions.
- the training device can perform a training operation based on the second data and the loss function.
- the first machine learning model is trained to obtain a trained first machine learning model. It should be noted that step 903 can be performed by the same device or by different devices.
- step 903 is performed by the same device, and the training device in step 903 can be a terminal device or a base station.
- the function of the first machine learning model includes coding and/or modulation
- step 903 includes: denoising a received signal corresponding to a signal to be sent to obtain a denoised received signal, demodulating and/or decoding the denoised received signal to obtain estimated data corresponding to the data to be processed; training the first machine learning model according to the estimated data and a first loss function, wherein the first loss function indicates the similarity between the estimated data and the data to be processed.
- the function of the first machine learning model is modulation.
- the training device can directly determine the second data as the signal to be sent; the training device obtains the received signal corresponding to the signal to be sent, denoises the received signal to obtain the denoised received signal, and demodulates the denoised received signal to obtain the estimated data corresponding to the data to be processed.
- the training device generates the similarity between the data to be processed and the estimated data, that is, obtains the function value of the first loss function, and uses the function value of the first loss function to update the weight parameters of the first machine learning model, thereby realizing one training of the first machine learning model.
- the first loss function can be a cross entropy loss function, an L1 loss function, or other types of loss functions, etc., which can be flexibly determined in combination with the actual application scenario, and is not limited in the embodiments of the present application.
- the training device obtaining a received signal corresponding to the signal to be sent may include: the training device multiplies the signal to be sent with a channel matrix, and adds the result of the multiplication to noise to obtain the received signal, and the aforementioned steps are for simulating the process of the signal to be sent being transmitted through the channel.
- the same channel matrix and noise can be used, or different channel matrices and/or different noises can be used. Different channel matrices and/or different noises are used to simulate channel environments with different signal-to-noise ratios.
- the function of the first machine learning model is encoding, and after obtaining the second data (i.e., the encoded data), the training device may also modulate the second data to obtain a signal to be sent; the training device obtains a received signal corresponding to the signal to be sent, denoises the received signal to obtain a denoised received signal, and demodulates and decodes the denoised received signal to obtain estimated data corresponding to the data to be processed in the training data.
- the training device trains the first machine learning model based on the data to be processed, the estimated data, and the first loss function; the aforementioned steps and the specific implementation of "the training device obtains a received signal corresponding to the signal to be sent" can be found in the above description and will not be repeated here.
- the function of the first machine learning model is modulation.
- the training device can rate match the second data by truncation to obtain the signal to be sent; the training device obtains a received signal corresponding to the signal to be sent, denoises the received signal to obtain a denoised received signal, and demodulates the denoised received signal to obtain estimated data corresponding to the data to be processed in the training data.
- the subsequent steps performed by the training device can refer to the above description and will not be repeated here.
- the function of the first machine learning model is encoding and modulation.
- the training device can use a truncation method to rate match the second data to obtain the signal to be sent; the training device obtains a received signal corresponding to the signal to be sent, and performs rate matching on the received signal.
- denoising a denoised received signal is obtained, and the denoised received signal is demodulated and decoded to obtain estimated data corresponding to the data to be processed in the training data.
- the subsequent steps performed by the training device can refer to the above description and will not be repeated here.
- the function of the first machine learning model is to generate a reference signal
- the second data is the reference signal.
- the training device can generate a received reference signal corresponding to the reference signal, and generate predicted channel information based on the received reference signal corresponding to the reference signal.
- the specific implementation method of “the training device generates a received reference signal corresponding to the reference signal” is similar to the specific implementation method of "the training device generates a received signal corresponding to the signal to be sent", the difference is that "the signal to be sent” is replaced by "reference signal”, and "the received signal” is replaced by "received reference signal", which will not be repeated here.
- the training device trains the first machine learning model according to the second loss function, and the second loss function indicates the similarity between the predicted channel information and the correct channel information.
- the training device calculates the similarity between the predicted channel information and the correct channel information to obtain the function value of the second loss function, and uses the function value of the second loss function to update the weight parameters of the first machine learning model, thereby realizing one training of the first machine learning model.
- step 903 is performed by two training devices (hereinafter referred to as the first training device and the second training device for ease of description).
- the first training device may be a base station
- the second training device may be a terminal device that requests the base station to retrain the first machine learning model
- the first training device may be a first device
- the second training device may be a second device.
- the function of the first machine learning model includes encoding and/or modulation.
- the difference of this implementation method mainly lies in the implementation method of "obtaining a received signal corresponding to the signal to be sent".
- the signal to be sent is sent to the second training device, and the second training device receives the received signal, and then the second training device obtains estimated data corresponding to the data to be processed; according to the estimated data and the first loss function, the first machine learning model is trained.
- a specific implementation method for training the first machine learning model is provided when the function of the first machine learning model includes encoding and/or modulation, which reduces the difficulty of implementing the present solution, and the loss function uses the similarity between the estimated data and the data to be processed, that is, the goal of the loss function is to obtain estimated data with better performance.
- the loss function is more in line with the actual needs when sending data between devices, and the second data output by the trained first machine learning model is more in line with actual needs.
- the function of the first machine learning model is to generate a reference signal.
- the difference of this implementation method mainly lies in the implementation method of "obtaining a received reference signal corresponding to the reference signal".
- the first training device sends the reference signal to the second training device; after the second training device obtains the received reference signal, it generates predicted channel information according to the received reference signal; the second training device trains the first machine learning model according to the loss function.
- a specific implementation method for training the first machine learning model when the function of the first machine learning model is to generate a reference signal is also provided, which expands the application scenarios of the present solution and improves the implementation flexibility of the present solution.
- the training device repeatedly executes steps 901 to 903 to iteratively train the first machine learning model until the condition is satisfied.
- the convergence condition of the first loss function obtains a set of trained parameters of the first machine learning model.
- the training device can also use multiple different training data sets to train the first machine learning model separately, so as to obtain multiple sets of trained parameters of the first machine learning model.
- Figure 10 is a schematic diagram of a data processing device provided in an embodiment of the present application.
- the data processing device 1000 can implement the function of the first device in the above method embodiment, and thus can also achieve the beneficial effects possessed by the above method embodiment.
- the data processing device 1000 may include a processing module 1001; wherein the processing module 1001 is used to obtain the value of T, T is an integer greater than or equal to 1, and T represents the number of sub-data included in the output data of the first machine learning model; the processing module 1001 is also used to input the first data into the first machine learning model to obtain the second data generated by the first machine learning model, the second data including T sub-data, wherein the first machine learning model includes one or more modules, and each module in the first machine learning model is called at least once to obtain one sub-data.
- the functionality of the first machine learning model includes any one or a combination of the following: encoding, modulation, and generating a reference signal.
- the multiple modules in the first machine learning model include a first module and at least one second module
- the processing module 1001 is specifically used to: input the first data into the first module to obtain first sub-data generated by the first module, where the first sub-data is one of T sub-data; input the first feature information into the second module to obtain second sub-data generated by the second module, wherein the first feature information includes the feature information generated when the module in the first machine learning model was last called for data processing, the second sub-data is one of the T sub-data, and the module in the first machine learning model that was last called is the first module or the second module.
- the multiple modules in the first machine learning model include a first module and at least one third module
- the processing module 1001 is specifically used to: input the first data into the first module, generate first sub-data through the first module, the first sub-data is one of T sub-data, and the process of generating the first sub-data through the first module includes extracting features of the first data; call the third module multiple times to obtain third sub-data generated by the third module, the third sub-data is one of T sub-data, wherein the input of the third module includes feature information of the first data, and the feature information of the first data is updated multiple times in the process of calling the third module multiple times.
- the processing module 1001 is specifically used to: perform a linear transformation on the first feature information through the second module, and process it with a first activation function to obtain the transformed feature information; perform a linear transformation on the transformed feature information, and process it with a second activation function to obtain second sub-data.
- the at least one second module includes multiple second modules, wherein parameters adopted by at least two second modules among the multiple second modules are different.
- the processing module 1001 is also used to obtain the value of the data to be processed and H, where H is an integer greater than or equal to 1, and H indicates the length of the first data; the processing module 1001 is also used to pad the data to be processed if the length of the data to be processed is less than H to obtain the first data, and the length of the first data is H.
- the first data includes data to be processed and filling data
- the filling data includes first identification information
- the first identification information is used to identify the value of T and/or the value of K, where K is the length of the data to be processed and K is an integer greater than or equal to 1.
- the size of the parameters in the first machine learning model is related to the value of H and the value of G, where G is the length of each sub-data.
- the parameters corresponding to the first machine learning model are carried in one or more of the following information: downlink control information DCI, uplink control information UCI, sidelink control information SCI, radio resource control RRC signaling, or media access control control element MAC CE; and/or, the identification information of the parameter is carried in any one or more of the following information: DCI, UCI, SCI, RRC signaling, MAC CE, physical broadcast channel PBCH, or physical random access channel PRACH.
- the data processing device 1000 is applied to the first device, and the second device is the receiving end of the second data.
- the second device has multiple groups of parameters corresponding to the first machine learning model and identification information of each group of parameters, please refer to Figure 10.
- the data processing device 1000 may also include: a transceiver module 1002, used to send second identification information to the second device, and the second identification information is used to indicate a set of parameters adopted by the first machine learning model in the first device.
- the data processing device 1100 can implement the function of the second device in the above method embodiment, and thus can also achieve the beneficial effects possessed by the above method embodiment.
- the data processing device 1100 may include a processing module 1101; wherein the processing module 1101 is used to obtain second data; and based on the second data, generate first data.
- the second data includes T sub-data, T is an integer greater than or equal to 1, and the second data is generated by a first machine learning model in the first device, and the first machine learning model includes one or more modules, and each time a module in the first machine learning model is called at least once, a sub-data is obtained.
- the parameters corresponding to the first machine learning model are carried in one or more of the following information: downlink control information DCI, uplink control information UCI, sidelink control information SCI, radio resource control RRC signaling, or media access control control element MAC CE; and/or, the identification information of the parameters is carried in any one or more of the following information: DCI, UCI, SCI, RRC signaling, MAC CE, physical broadcast channel PBCH, or physical random access channel PRACH.
- the data processing device is applied to a second device, and the second device has third data, and the third data includes multiple groups of parameters corresponding to the first machine learning model and identification information of each group of parameters.
- the data processing device 1100 may also include: a transceiver module 1102, which is used to receive the second identification information sent by the first device; a processing module 1101, which is also used to determine a set of parameters used by the first machine learning model in the first device according to the second identification information and the third data.
- the processing module 1101 is specifically used to generate the first data according to a set of parameters used by the first machine learning model in the first device and the second data.
- the model training device 1200 can implement the functions of the model training device in the above-mentioned method embodiment, and therefore can also achieve the beneficial effects possessed by the above-mentioned method embodiment.
- the model training device 1200 may include a processing module 1201; wherein the processing module 1201 is used to obtain training data from a training data set, wherein the training data is used to obtain the first data and the value of T, T is an integer greater than or equal to 1, and at least two training data in the training data set include different values of T; the processing module 1201 is also used to input the first data into the first machine learning model to obtain the first machine learning model generated by the first machine learning model.
- the second data includes T sub-data, wherein the first machine learning model includes multiple modules, and each time a module in the first machine learning model is called at least once, a sub-data generated by the module is obtained; the processing module 1201 is also used to train the first machine learning model based on the second data and the loss function to obtain the trained first machine learning model.
- the functionality of the first machine learning model includes any one or a combination of the following: encoding, modulating, or generating a reference signal.
- the multiple modules in the first machine learning model include a first module and at least one second module
- the processing module 1201 is specifically used to: input the first data into the first module to obtain first sub-data generated by the first module, where the first sub-data is one of T sub-data; input the first feature information into the second module to obtain second sub-data generated by the second module, wherein the first feature information includes the feature information generated when the module in the first machine learning model was last called for data processing, the second sub-data is one of the T sub-data, and the module in the first machine learning model that was last called is the first module or the second module.
- the processing module 1201 is also used to obtain the data to be processed from the training data; the processing module 1201 is also used to obtain the value of H, where H is an integer greater than or equal to 1, and H indicates the length of the first data; the processing module 1201 is also used to pad the data to be processed if the length of the data to be processed is less than H to obtain the first data, and the length of the first data is H.
- the second data is used to determine the signal to be sent
- the processing module 1201 is specifically used to: demodulate and/or decode the received signal corresponding to the signal to be sent to obtain estimated data corresponding to the data to be processed; train the first machine learning model according to the estimated data and the loss function, and the loss function indicates the similarity between the estimated data and the data to be processed.
- the processing module 1201 when the second data is a reference signal, based on the second data and the loss function, the processing module 1201 is specifically used to: generate predicted channel information according to a received reference signal corresponding to the reference signal; and train the first machine learning model according to the loss function, wherein the loss function indicates the similarity between the predicted channel information and the correct channel information.
- FIG 13 is a schematic diagram of a device provided in an embodiment of the present application.
- the communication device 1300 may be a device as a terminal device in the above embodiment, and the example shown in Figure 13 is implemented by a terminal device (or a component in the terminal device).
- the communication device 1300 may include but is not limited to at least one processor 1301 and a communication port 1302 .
- the device may further include at least one of a memory 1303 and a bus 1304 .
- the at least one processor 1301 is used to control and process the actions of the communication device 1300 .
- the processor 1301 may be a central processing unit, a general-purpose processor, a digital signal processor, an application-specific integrated circuit, a field programmable gate array or other programmable logic device, a transistor logic device, a hardware component, or any combination thereof. It may implement or execute various exemplary logic blocks, modules, and circuits described in conjunction with the disclosure of this application.
- the processor may also be a combination that implements computing functions, such as a combination of one or more microprocessors, a combination of a digital signal processor and a microprocessor, and the like.
- a person skilled in the art may clearly understand that the method described herein is not intended to be limiting. For convenience and simplicity, the specific working processes of the systems, devices and units described above can refer to the corresponding processes in the aforementioned method embodiments and will not be repeated here.
- the device 1300 shown in Figure 13 can be used to implement the steps implemented by the terminal device in the aforementioned method embodiment.
- the specific implementation methods of the device 1300 shown in Figure 13 to perform the aforementioned steps can all be referred to the description in the aforementioned method embodiment, and will not be repeated here one by one.
- the device 1400 can be specifically a device as a network device in the above embodiment, and the example shown in Figure 14 is that the network device is implemented by the network device (or a component in the network device); that is, when the first device, the second device or the training device involved in the above embodiment is specifically a network device, the device 1400 shown in Figure 14 can be implemented; illustratively, when the first device, the second device or the training device is a base station, the device 1400 shown in Figure 14 can be implemented.
- the structure of the communication device can refer to the structure shown in Figure 14.
- the device 1400 includes at least one processor 1411 and at least one network interface 1412. Further optionally, the communication device also includes at least one memory 1414, at least one transceiver 1413 and one or more antennas 1415.
- the processor 1411, the memory 1414, the transceiver 1413 and the network interface 1412 are connected, for example, through a bus. In an embodiment of the present application, the connection may include various interfaces, transmission lines or buses, etc., which are not limited in this embodiment.
- the antenna 1415 is connected to the transceiver 1413.
- the network interface 1412 is used to enable the communication device to communicate with other communication devices through a communication link.
- the network interface 1412 may include a network interface between the communication device and the core network device, such as an S1 interface, and the network interface may include a network interface between the communication device and other communication devices (such as other network devices or core network devices), such as an X2 or Xn interface.
- the processor 1411 is mainly used to process the communication protocol and communication data, and to control the entire communication device, execute the software program, and process the data of the software program, for example, to support the communication device to perform the actions described in the embodiment.
- the communication device may include a baseband processor and a central processor.
- the baseband processor is mainly used to process the communication protocol and communication data
- the central processor is mainly used to control the entire terminal device, execute the software program, and process the data of the software program.
- the processor 1411 in Figure 14 can integrate the functions of the baseband processor and the central processor. It can be understood by those skilled in the art that the baseband processor and the central processor can also be independent processors, interconnected by technologies such as buses.
- the terminal device can include multiple baseband processors to adapt to different network formats, and the terminal device can include multiple central processors to enhance its processing capabilities.
- the various components of the terminal device can be connected through various buses.
- the baseband processor can also be described as a baseband processing circuit or a baseband processing chip.
- the central processor can also be described as a central processing circuit or a central processing chip.
- the function of processing the communication protocol and communication data can be built into the processor, or it can be stored in the memory in the form of a software program, and the processor executes the software program to realize the baseband processing function.
- the memory is mainly used to store software programs and data.
- the memory 1414 can exist independently and be connected to the processor 1411.
- the memory 1414 can be integrated with the processor 1411, for example, integrated into a chip.
- the memory 1414 can store program codes for executing the technical solutions of the embodiments of the present application, and the execution is controlled by the processor 1411.
- the various types of computer program codes executed can also be regarded as drivers of the processor 1411.
- FIG14 shows only one memory and one processor.
- the memory may also be referred to as a storage medium or a storage device.
- the memory may be a processor in a
- the storage elements on the same chip, that is, on-chip storage elements, or independent storage elements are not limited in the embodiments of the present application.
- the transceiver 1413 can be used to support the reception or transmission of radio frequency signals between the communication device and the terminal, and the transceiver 1413 can be connected to the antenna 1415.
- the transceiver 1413 includes a transmitter Tx and a receiver Rx.
- one or more antennas 1415 can receive radio frequency signals
- the receiver Rx of the transceiver 1413 is used to receive the radio frequency signal from the antenna, and convert the radio frequency signal into a digital baseband signal or a digital intermediate frequency signal, and provide the digital baseband signal or the digital intermediate frequency signal to the processor 1411, so that the processor 1411 further processes the digital baseband signal or the digital intermediate frequency signal, such as demodulation and decoding.
- the transmitter Tx in the transceiver 1413 is also used to receive a modulated digital baseband signal or a digital intermediate frequency signal from the processor 1411, and convert the modulated digital baseband signal or the digital intermediate frequency signal into a radio frequency signal, and send the radio frequency signal through one or more antennas 1415.
- the receiver Rx can selectively perform one or more stages of down-mixing and analog-to-digital conversion processing on the RF signal to obtain a digital baseband signal or a digital intermediate frequency signal, and the order of the down-mixing and analog-to-digital conversion processing is adjustable.
- the transmitter Tx can selectively perform one or more stages of up-mixing and digital-to-analog conversion processing on the modulated digital baseband signal or digital intermediate frequency signal to obtain a RF signal, and the order of the up-mixing and digital-to-analog conversion processing is adjustable.
- the digital baseband signal and the digital intermediate frequency signal can be collectively referred to as a digital signal.
- the transceiver 1413 may also be referred to as a transceiver unit, a transceiver, a transceiver device, etc.
- a device in the transceiver unit for implementing a receiving function may be regarded as a receiving unit
- a device in the transceiver unit for implementing a sending function may be regarded as a sending unit, that is, the transceiver unit includes a receiving unit and a sending unit
- the receiving unit may also be referred to as a receiver, an input port, a receiving circuit, etc.
- the sending unit may be referred to as a transmitter, a transmitter, or a transmitting circuit, etc.
- the device 1400 shown in Figure 14 can be used to implement the steps implemented by the base station in the aforementioned method embodiment.
- the specific implementation methods of the device 1400 shown in Figure 14 to perform the aforementioned steps can all be referred to the description in the aforementioned method embodiment, and will not be repeated here one by one.
- a computer-readable storage medium is also provided in an embodiment of the present application, in which a program for performing signal processing is stored.
- the computer executes the steps executed by the first device in the method described in the embodiments shown in Figures 3 to 8 above, or the computer executes the steps executed by the second device in the method described in the embodiments shown in Figures 3 to 8 above, or the computer executes the steps executed by the training device in the method described in the embodiment shown in Figure 9 above.
- Also provided in an embodiment of the present application is a computer program product, which, when executed on a computer, enables the computer to execute the steps executed by the first device in the method described in the embodiments shown in the aforementioned Figures 3 to 8, or enables the computer to execute the steps executed by the second device in the method described in the embodiments shown in the aforementioned Figures 3 to 8, or enables the computer to execute the steps executed by the training device in the method described in the embodiment shown in the aforementioned Figure 9.
- the first device, the second device, the training device, the data processing device or the model training device provided in the embodiment of the present application may be a chip, and the chip includes: a processing unit and a communication unit, the processing unit may be, for example, a processor, and the communication unit may be, for example, an input/output interface, a pin or a circuit.
- the processing unit may execute the computer execution instructions stored in the storage unit so that the chip executes the data processing method described in the embodiments shown in Figures 3 to 8 above, or, So that the chip executes the training method of the model described in the embodiment shown in Figure 9.
- the storage unit is a storage unit in the chip, such as a register, a cache, etc.
- the storage unit can also be a storage unit located outside the chip in the wireless access device, such as a read-only memory (ROM) or other types of static storage devices that can store static information and instructions, a random access memory (RAM), etc.
- ROM read-only memory
- RAM random access memory
- FIG. 15 is a schematic diagram of a structure of a chip provided in an embodiment of the present application.
- the chip can be expressed as a neural network processor NPU 150.
- NPU 150 is mounted on the host CPU (Host CPU) as a coprocessor, and tasks are assigned by the Host CPU.
- the core part of the NPU is the operation circuit 150, which controls the operation circuit 1503 through the controller 1504 to extract matrix data in the memory and perform multiplication operations.
- the operation circuit 1503 includes multiple processing units (Process Engine, PE) inside.
- the operation circuit 1503 is a two-dimensional systolic array.
- the operation circuit 1503 can also be a one-dimensional systolic array or other electronic circuits capable of performing mathematical operations such as multiplication and addition.
- the operation circuit 1503 is a general-purpose matrix processor.
- the operation circuit takes the corresponding data of matrix B from the weight memory 1502 and caches it on each PE in the operation circuit.
- the operation circuit takes the matrix A data from the input memory 1501 and performs matrix operation with matrix B, and the partial result or final result of the matrix is stored in the accumulator 1508.
- Unified memory 1506 is used to store input data and output data. Weight data is directly transferred to weight memory 1502 through Direct Memory Access Controller (DMAC) 1505. Input data is also transferred to unified memory 1506 through DMAC.
- DMAC Direct Memory Access Controller
- BIU stands for Bus Interface Unit, that is, the bus interface unit 1510, which is used for the interaction between AXI bus and DMAC and instruction fetch buffer (IFB) 1509.
- IOB instruction fetch buffer
- the bus interface unit 1510 (Bus Interface Unit, BIU for short) is used for the instruction fetch memory 1509 to obtain instructions from the external memory, and is also used for the storage unit access controller 1505 to obtain the original data of the input matrix A or the weight matrix B from the external memory.
- BIU Bus Interface Unit
- DMAC is mainly used to transfer input data in the external memory DDR to the unified memory 1506 or to transfer weight data to the weight memory 1502 or to transfer input data to the input memory 1501.
- the vector calculation unit 1507 includes multiple operation processing units, which further process the output of the operation circuit when necessary, such as vector multiplication, vector addition, exponential operation, logarithmic operation, size comparison, etc. It is mainly used for non-convolutional/fully connected layer network calculations in neural networks, such as Batch Normalization, pixel-level summation, upsampling of feature planes, etc.
- the vector calculation unit 1507 can store the processed output vector to the unified memory 1506.
- the vector calculation unit 1507 can apply a linear function and/or a nonlinear function to the output of the operation circuit 1503, such as linear interpolation of the feature plane extracted by the convolution layer, and then, for example, a vector of accumulated values to generate an activation value.
- the vector calculation unit 1507 generates a normalized value, a pixel-level summed value, or both.
- the processed output vector can be used as an activation input to the operation circuit 1503, for example, for use in a subsequent layer in a neural network.
- the controller 1504 is connected to an instruction fetch buffer 1509 for storing the controller 1504 Instructions used;
- Unified memory 1506, input memory 1501, weight memory 1502 and instruction fetch memory 1509 are all on-chip memories. External memories are private to the NPU hardware architecture.
- the operations of each layer in the first machine learning model can be performed by the operation circuit 1503 or the vector calculation unit 1507.
- the processor mentioned in any of the above places may be a general-purpose central processing unit, a microprocessor, an ASIC, or one or more integrated circuits for controlling the execution of the program of the above-mentioned first aspect method.
- the device embodiments described above are merely schematic, wherein the units described as separate components may or may not be physically separated, and the components displayed as units may or may not be physical units, that is, they may be located in one place, or they may be distributed over multiple network units. Some or all of the modules may be selected according to actual needs to achieve the purpose of the scheme of this embodiment.
- the connection relationship between the modules indicates that there is a communication connection between them, which may be specifically implemented as one or more communication buses or signal lines.
- the technical solution of the present application is essentially or the part that contributes to the prior art can be embodied in the form of a software product, which is stored in a readable storage medium, such as a computer floppy disk, a U disk, a mobile hard disk, a ROM, a RAM, a magnetic disk or an optical disk, etc., including a number of instructions to enable a computer device (which can be a personal computer, a training device, or a network device, etc.) to execute the methods described in each embodiment of the present application.
- a computer device which can be a personal computer, a training device, or a network device, etc.
- all or part of the embodiments may be implemented by software, hardware, firmware or any combination thereof.
- all or part of the embodiments may be implemented in the form of a computer program product.
- the computer program product includes one or more computer instructions.
- the computer may be a general-purpose computer, a special-purpose computer, a computer network, or other programmable devices.
- the computer instructions may be stored in a computer-readable storage medium, or transmitted from one computer-readable storage medium to another computer-readable storage medium.
- the computer instructions may be transmitted from a website site, a computer, a training device, or a data center by wired (e.g., coaxial cable, optical fiber, digital subscriber line (DSL)) or wireless (e.g., infrared, wireless, microwave, etc.) mode to another website site, computer, training device, or data center.
- the computer-readable storage medium may be any available medium that a computer can store or a data storage device such as a training device, a data center, etc. that includes one or more available media integrations.
- the available medium may be a magnetic medium, (e.g., a floppy disk, a hard disk, a tape), an optical medium (e.g., a DVD), or a semiconductor medium (e.g., a solid-state drive (SSD)), etc.
- a magnetic medium e.g., a floppy disk, a hard disk, a tape
- an optical medium e.g., a DVD
- a semiconductor medium e.g., a solid-state drive (SSD)
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- Evolutionary Computation (AREA)
- Software Systems (AREA)
- Artificial Intelligence (AREA)
- Computing Systems (AREA)
- Mathematical Physics (AREA)
- Data Mining & Analysis (AREA)
- Biophysics (AREA)
- General Health & Medical Sciences (AREA)
- Molecular Biology (AREA)
- Biomedical Technology (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Computational Linguistics (AREA)
- Life Sciences & Earth Sciences (AREA)
- Health & Medical Sciences (AREA)
- Computer Networks & Wireless Communication (AREA)
- Signal Processing (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Databases & Information Systems (AREA)
- Medical Informatics (AREA)
- Mobile Radio Communication Systems (AREA)
- Machine Translation (AREA)
Abstract
Description
f(T,K)=1–2*((K-3)*11+(T-4))/98; (1)
Claims (28)
- 一种数据处理方法,其特征在于,所述方法包括:获取T的取值,所述T为大于或等于1的整数,所述T代表第一机器学习模型的输出数据包括的子数据的个数;将第一数据输入所述第一机器学习模型,得到所述第一机器学习模型生成的第二数据,所述第二数据包括所述T个子数据,其中,所述第一机器学习模型包括一个或多个模块,每调用所述第一机器学习模型中的模块至少一次得到一个所述子数据。
- 根据权利要求1所述的方法,其特征在于,所述第一机器学习模型的功能包括如下任一项或多项的组合:编码、调制、生成参考信号。
- 根据权利要求1或2所述的方法,其特征在于,所述第一机器学习模型中的多个模块包括第一模块和至少一个第二模块,所述将第一数据输入所述第一机器学习模型,得到所述第一机器学习模型输出的第二数据,包括:将所述第一数据输入所述第一模块,得到所述第一模块生成的第一子数据,所述第一子数据为所述T个子数据中的一个;将第一特征信息输入所述第二模块,得到所述第二模块生成的第二子数据,其中,所述第一特征信息包括上一次调用所述第一机器学习模型中的模块进行数据处理时生成的特征信息,所述第二子数据为所述T个子数据中的一个,所述上一次调用的所述第一机器学习模型中的模块为所述第一模块或者所述第二模块。
- 根据权利要求1或2所述的方法,其特征在于,所述第一机器学习模型中的多个模块包括第一模块和至少一个第三模块,所述将第一数据输入所述第一机器学习模型,得到所述第一机器学习模型输出的第二数据,包括:将所述第一数据输入所述第一模块,通过所述第一模块生成第一子数据,所述第一子数据为所述T个子数据中的一个,所述通过所述第一模块生成第一子数据的过程中包括对所述第一数据进行特征提取;调用所述第三模块多次,得到所述第三模块生成的第三子数据,所述第三子数据为所述T个子数据中的一个,其中,所述第三模块的输入包括所述第一数据的特征信息,在所述调用所述第三模块多次的过程中对所述第一数据的特征信息进行多次更新。
- 根据权利要求3所述的方法,其特征在于,所述将第一特征信息输入所述第二模块,得到所述第二模块生成的第二子数据,包括:通过所述第二模块对所述第一特征信息进行线性变换,并采用第一激活函数进行处理,得到变换后的特征信息;对所述变换后的特征信息进行线性变换,并采用第二激活函数进行处理,得到所述第二子数据。
- 根据权利要求3所述的方法,其特征在于,所述至少一个第二模块包括多个所述第二模块,其中,所述多个第二模块中至少两个第二模块采用的参数不同。
- 根据权利要求1或2所述的方法,其特征在于,所述将第一数据输入所述第一机器学习模型之前,所述方法还包括:获取待处理数据和H的取值,H为大于或等于1的整数,所述H指示所述第一数据的长度;若所述待处理数据的长度小于所述H,则对所述待处理数据进行填充,得到所述第一数据,所述第一数据的长度为所述H。
- 根据权利要求7所述的方法,其特征在于,所述第一数据包括所述待处理数据和填充数据,所述填充数据包括第一标识信息,所述第一标识信息用于标识所述T的取值和/或K的取值,所述K为所述待处理数据的长度,所述K为大于或等于1的整数。
- 根据权利要求7所述的方法,其特征在于,所述第一机器学习模型中的参数的尺寸与所述H的取值以及G的取值相关,所述G为每个所述子数据的长度。
- 根据权利要求1或2所述的方法,其特征在于,与所述第一机器学习模型对应的参数携带于如下一种或多种信息中:下行控制信息DCI、上行控制信息UCI、侧行链路控制信息SCI、无线资源控制RRC信令或者媒体访问控制的控制元素MAC CE;和/或,所述参数的标识信息携带于如下任一种或多种信息中:DCI、UCI、SCI、RRC信令、MAC CE、物理广播信道PBCH或者物理随机接入信道PRACH。
- 根据权利要求1或2所述的方法,其特征在于,所述方法应用于第一装置侧,第二装置为所述第二数据的接收端,所述第二装置中有与所述第一机器学习模型对应的多组参数以及每组所述参数的标识信息,所述方法还包括:向所述第二装置发送第二标识信息,所述第二标识信息用于指示所述第一装置中的所述第一机器学习模型采用的一组所述参数。
- 一种数据处理方法,其特征在于,所述方法包括:获取第二数据,其中,所述第二数据包括T个子数据,所述T为大于或等于1的整数,所述第二数据由第一装置中的第一机器学习模型生成,所述第一机器学习模型包括一个或多个模块,每调用所述第一机器学习模型中的模块至少一次得到一个所述子数据;根据所述第二数据,生成第一数据。
- 根据权利要求12所述的方法,其特征在于,与所述第一机器学习模型对应的参数携带于如下一种或多种信息中:下行控制信息DCI、上行控制信息UCI、侧行链路控制信息SCI、无线资源控制RRC信令或者媒体访问控制的控制元素MAC CE;和/或,所述参数的标识信息携带于如下任一种或多种信息中:DCI、UCI、SCI、RRC信令、MAC CE、物理广播信道PBCH或者物理随机接入信道PRACH。
- 根据权利要求12所述的方法,其特征在于,所述方法应用于第二装置侧,所述第二装置中有第三数据,所述第三数据包括与所述第一机器学习模型对应的多组参数以及每组所述参数的标识信息,所述方法还包括:接收所述第一装置发送的第二标识信息;根据所述第二标识信息和所述第三数据,确定所述第一装置中的所述第一机器学习模型采用的一组所述参数;所述根据所述第二数据,生成第一数据,包括:根据所述第一装置中的所述第一机器学习模型采用的一组所述参数和所述第二数据, 生成所述第一数据。
- 一种模型的训练方法,其特征在于,所述方法包括:从训练数据集合中获取训练数据,其中,所述训练数据用于得到第一数据和T的取值,所述T为大于或等于1的整数,所述训练数据集合中至少两个所述训练数据包括的所述T的取值不同;将所述第一数据输入所述第一机器学习模型,得到所述第一机器学习模型生成的第二数据,所述第二数据包括所述T个子数据,其中,所述第一机器学习模型包括多个模块,每调用所述第一机器学习模型中的模块至少一次,得到所述模块生成的一个所述子数据;基于所述第二数据和损失函数,对所述第一机器学习模型进行训练,得到训练后的所述第一机器学习模型。
- 根据权利要求15所述的方法,其特征在于,所述第一机器学习模型的功能包括如下任一项或多项的组合:编码、调制、生成参考信号。
- 根据权利要求15或16所述的方法,其特征在于,所述第一机器学习模型中的多个模块包括第一模块和至少一个第二模块,所述将第一数据输入所述第一机器学习模型,得到所述第一机器学习模型输出的第二数据,包括:将所述第一数据输入所述第一模块,得到所述第一模块生成的第一子数据,所述第一子数据为所述T个子数据中的一个;将第一特征信息输入所述第二模块,得到所述第二模块生成的第二子数据,其中,所述第一特征信息包括上一次调用所述第一机器学习模型中的模块进行数据处理时生成的特征信息,所述第二子数据为所述T个子数据中的一个,所述上一次调用的所述第一机器学习模型中的模块为所述第一模块或者所述第二模块。
- 根据权利要求15或16所述的方法,其特征在于,所述将第一数据输入所述第一机器学习模型之前,所述方法还包括:从所述训练数据中获取待处理数据;获取H的取值,H为大于或等于1的整数,所述H指示所述第一数据的长度;若所述待处理数据的长度小于所述H,则对所述待处理数据进行填充,得到所述第一数据,所述第一数据的长度为所述H。
- 根据权利要求18所述的方法,其特征在于,在所述第一机器学习模型的功能包括编码和/或调制的情况下,所述第二数据用于确定待发送的信号,所述基于所述第二数据和损失函数,对所述第一机器学习模型进行训练,包括:对与所述待发送的信号对应的接收信号进行解调制和/或解码以得到与所述待处理数据对应的估计数据;根据所述估计数据和所述损失函数,对所述第一机器学习模型进行训练,所述损失函数指示所述估计数据和所述待处理数据之间的相似度。
- 根据权利要求15或16所述的方法,其特征在于,在所述第二数据为参考信号的情况下,所述基于所述第二数据和损失函数,对所述第一机器学习模型进行训练,包括:根据与所述参考信号对应的接收的参考信号,生成预测的信道信息;根据所述损失函数,对所述第一机器学习模型进行训练,所述损失函数指示所述预测的信道信息和正确的信道信息之间的相似度。
- 一种数据处理装置,其特征在于,所述数据处理装置包括处理模块和收发模块;所述处理模块用于执行如权利要求1至11中任一项所述的处理操作,所述收发模块用于执行如权利要求1至11中任一项所述的收发操作。
- 一种数据处理装置,其特征在于,所述数据处理装置包括处理模块和收发模块;所述处理模块用于执行如权利要求12至14中任一项所述的处理操作,所述收发模块用于执行如权利要求12至14中任一项所述的收发操作。
- 一种模型的训练装置,其特征在于,所述数据处理装置包括处理模块,所述处理模块用于执行如权利要求15至20中任一项所述的处理操作。
- 一种通信系统,其特征在于,所述通信系统包括:如权利要求21所述的数据处理装置以及如权利要求22所述的数据处理装置。
- 根据权利要求24所述的系统,其特征在于,所述通信系统还包括:如权利要求23所述的模型的训练装置。
- 一种装置,其特征在于,所述装置包括至少一个处理器,所述至少一个处理器与存储器耦合,所述存储器用于存储程序或指令;所述至少一个处理器用于执行所述程序或指令,以使所述装置实现如权利要求1至11中任一项所述的方法;或者,实现如权利要求12至14中任一项所述的方法;或者,实现如权利要求15至20中任一项所述的方法。
- 一种计算机可读存储介质,其特征在于,所述可读存储介质存储有指令,当所述指令被计算机执行时,使得权利要求1至11中任一项所述的方法被执行;或者,使得权利要求12至14中任一项所述的方法被执行;或者,使得权利要求15至20中任一项所述的方法被执行。
- 一种计算机程序产品,其特征在于,所述计算机程序产品包括指令,当所述指令在计算机上运行时,使得权利要求1至11中任一项所述的方法被执行;或者,使得权利要求12至14中任一项所述的方法被执行;或者,使得权利要求15至20中任一项所述的方法被执行。
Priority Applications (3)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| EP23929396.2A EP4668175A4 (en) | 2023-03-31 | 2023-03-31 | DATA PROCESSING METHOD, MODEL TRAINING METHOD AND ASSOCIATED DEVICE |
| CN202380092774.9A CN120641917A (zh) | 2023-03-31 | 2023-03-31 | 一种数据处理方法、模型的训练方法以及相关设备 |
| PCT/CN2023/085467 WO2024197810A1 (zh) | 2023-03-31 | 2023-03-31 | 一种数据处理方法、模型的训练方法以及相关设备 |
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| PCT/CN2023/085467 WO2024197810A1 (zh) | 2023-03-31 | 2023-03-31 | 一种数据处理方法、模型的训练方法以及相关设备 |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| WO2024197810A1 true WO2024197810A1 (zh) | 2024-10-03 |
Family
ID=92907363
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| PCT/CN2023/085467 Ceased WO2024197810A1 (zh) | 2023-03-31 | 2023-03-31 | 一种数据处理方法、模型的训练方法以及相关设备 |
Country Status (3)
| Country | Link |
|---|---|
| EP (1) | EP4668175A4 (zh) |
| CN (1) | CN120641917A (zh) |
| WO (1) | WO2024197810A1 (zh) |
Cited By (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN119150706A (zh) * | 2024-11-18 | 2024-12-17 | 珠海市格努科技有限公司 | 弓网物理变量预测方法、装置、系统及存储介质 |
Citations (6)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20190095785A1 (en) * | 2017-09-26 | 2019-03-28 | Amazon Technologies, Inc. | Dynamic tuning of training parameters for machine learning algorithms |
| CN113408208A (zh) * | 2021-06-25 | 2021-09-17 | 成都欧珀通信科技有限公司 | 模型训练方法、信息提取方法、相关装置及存储介质 |
| CN114418129A (zh) * | 2022-03-30 | 2022-04-29 | 苏州浪潮智能科技有限公司 | 一种深度学习模型训练方法及相关装置 |
| WO2022127867A1 (en) * | 2020-12-17 | 2022-06-23 | Telefonaktiebolaget Lm Ericsson (Publ) | Method and apparatus for controlling training data |
| CN115470929A (zh) * | 2022-08-30 | 2022-12-13 | 三一重机有限公司 | 样本数据的生成方法、模型训练方法、装置、设备及介质 |
| WO2023036309A1 (zh) * | 2021-09-13 | 2023-03-16 | 维沃移动通信有限公司 | 参考信号序列生成方法、装置、设备及介质 |
Family Cites Families (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| EP3808024B1 (en) * | 2019-09-04 | 2022-03-16 | Google LLC | Neural network formation configuration feedback for wireless communications |
| US11689940B2 (en) * | 2019-12-13 | 2023-06-27 | Google Llc | Machine-learning architectures for simultaneous connection to multiple carriers |
-
2023
- 2023-03-31 WO PCT/CN2023/085467 patent/WO2024197810A1/zh not_active Ceased
- 2023-03-31 EP EP23929396.2A patent/EP4668175A4/en active Pending
- 2023-03-31 CN CN202380092774.9A patent/CN120641917A/zh active Pending
Patent Citations (6)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20190095785A1 (en) * | 2017-09-26 | 2019-03-28 | Amazon Technologies, Inc. | Dynamic tuning of training parameters for machine learning algorithms |
| WO2022127867A1 (en) * | 2020-12-17 | 2022-06-23 | Telefonaktiebolaget Lm Ericsson (Publ) | Method and apparatus for controlling training data |
| CN113408208A (zh) * | 2021-06-25 | 2021-09-17 | 成都欧珀通信科技有限公司 | 模型训练方法、信息提取方法、相关装置及存储介质 |
| WO2023036309A1 (zh) * | 2021-09-13 | 2023-03-16 | 维沃移动通信有限公司 | 参考信号序列生成方法、装置、设备及介质 |
| CN114418129A (zh) * | 2022-03-30 | 2022-04-29 | 苏州浪潮智能科技有限公司 | 一种深度学习模型训练方法及相关装置 |
| CN115470929A (zh) * | 2022-08-30 | 2022-12-13 | 三一重机有限公司 | 样本数据的生成方法、模型训练方法、装置、设备及介质 |
Non-Patent Citations (1)
| Title |
|---|
| See also references of EP4668175A4 * |
Cited By (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN119150706A (zh) * | 2024-11-18 | 2024-12-17 | 珠海市格努科技有限公司 | 弓网物理变量预测方法、装置、系统及存储介质 |
Also Published As
| Publication number | Publication date |
|---|---|
| CN120641917A (zh) | 2025-09-12 |
| EP4668175A4 (en) | 2026-03-18 |
| EP4668175A1 (en) | 2025-12-24 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| US20260089071A1 (en) | Communication method and related device | |
| WO2024197810A1 (zh) | 一种数据处理方法、模型的训练方法以及相关设备 | |
| US20250200382A1 (en) | Data processing method, training method, and related apparatus | |
| US20250125854A1 (en) | Channel state information (csi) feedback method, terminal device and network device | |
| WO2022222116A1 (zh) | 信道恢复的方法及收端设备 | |
| WO2025139843A1 (zh) | 通信方法和通信装置 | |
| WO2023207783A1 (zh) | 一种通信方法、装置及系统 | |
| WO2025059855A1 (zh) | 一种数据处理方法以及相关设备 | |
| WO2024050789A1 (zh) | 通信方法及相关装置 | |
| CN121444505A (zh) | 无线通信的方法、终端设备和网络设备 | |
| WO2024245221A1 (zh) | 一种数据处理方法以及相关设备 | |
| CN115706615A (zh) | 一种通信方法及通信装置 | |
| CN119155007B (zh) | 数据处理方法、装置、存储介质及程序产品 | |
| WO2025161758A1 (zh) | 通信方法和通信装置 | |
| WO2025050289A1 (zh) | 一种通信方法及相关设备 | |
| WO2025050286A1 (zh) | 一种通信方法及相关设备 | |
| KR20260064717A (ko) | 통신 방법 및 관련 디바이스 | |
| WO2025086262A1 (zh) | 一种通信方法及相关设备 | |
| WO2025167443A1 (zh) | 一种通信方法及相关设备 | |
| WO2025060863A1 (zh) | 通信方法及相关装置 | |
| WO2025189831A1 (zh) | 一种通信方法及相关装置 | |
| WO2025124095A1 (zh) | 通信方法及通信装置 | |
| WO2025167816A1 (zh) | 通信方法和相关装置 | |
| WO2025218168A1 (zh) | 一种通信方法及相关装置 | |
| CN121750406A (zh) | 通信方法及通信装置 |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| 121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 23929396 Country of ref document: EP Kind code of ref document: A1 |
|
| WWE | Wipo information: entry into national phase |
Ref document number: 202380092774.9 Country of ref document: CN |
|
| WWP | Wipo information: published in national office |
Ref document number: 202380092774.9 Country of ref document: CN |
|
| WWE | Wipo information: entry into national phase |
Ref document number: 2023929396 Country of ref document: EP |
|
| NENP | Non-entry into the national phase |
Ref country code: DE |
|
| ENP | Entry into the national phase |
Ref document number: 2023929396 Country of ref document: EP Effective date: 20250916 |
|
| WWP | Wipo information: published in national office |
Ref document number: 2023929396 Country of ref document: EP |