WO2023142828A1 - 信息处理方法及装置 - Google Patents
信息处理方法及装置 Download PDFInfo
- Publication number
- WO2023142828A1 WO2023142828A1 PCT/CN2022/141926 CN2022141926W WO2023142828A1 WO 2023142828 A1 WO2023142828 A1 WO 2023142828A1 CN 2022141926 W CN2022141926 W CN 2022141926W WO 2023142828 A1 WO2023142828 A1 WO 2023142828A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- quantization
- groups
- indication information
- parameters
- group
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Ceased
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N20/00—Machine learning
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/045—Combinations of networks
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F21/00—Security arrangements for protecting computers, components thereof, programs or data against unauthorised activity
- G06F21/60—Protecting data
- G06F21/62—Protecting access to data via a platform, e.g. using keys or access control rules
- G06F21/6218—Protecting access to data via a platform, e.g. using keys or access control rules to a system of files or objects, e.g. local or distributed file system or database
- G06F21/6245—Protecting personal data, e.g. for financial or medical purposes
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/0464—Convolutional networks [CNN, ConvNet]
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
Definitions
- the present disclosure relates to the field of distributed learning, and more specifically, to an information processing method and device.
- Federated Learning is an artificial intelligence (AI) basic technology. It ensures information security during big data exchange, protects terminal data and personal data privacy, and ensures legal compliance. Efficient machine learning between parties or multiple computing nodes.
- the machine learning algorithms that can be used in federated learning are not limited to neural networks, but also include other algorithms such as random forests.
- Distributed learning such as federated learning involves data interaction between multiple parties, such as the interaction of model parameters and so on.
- the interactive data needs to be compressed before being sent.
- the current compression process uses compression schemes based on deep learning, such as model pruning, knowledge distillation, format conversion, etc. These schemes can effectively use transmission resources and ensure communication efficiency. However, these schemes set some smaller parameter values to zero or migrate larger models to smaller models to reduce the amount of transmitted data, resulting in loss of model performance.
- Embodiments of the present disclosure provide a data processing solution, which can be applied to distributed learning such as federated learning, and can ensure model performance while ensuring communication efficiency.
- a data processing method includes: the first device divides the parameters of the distributed training model into multiple groups, and at least two of the multiple groups have different numerical intervals; the first device is based on the quantization bits corresponding to each of the multiple groups number, determine a plurality of quantization groups corresponding to the plurality of groups, and the number of quantization bits is determined based on at least one of the following: the amount of transmission resources between the first device and the second device, the value of the loss function, or each group The value range of the parameter; and the first device sends multiple quantization groups to the second device.
- the first device can quantize the parameters of the distributed training model, thereby realizing parameter compression.
- This process not only considers factors such as transmission resources, but also considers factors such as the value of parameters, so as to ensure that the compression of parameters can not only ensure communication efficiency, but also ensure the performance of parameters. On the one hand, it avoids the performance loss caused by excessive compression, and on the other hand, it avoids the excessive delay caused by the low compression ratio.
- the method further includes: the first device determines the quantization boundary value corresponding to the first group based on the multiple first parameters in the first group of the multiple groups.
- the value range of the parameters in the divided groups can be specified by the quantization boundary value, so that the amount of data transmitted in the quantization process can be simplified.
- the quantization boundary value includes at least one of the following: a maximum value, or a minimum value, where the maximum value is the maximum value among the multiple first parameters or the maximum value is a multiple of the multiple first parameters.
- the maximum value of the absolute value, the minimum value is the minimum value among the multiple first parameters or the minimum value is the minimum value of the multiple absolute values of the multiple first parameters.
- the maximum value and/or the minimum value as the quantization boundary value, it can be used to specify the numerical interval or absolute value range of the parameter in the divided group.
- the quantization group corresponding to the first group includes quantization boundary values and a plurality of quantization parameters corresponding to the plurality of first parameters.
- the first device receives first indication information from the second device, the first indication information indicates whether to determine the quantization boundary value according to the absolute value; and the first device determines the same as the first
- the quantization boundary value corresponding to the group includes: the first device determines the quantization boundary value based on the first indication information.
- the first device can determine the quantization boundary value based on the indication of the second device, which can reduce the amount of calculation at the first device for determining the quantization mode.
- the method further includes: the first device sends second indication information to the second device, where the second indication information indicates whether the quantization boundary value is determined according to an absolute value.
- the first device can inform the second device of the quantization method through the indication information, so that the second device can correctly perform inverse quantization.
- the number of quantization bits for each group is equal. In this manner, each group uses the same number of quantization bits, which can simplify operations and improve processing efficiency.
- the plurality of groups includes a first group and a second group, and the number of quantization bits corresponding to the first group is not equal to the number of quantization bits corresponding to the second group. In this way, different groups use different numbers of quantization bits, which can fully consider the parameter differences between different groups, and avoid the impact of quantization on performance.
- the method further includes: the first device determining multiple quantization bit numbers corresponding to multiple groups.
- the first device determining the multiple quantization bit numbers corresponding to the multiple groups includes: the first device determining the multiple quantization bit numbers corresponding to the multiple groups based on the configuration of the quantization bit numbers from the second device The number of quantization bits, the configuration of the number of quantization bits includes the quantization threshold and the corresponding number of quantization bits.
- the first device can determine the number of quantization bits based on the configuration of the number of quantization bits from the second device, which can meet the actual needs of the first device and the second device at the same time, and the determined number of quantization bits can take into account communication efficiency and compression performance.
- the method further includes: the first device receives the quantization bit number configuration from the second device.
- the method further includes: the first device sends third indication information to the second device, where the third indication information indicates a plurality of quantization bit numbers.
- the third indication information may implicitly or indirectly indicate multiple quantization bit numbers through indexes in the quantization bit number configuration.
- the first device can send the determined number of quantization bits to the second device, so that the second device can determine the number of quantization bits of each group based on the index, and ensure correct inverse quantization of parameters by the second device.
- the number of quantization bits is determined by the second device, and the method further includes: the first device receives fourth indication information from the second device, and the fourth indication information indicates the The number of quantized bits corresponding to each group.
- the number of quantization bits is indicated by the second device, so that the first device can perform parameter quantization directly based on the indication.
- the operation of the first device is simplified, and the processing efficiency is improved.
- the dividing the parameters of the distributed training model into multiple groups by the first device includes: dividing the parameters of the distributed training model into multiple groups by the first device using a model parameter division manner.
- the first device determines the model parameter division method based on the values of the parameters of the distributed training model; and the first device sends fifth instruction information to the second device, and the fifth The instruction information indicates how the model parameters are divided.
- the first device can determine the parameter division method based on the parameter characteristics, so that different quantization processes can be performed subsequently for different groups, which can ensure compression performance.
- the method further includes: the first device receives sixth indication information from the second device, where the sixth indication information indicates a model parameter division manner.
- the second device can indicate the parameter division manner, so that the first device can perform parameter division directly based on the indication.
- the operation of the first device is simplified, and the processing efficiency is improved.
- the model parameter division mode indicates that the division is based on at least one of the following: the type of the parameter, the type of the network layer where the parameter is located, and the like.
- the division of parameters can be achieved by taking into account different characteristics of the parameters, so that parameters located in the same group can be quantified similarly.
- the first device further includes: the first device performs entropy encoding on multiple quantization groups; and sending the multiple quantization groups from the first device to the second device includes: sending the entropy encoded Encoded multiple quantization groups.
- the first device can further improve the compression rate through entropy coding, so that the communication efficiency can be further improved.
- the method further includes: the first device sends first entropy encoding indication information to the second device, the first entropy encoding indication information having entropy encoded multiple quantization groups.
- the second device can know whether entropy coding is performed, so that it can correctly decode the received parameters.
- the method further includes: the first device receives the second entropy coding indication information from the second device; and performing entropy coding on the multiple quantization groups by the first device includes: the first device based on the second entropy Encoding indication information, performing entropy encoding on multiple quantization groups.
- the first device can perform entropy encoding based on the instruction of the second device, which can further improve the compression rate.
- a data processing method includes: the second device receives multiple quantization groups from the first device; the second device determines multiple groups corresponding to the multiple quantization groups based on the number of quantization bits corresponding to each of the multiple quantization groups, The number of quantized bits is determined based on at least one of the following: the amount of transmission resources between the first device and the second device, the value of the loss function, or the value range of the parameters of each group; and the second device based on the model parameters The division method and multiple groups determine the parameters of the distributed training model.
- the first quantization group of the plurality of quantization groups includes a quantization boundary value and a plurality of quantization parameters.
- the quantization boundary value includes at least one of the following: a maximum value, or a minimum value, the maximum value being the maximum value among the plurality of first parameters in the first group corresponding to the first quantization group or The maximum value is the maximum value of the multiple absolute values of the multiple first parameters, and the minimum value is the minimum value among the multiple first parameters or the minimum value is the minimum value of the multiple absolute values of the multiple first parameters.
- the second device further includes: the second device sends first indication information to the first device, and the first indication information indicates whether to determine the quantization boundary value according to an absolute value; or, the second device receives the information from the first Second indication information of the device, where the second indication information indicates whether the quantization boundary value is determined according to an absolute value.
- each quantization group has the same number of quantization bits.
- the plurality of quantization groups includes a first quantization group and a second quantization group, and the number of quantization bits corresponding to the first quantization group is not equal to the number of quantization bits corresponding to the second quantization group.
- the method further includes: the second device receives third indication information from the first device, where the third indication information indicates a plurality of quantization bit numbers.
- the second device sends the quantization bit number configuration to the first device, where the quantization bit number configuration includes a quantization threshold and a corresponding quantization bit number.
- the method further includes: the second device sending fourth indication information to the first device, where the fourth indication information indicates the number of quantization bits corresponding to each of the multiple groups.
- the method further includes: the second device receives fifth indication information from the first device, where the fifth indication information indicates a model parameter division manner.
- the method further includes: the second device sending sixth indication information to the first device, where the sixth indication information indicates a model parameter division manner.
- the model parameter division mode indicates that the division is based on at least one of the following: the type of the parameter, the type of the network layer where the parameter is located, and the like.
- the second device receiving the plurality of quantization groups from the first device includes: the second device receiving the plurality of entropy encoded quantization groups.
- the method further includes: the second device receives first entropy encoding indication information from the first device, the first entropy encoding indication information having entropy encoded multiple quantization groups.
- the method further includes: the second device sends second entropy encoding indication information to the first device, so as to instruct the first device to perform entropy encoding on multiple quantization groups.
- a communication device configured to include: a division module, configured to divide the parameters of the distributed training model into multiple groups, at least two of the multiple groups have different numerical intervals; a determination module, configured to be based on the parameters of the multiple groups The number of quantized bits corresponding to each group determines multiple quantized groups corresponding to multiple groups, and the number of quantized bits is determined based on at least one of the following: the amount of transmission resources between the communication device and the second device, and the loss function value, or a range of values of parameters of each group; and a sending module configured to send multiple quantization groups to the second device.
- the determining module is configured to determine the quantization boundary value corresponding to the first group based on the plurality of first parameters in the first group of the plurality of groups.
- the quantization boundary value includes at least one of the following: a maximum value, or a minimum value, where the maximum value is the maximum value of the multiple first parameters or the maximum value is a multiple of the multiple first parameters.
- the maximum value of the absolute value, the minimum value is the minimum value among the multiple first parameters or the minimum value is the minimum value of the multiple absolute values of the multiple first parameters.
- the quantization group corresponding to the first group includes quantization boundary values and a plurality of quantization parameters corresponding to the plurality of first parameters.
- it further includes a receiving module configured to receive first indication information from the second device, where the first indication information indicates whether to determine the quantization boundary value according to an absolute value. And the determination module is configured to determine the quantization boundary value based on the first indication information.
- the sending module is configured to send second indication information to the second device, where the second indication information indicates whether the quantization boundary value is determined according to an absolute value.
- the number of quantization bits for each group is equal.
- the plurality of groups includes a first group and a second group, and the number of quantization bits corresponding to the first group is not equal to the number of quantization bits corresponding to the second group.
- the determining module is configured to determine a plurality of quantization bit numbers corresponding to a plurality of groups.
- the determination module is configured to determine a plurality of quantization bit numbers corresponding to a plurality of groups based on the quantization bit number configuration from the second device, the quantization bit number configuration including a quantization threshold and a corresponding quantization bit number number of bits.
- the receiving module is configured to receive the quantization bit number configuration from the second device.
- the sending module is configured to send third indication information to the second device, where the third indication information indicates a plurality of quantization bit numbers.
- the number of quantized bits is determined by the second device, and the receiving module is configured to receive fourth indication information from the second device, where the fourth indication information indicates that each group in the plurality of groups The corresponding number of quantized bits.
- the division module is configured to divide the parameters of the distributed training model into multiple groups using a model parameter division manner.
- the determining module is configured to determine the model parameter division method based on the values of the parameters of the distributed training model; and the sending module is configured to send fifth indication information to the second device, the first The five indication information indicates the division mode of the model parameters.
- the receiving module is configured to receive sixth indication information from the second device, where the sixth indication information indicates a model parameter division manner.
- the model parameter division mode indicates that the division is based on at least one of the following: the type of the parameter, the type of the network layer where the parameter is located, and the like.
- an entropy encoding module configured to entropy encode the plurality of quantization groups; and the sending module configured to send the entropy encoded plurality of quantization groups to the second device.
- the sending module is configured to send first entropy encoding indication information to the second device, where the first entropy encoding indication information has entropy encoded a plurality of quantization groups.
- the receiving module is configured to receive second entropy encoding indication information from the second device; and the entropy encoding module is configured to perform entropy entropy on multiple quantization groups based on the second entropy encoding indication information coding.
- a communication device configured to include: a receiving module configured to receive a plurality of quantization groups from the first device; a first determination module configured to determine the number of quantization bits corresponding to each quantization group in the plurality of quantization groups multiple quantization groups corresponding to multiple groups, the number of quantization bits is determined based on at least one of the following: the amount of transmission resources between the first device and the communication device, the value of the loss function, or the value range of the parameters of each group ; and a second determination module configured to determine parameters of the distributed training model based on the model parameter division method and multiple groups.
- the first quantization group of the plurality of quantization groups includes a quantization boundary value and a plurality of quantization parameters.
- the quantization boundary value includes at least one of the following: a maximum value, or a minimum value, and the maximum value is the maximum value of a plurality of first parameters in the first group corresponding to the first quantization group or The maximum value is the maximum value of the multiple absolute values of the multiple first parameters, and the minimum value is the minimum value among the multiple first parameters or the minimum value is the minimum value of the multiple absolute values of the multiple first parameters.
- it further includes a sending module configured to send first indication information to the first device, where the first indication information indicates whether to determine the quantization boundary value according to an absolute value.
- the receiving module is configured to receive second indication information from the first device, the second indication information indicating whether the quantization boundary value is determined according to an absolute value.
- the number of quantization bits of each quantization group is equal.
- the plurality of quantization groups includes a first quantization group and a second quantization group, and the number of quantization bits corresponding to the first quantization group is not equal to the number of quantization bits corresponding to the second quantization group.
- the receiving module is configured to receive third indication information from the first device, where the third indication information indicates a plurality of quantization bit numbers.
- the sending module is configured to send the quantization bit number configuration to the first device, where the quantization bit number configuration includes a quantization threshold and a corresponding quantization bit number.
- the sending module is configured to send fourth indication information to the first device, where the fourth indication information indicates the number of quantized bits corresponding to each of the multiple groups.
- the receiving module is configured to receive fifth indication information from the first device, where the fifth indication information indicates a model parameter division manner.
- the sending module is configured to send sixth indication information to the first device, where the sixth indication information indicates a model parameter division manner.
- the model parameter division mode indicates that the division is based on at least one of the following: the type of the parameter, the type of the network layer where the parameter is located, and the like.
- the receiving module is configured to receive the entropy encoded plurality of quantization groups.
- the receiving module is configured to receive first entropy encoding indication information from the first device, the first entropy encoding indication information having entropy encoded a plurality of quantization groups.
- the sending module is configured to send the second entropy encoding indication information to the first device, so as to instruct the first device to perform entropy encoding on the multiple quantization groups.
- a communication device in a fifth aspect of the present disclosure, includes a processor and a memory, and the memory stores instructions executed by the processor, and when the instructions are executed by the processor, the communication device implements the operation of the method in the aforementioned first aspect or any embodiment thereof , or implement the operations according to the method in the above second aspect or any embodiment thereof.
- the communication device further includes a transceiver for receiving information from another device, sending information to another device, and the like.
- a computer-readable storage medium on which computer-executable instructions are stored.
- the above-mentioned first aspect or its The operation of the method in any embodiment, or the operation of the method according to the above second aspect or any embodiment thereof.
- a chip or a chip system in a seventh aspect of the present disclosure, includes a processing circuit configured to perform operations according to the method in the above first aspect or any embodiment thereof, or perform operations according to the method in the above second aspect or any embodiment thereof.
- a computer program or computer program product is provided.
- the computer program or computer program product is tangibly stored on a computer-readable medium and comprises computer-executable instructions which, when executed, implement the operations according to the method in the above-mentioned first aspect or any of its embodiments, Or implement the operations according to the method in the above second aspect or any embodiment thereof.
- FIG. 1 shows a schematic diagram of an example scenario where an embodiment of the present disclosure can be applied
- Fig. 2 shows a signaling interaction diagram of an information processing process according to some embodiments of the present disclosure
- Fig. 3 shows a schematic diagram of parameter division according to some embodiments of the present disclosure
- Fig. 4 shows a schematic diagram of parameter division according to some embodiments of the present disclosure
- Fig. 5 shows a schematic diagram of parameter division according to some embodiments of the present disclosure
- Fig. 6 shows a signaling interaction diagram of an information transmission process according to some embodiments of the present disclosure
- FIG. 7 shows a schematic diagram of a compressed configuration message according to some embodiments of the present disclosure
- Fig. 8 shows a schematic diagram of compressed data according to some embodiments of the present disclosure
- FIG. 9 shows a signaling interaction diagram of an information transmission process according to some embodiments of the present disclosure.
- Fig. 10 shows a schematic diagram of a quantization indication message according to some embodiments of the present disclosure
- Fig. 11 shows a schematic diagram of compressed data according to some embodiments of the present disclosure
- Fig. 12 shows a schematic block diagram of a communication device according to some embodiments of the present disclosure
- Figure 13 shows a schematic block diagram of a communication device according to some embodiments of the present disclosure.
- Fig. 14 shows a schematic block diagram of an example device that may be used to implement embodiments of the present disclosure.
- the term “comprising” and its similar expressions should be interpreted as an open inclusion, that is, “including but not limited to”.
- the term “based on” should be understood as “based at least in part on”.
- the term “one embodiment” or “the embodiment” should be read as “at least one embodiment”.
- the terms “first”, “second”, etc. may refer to different or the same object.
- the term “plurality” means two or more. Other definitions, both express and implied, may also be included below.
- Embodiments of the present disclosure may be implemented according to appropriate communication protocols, including but not limited to, third generation (3rd Generation, 3G), fourth generation (4G), fifth generation (5G), sixth generation (6G), etc.
- Cellular communication protocols including but not limited to, third generation (3rd Generation, 3G), fourth generation (4G), fifth generation (5G), sixth generation (6G), etc.
- Cellular communication protocols wireless local area network communication protocols such as Institute of Electrical and Electronics Engineers (Institute of Electrical and Electronics Engineers, IEEE) 802.11, future evolved mobile communication systems such as 6G communication system protocols, and/or currently known or future developed other agreements.
- the technical solution of the embodiment of the present disclosure is applied to a communication system following any appropriate communication protocol, such as: Universal Mobile Telecommunications System (Universal Mobile Telecommunications Service, UMTS), Long Term Evolution (Long Term Evolution, LTE) system, wideband code division multiple access System (Wideband Code Division Multiple Access, WCDMA), Code Division Multiple Access 2000 system (Code Division Multiple Access, CDMA2000), Time Division Synchronization Code Division Multiple Access system (Time Division-Synchronization Code Division Multiple Access, TD-SCDMA), frequency division Duplex (Frequency Division Duplex, FDD) system, Time Division Duplex (Time Division Duplex, TDD), fifth generation (5G) system or new radio (New Radio, NR), future evolution of 6G, 7G communication systems, etc. .
- Universal Mobile Telecommunications System Universal Mobile Telecommunications Service, UMTS
- Long Term Evolution Long Term Evolution, LTE
- WCDMA Wideband Code Division Multiple Access
- CDMA2000 Code Division Multiple Access 2000 system
- WLAN wireless local area network
- wired communication system any communication system with similar problems, such as a Wi-Fi network, a Wi-Fi network, or other communication systems developed in the future.
- terminal device refers to any terminal device capable of wired or wireless communication with network devices or with each other.
- the terminal equipment may sometimes be called user equipment (User Equipment, UE).
- a terminal device may be any type of mobile terminal, stationary terminal or portable terminal.
- terminal equipment may include a mobile handset, station, unit, device, mobile terminal (Mobile Terminal, MT), subscription station, portable subscription station, Internet node, communicator, desktop computer, laptop computer, notebook computer, tablet Computers, personal communication system devices, personal navigation devices, personal digital assistants (Personal Digital Assistant, PDA), positioning devices, radio broadcast receivers, e-book devices, game devices, Internet of Things (IoT) devices, vehicle-mounted devices , aircraft, virtual reality (Virtual Reality, VR) devices, augmented reality (Augmented Reality, AR) devices, wearable devices, terminal devices in 5G networks or evolved public land mobile networks (Public Land Mobile Network, PLMN) Any terminal device, other device that can be used for communication, or any combination of the above. Embodiments of the present disclosure do not limit this.
- the term "network device” used in this disclosure is an entity or node that can be used to communicate with a terminal device, such as an access network device.
- the access network device may be a device deployed in the radio access network to provide a wireless communication function for the mobile terminal, for example, it may be a radio access network (Radio Access Network, RAN) network device.
- Access network equipment may include various types of base stations.
- the access network equipment may include various forms of macro base stations, micro base stations, pico base stations, femto base stations, relay stations, access points, satellites, remote radio units (Remote Radio Unit, RRU), radio head (Radio Head , RH), remote radio head (Remote Radio Head, RRH), etc., can also be a device that undertakes the base station function in device-to-device (D2D) communication and machine-to-machine (M2M) communication.
- D2D device-to-device
- M2M machine-to-machine
- the names of access network equipment may be different, for example, in a Long Term Evolution (LTE) network, it is called an evolved NodeB (evolved NodeB, eNB or eNodeB), which is called Node B (NodeB, NB) in 3G network, can be called gNode B (gNB) or NR Node B (NR NB) in 5G network, and so on.
- the access network device may include a central unit (Central Unit, CU) and/or a distributed unit (Distributed Unit, DU).
- CU and DU can be placed in different places, for example: DU is remote and placed in a high-traffic area, and CU is placed in the central computer room.
- the CU and DU can also be placed in the same equipment room.
- the CU and DU can also be different components under one rack.
- the above-mentioned apparatuses for providing wireless communication functions for mobile terminals are collectively referred to as network devices, which are not specifically limited in the embodiments of the present disclosure.
- Federated learning was proposed in 2016. It is essentially a distributed learning framework, which aims to realize the joint construction of a global model on the basis of ensuring data privacy and security. Federated learning allows users to form a federation on the premise that the data is kept locally and not shared, and the global model is obtained through joint training, thereby avoiding data privacy and security issues.
- the embodiments of the present disclosure provide a parameter compression scheme, which can divide the parameters into multiple groups and quantize each group to compress the parameters, so that the performance of the data can be guaranteed, and the parameters can be compressed. Improve communication efficiency.
- FIG. 1 shows a schematic diagram of an example scenario 100 in which embodiments of the present disclosure can be applied.
- the scene 100 includes a first device 110 - 1 to a first device 110 -N and a second device 120 .
- the first device 110 - 1 to the first device 110 -N may be collectively referred to as the first device 110 .
- the first device 110 may be implemented as a client device, and the second device 120 may be implemented as a server device. In some examples, the first device 110 may be implemented as a terminal device, and the second device 120 may be implemented as an access network device. In some examples, the first device 110 may be called an execution node in distributed machine learning, and the second device 120 may be called an execution anchor in distributed machine learning. This disclosure is not limited thereto.
- the second device 120 can send the initial model parameters of the neural network to N1 first devices 110, for example, in the form of broadcast or multicast .
- the N1 first devices 110 may perform learning, such as training or iteration, on the basis of initial model parameters based on the local data set, so as to obtain processed parameters, which may include, for example, local model parameters or gradient parameters.
- Each of the N1 first devices 110 compresses the obtained processed parameters and sends them to the second device 120 .
- the second device 120 decompresses the processed parameters from each of the N1 first devices 110 , and obtains updated model parameters through aggregation. In this way, the model parameters of the neural network at the second device 120 are changed from the initial model parameters to the updated model parameters.
- the next round of training or learning can be performed to obtain further updated model parameters. That is to say, the second device 120 may send the updated model parameters to the N2 first devices 110 , and the N2 first devices 110 obtain reprocessed parameters and send them back to the second device 120 . This is repeated until the parameters converge, or until the training fails, such as reaching the maximum number of iterations.
- the parameters sent by the second device 120 to the first device 110 may be collectively referred to as the parameters to be trained, such as the initial model parameters of the neural network, or the aggregated and updated parameters obtained after the last round of training. Model parameters.
- the same or different first devices 110 may be equal or unequal, for example, the above-mentioned N1 first devices 110 and N2 first devices 110 They may be completely different, partly the same, or all of them may be the same.
- the N1 first devices 110 include the first device 110-1, and the N2 first devices 110 include or do not include the first device 110-1.
- the second device 120 receives compressed parameters from one or more first devices 110 in each round.
- the compression scheme used by each first device 110 is fixed. In some other embodiments, in multiple rounds of interaction, the compression scheme used by each first device 110 may be dynamically adjusted, for example, in an intermediate round of interaction, the compression scheme may be updated based on the attribute of the parameter. In order to simplify the description, in the following embodiment, one compression by the first device 110 is taken as an example for illustration, but it can be understood that the processes for each compression in multiple rounds of interaction are similar.
- Fig. 2 shows a signaling interaction diagram of an information processing procedure 200 according to some embodiments of the present disclosure.
- the process 200 involves a first device 110 and a second device 120, and the first device 110 in FIG. 3 refers to any one of the first device 110-1 to the first device 110-N in FIG. 1 .
- the first device 110 divides 210 the parameters of the distributed training model into a plurality of groups.
- Each group includes multiple parameters, and the number of parameters in different groups may be the same or different.
- the first group includes M1 parameters
- the second group includes M2 parameters, and M1 and M2 may or may not be equal.
- the values of multiple parameters in different groups can be in the same interval or different intervals. Taking the first group among multiple groups as an example, the first group includes M1 parameters, then the value interval of the first group can be determined based on the minimum value of the values of the M1 parameters and the maximum value of the values of the M1 parameters , for example, it can be assumed to be a closed interval [a1,b1].
- the value range of the second group can be determined, for example, it can be assumed to be [a2, b2].
- a1 ⁇ a2 and/or b1 ⁇ b2, that is, the value range of the first group is different from the value range of the second group.
- the first group here may be any one of the multiple groups, and the second group may be any one of the multiple groups that is different from the first group. Exemplarily, at least two of the multiple groups have different value intervals.
- the closed interval [a1, b1] of this application can also be an open interval (a1, b1), an open and closed interval (a1, b1], or a closed and open interval [a1, b1), which does not affect the essence of this application.
- [a1, b1] is taken as an example in this application, and [a2, b2] is also similar.
- any two groups in the multiple groups have the following differences: the value interval of the value of the parameter, the type of the parameter, or the type of the network layer to which the parameter belongs, where the type of the parameter is, for example, Weight, bias, variance, mean, etc., where the type of network layer to which the parameter belongs is, for example, a convolutional layer, a fully connected layer, etc.
- the parameters of the distributed training model may be that the first device 110 utilizes a local (that is, stored in the memory of the first device 110) data set, based on the model parameters from the last round of the second device 120, after learning or training owned.
- the first device 110 may divide the parameters of the distributed training model into multiple groups based on the parameter division manner.
- the parameter division method may indicate that the parameter division is based on at least one of the following: the type of the parameter, the type of the network layer where the parameter is located, and the like.
- Types of network layers include, for example, convolutional layers, fully connected layers, and the like.
- Types of parameters include, for example, weight parameters, bias parameters, mean parameters, and the like.
- FIG. 3 shows a schematic diagram of parameter division 300 according to some embodiments of the present disclosure.
- parameters representing weights in each convolutional layer may be divided into a group, such as group 310 .
- the parameters representing the bias in each convolutional layer are divided into a group, such as group 320 .
- Parameters representing weights in each fully connected layer are divided into a group, such as group 330 .
- the parameter division manner described with reference to FIG. 3 may be called a type-based division manner.
- FIG. 4 shows a schematic diagram of parameter division 400 according to some embodiments of the present disclosure.
- parameters representing weights in the convolutional layer 401 can be divided into a group, such as group 410 .
- Parameters representing biases in the convolutional layer 401 are divided into groups, such as group 420 .
- Parameters representing weights in the convolutional layer 402 are divided into groups, such as group 430 .
- the parameters representing biases in the convolutional layer 402 are divided into groups, such as group 440 .
- the parameter division manner described with reference to FIG. 4 may be referred to as a matrix-based division manner.
- FIG. 5 shows a schematic diagram of parameter division 500 according to some embodiments of the present disclosure.
- the parameters in the convolutional layer 501 can be divided into a group, such as group 510 .
- the parameters in batch normalization layer 502 are divided into groups, such as group 520 .
- the parameter division manner described with reference to FIG. 5 may be referred to as a network layer-based division manner.
- the parameter division method may also indicate that the parameters are divided according to other characteristics of the parameters, which will not be listed here.
- the parameter division method may be determined by the second device 120 and then indicated to the first device 110, or the parameter division method may be determined by the first device 110, as follows in conjunction with the embodiments of FIG. 6 to FIG. 11 Elaborate.
- the first device 110 determines 220 a plurality of quantization groups corresponding to the plurality of groups based on the number of quantization bits corresponding to each of the plurality of groups.
- the number of quantization bits may be used to indicate the length of each quantization parameter in the corresponding quantization group, that is, the number of bits occupied.
- the storage space occupied by multiple quantization groups is smaller than the storage space occupied by multiple quantization groups, for example, it may be 50% or other values, so as to realize the compression of multiple groups.
- Each quantization group may include a plurality of quantization parameters, wherein the length of each quantization parameter in the plurality of quantization parameters is equal to or smaller than the corresponding number of quantization bits.
- the first group among multiple groups as an example, assuming that the first group includes M1 parameters, and the number of quantization bits corresponding to the first group is n1, then the first quantization group corresponding to the first group includes M1 quantization bits parameter, and the number of bits of each quantization parameter in the M1 quantization parameters may be equal to n1.
- Each quantization group may include a quantization boundary value and a plurality of quantization parameters, wherein the quantization boundary value may be determined based on the value range of a plurality of parameters in the corresponding group, wherein each quantization parameter in the plurality of quantization parameters The length is equal to or less than the corresponding number of quantization bits.
- the first device 110 may determine a quantization boundary value determination manner for each of the multiple groups. In one example, the first device 110 may determine a quantization boundary value determination manner, and the quantization boundary value determination manner is applied to each group. In another example, for each of the multiple groups, the first device 110 may respectively determine a corresponding quantization boundary value determination manner.
- the determination method of the quantization boundary value may indicate whether to determine the quantization boundary value according to an absolute value.
- determining the quantization boundary value not according to the absolute value includes: determining the maximum and minimum values of the multiple parameters in the group as the quantization boundary value. Taking the first group among the multiple groups as an example, the first group includes M1 parameters, and the value interval of the M1 parameters is a closed interval [a1, b1], then the quantization boundary value can be determined correspondingly including a1 and b1 .
- determining the quantization boundary value according to the absolute value includes: determining the maximum and minimum values of the absolute values of the multiple parameters in the group as the quantization boundary value; or determining the maximum value of the absolute values of the multiple parameters in the group as Quantize boundary values.
- the first group includes M1 parameters, and the value range of M1 parameters is a closed interval [a1, b1]. If a1 and b1 are both positive values, then optional It can be determined that the quantization boundary value includes a1 and b1; if a1 is a negative value and b1 is a positive value, then optionally the quantization boundary value can be determined to include max(-a1, b1), where max means taking the maximum value.
- the quantization boundary value determination method that does not determine the quantization boundary value according to the absolute value can be called the traditional quantization method, and the quantization boundary value determination method that determines the quantization boundary value according to the absolute value can be called the symbol quantization method, but should It is understood that this designation is only indicative and should not be construed as limiting the embodiments of the present disclosure.
- the method of determining the quantization boundary value may also indicate to determine according to other value characteristics of the parameters, which will not be listed here.
- the quantization boundary value determination method may be determined by the second device 120 and then indicated to the first device 110, or the quantization boundary value determination method may be determined by the first device 110, or may be determined by the first device 110 is determined based on the configuration of the second device 120 , which will be described in detail below in conjunction with the embodiments of FIG. 6 to FIG. 11 .
- the number of quantization bits for each group may be equal.
- the first device 110 or the second device 120 may determine the number of quantization bits and use the number of quantization bits for each group. In this way, the lengths of the individual quantization parameters in each quantization group are equal.
- at least two of the plurality of groups have different numbers of quantization bits.
- the first device 110 or the second device 120 may respectively determine the corresponding number of quantization bits for each group. In this way, the lengths of quantization parameters located in different quantization groups may or may not be equal.
- the first group includes M1 parameters, and the number of quantization bits corresponding to the first group is n1
- the first quantization group corresponding to the first group includes M1 quantization parameters, and each of the M1 quantization parameters is quantized
- the number of bits of the parameter may be equal to n1.
- the second group includes M2 parameters, and the number of quantization bits corresponding to the second group is n2, then the second quantization group corresponding to the second group includes M2 quantization parameters, and the bit of each quantization parameter in the M2 quantization parameters
- the number can be equal to n2.
- each quantization parameter may include at least one bit for indicating the sign of the corresponding parameter.
- the quantization boundary value determination method of a certain quantization group is the symbol quantization method (that is, the quantization boundary value is determined according to the absolute value)
- the number of bits of the quantization parameter in the quantization group is equal to n1
- n1 bits Any bit in (for example, the first bit) indicates its positive or negative, and the remaining n1-1 bits indicate the parameter size.
- the number of quantized bits may be determined by the second device 120 and then indicated to the first device 110, or the number of quantized bits may be determined by the first device 110, or may be determined by the first device 110 It is determined based on the configuration of the second device 120 , which is described in detail below in conjunction with the embodiments of FIG. 6 to FIG. 11 .
- the first device 110 may perform entropy encoding 222 on multiple quantization groups.
- entropy encoding may be performed based on an indication from the second device 120 .
- the first device 110 may decide to perform entropy encoding by itself and notify the second device 120 .
- the first device 110 sends 230 quantization groups to the second device 120 .
- the multiple quantization groups that are sent are entropy coded multiple quantization groups.
- the second device 120 can receive multiple quantization groups from the first device 110 . It can be understood that, in some examples, the second device 120 previously sent the model parameters of the last round to multiple first devices 110, then correspondingly, the second device 120 can receive data from each of the multiple first devices 110. 110 multiple quantized groups sent.
- the second device 120 determines 240 a plurality of groups corresponding to the plurality of quantization groups.
- the second device 120 may perform inverse quantization based on the quantization manner and the number of quantization bits, so as to obtain multiple groups.
- the second device 120 determines 250 model parameters.
- the second device 120 may determine a position in the network model where each parameter in the dequantized multiple groups is located based on a parameter division manner.
- the second device 120 can obtain the model parameters obtained through the current round of training.
- the second device 120 may obtain the parameters to be trained in the next round by aggregation based on model parameters from multiple first devices.
- the subsequent process is similar to the above and will not be repeated in this article.
- FIG. 6 shows a signaling interaction diagram of an information transmission process 600 according to some embodiments of the present disclosure.
- the process 600 involves the first device 110 and the second device 120, and the first device 110 in FIG. 6 may include any one of the first device 110-1 to the first device 110-N in FIG. 1 .
- the second device 120 determines 610 to compress the configuration message.
- the compressed configuration message includes first indication information, and the first indication information indicates a parameter division manner.
- the second device 120 may determine a parameter division manner.
- the second device 120 may determine the parameter division method based on at least one of the following factors: parameter attributes of the parameters to be trained, network structure of the model to be trained, communication efficiency, and model performance.
- the parameters to be trained may be initial model parameters, or may be updated model parameters obtained based on a previous round of training.
- Parameter attributes may include: the value range of the parameter, the type of the parameter, the network layer where the parameter is located, etc., where the type of the parameter includes, for example, weight, bias, variance, mean, etc.
- the network structure may include the type of the network layer in the model, the number of the network layer, etc., where the type of the network layer includes, for example, a convolutional layer, a fully connected layer, and the like.
- the parameter division manner determined by the second device 120 may be a type-based division manner as shown in FIG. 3 or a matrix-based division manner as shown in FIG. 4 or a network layer-based division manner as shown in FIG. Divide the way.
- the first indication information may be indicated by 1 bit, for example, "0” indicates a matrix-based division manner, and "1" indicates a type-based division manner. In some examples, the first indication information may be indicated by 2 bits, for example, "00” indicates a matrix-based division manner, “01” indicates a type-based division manner, and "10” indicates a network layer-based division manner. In some other examples, the first indication information may also be indicated by using more bits, which is not limited in the present disclosure.
- the signaling overhead of the type-based partitioning method is smaller than that of the matrix-based partitioning method, but the type-based partitioning method has The model performance is not as good as that of the matrix-based partitioning approach. In practice, a better compromise can be made based on communication efficiency and model performance to determine the appropriate parameter division method.
- the number of parameters to be trained is s
- it can be divided into r1 groups according to the type-based division method, and can be divided into r2 groups according to the matrix-based division method. Then if (r2-r1) ⁇ 32 ⁇ 2 ⁇ s ⁇ d is satisfied, then the parameter division method is determined to be the matrix-based division method. On the contrary, if (r2-r1) ⁇ 32 ⁇ 2 ⁇ s ⁇ d is satisfied, it is determined that the parameter division method is a type-based division method.
- s, r1 and r2 are all positive integers, 32 represents the number of bits occupied by floating point numbers in the computer system, d is a predetermined value, for example, d can be the number of bits of two different parameter division methods determined by the user based on empirical information difference.
- the number of parameters to be trained is s
- s/r1 is smaller than the preset threshold, it is determined that the parameter division method is the type-based division method.
- s/r1 is greater than or equal to the preset threshold, it is determined that the parameter division method is a matrix-based division method.
- the preset threshold may be predefined by the user, or the preset threshold may be determined based on a historical division manner, for example equal to the maximum value of the number of average parameters of each group in the historically trained model.
- a threshold value of the number of parameters in each group after division can also be set
- the number of parameters in each group is between a first number threshold and a second number threshold, and the first number threshold is smaller than the second number threshold.
- group 1 may be further divided into at least two groups. In this way, the performance loss caused by an excessive number of parameters in a certain group can be avoided.
- group 2 may be merged with other groups. In this way, compression efficiency can be further improved, and excessive signaling overhead can be avoided.
- the compression configuration message may include second indication information. That is to say, the compression configuration message includes first indication information and second indication information.
- the second indication information may indicate a quantization boundary value determination mode (or quantization mode for short).
- the second device 120 may determine a quantization boundary value determination manner, and the quantization boundary value determination manner will be used for each group, that is, the quantization boundary value determination manner of each group is the same.
- the second device 120 may use the quantization manner most frequently used by other models trained in history as the quantization boundary value determination manner.
- the second device 120 may determine the manner of determining the quantization boundary value based on the value distribution of the parameter. For example, if the value distribution of the parameter is uneven, it can be determined to use the traditional quantization method, that is, to consider the value of positive and negative numbers, which can reduce the error. For example, multiple parameters in a certain group are all positive or negative, for example, the difference between the maximum value (positive) and the minimum value (negative) in a certain group is large.
- the parameter is the gradient of the reverse update or the residual parameter of the model, the sign of the parameter is more important at this time, then it can be determined to use the symbol quantization method, that is, quantization based on the absolute value, so as to ensure that each parameter remains Correct update direction to avoid inaccurate gradient direction of parameters near 0.
- the symbol quantization method that is, quantization based on the absolute value
- the minimum value of the parameter value may be 0 or close to 0.
- the quantization boundary value may include the maximum value of the absolute value but not the minimum value of the absolute value, so that Save signaling overhead.
- the second indication information may be indicated by 1 bit, for example, "0" indicates a traditional quantization mode, and "1" indicates a symbol quantization mode.
- the second indication information may also be indicated by more bits, for example, it may be indicated by 2 bits, wherein one bit indicates that multiple groups use the same quantization method, and the other bit indicates the quantization method used by each group way, which is not limited in the present disclosure.
- the second indication information may indicate the quantization mode configuration, and the second indication information may be used by the first device 110 to determine the quantization mode.
- the second indication information may be indicated by 1 bit, for example, "0" indicates that multiple groups use the same quantization mode, and "1" indicates that a quantization mode is determined for each group.
- the second indication information may be indicated by m1 bits, for example, one bit (such as the first bit) is used to indicate whether each group uses the same quantization method, and the other m1-1 bits indicate whether to use Used to determine the threshold of the quantization method.
- the second indication information may be indicated by m1-1 bits, indicating a threshold for determining a quantization manner. Exemplarily, for a specific embodiment of determining the quantization mode based on the configuration of the quantization mode, reference may be made to the description in conjunction with 650 below.
- the compression configuration message further includes third indication information.
- the compression configuration message may include first indication information, second indication information, and third indication information.
- the third indication information may indicate the number of quantization bits.
- the second device 120 may determine the number of quantization bits, and the number of quantization bits will be used for each group, that is, the number of quantization bits for each group is the same.
- the number of quantization bits used for all groups may be referred to as the number of round quantization bits or the number of coarse-grained quantization bits.
- the second device 120 may determine the number of quantization bits based on transmission resources between the first device 110 and the second device 120 and/or the number of network layers of the distributed neural network.
- the second device 120 may allocate transmission resources from the first device 110 to the second device 120 for the first device 110, and determine the number of quantization bits based on the allocated transmission resources.
- the third indication information may be indicated by m2 bits, which may indicate that the maximum number of quantization bits is 2 m2 . In some other examples, the third indication information may also be indicated by more or less bits, which is not limited in the present disclosure.
- the third indication information may indicate the configuration of the number of quantization bits, and the third indication information may be used by the first device 110 to determine the number of quantization bits.
- the quantization bit number configuration may indicate quantization granularity and a quantization threshold and quantization bit number list for determining the quantization bit number, wherein the quantization granularity may indicate whether multiple groups use the same quantization bit number.
- the quantization granularity may indicate that multiple groups use the same quantization bit number, for example, the round quantization bit number or the coarse-grained quantization bit number.
- the quantization granularity may indicate a coarse granularity.
- the quantization granularity may indicate that multiple groups use different quantization bit numbers, for example, a fine-grained quantization bit number, and optionally, the quantization granularity may indicate fine granularity.
- quantization granularity may indicate a combination of coarse and fine granularity.
- the quantization bit number configuration may include a quantization threshold number and a quantization threshold list, where the quantization threshold number is used to indicate the number of quantization thresholds in the quantization threshold list, and the quantization threshold list may include multiple quantization thresholds.
- the number of quantization thresholds is K, and K is a positive integer, and it is assumed that the list of quantization thresholds includes (T 1 , T 2 , . . . , T K ).
- the configuration of quantization bit numbers may also include a list continuity indication and a quantization list, wherein the list continuity indication is used to indicate the representation form of the quantization list, and the quantization list includes one or more quantization bit numbers.
- the list continuity indication may be represented by 1 bit.
- the quantization list may include (b 0 , b 1 , b 2 , . . . , b K ). If continuous is indicated, the quantization list may include b 0 and/or b K .
- quantization granularity may indicate a combination of coarse and fine granularity.
- the quantization bit number configuration may include the first quantization threshold number corresponding to the coarse grain, the first quantization threshold list, the first list continuity indication and the first quantization list, and the quantization bit number configuration further includes the second quantization threshold corresponding to the fine grain Quantity, Second Quantization Threshold List, Second List Continuity Indicator, and Second Quantization List.
- the configuration of the number of quantization bits may not include the second list continuity indication and the second quantization list, and in actual scenarios, the first list continuity indication and the first quantization list corresponding to the coarse grain may be used to determine the second list corresponding to the coarse grain. Two lists of continuity indication and a second quantization list.
- the configuration of the number of quantization bits may indicate a preset condition, so that when the first device 110 determines that the preset condition is satisfied, it may consider that the number of quantization bits is 0, or it may be understood that the first device 110 is After the current round of iterative training is completed, the processed parameters are not sent to the second device 120 .
- the second device 120 can determine the processed parameters of the current round based on the parameters of the previous round, so the second device 120 can prevent the first device 110 from sending the processed parameters of the current round through this configuration, which can save signaling overhead.
- the compression configuration message further includes entropy coding indication information.
- the compression configuration message may include first indication information, second indication information, and entropy coding indication information.
- the compression configuration message may include first indication information, second indication information, third indication information, and entropy coding indication information.
- the entropy coding indication information may indicate whether to perform entropy coding. For example, it may indicate whether the first device 110 performs arithmetic coding after quantizing the processed parameters.
- the entropy encoding indication information may be indicated by 1 bit, for example, "0" indicates that entropy encoding is performed, and "1" indicates that entropy encoding is not performed.
- the entropy coding indication information may also be indicated by more bits, which is not limited in the present disclosure.
- FIG. 7 shows a schematic diagram of a compressed configuration message 700 according to some embodiments of the present disclosure.
- the compression configuration message 700 includes first indication information 710 and second indication information 720 , and may optionally further include third indication information 730 and/or entropy coding indication information 740 .
- the second indication information 720 may indicate a quantization mode 722 or may indicate a quantization mode configuration 724 .
- the third indication information 730 may indicate a quantization bit number 732 or may indicate a quantization bit number configuration 734 , wherein the quantization bit number configuration 734 may include a quantization granularity 7342 and a quantization threshold and a quantization bit number list 7344 .
- the second device 120 sends 620 a compressed configuration message to the first device 110 .
- the second device 120 may send the compressed configuration message through signaling, for example, the second device 120 is an access network device, and the first device 110 is a terminal device, then the signaling may optionally be a physical layer signal
- the signaling may be radio resource control layer (Radio Resource Control, RRC) signaling.
- RRC Radio Resource Control
- the second device 120 sends 630 the parameters to be trained to the first device 110 .
- the parameters to be trained may be the initial model parameters of the above-mentioned neural network, or may be updated model parameters obtained after a previous round of training.
- the parameters to be trained may include parameters associated with the network structure of the neural network, such as weight parameters and the like.
- the parameters to be trained may include parameters used for iteration of the neural network, such as gradient parameters and the like.
- the second device 120 may send the compression configuration message together with the parameters to be trained to the first device 110 . In some other examples, the second device 120 may send the compressed configuration message and the parameters to be trained to the first device 110 through different signaling, which is not limited in the present disclosure.
- the second device 120 sends a compression configuration message and parameters to be trained to multiple first devices 110, for example, N1 first devices 110 or N2 first devices 110 described in conjunction with FIG. 1 .
- first devices 110 for example, N1 first devices 110 or N2 first devices 110 described in conjunction with FIG. 1 .
- the embodiment of the present disclosure is described with one first device 110 , and the operations performed by other first devices 110 are similar, and will not be repeated herein.
- the first device 110 may receive the compressed configuration message and the parameters to be trained from the second device 120 .
- the first device 110 may obtain 640 parameters of the distributed training model based on the parameters to be trained.
- the first device 110 may use the local data set to train the network model on the basis of the parameters to be trained, so as to obtain the parameters after the current round of training, that is, the parameters of the distributed training model.
- the first device 110 may determine 650 a parameter division mode, a quantization mode, and a number of quantization bits based on the compression configuration message. Corresponding embodiments in which the first device 110 determines the parameter division mode, the quantization mode, and the number of quantization bits will be described below through different compression configuration messages.
- the compression configuration message includes first indication information, second indication information and third indication information, wherein the first indication information indicates the parameter division method, the second indication information indicates the quantization mode, and the third indication information Indicates the number of quantization bits. Then, the first device 110 can read the parameter division mode, quantization mode and quantization bit number from the compression configuration message. It can be understood that the quantization method is used for a plurality of groups divided using the parameter division method, and the quantization bit number is used for a plurality of groups.
- the compression configuration message includes first indication information and second indication information, wherein the first indication information indicates a parameter division manner, and the second indication information indicates a quantization manner.
- the compressed configuration message does not include the third indication information. Then the first device 110 can read the parameter division mode and quantization mode from the compression configuration message, and the first device 110 can determine the number of quantization bits.
- the first device 110 may determine the number of quantization bits based on the value of the parameter, the distribution of the value of the parameter, and the like.
- the first device 110 may determine the number of quantization bits based on one or more of the following: transmission resources from the first device 110 to the second device 120, the number of network layers of the distributed neural network, and the training loss function.
- the first device 110 may determine the number of quantization bits and use it for each group, that is, the number of quantization bits in a coarse grain. Alternatively, the first device 110 may separately determine the number of quantization bits for each group, that is, the number of quantization bits in a fine-grained manner.
- the compression configuration message includes first indication information and second indication information, wherein the first indication information indicates a parameter division mode, and the second indication information indicates a quantization mode configuration.
- the compressed configuration message does not include the third indication information. Then the first device 110 can read the parameter division manner from the compressed configuration message.
- the first device 110 may determine the quantization scheme based on the quantization scheme configuration.
- the quantization mode configuration indicates that multiple groups use the same quantization mode, then the first device 110 may determine the quantization mode, and the quantization mode is used for each group.
- the configuration of the quantization mode indicates that the quantization modes are respectively determined for multiple groups, then the first device 110 may determine the quantization mode of the corresponding group based on the threshold in the configuration of the quantization mode.
- the maximum value of multiple parameters in the i-th group is max i
- the minimum value is min i
- ) ⁇ thres it can be determined that the quantization mode of the i-th group is a symbol quantization mode.
- ) ⁇ thres it may be determined that the quantization mode of the i-th group is a symbol quantization mode.
- the first device 110 may determine the number of quantization bits.
- the first device 110 may determine the number of quantization bits based on the value of the parameter, the distribution of the value of the parameter, and the like.
- the first device 110 may determine the number of quantization bits based on one or more of the following: transmission resources from the first device 110 to the second device 120, the number of network layers of the distributed neural network, and the training loss function.
- the first device 110 may determine the number of quantization bits and use it for each group, that is, the number of quantization bits in a coarse grain. Alternatively, the first device 110 may separately determine the number of quantization bits for each group, that is, the number of quantization bits in a fine-grained manner.
- the compression configuration message includes first indication information, second indication information and third indication information, wherein the first indication information indicates the parameter division method, the second indication information indicates the quantization mode configuration, and the third indication information The information indicates quantization bit number configuration. Then the first device 110 can read the parameter division manner from the compressed configuration message.
- the first device 110 may determine the quantization scheme based on the quantization scheme configuration.
- the quantization mode configuration indicates that multiple groups use the same quantization mode, then the first device 110 may determine the quantization mode, and the quantization mode is used for each group.
- the configuration of the quantization mode indicates that the quantization modes are respectively determined for multiple groups, then the first device 110 may determine the quantization mode of the corresponding group based on the threshold in the configuration of the quantization mode.
- the first device 110 may determine the quantization bit number based on the quantization bit number configuration, where the quantization bit number configuration includes a quantization granularity, a quantization threshold list, and a quantization list. For example, the first device 110 may determine the value to be compared (denoted as d), and compare the value to be compared with two thresholds in the quantization threshold list to determine the quantization threshold interval in which the value to be compared is located, and then determine The number of quantization bits in the quantization list corresponding to the quantization threshold interval.
- the quantization threshold list includes (T 1 , T 2 , ..., T K )
- the quantization list may include (b 0 , b 1 , b 2 , ..., b K ). Denote the value to be compared as d, and denote the determined number of quantization bits as quan bit , then the following formula is satisfied:
- the value to be compared may be determined based on a loss function or based on the values of multiple parameters in the divided groups.
- the quantization bit configuration includes quantization granularity, and the quantization granularity indicates coarse granularity, that is to say, multiple groups use the same quantization bit number.
- the first device 110 may compare the loss function obtained in the current round of training with the quantization threshold in the quantization bit number configuration, and determine the quantization bit number based on the comparison result.
- the quantization threshold list includes (T 1 , T 2 , . . . , T K )
- the quantization list may include (b 0 , b 1 , b 2 , . . . , b K ). If T 1 ⁇ abs(yx) ⁇ T 2 is satisfied, the number of quantization bits can be determined to be b 1 .
- the first device 110 may also determine an index corresponding to the number of quantization bits, where the index may indicate a position of the determined number of quantization bits in the quantization list, for example, the index corresponding to b2 may be 2.
- the quantization bit configuration includes quantization granularity, and the quantization granularity indicates fine granularity, that is to say, the quantization bit numbers need to be determined separately for different groups.
- the first device 110 may compare the value ranges of the multiple parameters in the group with the quantization threshold in the configuration of the number of quantization bits, and determine the number of quantization bits based on the comparison result.
- the quantization threshold list includes (T 1 , T 2 , . . . , T K )
- the quantization list may include (b 0 , b 1 , b 2 , . . . , b K ). If T 1 ⁇ abs(max i ⁇ min i ) ⁇ T 2 is satisfied, then the number of quantization bits of the i-th group can be determined to be b 1 .
- the first device 110 may determine a plurality of quantization bit numbers respectively corresponding to a plurality of groups.
- the first device 110 may also determine an index corresponding to each quantization bit number, and the index may indicate the position of the quantization bit number used by the group in the quantization list, for example, the quantization bit number b2 of the i-th group corresponds to The index of can be 2.
- the values to be compared in the embodiments of the present disclosure may also be determined based on other factors, such as the number of multiple parameters in a group, such as the size of transmission resources, etc., which are not listed in the present disclosure. It can be understood that in some examples, T 1 in the quantization threshold can be set close to 0, such as 10 ⁇ 3 or other values, and b 0 in the quantization list can be set to 0 accordingly. If the number of quantized bits is determined to be b 0 for the jth group among multiple groups, it can be considered that there is no need to send data corresponding to the number of quantized bits to the second device 120. For example, the quantized parameters for the jth group may only include quantization Boundary values such as max j and min j .
- the first device 110 determines 660 a plurality of quantization groups based on parameters of the distributed training model.
- each quantization group may include quantization boundary values and multiple quantization parameters.
- the first device 110 may use a parameter division manner to divide the parameters of the distributed training model into multiple groups.
- the first device 110 may use a quantization manner to determine a quantization boundary value for each of the multiple groups.
- the first device 110 may determine a plurality of quantization parameters for each of the plurality of groups based on the number of quantization bits.
- the first device 110 may use the quantization method of the group to determine the quantization boundary value of the group, and determine the multiple quantization parameters.
- the process 660 of determining multiple quantization groups may also be referred to as a quantization process or a compression process. It should be noted that the process 660 and the above-mentioned process 650 can be interleaved.
- the parameter division method can be used to divide the parameters into multiple groups, and then the quantization method can be determined based on the values of multiple parameters in each group. and quantization bits.
- the quantization process may include: using a parameter division method to divide the parameters into multiple groups; for each group, determining a quantization boundary value based on the quantization method, and determining multiple quantization parameters corresponding to multiple parameters in the group based on the number of quantization bits.
- a parameter division method to divide the parameters into multiple groups; for each group, determining a quantization boundary value based on the quantization method, and determining multiple quantization parameters corresponding to multiple parameters in the group based on the number of quantization bits.
- the first device 110 may perform entropy encoding 670 .
- the compression configuration message may include entropy encoding indication information, and the entropy encoding indication information indicates entropy encoding. Then the first device 110 may also perform entropy coding on the quantization parameter.
- the first device 110 may determine to perform entropy coding by itself. entropy coding, then the first device 110 may also perform entropy coding on the quantization parameter. For example, if the first device 110 determines that the compression rate of the quantization parameter is lower than the compression rate threshold, it may decide to perform entropy encoding by itself.
- the first device 110 sends more than 680 quantization groups to the second device 120 .
- the multiple quantization groups may be entropy encoded multiple quantization groups.
- the second device 120 may receive multiple quantization groups from the first device 110 .
- FIG. 8 shows a schematic diagram of compressed data 800 according to some embodiments of the present disclosure.
- the compressed data 800 includes a quantization boundary value data field 810 and a quantization parameter data field 820 .
- Quantization boundary value data field 810 may include quantization boundary values in multiple quantization groups. As an example, (min1, max1), (min2, max2), . . . may be included. As another example, (min1, min2, . . . ) and (max1, max2, . . . ) may be included. As yet another example, (max1, max2, . . . ) may be included.
- the quantization parameter data field 820 may include quantization parameters in multiple quantization groups, or may include entropy-encoded quantization parameters.
- the first device 110 may send 6822 a quantization bit feedback message to the second device 120 .
- the compression configuration message may include first indication information, second indication information and third indication information, wherein the first indication information indicates the parameter division mode, the second indication information indicates the quantization mode, and the third indication information indicates the number of quantization bits .
- the first device 110 may determine another number of quantization bits based on parameters of the distributed training model.
- the quantization bit feedback message may indicate another number of quantization bits, for example, another minimum number of quantization bits determined by the first device 110 to quantize the parameters of the distributed training model.
- the quantization bit feedback message may be sent to the second device 120 together with multiple quantization groups via the same signaling.
- the quantization bit feedback message and the multiple quantization groups may be sent to the second device 120 via different signaling.
- the second device 120 may determine whether to accept another number of quantization bits included in the quantization bit feedback message based on the amount of transmission resources or the like.
- the second device 120 may send 6824 a quantization bit response message to the first device 110, where the quantization bit response message may indicate acceptance or rejection of another quantization bit number. If the second device 120 accepts it, the second device 120 may then send a compression configuration message for the next round of training to the first device 110, and the third indication information in the compression configuration message may indicate the other quantization bit number .
- the first device 110 may send 6842 indication information of the number of quantization bits to the second device 120 .
- the compression configuration message may include first indication information and second indication information, wherein the first indication information indicates a parameter division method, the second indication information indicates a quantization mode, and the compression configuration message does not include the third indication information.
- the first device 110 may send quantization bit numbers for obtaining multiple quantization groups to the second device 120, so that the second device 120 performs subsequent inverse quantization based on the quantization bit numbers.
- the number of quantization bits and multiple quantization groups may be sent to the second device 120 via the same signaling.
- the number of quantization bits and the multiple quantization groups may be sent to the second device 120 via different signaling.
- the first device 110 may send 6862 a quantization indication message to the second device 120, where the quantization indication message may indicate the quantization method and the number of quantization bits used by the first device 110.
- the compression configuration message includes the first indication information, the second indication information and the third indication information, wherein the first indication information indicates the parameter division mode, the second indication information indicates the quantization mode configuration, and the third indication information indicates the quantization bit number configuration.
- the first device 110 may determine the quantization mode based on the configuration of the quantization mode, and may determine the number of quantization bits based on the configuration of the number of quantization bits.
- the second indication information indicates a quantization scheme configuration, and the quantization scheme configuration indicates that multiple groups use the same quantization scheme. Then, the quantization indication message may use 1 bit to indicate the quantization mode used.
- the second indication information indicates a quantization scheme configuration, and the quantization scheme configuration indicates that different groups use different quantization schemes. Then, the quantization indication message may respectively indicate the quantization modes of multiple groups through multiple bits.
- the third indication information indicates the configuration of the number of quantization bits, and the configuration of the number of quantization bits includes a quantization granularity, a quantization threshold list, and a quantization list.
- the quantization indication message may indicate the number of quantization bits used by using an index. It can be understood that for a coarse-grained situation, the number of indexes is one. For fine-grained situations, the number of indexes is multiple, corresponding to the number of multiple groups. For the description about the index, refer to the above-mentioned corresponding embodiment in the process 650 .
- the first device 110 may also send 6882 the entropy encoding indication information to the second device 120 .
- the first device 110 may use the entropy coding indication information to inform the second device 120 whether entropy coding is performed, for example, 1 bit may be used for indication. Therefore, the second device 120 can successfully decompress the received data based on this.
- the second device 120 dequantizes 690 the plurality of quantization groups. Specifically, the second device 120 may perform inverse quantization on multiple quantization groups based on the parameter division manner, the quantization manner, and the number of quantization bits.
- the description about inverse quantization can refer to the above corresponding embodiment in conjunction with FIG. 2 , and for the sake of brevity, it will not be repeated here.
- the second device 120 obtains the parameters to be trained in the next round through inverse quantization and aggregation based on multiple quantization groups from multiple first devices 110 .
- the second device 120 may send the compressed configuration message again, as shown in processes 610 and 620 .
- the second device 120 may omit sending the compression configuration message for the next round.
- the compression configuration message of the previous round may be used. In this way, repeated transmission of the same content can be avoided, and signaling overhead can be reduced.
- the first device 110 can quantize the parameters of the distributed training model based on the compression configuration message, thereby realizing the compression of the parameters. This process not only considers factors such as transmission resources, but also considers factors such as the value of parameters, so as to ensure that the compression of parameters can not only ensure communication efficiency, but also ensure the performance of parameters. On the one hand, it avoids the performance loss caused by excessive compression, and on the other hand, it avoids the excessive delay caused by the low compression rate.
- the transmission resources for the first device 110 to send a plurality of quantized bits to the second device 120 may be allocated by the second device 120, or may be allocated by the second device 120 after the first device 110 requests a resource. distributed.
- the first device 110 is a terminal device
- the second device 120 is an access network device.
- the second device 120 may use the resource indication information to indicate the transmission resource for the first device 110 to transmit to the second device 120, and may optionally indicate a coding and modulation scheme and the like.
- the first device 110 may send a resource request to the second device 120 , and the first device 110 may receive a resource allocation message from the second device 120 .
- the first device 110 may then send the plurality of quantization groups to the second device 120 on the allocated resources.
- the resource request may be a buffer state report (Buffer State Report, BSR) of a medium access control (Medium Access Control, MAC) layer.
- BSR buffer State Report
- MAC medium access control
- FIG. 9 shows a signaling interaction diagram of an information transmission process 900 according to some embodiments of the present disclosure.
- the process 900 involves the first device 110 and the second device 120, and the first device 110 in FIG. 9 may include any one of the first device 110-1 to the first device 110-N in FIG. 1 .
- the second device 120 sends 910 the parameters to be trained to the first device 110 .
- the parameters to be trained may be the initial model parameters of the above-mentioned neural network, or may be updated model parameters obtained after a previous round of training.
- the parameters to be trained may include parameters associated with the network structure of the neural network, such as weight parameters and the like.
- the parameters to be trained may include parameters used for iteration of the neural network, such as gradient parameters and the like.
- the second device 120 sends the parameters to be trained to multiple first devices 110 , for example, N1 first devices 110 or N2 first devices 110 described in conjunction with FIG. 1 .
- the embodiment of the present disclosure is described with one first device 110 , and the operations performed by other first devices 110 are similar, and will not be repeated herein.
- the first device 110 may receive parameters to be trained from the second device 120 .
- the first device 110 may obtain 920 parameters of the distributed training model based on the parameters to be trained.
- the first device 110 may use the local data set to train the network model on the basis of the parameters to be trained, so as to obtain the parameters after the current round of training, that is, the parameters of the distributed training model.
- the first device 110 determines 930 parameters based on the division method, the quantization method and the number of quantization bits. Specifically, the first device 110 may determine the parameter division method, the quantization method, and the number of quantization bits based on one or more of the following factors: the transmission resources of the first device 110 to the second device 120, the distributed neural network The number of network layers, and the loss function for training.
- the first device 110 may similarly determine the parameter division mode, which is not repeated here for brevity.
- the parameter division manner is a type-based division manner, a matrix-based division manner, or a network layer-based division manner.
- the first device 110 may use the determined parameter division manner to divide the parameters of the distributed training model into multiple groups.
- the first device 110 may similarly determine the quantization mode, and details are not repeated here for brevity.
- the first device 110 may determine a quantization scheme, and the quantization scheme is used for multiple groups.
- the first device 110 may respectively determine quantization modes for multiple groups.
- the quantization method is a conventional quantization method or a symbolic quantization method.
- the first device 110 may use the determined quantization manner to determine a quantization boundary value for each of the multiple groups.
- the first device 110 may determine the number of quantization bits and apply the number of quantization bits to multiple groups, in other words, the number of quantization bits may be a coarse-grained number of quantization bits. In some other examples, the first device 110 may separately determine the number of quantization bits for each group, in other words, the number of quantization bits may be a fine-grained number of quantization bits.
- the first device 110 may use the determined number of quantization bits to determine multiple quantization parameters for each of the multiple groups.
- the first device 110 determines 940 a plurality of quantization groups based on parameters of the distributed training model.
- each quantization group may include quantization boundary values and multiple quantization parameters.
- the first device 110 may perform entropy encoding 950 .
- the first device 110 may determine whether to perform entropy encoding based on compression ratios of multiple quantization groups and the like.
- the first device 110 sends 962 a quantization indication message to the second device 120 .
- the quantization indication message may include first indication information and second indication information.
- the quantization indication message may include third indication information.
- the first indication information may indicate a parameter division manner determined by the first device 110, for example, a type-based division manner, a matrix-based division manner, or a network layer-based division manner, and the like.
- the second indication information may indicate the quantization manner determined by the first device 110 .
- multiple groups are quantized in the same manner.
- the second indication information may be indicated by 1 bit, for example, "0" indicates a traditional quantization mode, and "1" indicates a symbol quantization mode.
- the second indication information may be indicated by 2 bits, wherein one bit indicates that each group uses the same quantization mode, and the other bit indicates the quantization mode.
- multiple groups are quantized differently. Assume that the parameters are divided into m3 groups using the determined parameter division method.
- the second indication information can be indicated by m3+1 bits, for example, one bit (such as the first bit) is used to indicate whether each group uses the same quantization method, and the other m3 bits represent the quantization of m3 groups respectively Way.
- the second indication information may be indicated by m3 bits, respectively representing the quantization manners of the m3 groups.
- the quantization indication message may not include the second indication information.
- the second device 120 can determine the quantization manner. For example, the second device 120 can calculate the total length of quantization parameters of all quantization groups, and then calculate the number of quantization boundary values. If the number is equal to the parameter set, it means that the quantization boundary value includes the maximum value of the absolute value, and the quantization mode can be determined as the symbol quantization mode. If the number is equal to twice the parameter group, it means that the quantization boundary value includes the maximum value and the minimum value, and the quantization method can be determined as the traditional quantization method.
- the third indication information may indicate the number of quantization bits determined by the first device 110 .
- multiple groups have the same number of quantization bits.
- the third indication information may be indicated by m2 bits, for example, the number of coarse-grained quantization bits is less than 2 m2 .
- the quantization bit numbers of the multiple groups are different, the third indication information may indicate the quantization granularity and the quantization list, and the third indication information further indicates the index of the quantization bit numbers of the multiple groups. So that the second device 120 can determine the number of quantization bits of each group based on the index and the quantization list.
- the quantization granularity indicates whether multiple groups use the same number of quantization bits . b K ).
- indexes corresponding to different groups may be the same. For example, there are two groups with the same number of quantization bits in multiple groups.
- the quantization indication message may not include the third indication information, and in this case, the second device 120 may determine the number of quantization bits based on multiple quantization groups.
- each quantization group includes a quantization boundary value and a plurality of quantization parameters. For example, when the overhead (number of bits) occupied by quantization boundary values is less than the number of multiple quantization parameters, assuming that the total number of bits occupied by multiple quantization groups is p1, and the number of multiple quantization parameters is n3, then able to pass To determine the number of quantized bits, where Indicates rounding down. In this manner, the second device 120 can determine the number of quantization bits, so that there is no need to indicate through the third indication information, which can save signaling overhead.
- the quantization indication message may not include the third indication information.
- the second device 120 can calculate the total length of quantization parameters of all quantization groups, and then calculate the number of quantization boundary values. If the number is equal to the parameter set, it means that the quantization boundary value includes the maximum value of the absolute value, and the quantization mode can be determined as the symbol quantization mode. If the number is equal to twice the parameter group, it means that the quantization boundary value includes the maximum value and the minimum value, and the quantization method can be determined as the traditional quantization method. The second device 120 can further determine the quantized parameter length, and divide it by the number of parameters to obtain the common number of quantized bits of each parameter group.
- the first device 110 may determine not to send the quantization indication message for the current round.
- the quantization indication message of the previous round may be used to dequantize the multiple quantization groups of the current round . In this way, repeated transmission of the same content can be avoided, and signaling overhead can be reduced.
- the first device 110 may also send 964 the entropy coding indication information to the second device 120 .
- the first device 110 may use the entropy coding indication information to inform the second device 120 whether entropy coding is performed, for example, 1 bit may be used to indicate. Therefore, the second device 120 can successfully decompress the received data based on this.
- the quantization indication message and the multiple quantization groups may be carried in the same signaling, or the quantization indication message and the multiple quantization groups may be carried in different signalings.
- the entropy coding indication information and the multiple quantization groups may be carried in the same signaling, or the entropy coding indication information and the multiple quantization groups may be carried in different signalings. This disclosure is not limited thereto.
- FIG. 10 shows a schematic diagram of a quantization indication message 1000 according to some embodiments of the present disclosure.
- the quantization indication message 1000 includes first indication information 1010 , second indication information 1020 and third indication information 1030 , and may also include entropy coding indication information 1040 optionally.
- the third indication information 1030 may indicate the number of quantization bits 1032 , or the third indication information 1030 may indicate a quantization granularity 1034 and a quantization list 1036 .
- the quantization indication message may not include the third indication information.
- the quantization indication message may not include the first indication information.
- the first device 110 and the second device 120 are preset with a method for determining the division method, then the first device 110 and the second device 120 can determine the division method based on the preset determination method, so that the first device 110 and the second device 120 do not need to The indication information is included in the quantization indication message, which can reduce signaling overhead.
- the preset determination method includes, for example: assuming that the parameters can be divided into r1 groups according to the type-based division method, and assuming that the parameters can be divided into r2 groups according to the matrix-based division method, then based on r2-r1 and the predetermined The relationship between the values (such as s and d) to determine the division method, as described in conjunction with the process 610 above.
- the first device 110 sends more than 970 quantization groups to the second device 120 .
- the multiple quantization groups may be entropy encoded multiple quantization groups.
- the second device 120 may receive multiple quantization groups from the first device 110 .
- the third indication information in the quantization indication message indicates the quantization granularity and the quantization list
- the corresponding index may also be sent to the second device 120 .
- the index may indicate the position of the corresponding quantization bit number in the quantization bit number list as shown in FIG. 7 .
- the first device 110 may determine the compressed data based on the plurality of quantization groups, and send the compressed data to the second device 120 .
- FIG. 11 shows a schematic diagram of compressed data 1100 according to some embodiments of the present disclosure.
- the compressed data 1100 includes a quantization boundary value data field 1110 and a quantization parameter data field 1120 , and may also include a list of indexes 1130 optionally.
- Quantization boundary value data field 1110 may include quantization boundary values in multiple quantization groups. As an example, (min1, max1), (min2, max2), . . . may be included. As another example, (min1, min2, . . . ) and (max1, max2, . . . ) may be included.
- the quantization parameter data field 1120 may include quantization parameters in a plurality of quantization groups, or may include entropy-encoded quantization parameters.
- the index list 1130 may include indexes corresponding to quantization bit numbers of each group.
- the index list 1130 may be independent from the quantization boundary value data field 1110 and the quantization parameter data field 1120 .
- a list of indexes may be included in the quantization boundary value data field 1110, for example may include (min1, max1, index1), (min2, max2, index2), ..., or for example include (min1, min2, %), (max1, max2, ...) and (index1, index2, 7), where index1 represents the index of the number of quantization bits used by the group whose quantization boundary value is min1 and max1, and index2 represents the group whose quantization boundary value is min2 and max2 Index of the number of quantization bits used by the group.
- the second device 120 dequantizes 980 the plurality of quantization groups. Specifically, the second device 120 may perform inverse quantization on multiple quantization groups based on the quantization indication information from the first device 110 .
- the description about inverse quantization can refer to the above corresponding embodiment in conjunction with FIG. 2 , and for the sake of brevity, it will not be repeated here.
- the transmission resources for the first device 110 to send a plurality of quantized bits to the second device 120 may be allocated by the second device 120, or may be allocated by the second device 120 after the first device 110 requests a resource. distributed. For example as described in the above embodiments of the present disclosure.
- the first device 110 can quantize the parameters of the distributed training model, thereby realizing parameter compression. This process not only considers factors such as transmission resources, but also considers factors such as the value of parameters, so as to ensure that the compression of parameters can not only ensure communication efficiency, but also ensure the performance of parameters. On the one hand, it avoids the performance loss caused by excessive compression, and on the other hand, it avoids the excessive delay caused by the low compression ratio.
- Fig. 12 shows a schematic block diagram of a communication device 1200 according to some embodiments of the present disclosure.
- the apparatus 1200 may be implemented as the first device 110 shown in FIG. 1 or as a part of the first device 110 (such as a chip), etc., which is not limited in the present disclosure.
- the apparatus 1200 may include a dividing module 1210 , a determining module 1220 and a sending module 1230 , and optionally, a receiving module 1240 and/or an entropy encoding module 1250 .
- the dividing module 1210 is configured to divide the parameters of the distributed training model into multiple groups, and at least two of the multiple groups have different value intervals.
- the determination module 1220 is configured to determine a plurality of quantization groups corresponding to the plurality of groups based on the number of quantization bits corresponding to each group in the plurality of groups, the number of quantization bits is determined based on at least one of the following: the communication device and The amount of transmission resources between the second devices, the value of the loss function, or the value range of the parameters of each group.
- the sending module 1230 is configured to send multiple quantization groups to the second device.
- the determining module 1220 may be configured to determine a quantization boundary value corresponding to the first group based on the plurality of first parameters in the first group of the plurality of groups.
- the quantization boundary value includes a maximum value and/or a minimum value, the maximum value is the maximum value of the multiple first parameters or the maximum value is the maximum value of multiple absolute values of the multiple first parameters, and the minimum value is the multiple first parameters
- the minimum value or the minimum value in is the minimum value of the absolute values of the plurality of first parameters.
- the quantization group corresponding to the first group includes quantization boundary values and a plurality of quantization parameters corresponding to the plurality of first parameters.
- the receiving module 1240 may be configured to receive first indication information from the second device, where the first indication information indicates whether to determine the quantization boundary value according to an absolute value.
- the determination module 1220 may be configured to determine a quantization boundary value based on the first indication information.
- the sending module 1230 may be configured to send second indication information to the second device, where the second indication information indicates whether the quantization boundary value is determined according to an absolute value.
- the number of quantization bits for each group is equal. In some embodiments, the plurality of groups includes a first group and a second group, and the number of quantization bits corresponding to the first group is not equal to the number of quantization bits corresponding to the second group.
- the determining module 1220 may be configured to determine a plurality of quantization bit numbers corresponding to a plurality of groups. In some examples, the determining module 1220 may be configured to determine a plurality of quantization bit numbers corresponding to multiple groups based on a quantization bit number configuration from the second device, the quantization bit number configuration including a quantization threshold and a corresponding quantization bit number. Correspondingly, the receiving module 1240 may be configured to receive the quantization bit number configuration from the second device.
- the sending module 1230 is configured to send third indication information to the second device, where the third indication information indicates a plurality of quantization bit numbers.
- the number of quantization bits is determined by the second device, and the receiving module 1240 is configured to receive fourth indication information from the second device, where the fourth indication information indicates the quantization bits corresponding to each of the multiple groups number.
- the dividing module 1210 may be configured to divide the parameters of the distributed training model into multiple groups using a model parameter dividing manner.
- the determination module 1220 is configured to determine the model parameter division method based on the values of the parameters of the distributed training model.
- the sending module 1230 may be configured to send fifth indication information to the second device, where the fifth indication information indicates a model parameter division manner.
- the receiving module 1240 is configured to receive sixth indication information from the second device, where the sixth indication information indicates a model parameter division manner.
- the division mode of the model parameters indicates that the division is based on at least one of the following items: the type of the parameter, the type of the network layer where the parameter is located, and the like.
- the entropy encoding module 1250 is configured to entropy encode the multiple quantization groups; and the sending module is configured to send the entropy encoded multiple quantization groups to the second device.
- the sending module 1230 is configured to send first entropy encoding indication information to the second device, where the first entropy encoding indication information has entropy encoded multiple quantization groups.
- the receiving module is configured to receive second entropy encoding indication information from the second device; and the entropy encoding module is configured to perform entropy encoding on the multiple quantization groups based on the second entropy encoding indication information.
- each function in the disclosed embodiments Units can be integrated into one unit, or physically exist separately, or two or more units can be integrated into one unit.
- the above-mentioned integrated units can be implemented in the form of hardware or in the form of software functional units.
- the apparatus 1200 in FIG. 12 can be used to implement each process described by the first device 110 in the foregoing embodiments, and for the sake of brevity, details are not repeated here.
- Fig. 13 shows a schematic block diagram of a communication device 1300 according to some embodiments of the present disclosure.
- the apparatus 1300 may be implemented as the second device 120 shown in FIG. 1 or as a part of the second device 120 (such as a chip), etc., which is not limited in the present disclosure.
- the apparatus 1300 may include a receiving module 1310 , a first determining module 1320 and a second determining module 1330 , and optionally further includes a sending module 1340 .
- the receiving module 1310 is configured to receive multiple quantization groups from the first device.
- the first determination module 1320 is configured to determine a plurality of groups corresponding to the plurality of quantization groups based on the number of quantization bits corresponding to each of the plurality of quantization groups, the number of quantization bits is determined based on at least one of the following : the amount of transmission resources between the first device and the communication device, the value of the loss function, or the value range of the parameters of each group.
- the second determination module 1330 is configured to determine parameters of the distributed training model based on the model parameter division method and multiple groups.
- the first quantization group of the plurality of quantization groups includes quantization boundary values and a plurality of quantization parameters.
- the quantization boundary value includes a maximum value and/or a minimum value, and the maximum value is the maximum value among the multiple first parameters in the first group corresponding to the first quantization group or the maximum value is the maximum value of the multiple absolute values of the multiple first parameters.
- the maximum value and the minimum value are the minimum value among the multiple first parameters or the minimum value is the minimum value of the multiple absolute values of the multiple first parameters.
- the sending module 1340 is configured to send first indication information to the first device, where the first indication information indicates whether to determine the quantization boundary value according to an absolute value.
- the receiving module 1310 is configured to receive second indication information from the first device, where the second indication information indicates whether the quantization boundary value is determined according to an absolute value.
- the number of quantization bits for each quantization group is equal. In some embodiments, the multiple quantization groups include a first quantization group and a second quantization group, and the number of quantization bits corresponding to the first quantization group is not equal to the number of quantization bits corresponding to the second quantization group.
- the receiving module 1310 is configured to receive third indication information from the first device, where the third indication information indicates a plurality of quantization bit numbers.
- the sending module 1340 is configured to send the quantization bit number configuration to the first device, where the quantization bit number configuration includes a quantization threshold and a corresponding quantization bit number.
- the sending module 1340 is configured to send fourth indication information to the first device, where the fourth indication information indicates the number of quantization bits corresponding to each of the multiple groups.
- the receiving module 1310 is configured to receive fifth indication information from the first device, where the fifth indication information indicates a model parameter division manner.
- the sending module 1340 is configured to send sixth indication information to the first device, where the sixth indication information indicates a model parameter division manner.
- the model parameter division method indicates that the division is based on at least one of the following: the type of the parameter, the type of the network layer where the parameter is located, and the like.
- the receiving module 1310 is configured to receive the entropy encoded plurality of quantization groups.
- the receiving module 1310 is configured to receive first entropy encoding indication information from the first device, where the first entropy encoding indication information has entropy encoded a plurality of quantization groups.
- the sending module 1340 is configured to send second entropy encoding indication information to the first device, so as to instruct the first device to perform entropy encoding on multiple quantization groups.
- each function in the disclosed embodiments Units can be integrated into one unit, or physically exist separately, or two or more units can be integrated into one unit.
- the above-mentioned integrated units can be implemented in the form of hardware or in the form of software functional units.
- the apparatus 1300 in FIG. 13 can be used to implement the various processes described by the second device 120 in the foregoing embodiments, and details are not repeated here for brevity.
- FIG. 14 shows a schematic block diagram of an example device 1400 that may be used to implement embodiments of the present disclosure.
- the device 1400 may be implemented as or included in the first device 110 of FIG. 1 , or the device 1400 may be implemented as or included in the second device 120 of FIG. 1 .
- device 1400 includes one or more processors 1410 , one or more memories 1420 coupled to processors 1410 , and communication module 1440 coupled to processors 1410 .
- the communication module 1440 can be used for two-way communication.
- the communication module 1440 may have at least one communication interface for communication.
- Communication interfaces may include any interface necessary to communicate with other devices.
- the processor 1410 may be of any type suitable for the local technology network, and may include, but is not limited to, at least one of the following: a general purpose computer, a special purpose computer, a microcontroller, a digital signal processor (Digital Signal Processor, DSP), or a control-based One or more of the multi-core controller architectures of the processor.
- Device 1400 may have multiple processors, such as application specific integrated circuit chips, that are time slaved to a clock that is synchronized to a main processor.
- Memory 1420 may include one or more non-volatile memories and one or more volatile memories.
- non-volatile memory include but are not limited to at least one of the following: read-only memory (Read-Only Memory, ROM) 1424, erasable programmable read-only memory (Erasable Programmable Read Only Memory, EPROM), flash memory, hard disk , Compact Disc (CD), Digital Video Disk (Digital Versatile Disc, DVD) or other magnetic and/or optical storage.
- Examples of volatile memory include, but are not limited to, at least one of: Random Access Memory (RAM) 1422, or other volatile memory that does not persist for the duration of a power outage.
- RAM Random Access Memory
- the memory 1420 may also be integrated with the processor 1410 .
- the computer program 1430 includes computer-executable instructions executed by the associated processor 1410 .
- Program 1430 may be stored in ROM 1424.
- Processor 1410 may perform any suitable actions and processes by loading program 1430 into RAM 1422.
- Embodiments of the present disclosure may be implemented by means of a program 1430 such that the device 1400 may perform any process as discussed with reference to FIGS. 2 to 11 .
- Embodiments of the present disclosure can also be realized by hardware or by a combination of software and hardware.
- Program 1430 may be tangibly embodied on a computer-readable medium, which may be included in device 1400 , such as in memory 1420 , or other storage device accessible by device 1400 .
- Program 1430 may be loaded from a computer readable medium into RAM 1422 for execution.
- the computer readable medium may include any type of tangible nonvolatile memory such as ROM, EPROM, flash memory, hard disk, CD, DVD, and the like.
- the communication module 1440 in the device 1400 can be implemented as a transmitter and receiver (or transceiver), which can be configured to send/receive information such as compression configuration messages, parameters to be trained, multiple quantization groups, etc. .
- the device 1400 may further include one or more of a scheduler, a controller, and a radio frequency/antenna, which will not be described in detail in this disclosure.
- the device 1400 in FIG. 14 may be implemented as a communication device, or may be implemented as a chip or chip system in the communication device, which is not limited by the embodiments of the present disclosure.
- Embodiments of the present disclosure also provide a chip, which may include an input interface, an output interface, and a processing circuit.
- a chip which may include an input interface, an output interface, and a processing circuit.
- the interaction of signaling or data may be completed by the input interface and the output interface, and the generation and processing of signaling or data information may be completed by the processing circuit.
- Embodiments of the present disclosure also provide a chip system, including a processor, configured to support a device to implement the functions involved in any of the foregoing embodiments.
- the system-on-a-chip may further include a memory for storing necessary program instructions and data, and when the processor runs the program instructions, the device installed with the system-on-a-chip can implement the program described in any of the above-mentioned embodiments.
- the chip system may consist of one or more chips, and may also include chips and other discrete devices.
- the memory and processor can be integrated.
- Embodiments of the present disclosure further provide a processor, configured to be coupled with a memory, where instructions are stored in the memory, and when the processor executes the instructions, the processor executes the methods and functions involved in any of the foregoing embodiments.
- Embodiments of the present disclosure also provide a computer program product containing instructions, which, when run on a computer, cause the computer to execute the methods and functions involved in any of the above embodiments.
- Embodiments of the present disclosure also provide a computer-readable storage medium on which computer instructions are stored, and when a processor executes the instructions, the processor is made to execute the methods and functions involved in any of the above embodiments.
- the various embodiments of the present disclosure may be implemented in hardware or special purpose circuits, software, logic or any combination thereof. Some aspects may be implemented in hardware, while other aspects may be implemented in firmware or software, which may be executed by a controller, microprocessor or other device. While various aspects of the embodiments of the present disclosure are shown and described as block diagrams, flowcharts, or using some other pictorial representation, it should be understood that the blocks, devices, systems, techniques or methods described herein can be implemented as, without limitation, hardware, software, firmware, special purpose circuits or logic, general purpose hardware or controllers or other devices, or some combination thereof.
- the present disclosure also provides at least one computer program product tangibly stored on a non-transitory computer-readable storage medium.
- the computer program product comprises computer-executable instructions, eg included in program modules, which are executed in a device on a real or virtual processor of a target to perform the process/method as above with reference to the accompanying drawings.
- program modules include routines, programs, libraries, objects, classes, components, data structures, etc. that perform particular tasks or implement particular abstract data types.
- the functionality of the program modules may be combined or divided as desired among the program modules.
- Machine-executable instructions for program modules may be executed within local or distributed devices. In a distributed device, program modules may be located in both local and remote storage media.
- Computer program codes for implementing the methods of the present disclosure may be written in one or more programming languages. These computer program codes can be provided to processors of general-purpose computers, special-purpose computers, or other programmable data processing devices, so that when the program codes are executed by the computer or other programmable data processing devices, The functions/operations specified in are implemented.
- the program code may execute entirely on the computer, partly on the computer, as a stand-alone software package, partly on the computer and partly on a remote computer or entirely on the remote computer or server.
- computer program code or related data may be carried by any suitable carrier to enable a device, apparatus or processor to perform the various processes and operations described above.
- carriers include signals, computer readable media, and the like.
- signals may include electrical, optical, radio, sound, or other forms of propagated signals, such as carrier waves, infrared signals, and the like.
- a computer readable medium may be any tangible medium that contains or stores a program for or related to an instruction execution system, apparatus, or device.
- the computer readable medium may be a computer readable signal medium or a computer readable storage medium.
- a computer readable medium may include, but is not limited to, an electronic, magnetic, optical, electromagnetic, infrared, or semiconductor system, apparatus, or device, or any suitable combination thereof. More detailed examples of computer-readable storage media include electrical connections with one or more wires, portable computer diskettes, hard disks, random storage access memory (RAM), read-only memory (ROM), erasable programmable read-only memory (EPROM or flash), optical storage, magnetic storage, or any suitable combination thereof.
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- Software Systems (AREA)
- Health & Medical Sciences (AREA)
- General Engineering & Computer Science (AREA)
- General Health & Medical Sciences (AREA)
- General Physics & Mathematics (AREA)
- Artificial Intelligence (AREA)
- Mathematical Physics (AREA)
- Data Mining & Analysis (AREA)
- Evolutionary Computation (AREA)
- Computing Systems (AREA)
- Biophysics (AREA)
- Molecular Biology (AREA)
- Computational Linguistics (AREA)
- Biomedical Technology (AREA)
- Life Sciences & Earth Sciences (AREA)
- Bioethics (AREA)
- Medical Informatics (AREA)
- Databases & Information Systems (AREA)
- Computer Hardware Design (AREA)
- Computer Security & Cryptography (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
Abstract
Description
Claims (40)
- 一种数据处理方法,包括:第一设备将分布式训练模型的参数划分为多个组,所述多个组中的至少两个组的数值区间不同;所述第一设备基于与所述多个组中每个组对应的量化比特数,确定与所述多个组对应的多个量化组,所述量化比特数是基于以下至少一项而被确定的:所述第一设备与第二设备之间的传输资源量、损失函数值、或所述每个组的参数的取值范围;以及所述第一设备向所述第二设备发送所述多个量化组。
- 根据权利要求1所述的方法,还包括:所述第一设备基于所述多个组中的第一组中的多个第一参数,确定与所述第一组对应的量化边界值。
- 根据权利要求2所述的方法,其中所述量化边界值包括以下至少一项:最大值、或最小值,所述最大值为所述多个第一参数中的最大值或者所述最大值为所述多个第一参数的多个绝对值的最大值,所述最小值为所述多个第一参数中的最小值或者所述最小值为所述多个第一参数的多个绝对值的最小值。
- 根据权利要求2或3所述的方法,其中与所述第一组对应的量化组包括所述量化边界值以及与所述多个第一参数对应的多个量化参数。
- 根据权利要求2至4中任一项所述的方法,还包括:所述第一设备接收来自所述第二设备的第一指示信息,所述第一指示信息指示是否按照绝对值来确定所述量化边界值;并且其中所述第一设备确定与所述第一组对应的量化边界值包括:所述第一设备基于所述第一指示信息,确定所述量化边界值。
- 根据权利要求2至4中任一项所述的方法,还包括:所述第一设备向所述第二设备发送第二指示信息,所述第二指示信息指示所述量化边界值是否按照绝对值而被确定。
- 根据权利要求1至6中任一项所述的方法,其中所述每个组的量化比特数都是相等的。
- 根据权利要求1至6中任一项所述的方法,其中所述多个组包括第一组和第二组,并且与所述第一组对应的量化比特数不等于与所述第二组对应的量化比特数。
- 根据权利要求8所述的方法,所述方法还包括:所述第一设备确定与所述多个组对应的多个量化比特数。
- 根据权利要求9所述的方法,其中所述第一设备确定与所述多个组对应的多个量化比特数包括:所述第一设备基于来自所述第二设备的量化比特数配置,确定与所述多个组对应的所述多个量化比特数,所述量化比特数配置包括量化阈值和对应的量化比特数。
- 根据权利要求10所述的方法,还包括:所述第一设备接收来自所述第二设备的所述量化比特数配置。
- 根据权利要求9至11中任一项所述的方法,还包括:所述第一设备向所述第二设备发送第三指示信息,所述第三指示信息指示所述多个量化比特数。
- 根据权利要求1至8中任一项所述的方法,其中所述量化比特数是由所述第二设备确定的,所述方法还包括:所述第一设备接收来自所述第二设备的第四指示信息,所述第四指示信息指示与所述多个组中每个组对应的量化比特数。
- 根据权利要求1至11中任一项所述的方法,其中所述第一设备将分布式训练模型的参数划分为多个组包括:所述第一设备使用模型参数划分方式,将所述分布式训练模型的参数划分为所述多个组。
- 根据权利要求14所述的方法,还包括:所述第一设备基于所述分布式训练模型的参数的取值大小,确定所述模型参数划分方式;以及所述第一设备向所述第二设备发送第五指示信息,所述第五指示信息指示所述模型参数划分方式。
- 根据权利要求14所述的方法,还包括:所述第一设备接收来自所述第二设备的第六指示信息,所述第六指示信息指示所述模型参数划分方式。
- 根据权利要求14至16中任一项所述的方法,其中所述模型参数划分方式指示基于如下至少一项进行划分:参数的类型、参数所在的网络层的类型等。
- 根据权利要求1至17中任一项所述的方法,还包括:所述第一设备对所述多个量化组进行熵编码;并且其中所述第一设备向所述第二设备发送所述多个量化组包括:所述第一设备向所述第二设备发送经熵编码的所述多个量化组。
- 根据权利要求18所述的方法,还包括:所述第一设备向所述第二设备发送第一熵编码指示信息,所述第一熵编码指示信息所述多个量化组已被熵编码。
- 根据权利要求18所述的方法,还包括:所述第一设备接收来自所述第二设备的第二熵编码指示信息;并且其中所述第一设备对所述多个量化组进行熵编码包括:所述第一设备基于所述第二熵编码指示信息,对所述多个量化组进行熵编码。
- 一种数据处理方法,包括:第二设备接收来自第一设备的多个量化组;所述第二设备基于与所述多个量化组中每个量化组对应的量化比特数,确定与所述多个量化组对应的多个组,所述量化比特数是基于以下至少一项而被确定的:所述第一设备与所述第二设备之间的传输资源量、损失函数值、或所述每个组的参数的取值范围;以及所述第二设备基于模型参数划分方式和所述多个组,确定分布式训练模型的参数。
- 根据权利要求21所述的方法,其中所述多个量化组中第一量化组包括量化边界值以及多个量化参数。
- 根据权利要求22所述的方法,其中量化边界值包括以下至少一项:最大值、或最小值,所述最大值为与所述第一量化组对应的第一组中多个第一参数中的最大值或者所述最大值为所述多个第一参数的多个绝对值的最大值,所述最小值为所述多个第一参数中的最小值 或者所述最小值为所述多个第一参数的多个绝对值的最小值。
- 根据权利要求22或23所述的方法,还包括:所述第二设备向所述第一设备发送第一指示信息,所述第一指示信息指示是否按照绝对值来确定所述量化边界值;或者,所述第二设备接收来自所述第一设备的第二指示信息,所述第二指示信息指示所述量化边界值是否按照绝对值而被确定。
- 根据权利要求21至24中任一项所述的方法,其中所述每个量化组的量化比特数都是相等的。
- 根据权利要求21至24中任一项所述的方法,其中所述多个量化组包括第一量化组和第二量化组,并且与所述第一量化组对应的量化比特数不等于与所述第二量化组对应的量化比特数。
- 根据权利要求26所述的方法,还包括:所述第二设备接收来自所述第一设备的第三指示信息,所述第三指示信息指示所述多个量化比特数。
- 根据权利要求21至26中任一项所述的方法,还包括:所述第二设备向所述第一设备发送量化比特数配置,所述量化比特数配置包括量化阈值和对应的量化比特数。
- 根据权利要求21至26中任一项所述的方法,还包括:所述第二设备向所述第一设备发送第四指示信息,所述第四指示信息指示与所述多个组中每个组对应的量化比特数。
- 根据权利要求21至29中任一项所述的方法,还包括:所述第二设备接收来自所述第一设备的第五指示信息,所述第五指示信息指示所述模型参数划分方式;或者,所述第二设备向所述第一设备发送第六指示信息,所述第六指示信息指示所述模型参数划分方式。
- 根据权利要求21至30中任一项所述的方法,其中所述模型参数划分方式指示基于如下至少一项进行划分:参数的类型、参数所在的网络层的类型等。
- 根据权利要求21至31中任一项所述的方法,其中所述第二设备接收来自第一设备的多个量化组包括:所述第二设备接收经熵编码的所述多个量化组。
- 根据权利要求32所述的方法,还包括:所述第二设备接收来自所述第一设备的第一熵编码指示信息,所述第一熵编码指示信息所述多个量化组已被熵编码;或者,所述第二设备向所述第一设备发送第二熵编码指示信息,以指示所述第一设备对所述多个量化组进行熵编码。
- 一种通信装置,包括:划分模块,被配置为将分布式训练模型的参数划分为多个组,所述多个组中的至少两个组的数值区间不同;确定模块,被配置为基于与所述多个组中每个组对应的量化比特数,确定与所述多个组对应的多个量化组,所述量化比特数是基于以下至少一项而被确定的:所述通信装置与第二设备之间的传输资源量、损失函数值、或所述每个组的参数的取值范围;以及发送模块,被配置为向所述第二设备发送所述多个量化组。
- 一种通信装置,包括:接收模块,被配置为接收来自第一设备的多个量化组;第一确定模块,被配置为基于与所述多个量化组中每个量化组对应的量化比特数,确定与所述多个量化组对应的多个组,所述量化比特数是基于以下至少一项而被确定的:所述第一设备与所述通信装置之间的传输资源量、损失函数值、或所述每个组的参数的取值范围;以及第二确定模块,被配置为基于模型参数划分方式和所述多个组,确定分布式训练模型的参数。
- 一种通信设备,包括存储器和处理器,所述存储器存储有计算机指令,当所述计算机指令被所述处理器执行时,使得所述通信设备实现根据权利要求1至33中任一项所述的方法。
- 一种计算机可读存储介质,所述计算机可读存储介质存储有计算机可执行指令,所述计算机可执行指令被处理器执行时实现根据权利要求1至33中任一项所述的方法。
- 一种芯片,包括处理电路,被配置为执行根据权利要求1至33中任一项所述的方法。
- 一种计算机程序产品,所述计算机程序产品上包含计算机可执行指令,所述计算机可执行指令在被执行时实现根据权利要求1至33中任一项所述的方法。
- 一种通信系统,包括第一设备和第二设备,其中所述第一设备用于实现根据权利要求1至20中任一项所述的方法,所述第二设备用于实现根据权利要求21至33中任一项所述的方法。
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| EP22923614.6A EP4465213B1 (en) | 2022-01-29 | 2022-12-26 | Information processing method and apparatus |
Applications Claiming Priority (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN202210112944.2 | 2022-01-29 | ||
| CN202210112944.2A CN116579438A (zh) | 2022-01-29 | 2022-01-29 | 信息处理方法及装置 |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| WO2023142828A1 true WO2023142828A1 (zh) | 2023-08-03 |
Family
ID=87470407
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| PCT/CN2022/141926 Ceased WO2023142828A1 (zh) | 2022-01-29 | 2022-12-26 | 信息处理方法及装置 |
Country Status (3)
| Country | Link |
|---|---|
| EP (1) | EP4465213B1 (zh) |
| CN (1) | CN116579438A (zh) |
| WO (1) | WO2023142828A1 (zh) |
Families Citing this family (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN117764132B (zh) * | 2023-12-22 | 2025-12-16 | 浪潮(北京)电子信息产业有限公司 | 一种数据传输方法、装置、系统及电子设备和存储介质 |
Citations (7)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN110135465A (zh) * | 2019-04-22 | 2019-08-16 | 腾讯科技(深圳)有限公司 | 模型参数表示空间大小估计方法及装置、推荐方法 |
| US20200364603A1 (en) * | 2019-05-15 | 2020-11-19 | Google Llc | Compression of Machine-Learned Models via Entropy Penalized Weight Reparameterization |
| CN113139663A (zh) * | 2021-04-23 | 2021-07-20 | 深圳市大数据研究院 | 联邦边缘学习配置信息获取方法、装置、设备及介质 |
| CN113222179A (zh) * | 2021-03-18 | 2021-08-06 | 北京邮电大学 | 一种基于模型稀疏化与权重量化的联邦学习模型压缩方法 |
| CN113258935A (zh) * | 2021-05-25 | 2021-08-13 | 山东大学 | 一种联邦学习中基于模型权值分布的通信压缩方法 |
| WO2021189225A1 (zh) * | 2020-03-24 | 2021-09-30 | Oppo广东移动通信有限公司 | 一种机器学习模型训练方法、电子设备及存储介质 |
| CN114611705A (zh) * | 2020-11-23 | 2022-06-10 | 华为技术有限公司 | 数据处理方法、机器学习的训练方法及相关装置、设备 |
-
2022
- 2022-01-29 CN CN202210112944.2A patent/CN116579438A/zh active Pending
- 2022-12-26 WO PCT/CN2022/141926 patent/WO2023142828A1/zh not_active Ceased
- 2022-12-26 EP EP22923614.6A patent/EP4465213B1/en active Active
Patent Citations (7)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN110135465A (zh) * | 2019-04-22 | 2019-08-16 | 腾讯科技(深圳)有限公司 | 模型参数表示空间大小估计方法及装置、推荐方法 |
| US20200364603A1 (en) * | 2019-05-15 | 2020-11-19 | Google Llc | Compression of Machine-Learned Models via Entropy Penalized Weight Reparameterization |
| WO2021189225A1 (zh) * | 2020-03-24 | 2021-09-30 | Oppo广东移动通信有限公司 | 一种机器学习模型训练方法、电子设备及存储介质 |
| CN114611705A (zh) * | 2020-11-23 | 2022-06-10 | 华为技术有限公司 | 数据处理方法、机器学习的训练方法及相关装置、设备 |
| CN113222179A (zh) * | 2021-03-18 | 2021-08-06 | 北京邮电大学 | 一种基于模型稀疏化与权重量化的联邦学习模型压缩方法 |
| CN113139663A (zh) * | 2021-04-23 | 2021-07-20 | 深圳市大数据研究院 | 联邦边缘学习配置信息获取方法、装置、设备及介质 |
| CN113258935A (zh) * | 2021-05-25 | 2021-08-13 | 山东大学 | 一种联邦学习中基于模型权值分布的通信压缩方法 |
Non-Patent Citations (1)
| Title |
|---|
| See also references of EP4465213A4 |
Also Published As
| Publication number | Publication date |
|---|---|
| CN116579438A (zh) | 2023-08-11 |
| EP4465213A4 (en) | 2025-04-02 |
| EP4465213A1 (en) | 2024-11-20 |
| EP4465213B1 (en) | 2026-04-22 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| US10568146B2 (en) | Facilitating signaling and transmission protocols for enhanced beam management for initial access | |
| US20200267579A1 (en) | Facilitating enhanced beam management in a wireless communication system | |
| CN111819872A (zh) | 信息传输方法、装置、通信设备及存储介质 | |
| US20250373298A1 (en) | Information transmission method and apparatus, and communication device and storage medium | |
| WO2021057461A1 (zh) | 一种极化码分段编码的方法及通信设备 | |
| US11201712B2 (en) | Facilitating incremental downlink control information design to support downlink control information scheduling | |
| WO2019095783A1 (zh) | 一种同步块与寻呼调度信令关联方法、指示方法及装置 | |
| WO2022016466A1 (zh) | 资源请求信息处理方法及装置、通信设备及存储介质 | |
| CN110690944A (zh) | 信道状态信息的优先级发送、确定方法及装置、存储介质、用户设备 | |
| WO2020199552A1 (zh) | 一种通信方法及设备 | |
| CN115379508A (zh) | 载波管理方法、资源分配方法及相关设备 | |
| WO2024046419A1 (zh) | 一种通信方法及装置 | |
| CN110139371B (zh) | 一种资源分配方法、终端和网络侧设备 | |
| WO2023142828A1 (zh) | 信息处理方法及装置 | |
| WO2018133802A1 (zh) | 一种资源调度方法和基站及终端 | |
| CN110351003B (zh) | 确定传输块大小的方法和通信装置 | |
| WO2024036526A1 (zh) | 一种模型调度方法和装置 | |
| WO2024065804A1 (zh) | 数据压缩传输方法、装置、设备以及存储介质 | |
| WO2022127702A1 (zh) | 信息确定方法、装置及通信设备 | |
| WO2023061601A1 (en) | Beam Management | |
| WO2020134610A1 (zh) | 数据调度方法及装置 | |
| WO2020088517A1 (zh) | 数据传输方法和设备 | |
| WO2026001530A1 (zh) | 能力上报方法、能力确定方法及装置 | |
| WO2019052441A1 (zh) | 一种用于生成扩展符号的方法及装置 | |
| WO2018121220A1 (zh) | 一种系统信息传输方法、用户终端和传输节点 |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| 121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 22923614 Country of ref document: EP Kind code of ref document: A1 |
|
| WWE | Wipo information: entry into national phase |
Ref document number: 2022923614 Country of ref document: EP |
|
| ENP | Entry into the national phase |
Ref document number: 2022923614 Country of ref document: EP Effective date: 20240813 |
|
| NENP | Non-entry into the national phase |
Ref country code: DE |
|
| WWG | Wipo information: grant in national office |
Ref document number: 2022923614 Country of ref document: EP |
