WO2021004161A1 - 一种异常检测方法和装置 - Google Patents
一种异常检测方法和装置 Download PDFInfo
- Publication number
- WO2021004161A1 WO2021004161A1 PCT/CN2020/090936 CN2020090936W WO2021004161A1 WO 2021004161 A1 WO2021004161 A1 WO 2021004161A1 CN 2020090936 W CN2020090936 W CN 2020090936W WO 2021004161 A1 WO2021004161 A1 WO 2021004161A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- data
- abnormal
- detection
- result
- reliability
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Ceased
Links
Images
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04W—WIRELESS COMMUNICATION NETWORKS
- H04W24/00—Supervisory, monitoring or testing arrangements
- H04W24/04—Arrangements for maintaining operational condition
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L43/00—Arrangements for monitoring or testing data switching networks
- H04L43/08—Monitoring or testing based on specific metrics, e.g. QoS, energy consumption or environmental parameters
- H04L43/0805—Monitoring or testing based on specific metrics, e.g. QoS, energy consumption or environmental parameters by checking availability
- H04L43/0817—Monitoring or testing based on specific metrics, e.g. QoS, energy consumption or environmental parameters by checking availability by checking functioning
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/21—Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
- G06F18/214—Generating training patterns; Bootstrap methods, e.g. bagging or boosting
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/21—Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
- G06F18/217—Validation; Performance evaluation; Active pattern learning techniques
- G06F18/2178—Validation; Performance evaluation; Active pattern learning techniques based on feedback of a supervisor
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L41/00—Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
- H04L41/14—Network analysis or design
- H04L41/142—Network analysis or design using statistical or mathematical methods
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L41/00—Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
- H04L41/16—Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks using machine learning or artificial intelligence
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L63/00—Network architectures or network communication protocols for network security
- H04L63/14—Network architectures or network communication protocols for network security for detecting or protecting against malicious traffic
- H04L63/1408—Network architectures or network communication protocols for network security for detecting or protecting against malicious traffic by monitoring network traffic
- H04L63/1425—Traffic logging, e.g. anomaly detection
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04W—WIRELESS COMMUNICATION NETWORKS
- H04W24/00—Supervisory, monitoring or testing arrangements
- H04W24/08—Testing, supervising or monitoring using real traffic
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04W—WIRELESS COMMUNICATION NETWORKS
- H04W76/00—Connection management
- H04W76/20—Manipulation of established connections
- H04W76/27—Transitions between radio resource control [RRC] states
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04W—WIRELESS COMMUNICATION NETWORKS
- H04W16/00—Network planning, e.g. coverage or traffic planning tools; Network deployment, e.g. resource partitioning or cells structures
- H04W16/22—Traffic simulation tools or models
Definitions
- the embodiments of the present application relate to, but are not limited to, the field of communication operation and maintenance, in particular to an abnormality detection method and device.
- KPI Key Performance Index
- the embodiment of the present application provides an anomaly detection method, including: for each anomaly detection algorithm in N types of anomaly detection algorithms, using the anomaly detection algorithm to detect the first characteristic data of the collected first data to obtain the first feature data A detection result; where N is an integer greater than or equal to 1; when the first detection result of one or more anomaly detection algorithms is abnormal, the reliability of the first characteristic data of the first data is checked Obtain the first verification result; determine the second detection result according to whether the reliability of the abnormality detection algorithm whose first detection result is abnormal is greater than or equal to the first preset threshold and the first verification result.
- the embodiment of the present application provides an abnormality detection device, which includes a processor and a computer-readable storage medium.
- the computer-readable storage medium stores instructions. When the instructions are executed by the processor, any one of the foregoing An anomaly detection method.
- the embodiment of the present application provides a computer-readable storage medium on which a computer program is stored, and when the computer program is executed by a processor, the steps of any of the above-mentioned abnormal detection methods are realized.
- the embodiment of the present application provides an abnormality detection device, including: a detection module, configured to use the abnormality detection algorithm for each of the N abnormality detection algorithms to determine the first feature of the collected first data Data is detected to obtain a first detection result; where N is an integer greater than or equal to 1; when the first detection result of one or more anomaly detection algorithms is abnormal, the first characteristic data of the first data Perform reliability verification to obtain the first verification result; the determining module is used to determine the second detection according to whether the reliability of the abnormality detection algorithm whose first detection result is abnormal is greater than or equal to the first preset threshold and the first verification result result.
- FIG. 1 is a flowchart of an abnormality detection method proposed in an embodiment of the application
- FIG. 2 is a schematic diagram of the structural composition of an abnormality detection device proposed in another embodiment of the application.
- an embodiment of the present application proposes an anomaly detection method, including: step 100, for each anomaly detection algorithm in N types of anomaly detection algorithms, use the anomaly detection algorithm to analyze the first data collected The first feature data is detected to obtain the first detection result; where N is an integer greater than or equal to 1.
- the first data may be any data that needs to be detected for anomaly, such as KPI data.
- the first data includes the time series data data1 and related data data2.
- data1 is recorded as ⁇ (t 1 ,x 1 ),...,(t w ,x w ) ⁇ , t w is the time to be detected, and x w is the value of the success rate of RRC establishment connection corresponding to t w ,
- data2 is denoted as [x eff , x ref , pre_res, obj_id, KPI_id]
- x eff is the number of RRC connection establishment requests corresponding to t w , x ref It is the number of RRC connection establishment failures
- pre_res is whether it is abnormal at t w-1
- obj_id is the cell id
- KPI_id is the id of the KPI to be detected.
- all the first feature data required by the N types of anomaly detection algorithms can be extracted from the first data at one time, and each anomaly detection algorithm is used to compare the collected first feature data.
- the first feature data of the data is detected to obtain the first detection result, the first feature data required by the anomaly detection algorithm is selected from the extracted feature data;
- the first feature data required by the anomaly detection algorithm is extracted from the collected first data .
- the first data need to be feature extraction, and for the first data that does not need feature extraction, the first data can be directly used as the first feature data.
- the first data is cleaned first, and then the feature data is extracted from the cleaned first data.
- the first data is cleaned first, and then the feature data is extracted from the cleaned first data.
- the first data is cleaned first, and part of the first data may be cleaned, which is not limited in the embodiment of the present application.
- data1 is cleaned, but data2 is not cleaned.
- data1 has missing values
- use methods such as linear interpolation and mean to fill in the missing data.
- extract the first feature data from the cleaned data1 for example, perform feature statistics on the cleaned data1 (such as maximum x max , minimum x min , average x mean , median x median , Standard deviation x std, etc.), periodic judgment, classification feature construction (for example, one-hot encoding of t w , including hour, day of the week, etc.), calculation of the year-on-year ring ratio of x w , first-order difference, two Order difference etc.
- feature statistics such as maximum x max , minimum x min , average x mean , median x median , Standard deviation x std, etc.
- classification feature construction for example, one-hot encoding of t w , including hour, day of the week, etc.
- calculation of the year-on-year ring ratio of x w for example, one-hot encoding of t w , including hour, day of the week, etc.
- the anomaly detection algorithm may be based on a statistical learning algorithm (for example, 3-sigma, Exponentially Weighted Moving-Average (EWMA), and differential integrated moving average autoregressive model (ARIMA, Autoregressive Integrated Moving-Average). Average Model, etc.), unsupervised classification algorithms (such as Isolation Forest, Support Vector Machine (SVM) (one-class SVM), Variational Auto-Encode, etc.) , Supervised learning algorithms (such as logistic regression, extreme gradient boosting (XGBoost, eXtreme Gradient Boosting), deep neural network (DNN, Deep Neural Network), etc.).
- a statistical learning algorithm for example, 3-sigma, Exponentially Weighted Moving-Average (EWMA), and differential integrated moving average autoregressive model (ARIMA, Autoregressive Integrated Moving-Average). Average Model, etc.
- unsupervised classification algorithms such as Isolation Forest, Support Vector Machine (SVM) (one-class SVM), Variational Auto-Encode,
- the embodiment of the present application uses a variety of anomaly detection algorithms to perform anomaly detection, with the purpose of detecting all possible abnormal data.
- Step 101 When the first detection result of one or more abnormal detection algorithms is abnormal, perform reliability verification on the first characteristic data of the first data to obtain the first verification result.
- the user can set the verification conditions for reliability verification, select the reliability evaluation method, and set related parameters through the interface, or import the verification conditions, reliability evaluation methods, and related parameters in the form of writing configuration files. parameter.
- the specific implementation manner is not limited in the embodiment of this application.
- performing reliability verification on the first characteristic data of the first data includes: performing at least one of the following on the first characteristic data of the first data: data validity verification, safety interval verification, and complete Network contribution test and abnormal continuity test.
- the data validity check on the first feature data is to detect whether the number of RRC establishment connection requests x eff is greater than the second preset threshold x eff _threshold, when RRC is established When the number of connection requests is greater than the second preset threshold, the first data is considered valid, and the security interval check is continued on the first characteristic data; when the number of RRC establishment connection requests is less than or equal to the second preset threshold, the first data is considered invalid, Then the first data is normal (that is, the first check result is normal), and the check ends.
- x eff _threshold is given by experts based on business experience and domain knowledge, which can be specifically set by the user on the page or import a configurable file.
- Performing a safe interval check on the first characteristic data is to check whether the RRC connection establishment success rate x w is within the safe interval [x min , x max ].
- the first data is considered normal ( That is, the first check result is normal), and the check ends; when the RRC connection establishment success rate is not within the safe interval, the whole network contribution degree check is continued on the first characteristic data.
- the safety interval [x min , x max ] is given by experts based on business experience and domain knowledge, which can be specifically set by the user on the page or import a configurable file.
- x ref _threshold is given by experts based on business experience and domain knowledge, which can be specifically set by the user on the page or import a configurable file.
- the continuity test of the first feature is to check whether the test result pre_res for a period of time before t w is abnormal, and whether there is a continuous deterioration trend through the value of the second-order difference, when the test for a period of time before t w
- the first verification result is determined to be abnormal; when at least one of the test results for a period of time before t w is normal, or there is a tendency to change for the better, it is determined
- the first check result is normal and the check ends.
- Step 102 Determine a second detection result according to whether the reliability of the abnormality detection algorithm whose first detection result is abnormal is greater than or equal to the first preset threshold and the first verification result.
- the second detection result includes any one or more of the following: When the reliability of one or more of the abnormality detection algorithms in the abnormality detection algorithms whose first detection result is abnormal is greater than or equal to the first preset threshold, it is determined that the second detection result is abnormal; when When the reliability of the abnormality detection algorithm whose first detection result is abnormal is less than the first preset threshold, the first verification result is used as the second detection result.
- the second detection result is determined to be abnormal; when the reliability of the abnormal detection algorithms XGBoost and 3-sigma are both less than the first preset threshold, the first verification result is used as the second detection result.
- the reliability of the anomaly detection algorithm includes: similarity comparison of anomaly detection algorithms, statistical testing, and the like. For example, the coincidence rate of the anomaly detection algorithm.
- the method before determining the second detection result according to whether the reliability of the abnormality detection algorithm whose first detection result is abnormal is greater than or equal to the first preset threshold and the first verification result, the method further includes: For each of the abnormality detection algorithms, the coincidence rate of the abnormality detection algorithms is calculated.
- calculating the coincidence rate of the anomaly detection algorithm includes: determining the ratio of the first number to the second number as the coincidence rate; wherein, the first number is all second numbers collected within a preset time.
- a third detection result obtained by detecting the second characteristic data of the second data using the abnormality detection algorithm and a second verification result obtained by performing a reliability check on the second characteristic data of the second data The same number of second data; the second number is the third detection result obtained by using the abnormal detection algorithm to detect the second characteristic data of the second data among all the second data collected within the preset time Is the number of abnormal second data.
- the test results of the RRC connection success rate index for all cells are abnormal samples.
- the detection results obtained by the anomaly detection algorithm are abnormal samples.
- the method further includes any one or more of the following: when the second detection result is abnormal and the user’s abnormal cancellation information is received, marking the first data as normal When the second detection result is abnormal, and the user’s abnormal confirmation information is received, the first data is added to the labeled sample library as an abnormal sample; wherein, the label The samples marked as normal or abnormal in the sample library are used to train the supervised learning model of the abnormality detection algorithm, and the abnormality detection algorithm detects the first feature data of the first data based on the supervised learning model to obtain the first feature data One test result. Add the first data as labeled samples to the labeled sample library.
- the anomaly detection algorithm based on supervised learning or semi-supervised learning apply new labeled samples Incremental learning or periodic retraining of the supervised learning model improves the reliability of the supervised learning model, thereby improving the detection accuracy of the anomaly detection algorithm.
- the abnormal information when the second detection result is abnormal, the abnormal information is displayed.
- the abnormal information includes the object name "cell 1" corresponding to the abnormal data (ie the first data), the occurrence time t w , and the KPI name "RRC connection establishment Success rate", KPI value x w, etc.
- the user can choose to enter the exception confirmation information to confirm the exception, or enter the exception cancellation information to cancel the exception; or do nothing.
- the abnormality confirmation information the first data is added as an abnormal sample to the labeled sample library; when the user inputs abnormal cancellation information, the first data is added to the sample library as a normal sample.
- the embodiment of the present application determines the final detection result based on the detection results of various anomaly detection algorithms and the reliability verification results. Since the reliability verification is based on the reliability verification of expert business experience and domain knowledge, the detection is improved. Reliability of results.
- an abnormality detection device which includes a processor and a computer-readable storage medium.
- the computer-readable storage medium stores instructions. When the instructions are executed by the processor, the foregoing Any anomaly detection method.
- Another embodiment of the present application provides a computer-readable storage medium on which a computer program is stored, and when the computer program is executed by a processor, the steps of any of the above-mentioned abnormal detection methods are realized.
- an anomaly detection device including: a detection module 201, for each anomaly detection algorithm in N types of anomaly detection algorithms, using the anomaly detection algorithm to collect The first feature data of the first data is detected to obtain the first detection result; where N is an integer greater than or equal to 1; when the first detection result of one or more abnormal detection algorithms is abnormal, the first detection result Perform reliability verification on the first characteristic data of a data to obtain a first verification result; the determining module 202 is configured to determine whether the reliability of the abnormality detection algorithm for which the first detection result is abnormal is greater than or equal to the first preset threshold and the first A verification result determines the second detection result.
- the first data may be any data that needs to be detected for anomaly, such as KPI data.
- the first data includes the time series data data1 and related data data2.
- data1 is recorded as ⁇ (t 1 ,x 1 ),...,(t w ,x w ) ⁇ , t w is the time to be detected, and x w is the value of the success rate of RRC establishment connection corresponding to t w ,
- data2 is denoted as [x eff , x ref , pre_res, obj_id, KPI_id]
- x eff is the number of RRC connection establishment requests corresponding to t w , x ref It is the number of RRC connection establishment failures
- pre_res is whether it is abnormal at t w-1
- obj_id is the cell id
- KPI_id is the id of the KPI to be detected.
- the detection module 201 may extract all the first characteristic data required by the N types of anomaly detection algorithms from the first data at one time after collecting the first data, and then use each anomaly detection algorithm to collect When the first feature data of the first data is detected to obtain the first detection result, the first feature data required by the abnormal detection algorithm is selected from the extracted feature data;
- the detection module 201 uses each anomaly detection algorithm to detect the first characteristic data of the collected first data to obtain the first detection result, it extracts from the collected first data the first feature required by the anomaly detection algorithm. One feature data.
- the first data need to be feature extraction, and for the first data that does not need feature extraction, the first data can be directly used as the first feature data.
- the detection module 201 before extracting the characteristic data of the first data, the detection module 201 first cleans the first data, and then extracts the characteristic data from the cleaned first data.
- the first data needs to be cleaned, and part of the first data may be cleaned, which is not limited in the embodiment of the present application.
- data1 is cleaned, but data2 is not cleaned.
- data1 has missing values
- use methods such as linear interpolation and mean to fill in the missing data.
- extract the first feature data from the cleaned data1 for example, perform feature statistics on the cleaned data1 (such as maximum x max , minimum x min , average x mean , median x median , Standard deviation x std, etc.), periodic judgment, classification feature construction (for example, one-hot encoding of t w , including hour, day of the week, etc.), calculation of the year-on-year ring ratio of x w , first-order difference, two Order difference etc.
- feature statistics such as maximum x max , minimum x min , average x mean , median x median , Standard deviation x std, etc.
- classification feature construction for example, one-hot encoding of t w , including hour, day of the week, etc.
- calculation of the year-on-year ring ratio of x w for example, one-hot encoding of t w , including hour, day of the week, etc.
- the anomaly detection algorithm may be based on a statistical learning algorithm (for example, 3-sigma, Exponentially Weighted Moving-Average (EWMA), and differential integrated moving average autoregressive model (ARIMA, Autoregressive Integrated Moving-Average). Average Model, etc.), unsupervised classification algorithms (such as Isolation Forest, Support Vector Machine (SVM) (one-class SVM), Variational Auto-Encode, etc.) , Supervised learning algorithms (such as logistic regression, extreme gradient boosting (XGBoost, eXtreme Gradient Boosting), deep neural network (DNN, Deep Neural Network), etc.).
- a statistical learning algorithm for example, 3-sigma, Exponentially Weighted Moving-Average (EWMA), and differential integrated moving average autoregressive model (ARIMA, Autoregressive Integrated Moving-Average). Average Model, etc.
- unsupervised classification algorithms such as Isolation Forest, Support Vector Machine (SVM) (one-class SVM), Variational Auto-Encode,
- the embodiment of the present application uses a variety of anomaly detection algorithms to perform anomaly detection, with the purpose of detecting all possible abnormal data.
- the user can set the verification conditions for reliability verification, select the reliability evaluation method, and set related parameters through the interface, or import the verification conditions, reliability evaluation methods, and related parameters in the form of writing configuration files. parameter.
- the specific implementation manner is not limited in the embodiment of this application.
- the detection module 201 is specifically configured to implement the reliability check on the first characteristic data of the first data in the following manner: perform at least one of the following on the first characteristic data of the first data: data Validity test, safety interval test, network-wide contribution test, abnormal continuity test.
- the data validity check on the first feature data is to detect whether the number of RRC establishment connection requests x eff is greater than the second preset threshold x eff _threshold, when RRC is established When the number of connection requests is greater than the second preset threshold, the first data is considered valid, and the security interval check is continued on the first characteristic data; when the number of RRC establishment connection requests is less than or equal to the second preset threshold, the first data is considered invalid, Then the first data is normal (that is, the first check result is normal), and the check ends.
- x eff _threshold is given by experts based on business experience and domain knowledge, which can be specifically set by the user on the page or import a configurable file.
- Performing a safe interval check on the first characteristic data is to check whether the RRC connection establishment success rate x w is within the safe interval [x min , x max ].
- the first data is considered normal ( That is, the first check result is normal), and the check ends; when the RRC connection establishment success rate is not within the safe interval, the first characteristic data is continuously checked for the contribution of the whole network.
- the safety interval [x min , x max ] is given by experts based on business experience and domain knowledge, which can be specifically set by the user on the page or import a configurable file.
- x ref _threshold is given by experts based on business experience and domain knowledge, which can be specifically set by the user on the page or import a configurable file.
- the continuity test of the first feature is to check whether the test result pre_res for a period of time before t w is abnormal, and whether there is a continuous deterioration trend through the value of the second-order difference, when the test for a period of time before t w
- the first verification result is determined to be abnormal; when at least one of the test results for a period of time before t w is normal, or there is a tendency to change for the better, it is determined
- the first check result is normal and the check ends.
- the determining module 202 is specifically configured to execute any one or more of the following: when the first detection result is abnormal, the reliability of one or more abnormal detection algorithms is greater than Or equal to the first preset threshold, it is determined that the second detection result is abnormal; when the reliability of the abnormality detection algorithm for which the first detection result is abnormal is less than the first preset threshold, all The first verification result is used as the second detection result.
- the second detection result is determined to be abnormal; when the reliability of the abnormal detection algorithms XGBoost and 3-sigma are both less than the first preset threshold, the first verification result is used as the second detection result.
- the reliability of the anomaly detection algorithm includes: similarity comparison of anomaly detection algorithms, statistical testing, and the like. For example, the coincidence rate of the anomaly detection algorithm.
- the determining module 202 is further configured to: for each of the abnormality detection algorithms, calculate the coincidence rate of the abnormality detection algorithms.
- the determining module 202 is specifically configured to calculate the coincidence rate of the anomaly detection algorithm in the following manner: determine that the ratio of the first number to the second number is the coincidence rate; wherein, the first number is Among all the second data collected within the preset time, the third detection result obtained by using the abnormality detection algorithm to detect the second characteristic data of the second data and the reliability of the second characteristic data of the second data.
- the number of second data with the same second verification result obtained by the verification; the second number is the second feature of the second data using the abnormality detection algorithm among all the second data collected within the preset time
- the third detection result obtained by data detection is the number of abnormal second data.
- the test results of the RRC connection success rate index for all cells are abnormal samples.
- the detection results obtained by the anomaly detection algorithm are abnormal samples.
- the determining module 202 is further configured to perform any one or more of the following: when the second detection result is abnormal, and the user's abnormal cancellation information is received, the first data is used as The samples marked as normal are added to the marked sample library; when the second detection result is abnormal and the user's abnormal confirmation information is received, the first data is added to the marked sample library as the samples marked as abnormal; wherein, The samples labeled as normal or abnormal in the labeled sample library are used to train the supervised learning model of the abnormality detection algorithm, and the abnormality detection algorithm performs the first feature data of the first data based on the supervised learning model. The first test result is obtained. Add the first data as labeled samples to the labeled sample library.
- the anomaly detection algorithm based on supervised learning or semi-supervised learning apply new labeled samples Incremental learning or periodic retraining of the supervised learning model improves the reliability of the supervised learning model, thereby improving the detection accuracy of the anomaly detection algorithm.
- the abnormal information when the second detection result is abnormal, the abnormal information is displayed.
- the abnormal information includes the object name "cell 1" corresponding to the abnormal data (ie the first data), the occurrence time t w , and the KPI name "RRC connection establishment Success rate", KPI value x w, etc.
- the user can choose to enter the exception confirmation information to confirm the exception, or enter the exception cancellation information to cancel the exception; or do nothing.
- the user inputs abnormality confirmation information the first data is added to the labeled sample library as a sample labeled as abnormal; when the user inputs abnormality cancellation information, the first data is added to the sample library as a labeled normal sample.
- the embodiment of the present application determines the final detection result based on the detection results of various anomaly detection algorithms and the reliability verification results. Since the reliability verification is based on the reliability verification of expert business experience and domain knowledge, the detection is improved. Reliability of results.
- Such software may be distributed on a computer-readable medium
- the computer-readable medium may include a computer storage medium (or non-transitory medium) and a communication medium (or transitory medium).
- the term computer storage medium includes volatile and non-volatile memory implemented in any method or technology for storing information (such as computer-readable instructions, data structures, program modules, or other data).
- Computer storage media include but are not limited to RAM, ROM, EEPROM, flash memory or other memory technology, CD-ROM, digital versatile disk (DVD) or other optical disk storage, magnetic cassette, tape, magnetic disk storage or other magnetic storage device, or Any other medium used to store desired information and that can be accessed by a computer.
- communication media usually contain computer-readable instructions, data structures, program modules, or other data in a modulated data signal such as carrier waves or other transmission mechanisms, and may include any information delivery media .
Landscapes
- Engineering & Computer Science (AREA)
- Signal Processing (AREA)
- Computer Networks & Wireless Communication (AREA)
- Data Mining & Analysis (AREA)
- Theoretical Computer Science (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Artificial Intelligence (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Evolutionary Computation (AREA)
- Physics & Mathematics (AREA)
- Evolutionary Biology (AREA)
- Bioinformatics & Computational Biology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Life Sciences & Earth Sciences (AREA)
- Computer Security & Cryptography (AREA)
- Environmental & Geological Engineering (AREA)
- Computing Systems (AREA)
- Computer Hardware Design (AREA)
- Mathematical Analysis (AREA)
- Algebra (AREA)
- Mathematical Optimization (AREA)
- Mathematical Physics (AREA)
- Probability & Statistics with Applications (AREA)
- Pure & Applied Mathematics (AREA)
- Software Systems (AREA)
- Medical Informatics (AREA)
- Databases & Information Systems (AREA)
- Debugging And Monitoring (AREA)
Abstract
Description
Claims (10)
- 一种异常检测方法,包括:对于N种异常检测算法中的每一种异常检测算法,采用所述异常检测算法对采集的第一数据的第一特征数据进行检测得到第一检测结果;其中,N为大于或等于1的整数;当一种或一种以上异常检测算法的第一检测结果为异常时,对所述第一数据的第一特征数据进行可靠性校验得到第一校验结果;根据第一检测结果为异常的异常检测算法的可靠性是否大于或等于第一预设阈值和第一校验结果确定第二检测结果。
- 根据权利要求1所述的方法,其中,该方法还包括以下任意一个或多个:当所述第二检测结果为异常,且接收到用户的异常取消信息时,将所述第一数据作为标注为正常的样本加入标注样本库;当所述第二检测结果为异常,且接收到用户的异常确认信息时,将所述第一数据作为标注为异常的样本加入标注样本库中;其中,所述标注样本库中的标注为正常或异常的样本用于训练所述异常检测算法的监督学习模型,所述异常检测算法基于所述监督学习模型对所述第一数据的第一特征数据进行检测得到所述第一检测结果。
- 根据权利要求1或2所述的方法,其中,所述对第一数据的第一特征数据进行可靠性校验包括:对所述第一数据的第一特征数据进行以下至少之一:数据有效性检验、安全区间检验、全网贡献度检验、异常持续性检验。
- 根据权利要求1或2所述的方法,其中,所述根据第一检测结果为异常的异常检测算法的可靠性是否大于或等于第一预设阈值和第一校验结果确定第二检测结果包括以下任意一种或多种:当所述第一检测结果为异常的异常检测算法中的一种或一种以上异常检 测算法的可靠性大于或等于所述第一预设阈值时,确定所述第二检测结果为异常;当所述第一检测结果为异常的异常检测算法的可靠性均小于所述第一预设阈值时,将所述第一校验结果作为所述第二检测结果。
- 根据权利要求1或2所述的方法,,其中,所述异常检测算法的可靠性包括:所述异常检测算法的重合率。
- 根据权利要求5所述的方法,其中,所述根据第一检测结果为异常的异常检测算法的可靠性是否大于或等于第一预设阈值和第一校验结果确定第二检测结果之前,该方法还包括:对于每一种所述异常检测算法,计算所述异常检测算法的重合率。
- 根据权利要求6所述的方法,其中,所述计算异常检测算法的重合率包括:确定第一数量和第二数量的比值为所述重合率;其中,所述第一数量为在预设时间内采集的所有第二数据中,采用所述异常检测算法对所述第二数据的第二特征数据进行检测得到的第三检测结果和对所述第二数据的第二特征数据进行可靠性校验得到的第二校验结果相同的第二数据的数量;所述第二数量为在预设时间内采集的所有第二数据中,采用所述异常检测算法对所述第二数据的第二特征进行检测得到的第三检测结果为异常的第二数据的数量。
- 一种异常检测装置,包括处理器和计算机可读存储介质,所述计算机可读存储介质中存储有指令,其中,当所述指令被所述处理器执行时,实现如权利要求1~7任一项所述的异常检测方法。
- 一种计算机可读存储介质,其上存储有计算机程序,其中,所述计算机程序被处理器执行时实现如权利要求1~7任一项所述的异常检测方法的步骤。
- 一种异常检测装置,包括:检测模块,用于对于N种异常检测算法中的每一种异常检测算法,采用所述异常检测算法对采集的第一数据的第一特征数据进行检测得到第一检测结果;其中,N为大于或等于1的整数;当一种或一种以上异常检测算法的第一检测结果为异常时,对所述第一数据的第一特征数据进行可靠性校验得到第一校验结果;确定模块,用于根据第一检测结果为异常的异常检测算法的可靠性是否大于或等于第一预设阈值和第一校验结果确定第二检测结果。
Priority Applications (3)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US17/625,078 US11777824B2 (en) | 2019-07-05 | 2020-05-19 | Anomaly detection method and apparatus |
| EP20837813.3A EP3979416B1 (en) | 2019-07-05 | 2020-05-19 | Anomaly detection method and apparatus |
| JP2021578200A JP2022539578A (ja) | 2019-07-05 | 2020-05-19 | 異常検出方法および装置 |
Applications Claiming Priority (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN201910605053.9 | 2019-07-05 | ||
| CN201910605053.9A CN112188534B (zh) | 2019-07-05 | 2019-07-05 | 一种异常检测方法和装置 |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| WO2021004161A1 true WO2021004161A1 (zh) | 2021-01-14 |
Family
ID=73915346
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| PCT/CN2020/090936 Ceased WO2021004161A1 (zh) | 2019-07-05 | 2020-05-19 | 一种异常检测方法和装置 |
Country Status (5)
| Country | Link |
|---|---|
| US (1) | US11777824B2 (zh) |
| EP (1) | EP3979416B1 (zh) |
| JP (1) | JP2022539578A (zh) |
| CN (1) | CN112188534B (zh) |
| WO (1) | WO2021004161A1 (zh) |
Cited By (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN113761025A (zh) * | 2021-08-30 | 2021-12-07 | 上海电气国轩新能源科技有限公司 | 储能电站电池数据异常的检测方法、系统、设备及介质 |
Families Citing this family (8)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN113535445B (zh) * | 2021-01-06 | 2026-04-24 | 腾讯科技(深圳)有限公司 | 一种异常检测方法、设备及计算机可读存储介质 |
| CN115242600A (zh) * | 2021-04-23 | 2022-10-25 | 北京华为数字技术有限公司 | 一种网络异常检测方法及装置 |
| US12386721B2 (en) * | 2021-08-04 | 2025-08-12 | Verizon Patent And Licensing Inc. | Anomaly detection using user behavioral biometrics profiling method and apparatus |
| US11832119B2 (en) * | 2021-08-31 | 2023-11-28 | Verizon Patent And Licensing Inc. | Identification of anomalous telecommunication service |
| CN115374851A (zh) * | 2022-08-19 | 2022-11-22 | 北京市燃气集团有限责任公司 | 一种燃气数据异常检测方法及装置 |
| KR102593981B1 (ko) * | 2022-11-10 | 2023-10-25 | 주식회사 이노와이어리스 | 네트워크 로그 데이터의 결측치 처리 및 이를 통한 통신 결함 근원 분류 방법 |
| CN117851907B (zh) * | 2024-01-10 | 2024-06-11 | 山东省水利勘测设计院有限公司 | 一种基于物联网技术的水闸渗流监测方法 |
| CN119829914B (zh) * | 2024-01-24 | 2025-12-02 | 乌鲁木齐大数据产业发展投资有限公司 | 一种基于多维数据的数据处理方法 |
Citations (6)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN1996888A (zh) * | 2006-12-15 | 2007-07-11 | 华为技术有限公司 | 一种网络流量异常的检测方法及检测装置 |
| CN105630885A (zh) * | 2015-12-18 | 2016-06-01 | 国网福建省电力有限公司泉州供电公司 | 一种用电异常检测方法及系统 |
| CN106357622A (zh) * | 2016-08-29 | 2017-01-25 | 北京工业大学 | 基于软件定义网络的网络异常流量检测防御系统 |
| CN107276851A (zh) * | 2017-06-26 | 2017-10-20 | 中国信息安全测评中心 | 一种节点的异常检测方法、装置、网络节点及控制台 |
| CN108322347A (zh) * | 2018-02-09 | 2018-07-24 | 腾讯科技(深圳)有限公司 | 数据检测方法、装置、检测服务器及存储介质 |
| US20180292806A1 (en) * | 2017-04-06 | 2018-10-11 | Renesas Electronics Corporation | Abnormality detection system, semiconductor device manufacturing system and semiconductor device manufacturing method |
Family Cites Families (10)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| AU2002235516A1 (en) * | 2001-01-08 | 2002-07-16 | Vextec Corporation | Method and apparatus for predicting failure in a system |
| US20060063521A1 (en) * | 2004-09-21 | 2006-03-23 | Benjamin Cheung | Method of monitoring wireless network performance |
| JP5431235B2 (ja) * | 2009-08-28 | 2014-03-05 | 株式会社日立製作所 | 設備状態監視方法およびその装置 |
| US10592093B2 (en) * | 2014-10-09 | 2020-03-17 | Splunk Inc. | Anomaly detection |
| US9872188B2 (en) * | 2015-07-28 | 2018-01-16 | Futurewei Technologies, Inc. | Adaptive filtering based network anomaly detection |
| CN105471854B (zh) | 2015-11-18 | 2019-06-28 | 国网智能电网研究院 | 一种基于多级策略的自适应边界异常检测方法 |
| US10164991B2 (en) * | 2016-03-25 | 2018-12-25 | Cisco Technology, Inc. | Hierarchical models using self organizing learning topologies |
| JP6805554B2 (ja) * | 2016-06-02 | 2020-12-23 | 富士電機株式会社 | 監視装置及び監視方法 |
| US10904276B2 (en) | 2017-09-14 | 2021-01-26 | Nokia Technologies Oy | Systems and methods for anomaly detection |
| CN108762993A (zh) | 2018-06-06 | 2018-11-06 | 山东超越数控电子股份有限公司 | 一种基于人工智能的虚拟机故障迁移方法与装置 |
-
2019
- 2019-07-05 CN CN201910605053.9A patent/CN112188534B/zh active Active
-
2020
- 2020-05-19 JP JP2021578200A patent/JP2022539578A/ja active Pending
- 2020-05-19 US US17/625,078 patent/US11777824B2/en active Active
- 2020-05-19 WO PCT/CN2020/090936 patent/WO2021004161A1/zh not_active Ceased
- 2020-05-19 EP EP20837813.3A patent/EP3979416B1/en active Active
Patent Citations (6)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN1996888A (zh) * | 2006-12-15 | 2007-07-11 | 华为技术有限公司 | 一种网络流量异常的检测方法及检测装置 |
| CN105630885A (zh) * | 2015-12-18 | 2016-06-01 | 国网福建省电力有限公司泉州供电公司 | 一种用电异常检测方法及系统 |
| CN106357622A (zh) * | 2016-08-29 | 2017-01-25 | 北京工业大学 | 基于软件定义网络的网络异常流量检测防御系统 |
| US20180292806A1 (en) * | 2017-04-06 | 2018-10-11 | Renesas Electronics Corporation | Abnormality detection system, semiconductor device manufacturing system and semiconductor device manufacturing method |
| CN107276851A (zh) * | 2017-06-26 | 2017-10-20 | 中国信息安全测评中心 | 一种节点的异常检测方法、装置、网络节点及控制台 |
| CN108322347A (zh) * | 2018-02-09 | 2018-07-24 | 腾讯科技(深圳)有限公司 | 数据检测方法、装置、检测服务器及存储介质 |
Cited By (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN113761025A (zh) * | 2021-08-30 | 2021-12-07 | 上海电气国轩新能源科技有限公司 | 储能电站电池数据异常的检测方法、系统、设备及介质 |
Also Published As
| Publication number | Publication date |
|---|---|
| CN112188534A (zh) | 2021-01-05 |
| EP3979416A1 (en) | 2022-04-06 |
| US20220278914A1 (en) | 2022-09-01 |
| EP3979416B1 (en) | 2026-02-11 |
| EP3979416A4 (en) | 2022-08-03 |
| JP2022539578A (ja) | 2022-09-12 |
| CN112188534B (zh) | 2023-11-03 |
| US11777824B2 (en) | 2023-10-03 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| WO2021004161A1 (zh) | 一种异常检测方法和装置 | |
| US12147893B2 (en) | Anomaly detection in network topology | |
| WO2022048168A1 (zh) | 一种用于故障预测神经网络模型的训练方法与设备 | |
| US20150331063A1 (en) | Dynamic Modeling and Resilience for Power Distribution | |
| US8918345B2 (en) | Network analysis system | |
| AU2019275633B2 (en) | System and method of automated fault correction in a network environment | |
| US10158522B2 (en) | Network monitor and method for event based prediction of radio network outages and their root cause | |
| CN114169604A (zh) | 性能指标的异常检测方法、异常检测装置、电子设备和存储介质 | |
| CN114138610B (zh) | 一种故障处理方法及装置 | |
| CN110933115B (zh) | 基于动态session的分析对象行为异常检测方法及装置 | |
| US20220046039A1 (en) | Method, device, and computer program product for abnormality detection | |
| CN104899657A (zh) | 社团融合事件的预测方法 | |
| CN103634158B (zh) | 一种snmp管理进程的压力测试方法和装置 | |
| CN115130616A (zh) | 一种告警预测方法、装置、设备及介质 | |
| CN118041608A (zh) | 一种基于攻击知识与多属性嵌入的apt网络攻击检测方法 | |
| CN117456417A (zh) | 交通异常事件感知方法、系统、电子设备及存储介质 | |
| CN118555187A (zh) | 一种告警关联分析方法、装置、电子设备及存储介质 | |
| CN110738414B (zh) | 一种风险预测方法和装置及计算机可读存储介质 | |
| CN115509802A (zh) | 网络设备的备份决策方法、装置、电子设备及存储介质 | |
| US20240144075A1 (en) | Updating label probability distributions of data points | |
| CN120803785A (zh) | 业务性能故障诊断方法、装置、设备、介质和程序产品 | |
| CN121509211A (zh) | 基于边缘ai驱动的实时链路异常检测与采样方法及装置 | |
| CN120508427A (zh) | 智能计算中心的算力资源运行故障智能诊断方法及装置 | |
| CN121579257A (zh) | 云原生环境故障预测方法、计算机设备、计算机可读存储介质 | |
| CN121396754A (zh) | 一种分组传输网络故障识别定位方法、设备及存储介质 |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| 121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 20837813 Country of ref document: EP Kind code of ref document: A1 |
|
| ENP | Entry into the national phase |
Ref document number: 2021578200 Country of ref document: JP Kind code of ref document: A |
|
| ENP | Entry into the national phase |
Ref document number: 2020837813 Country of ref document: EP Effective date: 20211226 |
|
| WWG | Wipo information: grant in national office |
Ref document number: 2020837813 Country of ref document: EP |