WO2017084521A1 - 一种订单聚类方法及装置,以及反恶意信息的方法及装置 - Google Patents
一种订单聚类方法及装置,以及反恶意信息的方法及装置 Download PDFInfo
- Publication number
- WO2017084521A1 WO2017084521A1 PCT/CN2016/105137 CN2016105137W WO2017084521A1 WO 2017084521 A1 WO2017084521 A1 WO 2017084521A1 CN 2016105137 W CN2016105137 W CN 2016105137W WO 2017084521 A1 WO2017084521 A1 WO 2017084521A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- order
- cluster
- content
- information
- clustering
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Ceased
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q30/00—Commerce
- G06Q30/06—Buying, selling or leasing transactions
- G06Q30/0601—Electronic shopping [e-shopping]
- G06Q30/0633—Managing shopping lists, e.g. compiling or processing purchase lists
- G06Q30/0635—Managing shopping lists, e.g. compiling or processing purchase lists replenishment orders; recurring orders
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/35—Clustering; Classification
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/33—Querying
- G06F16/3331—Query processing
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/907—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/955—Retrieval from the web using information identifiers, e.g. uniform resource locators [URL]
- G06F16/9566—URL specific, e.g. using aliases, detecting broken or misspelled links
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/23—Clustering techniques
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q30/00—Commerce
- G06Q30/018—Certifying business or products
- G06Q30/0185—Product, service or business identity fraud
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L63/00—Network architectures or network communication protocols for network security
- H04L63/14—Network architectures or network communication protocols for network security for detecting or protecting against malicious traffic
- H04L63/1441—Countermeasures against malicious traffic
- H04L63/1483—Countermeasures against malicious traffic service impersonation, e.g. phishing, pharming or web spoofing
Definitions
- the present application relates to the field of network communication technologies, and in particular, to an order clustering method and apparatus, and a method and apparatus for anti-malicious information.
- each transaction is regarded as an “order”.
- the “I want to collect payment” service in the Alipay application forms an order, and each remittance business also constitutes an order.
- There may be malicious information in the order such as the “I want to collect payment” service in the Alipay application.
- the malicious user sends a “I want to collect payment” request to a large number of Alipay users, and corrects the consumption record corresponding to each order.
- those phishing websites are mostly slogans such as “Taobao Seventh Anniversary”. The user found that he jumped from Alipay's official website and was relatively trustworthy and recruited. This caused a large loss of funds for the customer and affected Alipay's reputation.
- keywords are generally used to filter and identify malicious information, but there are certain deficiencies.
- a keyword filtering scheme only when the keyword is clearly known as spam can function, and when a malicious user does The adjustment will be carried out to modify the keywords in the malicious information.
- the keyword filtering scheme will be invalid, and it will not play the role of actively discovering malicious information.
- the endless stream of malicious information in various means and forms only The ability to add keywords to human monitoring to continuously improve the filtering of malicious information, so passive response makes the cost of identifying these malicious information higher.
- an order clustering method and device and a method and device for anti-malicious information are proposed, and the order is carried out in the technical solution of the present application.
- the clustering operation can be used to classify the order more accurately, and the degree of human participation can be greatly reduced, and the corresponding anti-malicious information method and device can quickly respond to the orders in the entire malicious information classification cluster. deal with.
- An embodiment of the present application provides a method for order clustering in an electronic transaction, including
- the clustering operation is performed according to the order unique identifier and the order vector, and the result of the cluster to which the order belongs is obtained.
- the embodiment of the present application further provides an anti-malicious information method based on order clustering, including,
- the representative order contents in all the clusters are analyzed, and if the representative order content is malicious information, all the orders in the cluster are processed accordingly.
- the embodiment of the present application further provides an order clustering device in an electronic transaction, including
- a generating unit configured to generate an order unique identifier and an order content according to the order information
- a vector conversion unit configured to convert the order content into an order vector
- the clustering operation unit is configured to perform a clustering operation according to the order unique identifier and the order vector to obtain a result of the cluster to which the order belongs.
- the embodiment of the present application further provides an anti-malicious information device based on order clustering, including
- a generating unit configured to generate an order unique identifier and an order content according to the order information
- a vector conversion unit configured to convert the order content into an order vector
- a clustering operation unit configured to perform a clustering operation according to the order unique identifier and the order vector, to obtain a result of the classification cluster to which the order belongs;
- the processing unit is configured to analyze the representative order content in all the clusters, and if the representative order content is malicious information, all the orders in the cluster are processed accordingly.
- the order in the electronic transaction is automatically classified by the clustering algorithm, and the new classification can be obtained according to the real-time content analysis of the order, and the classification of the obtained malicious information is correspondingly performed. Processing reduces the degree of human involvement and allows automatic processing of new clusters.
- FIG. 1 is a flowchart of a method for order clustering in an electronic transaction according to an embodiment of the present application
- FIG. 2 is a schematic structural diagram of an order clustering apparatus in an electronic transaction according to an embodiment of the present application
- FIG. 3 is a flowchart of an anti-malicious information method based on order clustering according to an embodiment of the present application
- FIG. 4 is a schematic structural diagram of an anti-malicious information device based on order clustering according to an embodiment of the present application
- FIG. 5 is a specific flowchart of an anti-malicious information method based on order clustering in an electronic transaction according to an embodiment of the present application
- FIG. 6 is a flowchart of a method for quickly classifying an order after clustering processing according to an embodiment of the present application.
- the embodiment of the present application provides an order clustering method and device, and a method and device for anti-malicious information.
- FIG. 1 is a flowchart of a method for order clustering in an electronic transaction according to an embodiment of the present application.
- a part of the promotion and dissemination information belongs to normal information, and a part of the promotion and propagation information belongs to malicious information, due to user propagation.
- the promotion information is forwarded by the server of the electronic transaction system, so the method in this embodiment runs in the server of the electronic transaction system, and all the promotion and propagation information can be automatically classified by a clustering algorithm, such as the meanshift clustering algorithm.
- the figure specifically includes a step 101 of generating an order unique identifier and an order content based on the order information.
- Step 102 Convert the order content into an order vector.
- Step 103 Perform a clustering operation according to the order unique identifier and the order vector to obtain a result of the cluster to which the order belongs.
- the order shift operation may be performed on the order by using the meanshift clustering algorithm to obtain the result of the classification cluster to which the order belongs.
- the meanshift clustering operation is a nonparametric multi-model segmentation method, its basic calculation
- the module adopts the traditional pattern recognition program, which is to analyze the image's feature space and clustering method to achieve the purpose of segmentation. It obtains the density pattern of the unknown category by directly estimating the local maximum of the feature space probability density function, and determines the location of the pattern, and then clusters it into the categories related to this mode.
- a suitable clustering algorithm can be selected or adopted as needed.
- the method further includes: mapping the order content in the classification cluster to a feature string, when the mapping string of the order content of the order to be analyzed is the same as the feature string, directly The analysis analysis order is included in the classification cluster.
- the method before the generating the order unique identifier and the order content according to the order information, the method further includes: importing the order into the hdfs (distributed file system).
- the subsequent steps of the present application can be performed in the hdfs, so that the processing efficiency can be improved.
- the other steps of the present invention are optional in the distributed file system, and all the other distributed systems can be performed. The steps can also achieve the purpose of improving efficiency, and will not be described here.
- the generating the order unique identifier and the order content according to the order information further comprises: generating the order unique identifier according to the user ID, the order ID, and the service type in the order information, according to the order information
- the order title or order note generates the order content.
- the user ID and the order ID may be arbitrarily combined.
- the order unique identifier is generated with the business type, or may also include the contents of other fields (eg, an external order number) to generate an order unique identifier.
- the order title may include a description of the item, and the description of the order, for example, the order title of the payment service may include a business name, a payee, a message, etc., and the order note may include promotion information of the originating order user. , for example, a URL address, a promotional text, or a promotional image.
- the converting the order content into an order vector comprises converting the order content into a tf-idf (term frequency – inverse document frequency) vector.
- the order in the electronic transaction can be automatically classified by a clustering algorithm such as meanshift, and the new classification can be obtained according to the real-time content analysis of the order; when processing the order to be analyzed, Transcoding the order to be analyzed, and then determining whether the reordered order is the same as the feature string of the order content in a certain cluster, thereby further improving the efficiency of the order classification; by importing the order into the distributed file of hdfs
- the system can take advantage of the characteristics of the distributed system to further improve the efficiency of order classification.
- FIG. 2 is a schematic structural diagram of an order clustering apparatus in an electronic transaction according to an embodiment of the present application.
- the apparatus may be disposed in a server of an electronic transaction system, or a high performance server may be separately set.
- the apparatus of the present embodiment can be implemented by a dedicated CPU or a single-chip microcomputer or an FPGA, or by executing a control logic of each functional module by a general-purpose processor.
- the function module in the apparatus of this embodiment performs a clustering operation such as meanshift, effectively classifies the order, and can obtain a newly created cluster according to the content of the order, and the developer does not need to preset the number of clusters. This can improve the self-identification degree of order clustering anti-malicious information, improve the accuracy of classification, and reduce the degree of human participation and improve work efficiency.
- the apparatus of this embodiment includes a generating unit 201 for generating an order unique identifier and an order content based on the order information.
- the vector conversion unit 202 is configured to convert the order content into an order vector.
- the clustering operation unit 203 is configured to perform a clustering operation according to the order unique identifier and the order vector to obtain a result of the cluster to which the order belongs.
- a conversion unit 204 is further connected between the generating unit and the clustering operation unit, and is configured to map an order content in the classification cluster into a feature string, to be analyzed.
- the mapping string of the order content of the order is the same as the feature string, the to-be-analyzed order is directly included in the cluster.
- an import unit 205 is further included for importing an order into hdfs (distributed file system).
- the generating unit is further configured to: generate the order unique identifier according to a user ID, an order ID, and a service type in the order information, and generate the according to an order title or an order remark in the order information.
- Order content is further configured to: generate the order unique identifier according to a user ID, an order ID, and a service type in the order information, and generate the according to an order title or an order remark in the order information.
- the vector conversion unit is further configured to convert the order content into a tf-idf vector.
- an order in an electronic transaction can be automatically classified by a clustering algorithm such as meanshift, and a new classification can be obtained according to real-time content analysis of the order; when processing the order to be analyzed, Transcoding the order to be analyzed, and then determining whether the reordered order is the same as the feature string of the order content in a certain cluster, thereby further improving the efficiency of the order classification; by importing the order into the distributed file of hdfs
- the system can take advantage of the characteristics of the distributed system to further improve the efficiency of order classification.
- FIG. 3 is a flowchart of an anti-malicious information method based on order clustering according to an embodiment of the present application.
- the generated clusters are judged, so that which clusters contain malicious information (such as garbage orders), and all the orders in the cluster can be processed to avoid the risk of electronic transactions.
- malicious information such as garbage orders
- step 301 is included to generate an order unique identifier and an order content according to the order information.
- Step 302 converting the order content into an order vector.
- Step 303 Perform a clustering operation according to the order unique identifier and the order vector to obtain a result of the classification cluster to which the order belongs.
- Step 304 Analyze the content of the representative order in all the clusters. If the content of the representative order is malicious information, all the orders in the cluster are processed accordingly.
- a method in the prior art may be used, for example, determining whether the order content of an order in the cluster is malicious information according to a keyword, or manually
- the content of the order is filtered.
- the corresponding processing may include recording a user ID in the cluster, sending the user ID to the administrator to perform a user ID or issuing a warning message to the user ID.
- the method further includes: mapping the order content in the classification cluster to a feature string, when the mapping string of the order content of the order to be analyzed is the same as the feature string, directly The analysis analysis order is included in the classification cluster.
- mapping character string of the to-be-analyzed order is identical to the feature character string, further comprising: when the classification cluster has been calibrated as a garbage class, the malicious information that can directly be used to analyze the order to be analyzed is directly Processing, such as freezing transactions, blocking accounts, etc.
- the order content refers to the order content of an order selected by any of the classification clusters.
- the order to be analyzed can be directly classified according to the clustering clustering operation, because most of the information in the promotion and dissemination information (order) initiated by a certain user is the same, which classification clusters of the promotion propagation information have been obtained by the above algorithm, and the feature samples (ie, the most representative promotion propagation information) in the classification cluster are converted into MD5 codes, and only when the promotion propagation information to be analyzed is received, only By converting the promotion and propagation information into the MD5 code, it can be determined whether the promotion and propagation information to be analyzed belongs to the classification cluster by comparing whether the two are the same. Those skilled in the art can imagine that other types of transformation can be obtained by the above technical solutions. The code is also feasible and will not be described here.
- an order in an electronic transaction can be automatically classified by a clustering algorithm such as meanshift (in practical applications, an appropriate clustering algorithm can also be selected or adopted according to needs), and can be based on an order.
- a clustering algorithm such as meanshift (in practical applications, an appropriate clustering algorithm can also be selected or adopted according to needs), and can be based on an order.
- Real-time content analysis gets a new classification; when processing an order to be analyzed, it can be transcoded by the order to be analyzed, and then it is judged whether the reordered order is related to the characteristic character of the order content in a certain cluster.
- the same string which further improves the efficiency of order classification; by importing orders into the distributed file system of hdfs, the characteristics of the distributed system can be utilized to further improve the efficiency of order classification; only similar orders will be clustered to In a cluster, so by analyzing one of the clusters to represent the content of the order, it is possible to obtain whether the entire cluster is a cluster of malicious information, thereby simplifying the workflow for comparing and determining each order.
- FIG. 4 is a schematic structural diagram of an anti-malicious information device based on order clustering according to an embodiment of the present application.
- the device in this embodiment may be set in a server of an electronic transaction system, or a high-performance device may be separately set.
- the server implements the apparatus of this embodiment.
- the various functional units and modules of the apparatus of this embodiment may be implemented by a dedicated CPU or a single chip microcomputer or an FPGA, or may be implemented by a general purpose processor executing control logic of each functional module.
- the function module in the apparatus of this embodiment performs a clustering operation such as meanshift, effectively classifies the order, and can obtain a newly created cluster according to the content of the order, and the developer does not need to preset the number of clusters. This can improve the self-identification degree of order clustering anti-malicious information, improve the accuracy of classification, and reduce the degree of human participation and improve work efficiency.
- the embodiment of the figure specifically includes a generating unit 401 for generating an order unique identifier and an order content according to the order information.
- the vector conversion unit 402 is configured to convert the order content into an order vector.
- the clustering operation unit 403 is configured to perform a clustering operation according to the order unique identifier and the order vector to obtain a result of the cluster of the order to which the order belongs.
- the processing unit 404 is configured to analyze the representative order content in all the clusters, and if the representative order content is malicious information, all the orders in the cluster are processed accordingly.
- a conversion unit 405 is further included, which is connected between the generating unit and the clustering operation unit, and is used for mapping an order content in the classification cluster into a feature string, to be analyzed.
- the mapping string of the order content of the order is the same as the feature string, the to-be-analyzed order is directly included in the cluster.
- the converting unit is further configured to: when the classification cluster has been calibrated as a garbage class, directly submit the to-be-analyzed order to the processing unit for automated malicious information processing (eg, freeze transaction, banned Account number, etc.).
- automated malicious information processing eg, freeze transaction, banned Account number, etc.
- an order in an electronic transaction can be automatically classified by a clustering algorithm such as meanshift (in practical applications, an appropriate clustering algorithm can also be selected or adopted according to needs), and can be based on an order.
- the real-time content analysis obtains a new classification; when processing the order to be analyzed, the order to be analyzed can be transcoded, and then it is determined whether the re-ordered order is the same as the characteristic string of the order content in a certain cluster, thereby further Improve the efficiency of order classification; by importing orders into the distributed file system of hdfs, The characteristics of the distributed system can be utilized to further improve the efficiency of order classification; since only similar orders will be clustered into one cluster, so by analyzing one of the clusters to represent the order content, the whole can be obtained. Whether the cluster is a cluster of malicious information simplifies the workflow for comparing and determining each order.
- FIG. 5 is a specific flowchart of an anti-malicious information method based on order clustering in an electronic transaction according to an embodiment of the present application.
- the entire method is performed based on a Hadoop distributed file system, thereby improving
- the overall work efficiency is described by taking the meanshift clustering method as an example in the clustering process.
- the payment service is used as an explanation.
- the user ID, the order ID, the service type, and the receipt are included.
- the information such as the ID, date, and amount of the payee also includes the remark information in the order, and the URL of the phishing website (Uniform Resource Locator) is included in the remark information, and the malicious user sends the promotion and dissemination information to many users.
- Uniform Resource Locator Uniform Resource Locator
- the method includes the step 501 of importing an order acquired from a database into a distributed file system (hdfs).
- hdfs distributed file system
- step 502 the user ID, the order ID, and the service type in the order are combined as a unique identifier of the order.
- the combination in this step may include splicing the user ID, the order ID, and the service type to form an order unique identifier, or may take the last two digits of the user ID, the last two digits of the order ID, and the service type to form an order.
- the unique identifier may also be otherwise generated by a unique identifier to uniquely identify the order, such as uniquely identifying the order only by the order ID.
- step 503 the remark information in the order is taken as the content of the order.
- the remark information of the order is a part describing the content of the order, for example, the URL address of the phishing website is included in the remark information.
- the order also includes content such as a consumption record title
- the content may also be used as an order content, wherein the consumption record title may include a description of the item in the order, and for the collection application, May include information such as business name, recipient, amount, etc.
- the unique identifier for each order corresponds to the order content of the order.
- Step 504 converting the order unique identifier and the order content into a sequencefile format.
- the format of the sequencefile converted in this step is for the data format used by the above hdfs distributed file system. After converting to such a format, the processing method processed by the hdfs system can be applied, thereby improving the processing efficiency.
- Step 505 converting the above order content into a tf-idf vector.
- the converted tf-idf vector forms a mapping relationship with the order unique identifier ⁇ key, vector>, where key is the above-mentioned order unique identifier, and vector is the vector formed by the above order content, that is, the mathematical expression of the order content.
- Step 506 Perform a meanshift clustering operation on the vector to obtain a result of the cluster to which the order belongs.
- the content of the order is iteratively compared with the cluster, and finally, which cluster belongs to the category or belongs to a newly established cluster, because there is a key that is mapped to the vector (ie, the order unique identifier) Therefore, it can correspond to which particular group belongs to which cluster, and the iterative parameters and convergence parameters in the meanshift clustering operation can be set by the developer according to the actual situation, and the output result is in the sequencefile format.
- Step 507 converting the clustering result of the order into a text format.
- the clustering result of the order in the foregoing step is in the sequencefile format, it is not conducive to subsequent judgment and processing of the clustering result, so the clustering result of the order needs to be converted into a text format.
- Step 508 Determine whether the order content in the classification cluster is malicious information. If it is malicious information, proceed to step 509, otherwise proceed to step 510.
- the specific judgment of whether the content of the order is malicious information may be determined by a method in the prior art, such as a keyword or a manual method. There may be many orders in a cluster, and only one of the orders may be selected for judgment.
- the amount of promotion and dissemination information is large, especially the amount of information about the promotion and dissemination of malicious information may be larger, after the clustering operation ends, there are many orders in each classification cluster, and the order will be included.
- the most up to several clusters are filtered out (for example, the top ten clusters are filtered out) and provided to the monitor for monitoring. It can be used to monitor the orders in the top ten clusters.
- the content can determine which taxonomic clusters are more likely to be classified as malicious information.
- step 509 the user who initiated the order is marked, and the monitor is prompted.
- Step 510 if it is not malicious information, no processing is performed.
- the orders in the electronic transaction system can be classified by the meanshift clustering operation, and new classifications can be automatically generated, thereby reducing manual participation, and can be adapted to the classification of the ever-changing malicious information in real time.
- FIG. 6 is a flowchart of a method for quickly classifying an order after clustering processing according to an embodiment of the present application. The method is based on the foregoing embodiment of FIG. 5, each of which has a large number of orders, and the system receives a new one. The order to be analyzed is processed.
- the Figure 6 includes a step 601 of extracting the order content in each cluster.
- the extraction in this step refers to random extraction, since the content of the order in each cluster should be similar, Therefore, which order content is extracted can be used as a feature of the cluster.
- Step 602 Perform MD5 transcoding operation on the extracted order content to obtain a feature string.
- the feature string of each cluster is obtained.
- the MD5 transcoding operation is only one of the transcoding operations, and other transcoding methods in the prior art may be used, or A transcoding method is designed by the developer as needed, and is not limited herein.
- Step 603 Acquire an order to be analyzed.
- This step can be obtained, for example, from step 503 in the embodiment shown in FIG. 5 above, since the embodiment shown in FIG. 5 is generally used for post-mortem analysis orders, and the embodiment shown in FIG. 6 can be used in the embodiment of FIG. 5.
- the new orders are analyzed, the real-time performance is higher, and the relatively complicated clustering operations are omitted.
- this step can also directly obtain the promotion and dissemination information (ie, the order) to be forwarded, and perform real-time comparison analysis on the order to determine the classification cluster to which the order belongs.
- the promotion and dissemination information ie, the order
- step 604 the order content of the order to be analyzed is subjected to MD5 transcoding operation to obtain a mapping character string.
- Step 605 Compare the feature string with the mapping string. If yes, go to step 606. Otherwise, go to step 607.
- Step 606 The order to be analyzed is directly included in the cluster of the feature string.
- the classification cluster can then continue to be processed in a regression step 508. If it is malicious information, the promotion of the promotion information can be stopped.
- mapping string of the to-be-analyzed order is the same as the characteristic string of the cluster of the classified garbage class
- the process directly proceeds to step 509, and without step 606, the penalty for the order to be analyzed is automatically automated. Processing, such as freezing transactions, blocking accounts, etc.
- Step 607 returning to step 504 to continue the clustering process.
- an order in an electronic transaction can be automatically classified by a clustering algorithm such as meanshift, and a new classification can be obtained according to the real-time content analysis of the order;
- a clustering algorithm such as meanshift
- a new classification can be obtained according to the real-time content analysis of the order;
- the order to be analyzed is transcoded, and then it is judged whether the order after the transcoding is the same as the characteristic string of the order content in a certain cluster, thereby further improving the efficiency of order classification; by importing the order into hdfs
- the distributed file system can take advantage of the characteristics of the distributed system to further improve the efficiency of order classification.
- Improvements to a technology can clearly distinguish between hardware improvements (eg, improvements to circuit structures such as diodes, transistors, switches, etc.) or software improvements (for process flow improvements).
- hardware improvements eg, improvements to circuit structures such as diodes, transistors, switches, etc.
- software improvements for process flow improvements.
- PLD Programmable Logic Device
- FPGA Field Programmable Gate Array
- HDL Hardware Description Language
- the controller can be implemented in any suitable manner, for example, the controller can take the form of, for example, a microprocessor or processor and a computer readable medium storing computer readable program code (eg, software or firmware) executable by the (micro)processor.
- computer readable program code eg, software or firmware
- examples of controllers include, but are not limited to, the following microcontrollers: ARC 625D, Atmel AT91SAM, The Microchip PIC18F26K20 and the Silicone Labs C8051F320, the memory controller can also be implemented as part of the memory's control logic.
- the controller can be logically programmed by means of logic gates, switches, ASICs, programmable logic controllers, and embedding.
- Such a controller can therefore be considered a hardware component, and the means for implementing various functions included therein can also be considered as a structure within the hardware component.
- a device for implementing various functions can be considered as a software module that can be both a method of implementation and a structure within a hardware component.
- the system, device, module or unit illustrated in the above embodiments may be specifically implemented by a computer chip or an entity. Or it can be realized by a product with a certain function.
- the present application can be implemented by means of software plus a necessary general hardware platform. Based on such understanding, the technical solution of the present application may be embodied in the form of a software product in essence or in the form of a software product, which may be stored in a storage medium such as a ROM/RAM or a disk. , an optical disk, etc., includes instructions for causing a computer device (which may be a personal computer, server, or network device, etc.) to perform the methods described in various embodiments of the present application or portions of the embodiments.
- a computer device which may be a personal computer, server, or network device, etc.
- This application can be used in a variety of general purpose or special purpose computer system environments or configurations.
- the application can be described in the general context of computer-executable instructions executed by a computer, such as a program module.
- program modules include routines, programs, objects, components, data structures, and the like that perform particular tasks or implement particular abstract data types.
- the present application can also be practiced in distributed computing environments where tasks are performed by remote processing devices that are connected through a communication network.
- program modules can be located in both local and remote computer storage media including storage devices.
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Physics & Mathematics (AREA)
- Business, Economics & Management (AREA)
- Databases & Information Systems (AREA)
- Data Mining & Analysis (AREA)
- General Engineering & Computer Science (AREA)
- Accounting & Taxation (AREA)
- Finance (AREA)
- General Business, Economics & Management (AREA)
- Development Economics (AREA)
- Economics (AREA)
- Marketing (AREA)
- Strategic Management (AREA)
- Computer Security & Cryptography (AREA)
- Entrepreneurship & Innovation (AREA)
- Library & Information Science (AREA)
- Computer Hardware Design (AREA)
- Signal Processing (AREA)
- Computer Networks & Wireless Communication (AREA)
- Computing Systems (AREA)
- Computational Linguistics (AREA)
- Evolutionary Biology (AREA)
- Artificial Intelligence (AREA)
- Life Sciences & Earth Sciences (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Bioinformatics & Computational Biology (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Evolutionary Computation (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
Abstract
Description
Claims (12)
- 一种电子交易中订单聚类的方法,其特征在于,包括,根据订单信息生成订单唯一标识符和订单内容;将所述订单内容转换为订单向量;根据所述订单唯一标识符和订单向量做聚类运算,得到该订单所属分类簇的结果。
- 根据权利要求1所述方法,其特征在于,将所述分类簇中的订单内容映射为特征字符串,当待分析订单的订单内容映射字符串与所述特征字符串相同时,直接将所述待分析订单纳入该分类簇。
- 根据权利要求1所述方法,其特征在于,所述根据订单信息生成订单唯一标识符和订单内容进一步包括,根据订单信息中的用户ID、订单ID和业务类型生成所述订单唯一标识符,根据订单信息中的订单标题或者订单备注生成所述订单内容。
- 一种基于订单聚类的反恶意信息方法,其特征在于包括,根据订单信息生成订单唯一标识符和订单内容;将所述订单内容转换为订单向量;根据所述订单唯一标识符和订单向量做聚类运算,得到该订单所属分类簇的结果;分析所有分类簇中的代表订单内容,若所述代表订单内容为恶意信息,则对该分类簇中所有的订单进行相应处理。
- 根据权利要求4所述方法,其特征在于,将所述分类簇中的订单内容映射为特征字符串,当待分析订单的订单内容映射字符串与所述特征字符串相同时,直接将所述待分析订单纳入该分类簇。
- 根据权利要求5所述方法,其特征在于,当待分析订单的订单内容映射字符串与所述特征字符串相同后进一步包括,当所述分类簇已经被标定为垃圾类,直接将待分析订单做恶意信息处理。
- 一种电子交易中订单聚类装置,其特征在于包括,生成单元,用于根据订单信息生成订单唯一标识符和订单内容;向量转换单元,用于将所述订单内容转换为订单向量;聚类运算单元,用于根据所述订单唯一标识符和订单向量做聚类运算,得到该订单所属分类簇的结果。
- 根据权利要求7所述的装置,其特征在于还包括转换单元,连接于所述生成单元和所述聚类运算单元之间,用于将所述分类簇中的订单内容映射为特征字符串,当待 分析订单的订单内容映射字符串与所述特征字符串相同时,直接将所述待分析订单纳入该分类簇。
- 根据权利要求7所述的装置,其特征在于,所述生成单元进一步用于,根据订单信息中的用户ID、订单ID和业务类型生成所述订单唯一标识符,根据订单信息中的订单标题或者订单备注生成所述订单内容。
- 一种基于订单聚类的反恶意信息装置,其特征在于包括,生成单元,用于根据订单信息生成订单唯一标识符和订单内容;向量转换单元,用于将所述订单内容转换为订单向量;聚类运算单元,用于根据所述订单唯一标识符和订单向量做聚类运算,得到该订单所属分类簇的结果;处理单元,用于分析所有分类簇中的代表订单内容,若所述代表订单内容为恶意信息,则对该分类簇中所有的订单进行相应处理。
- 根据权利要求10所述的装置,其特征在于还包括转换单元,连接于所述生成单元和所述聚类运算单元之间,用于将所述分类簇中的订单内容映射为特征字符串,当待分析订单的订单内容映射字符串与所述特征字符串相同时,直接将所述待分析订单纳入该分类簇。
- 根据权利要求11所述的装置,其特征在于所述转换单元还用于当所述分类簇已经被标定为垃圾类,则直接将待分析订单发送给处理单元进行恶意信息处理。
Priority Applications (9)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| KR1020187017252A KR102151328B1 (ko) | 2015-11-18 | 2016-11-09 | 오더 클러스터링 및 악의적인 정보 퇴치 방법 및 장치 |
| JP2018526215A JP6594543B2 (ja) | 2015-11-18 | 2016-11-09 | 注文クラスタリング方法及び装置並びに悪意の情報に対抗する方法及び装置 |
| AU2016355767A AU2016355767A1 (en) | 2015-11-18 | 2016-11-09 | Order clustering method and device, and malicious information rejecting method and device |
| SG11201804174TA SG11201804174TA (en) | 2015-11-18 | 2016-11-09 | Order clustering and malicious information combating method and apparatus |
| PH1/2018/501058A PH12018501058B1 (en) | 2015-11-18 | 2016-11-09 | Order clustering and malicious information combating method and apparatus |
| EP16865698.1A EP3379427A4 (en) | 2015-11-18 | 2016-11-09 | Order clustering method and device, and malicious information rejecting method and device |
| MYPI2018701895A MY186818A (en) | 2015-11-18 | 2016-11-09 | Order clustering and malicious information combating method and apparatus |
| US15/980,171 US11200615B2 (en) | 2015-11-18 | 2018-05-15 | Order clustering and malicious information combating method and apparatus |
| US16/721,824 US11100567B2 (en) | 2015-11-18 | 2019-12-19 | Order clustering and malicious information combating method and apparatus |
Applications Claiming Priority (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN201510795161.9A CN106709777A (zh) | 2015-11-18 | 2015-11-18 | 一种订单聚类方法及装置,以及反恶意信息的方法及装置 |
| CN201510795161.9 | 2015-11-18 |
Related Child Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| US15/980,171 Continuation US11200615B2 (en) | 2015-11-18 | 2018-05-15 | Order clustering and malicious information combating method and apparatus |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| WO2017084521A1 true WO2017084521A1 (zh) | 2017-05-26 |
Family
ID=58717371
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| PCT/CN2016/105137 Ceased WO2017084521A1 (zh) | 2015-11-18 | 2016-11-09 | 一种订单聚类方法及装置,以及反恶意信息的方法及装置 |
Country Status (10)
| Country | Link |
|---|---|
| US (2) | US11200615B2 (zh) |
| EP (1) | EP3379427A4 (zh) |
| JP (1) | JP6594543B2 (zh) |
| KR (1) | KR102151328B1 (zh) |
| CN (1) | CN106709777A (zh) |
| AU (1) | AU2016355767A1 (zh) |
| MY (1) | MY186818A (zh) |
| PH (1) | PH12018501058B1 (zh) |
| SG (1) | SG11201804174TA (zh) |
| WO (1) | WO2017084521A1 (zh) |
Cited By (3)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN107562853A (zh) * | 2017-08-28 | 2018-01-09 | 武汉烽火普天信息技术有限公司 | 一种面向海量互联网文本数据的流式聚类及展现的方法 |
| CN111126629A (zh) * | 2019-12-25 | 2020-05-08 | 上海携程国际旅行社有限公司 | 模型的生成方法、刷单行为识别方法、系统、设备和介质 |
| CN115641177A (zh) * | 2022-10-20 | 2023-01-24 | 北京力尊信通科技股份有限公司 | 一种基于机器学习的防秒杀预判系统 |
Families Citing this family (41)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN106709777A (zh) | 2015-11-18 | 2017-05-24 | 阿里巴巴集团控股有限公司 | 一种订单聚类方法及装置,以及反恶意信息的方法及装置 |
| RU2649793C2 (ru) | 2016-08-03 | 2018-04-04 | ООО "Группа АйБи" | Способ и система выявления удаленного подключения при работе на страницах веб-ресурса |
| RU2671991C2 (ru) | 2016-12-29 | 2018-11-08 | Общество с ограниченной ответственностью "Траст" | Система и способ сбора информации для обнаружения фишинга |
| RU2637477C1 (ru) * | 2016-12-29 | 2017-12-04 | Общество с ограниченной ответственностью "Траст" | Система и способ обнаружения фишинговых веб-страниц |
| RU2689816C2 (ru) | 2017-11-21 | 2019-05-29 | ООО "Группа АйБи" | Способ для классифицирования последовательности действий пользователя (варианты) |
| RU2676247C1 (ru) | 2018-01-17 | 2018-12-26 | Общество С Ограниченной Ответственностью "Группа Айби" | Способ и компьютерное устройство для кластеризации веб-ресурсов |
| RU2677361C1 (ru) | 2018-01-17 | 2019-01-16 | Общество с ограниченной ответственностью "Траст" | Способ и система децентрализованной идентификации вредоносных программ |
| RU2677368C1 (ru) | 2018-01-17 | 2019-01-16 | Общество С Ограниченной Ответственностью "Группа Айби" | Способ и система для автоматического определения нечетких дубликатов видеоконтента |
| RU2668710C1 (ru) | 2018-01-17 | 2018-10-02 | Общество с ограниченной ответственностью "Группа АйБи ТДС" | Вычислительное устройство и способ для обнаружения вредоносных доменных имен в сетевом трафике |
| RU2680736C1 (ru) | 2018-01-17 | 2019-02-26 | Общество с ограниченной ответственностью "Группа АйБи ТДС" | Сервер и способ для определения вредоносных файлов в сетевом трафике |
| RU2681699C1 (ru) | 2018-02-13 | 2019-03-12 | Общество с ограниченной ответственностью "Траст" | Способ и сервер для поиска связанных сетевых ресурсов |
| CN109189928B (zh) * | 2018-08-30 | 2022-05-17 | 天津做票君机器人科技有限公司 | 一种汇票交易机器人的授信信息识别方法 |
| CN109271483B (zh) * | 2018-09-06 | 2022-03-15 | 中山大学 | 基于递进式多判别器的问题生成方法 |
| CN110955890B (zh) * | 2018-09-26 | 2021-08-17 | 瑞数信息技术(上海)有限公司 | 恶意批量访问行为的检测方法、装置和计算机存储介质 |
| RU2708508C1 (ru) | 2018-12-17 | 2019-12-09 | Общество с ограниченной ответственностью "Траст" | Способ и вычислительное устройство для выявления подозрительных пользователей в системах обмена сообщениями |
| RU2701040C1 (ru) | 2018-12-28 | 2019-09-24 | Общество с ограниченной ответственностью "Траст" | Способ и вычислительное устройство для информирования о вредоносных веб-ресурсах |
| EP3842968B1 (en) | 2019-02-27 | 2024-04-24 | "Group IB" Ltd. | Method and system for identifying a user according to keystroke dynamics |
| EP3842966B8 (en) | 2019-04-10 | 2026-03-04 | Group-Ib Global Private Limited | Identifying a user from the sequence of windows opened on a user interface |
| CN111768258B (zh) * | 2019-06-05 | 2024-09-24 | 北京京东尚科信息技术有限公司 | 识别异常订单的方法、装置、电子设备和介质 |
| CN110347888B (zh) * | 2019-07-16 | 2022-02-15 | 中国联合网络通信集团有限公司 | 订单数据的处理方法、装置及存储介质 |
| CN110532394B (zh) * | 2019-09-11 | 2023-04-07 | 携程计算机技术(上海)有限公司 | 订单备注文本的处理方法及系统 |
| CN111147441A (zh) * | 2019-11-12 | 2020-05-12 | 恒大智慧科技有限公司 | 网络购票的舞弊行为自动检测方法、设备及可读存储介质 |
| RU2728498C1 (ru) | 2019-12-05 | 2020-07-29 | Общество с ограниченной ответственностью "Группа АйБи ТДС" | Способ и система определения принадлежности программного обеспечения по его исходному коду |
| RU2728497C1 (ru) | 2019-12-05 | 2020-07-29 | Общество с ограниченной ответственностью "Группа АйБи ТДС" | Способ и система определения принадлежности программного обеспечения по его машинному коду |
| RU2743974C1 (ru) | 2019-12-19 | 2021-03-01 | Общество с ограниченной ответственностью "Группа АйБи ТДС" | Система и способ сканирования защищенности элементов сетевой архитектуры |
| RU2722693C1 (ru) | 2020-01-27 | 2020-06-03 | Общество с ограниченной ответственностью «Группа АйБи ТДС» | Способ и система выявления инфраструктуры вредоносной программы или киберзлоумышленника |
| RU2722692C1 (ru) | 2020-02-21 | 2020-06-03 | Общество с ограниченной ответственностью «Группа АйБи ТДС» | Способ и система выявления вредоносных файлов в неизолированной среде |
| SG10202001963TA (en) | 2020-03-04 | 2021-10-28 | Group Ib Global Private Ltd | System and method for brand protection based on the search results |
| RU2738344C1 (ru) | 2020-03-10 | 2020-12-11 | Общество с ограниченной ответственностью «Группа АйБи ТДС» | Способ и система поиска схожих вредоносных программ по результатам их динамического анализа |
| US11475090B2 (en) | 2020-07-15 | 2022-10-18 | Group-Ib Global Private Limited | Method and system for identifying clusters of affiliated web resources |
| CN113763077B (zh) * | 2020-07-24 | 2024-07-19 | 北京沃东天骏信息技术有限公司 | 用于检测虚假交易订单的方法和装置 |
| RU2743619C1 (ru) | 2020-08-06 | 2021-02-20 | Общество с ограниченной ответственностью "Группа АйБи ТДС" | Способ и система генерации списка индикаторов компрометации |
| CN113240480A (zh) * | 2021-01-25 | 2021-08-10 | 天津五八到家货运服务有限公司 | 订单处理方法、装置、电子终端及存储介质 |
| US11947572B2 (en) | 2021-03-29 | 2024-04-02 | Group IB TDS, Ltd | Method and system for clustering executable files |
| CN113781156B (zh) * | 2021-05-13 | 2024-08-20 | 北京沃东天骏信息技术有限公司 | 恶意订单的识别方法、模型的训练方法、设备及存储介质 |
| NL2030861B1 (en) | 2021-06-01 | 2023-03-14 | Trust Ltd | System and method for external monitoring a cyberattack surface |
| RU2769075C1 (ru) | 2021-06-10 | 2022-03-28 | Общество с ограниченной ответственностью "Группа АйБи ТДС" | Система и способ активного обнаружения вредоносных сетевых ресурсов |
| CN113724037A (zh) * | 2021-08-02 | 2021-11-30 | 深圳依时货拉拉科技有限公司 | 非正常订单处理方法、装置、存储介质和计算机设备 |
| NL2031253B1 (en) * | 2021-08-19 | 2023-03-24 | Group Ib Tds Ltd | Computing device and method of detecting compromised network devices based on dns tunneling detection |
| CN114220111B (zh) * | 2021-12-22 | 2022-09-16 | 深圳市伊登软件有限公司 | 基于云平台的图文批量识别方法及系统 |
| CN116955613B (zh) * | 2023-06-12 | 2024-02-27 | 广州数说故事信息科技有限公司 | 一种基于研报数据和大语言模型生成产品概念的方法 |
Citations (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN101860822A (zh) * | 2010-06-11 | 2010-10-13 | 中兴通讯股份有限公司 | 垃圾短信监控方法和系统 |
| US8291024B1 (en) * | 2008-07-31 | 2012-10-16 | Trend Micro Incorporated | Statistical spamming behavior analysis on mail clusters |
| CN103279868A (zh) * | 2013-05-22 | 2013-09-04 | 兰亭集势有限公司 | 一种自动识别欺诈订单的方法和装置 |
| CN104182539A (zh) * | 2014-09-02 | 2014-12-03 | 五八同城信息技术有限公司 | 异常信息批量处理的方法及系统 |
Family Cites Families (26)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US6094643A (en) * | 1996-06-14 | 2000-07-25 | Card Alert Services, Inc. | System for detecting counterfeit financial card fraud |
| US7376618B1 (en) * | 2000-06-30 | 2008-05-20 | Fair Isaac Corporation | Detecting and measuring risk with predictive models using content mining |
| US7242999B2 (en) * | 2001-05-11 | 2007-07-10 | Kenneth Kuk-Kei Wang | Method and apparatus for identifying virtual body profiles |
| JP4744737B2 (ja) * | 2001-07-23 | 2011-08-10 | 株式会社ブロードリーフ | 電子メールによる発注方法 |
| US7725544B2 (en) * | 2003-01-24 | 2010-05-25 | Aol Inc. | Group based spam classification |
| CN1798204A (zh) * | 2004-12-24 | 2006-07-05 | 华为技术有限公司 | 一种支付系统及其实现方法 |
| US20070027932A1 (en) * | 2005-07-29 | 2007-02-01 | Q2 Labs, Llc | System and method of creating a single source rss document from multiple content sources |
| JP4769983B2 (ja) * | 2007-05-17 | 2011-09-07 | 独立行政法人産業技術総合研究所 | 異常検出装置および異常検出方法 |
| JP5040718B2 (ja) * | 2008-02-21 | 2012-10-03 | 日本電気株式会社 | スパム・イベント検出装置及び方法並びにプログラム |
| US8170966B1 (en) * | 2008-11-04 | 2012-05-01 | Bitdefender IPR Management Ltd. | Dynamic streaming message clustering for rapid spam-wave detection |
| KR101048159B1 (ko) * | 2009-02-27 | 2011-07-08 | (주)다우기술 | 봇넷 탐지 및 차단 시스템 및 방법 |
| US10394754B2 (en) * | 2010-03-08 | 2019-08-27 | International Business Machines Corporation | Indexing multiple types of data to facilitate rapid re-indexing of one or more types of data |
| US9116995B2 (en) * | 2011-03-30 | 2015-08-25 | Vcvc Iii Llc | Cluster-based identification of news stories |
| US8955133B2 (en) * | 2011-06-09 | 2015-02-10 | Microsoft Corporation | Applying antimalware logic without revealing the antimalware logic to adversaries |
| JP5740228B2 (ja) * | 2011-07-01 | 2015-06-24 | Kddi株式会社 | 代表的なコメント抽出方法およびプログラム |
| US20130232045A1 (en) * | 2012-03-04 | 2013-09-05 | Oracle International Corporation | Automatic Detection Of Fraud And Error Using A Vector-Cluster Model |
| US8935783B2 (en) * | 2013-03-08 | 2015-01-13 | Bitdefender IPR Management Ltd. | Document classification using multiscale text fingerprints |
| US8788405B1 (en) * | 2013-03-15 | 2014-07-22 | Palantir Technologies, Inc. | Generating data clusters with customizable analysis strategies |
| US20150095247A1 (en) * | 2013-10-02 | 2015-04-02 | Eventbrite, Inc. | Classifying Fraud on Event Management Systems |
| US10078861B1 (en) * | 2013-10-15 | 2018-09-18 | Dd Ip Holder Llc | Methods and apparatus for a centralized customer order processing system with automatic detection of customer arrival |
| CN103632250A (zh) * | 2013-12-06 | 2014-03-12 | 湖南御家汇化妆品有限公司 | 一种销售订单快速排序分组筛选方法 |
| US10038703B2 (en) * | 2014-07-18 | 2018-07-31 | The Regents Of The University Of Michigan | Rating network security posture and comparing network maliciousness |
| US20160188733A1 (en) * | 2014-12-29 | 2016-06-30 | Quixey, Inc. | Generating Recommendations Based on Clustered Application States |
| US9979748B2 (en) * | 2015-05-27 | 2018-05-22 | Cisco Technology, Inc. | Domain classification and routing using lexical and semantic processing |
| US20180268015A1 (en) * | 2015-09-02 | 2018-09-20 | Sasha Sugaberry | Method and apparatus for locating errors in documents via database queries, similarity-based information retrieval and modeling the errors for error resolution |
| CN106709777A (zh) | 2015-11-18 | 2017-05-24 | 阿里巴巴集团控股有限公司 | 一种订单聚类方法及装置,以及反恶意信息的方法及装置 |
-
2015
- 2015-11-18 CN CN201510795161.9A patent/CN106709777A/zh active Pending
-
2016
- 2016-11-09 MY MYPI2018701895A patent/MY186818A/en unknown
- 2016-11-09 PH PH1/2018/501058A patent/PH12018501058B1/en unknown
- 2016-11-09 AU AU2016355767A patent/AU2016355767A1/en not_active Abandoned
- 2016-11-09 SG SG11201804174TA patent/SG11201804174TA/en unknown
- 2016-11-09 WO PCT/CN2016/105137 patent/WO2017084521A1/zh not_active Ceased
- 2016-11-09 JP JP2018526215A patent/JP6594543B2/ja active Active
- 2016-11-09 EP EP16865698.1A patent/EP3379427A4/en not_active Withdrawn
- 2016-11-09 KR KR1020187017252A patent/KR102151328B1/ko active Active
-
2018
- 2018-05-15 US US15/980,171 patent/US11200615B2/en active Active
-
2019
- 2019-12-19 US US16/721,824 patent/US11100567B2/en active Active
Patent Citations (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US8291024B1 (en) * | 2008-07-31 | 2012-10-16 | Trend Micro Incorporated | Statistical spamming behavior analysis on mail clusters |
| CN101860822A (zh) * | 2010-06-11 | 2010-10-13 | 中兴通讯股份有限公司 | 垃圾短信监控方法和系统 |
| CN103279868A (zh) * | 2013-05-22 | 2013-09-04 | 兰亭集势有限公司 | 一种自动识别欺诈订单的方法和装置 |
| CN104182539A (zh) * | 2014-09-02 | 2014-12-03 | 五八同城信息技术有限公司 | 异常信息批量处理的方法及系统 |
Non-Patent Citations (1)
| Title |
|---|
| See also references of EP3379427A4 * |
Cited By (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN107562853A (zh) * | 2017-08-28 | 2018-01-09 | 武汉烽火普天信息技术有限公司 | 一种面向海量互联网文本数据的流式聚类及展现的方法 |
| CN107562853B (zh) * | 2017-08-28 | 2021-02-23 | 武汉烽火普天信息技术有限公司 | 一种面向海量互联网文本数据的流式聚类及展现的方法 |
| CN111126629A (zh) * | 2019-12-25 | 2020-05-08 | 上海携程国际旅行社有限公司 | 模型的生成方法、刷单行为识别方法、系统、设备和介质 |
| CN111126629B (zh) * | 2019-12-25 | 2023-09-19 | 上海携程国际旅行社有限公司 | 模型的生成方法、刷单行为识别方法、系统、设备和介质 |
| CN115641177A (zh) * | 2022-10-20 | 2023-01-24 | 北京力尊信通科技股份有限公司 | 一种基于机器学习的防秒杀预判系统 |
Also Published As
| Publication number | Publication date |
|---|---|
| EP3379427A4 (en) | 2018-11-07 |
| US20180268464A1 (en) | 2018-09-20 |
| KR20180085756A (ko) | 2018-07-27 |
| JP6594543B2 (ja) | 2019-10-23 |
| CN106709777A (zh) | 2017-05-24 |
| JP2019504380A (ja) | 2019-02-14 |
| SG11201804174TA (en) | 2018-06-28 |
| PH12018501058B1 (en) | 2023-03-10 |
| KR102151328B1 (ko) | 2020-09-03 |
| EP3379427A1 (en) | 2018-09-26 |
| MY186818A (en) | 2021-08-23 |
| AU2016355767A1 (en) | 2018-07-05 |
| US11200615B2 (en) | 2021-12-14 |
| PH12018501058A1 (en) | 2019-01-28 |
| US11100567B2 (en) | 2021-08-24 |
| US20200134702A1 (en) | 2020-04-30 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| WO2017084521A1 (zh) | 一种订单聚类方法及装置,以及反恶意信息的方法及装置 | |
| CN107436875B (zh) | 文本分类方法及装置 | |
| CN104598631B (zh) | 分布式数据处理平台 | |
| WO2020037917A1 (zh) | 一种用户行为数据推荐方法、服务器及计算机可读介质 | |
| CN107704601A (zh) | 大数据检索方法与系统、计算机存储介质及电子设备 | |
| WO2019001429A1 (zh) | 一种多源数据融合方法和装置 | |
| WO2016069065A1 (en) | Similarity search and malware prioritization | |
| CN104615715A (zh) | 基于地理位置的社交网络事件分析方法及系统 | |
| CN108334513A (zh) | 一种识别相似文本的处理方法、装置及系统 | |
| CN110175281A (zh) | 一种用户数据处理、交互方法、装置及系统 | |
| CN107368500B (zh) | 数据抽取方法及系统 | |
| CN115033688A (zh) | 识别告警事件类型的方法、装置、设备及存储介质 | |
| CN115169468A (zh) | 数据处理方法、装置、电子设备及计算机可读存储介质 | |
| CN120296754A (zh) | 使用人工智能的多模态数据丢失保护 | |
| CN107844867A (zh) | 数据处理方法、装置及设备 | |
| US10262061B2 (en) | Hierarchical data classification using frequency analysis | |
| CN111061763A (zh) | 用于生成规则引擎的规则执行计划的方法及装置 | |
| US20130230248A1 (en) | Ensuring validity of the bookmark reference in a collaborative bookmarking system | |
| CN112148979B (zh) | 事件关联用户的识别方法、装置、电子设备和存储介质 | |
| CN116127106A (zh) | 基于区块链的互联网图像大数据智能存储方法与系统 | |
| US10296990B2 (en) | Verifying compliance of a land parcel to an approved usage | |
| Li et al. | The Application of Association Analysis in Mobile Phone Forensics System | |
| US20250328548A1 (en) | Inline Nested Data Loss Protection (DLP) | |
| CN105160014A (zh) | 数据处理方法及装置 | |
| US12443792B2 (en) | Reference driven NLP-based topic categorization |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| 121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 16865698 Country of ref document: EP Kind code of ref document: A1 |
|
| WWE | Wipo information: entry into national phase |
Ref document number: 11201804174T Country of ref document: SG Ref document number: 12018501058 Country of ref document: PH |
|
| WWE | Wipo information: entry into national phase |
Ref document number: 2018526215 Country of ref document: JP |
|
| NENP | Non-entry into the national phase |
Ref country code: DE |
|
| ENP | Entry into the national phase |
Ref document number: 20187017252 Country of ref document: KR Kind code of ref document: A |
|
| WWE | Wipo information: entry into national phase |
Ref document number: 1020187017252 Country of ref document: KR Ref document number: 2016865698 Country of ref document: EP |
|
| ENP | Entry into the national phase |
Ref document number: 2016355767 Country of ref document: AU Date of ref document: 20161109 Kind code of ref document: A |