CA2956627C - Systeme et moteur servant au regroupement cible d'evenements d'informations - Google Patents

Systeme et moteur servant au regroupement cible d'evenements d'informations

Info

Publication number
CA2956627C
CA2956627C CA2956627A CA2956627A CA2956627C CA 2956627 C CA2956627 C CA 2956627C CA 2956627 A CA2956627 A CA 2956627A CA 2956627 A CA2956627 A CA 2956627A CA 2956627 C CA2956627 C CA 2956627C
Authority
CA
Canada
Prior art keywords
documents
cluster
event
document
clusters
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CA2956627A
Other languages
English (en)
Other versions
CA2956627A1 (fr
Inventor
Jack G. Conrad
Michael J. Bender
Original Assignee
Thomson Reuters Enterprise Centre GmbH
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from US15/418,763 external-priority patent/US11663254B2/en
Application filed by Thomson Reuters Enterprise Centre GmbH filed Critical Thomson Reuters Enterprise Centre GmbH
Publication of CA2956627A1 publication Critical patent/CA2956627A1/fr
Application granted granted Critical
Publication of CA2956627C publication Critical patent/CA2956627C/fr
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Landscapes

  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

La présente invention concerne un système de regroupement et de recherche d’événements d’actualité configuré pour créer dans un premier temps un ensemble de données candidates de documents, dans un deuxième temps un ensemble de clusters initiaux basés sur la proximité ou le statut de similitude des doublons, et dans un troisième temps un cluster agrégé en fusionnant les clusters initiaux avec les documents sources. L’invention génère des clusters de niveau supérieur pour les événements d’actualité sur la base d’une étiquette thématique ou d’un composant « source » fourni par la rédaction, et génère des clusters axés sur des sous-thèmes à l’aide d’un algorithme. Le système utilise un algorithme de regroupement agglomératif pour rassembler et structurer les documents en ensembles de résultats distincts. Les décisions concernant la fusion de documents ou de clusters connexes sont prises en fonction de la similitude des preuves provenant de deux sources distinctes, l’une s’appuyant sur une signature numérique basée sur le texte non structuré du document, l’autre basée sur la présence de balises d’entités nommées qui ont été attribuées au document par un étiqueteur d’événements ou d’entités nommées tel que le moteur/service Web Thomson Reuters Calais.
CA2956627A 2016-01-29 2017-01-30 Systeme et moteur servant au regroupement cible d'evenements d'informations Active CA2956627C (fr)

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
US201662288543P 2016-01-29 2016-01-29
US62/288543 2016-01-29
US15/418763 2017-01-29
US15/418,763 US11663254B2 (en) 2016-01-29 2017-01-29 System and engine for seeded clustering of news events

Publications (2)

Publication Number Publication Date
CA2956627A1 CA2956627A1 (fr) 2017-07-29
CA2956627C true CA2956627C (fr) 2025-09-16

Family

ID=59385117

Family Applications (1)

Application Number Title Priority Date Filing Date
CA2956627A Active CA2956627C (fr) 2016-01-29 2017-01-30 Systeme et moteur servant au regroupement cible d'evenements d'informations

Country Status (1)

Country Link
CA (1) CA2956627C (fr)

Families Citing this family (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP3447663A1 (fr) * 2017-08-23 2019-02-27 Tata Consultancy Services Limited Système et procédé pour le profilage d'événements
CN107992474B (zh) * 2017-11-24 2021-04-27 国家计算机网络与信息安全管理中心 一种流式数据主题挖掘方法及其系统
CN109299720B (zh) * 2018-07-13 2022-02-22 沈阳理工大学 一种基于轮廓片段空间关系的目标识别方法
GB202002192D0 (en) * 2020-02-18 2020-04-01 Echobox Ltd Topic clustering and Event Detection
CN111401775A (zh) * 2020-03-27 2020-07-10 深圳壹账通智能科技有限公司 复杂关系网络的信息分析方法、装置、设备及存储介质
CN112488894A (zh) * 2020-12-03 2021-03-12 南理工泰兴智能制造研究院有限公司 一种基于环保大数据的分类归档方法
CN112668836B (zh) * 2020-12-07 2024-04-05 数据地平线(广州)科技有限公司 一种面向风险图谱的关联风险证据高效挖掘与监控方法和装置

Also Published As

Publication number Publication date
CA2956627A1 (fr) 2017-07-29

Similar Documents

Publication Publication Date Title
US11663254B2 (en) System and engine for seeded clustering of news events
US7912816B2 (en) Adaptive archive data management
US10565234B1 (en) Ticket classification systems and methods
CA2956627C (fr) Systeme et moteur servant au regroupement cible d'evenements d'informations
US8266148B2 (en) Method and system for business intelligence analytics on unstructured data
Inzalkar et al. A survey on text mining-techniques and application
US8140515B2 (en) Personalization engine for building a user profile
Weng et al. Using text classification and multiple concepts to answer e-mails
CN110888990A (zh) 文本推荐方法、装置、设备及介质
WO2012129149A2 (fr) Regroupement de résultats de recherche basé sur l'association d'instances de données à des entités de bases de connaissances
WO2010144618A1 (fr) Procédés, appareil et logiciels pour analyser le contenu de messages de microblogues
US10002187B2 (en) Method and system for performing topic creation for social data
US12093222B2 (en) Data tagging and synchronisation system
Kaur Web content classification: A survey
EP2384476A1 (fr) Moteur de personnalisation pour la création d'un profil utilisateur
Uskenbayeva et al. Creation of data classification system for local administration
Huang et al. An intelligent mechanism to automatically discover emerging technology trends: Exploring regulatory technology
KR102041915B1 (ko) 인공지능을 활용한 데이터베이스 모듈 및 이를 이용하는 경제데이터 제공 시스템 및 방법
tong et al. Mining and analyzing user feedback from app reviews: An econometric approach
Pérez et al. Towards a data warehouse contextualized with web opinions
Xiao et al. Querying specific message from chat logs of suspects based on keywords expansion
Bhopale et al. Temporal topic modeling of scholarly publications for future trend forecasting
AU2021103329A4 (en) The investigation technique of object using machine learning and system.
Felden Integrating Structured and Unstructured Data in a Business Intelligence System
Feng et al. Key information retrieval for power system data based on data mining and improved decision tree algorithm

Legal Events

Date Code Title Description
EEER Examination request

Effective date: 20211101

EEER Examination request

Effective date: 20211101

EEER Examination request

Effective date: 20211101

EEER Examination request

Effective date: 20211101

EEER Examination request

Effective date: 20211101

EEER Examination request

Effective date: 20211101

MFA Maintenance fee for application paid

Free format text: FEE DESCRIPTION TEXT: MF (APPLICATION, 8TH ANNIV.) - STANDARD

Year of fee payment: 8

U00 Fee paid

Free format text: ST27 STATUS EVENT CODE: A-2-2-U10-U00-U101 (AS PROVIDED BY THE NATIONAL OFFICE); EVENT TEXT: MAINTENANCE REQUEST RECEIVED

Effective date: 20241219

U11 Full renewal or maintenance fee paid

Free format text: ST27 STATUS EVENT CODE: A-2-2-U10-U11-U102 (AS PROVIDED BY THE NATIONAL OFFICE); EVENT TEXT: MAINTENANCE FEE PAYMENT DETERMINED COMPLIANT

Effective date: 20241219

Free format text: ST27 STATUS EVENT CODE: A-2-2-U10-U11-U102 (AS PROVIDED BY THE NATIONAL OFFICE); EVENT TEXT: MAINTENANCE FEE PAYMENT PAID IN FULL

Effective date: 20241219

D22 Grant of ip right intended

Free format text: ST27 STATUS EVENT CODE: A-2-2-D10-D22-D128 (AS PROVIDED BY THE NATIONAL OFFICE); EVENT TEXT: NOTICE OF ALLOWANCE IS ISSUED

Effective date: 20250314

Free format text: ST27 STATUS EVENT CODE: A-2-2-D10-D22-D128 (AS PROVIDED BY THE NATIONAL OFFICE); EVENT TEXT: ALLOWANCE REQUIREMENTS DETERMINED COMPLIANT

Effective date: 20250314

W00 Other event occurred

Free format text: ST27 STATUS EVENT CODE: A-2-2-W10-W00-W100 (AS PROVIDED BY THE NATIONAL OFFICE); EVENT TEXT: LETTER SENT

Effective date: 20250314

D00 Search and/or examination requested or commenced

Free format text: ST27 STATUS EVENT CODE: A-4-4-D10-D00-D164 (AS PROVIDED BY THE NATIONAL OFFICE); EVENT TEXT: RESPONSE TO NOTICE OF ALLOWANCE

Effective date: 20250704

D22 Grant of ip right intended

Free format text: ST27 STATUS EVENT CODE: A-2-4-D10-D22-D143 (AS PROVIDED BY THE NATIONAL OFFICE); EVENT TEXT: PRE-GRANT

Effective date: 20250722

W00 Other event occurred

Free format text: ST27 STATUS EVENT CODE: A-4-4-W10-W00-W111 (AS PROVIDED BY THE NATIONAL OFFICE); EVENT TEXT: CORRESPONDENT DETERMINED COMPLIANT

Effective date: 20250722