BRPI0913815A2 - "equipamento de computador, método e programa de computador para extração de termos a partir de dados de documentos incluindo segmentos de texto" - Google Patents

"equipamento de computador, método e programa de computador para extração de termos a partir de dados de documentos incluindo segmentos de texto"

Info

Publication number
BRPI0913815A2
BRPI0913815A2 BRPI0913815A BRPI0913815A BRPI0913815A2 BR PI0913815 A2 BRPI0913815 A2 BR PI0913815A2 BR PI0913815 A BRPI0913815 A BR PI0913815A BR PI0913815 A BRPI0913815 A BR PI0913815A BR PI0913815 A2 BRPI0913815 A2 BR PI0913815A2
Authority
BR
Brazil
Prior art keywords
data including
document data
including text
text segments
computer program
Prior art date
Application number
BRPI0913815A
Other languages
English (en)
Inventor
Hironori Takeuchi
Shiho Negishi
Yohei Ikawa
Original Assignee
Ibm
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Ibm filed Critical Ibm
Publication of BRPI0913815A2 publication Critical patent/BRPI0913815A2/pt
Publication of BRPI0913815B1 publication Critical patent/BRPI0913815B1/pt

Links

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/40Processing or translation of natural language
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/34Browsing; Visualisation therefor
    • G06F16/345Summarisation for human users
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/279Recognition of textual entities
    • G06F40/284Lexical analysis, e.g. tokenisation or collocates

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Health & Medical Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computational Linguistics (AREA)
  • General Health & Medical Sciences (AREA)
  • Databases & Information Systems (AREA)
  • Data Mining & Analysis (AREA)
  • Machine Translation (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
BRPI0913815-3A 2008-10-02 2009-07-30 equipamento de computador e método para extração de termos a partir de dados de documentos incluindo segmentos de texto BRPI0913815B1 (pt)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
JP2008257388 2008-10-02
PCT/JP2009/063584 WO2010038540A1 (ja) 2008-10-02 2009-07-30 テキストセグメントを有する文書から用語を抽出するためのシステム

Publications (2)

Publication Number Publication Date
BRPI0913815A2 true BRPI0913815A2 (pt) 2015-10-20
BRPI0913815B1 BRPI0913815B1 (pt) 2019-11-12

Family

ID=42073317

Family Applications (1)

Application Number Title Priority Date Filing Date
BRPI0913815-3A BRPI0913815B1 (pt) 2008-10-02 2009-07-30 equipamento de computador e método para extração de termos a partir de dados de documentos incluindo segmentos de texto

Country Status (7)

Country Link
US (2) US8463794B2 (pt)
EP (1) EP2315129A4 (pt)
JP (1) JP5106636B2 (pt)
KR (1) KR101498331B1 (pt)
CN (1) CN102144229B (pt)
BR (1) BRPI0913815B1 (pt)
WO (1) WO2010038540A1 (pt)

Families Citing this family (21)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8719692B2 (en) 2011-03-11 2014-05-06 Microsoft Corporation Validation, rejection, and modification of automatically generated document annotations
US9223859B2 (en) * 2011-05-11 2015-12-29 Here Global B.V. Method and apparatus for summarizing communications
JP5670490B2 (ja) * 2012-02-15 2015-02-18 楽天株式会社 カテゴリ判定装置、検索装置、カテゴリ判定方法、カテゴリ判定プログラム、及びそのプログラムを記憶するコンピュータ読取可能な記録媒体
JP5863537B2 (ja) * 2012-03-30 2016-02-16 インターナショナル・ビジネス・マシーンズ・コーポレーションInternational Business Machines Corporation 電子文書に含まれる非自己記述的用語を特定するためのコンピュータ実装方法、プログラムおよびシステム
US9436891B2 (en) 2013-07-30 2016-09-06 GlobalFoundries, Inc. Discriminating synonymous expressions using images
JP6277921B2 (ja) * 2014-09-25 2018-02-14 京セラドキュメントソリューションズ株式会社 用語集管理装置および用語集管理プログラム
US20160117386A1 (en) 2014-10-22 2016-04-28 International Business Machines Corporation Discovering terms using statistical corpus analysis
CN105159892B (zh) * 2015-08-28 2018-04-03 长安大学 一种语料提取器及提取语料的方法
CN105677640A (zh) * 2016-01-08 2016-06-15 中国科学院计算技术研究所 一种面向开放文本的领域概念抽取方法
WO2017163346A1 (ja) * 2016-03-23 2017-09-28 株式会社野村総合研究所 文章解析システム及びプログラム
US20200201917A1 (en) * 2017-09-11 2020-06-25 Shimadzu Corporation Sample category identification device, analysis system, and analysis network system
CN110020140B (zh) * 2017-11-15 2023-02-21 腾讯科技(深圳)有限公司 推荐内容显示方法、装置及系统
CN107918606B (zh) * 2017-11-29 2021-02-09 北京小米移动软件有限公司 具象名词识别方法、装置及计算机可读存储介质
US10394955B2 (en) 2017-12-21 2019-08-27 International Business Machines Corporation Relation extraction from a corpus using an information retrieval based procedure
US10929106B1 (en) * 2018-08-13 2021-02-23 Zoho Coroporation Private Limited Semantic analyzer with grammatical-number enforcement within a namespace
US11151175B2 (en) 2018-09-24 2021-10-19 International Business Machines Corporation On-demand relation extraction from text
CN111291167B (zh) * 2018-12-07 2023-05-05 宁波方太厨具有限公司 基于图像识别的产品纸质说明书自动查检方法
CN114207604A (zh) 2019-07-05 2022-03-18 爱思唯尔有限公司 使用针对性问题回答来提取科学测量背景的系统和方法
CN113971401B (zh) * 2020-07-23 2025-03-18 金风科技股份有限公司 风电故障信息抽取方法和装置
KR102318674B1 (ko) * 2020-10-27 2021-10-28 (주)메디아이플러스 임상 시험 주요 키워드 예측 방법 및 이를 실행하는 서버
CN114841755B (zh) * 2022-05-30 2025-04-18 北京百度网讯科技有限公司 文案的生成方法、装置、电子设备和存储介质

Family Cites Families (21)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2583386B2 (ja) * 1993-03-29 1997-02-19 日本電気株式会社 キーワード自動抽出装置
US5715468A (en) * 1994-09-30 1998-02-03 Budzinski; Robert Lucius Memory system for storing and retrieving experience and knowledge with natural language
JPH09190438A (ja) 1996-01-12 1997-07-22 Canon Inc 情報処理装置及びその方法
JPH10177575A (ja) 1996-10-15 1998-06-30 Ricoh Co Ltd 語句抽出装置および方法、情報記憶媒体
JP3579204B2 (ja) * 1997-01-17 2004-10-20 富士通株式会社 文書要約装置およびその方法
US6253202B1 (en) * 1998-09-18 2001-06-26 Tacit Knowledge Systems, Inc. Method, system and apparatus for authorizing access by a first user to a knowledge profile of a second user responsive to an access request from the first user
JP4253152B2 (ja) * 2000-01-05 2009-04-08 三菱電機株式会社 キーワード抽出装置
US6999963B1 (en) * 2000-05-03 2006-02-14 Microsoft Corporation Methods, apparatus, and data structures for annotating a database design schema and/or indexing annotations
GB2390704A (en) * 2002-07-09 2004-01-14 Canon Kk Automatic summary generation and display
JP2004151882A (ja) * 2002-10-29 2004-05-27 Fuji Xerox Co Ltd 情報出力制御方法、情報出力処理システム、プログラム
US20050004806A1 (en) * 2003-06-20 2005-01-06 Dah-Chih Lin Automatic patent claim reader and computer-aided claim reading method
JP4249038B2 (ja) 2004-01-08 2009-04-02 株式会社ジャストシステム 文書表示装置、文書表示方法、および文書表示プログラム
CN100336056C (zh) * 2005-01-07 2007-09-05 清华大学 基于成熟工艺文档的工艺术语提取、规律分析和重用方法
US8135728B2 (en) * 2005-03-24 2012-03-13 Microsoft Corporation Web document keyword and phrase extraction
US20070016863A1 (en) * 2005-07-08 2007-01-18 Yan Qu Method and apparatus for extracting and structuring domain terms
US7870117B1 (en) * 2006-06-01 2011-01-11 Monster Worldwide, Inc. Constructing a search query to execute a contextual personalized search of a knowledge base
WO2007143223A2 (en) * 2006-06-09 2007-12-13 Tamale Software, Inc. System and method for entity based information categorization
CN101122909B (zh) * 2006-08-10 2010-06-16 株式会社日立制作所 文本信息检索装置以及文本信息检索方法
US8166045B1 (en) * 2007-03-30 2012-04-24 Google Inc. Phrase extraction using subphrase scoring
US8290946B2 (en) * 2008-06-24 2012-10-16 Microsoft Corporation Consistent phrase relevance measures
US8214346B2 (en) * 2008-06-27 2012-07-03 Cbs Interactive Inc. Personalization engine for classifying unstructured documents

Also Published As

Publication number Publication date
US20110208728A1 (en) 2011-08-25
JP5106636B2 (ja) 2012-12-26
US20130253916A1 (en) 2013-09-26
US8463794B2 (en) 2013-06-11
BRPI0913815B1 (pt) 2019-11-12
EP2315129A1 (en) 2011-04-27
EP2315129A4 (en) 2016-06-15
CN102144229B (zh) 2013-09-04
JPWO2010038540A1 (ja) 2012-03-01
KR101498331B1 (ko) 2015-03-03
KR20110081194A (ko) 2011-07-13
US9043339B2 (en) 2015-05-26
CN102144229A (zh) 2011-08-03
WO2010038540A1 (ja) 2010-04-08

Similar Documents

Publication Publication Date Title
BRPI0913815A2 (pt) "equipamento de computador, método e programa de computador para extração de termos a partir de dados de documentos incluindo segmentos de texto"
BRPI0913820A2 (pt) "método para operar uma rede, dispositivo de gerenciamento de sistema, rede e programa de computador"
BRPI0923419A2 (pt) aparelho e método para analizar a condição de uma máquina, e, programa de computador.
BRPI0820949A2 (pt) Método para extração das características de baixa intensidade de um conjunto de dados de imagens, produtos de programa de computador, sistema para a exibição das características de baixa intensidade de um conjunto de dados de imagem
BRPI0820830A2 (pt) Método para modelar em um computador para região física, e, produto de programa de computador.
BR112012005252A2 (pt) aparelho de processamento de informação, método de gerenciamento de dados, e, programa
BRPI0813771A2 (pt) Método, produto de programa de computador, telefone, e, métodos para usar um telefone para facilitar um processo de autorização.
DE602008005063D1 (de) Informationsverarbeitungsvorrichtung, Informationsverarbeitungsverfahren und Computerprogramm
BRPI0919572A2 (pt) método implementado por computador em múltipla escala, sistema implementado por computador, e, método para operar um reservatório de subsuperfície
EP2352103A4 (en) INFORMATION PROCESSING APPARATUS, DOCUMENT EXTRACTING SYSTEM AND METHOD, AND PROGRAM
BRPI0810134A2 (pt) Dispersão de dado acústicos, método de extração de dispersão para dados acústicos, e aparelho para extração de dispersão para dados acústicos.
BRPI0810453A2 (pt) Método para projetar um sistema de localização wireless esparso a partir de um projeto de rede inicial e meio para leitura por computador contendo o mesmo
BRPI0917120A2 (pt) método, e, meio legível por computador.
BRPI0910893A2 (pt) método para analisar dados de deformação
BRPI0820488A2 (pt) método e equipamento para processar um sinal
FR2926375B1 (fr) Procede d'execution d'une application informatique, kit et aeronef associes
BRPI1006971A2 (pt) "dispositivo e método de processamento de informação, e, programa."
GB0802989D0 (en) System, method and computer program for selecting an information provider
BRPI0905920A2 (pt) Método para criação de um banco de dados, produto de banco de dados do computador, meio de educação e método de negócio
BRPI1014842A2 (pt) "método para geração de livro de códigos, método e aparelho para transmissão de dados"
EP2407844A4 (en) OPERATIONAL SUPPORT DEVICE, OPERATIONAL SUPPORT PROCESS AND COMPUTER PROGRAM
BRPI0812652A2 (pt) "método para avaliar automaticamente um diálogo, produto de programa de computador e sistema para avaliar automaticamente um diálogo"
BRPI0907937A2 (pt) Equipamento, método, e programa de computador incorporado em um meio legível por computador
BRPI0906911A2 (pt) Processo e aparelhagem para recuperar dicloridrinas a partir de uma mistura compreendendo dicloridrinas.
BRPI0719477A2 (pt) Método para classificar páginas da web e organizar os conteúdos correspondentes.

Legal Events

Date Code Title Description
B06F Objections, documents and/or translations needed after an examination request according art. 34 industrial property law
B15K Others concerning applications: alteration of classification

Free format text: AS CLASSIFICACOES ANTERIORES ERAM: G06F 17/28 , G06F 17/21 , G06F 17/30

Ipc: G06F 17/27 (1995.01), G06F 16/34 (2019.01), G06F 1

B06T Formal requirements before examination
B16A Patent or certificate of addition of invention granted

Free format text: PRAZO DE VALIDADE: 10 (DEZ) ANOS CONTADOS A PARTIR DE 12/11/2019, OBSERVADAS AS CONDICOES LEGAIS. (CO) 10 (DEZ) ANOS CONTADOS A PARTIR DE 12/11/2019, OBSERVADAS AS CONDICOES LEGAIS