NO20045285L - Fremgangsmate for a syntetisere et selvlaerende system for uttrekking av av kunnskap fra tekstlige dokumenter for bruk i sokesystemer - Google Patents

Fremgangsmate for a syntetisere et selvlaerende system for uttrekking av av kunnskap fra tekstlige dokumenter for bruk i sokesystemer

Info

Publication number
NO20045285L
NO20045285L NO20045285A NO20045285A NO20045285L NO 20045285 L NO20045285 L NO 20045285L NO 20045285 A NO20045285 A NO 20045285A NO 20045285 A NO20045285 A NO 20045285A NO 20045285 L NO20045285 L NO 20045285L
Authority
NO
Norway
Prior art keywords
text
stochastically indexed
request
self
knowledge
Prior art date
Application number
NO20045285A
Other languages
English (en)
Inventor
Vladimir Vladimirovich Nasypny
Galina Antolievna Nasypnaya
Original Assignee
Galina Antolievna Nasypnaya
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Galina Antolievna Nasypnaya filed Critical Galina Antolievna Nasypnaya
Publication of NO20045285L publication Critical patent/NO20045285L/no

Links

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/3331Query processing
    • G06F16/334Query execution
    • G06F16/3344Query execution using natural language analysis
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/31Indexing; Data structures therefor; Storage structures
    • G06F16/313Selection or weighting of terms for indexing

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • General Engineering & Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • Artificial Intelligence (AREA)
  • Software Systems (AREA)
  • Machine Translation (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Pharmaceuticals Containing Other Organic And Inorganic Compounds (AREA)

Abstract

Oppfinnelsen kan brukes til utvikling av datauthentingssystemer basert på Internett. Oppfinnelsen gjør det mulig å automatisk danne kunnskap og ekstrahere kunnskapen fra elektronisk presenterte tekstbaserte dokumenter i forskjellige språk, og intellektuelt å prosessere tekstbaserte data og brukerforespørsler. Den oppfinneriske fremgangsmåte består i tilveiebringelse av en selvlærende mekanisme for et system som involverer regler for grammatisk og semantisk analyse i form av et stokastisk indeksert intelligenssystem, dannelse av en database for stokastisk indekserte ordbøker og en indekstabell for lingvistiske tekster, utførelse av analysene og stokastisk indeksering av de tekstbaserte dokumenter og ved dannelse av en korresponderende kunnskapsbase. En stokastisk indeksert brukerforespørsel transformeres til et mangfold av nye forespørsler, og fragmentene i de tekstbaserte dokumenter som inneholder ordgruppene i de transformerte forespørsler velges. Fragmentene brukes til dannelse av en stokastisk indeksert semantisk struktur og den korte respons for systemet basert på denne. Relevansen av den mottatte korte respons for forespørselen sjekkes ved dannelse av en spørresetning som er basert på denne, og ved sammenligning av setningen med forespørselen.
NO20045285A 2002-05-28 2004-12-02 Fremgangsmate for a syntetisere et selvlaerende system for uttrekking av av kunnskap fra tekstlige dokumenter for bruk i sokesystemer NO20045285L (no)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/RU2002/000258 WO2003100659A1 (fr) 2002-05-28 2002-05-28 Procede de synthese d'un systeme a auto-apprentissage d'extraction de connaissances a partir de documents textuels pour moteurs de recherche

Publications (1)

Publication Number Publication Date
NO20045285L true NO20045285L (no) 2005-02-16

Family

ID=29580128

Family Applications (1)

Application Number Title Priority Date Filing Date
NO20045285A NO20045285L (no) 2002-05-28 2004-12-02 Fremgangsmate for a syntetisere et selvlaerende system for uttrekking av av kunnskap fra tekstlige dokumenter for bruk i sokesystemer

Country Status (9)

Country Link
US (1) US20050071150A1 (no)
EP (1) EP1508861A1 (no)
JP (1) JP2005535007A (no)
KR (1) KR20040111715A (no)
CN (1) CN100392644C (no)
AU (1) AU2002323853A1 (no)
CA (1) CA2487739A1 (no)
NO (1) NO20045285L (no)
WO (1) WO2003100659A1 (no)

Families Citing this family (95)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6741990B2 (en) * 2001-05-23 2004-05-25 Intel Corporation System and method for efficient and adaptive web accesses filtering
US7127520B2 (en) 2002-06-28 2006-10-24 Streamserve Method and system for transforming input data streams
US7047226B2 (en) * 2002-07-24 2006-05-16 The United States Of America As Represented By The Secretary Of The Navy System and method for knowledge amplification employing structured expert randomization
US7296260B2 (en) * 2003-02-26 2007-11-13 Inventec Corporation System and method for composing a multi-lingual instructional software
US7328156B2 (en) * 2003-07-17 2008-02-05 International Business Machines Corporation Computational linguistic statements for providing an autonomic computing environment
US8869061B1 (en) 2003-08-29 2014-10-21 Microsoft Corporation User interface for searching an electronic document
TWI290687B (en) * 2003-09-19 2007-12-01 Hon Hai Prec Ind Co Ltd System and method for search information based on classifications of synonymous words
US7590936B1 (en) * 2003-09-30 2009-09-15 Microsoft Corporation Method for extracting information associated with a search term
US20050120009A1 (en) * 2003-11-21 2005-06-02 Aker J. B. System, method and computer program application for transforming unstructured text
US7689412B2 (en) * 2003-12-05 2010-03-30 Microsoft Corporation Synonymous collocation extraction using translation information
EP1697300A4 (en) * 2003-12-24 2007-10-03 Univ Louisville Res Found BONE-RELATED COMPOUNDS FOR THE ADMINISTRATION OF BODY AGENTS FOR THE INTERACTION THEREOF
US7562008B2 (en) * 2004-06-23 2009-07-14 Ning-Ping Chan Machine translation method and system that decomposes complex sentences into two or more sentences
JP2006091994A (ja) * 2004-09-21 2006-04-06 Toshiba Corp 文書情報処理装置および方法、文書情報処理プログラム
US9104779B2 (en) * 2005-03-30 2015-08-11 Primal Fusion Inc. Systems and methods for analyzing and synthesizing complex knowledge representations
US7548849B2 (en) * 2005-04-29 2009-06-16 Research In Motion Limited Method for generating text that meets specified characteristics in a handheld electronic device and a handheld electronic device incorporating the same
US7912701B1 (en) 2005-05-04 2011-03-22 IgniteIP Capital IA Special Management LLC Method and apparatus for semiotic correlation
KR100614762B1 (ko) * 2005-06-03 2006-08-22 주식회사 우량정보기술 케이엠에스 및 엘엠에스 통합에 의한 컨텐츠 제공방법 및이를 실행하기 위한 프로그램이 기록된 기록매체
US20060282255A1 (en) * 2005-06-14 2006-12-14 Microsoft Corporation Collocation translation from monolingual and available bilingual corpora
US20070005679A1 (en) * 2005-06-21 2007-01-04 Bui Richard T Server-client hybrid search systems, methods, and apparatuses
US20070016397A1 (en) * 2005-07-18 2007-01-18 Microsoft Corporation Collocation translation using monolingual corpora
US7788263B2 (en) 2005-08-10 2010-08-31 Microsoft Corporation Probabilistic retrospective event detection
US8572088B2 (en) * 2005-10-21 2013-10-29 Microsoft Corporation Automated rich presentation of a semantic topic
US7644048B2 (en) * 2005-10-28 2010-01-05 General Dynamics Advanced Information Systems, Inc. System, method and software for cognitive automation
US8180625B2 (en) * 2005-11-14 2012-05-15 Fumitaka Noda Multi language exchange system
US7930319B2 (en) * 2008-01-10 2011-04-19 Qin Zhang Search method and system using thinking system
US8019714B2 (en) * 2005-12-12 2011-09-13 Qin Zhang Thinking system and method
US7962328B2 (en) * 2006-03-13 2011-06-14 Lexikos Corporation Method and apparatus for generating a compact data structure to identify the meaning of a symbol
US20070260450A1 (en) * 2006-05-05 2007-11-08 Yudong Sun Indexing parsed natural language texts for advanced search
JP4256416B2 (ja) * 2006-09-29 2009-04-22 株式会社東芝 データ構造変換システム及びプログラム
US7984032B2 (en) * 2007-08-31 2011-07-19 Microsoft Corporation Iterators for applying term occurrence-level constraints in natural language searching
US8639708B2 (en) * 2007-08-31 2014-01-28 Microsoft Corporation Fact-based indexing for natural language search
US8463593B2 (en) * 2007-08-31 2013-06-11 Microsoft Corporation Natural language hypernym weighting for word sense disambiguation
US8346756B2 (en) * 2007-08-31 2013-01-01 Microsoft Corporation Calculating valence of expressions within documents for searching a document index
US8229730B2 (en) * 2007-08-31 2012-07-24 Microsoft Corporation Indexing role hierarchies for words in a search index
US8868562B2 (en) * 2007-08-31 2014-10-21 Microsoft Corporation Identification of semantic relationships within reported speech
MX2010002350A (es) * 2007-08-31 2010-07-30 Microsoft Corp Identificacion de relaciones semanticas dentro de lenguaje reportado.
US8041697B2 (en) * 2007-08-31 2011-10-18 Microsoft Corporation Semi-automatic example-based induction of semantic translation rules to support natural language search
US8316036B2 (en) * 2007-08-31 2012-11-20 Microsoft Corporation Checkpointing iterators during search
US8712758B2 (en) * 2007-08-31 2014-04-29 Microsoft Corporation Coreference resolution in an ambiguity-sensitive natural language processing system
US8229970B2 (en) * 2007-08-31 2012-07-24 Microsoft Corporation Efficient storage and retrieval of posting lists
US8280721B2 (en) * 2007-08-31 2012-10-02 Microsoft Corporation Efficiently representing word sense probabilities
US20090070322A1 (en) * 2007-08-31 2009-03-12 Powerset, Inc. Browsing knowledge on the basis of semantic relations
US8996433B2 (en) * 2007-10-11 2015-03-31 Steven Ginzberg Automated natural language formula translator and data evaluator
US20090198488A1 (en) * 2008-02-05 2009-08-06 Eric Arno Vigen System and method for analyzing communications using multi-placement hierarchical structures
US8370128B2 (en) * 2008-09-30 2013-02-05 Xerox Corporation Semantically-driven extraction of relations between named entities
CN101876981B (zh) * 2009-04-29 2015-09-23 阿里巴巴集团控股有限公司 一种构建知识库的方法及装置
TW201118619A (en) * 2009-11-30 2011-06-01 Inst Information Industry An opinion term mining method and apparatus thereof
US8457948B2 (en) * 2010-05-13 2013-06-04 Expedia, Inc. Systems and methods for automated content generation
US9317595B2 (en) 2010-12-06 2016-04-19 Yahoo! Inc. Fast title/summary extraction from long descriptions
US20120215712A1 (en) * 2011-02-17 2012-08-23 Tariq Malki System and database for education
SG194709A1 (en) * 2011-05-10 2013-12-30 Nec Corp Device, method and program for assessing synonymous expressions
JP2013003663A (ja) * 2011-06-13 2013-01-07 Sony Corp 情報処理装置、情報処理方法、およびプログラム
US9201868B1 (en) * 2011-12-09 2015-12-01 Guangsheng Zhang System, methods and user interface for identifying and presenting sentiment information
US9037452B2 (en) * 2012-03-16 2015-05-19 Afrl/Rij Relation topic construction and its application in semantic relation extraction
CN102651014B (zh) * 2012-03-29 2014-10-22 华侨大学 基于概念关系的领域数据语义的检索方法
GB2513537A (en) 2012-12-20 2014-11-05 Ibm Natural language processing
US9201860B1 (en) 2013-03-12 2015-12-01 Guangsheng Zhang System and methods for determining sentiment based on context
KR20150026305A (ko) * 2013-09-02 2015-03-11 최승철 언어 학습프로그램 및 이를 기록한 컴퓨터로 읽을 수 있는 기록매체
US9547640B2 (en) * 2013-10-16 2017-01-17 International Business Machines Corporation Ontology-driven annotation confidence levels for natural language processing
US9916284B2 (en) 2013-12-10 2018-03-13 International Business Machines Corporation Analyzing document content and generating an appendix
KR101590908B1 (ko) * 2013-12-24 2016-02-03 서강대학교산학협력단 채팅 데이터 학습 및 서비스 방법 및 그에 따른 시스템
CN104850554B (zh) * 2014-02-14 2020-05-19 北京搜狗科技发展有限公司 一种搜索方法和系统
US20160110394A1 (en) * 2014-10-15 2016-04-21 Bart Boxwell Obituary Alerting System and Method of Use
US9886665B2 (en) 2014-12-08 2018-02-06 International Business Machines Corporation Event detection using roles and relationships of entities
US10079785B2 (en) 2015-02-12 2018-09-18 Google Llc Determining reply content for a reply to an electronic communication
CN105468663A (zh) * 2015-02-12 2016-04-06 国网山东省电力公司潍坊供电公司 一种基于云模型的智能决策电网知识库的搭建方法
CN106155999A (zh) * 2015-04-09 2016-11-23 科大讯飞股份有限公司 自然语言语义理解方法及系统
KR101686919B1 (ko) * 2016-01-07 2016-12-16 주식회사 엑셈 빅데이터에 기반한 추론 엔진을 관리하는 방법 및 장치
US10534843B2 (en) 2016-05-27 2020-01-14 Open Text Sa Ulc Document architecture with efficient storage
US10671928B2 (en) 2016-08-30 2020-06-02 International Business Machines Corporation Adaptive analytical modeling tool
CN106469214B (zh) * 2016-09-06 2019-10-15 北京百度网讯科技有限公司 基于人工智能的信息呈现方法和装置
WO2018083804A1 (ja) 2016-11-07 2018-05-11 富士通株式会社 分析プログラム、情報処理装置および分析方法
KR101970294B1 (ko) * 2017-03-06 2019-04-18 네이버 주식회사 항목 추천 장치, 방법 및 컴퓨터 프로그램
CN111279331B (zh) * 2017-11-06 2023-11-10 株式会社力森诺科 因果句解析装置、因果句解析系统、程序以及因果句解析方法
CN107977415B (zh) * 2017-11-22 2019-02-05 北京寻领科技有限公司 自动问答方法及装置
US11625533B2 (en) * 2018-02-28 2023-04-11 Charles Northrup System and method for a thing machine to perform models
US11120059B2 (en) * 2018-06-27 2021-09-14 Adobe Inc. Conversational query answering system
US10740381B2 (en) * 2018-07-18 2020-08-11 International Business Machines Corporation Dictionary editing system integrated with text mining
CN110390049B (zh) * 2019-07-10 2022-01-28 北京航空航天大学 一种面向软件开发问题的答案自动生成方法
CN111444399B (zh) * 2020-03-30 2022-10-25 腾讯科技(深圳)有限公司 回复内容的生成方法、装置、设备及可读存储介质
CN111737572B (zh) * 2020-06-17 2024-01-30 北京字节跳动网络技术有限公司 搜索语句生成方法、装置和电子设备
CN111950646A (zh) * 2020-08-20 2020-11-17 北京环境特性研究所 电磁图像的层次化知识模型构建方法及目标识别方法
CN112651226B (zh) * 2020-09-21 2022-03-29 深圳前海黑顿科技有限公司 基于依存句法树的知识解析系统及方法
CN113641778B (zh) * 2020-10-30 2024-07-12 浙江华云信息科技有限公司 一种对话文本的主题识别方法
CN113569539B (zh) * 2021-02-05 2025-04-04 中国科学院计算技术研究所 文本内容的衍生方法、装置、计算机可读介质及电子设备
EP4327329A1 (en) * 2021-04-22 2024-02-28 Smart Reporting GmbH Methods and systems for structuring medical report texts
CN114064855B (zh) * 2021-11-10 2024-05-17 国电南瑞南京控制系统有限公司 一种基于变压器知识库的信息检索方法及系统
CN114281945B (zh) * 2021-12-28 2024-02-27 合肥工业大学 基于绿色产品案例库的减碳策略知识库的构建方法
US11888793B2 (en) 2022-02-22 2024-01-30 Open Text Holdings, Inc. Systems and methods for intelligent delivery of communications
US12273310B2 (en) 2022-02-22 2025-04-08 Open Text Holdings, Inc. Systems and methods for intelligent delivery of communications
CN114706941B (zh) * 2022-03-03 2023-04-18 广州万辉信息科技有限公司 一种专利监控平台及方法
CN114742041B (zh) * 2022-04-21 2026-01-27 中国航空无线电电子研究所 英文需求二义性辅助检查系统
CN116778906A (zh) * 2023-07-14 2023-09-19 上海蜜度信息技术有限公司 音频生成方法、设备及计算机可读介质
US12265836B1 (en) * 2024-02-29 2025-04-01 Crowdstrike, Inc. Localization middleware
CN119540958B (zh) * 2025-01-20 2025-04-25 北京星震同源数字系统股份有限公司 基于梯度下降方法比对学习的语料知识库自学习方法及装置

Family Cites Families (23)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5454106A (en) * 1993-05-17 1995-09-26 International Business Machines Corporation Database retrieval system using natural language for presenting understood components of an ambiguous query on a user interface
CA2193803C (en) * 1994-06-22 2004-12-07 Bruce G. Molloy A system and method for representing and retrieving knowledge in an adaptive cognitive network
US5642502A (en) * 1994-12-06 1997-06-24 University Of Central Florida Method and system for searching for relevant documents from a text database collection, using statistical ranking, relevancy feedback and small pieces of text
US5717913A (en) * 1995-01-03 1998-02-10 University Of Central Florida Method for detecting and extracting text data using database schemas
US6498921B1 (en) * 1999-09-01 2002-12-24 Chi Fai Ho Method and system to answer a natural-language question
US6269368B1 (en) * 1997-10-17 2001-07-31 Textwise Llc Information retrieval using dynamic evidence combination
KR980004126A (ko) * 1997-12-16 1998-03-30 양승택 다국어 웹 문서 검색을 위한 질의어 변환 장치 및 방법
US6101492A (en) * 1998-07-02 2000-08-08 Lucent Technologies Inc. Methods and apparatus for information indexing and retrieval as well as query expansion using morpho-syntactic analysis
RU2166208C2 (ru) * 1999-04-29 2001-04-27 Халин Евгений Васильевич Способ автоматизированного приобретения знаний по безопасности производства
US6446064B1 (en) * 1999-06-08 2002-09-03 Albert Holding Sa System and method for enhancing e-commerce using natural language interface for searching database
US6601026B2 (en) * 1999-09-17 2003-07-29 Discern Communications, Inc. Information retrieval by natural language querying
US6963863B1 (en) * 1999-09-28 2005-11-08 Thomas Bannon Network query and matching system and method
DE19952769B4 (de) * 1999-11-02 2008-07-17 Sap Ag Suchmaschine und Verfahren zum Abrufen von Informationen mit Abfragen in natürlicher Sprache
US20030074353A1 (en) * 1999-12-20 2003-04-17 Berkan Riza C. Answer retrieval technique
US6829603B1 (en) * 2000-02-02 2004-12-07 International Business Machines Corp. System, method and program product for interactive natural dialog
US6757646B2 (en) * 2000-03-22 2004-06-29 Insightful Corporation Extended functionality for an inverse inference engine based web search
US6701309B1 (en) * 2000-04-21 2004-03-02 Lycos, Inc. Method and system for collecting related queries
US6728728B2 (en) * 2000-07-24 2004-04-27 Israel Spiegler Unified binary model and methodology for knowledge representation and for data and information mining
US6778951B1 (en) * 2000-08-09 2004-08-17 Concerto Software, Inc. Information retrieval method with natural language interface
US6766316B2 (en) * 2001-01-18 2004-07-20 Science Applications International Corporation Method and system of ranking and clustering for document indexing and retrieval
US20020165860A1 (en) * 2001-05-07 2002-11-07 Nec Research Insititute, Inc. Selective retrieval metasearch engine
US6654740B2 (en) * 2001-05-08 2003-11-25 Sunflare Co., Ltd. Probabilistic information retrieval based on differential latent semantic space
US6778979B2 (en) * 2001-08-13 2004-08-17 Xerox Corporation System for automatically generating queries

Also Published As

Publication number Publication date
HK1077380A1 (en) 2006-02-10
KR20040111715A (ko) 2004-12-31
CA2487739A1 (en) 2003-12-04
US20050071150A1 (en) 2005-03-31
AU2002323853A1 (en) 2003-12-12
JP2005535007A (ja) 2005-11-17
WO2003100659A1 (fr) 2003-12-04
CN1628298A (zh) 2005-06-15
EP1508861A1 (en) 2005-02-23
CN100392644C (zh) 2008-06-04

Similar Documents

Publication Publication Date Title
NO20045285L (no) Fremgangsmate for a syntetisere et selvlaerende system for uttrekking av av kunnskap fra tekstlige dokumenter for bruk i sokesystemer
Zhang et al. Entity linking leveraging automatically generated annotation
CN108304375B (zh) 一种信息识别方法及其设备、存储介质、终端
JP6813591B2 (ja) モデル作成装置、テキスト検索装置、モデル作成方法、テキスト検索方法、及びプログラム
Ravichandran et al. Learning surface text patterns for a question answering system
US10503828B2 (en) System and method for answering natural language question
KR101726667B1 (ko) 어법컴파일방법, 어의해석방법, 디바이스, 컴퓨터 저장매체 및 장치
Almeman et al. Automatic building of arabic multi dialect text corpora by bootstrapping dialect words
US20100332217A1 (en) Method for text improvement via linguistic abstractions
JP2005520251A (ja) 名前付きエンティティの翻訳
KR20080084803A (ko) 교차-언어 지식 검색을 위한 시스템 및 방법
JP2010519655A (ja) 名前照合システムの名前インデックス付け
CN114064861B (zh) 一种查询语句的生成方法和装置
WO2012159558A1 (zh) 基于语意识别的自然语言处理方法、装置和系统
Neale et al. Leveraging lexical resources and constraint grammar for rule-based part-of-speech tagging in Welsh
KR100481598B1 (ko) 복합 형태소 분석 장치 및 방법
Alhasan et al. POS tagging for arabic text using bee colony algorithm
KR101333485B1 (ko) 온라인 사전을 이용한 개체명 사전 구축 방법 및 이를 실행하는 장치
Craig et al. Scaling address parsing sequence models through active learning
JP2003150624A (ja) 情報抽出装置および情報抽出方法
JP4005343B2 (ja) 情報検索システム
KR20030006201A (ko) 홈페이지 자동 검색을 위한 통합형 자연어 질의-응답시스템
JP5688754B2 (ja) 情報検索装置及びコンピュータプログラム
Maarif et al. Complexity algorithm analysis for edit distance
JP4153843B2 (ja) 自然文検索装置、自然文検索方法、自然文検索プログラム及び自然文検索プログラム記憶媒体

Legal Events

Date Code Title Description
FC2A Withdrawal, rejection or dismissal of laid open patent application