WO2015192567A1 - Procédé et dispositif d'extraction d'un temps lunaire chinois à partir d'un texte et support de stockage informatique - Google Patents
Procédé et dispositif d'extraction d'un temps lunaire chinois à partir d'un texte et support de stockage informatique Download PDFInfo
- Publication number
- WO2015192567A1 WO2015192567A1 PCT/CN2014/089181 CN2014089181W WO2015192567A1 WO 2015192567 A1 WO2015192567 A1 WO 2015192567A1 CN 2014089181 W CN2014089181 W CN 2014089181W WO 2015192567 A1 WO2015192567 A1 WO 2015192567A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- time
- lunar
- lunar time
- text
- model file
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Ceased
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/10—Text processing
Definitions
- the present invention relates to data processing technologies in the field of terminals, and in particular, to a method, device and computer storage medium for extracting lunar time from text.
- the lunar calendar In the process of describing time, there is a certain difference between the lunar calendar and the solar calendar. For example, when describing the date, from the 1st to the 10th, the lunar calendar is expressed by adding the word “beginning” in front of the number, so when the first few months appear It can be determined by the lunar calendar; from the eleventh, the description of the lunar calendar and the solar calendar is the same, which requires the user to judge according to other factors such as living environment and expression habits.
- embodiments of the present invention are directed to providing a method, apparatus, and computer storage medium for extracting lunar time from text, which can automatically extract lunar time from text, which is convenient. The user performs schedule management.
- the embodiment of the invention provides a method for extracting lunar time from text, the method comprising:
- the lunar time component in the text is extracted according to the model file of the lunar time; the lunar time component is converted into the lunar time value.
- the method further includes: generating a model file of the lunar time;
- the generating the model file of the lunar time includes: expressing an expression rule of the lunar time; generating a regular expression of the lunar time according to the rule; and encapsulating the regular expression into a model file of the lunar time.
- the generating a regular expression of the lunar time includes: setting an exact time indicating the lunar time before the relative time indicating the lunar time.
- the extracting the lunar time component in the text according to the model file of the lunar time includes: traversing and matching the content in the text with the regular expression in the model file; and performing the matching lunar time component extract.
- the method further includes: constructing a time object; converting the lunar time component into a lunar time value, comprising: assigning the extracted lunar time component to a control field of the time object, and obtaining a lunar time value
- the control field of the time object includes: year, month, day, hour, minute, and second.
- the method further includes: performing relative time calculation on the acquired lunar time value when the relative time is included in the lunar time component.
- An embodiment of the present invention further provides an apparatus for extracting lunar time from text, the apparatus comprising: an extracting module, and a converting module; wherein the extracting module is configured to extract a lunar calendar in the text according to a model file of the lunar time a time component; the conversion module configured to convert the lunar time component into a lunar time value.
- the device further includes a generating module, and the generating module is configured to generate a model file of the lunar time, including: a statistical expression rule of the lunar time, and according to the rule A regular expression into a lunar time that encapsulates the regular expression into a model file of the lunar time.
- the generating module is configured to set the exact time indicating the lunar time before the relative time indicating the lunar time.
- the extracting module is configured to traverse and match the content in the text with the regular expression in the model file; and extract the lunar time component obtained by the matching.
- the device further includes a building module, the building module is configured to construct a time object, and the converting module is configured to assign the extracted lunar time component to a control field of the time object to obtain a lunar calendar Time value.
- the conversion module is further configured to perform a relative time calculation on the acquired lunar time value when the lunar time component contains a relative time.
- the embodiment of the invention further provides a computer storage medium, the storage medium comprising a set of computer executable instructions, the instructions for performing the method for extracting the lunar time from the text according to the embodiment of the invention.
- the method, device and computer storage medium for extracting lunar time from text extracts a lunar time component in a text according to a model file of a lunar time; and converts the lunar time component into a lunar time value.
- the lunar time component can be automatically extracted from the text based on the model file of the lunar time, and the extracted lunar time component can be converted into a specific lunar time value, thereby realizing automatic extraction and conversion of the lunar time, which can be effective. It is convenient for users to manage their life and work schedule.
- FIG. 1 is a schematic flowchart of a method for extracting a lunar time from a text according to an embodiment of the present invention
- FIG. 2 is a schematic diagram of an implementation process of generating a model file of a lunar time according to an embodiment of the present invention
- FIG. 3 is a schematic diagram of an implementation process for extracting and converting a lunar time component according to an embodiment of the present invention
- FIG. 4 is a schematic structural diagram of a device for extracting lunar time from text according to an embodiment of the present invention.
- the lunar time component in the text is extracted according to the model file of the lunar time; the lunar time component is converted into the lunar time value.
- FIG. 1 is a schematic flowchart of a method for extracting lunar time from text according to an embodiment of the present invention, where the method includes the following steps:
- Step 101 Extract a lunar time component in the text according to a model file of the lunar time
- the method for extracting the lunar time from the text according to the embodiment of the present invention can be divided into two parts: generation of a lunar time model file, and extraction and conversion of lunar time components.
- the model file of the lunar time is first generated, that is, the first part of the method according to the embodiment of the present invention: the generation of the lunar time model file;
- FIG. 2 is generated according to the embodiment of the present invention.
- a schematic diagram of the implementation process of the model file of the lunar time, as shown in FIG. 2, the model file for generating the lunar time includes the following steps:
- Step 201 Statistics expression rules of the lunar time
- the statistical expression rule of the lunar time that is, the time expression of the lunar calendar in daily life, is a type of expression about the lunar time in daily life, rather than a specific description of the lunar time.
- the lunar calendar time in the daily life can be twenty-four solar terms, traditional festivals, heavenly branches, etc., in addition to the dates we often use.
- Step 202 Generate a regular expression of the lunar time according to the rule.
- a series of regular expressions of the lunar time are generated for the expression rules of the lunar time in daily life, and different regular expressions are divided by the or operator.
- the types of regular expressions of the commonly used lunar time can be including but not limited to:
- Step 203 Encapsulate the regular expression into a model file of a lunar time
- the series of regular expressions relating to the lunar time generated above is encapsulated into a model file of the lunar time, the model file being equivalent to a database of the lunar time model.
- Step 102 Convert the lunar time component into a lunar time value
- FIG. 3 is a schematic diagram of the implementation process of extracting and converting the lunar time component according to the embodiment of the present invention.
- the extracting and converting the lunar time component may include the following steps:
- Step 301 traverse the content in the text and the regular expression in the model file to determine whether there is a match, and if yes, execute step 302, and if not, end the process;
- step 302 is performed, and if there is no match, the process ends.
- the time description needs to be matched as a whole; for example, if There is a continuous time description in the text "8:00 in the New Year's Eve", the "New Year's Eve” can be described as a time, and the "8 o'clock in the evening” can also be described as a time, but because of the two What is expressed together is a complete description of time, so it is necessary to match "8 o'clock in the New Year's Eve" as a whole.
- Step 302 Extract the lunar time components obtained by the matching
- the extracted lunar time component has no meaning if it is only a simple description about the lunar time. Therefore, the lunar time component may include but not only the description about the lunar time. Not limited to: location, people, events, etc.
- Step 303 Assign the extracted lunar time component to a control field of the time object to obtain a lunar time value.
- the method according to the embodiment of the present invention needs to construct a time object, and the control field of the time object may include: year, month, day, hour, minute, and second; the extracted lunar time component is according to year, month, and day.
- the order of time, minute, and second is sequentially assigned to the control field of the time object, and the lunar time value can be obtained;
- the extracted lunar time component has a time description of "New Year's Eve, six o'clock, six o'clock, and dinner", and the "big year” indicates the lunar calendar or the first month, which needs to be dated according to the date.
- the number depends on the number. If the number after it is more than twenty, it means that the “big year” means the lunar calendar. If the number after it is the first few, it means that the “big year” means The first month, therefore, the “big year” mentioned in the above “New Year’s Eve” represents December; at this time, the “big year” is first converted into “12” and assigned to the “time object control field”.
- Step 304 When the relative time is included in the lunar time component, perform relative time calculation on the acquired lunar time value;
- FIG. 4 is a schematic structural diagram of a device for extracting lunar time from text according to an embodiment of the present invention, where the apparatus includes: an extraction module 42. And a conversion module 43; wherein
- the extraction module 42 is configured to extract a lunar time component in the text according to the model file of the lunar time;
- the extracting module 42 is configured to traverse the content in the text and the regular expression in the model file, and extract the matched lunar time components.
- the device may further include a generating module 41 configured to generate a model file of the lunar time;
- the model file for generating the lunar time includes: an expression rule for counting the lunar time, and generating a regular expression of the lunar time according to the rule, The regular expression is encapsulated into a model file of the lunar time;
- the generating module 41 is configured to set an exact time indicating the lunar time before a relative time indicating the lunar time.
- the conversion module 43 is configured to convert the lunar time component into a lunar time value
- the apparatus can also include a building block 44 configured to build a time object
- the conversion module 43 is configured to assign the extracted lunar time component to a control field of the time object to obtain a lunar time value.
- the conversion module 43 is further configured to: when the lunar time component contains relative time, Calculate the relative time of the obtained lunar time value.
- the generating module 41, the extracting module 42, the converting module 43, and the building module 44 may be implemented by a central processing unit (CPU), a microprocessor (MPU), a digital signal processor (DSP), or a field of the device.
- a central processing unit CPU
- MPU microprocessor
- DSP digital signal processor
- FPGA Programmable Gate Array
- the embodiment of the invention further provides a computer storage medium, the storage medium comprising a set of computer executable instructions, the instructions for performing the method for extracting the lunar time from the text according to the embodiment of the invention.
- the method for extracting the lunar time from the text described in the embodiment of the present invention may also be stored in a computer readable storage medium if it is implemented in the form of a software function module and sold or used as a separate product.
- a computer readable storage medium if it is implemented in the form of a software function module and sold or used as a separate product.
- embodiments of the present application can be provided as a method, system, or computer program product.
- the present application can take the form of an entirely hardware embodiment, an entirely software embodiment, or an embodiment in combination of software and hardware.
- the application can take the form of a computer program product embodied on one or more computer-usable storage media containing computer usable program code, including but not limited to a USB flash drive, a mobile hard drive, a read only memory (ROM, Read-Only Memory), disk storage, CD-ROM, optical storage, etc.
- a USB flash drive a mobile hard drive
- a read only memory ROM, Read-Only Memory
- disk storage CD-ROM, optical storage, etc.
- the computer program instructions can also be stored in a computer readable memory that can direct a computer or other programmable data processing device to operate in a particular manner, such that the computer is readable and stored
- the instructions in the reservoir produce an article of manufacture comprising an instruction device that implements the functions specified in one or more blocks of the flow or in a flow or block diagram of the flowchart.
- These computer program instructions can also be loaded onto a computer or other programmable data processing device such that a series of operational steps are performed on a computer or other programmable device to produce computer-implemented processing for execution on a computer or other programmable device.
- the instructions provide steps for implementing the functions specified in one or more of the flow or in a block or blocks of a flow diagram.
- the embodiment of the present invention further provides a computer storage medium, wherein a computer program is stored, and the computer program is used to execute the method for extracting the lunar time from the text as described in the method embodiment of the present invention.
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Health & Medical Sciences (AREA)
- Artificial Intelligence (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Computational Linguistics (AREA)
- General Health & Medical Sciences (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Machine Translation (AREA)
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
Abstract
Un mode de réalisation de la présente invention concerne un procédé d'extraction d'un temps lunaire chinois à partir d'un texte. Ledit procédé comprend les étapes consistant à : extraire un élément de temps lunaire dans un texte d'après un fichier de modèle de temps lunaires (101); et convertir l'élément de temps lunaire en valeur de temps lunaire (102). Le mode de réalisation de la présente invention concerne également un dispositif d'extraction du temps lunaire à partir du texte et un support de stockage informatique.
Applications Claiming Priority (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN201410270720.X | 2014-06-17 | ||
| CN201410270720.XA CN105183704A (zh) | 2014-06-17 | 2014-06-17 | 一种从文本中提取农历时间的方法及装置 |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| WO2015192567A1 true WO2015192567A1 (fr) | 2015-12-23 |
Family
ID=54905793
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| PCT/CN2014/089181 Ceased WO2015192567A1 (fr) | 2014-06-17 | 2014-10-22 | Procédé et dispositif d'extraction d'un temps lunaire chinois à partir d'un texte et support de stockage informatique |
Country Status (2)
| Country | Link |
|---|---|
| CN (1) | CN105183704A (fr) |
| WO (1) | WO2015192567A1 (fr) |
Families Citing this family (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN107729310A (zh) * | 2016-08-11 | 2018-02-23 | 中兴通讯股份有限公司 | 一种文本信息的提取方法、装置和移动终端 |
Citations (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20090192956A1 (en) * | 2008-01-28 | 2009-07-30 | Xerox Corporation | Method and apparatus for structuring documents utilizing recognition of an ordered sequence of identifiers |
| CN101727438A (zh) * | 2008-10-30 | 2010-06-09 | 北大方正集团有限公司 | 一种数字报刊版面信息的自动提取方法 |
Family Cites Families (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN101083635B (zh) * | 2007-04-04 | 2010-07-21 | 腾讯科技(深圳)有限公司 | 在日历中更换历法时间以管理事件的方法、系统及设备 |
| CN102193951A (zh) * | 2010-03-19 | 2011-09-21 | 华为技术有限公司 | 信息抽取的方法及系统 |
| CN103093334A (zh) * | 2011-11-04 | 2013-05-08 | 周超然 | 活动通知文本识别并自动转换为日历项的方法 |
| CN103593360A (zh) * | 2012-08-16 | 2014-02-19 | 江苏金鸽网络科技有限公司 | 基于页面分析的互联网信息发表时间提取方法 |
-
2014
- 2014-06-17 CN CN201410270720.XA patent/CN105183704A/zh not_active Withdrawn
- 2014-10-22 WO PCT/CN2014/089181 patent/WO2015192567A1/fr not_active Ceased
Patent Citations (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20090192956A1 (en) * | 2008-01-28 | 2009-07-30 | Xerox Corporation | Method and apparatus for structuring documents utilizing recognition of an ordered sequence of identifiers |
| CN101727438A (zh) * | 2008-10-30 | 2010-06-09 | 北大方正集团有限公司 | 一种数字报刊版面信息的自动提取方法 |
Non-Patent Citations (1)
| Title |
|---|
| LIN, JING ET AL.: "Automatic TIME X2 Tagging of Chinese Temporal Information", JOURNAL OF TSINGHUA UNIVERSITY ( SCIENCE AND TECHNOLOGY, vol. 48, no. 1, 15 January 2008 (2008-01-15) * |
Also Published As
| Publication number | Publication date |
|---|---|
| CN105183704A (zh) | 2015-12-23 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| JP6667931B2 (ja) | 音声情報から時間情報を認識するための方法およびデバイス | |
| US9524071B2 (en) | Threshold view | |
| TW201602932A (zh) | 在具有時間線的日曆上搜尋及定位事件 | |
| CN110019200B (zh) | 一种索引的建立、使用方法及装置 | |
| CN107451832A (zh) | 推送信息的方法和装置 | |
| US20170103012A1 (en) | Live data fabrication | |
| EP3483746A1 (fr) | Procédés et dispositifs pour générer des données afin de former un composant de compréhension du langage naturel | |
| US9754015B2 (en) | Feature rich view of an entity subgraph | |
| CN104809126B (zh) | 业务流程引擎系统 | |
| SG10201702888XA (en) | Platform for the integration of operational bim, operational intelligence, and user journeys for the simplified and unified management of smart cities | |
| CN111078701A (zh) | 基于关系型数据库的数据抽取方法及装置 | |
| US10726034B2 (en) | Modular electronic data analysis computing system | |
| WO2017041377A1 (fr) | Méthode et dispositif de génération d'informations de rappel de calendrier | |
| CN103902728A (zh) | 一种智能手表的语音信号存储方法及装置 | |
| US10210165B2 (en) | File management for eDiscovery | |
| WO2015192567A1 (fr) | Procédé et dispositif d'extraction d'un temps lunaire chinois à partir d'un texte et support de stockage informatique | |
| JP2016524733A (ja) | カレンダアプリケーションの入力ストリームを処理するシステムおよび方法 | |
| GB2534481A (en) | Data analytics for oilfield data repositories | |
| CN110533398B (zh) | 智慧手表及其提示管理方法、计算机可读存储介质 | |
| US9824076B2 (en) | Big file editor | |
| CN105282341B (zh) | 一种日程建立方法和装置 | |
| US20160092824A1 (en) | System and method for case management information entry | |
| JP2012118616A (ja) | 電子文書比較装置、電子文書比較方法、プログラムおよび記録媒体 | |
| Sagar | BabyX | |
| KR20180109143A (ko) | 복수의 일정을 포함하는 태스크 기반의 일정 관리 및 추천 시스템 |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| 121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 14895131 Country of ref document: EP Kind code of ref document: A1 |
|
| NENP | Non-entry into the national phase |
Ref country code: DE |
|
| 122 | Ep: pct application non-entry in european phase |
Ref document number: 14895131 Country of ref document: EP Kind code of ref document: A1 |