WO2014145999A3 - Searching text by optical character recognition - Google Patents
Searching text by optical character recognition Download PDFInfo
- Publication number
- WO2014145999A3 WO2014145999A3 PCT/US2014/030867 US2014030867W WO2014145999A3 WO 2014145999 A3 WO2014145999 A3 WO 2014145999A3 US 2014030867 W US2014030867 W US 2014030867W WO 2014145999 A3 WO2014145999 A3 WO 2014145999A3
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- character
- character recognition
- optical character
- ocr
- searching text
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Ceased
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/98—Detection or correction of errors, e.g. by rescanning the pattern or by human intervention; Evaluation of the quality of the acquired patterns
Landscapes
- Engineering & Computer Science (AREA)
- Quality & Reliability (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Multimedia (AREA)
- Theoretical Computer Science (AREA)
- Character Discrimination (AREA)
Abstract
A method for generating a character-by-character substitution in an optical character recognition (OCR) text output of a document including at least one character, includes: executing on a processor instructions for substituting an OCR key for the at least one character. The instructions include: identifying a class corresponding to the at least one character, wherein the class includes a character shape corresponding to at least a portion of the at least one character; substituting the OCR key including to the character shape for the at least one character; and generating a searchable substituted document including the OCR key.
Applications Claiming Priority (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US201361798223P | 2013-03-15 | 2013-03-15 | |
| US61/798,223 | 2013-03-15 |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| WO2014145999A2 WO2014145999A2 (en) | 2014-09-18 |
| WO2014145999A3 true WO2014145999A3 (en) | 2014-11-06 |
Family
ID=51538590
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| PCT/US2014/030867 Ceased WO2014145999A2 (en) | 2013-03-15 | 2014-03-17 | System and method for searching through text transcribed from an image processed by optical character recognition |
Country Status (1)
| Country | Link |
|---|---|
| WO (1) | WO2014145999A2 (en) |
Families Citing this family (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN114648002B (en) * | 2020-12-17 | 2024-12-24 | 永中软件股份有限公司 | Method for outputting multiple Office document content images through multiple processes |
Citations (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US7415171B2 (en) * | 2005-03-28 | 2008-08-19 | United States Postal Service | Multigraph optical character reader enhancement systems and methods |
| US20100246963A1 (en) * | 2009-03-26 | 2010-09-30 | Al-Muhtaseb Husni A | Automatic arabic text image optical character recognition method |
-
2014
- 2014-03-17 WO PCT/US2014/030867 patent/WO2014145999A2/en not_active Ceased
Patent Citations (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US7415171B2 (en) * | 2005-03-28 | 2008-08-19 | United States Postal Service | Multigraph optical character reader enhancement systems and methods |
| US20100246963A1 (en) * | 2009-03-26 | 2010-09-30 | Al-Muhtaseb Husni A | Automatic arabic text image optical character recognition method |
Also Published As
| Publication number | Publication date |
|---|---|
| WO2014145999A2 (en) | 2014-09-18 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| WO2015200110A3 (en) | Techniques for machine language translation of text from an image based on non-textual context information from the image | |
| CA2879417A1 (en) | Structured search queries based on social-graph information | |
| WO2016109307A3 (en) | Discriminating ambiguous expressions to enhance user experience | |
| WO2013009578A3 (en) | Systems and methods for speech command processing | |
| EP3136257A3 (en) | Document-specific gazetteers for named entity recognition | |
| EP2757487A3 (en) | Machine translation-driven authoring system and method | |
| WO2011159460A3 (en) | Identifying establishments in images | |
| Matthewson et al. | Inchoativity meets the perfect time span: The Niuean perfect | |
| PH12015000372A1 (en) | Conversion of documents of different types to a uniform and an editable or a searchable format | |
| EP4428742A3 (en) | Enhancing reading accuracy, efficiency and retention | |
| EP2811414A3 (en) | Confidence-driven rewriting of source texts for improved translation | |
| MX2016016289A (en) | Learning and using contextual content retrieval rules for query disambiguation. | |
| TW201612773A (en) | Multi-command single utterance input method | |
| WO2014209810A3 (en) | Methods and apparatuses for mining synonymous phrases, and for searching related content | |
| WO2016035072A3 (en) | Sentiment rating system and method | |
| WO2013163644A3 (en) | Updating a search index used to facilitate application searches | |
| WO2012134972A3 (en) | Systems and methods for paragraph-based document searching | |
| MX387895B (en) | METHOD FOR TEXT RECOGNITION AND COMPUTER PROGRAM PRODUCT. | |
| BR112014026626A2 (en) | creation of social networking groups | |
| WO2014210387A3 (en) | Concept extraction | |
| GB2542053A (en) | Automatically generating a semantic mapping for a relational database | |
| WO2015038408A3 (en) | Creating inforgraphics from text data in electronic documents | |
| CL2016000984A1 (en) | System and method for implementing multi-faceted search queries | |
| WO2013025624A3 (en) | Searching encrypted electronic books | |
| MY194297A (en) | A method and device for providing search engine label |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| 121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 14763293 Country of ref document: EP Kind code of ref document: A2 |
|
| 122 | Ep: pct application non-entry in european phase |
Ref document number: 14763293 Country of ref document: EP Kind code of ref document: A2 |