WO2014145999A3 - Searching text by optical character recognition - Google Patents

Searching text by optical character recognition Download PDF

Info

Publication number
WO2014145999A3
WO2014145999A3 PCT/US2014/030867 US2014030867W WO2014145999A3 WO 2014145999 A3 WO2014145999 A3 WO 2014145999A3 US 2014030867 W US2014030867 W US 2014030867W WO 2014145999 A3 WO2014145999 A3 WO 2014145999A3
Authority
WO
WIPO (PCT)
Prior art keywords
character
character recognition
optical character
ocr
searching text
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Ceased
Application number
PCT/US2014/030867
Other languages
French (fr)
Other versions
WO2014145999A2 (en
Inventor
Sergio David SUAREZ Jr.
Joshua Daniel MESKE
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Individual
Original Assignee
Individual
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Individual filed Critical Individual
Publication of WO2014145999A2 publication Critical patent/WO2014145999A2/en
Publication of WO2014145999A3 publication Critical patent/WO2014145999A3/en
Anticipated expiration legal-status Critical
Ceased legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/98Detection or correction of errors, e.g. by rescanning the pattern or by human intervention; Evaluation of the quality of the acquired patterns

Landscapes

  • Engineering & Computer Science (AREA)
  • Quality & Reliability (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • Theoretical Computer Science (AREA)
  • Character Discrimination (AREA)

Abstract

A method for generating a character-by-character substitution in an optical character recognition (OCR) text output of a document including at least one character, includes: executing on a processor instructions for substituting an OCR key for the at least one character. The instructions include: identifying a class corresponding to the at least one character, wherein the class includes a character shape corresponding to at least a portion of the at least one character; substituting the OCR key including to the character shape for the at least one character; and generating a searchable substituted document including the OCR key.
PCT/US2014/030867 2013-03-15 2014-03-17 System and method for searching through text transcribed from an image processed by optical character recognition Ceased WO2014145999A2 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US201361798223P 2013-03-15 2013-03-15
US61/798,223 2013-03-15

Publications (2)

Publication Number Publication Date
WO2014145999A2 WO2014145999A2 (en) 2014-09-18
WO2014145999A3 true WO2014145999A3 (en) 2014-11-06

Family

ID=51538590

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2014/030867 Ceased WO2014145999A2 (en) 2013-03-15 2014-03-17 System and method for searching through text transcribed from an image processed by optical character recognition

Country Status (1)

Country Link
WO (1) WO2014145999A2 (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114648002B (en) * 2020-12-17 2024-12-24 永中软件股份有限公司 Method for outputting multiple Office document content images through multiple processes

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7415171B2 (en) * 2005-03-28 2008-08-19 United States Postal Service Multigraph optical character reader enhancement systems and methods
US20100246963A1 (en) * 2009-03-26 2010-09-30 Al-Muhtaseb Husni A Automatic arabic text image optical character recognition method

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7415171B2 (en) * 2005-03-28 2008-08-19 United States Postal Service Multigraph optical character reader enhancement systems and methods
US20100246963A1 (en) * 2009-03-26 2010-09-30 Al-Muhtaseb Husni A Automatic arabic text image optical character recognition method

Also Published As

Publication number Publication date
WO2014145999A2 (en) 2014-09-18

Similar Documents

Publication Publication Date Title
WO2015200110A3 (en) Techniques for machine language translation of text from an image based on non-textual context information from the image
CA2879417A1 (en) Structured search queries based on social-graph information
WO2016109307A3 (en) Discriminating ambiguous expressions to enhance user experience
WO2013009578A3 (en) Systems and methods for speech command processing
EP3136257A3 (en) Document-specific gazetteers for named entity recognition
EP2757487A3 (en) Machine translation-driven authoring system and method
WO2011159460A3 (en) Identifying establishments in images
Matthewson et al. Inchoativity meets the perfect time span: The Niuean perfect
PH12015000372A1 (en) Conversion of documents of different types to a uniform and an editable or a searchable format
EP4428742A3 (en) Enhancing reading accuracy, efficiency and retention
EP2811414A3 (en) Confidence-driven rewriting of source texts for improved translation
MX2016016289A (en) Learning and using contextual content retrieval rules for query disambiguation.
TW201612773A (en) Multi-command single utterance input method
WO2014209810A3 (en) Methods and apparatuses for mining synonymous phrases, and for searching related content
WO2016035072A3 (en) Sentiment rating system and method
WO2013163644A3 (en) Updating a search index used to facilitate application searches
WO2012134972A3 (en) Systems and methods for paragraph-based document searching
MX387895B (en) METHOD FOR TEXT RECOGNITION AND COMPUTER PROGRAM PRODUCT.
BR112014026626A2 (en) creation of social networking groups
WO2014210387A3 (en) Concept extraction
GB2542053A (en) Automatically generating a semantic mapping for a relational database
WO2015038408A3 (en) Creating inforgraphics from text data in electronic documents
CL2016000984A1 (en) System and method for implementing multi-faceted search queries
WO2013025624A3 (en) Searching encrypted electronic books
MY194297A (en) A method and device for providing search engine label

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 14763293

Country of ref document: EP

Kind code of ref document: A2

122 Ep: pct application non-entry in european phase

Ref document number: 14763293

Country of ref document: EP

Kind code of ref document: A2