PATENTSCOPE will be unavailable a few hours for maintenance reason on Tuesday 19.11.2019 at 4:00 PM CET
Search International and National Patent Collections
Some content of this application is unavailable at the moment.
If this situation persists, please contact us atFeedback&Contact
1. (MXMX/a/2008/000180) GRAMMATICAL PARSING OF DOCUMENT VISUAL STRUCTURES

Office : Mexico
Application Number: MX/a/2008/000180 Application Date: 07.01.2008
Publication Number: MX/a/2008/000180 Publication Date: 04.07.2008
Publication Kind : A
Prior PCT appl.: Application Number:US2006026140 ; Publication Number:07005937 Click to see the data
IPC:
G06K 9/72
G PHYSICS
06
COMPUTING; CALCULATING; COUNTING
K
RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
9
Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
62
Methods or arrangements for recognition using electronic means
72
using context analysis based on the provisionally recognised identity of a number of successive patterns, e.g. a word
Applicants: MICROSOFT CORPORATION.*
Inventors: Paul A. Viola
Michael Shilman
Agents: CESAR RAMOS DE MIGUEL*
Priority Data: 11173280 01.07.2005 US
Title: (EN) GRAMMATICAL PARSING OF DOCUMENT VISUAL STRUCTURES
(ES) ANALISIS GRAMATICAL DE ESTRUCTURAS VISUALES DE DOCUMENTOS
Abstract:
(EN) A two-dimensional representation of a document is leveraged to extract a hierarchical structure that facilitates recognition of the document. The visual structure is grammatically parsed utilizing two-dimensional adaptations of statistical parsing algorithms. This allows recognition of layout structures (e.g., columns, authors, titles, footnotes, etc.) and the like such that structural components of the document can be accurately interpreted. Additional techniques can also be employed to facilitate document layout recognition. For example, grammatical parsing techniques that utilize machine learning, parse scoring based on image representations, boosting techniques, and/or"fast features"and the like can be employed to facilitate in document recognition.
(ES) Una representación bidimensional de u documento es apalancado para extraer una estructura jerárquica que facilita el reconocimiento del documento. La estructura visual es gramáticamente analizada utilizando adaptaciones bidimensionales de algoritmos de análisis estadístico. Esto permite el reconocimiento de estructuras de presentación (por ejemplo, columnas, autores, títulos, nota de pie de página, etc.) y similares de manera que los componentes estructurales del documento puede ser interpretados con exactitud. También se pueden emplear técnicas adicionales para facilitar el reconocimiento de presentación del como documento. Por ejemplo, para facilitar el reconocimiento de documento, se pueden emplear técnicas de análisis gramatical que utilizan aprendizaje de máquina, clasificación de análisis a base de representaciones de imágenes técnicas de intensificación io complementarias, y/o"características rápidas", y similares.
Also published as:
NO20080090NZ565147KR1020080026128EP1894144ZA2008/00041JP2009500755
RU0002421810CN101253514CA2614177IN40/DELNP/2008WO/2007/005937