Traitement en cours

Veuillez attendre...

Paramétrages

Paramétrages

Aller à Demande

1. US20110222773 - Paragraph recognition in an optical character recognition (OCR) process

Office
États-Unis d'Amérique
Numéro de la demande 12720992
Date de la demande 10.03.2010
Numéro de publication 20110222773
Date de publication 15.09.2011
Numéro de délivrance 08565474
Date de délivrance 22.10.2013
Type de publication B2
CIB
G06K 9/00
GPHYSIQUE
06CALCUL; COMPTAGE
KRECONNAISSANCE DES DONNÉES; PRÉSENTATION DES DONNÉES; SUPPORTS D'ENREGISTREMENT; MANIPULATION DES SUPPORTS D'ENREGISTREMENT
9Méthodes ou dispositions pour la lecture ou la reconnaissance de caractères imprimés ou écrits ou pour la reconnaissance de formes, p.ex. d'empreintes digitales
Déposants Radakovic Bogdan
Microsoft Corporation
Galic Sasa
Uzelac Aleksandar
Inventeurs Radakovic Bogdan
Galic Sasa
Uzelac Aleksandar
Mandataires Mayer & Williams, PC
Titre
(EN) Paragraph recognition in an optical character recognition (OCR) process
Abrégé
(EN)

An image processing apparatus for detecting paragraphs in a textual image includes an input component for receiving an input image in which textual lines and words have been identified and a page classification component for classifying the input image as a first or second page type. The apparatus also includes a paragraph detection component for classifying all textual lines on the input image as a beginning paragraph line or a continuation paragraph line. The apparatus is also provided with a paragraph creation component for creating paragraphs that include textual lines between two successive beginning paragraph lines, including a first of the two successive beginning paragraph lines. The paragraphs that have been identified may be classified by the type of alignment they exhibit. For instance, paragraphs may be classified according to whether they are left aligned, right aligned, center aligned or justified.


Documents de brevet associés