Processing

Please wait...

PATENTSCOPE will be unavailable a few hours for maintenance reason on Saturday 31.10.2020 at 7:00 AM CET
Settings

Settings

Goto Application

1. WO2016154466 - METHOD AND APPARATUS FOR GENERATING TEXT LINE CLASSIFIER

Publication Number WO/2016/154466
Publication Date 29.09.2016
International Application No. PCT/US2016/024069
International Filing Date 24.03.2016
IPC
G06F 17/27 2006.01
GPHYSICS
06COMPUTING; CALCULATING OR COUNTING
FELECTRIC DIGITAL DATA PROCESSING
17Digital computing or data processing equipment or methods, specially adapted for specific functions
20Handling natural language data
27Automatic analysis, e.g. parsing, orthograph correction
G06K 9/00 2006.01
GPHYSICS
06COMPUTING; CALCULATING OR COUNTING
KRECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
9Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
G06K 9/72 2006.01
GPHYSICS
06COMPUTING; CALCULATING OR COUNTING
KRECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
9Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
62Methods or arrangements for recognition using electronic means
72using context analysis based on the provisionally recognised identity of a number of successive patterns, e.g. a word
G06K 9/18 2006.01
GPHYSICS
06COMPUTING; CALCULATING OR COUNTING
KRECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
9Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
18using printed characters having additional code marks or containing code marks, e.g. the character being composed of individual strokes of different shape, each representing a different code value
G06K 9/34 2006.01
GPHYSICS
06COMPUTING; CALCULATING OR COUNTING
KRECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
9Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
20Image acquisition
34Segmentation of touching or overlapping patterns in the image field
G06K 9/46 2006.01
GPHYSICS
06COMPUTING; CALCULATING OR COUNTING
KRECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
9Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
36Image preprocessing, i.e. processing the image information without deciding about the identity of the image
46Extraction of features or characteristics of the image
CPC
G06K 2209/013
GPHYSICS
06COMPUTING; CALCULATING; COUNTING
KRECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
2209Indexing scheme relating to methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
01Character recognition
013of non-latin characters other than Kanji, Hiragana or Katakana characters
G06K 9/00
GPHYSICS
06COMPUTING; CALCULATING; COUNTING
KRECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
9Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
G06K 9/00456
GPHYSICS
06COMPUTING; CALCULATING; COUNTING
KRECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
9Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
00442Document analysis and understanding; Document recognition
00456Classification of image contents, e.g. text, photographs, tables
G06K 9/00865
GPHYSICS
06COMPUTING; CALCULATING; COUNTING
KRECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
9Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
00852Recognising whole cursive words
00865using stroke segmentation
G06K 9/6255
GPHYSICS
06COMPUTING; CALCULATING; COUNTING
KRECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
9Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
62Methods or arrangements for recognition using electronic means
6217Design or setup of recognition systems and techniques; Extraction of features in feature space; Clustering techniques; Blind source separation
6255Determining representative reference patterns, e.g. averaging or distorting patterns; Generating dictionaries, e.g. user dictionaries
G06K 9/6821
GPHYSICS
06COMPUTING; CALCULATING; COUNTING
KRECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
9Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
62Methods or arrangements for recognition using electronic means
68using sequential comparisons of the image signals with a plurality of references ; in which the sequence of the image signals or the references is relevant; , e.g. addressable memory
6807Dividing the references in groups prior to recognition, the recognition taking place in steps; Selecting relevant dictionaries
6814according to the graphical properties
6821Alphabet recognition, e.g. Latin, Kanji, Katakana
Applicants
  • ALIBABA GROUP HOLDING LIMITED
Inventors
  • JIN, Xuan
  • WANG, Tianzhou
  • XUE, Qin
Agents
  • MURABITO, Anthony, C.
Priority Data
201510133507.925.03.2015CN
Publication Language English (EN)
Filing Language English (EN)
Designated States
Title
(EN) METHOD AND APPARATUS FOR GENERATING TEXT LINE CLASSIFIER
(FR) PROCÉDÉ ET APPAREIL DE GÉNÉRATION D'UN CLASSIFICATEUR DE LIGNES DE TEXTE
Abstract
(EN)
A method of generating a text line classifier including generating text line samples by use of a present terminal system font reservoir. The method also includes extracting features from the text line samples and pre-stored marked-up samples. The method further includes training models by use of the extracted features to generate a text line classifier for recognizing text regions. With the system font reservoir being utilized for generating text line samples, the generated text line classifiers can target different scenes or different requirements for text region recognition with a high degree of applicability and wide application in addition to ease of implementation. Together with the combinational use of the marked up samples for extracting features from the text line samples, the generated text line classifiers provide for enhanced classification efficiency and accuracy.
(FR)
La présente invention concerne un procédé de génération d'un classificateur de lignes de texte. Le procédé comprend les étapes consistant à : générer des échantillons de lignes de texte en utilisant une réserve actuelle de polices de caractères d'un système de terminal; extraire des caractéristiques à partir des échantillons de lignes de texte et d'échantillons marqués préstockés; et former des modèles en utilisant les caractéristiques extraites de façon à générer un classificateur de lignes de texte permettant de reconnaître des zones de texte. Grâce à la réserve de polices de caractères du système utilisée pour générer des échantillons de lignes de texte, les classificateurs de lignes de texte générés peuvent cibler différentes scènes ou différentes exigences de reconnaissance de zones de texte avec un haut degré d'applicabilité et une large application qui s'ajoutent à la facilité d'implémentation. Associés à l'utilisation combinée des échantillons marqués pour extraire les caractéristiques à partir des échantillons de lignes de texte, les classificateurs de lignes de texte générés assurent une efficacité et une précision accrues de la classification.
Latest bibliographic data on file with the International Bureau