Processing

Please wait...

Settings

Settings

Goto Application

1. WO2020117649 - METHODS AND SYSTEMS FOR AUTOMATED TABLE DETECTION WITHIN DOCUMENTS

Publication Number WO/2020/117649
Publication Date 11.06.2020
International Application No. PCT/US2019/063954
International Filing Date 02.12.2019
IPC
G06K 9/00 2006.01
GPHYSICS
06COMPUTING; CALCULATING OR COUNTING
KRECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
9Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
G06K 9/36 2006.01
GPHYSICS
06COMPUTING; CALCULATING OR COUNTING
KRECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
9Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
36Image preprocessing, i.e. processing the image information without deciding about the identity of the image
G06K 9/46 2006.01
GPHYSICS
06COMPUTING; CALCULATING OR COUNTING
KRECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
9Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
36Image preprocessing, i.e. processing the image information without deciding about the identity of the image
46Extraction of features or characteristics of the image
G06N 3/02 2006.01
GPHYSICS
06COMPUTING; CALCULATING OR COUNTING
NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
3Computer systems based on biological models
02using neural network models
CPC
G06K 2209/01
GPHYSICS
06COMPUTING; CALCULATING; COUNTING
KRECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
2209Indexing scheme relating to methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
01Character recognition
G06K 9/00449
GPHYSICS
06COMPUTING; CALCULATING; COUNTING
KRECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
9Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
00442Document analysis and understanding; Document recognition
00449Layout structured with printed lines or input boxes, e.g. business forms, tables
G06K 9/00456
GPHYSICS
06COMPUTING; CALCULATING; COUNTING
KRECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
9Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
00442Document analysis and understanding; Document recognition
00456Classification of image contents, e.g. text, photographs, tables
G06K 9/00463
GPHYSICS
06COMPUTING; CALCULATING; COUNTING
KRECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
9Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
00442Document analysis and understanding; Document recognition
00463Document analysis by extracting the geometrical structure, e.g. layout tree; Block segmentation, e.g. bounding boxes for graphics, paragraphs, words or letters
G06K 9/4628
GPHYSICS
06COMPUTING; CALCULATING; COUNTING
KRECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
9Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
36Image preprocessing, i.e. processing the image information without deciding about the identity of the image
46Extraction of features or characteristics of the image
4604Detecting partial patterns, e.g. edges or contours, or configurations, e.g. loops, corners, strokes, intersections
4609by matching or filtering
4619Biologically-inspired filters, e.g. receptive fields
4623with interaction between the responses of different filters
4628Integrating the filters into a hierarchical structure
G06K 9/6218
GPHYSICS
06COMPUTING; CALCULATING; COUNTING
KRECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
9Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
62Methods or arrangements for recognition using electronic means
6217Design or setup of recognition systems and techniques; Extraction of features in feature space; Clustering techniques; Blind source separation
6218Clustering techniques
Applicants
  • LEVERTON HOLDING LLC [US]/[US]
Inventors
  • SCHÄFER, Christian
  • KIEWEG, Michael
Agents
  • REIBMAN, Andrew, L.
  • PLUMMER, Kelly, A.
  • WEBER, Brett, J.
  • HUBBARD, Nolan
  • MAJEWSKI, Dennis
Priority Data
62/775,06204.12.2018US
Publication Language English (EN)
Filing Language English (EN)
Designated States
Title
(EN) METHODS AND SYSTEMS FOR AUTOMATED TABLE DETECTION WITHIN DOCUMENTS
(FR) PROCÉDÉS ET SYSTÈMES DE DÉTECTION AUTOMATIQUE DE TABLE DANS DES DOCUMENTS
Abstract
(EN)
Methods and systems for detecting tables within documents are provided. The methods and systems may include receiving a text of the document that includes a plurality of words depicted in the document image. Feature sets may be calculated for the words and may contain one or more features of a corresponding word of the text. Candidate table words may then be identified based on the features vectors, and may then be used to identify a table location within the document image. In some cases, the candidate table words may be identified using a machine learning model.
(FR)
L'invention concerne des procédés et des systèmes de détection de tables dans des documents. Les procédés et les systèmes peuvent consister à recevoir un texte du document qui contient une pluralité de mots représentés dans l'image de document. Des ensembles de caractéristiques peuvent être calculés pour les mots et peuvent contenir une ou plusieurs caractéristiques d'un mot correspondant du texte. Des mots de table candidats peuvent alors être identifiés en fonction des vecteurs caractéristiques, et peuvent ensuite être utilisés pour identifier un emplacement de table dans l'image de document. Dans certains cas, les mots de table candidats peuvent être identifiés en utilisant un modèle d'apprentissage automatique.
Latest bibliographic data on file with the International Bureau