Processing

Please wait...

Settings

Settings

Goto Application

1. US10325149 - Systems and methods for automatically identifying document information

Office United States of America
Application Number 16122819
Application Date 05.09.2018
Publication Number 10325149
Publication Date 18.06.2019
Grant Number 10325149
Grant Date 18.06.2019
Publication Kind B1
IPC
G06F 17/22
GPHYSICS
06COMPUTING; CALCULATING OR COUNTING
FELECTRIC DIGITAL DATA PROCESSING
17Digital computing or data processing equipment or methods, specially adapted for specific functions
20Handling natural language data
21Text processing
22Manipulating or registering by use of codes, e.g. in sequence of text characters
G06K 9/00
GPHYSICS
06COMPUTING; CALCULATING OR COUNTING
KRECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
9Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
G06F 16/435
GPHYSICS
06COMPUTING; CALCULATING OR COUNTING
FELECTRIC DIGITAL DATA PROCESSING
16Information retrieval; Database structures therefor; File system structures therefor
40of multimedia data, e.g. slideshows comprising image and additional audio data
43Querying
435Filtering based on additional data, e.g. user or group profiles
CPC
G06F 16/435
GPHYSICS
06COMPUTING; CALCULATING; COUNTING
FELECTRIC DIGITAL DATA PROCESSING
16Information retrieval; Database structures therefor; File system structures therefor
40of multimedia data, e.g. slideshows comprising image and additional audio data
43Querying
435Filtering based on additional data, e.g. user or group profiles
G06K 9/00483
GPHYSICS
06COMPUTING; CALCULATING; COUNTING
KRECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
9Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
00442Document analysis and understanding; Document recognition
00483Document matching
G06K 9/00463
GPHYSICS
06COMPUTING; CALCULATING; COUNTING
KRECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
9Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
00442Document analysis and understanding; Document recognition
00463Document analysis by extracting the geometrical structure, e.g. layout tree; Block segmentation, e.g. bounding boxes for graphics, paragraphs, words or letters
G06K 9/00469
GPHYSICS
06COMPUTING; CALCULATING; COUNTING
KRECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
9Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
00442Document analysis and understanding; Document recognition
00469Document understanding by extracting the logical structure, e.g. chapters, sections, columns, titles, paragraphs, captions, page number, and identifying its elements, e.g. author, keywords, ZIP code, money amount
Applicants Coupa Software Incorporated
Inventors Mark Oliver Burch
Hanieh Borhanazad
Agents Hickman Palermo Becker Bingham LLP
Title
(EN) Systems and methods for automatically identifying document information
Abstract
(EN)

A computer-implemented method comprises defining a set of canonical features for a document type and a plurality of attributes for a canonical feature; identifying a set of text rectangles from an electronic document; obtaining a comparison set of reference document codifications, one of which comprising a plurality of canonical feature codifications, one of which comprising one or more attribute values for one or more of the plurality of attributes of one of the set of canonical features as the one canonical feature appears in the one reference document; for each current canonical feature of the set of canonical features: selecting a set of canonical feature codifications from the comparison set and identifying a match between one of the set of text rectangles and one of the set of canonical feature codifications; for each of the set of text rectangles, selecting one of the matching canonical feature codifications.