Search International and National Patent Collections
Some content of this application is unavailable at the moment.
If this situation persist, please contact us atFeedback&Contact
1. (US20180165272) Automatic locale determination for electronic documents

Office : United States of America
Application Number: 15858980 Application Date: 29.12.2017
Publication Number: 20180165272 Publication Date: 14.06.2018
Grant Number: 10346538 Grant Date: 09.07.2019
Publication Kind : B2
IPC:
G06F 16/81
G06F 17/22
G06F 17/27
G06F 16/182
G06F 16/9535
[IPC code unknown for G06F 16/81]
G PHYSICS
06
COMPUTING; CALCULATING; COUNTING
F
ELECTRIC DIGITAL DATA PROCESSING
17
Digital computing or data processing equipment or methods, specially adapted for specific functions
20
Handling natural language data
21
Text processing
22
Manipulating or registering by use of codes, e.g. in sequence of text characters
G PHYSICS
06
COMPUTING; CALCULATING; COUNTING
F
ELECTRIC DIGITAL DATA PROCESSING
17
Digital computing or data processing equipment or methods, specially adapted for specific functions
20
Handling natural language data
27
Automatic analysis, e.g. parsing, orthograph correction
[IPC code unknown for G06F 16/182][IPC code unknown for G06F 16/9535]
CPC:
G06F 16/182
G06F 17/275
G06F 16/81
G06F 16/9535
G06F 17/2247
G06F 17/2252
G06F 17/2765
Applicants: Coupa Software Incorporated
Inventors: Matthew Pasquini
Agents: Hickman Palermo Becker Bingham LLP
Priority Data:
Title: (EN) Automatic locale determination for electronic documents
Abstract: front page image
(EN)

Automatic locale determination for documents is described. In an embodiment, a computer server receives an electronic document comprising a plurality of unknown-language data elements each associated with one or more types. Based on a document schema of the document, the computer system selects one or more unknown-language data elements from the plurality of unknown-language data elements and assigning to each of the one or more unknown-language data elements a corresponding weight value based on a respective type of the unknown-language data element. The computer system compares the one or more unknown-language data elements with a plurality of known-language data elements that are associated with the document schema and based on the comparing, determines a number of unknown-language data elements in the one or more unknown-language data elements that matched any in a subset of the plurality of known-language data elements, wherein the subset of known-language data elements corresponds to a particular language. Based on the number of data elements that matched to the subset of known-language data elements and based on the corresponding weight assigned to each unknown-language data element in the number of unknown-language data elements, the computer system determines a language confidence level value specifying a level of machine confidence that the document is expressed in the particular language and based on the language confidence value for the particular language exceeding a language threshold value, automatically processes the document using the particular language.