Processing

Please wait...

Settings

Settings

Goto Application

1. WO2020201465 - METHOD AND SYSTEM FOR ADVANCED DOCUMENT REDACTION

Publication Number WO/2020/201465
Publication Date 08.10.2020
International Application No. PCT/EP2020/059470
International Filing Date 02.04.2020
IPC
G06F 40/205 2020.01
GPHYSICS
06COMPUTING; CALCULATING OR COUNTING
FELECTRIC DIGITAL DATA PROCESSING
40Handling natural language data
20Natural language analysis
205Parsing
CPC
G06F 16/3344
GPHYSICS
06COMPUTING; CALCULATING; COUNTING
FELECTRIC DIGITAL DATA PROCESSING
16Information retrieval; Database structures therefor; File system structures therefor
30of unstructured textual data
33Querying
3331Query processing
334Query execution
3344using natural language analysis
G06F 40/205
GPHYSICS
06COMPUTING; CALCULATING; COUNTING
FELECTRIC DIGITAL DATA PROCESSING
40Handling natural language data
20Natural language analysis
205Parsing
G06F 40/253
GPHYSICS
06COMPUTING; CALCULATING; COUNTING
FELECTRIC DIGITAL DATA PROCESSING
40Handling natural language data
20Natural language analysis
253Grammatical analysis; Style critique
G06F 40/289
GPHYSICS
06COMPUTING; CALCULATING; COUNTING
FELECTRIC DIGITAL DATA PROCESSING
40Handling natural language data
20Natural language analysis
279Recognition of textual entities
289Phrasal analysis, e.g. finite state techniques or chunking
G06F 40/40
GPHYSICS
06COMPUTING; CALCULATING; COUNTING
FELECTRIC DIGITAL DATA PROCESSING
40Handling natural language data
40Processing or translation of natural language
G06N 20/00
GPHYSICS
06COMPUTING; CALCULATING; COUNTING
NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
20Machine learning
Applicants
  • GENPACT LUXEMBOURG S.À R.L [LU]/[LU]
Inventors
  • MANE, Shishir
Agents
  • KEANE, Paul
Priority Data
16/373,21602.04.2019US
Publication Language English (EN)
Filing Language English (EN)
Designated States
Title
(EN) METHOD AND SYSTEM FOR ADVANCED DOCUMENT REDACTION
(FR) PROCÉDÉ ET SYSTÈME DE RÉDACTION DE DOCUMENT AVANCÉE
Abstract
(EN)
A system and method for advanced document redaction are disclosed. According to one embodiment, a system comprises a parser that analyzes documents to identify structured, semi-structured, and unstructured data from a document. A candidates generator generates a list of words for redaction from the structured, semi-structured, and unstructured data. A replacement engine replaces one or more words from the list of words with one or more of a replacement word, random characters, and random numbers.
(FR)
L'invention concerne un procédé et un système de rédaction de document avancée. Selon un mode de réalisation, un système comprend un analyseur qui analyse des documents pour identifier des données structurées, semi-structurées et non structurées à partir d'un document. Selon un mode de réalisation, un système comprend un analyseur qui analyse des documents afin d’identifier des données structurées, semi-structurées et non structurées à partir d'un document. Un moteur de remplacement remplace un ou plusieurs mots de la liste de mots par un ou plusieurs des éléments suivants : mot de remplacement, caractères aléatoires et nombres aléatoires.
Latest bibliographic data on file with the International Bureau