Search International and National Patent Collections

1. (WO2018135023) INFORMATION PROCESSING SYSTEM, INFORMATION PROCESSING METHOD, AND COMPUTER PROGRAM

Pub. No.:    WO/2018/135023    International Application No.:    PCT/JP2017/028632
Publication Date: Fri Jul 27 01:59:59 CEST 2018 International Filing Date: Tue Aug 08 01:59:59 CEST 2017
IPC: G06F 17/27
Applicants: NOMURA RESEARCH INSTITUTE, LTD.
株式会社野村総合研究所
Inventors: Mao Yuxiang
毛 羽翔
Title: INFORMATION PROCESSING SYSTEM, INFORMATION PROCESSING METHOD, AND COMPUTER PROGRAM
Abstract:
A dictionary creation device 14 records a dictionary used in natural language processing by a natural language processing device 16, the dictionary storing independent words, which are words that establish a meaning as a standalone word. The dictionary creation device 14 deems a character string that remains after at least independent words already stored in the dictionary are removed from a character string of a patent document held in a patent document database 12 to be a phrase, and extracts a plurality of phrases. If an identical character string is present at the beginning portion of at least a prescribed number of phrases from among the plurality of phrases extracted, the dictionary creation device 14 extracts the identical character string as an independent word. The dictionary creation device 14 stores the extracted independent word in the dictionary.