Processing

Please wait...

Settings

Settings

Goto Application

1. US20080195595 - Keyword Extracting Device

Office
United States of America
Application Number 11667097
Application Date 11.10.2005
Publication Number 20080195595
Publication Date 14.08.2008
Publication Kind A1
IPC
G06F 17/30
GPHYSICS
06COMPUTING; CALCULATING OR COUNTING
FELECTRIC DIGITAL DATA PROCESSING
17Digital computing or data processing equipment or methods, specially adapted for specific functions
30Information retrieval; Database structures therefor
Applicants INTELLECTUAL PROPERTY BANK CORP.
Inventors Masuyama Hiroaki
Sato Haru-Tada
Asada Makoto
Hasuko Kazumi
Hotta Hideaki
Agents WENDEROTH, LIND &; PONACK, L.L.P.
Priority Data 2004322924 05.11.2004 JP
Title
(EN) Keyword Extracting Device
Abstract
(EN)

A keyword extracting device includes high-frequency term extracting means (30) for extracting high-frequency terms which are index terms having a great weight among the index terms in a document group (E) including a plurality of documents (D), the weight including evaluation on the level of an appearance frequency of each index term, clustering means (50) for clustering the high-frequency terms on the basis of a co-occurrence degree C. which is based on the presence/absence of the co-occurrence of each document with the index terms (w) in the document group (E) in each document, score calculating means (70) for calculating a score key(w) of each index term (w) such that a high score is given to the index term among the index terms (w) that co-occurs with the high-frequency term belonging to more clusters (g) and that co-occurs with the high-frequency term in more documents (D), and keyword extracting means (90) for extracting keywords on the basis of the scores. Accordingly, the keywords indicating a feature of a document group including a plurality of documents can be automatically extracted.