Processing

Please wait...

Settings

Settings

Goto Application

1. WO2012057773 - GENERATING A TAXONOMY FROM UNSTRUCTURED INFORMATION

Publication Number WO/2012/057773
Publication Date 03.05.2012
International Application No. PCT/US2010/054611
International Filing Date 29.10.2010
IPC
G06F 17/27 2006.01
GPHYSICS
06COMPUTING; CALCULATING OR COUNTING
FELECTRIC DIGITAL DATA PROCESSING
17Digital computing or data processing equipment or methods, specially adapted for specific functions
20Handling natural language data
27Automatic analysis, e.g. parsing, orthograph correction
G06F 17/25 2006.01
GPHYSICS
06COMPUTING; CALCULATING OR COUNTING
FELECTRIC DIGITAL DATA PROCESSING
17Digital computing or data processing equipment or methods, specially adapted for specific functions
20Handling natural language data
21Text processing
25Automatic justification
CPC
G06F 16/35
GPHYSICS
06COMPUTING; CALCULATING; COUNTING
FELECTRIC DIGITAL DATA PROCESSING
16Information retrieval; Database structures therefor; File system structures therefor
30of unstructured textual data
35Clustering; Classification
G06F 16/36
GPHYSICS
06COMPUTING; CALCULATING; COUNTING
FELECTRIC DIGITAL DATA PROCESSING
16Information retrieval; Database structures therefor; File system structures therefor
30of unstructured textual data
36Creation of semantic tools, e.g. ontology or thesauri
Applicants
  • HEWLETT-PACKARD DEVELOPMENT COMPANY, L.P. [US]/[US] (AllExceptUS)
  • MEHRA, Pankaj [US]/[US] (UsOnly)
  • ULANOV, Alexander [RU]/[RU] (UsOnly)
  • SIMANOVSKY, Andrey [RU]/[RU] (UsOnly)
Inventors
  • MEHRA, Pankaj
  • ULANOV, Alexander
  • SIMANOVSKY, Andrey
Agents
  • DAKIN, Lloyd E.
Priority Data
Publication Language English (EN)
Filing Language English (EN)
Designated States
Title
(EN) GENERATING A TAXONOMY FROM UNSTRUCTURED INFORMATION
(FR) GÉNÉRATION D'UNE TAXINOMIE À PARTIR D'INFORMATIONS NON STRUCTURÉES
Abstract
(EN)
At least one term is extracted [202] from unstructured information. The at least one term is validated [204]. Then, a sense of the at least one extracted and validated term is determined [206]. The at least one extracted and validated term is clustered [208] into at least one group of terms according to the determined sense. A taxonomy is generated [210] based on the clustering and a mining of accessible taxonomies.
(FR)
Selon la présente invention, au moins un terme est extrait [202] [202] à partir d'informations non structurées. Le ou les termes sont ensuite validés [204]. Puis, une détection du ou des termes extraits et validés est déterminée [206]. Le ou les termes extraits et validés sont groupés [208] en au moins un groupe de termes sur la base de la détection déterminée. Une taxinomie est générée [210] sur la base du groupement et d'une recherche dans les taxinomies accessibles.
Also published as
Latest bibliographic data on file with the International Bureau