Processing

Please wait...

Settings

Settings

Goto Application

1. CN112119394 - FINDING A RESOURCE IN RESPONSE TO A QUERY INCLUDING UNKNOWN WORDS

Office
China
Application Number 201980032253.8
Application Date 16.05.2019
Publication Number 112119394
Publication Date 22.12.2020
Publication Kind A
IPC
G06F 40/242
GPHYSICS
06COMPUTING; CALCULATING OR COUNTING
FELECTRIC DIGITAL DATA PROCESSING
40Handling natural language data
20Natural language analysis
237Lexical tools
242Dictionaries
CPC
G06F 40/268
GPHYSICS
06COMPUTING; CALCULATING; COUNTING
FELECTRIC DIGITAL DATA PROCESSING
40Handling natural language data
20Natural language analysis
268Morphological analysis
G06F 40/30
GPHYSICS
06COMPUTING; CALCULATING; COUNTING
FELECTRIC DIGITAL DATA PROCESSING
40Handling natural language data
30Semantic analysis
G06F 16/3334
GPHYSICS
06COMPUTING; CALCULATING; COUNTING
FELECTRIC DIGITAL DATA PROCESSING
16Information retrieval; Database structures therefor; File system structures therefor
30of unstructured textual data
33Querying
3331Query processing
3332Query translation
3334Selection or weighting of terms from queries, including natural language queries
G06F 16/3344
GPHYSICS
06COMPUTING; CALCULATING; COUNTING
FELECTRIC DIGITAL DATA PROCESSING
16Information retrieval; Database structures therefor; File system structures therefor
30of unstructured textual data
33Querying
3331Query processing
334Query execution
3344using natural language analysis
G06F 16/374
GPHYSICS
06COMPUTING; CALCULATING; COUNTING
FELECTRIC DIGITAL DATA PROCESSING
16Information retrieval; Database structures therefor; File system structures therefor
30of unstructured textual data
36Creation of semantic tools, e.g. ontology or thesauri
374Thesaurus
G10L 15/08
GPHYSICS
10MUSICAL INSTRUMENTS; ACOUSTICS
LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
15Speech recognition
08Speech classification or search
Applicants INTERNATIONAL BUSINESS MACHINES CORPORATION
国际商业机器公司
Inventors OYA HIROKI
大矢裕己
Agents 北京市金杜律师事务所 11256
Priority Data 15986813 23.05.2018 US
Title
(EN) FINDING A RESOURCE IN RESPONSE TO A QUERY INCLUDING UNKNOWN WORDS
(ZH) 响应于包括未知词的查询查找资源
Abstract
(EN) A computer receives a search query from a user for finding a resource. The computer extracts one or more words from the search query using morphological analysis. The computer assigns at least one first category to at least one first word of the one or more words using a dictionary. In response to identifying an unknown word not in the dictionary within the one or more words, the computer searchesfor the unknown word on a net. If the unknown word is found on the net, the computer obtains a description on a page on the net on which the unknown word is found, extracts one or more second words from the description using morphological analysis, assigns, using the dictionary, at least one second category to the one or more second words extracted from the description, finds, among the one or more second words, a particular word to which a predetermined category was assigned, extracts a correlation word from among the one or more second words having a high correlation with the found particular word, and finds, among the first words, a search word assigned the at least one first category that is the same as the predetermined category, finds, from a repository, resource data or worksite data using the correlation word and the search word, and lists the found resource data.
(ZH) 计算机从用户接收用于查找资源的搜索查询。所述计算机使用词法分析从所述搜索查询中提取一个或多个词。所述计算机使用词典将至少一个第一类别分配给所述一个或多个词中的至少一个第一词。响应于在所述一个或多个词中识别出不在所述词典中的未知词,所述计算机在网络上搜索所述未知词。如果在所述网络上找到了所述未知词,则所述计算机会在所述网络上查找到所述未知词的页面上获得描述,并使用词法分析从所述描述中提取一个或多个第二个词,使用所述词典将至少一个第二类别分配给从所述描述中提取的所述一个或多个第二词,在所述一个或多个第二词中找到已分配预定类别的特定词,从其中提取与所述一个或多个第二词与所找到的特定词具有高度相关性的相关词,并在所述第一词中找到已分配与所述预定类别相同的至少一个第一类别的搜索词,使用所述相关词和所述搜索词从存储库中找到资源数据或工作场所数据,并列出找到的资源数据。
Related patent documents
GB2018171.5This application is not viewable in PATENTSCOPE because the national phase entry has not been published yet or the national entry is issued from a country that does not share data with WIPO or there is a formatting issue or an unavailability of the application.