Processing

Please wait...

Settings

Settings

Goto Application

1. CN101460949 - Indexing documents for information retrieval based on additional feedback fields

Office China
Application Number 200780020322.0
Application Date 15.03.2007
Publication Number 101460949
Publication Date 17.06.2009
Grant Number 101460949
Grant Date 27.08.2014
Publication Kind B
IPC
G06F 17/30
GPHYSICS
06COMPUTING; CALCULATING OR COUNTING
FELECTRIC DIGITAL DATA PROCESSING
17Digital computing or data processing equipment or methods, specially adapted for specific functions
30Information retrieval; Database structures therefor
G06F 17/21
GPHYSICS
06COMPUTING; CALCULATING OR COUNTING
FELECTRIC DIGITAL DATA PROCESSING
17Digital computing or data processing equipment or methods, specially adapted for specific functions
20Handling natural language data
21Text processing
CPC
G06F 16/3326
GPHYSICS
06COMPUTING; CALCULATING; COUNTING
FELECTRIC DIGITAL DATA PROCESSING
16Information retrieval; Database structures therefor; File system structures therefor
30of unstructured textual data
33Querying
332Query formulation
3325Reformulation based on results of preceding query
3326using relevance feedback from the user, e.g. relevance feedback on documents, documents sets, document terms or passages
G06F 16/951
GPHYSICS
06COMPUTING; CALCULATING; COUNTING
FELECTRIC DIGITAL DATA PROCESSING
16Information retrieval; Database structures therefor; File system structures therefor
90Details of database functions independent of the retrieved data types
95Retrieval from the web
951Indexing; Web crawling techniques
Inventors H·E·威廉姆斯
Agents 上海专利商标事务所有限公司 31100
Priority Data 06114850 01.06.2006 EP
Title
(EN) Indexing documents for information retrieval based on additional feedback fields
(ZH) 索引文档以供信息检索
Abstract
(EN)
Information retrieval systems such as web search systems locate documents amongst millions and even billions of possible documents on the basis of query terms. In order to achieve this document indexes are created. We propose creating new fields in the documents to store feedback information. This information comprises query terms used in a particular search as well as information about whether a particular document retrieved is given positive or negative feedback for example. Indexes are created on the basis of this feedback information in addition to other available information. As a result relevance of search results is improved. Multiple fields of information are available for given documents (such as abstract fields, title fields, anchor text fields as well as our feedback fields). Any search algorithm which deals with multiple fields as well as multiple query terms and which provides for differential weighting of document fields is used.

(ZH)

诸如web搜索系统等信息检索系统在查询项的基础上在数百万甚至数十亿可能的文档中定位文档。为实现这点,创建了文档索引。建议在文档中创建新的字段以存储反馈信息。该信息包括在特定搜索中所使用的查询项以及关于是否对所检索到的特定文档给予例如肯定反馈或否定反馈的信息。在该反馈信息加上其它可用信息的基础上创建索引。结果,改进了搜索结果的相关性。对给定文档有多个信息字段(如摘要字段、标题字段、锚文本字段以及此处的反馈字段可用。使用了处理多个字段以及多个查询项并提供对文档字段的差异加权的任何搜索算法。