Processing

Please wait...

Settings

Settings

Goto Application

1. CN102129450 - Detecting spiking queries

Office China
Application Number 201110030893.0
Application Date 19.01.2011
Publication Number 102129450
Publication Date 20.07.2011
Grant Number 102129450
Grant Date 19.08.2015
Publication Kind B
IPC
G06F 17/30
GPHYSICS
06COMPUTING; CALCULATING OR COUNTING
FELECTRIC DIGITAL DATA PROCESSING
17Digital computing or data processing equipment or methods, specially adapted for specific functions
30Information retrieval; Database structures therefor
CPC
G06Q 30/0254
GPHYSICS
06COMPUTING; CALCULATING; COUNTING
QDATA PROCESSING SYSTEMS OR METHODS, SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES, NOT OTHERWISE PROVIDED FOR
30Commerce, e.g. shopping or e-commerce
02Marketing, e.g. market research and analysis, surveying, promotions, advertising, buyer profiling, customer management or rewards; Price estimation or determination
0241Advertisement
0251Targeted advertisement
0254based on statistics
G06F 16/24534
GPHYSICS
06COMPUTING; CALCULATING; COUNTING
FELECTRIC DIGITAL DATA PROCESSING
16Information retrieval; Database structures therefor; File system structures therefor
20of structured data, e.g. relational data
24Querying
245Query processing
2453Query optimisation
24534Query rewriting; Transformation
Applicants 微软技术许可有限责任公司
Inventors C·A·梅耶斯
G·P·戈帕尔
A·P·奥克利
N·阿格拉沃尔
N·E·克拉斯韦尔
M·邵库赫
D·L·康奈尔
S·阿哈里
N·B·沙曼
G·萨瑞恩
H·E·威廉姆斯
J·K·高亚尔
Agents 上海专利商标事务所有限公司 31100
上海专利商标事务所有限公司 31100
Priority Data 12690184 20.01.2010 US
Title
(EN) Detecting spiking queries
(ZH) 检测尖峰查询
Abstract
(EN)
Methods, systems, and media are provided for identifying and clustering queries that are rising in popularity. Resultant clustered queries can be compared to other stored queries using textual and temporal correlations. Fresh indices containing information and results from recently crawled content sources are searched to obtain the most recent query activity. Historical indices are also searched to obtain temporally correlated information and results that match the clustered query stream. A weighted average acceleration of a spike can be calculated to distinguish between a legitimate spike and a non-legitimate spike. Legitimate clusters are combined with other stored clusters and presented as grouped content results to a user output device.

(ZH)

本发明提供了一种检测尖峰查询的方法、系统和介质。提供了用于标识流行度不断提升的查询并对其进行聚类的方法、系统和介质。可以使用文本或时间相关性将所得的聚类查询与其他所存储的查询进行比较。搜索包含来自最近爬行的内容源的信息和结果的新鲜索引来获得最近查询活动。还搜索历史索引来获得匹配聚类查询流的、在时间上相关的信息和结果。可以计算尖峰的加权平均加速度来在合法尖峰和不合法尖峰之间进行区分。将合法聚类与其他所存储的聚类进行组合并作为分组的内容结果呈现给用户输出设备。

Also published as