Processing

Please wait...

Settings

Settings

Goto Application

1. WO1999005618 - APPARATUS AND METHODS FOR AN INFORMATION RETRIEVAL SYSTEM THAT EMPLOYS NATURAL LANGUAGE PROCESSING OF SEARCH RESULTS TO IMPROVE OVERALL PRECISION

Publication Number WO/1999/005618
Publication Date 04.02.1999
International Application No. PCT/US1998/009711
International Filing Date 13.05.1998
Chapter 2 Demand Filed 10.02.1999
IPC
G06F 17/30 2006.1
GPHYSICS
06COMPUTING; CALCULATING OR COUNTING
FELECTRIC DIGITAL DATA PROCESSING
17Digital computing or data processing equipment or methods, specially adapted for specific functions
30Information retrieval; Database structures therefor
CPC
G06F 16/3344
GPHYSICS
06COMPUTING; CALCULATING; COUNTING
FELECTRIC DIGITAL DATA PROCESSING
16Information retrieval; Database structures therefor; File system structures therefor
30of unstructured textual data
33Querying
3331Query processing
334Query execution
3344using natural language analysis
Y10S 707/99932
YSECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
10TECHNICAL SUBJECTS COVERED BY FORMER USPC
STECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
707Data processing: database and file management or data structures
99931Database or file accessing
99932Access augmentation or optimizing
Y10S 707/99933
YSECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
10TECHNICAL SUBJECTS COVERED BY FORMER USPC
STECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
707Data processing: database and file management or data structures
99931Database or file accessing
99933Query processing, i.e. searching
Y10S 707/99934
YSECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
10TECHNICAL SUBJECTS COVERED BY FORMER USPC
STECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
707Data processing: database and file management or data structures
99931Database or file accessing
99933Query processing, i.e. searching
99934Query formulation, input preparation, or translation
Y10S 707/99935
YSECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
10TECHNICAL SUBJECTS COVERED BY FORMER USPC
STECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
707Data processing: database and file management or data structures
99931Database or file accessing
99933Query processing, i.e. searching
99935Query augmenting and refining, e.g. inexact access
Y10S 707/99936
YSECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
10TECHNICAL SUBJECTS COVERED BY FORMER USPC
STECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
707Data processing: database and file management or data structures
99931Database or file accessing
99933Query processing, i.e. searching
99936Pattern matching access
Applicants
  • MICROSOFT CORPORATION [US]/[US]
Inventors
  • BRADEN-HARDER, Lisa
  • CORSTON, Simon, H.
  • DOLAN, William, B.
  • VANDERWENDE, Lucy, H.
Agents
  • MICHAELSON, Peter, L.
Priority Data
08/898,65222.07.1997US
Publication Language English (en)
Filing Language English (EN)
Designated States
Title
(EN) APPARATUS AND METHODS FOR AN INFORMATION RETRIEVAL SYSTEM THAT EMPLOYS NATURAL LANGUAGE PROCESSING OF SEARCH RESULTS TO IMPROVE OVERALL PRECISION
(FR) APPAREIL ET PROCEDES POUR SYSTEME D'EXTRACTION D'INFORMATION UTILISANT LE TRAITEMENT EN LANGAGE NATUREL DES RESULTATS DE RECHERCHE POUR AMELIORER LA PRECISION GLOBALE
Abstract
(EN) Apparatus and accompanying methods for an information retrieval system that utilizes natural language processing to process results retrieved by, for example, an information retrieval engine such as a conventional statistical-based search engine, in order to improve overall precision. Specifically, such a search ultimately yields a set of retrieved documents. Each such document is then subjected to natural language processing to produce a set of logical forms. Each such logical form encodes, in a word-relation-word manner, semantic relationships, particularly argument and adjunct structure, between words in a phrase. A user-supplied query is analyzed in the same manner to yield a set of corresponding logical forms therefor. Documents are ranked as a predefined function of the logical forms from the documents and the query. Specifically, the set of logical forms for the query is then compared against a set of logical forms for each of the retrieved documents in order to ascertain a match between any such logical forms in both sets. Each document that has at least one matching logical forms is heuristically scored, with each different relation for a matching logical forms being assigned a different corresponding predefined weight. The score of each such document is, e.g., a predefined function of the weights of its uniquely matching logical forms. Finally, the retained documents are ranked in order of descending score and then presented to a user in that order.
(FR) Appareils et procédés associés, pour un système de recherche d'information utilisant le traitement en langage naturel pour traiter les résultats extraits, par exemple, par un moteur d'extraction d'information comme un moteur de recherche à base statistique classique, afin d'améliorer la précision globale. Ladite recherche permet notamment de produire en final un ensemble de documents extraits. Chaque document est ensuite soumis à un traitement en langue naturelle de sorte qu'un ensemble de formes logiques soit produit. Chaque forme logique code, en mode mot-relation-mot, les relations sémantiques, notamment la structure d'argument et d'adjonction, entre les mots d'une phrase. Une demande formulée par l'utilisateur est analysée de la même manière de sorte qu'un ensemble de formes logiques correspondantes soit produit. Les documents sont classés en fonction, de manière prédéterminée, des formes logiques provenant des documents et de la demande. Spécifiquement, l'ensemble de formes logiques pour la demande est ensuite comparé à un ensemble de formes logiques pour chacun des documents extraits, de manière qu'un appariement soit établi entre chaque forme logique des deux ensembles. Chaque document qui présente au moins une forme logique appariée est évalué de manière heuristique, un poids prédéfini différent et correspondant différent étant attribué à chaque relation différente pour une forme logique appariée. L'évaluation de chaque document est fonction, par exemple, de manière prédéterminée, des poids de ses formes logiques appariées uniques. Les documents retenus sont ensuite classés dans l'ordre décroissant puis présentés à un utilisateur dans cet ordre.
Latest bibliographic data on file with the International Bureau