trec5@crd.ge.com
Principal Investigator: Tomek Strzalkowski
GE Research and Development Center Bldg. K-1, rm 5C40 P.O. Box 8 Schenectady, NY 12301 phone (518) 387-6871 fax (518) 387-6845
Participation Category: A
Team participants: GE, NYU, Rutgers, Lockheed Martin
For previous TREC's the GE/NYU group has developed a prototype Natural Language Information Retrieval System which uses advanced linguistic processing techniques to enhance the effectiveness of traditional term-based document retrieval. The backbone of our system is a statistical information retrieval engine which performs automated indexing of documents, then search and ranking in response to user queries. This core architecture is augmented with robust natural language processing tools which are used to process text documents (both database documents and user's queries). These tools include a dictionary-assisted word stemmer, a part-of-speech tagger, a syntactic parser, a pattern-matcher, and a statistical program package for computing word and phrase correlations in a given text database. We believe that when used properly, automated natural language processing could become a significant factor in bringing about a new generation of text retrieval systems.
Page maintained by Jussi Karlgren. Comments:
karlgren@cs.nyu.edu.