Proteus Nomlex

Proteus Project

Department of Computer Science
New York University


Catherine Macleod, Ralph Grishman, Adam Meyers, Leslie Barrett and Ruth Reeves

Overview

NOMLEX (NOMinalization Lexicon) is a dictionary of English nominalizations developed by the Proteus Project at New York University under the direction of Catherine Macleod. NOMLEX seeks not only to describe the allowed complements for a nominalization, but also to relate the nominal complements to the arguments of the corresponding verb. The complements of the nominalization are described in terms of the COMLEX Syntax verb subcategorization patterns of its associated verb. See the COMLEX Syntax Manual for information on the verbal complements. We identify both the main verbal arguments (subject, direct object, and indirect object), which may map into a variety of nominal positions, and the oblique verbal complements, which map more directly into nominal complements. The argument correspondences are specified through a combination of explicit information in the lexical entries and general linguistic constraints on the correspondences. We have 1025 entries of several types of lexical nominalizations, including over 1000 distinct words. These words were selected from lists of frequently appearing nominalizations in our corpus (which includes Brown and the Wall Street Journal). There are multiple entries for  certain graphonyms. For example, "deduction" has two NOMLEX entries ("deduction1" and "deduction2"), one corresponding to the verb "deduct" and the other corresponding to the verb "deduce". We released the alpha version of NOMLEX on January 15, 1999, a small update to the alpha version (the Alpha 2 version) on March 12, 1999. Finally, the 2001 version was released on October 19, 2001.  The latter is downloadable from this website (see below) and freely available for use by all. We would appreciate feedback from this usage to macleod@cs.nyu.edu.

A sample entry follows:

(nom      :orth "promotion"
          :verb "promote"
          :nom-type((verb-nom))
          :verb-subj ((n-n-mod) (det-poss))
          :verb-subc ((nom-np :object ((det-poss)(n-n-mod)(pp-of)))
                      (nom-np-as-np :object ((det-poss) (pp-of)))
                      (nom-possing :nom-subc ((p-possing :pval ("of"))))
                      (nom-np-pp :object ((det-poss) (n-n-mod) (pp-of))
                                 :pval ("into" "from" "for" "to"))
                      (nom-np-pp-pp :object ((det-poss) (n-n-mod) (pp-of))
                                    :pval ("for" "into" "to") :pval2 ("from"))))


Many nominalization appear with support verbs: "launch an attack", "take a walk". We have designed a extended nominalization entry which captures information about these support verbs (Proteus Project Memorandum 02-005). We are now starting a project to annotate all the nominalizations in the Penn Tree Bank; this will allow us to extend and validate the entries in NOMLEX

Downloading NOMLEX

NOMLEX is available in two forms: This project is funded by the National Science Foundation under Grant No. IRI-9633286.

References

Adam Meyers, Catherine Macleod, Roman Yangarber, Ralph Grishman, Leslie Barrett, Ruth Reeves. Using NOMLEX to Produce Nominalization Patterns for Information Extraction. Coling-ACL98 workshop Proceedings: the Computational Treatment of Nominals Montreal, Canada, August, 1998.

Catherine Macleod, Ralph Grishman, Adam Meyers, Leslie Barrett, Ruth Reeves. NOMLEX: A Lexicon of Nominalizations. Proceedings of EURALEX'98, Liege, Belgium, August 1998.

Catherine Macleod, Adam Meyers, Ralph Grishman, Leslie Barrett, Ruth Reeves. Designing a Dictionary of Derived Nominals. Proceedings of Recent Advances in Natural Language Processing, Tzigov Chark, Bulgaria, September, 1997.