SIGLEX has currently identified 59 lexical resources as having special interest to SIGLEX members. You can search this list by resource name, type, language, or keywords. You can also suggest a lexical resource to be added to this list. Also, check the ACL List of Resources by Language. (A prior set of links to SIGLEX Online Reources is currently being integrated into the SIGLEX Lexical Resources database, but some older links may be of interest.)

Name or Keywords:
Resource type:
Other types:
Language:

show all hide all


Open Roget's

Primary resource type: Lexicons:Research resources; Other resource tags: Ontologies; Resource language: English; Availability: Public; Sponsor: School of Information Technology and Engineering, University of Ottawa, Canada

Open Rogetís is an interface, implemented in Java, to Rogetís Thesaurus, designed for use by the Natural Language Processing community. It is based on the publicly available 1911 data, available raw from Project Gutenberg. Rogetís Thesaurus is built around a hierarchy, nine levels deep, which organizes related words and phrases. Open Rogetís contains almost 100000 words and phrases. The resource is useful in many NLP tasks, for example in identifying semantic relatedness, and it is very effective at quickly calculating the semantic distance between pairs of words. It can also be applied in the creation of lexical chains, and has been put to use in tasks such as text summarization and word-sense disambiguation. (40)