Clicking on the links below will take you through a registration process, intended only to provide statistics on the number of downloads and to provide an email address should you have any questions. We welcome your comments.
DIMAP-4 for Windows This demo version contains all the dictionary creation, maintenance, and analysis functionality of the full version, including regular expression searches over all fields, subdictionary creation based on matches, uploading dictionary data, dictionary downloading to user specifications, automatic dictionary creation from texts, and integrated WordNet lookup. It does not contain the parsing capability used in parsing definitions (including consistency checks with WordNet), parsing text, question-answering, discourse analysis, and text summarization. DIMAP dictionaries for WordNet 2.0 (see below) and for the 120,000 words and 250,000 definitions of Webster's Revised International Dictionary (in the DIMAP dictionaries directory) are available for free. Other dictionaries (the New Oxford Dictionary of English and The Macquarie Dictionary) are available under academic or commercial research license agreements. Download and unzip, then run SETUP.EXE; sample dictionaries are provided (including WordNet 1.7 for the letter X and the dictionary used in Senseval-1).
CL Research - Proximity Technology Parser This demo is a fully operational version of the parser that underlies CL Research technologies for definition parsing, word-sense disambiguation, question-answering, discourse analysis, information extraction, text mining, and text summarization. In the demo, you can type or paste and then parse any sentence to see the parse tree results upon which further analysis is performed. The demo files include an HTML file describing the non-terminal and terminal (part-of-speech) nodes in the parse tree. Download and unzip the file and run the executable.
XML Analyzer The XML Analyzer was designed principally to investigate texts processed into XML format by the DIMAP text processor for question answering in TREC 2002. The full set of XML renderings for documents answering the TREC 2002 questions are available from NIST (further details upon request). A sample XML file is provided, along with instructions for examining particular aspects of the text.
Alphabetic WordNet 2.0 An alphabetic version of WordNet was created to assist the construction of customized dictionaries using the WordNet distribution. See the full description of how the alphabetic version of WordNet 2.0 was created, then follow the link here to register your download.
UMLS Specialist Lexicon The Specialist Lexicon of the Unified Medical Language System is designed for the specialized lexical needs of medical community. This lexicon contains over 220,000 terms and was developed to provide the lexical information needed for the SPECIALIST Natural Language Processing System. It is intended to be a general English lexicon that includes many biomedical terms. Coverage includes commonly occurring English words and biomedical vocabulary. The data elements in the lexicon describe syntactic characteristics of each entry, including inflection codes, case, gender, syntactic category, complements for verbs and nouns, modification types for adverbs, and more. This is lexicon was developed as a free, publicly available resource, with only moderate restrictions (e.g., you can't claim it as your own).
FrameNet Explorer for Windows (Last updated: March 9, 2005) FrameNet Explorer allows (1) examination of the FrameNet frames, frame elements, and lexical units, (2) extracting samples of frames and annotations, (3) selecting preposition corpus instances, and (4) identifying other syntactic realizations of frame elements associated with preposition instances. This program operates only in Windows (developed in Windows XP, but likely to work on earlier versions of Windows). Running the program assumes you have obtained the FrameNet distribution and have unzipped the files to the default directories; you will need to make some modifications to the FrameNet data files to reflect local paths. See Instructions for Using CL Research FrameNet Explorer for more details. These instructions are included in the distribution as well.
Alphabetic FrameNet Dictionary The FrameNet 1.3 data have been converted into an alphabetic dictionary. This dictionary contains 9471 entries, with 7575 entries for lexical items (many having multiple senses with different parts of speech) and 1896 entries that encode the frames and frame relations. Details of these items can be found through the main FrameNet site. A more detailed description of the DIMAP dictionary and how it was used in SemEval-2007 can be found in the paper "CLR: Integration of FrameNet in a Text Representation System".
MCCALite for Windows A light version of MCCA, without printing, addition of reference groups, more sophisticated statistical analyses, or ability to modify the MCCA dictionary. Suitable for analyses and comparisons of sets of texts (from sentences to books) and multi-person transcripts, including plays, focus groups, interviews, hearings, and TV scripts. Download and unzip, then run SETUP.EXE, for immediate analysis of Hamlet, or examine the text on which McTavish & Pirro was based.)
Maintained by Ken Litkowski .
Copyright © 2008 CL Research