Tools Installed at OSU and their Reference Documentation
This list of resources was collected by Markus Dickinson and Detmar Meurers (OSU), February 2002. Funding for this project provided by OSU College of Humanities Seed Grant.
Go to the corpus resources page or
view the locally installed corpora.
TOKENIZATION / SEGMENTATION
LT TTT (Text Tokenisation Tool)
found at: /opt/compling/src/TTT_v1.0/
SATZ
found at: /opt/compling/src/satz-1.0/ (README)
MXTERMINATOR
found at: /opt/compling/src/Ratnaparkhi/
TAGGERS
Decision Tree Tagger
found at: /opt/compling/lib/tree-tagger/ (README)
Xerox Tagger
found at: /opt/compling/src/XeroxTagger/
TnT
found at: /opt/compling/src/tnt/
Brill Taggers
MXPOST
found at: /opt/compling/src/Ratnaparkhi/
MuTBL
found at: /opt/compling/src/Mutbl/
- The User's Manual: html
- The Programmer's Manual: html
SNoW-based Tagger
found at: /opt/compling/src/roth-tagger (README.POS)
- The Snow User's Guide: ps
MORPHOLOGICAL ANALYZERS
Morphix
found at: /opt/compling/src/morphix/
- A description: html
- New features: txt
morpha and morphg
found at: /opt/compling/src/english-morph/ (README)
PC-KIMMO
found at: /opt/compling/pckimmo/
PARSERS
Link Grammar Parser
found at: /opt/compling/src/link-grammar-system-4.1/(README)
SNoW-based Shallow Parser
found at: /opt/compling/src/shallow-parser
(README)
- The Snow User's Guide: ps
Collins Parser
found at: /opt/compling/src/Collins/parser (README, README.models)
nlparser (Charniak)
found at: /opt/compling/src/charniak (README)
LoPar
found at: /opt/compling/src/LoPar-2.8
VARIOUS TOOLS (ANNOTATE, SEARCH, TRANSCRIBE)
TIGERSearch
found at: /opt/compling/TIGERSearch
CQP (Corpus Query Processor) and Xkwic
found at: /opt/compling/ims-cwb/cwb-3.0
- The User's Manual (ps)
- A tutorial for CQP (pdf, ps)
- The Technical Manual (ps)
- The Xkwic Manual (ps)
The CLaRK System
found at: /home/compling/src/ClarkSystem/
AGTK (Annotation Graph ToolKit)
found at: /opt/compling/src/agtk/
- AGTK Application Developer's Manual: pdf, ps
- LDC MultiTrans transcription tool User Manual: html,
txt
- LDC TableTrans annotation tool User Manual (currently unavailable)
tgrep
found at: /home/corpora/EN/penn_treebank_2/tools/ (README.T)
To use, set TGREP_CORPUS variable to
/home/corpora/EN/penn_treebank_2/tgrepabl/wsj_mrg.crp
Questions or comments? Contact Markus Dickinson.
|