| EN_PENN_TREEBANK_TOKENIZER |
= |
"edu.stanford.nlp.process.PTBTokenizer" |
|
The default sentence segmenter and tokenizer. This is an English-language
tokenizer with support for Penn Treebank markup.
|
| ENGLISH_PCFG_MODEL |
= |
"$(ROOT)/englishPCFG.ser.gz" |
|
Path to an English PCFG model that comes with the Stanford Parser. The
location is relative to the parser root directory. This is a valid value
for the grammar parameter of the LexicalizedParser
constructor.
|
| ROOT |
= |
initialize_on_load |
|
The root directory of the Stanford parser installation.
|
| StandoffToken |
= |
Struct.new(:current, :word, :before, :after, :begin_position, :end_position) |