com.aliasi.tokenizer
Interface TokenizerFactory
- All Known Implementing Classes:
- CharacterTokenizerFactory, IndoEuropeanTokenizerFactory, LineTokenizerFactory, NGramTokenizerFactory, RegExTokenizerFactory
public interface TokenizerFactory
A TokenizerFactory constructors tokenizers from
subsequences of character arrays. Factories are typically supplied
with no-argument constructors through which they may be
instantiated through reflection.
- Since:
- LingPipe1.0
- Version:
- 1.0
- Author:
- Bob Carpenter
|
Method Summary |
Tokenizer |
tokenizer(char[] ch,
int start,
int length)
Returns a tokenizer for the specified subsequence
of characters. |
tokenizer
Tokenizer tokenizer(char[] ch,
int start,
int length)
- Returns a tokenizer for the specified subsequence
of characters.
- Parameters:
ch - Characters to tokenize.start - Index of first character to tokenize.length - Number of characters to tokenize.