public interface Tokenizer
TokenLists.| Modifier and Type | Field and Description |
|---|---|
static String |
TOKENIZER_DEFAULT
The name of the default system tokenizer.
|
| Modifier and Type | Method and Description |
|---|---|
Tokenizer |
getIngestTokenizer(SchemaField field,
Locale locale)
Get the underlying tokenizer to use for tokenizing fields in the ingest workflow.
|
Tokenizer |
getQueryTokenizer(SchemaField field,
Locale locale)
Get the underlaying tokenizer to use for tokenizing fields in the query workflow.
|
Phrase |
tokenize(SchemaField field,
Locale locale,
SearchTerm term)
Tokenizes
term into a Phrase for query processing. |
TokenList |
tokenize(SchemaField field,
Locale locale,
String value)
Tokenizes
value into a TokenList. |
Phrase |
tokenize(SchemaField field,
Locale locale,
TermRange range)
Tokenizes
range into a Phrase for query processing. |
void |
tokenize(SchemaField field,
Locale locale,
TokenList tokens)
Tokenizes all tokens in
tokens. |
Phrase |
tokenize(SchemaField field,
Locale locale,
WildcardTerm term)
Tokenizes a wildcard
term into a Phrase for query processing. |
static final String TOKENIZER_DEFAULT
Tokenizer getIngestTokenizer(SchemaField field, Locale locale) throws AttivioException
In general, this method should return this. Tokenizers that route to
sub-tokenizers for handling different fields/locales should return the actual
tokenizer that will be used.
AttivioExceptionTokenizer getQueryTokenizer(SchemaField field, Locale locale) throws AttivioException
In general, this method should return this. Tokenizers that route to
sub-tokenizers for handling different fields/locales should return the actual
tokenizer that will be used.
AttivioExceptionvoid tokenize(SchemaField field, Locale locale, TokenList tokens) throws AttivioException
tokens.field - the schema field being tokenized (may be null)locale - the Locale of the tokens (may be null)tokens - the token listAttivioException - on an unrecoverable errorTokenList tokenize(SchemaField field, Locale locale, String value) throws AttivioException
value into a TokenList.field - the schema field being tokenized (may be null)locale - the Locale of the tokens (may be null)value - the string to tokenizeAttivioException - on an unrecoverable errorPhrase tokenize(SchemaField field, Locale locale, SearchTerm term) throws AttivioException
term into a Phrase for query processing.field - the schema field being tokenized (may be null)locale - the Locale of the tokens (may be null)term - the SearchTerm to tokenizeAttivioException - on an unrecoverable errorPhrase tokenize(SchemaField field, Locale locale, WildcardTerm term) throws AttivioException
term into a Phrase for query processing.field - the schema field being tokenized (may be null)locale - the Locale of the tokens (may be null)term - the WildcardTerm to tokenizeAttivioException - on an unrecoverable errorPhrase tokenize(SchemaField field, Locale locale, TermRange range) throws AttivioException
range into a Phrase for query processing.field - the schema field being tokenized (may be null)locale - the Locale of the tokens (may be null)range - the TermRange to tokenizeAttivioException - on an unrecoverable errorCopyright © 2018 Attivio, Inc. All Rights Reserved.
PATENT NOTICE: Attivio, Inc. Software Related Patents. With respect to the Attivio software product(s) being used, the following patents apply: Querying Joined Data Within A Search Engine Index: United States Patent No.(s): 8,073,840. Ordered Processing of Groups of Messages: U.S. Patent No.(s) 8,495,656. Signal processing approach to sentiment analysis for entities in documents: U.S. Patent No.(s) 8,725,494. Other U.S. and International Patents Pending.