public interface Tokenizer
TokenList
s.Modifier and Type | Field and Description |
---|---|
static String |
TOKENIZER_DEFAULT
The name of the default system tokenizer.
|
Modifier and Type | Method and Description |
---|---|
Tokenizer |
getIngestTokenizer(SchemaField field,
Locale locale)
Get the underlying tokenizer to use for tokenizing fields in the ingest workflow.
|
Tokenizer |
getQueryTokenizer(SchemaField field,
Locale locale)
Get the underlaying tokenizer to use for tokenizing fields in the query workflow.
|
Phrase |
tokenize(SchemaField field,
Locale locale,
SearchTerm term)
Tokenizes
term into a Phrase for query processing. |
TokenList |
tokenize(SchemaField field,
Locale locale,
String value)
Tokenizes
value into a TokenList. |
Phrase |
tokenize(SchemaField field,
Locale locale,
TermRange range)
Tokenizes
range into a Phrase for query processing. |
void |
tokenize(SchemaField field,
Locale locale,
TokenList tokens)
Tokenizes all tokens in
tokens . |
Phrase |
tokenize(SchemaField field,
Locale locale,
WildcardTerm term)
Tokenizes a wildcard
term into a Phrase for query processing. |
static final String TOKENIZER_DEFAULT
Tokenizer getIngestTokenizer(SchemaField field, Locale locale) throws AttivioException
In general, this method should return this
. Tokenizers that route to
sub-tokenizers for handling different fields/locales should return the actual
tokenizer that will be used.
AttivioException
Tokenizer getQueryTokenizer(SchemaField field, Locale locale) throws AttivioException
In general, this method should return this
. Tokenizers that route to
sub-tokenizers for handling different fields/locales should return the actual
tokenizer that will be used.
AttivioException
void tokenize(SchemaField field, Locale locale, TokenList tokens) throws AttivioException
tokens
.field
- the schema field being tokenized (may be null)locale
- the Locale of the tokens (may be null)tokens
- the token listAttivioException
- on an unrecoverable errorTokenList tokenize(SchemaField field, Locale locale, String value) throws AttivioException
value
into a TokenList.field
- the schema field being tokenized (may be null)locale
- the Locale of the tokens (may be null)value
- the string to tokenizeAttivioException
- on an unrecoverable errorPhrase tokenize(SchemaField field, Locale locale, SearchTerm term) throws AttivioException
term
into a Phrase for query processing.field
- the schema field being tokenized (may be null)locale
- the Locale of the tokens (may be null)term
- the SearchTerm to tokenizeAttivioException
- on an unrecoverable errorPhrase tokenize(SchemaField field, Locale locale, WildcardTerm term) throws AttivioException
term
into a Phrase for query processing.field
- the schema field being tokenized (may be null)locale
- the Locale of the tokens (may be null)term
- the WildcardTerm to tokenizeAttivioException
- on an unrecoverable errorPhrase tokenize(SchemaField field, Locale locale, TermRange range) throws AttivioException
range
into a Phrase for query processing.field
- the schema field being tokenized (may be null)locale
- the Locale of the tokens (may be null)range
- the TermRange to tokenizeAttivioException
- on an unrecoverable errorCopyright © 2018 Attivio, Inc. All Rights Reserved.
PATENT NOTICE: Attivio, Inc. Software Related Patents. With respect to the Attivio software product(s) being used, the following patents apply: Querying Joined Data Within A Search Engine Index: United States Patent No.(s): 8,073,840. Ordered Processing of Groups of Messages: U.S. Patent No.(s) 8,495,656. Signal processing approach to sentiment analysis for entities in documents: U.S. Patent No.(s) 8,725,494. Other U.S. and International Patents Pending.