Module datarush.analytics
Package com.pervasive.datarush.analytics.text
package com.pervasive.datarush.analytics.text
Provides various unstructured text processing operators.
-
ClassDescriptionCalculates the n-gram frequencies for a tokenized text field.Calculates the word frequencies for a tokenized text field.Converts the case on a TokenizedText field.Counts the number of tokens in a tokenized text field.Filters a tokenized text field using a dictionary.Expands text frequency field.Expands a TokenizedText field.Filters a tokenized text field.Calculates the bag of words for a tokenized text field.Implementation of an n-gram.Implementation of an n-gram model.A word break iterator that that allows its default behavior for the Locale to be overridden by supplied regular expression rules.A tree node that can hold information on text elements.Definition of a text element.Enumeration of the possible text and character groupings.Filters a frequency map field.Interface for a node in a
TextContainertree.Stems a TokenizedText field.Tokenizes a string field as a TokenizedText object.Utility methods for operating on TextContainer objects.A TextContainer that can store a tokenized paragraph.A TextContainer that can store a tokenized sentence.A TextContainer that can store a tokenized document.A TextContainer that can store a tokenized word.Implementation of a word frequency model.