Uses of Class
com.pervasive.datarush.analytics.text.TextContainer
-
Packages that use TextContainer Package Description com.pervasive.datarush.analytics.text Provides various unstructured text processing operators.com.pervasive.datarush.analytics.text.filter -
-
Uses of TextContainer in com.pervasive.datarush.analytics.text
Subclasses of TextContainer in com.pervasive.datarush.analytics.text Modifier and Type Class Description class
TokenizedParagraph
A TextContainer that can store a tokenized paragraph.class
TokenizedSentence
A TextContainer that can store a tokenized sentence.class
TokenizedText
A TextContainer that can store a tokenized document.class
TokenizedWord
A TextContainer that can store a tokenized word.Fields in com.pervasive.datarush.analytics.text declared as TextContainer Modifier and Type Field Description protected TextContainer
TextContainer. nextSibling
protected TextContainer
TextContainer. parent
protected TextContainer
TextContainer. prevSibling
Fields in com.pervasive.datarush.analytics.text with type parameters of type TextContainer Modifier and Type Field Description protected List<TextContainer>
TextContainer. children
Methods in com.pervasive.datarush.analytics.text that return TextContainer Modifier and Type Method Description static TextContainer
TextTokenUtil. createTreeFromList(List<TextContainer> nodes)
Creates a TextContainer from a list of TextContainer nodes.static TextContainer
TextTokenUtil. createTreeFromString(String textTokens)
Methods in com.pervasive.datarush.analytics.text that return types with arguments of type TextContainer Modifier and Type Method Description List<TextContainer>
TextContainer. getChildren()
List<TextContainer>
TextNode. getChildren()
Get the ordered list of direct children of this node.ListIterator<TextContainer>
TextContainer. getIterator()
ListIterator<TextContainer>
TextContainer. getIterator(TextElementType type)
ListIterator<TextContainer>
TextNode. getIterator()
Get an iterator over all descendants of this node including this node.ListIterator<TextContainer>
TextNode. getIterator(TextElementType type)
Get an iterator over the node and all descendants of this node of the specified type.ListIterator<TextContainer>
TextContainer. getPostIterator()
ListIterator<TextContainer>
TextContainer. getPostIterator(TextElementType type)
Methods in com.pervasive.datarush.analytics.text with parameters of type TextContainer Modifier and Type Method Description static NGramMap
TextTokenUtil. calcNGramFreq(TextContainer text, int n)
Creates an n-gram frequency model based on the contents of the TextContainer.static NGramMap
TextTokenUtil. calcNGramFreq(TextContainer text, int n, Set<NGram> nGramSet)
Creates an n-gram frequency model containing the specified set of terms based on the contents of the TextContainer.static WordMap
TextTokenUtil. calcWordFreq(TextContainer text)
Creates a term frequency model based on the contents of the TextContainer.static WordMap
TextTokenUtil. calcWordFreq(TextContainer text, Set<String> wordSet)
Creates a term frequency model containing the specified set of terms based on the contents of the TextContainer.static int
TextTokenUtil. countElementType(TextContainer text, TextElementType type)
Counts the number of elements of a specific type in the TextContainer.static Set<String>
TextTokenUtil. genBagOfWords(TextContainer text)
Creates a bag of words based on the contents of the TextContainer.static List<NGram>
TextTokenUtil. generateNGramList(TextContainer text, int n)
Lists the unique n-grams contained in the TextContainer.static List<String>
TextTokenUtil. generateWordList(TextContainer text)
Lists the unique words contained in the TextContainer.Method parameters in com.pervasive.datarush.analytics.text with type arguments of type TextContainer Modifier and Type Method Description static TextContainer
TextTokenUtil. createTreeFromList(List<TextContainer> nodes)
Creates a TextContainer from a list of TextContainer nodes.Constructor parameters in com.pervasive.datarush.analytics.text with type arguments of type TextContainer Constructor Description TextContainer(TextElementType type, List<? extends TextContainer> children)
Construct a container of element type with the specified children.TokenizedParagraph(List<? extends TextContainer> tokens)
Create a tokenized paragraph container.TokenizedSentence(List<? extends TextContainer> tokens)
Create a tokenized sentence container.TokenizedText(List<? extends TextContainer> tokens)
Create a tokenized document container. -
Uses of TextContainer in com.pervasive.datarush.analytics.text.filter
Methods in com.pervasive.datarush.analytics.text.filter that return TextContainer Modifier and Type Method Description TextContainer
AbstractTextFilter. filterText(TextContainer text)
TextContainer
LengthFilter. filterText(TextContainer text)
TextContainer
PunctuationFilter. filterText(TextContainer text)
TextContainer
RegexFilter. filterText(TextContainer text)
TextContainer
TextElementFilter. filterText(TextContainer text)
TextContainer
TextFilter. filterText(TextContainer text)
Returns the tokenized text with the filtered tokens removed.TextContainer
WordFilter. filterText(TextContainer text)
Methods in com.pervasive.datarush.analytics.text.filter with parameters of type TextContainer Modifier and Type Method Description TextContainer
AbstractTextFilter. filterText(TextContainer text)
TextContainer
LengthFilter. filterText(TextContainer text)
TextContainer
PunctuationFilter. filterText(TextContainer text)
TextContainer
RegexFilter. filterText(TextContainer text)
TextContainer
TextElementFilter. filterText(TextContainer text)
TextContainer
TextFilter. filterText(TextContainer text)
Returns the tokenized text with the filtered tokens removed.TextContainer
WordFilter. filterText(TextContainer text)
-