Uses of Class
com.pervasive.datarush.annotations.OperatorDescription
-
Packages that use OperatorDescription Package Description com.actian.dataflow.operators.io.orc com.actian.dataflow.operators.io.parquet com.pervasive.datarush.analytics.arm Provides common classes for Association Rule Mining (ARM).com.pervasive.datarush.analytics.arm.fpgrowth Provides the operator to perform the FP-growth ARM algorithm.com.pervasive.datarush.analytics.cleansing Provides operators related to data cleansing.com.pervasive.datarush.analytics.cluster.kmeans Provides the KMeans algorithm.com.pervasive.datarush.analytics.decisiontree.learner Provides the PMML learner operator and associated classes.com.pervasive.datarush.analytics.decisiontree.predictor Provides the decision tree predictor operator and associated classes.com.pervasive.datarush.analytics.decisiontree.pruner Provides the decision tree pruner operator and associated classes.com.pervasive.datarush.analytics.knn Provides an implementation of the KNN algorithm using DataRush's sparse data API.com.pervasive.datarush.analytics.naivebayes.learner Provides an implementation of the Naive Bayes learner.com.pervasive.datarush.analytics.naivebayes.predictor Provides an implementation of a Naive Bayes predictor.com.pervasive.datarush.analytics.r com.pervasive.datarush.analytics.regression Provides utility, PMML and other classes for shared use by regression related entities.com.pervasive.datarush.analytics.stats Provides various statistics, Data Summarizer, and Data Quality Analyzer.com.pervasive.datarush.analytics.svm.predictor Provides an implementation of an SVM predictor.com.pervasive.datarush.analytics.text Provides various unstructured text processing operators.com.pervasive.datarush.analytics.viz Provides operators for classifier performance visualization.com.pervasive.datarush.hbase com.pervasive.datarush.matching Provides operators for performing discovering duplicates or links between records.com.pervasive.datarush.matching.cluster Provides operators for clustering the results of duplicate or linkage discovery.com.pervasive.datarush.operators.assertion Provides operators for making assertions on flows and files.com.pervasive.datarush.operators.group Provides data aggregation components.com.pervasive.datarush.operators.io Provides base file I/O components including encoders and decoders.com.pervasive.datarush.operators.io.avro Provides operators for reading and writing files in Avro format.com.pervasive.datarush.operators.io.binary com.pervasive.datarush.operators.io.jdbc Provides operators for reading from JDBC sources and writing to JDBC targets.com.pervasive.datarush.operators.io.mdf com.pervasive.datarush.operators.io.staging Provides operators for reading and writing DataRush staging datasets.com.pervasive.datarush.operators.io.textfile Provides operators for reading and writing text data.com.pervasive.datarush.operators.io.vectorwise com.pervasive.datarush.operators.io.vectorwise.dl com.pervasive.datarush.operators.join Provides operators for joining together two data sets into a single one.com.pervasive.datarush.operators.model Provides operators for handling models.com.pervasive.datarush.operators.partition Provides operators for partitioning and unpartitioning flows of data.com.pervasive.datarush.operators.record Provides operators for manipulating record structure.com.pervasive.datarush.operators.scripting Provides theRunScript
operator for running user-defined scripts on the rows of an input record flow.com.pervasive.datarush.operators.select Provides operators for selecting a subset of the data set.com.pervasive.datarush.operators.sink Provides theLogRows
operator for writing debugging information about a flow to the logging API.com.pervasive.datarush.operators.sort Provides operators for sorting and manipulating sorted flows.com.pervasive.datarush.operators.source Provides operators for generating data tokens in various ways.com.pervasive.datarush.operators.string Provides operators for operating on string values in records. -
-
Uses of OperatorDescription in com.actian.dataflow.operators.io.orc
Classes in com.actian.dataflow.operators.io.orc with annotations of type OperatorDescription Modifier and Type Class Description class
ReadORC
class
WriteORC
Write data in the Apache Hive ORC format. -
Uses of OperatorDescription in com.actian.dataflow.operators.io.parquet
Classes in com.actian.dataflow.operators.io.parquet with annotations of type OperatorDescription Modifier and Type Class Description class
ReadParquet
Reads data previously written using Apache Parquet format by Apache Hive. -
Uses of OperatorDescription in com.pervasive.datarush.analytics.arm
Classes in com.pervasive.datarush.analytics.arm with annotations of type OperatorDescription Modifier and Type Class Description class
ConvertARMModel
An operator that converts an association model in PMML into a target format.class
FrequentItems
Compute the frequent items within the given transactions. -
Uses of OperatorDescription in com.pervasive.datarush.analytics.arm.fpgrowth
Classes in com.pervasive.datarush.analytics.arm.fpgrowth with annotations of type OperatorDescription Modifier and Type Class Description class
FPGrowth
An operator that implements the FP-growth algorithm, outputting a PMML model containing generated items sets and association rules. -
Uses of OperatorDescription in com.pervasive.datarush.analytics.cleansing
Classes in com.pervasive.datarush.analytics.cleansing with annotations of type OperatorDescription Modifier and Type Class Description class
ReplaceMissingValues
Replace missing values in the input data according to the given replacement specifications. -
Uses of OperatorDescription in com.pervasive.datarush.analytics.cluster.kmeans
Classes in com.pervasive.datarush.analytics.cluster.kmeans with annotations of type OperatorDescription Modifier and Type Class Description class
KMeans
Computes clustering model for the given input based on the k-Means algorithm. -
Uses of OperatorDescription in com.pervasive.datarush.analytics.decisiontree.learner
Classes in com.pervasive.datarush.analytics.decisiontree.learner with annotations of type OperatorDescription Modifier and Type Class Description class
DecisionTreeLearner
Operator responsible for constructing a Decision Tree. -
Uses of OperatorDescription in com.pervasive.datarush.analytics.decisiontree.predictor
Classes in com.pervasive.datarush.analytics.decisiontree.predictor with annotations of type OperatorDescription Modifier and Type Class Description class
DecisionTreePredictor
Operator responsible for predicting outcomes based on a Decision Tree PMML model. -
Uses of OperatorDescription in com.pervasive.datarush.analytics.decisiontree.pruner
Classes in com.pervasive.datarush.analytics.decisiontree.pruner with annotations of type OperatorDescription Modifier and Type Class Description class
DecisionTreePruner
Performs pruning of the provided input model. -
Uses of OperatorDescription in com.pervasive.datarush.analytics.knn
Classes in com.pervasive.datarush.analytics.knn with annotations of type OperatorDescription Modifier and Type Class Description class
KNNClassifier
Applies the K-nearest neighbor algorithm to classify input data against an already classified set of example data. -
Uses of OperatorDescription in com.pervasive.datarush.analytics.naivebayes.learner
Classes in com.pervasive.datarush.analytics.naivebayes.learner with annotations of type OperatorDescription Modifier and Type Class Description class
NaiveBayesLearner
Operator responsible for building a Naive Bayes PMML model from input data. -
Uses of OperatorDescription in com.pervasive.datarush.analytics.naivebayes.predictor
Classes in com.pervasive.datarush.analytics.naivebayes.predictor with annotations of type OperatorDescription Modifier and Type Class Description class
NaiveBayesPredictor
Operator responsible for predicting outcomes based on a Naive Bayes PMML model. -
Uses of OperatorDescription in com.pervasive.datarush.analytics.r
Classes in com.pervasive.datarush.analytics.r with annotations of type OperatorDescription Modifier and Type Class Description class
RunRScript
Execute an R script in flow. -
Uses of OperatorDescription in com.pervasive.datarush.analytics.regression
Classes in com.pervasive.datarush.analytics.regression with annotations of type OperatorDescription Modifier and Type Class Description class
LinearRegressionLearner
Performs a multivariate linear regression on the given training data.class
SumOfSquares
Compute the sum of squares for the given fields of the input data. -
Uses of OperatorDescription in com.pervasive.datarush.analytics.stats
Classes in com.pervasive.datarush.analytics.stats with annotations of type OperatorDescription Modifier and Type Class Description class
CountRanges
Determines which range each value in a field falls within and counts the totals.class
DataQualityAnalyzer
Evaluates a set of quality tests on an input dataset.class
DistinctValues
Calculates distinct values of the given input field.class
MostFrequentValues
Compute the most frequent values within the given fields.class
NormalizeValues
Apply normalization methods to fields within an input data flow.class
Rank
Rank data using the given rank mode.class
SummaryStatistics
Discovers various metrics of an input dataset, based on the configured detail level. -
Uses of OperatorDescription in com.pervasive.datarush.analytics.svm.predictor
Classes in com.pervasive.datarush.analytics.svm.predictor with annotations of type OperatorDescription Modifier and Type Class Description class
SVMPredictor
Operator responsible for classification based on a SVM PMML model. -
Uses of OperatorDescription in com.pervasive.datarush.analytics.text
Classes in com.pervasive.datarush.analytics.text with annotations of type OperatorDescription Modifier and Type Class Description class
CalculateNGramFrequency
Calculates the n-gram frequencies for a tokenized text field.class
CalculateWordFrequency
Calculates the word frequencies for a tokenized text field.class
ConvertTextCase
Converts the case on a TokenizedText field.class
CountTokens
Counts the number of tokens in a tokenized text field.class
DictionaryFilter
Filters a tokenized text field using a dictionary.class
ExpandTextFrequency
Expands text frequency field.class
ExpandTextTokens
Expands a TokenizedText field.class
FilterText
Filters a tokenized text field.class
GenerateBagOfWords
Calculates the bag of words for a tokenized text field.class
TextFrequencyFilter
Filters a frequency map field.class
TextStemmer
Stems a TokenizedText field.class
TextTokenizer
Tokenizes a string field as a TokenizedText object. -
Uses of OperatorDescription in com.pervasive.datarush.analytics.viz
Classes in com.pervasive.datarush.analytics.viz with annotations of type OperatorDescription Modifier and Type Class Description class
DrawDiagnosticsChart
This operator takes the output of one or multiple predictors and uses the confidence values produced by these predictors along with the actual target values ("true class") to produce diagnostic charts. -
Uses of OperatorDescription in com.pervasive.datarush.hbase
Classes in com.pervasive.datarush.hbase with annotations of type OperatorDescription Modifier and Type Class Description class
DeleteHBase
Write delete markers to HBaseclass
ReadHBase
Read a result set from HBase.class
WriteHBase
Write a result set to HBase. -
Uses of OperatorDescription in com.pervasive.datarush.matching
Classes in com.pervasive.datarush.matching with annotations of type OperatorDescription Modifier and Type Class Description class
DiscoverDuplicates
Discover duplicate records within a single source using fuzzy matching operators.class
DiscoverLinks
Use fuzzy matching operators to discover linked records from two data sources. -
Uses of OperatorDescription in com.pervasive.datarush.matching.cluster
Classes in com.pervasive.datarush.matching.cluster with annotations of type OperatorDescription Modifier and Type Class Description class
ClusterDuplicates
Transform record pairs into clusters of like records, where the two sides of the pair are from the same source.class
ClusterLinks
Transform record pairs into clusters of like records. -
Uses of OperatorDescription in com.pervasive.datarush.operators.assertion
Classes in com.pervasive.datarush.operators.assertion with annotations of type OperatorDescription Modifier and Type Class Description class
AssertEqual
Verifies that actual rows are equal to expected rows.class
AssertEqualHash
Verifies that actual rows are equal to expected rows without regard to order.class
AssertEqualTypes
Asserts that two input flows have identical types.class
AssertMetadata
Assert that the metadata on the input port is set correctly.class
AssertPredicate
Assert that the given predicate is true for all input values.class
AssertRowCount
Verifies that the input flow contains the specified row count.class
AssertSorted
Verifies that the input data is sorted by the given set of keys. -
Uses of OperatorDescription in com.pervasive.datarush.operators.group
Classes in com.pervasive.datarush.operators.group with annotations of type OperatorDescription Modifier and Type Class Description class
Group
Performs grouping (aggregation) of sorted input data.class
RemoveDuplicates
Removes duplicate rows based on a specified set of group keys. -
Uses of OperatorDescription in com.pervasive.datarush.operators.io
Classes in com.pervasive.datarush.operators.io with annotations of type OperatorDescription Modifier and Type Class Description class
ReadSource
Reads a data source as a stream of records.class
WriteSink
Writes a stream of records to a data sink. -
Uses of OperatorDescription in com.pervasive.datarush.operators.io.avro
Classes in com.pervasive.datarush.operators.io.avro with annotations of type OperatorDescription Modifier and Type Class Description class
ReadAvro
Reads data previously written using Apache Avro format.class
WriteAvro
Writes data using Apache Avro format. -
Uses of OperatorDescription in com.pervasive.datarush.operators.io.binary
Classes in com.pervasive.datarush.operators.io.binary with annotations of type OperatorDescription Modifier and Type Class Description class
BinaryWriter
Writes raw binary data to a filesystem. -
Uses of OperatorDescription in com.pervasive.datarush.operators.io.jdbc
Classes in com.pervasive.datarush.operators.io.jdbc with annotations of type OperatorDescription Modifier and Type Class Description class
DeleteFromJDBC
This operator deletes data in the target table in a database by applying SQL delete statements.class
ReadFromJDBC
TheReadFromJDBC
operator is used to access relational database systems using a supplied JDBC driver.class
UpdateInJDBC
This operator updates the target table in a database by applying SQL update statements.class
WriteToJDBC
In its simplest form, writes records from an input port to a JDBC target table using insert statements. -
Uses of OperatorDescription in com.pervasive.datarush.operators.io.mdf
Classes in com.pervasive.datarush.operators.io.mdf with annotations of type OperatorDescription Modifier and Type Class Description class
ReadMDF
Reads data previously written using MDF format. -
Uses of OperatorDescription in com.pervasive.datarush.operators.io.staging
Classes in com.pervasive.datarush.operators.io.staging with annotations of type OperatorDescription Modifier and Type Class Description class
ForceRecordStaging
Forces staging of record ports.class
ReadStagingDataset
Reads a sequence of records previously staged to disk.class
WriteStagingDataset
Writes a sequence of records to disk in an internal format for staged data. -
Uses of OperatorDescription in com.pervasive.datarush.operators.io.textfile
Classes in com.pervasive.datarush.operators.io.textfile with annotations of type OperatorDescription Modifier and Type Class Description class
ParseTextFields
Parses input text records according to a specified text schema.class
ReadARFF
Read files in the Attribute-Relation File Format (ARFF).class
ReadDelimitedText
Reads a text file of delimited records as record tokens.class
ReadFixedText
Reads a text file of fixed-width records as record tokens.class
ReadJSON
The ReadJSON operator reads a JSON file of key-value pairs or array of objects as record tokens.class
ReadLog
Reads a log file as record tokens.class
WriteARFF
Write files using the Attribute-Relation File Format (ARFF).class
WriteDelimitedText
Writes a stream of records as delimited text.class
WriteFixedText
Writes a record dataflow as a text file of fixed-width records. -
Uses of OperatorDescription in com.pervasive.datarush.operators.io.vectorwise
Classes in com.pervasive.datarush.operators.io.vectorwise with annotations of type OperatorDescription Modifier and Type Class Description class
LoadActianVector
Bulk load data into the Actian Vector database.class
LoadVectorOnHadoop
Deprecated.this operator has been replaced withLoadActianVector
; use that operator instead. -
Uses of OperatorDescription in com.pervasive.datarush.operators.io.vectorwise.dl
Classes in com.pervasive.datarush.operators.io.vectorwise.dl with annotations of type OperatorDescription Modifier and Type Class Description class
LoadVectorOnHadoopDirect
Deprecated.this operator has been replaced withLoadActianVector
; use that operator instead. -
Uses of OperatorDescription in com.pervasive.datarush.operators.join
Classes in com.pervasive.datarush.operators.join with annotations of type OperatorDescription Modifier and Type Class Description class
CrossJoin
Produce the cartesian product of two sets of records.class
FilterExistingRows
Filters records on the left based on the presence of matching records on the right.class
Join
Performs a relational equi-join on two input datasets by a specified set of keys.class
SemiJoin
Deprecated.this operator has been replaced withFilterExistingRows
; use that operator instead, linking to the appropriate output port. -
Uses of OperatorDescription in com.pervasive.datarush.operators.model
Classes in com.pervasive.datarush.operators.model with annotations of type OperatorDescription Modifier and Type Class Description class
GetModel<T>
Provides a way to update an in-memory reference to a model object.class
PutModel<T>
Provides a way to inject an in-memory reference to a model object into a graph. -
Uses of OperatorDescription in com.pervasive.datarush.operators.partition
Classes in com.pervasive.datarush.operators.partition with annotations of type OperatorDescription Modifier and Type Class Description class
GatherHint
Forces parallel streams of data to be gathered into a single non-parallel stream.class
PartitionHint
Forces the input data to be partitioned into parallel streams of data for subsequent parallel operations. -
Uses of OperatorDescription in com.pervasive.datarush.operators.record
Classes in com.pervasive.datarush.operators.record with annotations of type OperatorDescription Modifier and Type Class Description class
ColumnsToRows
Normalize records by transposing values from row columns into multiple rows.class
DeriveFields
Applies one or more functions to the input record data.class
MergeFields
Merges two streams of data with an equivalent number of rows into one.class
RemapFields
Rearranges and renames fields in a record.class
RemoveFields
Removes a subset of fields from the input records.class
RetainFields
Preserves a subset of fields from the input records.class
RowsToColumns
The RowsToColumns operator is used to pivot data from a narrow representation (rows) into a wider representation (columns).class
SelectFields
Preserves a subset of fields from the input records. -
Uses of OperatorDescription in com.pervasive.datarush.operators.scripting
Classes in com.pervasive.datarush.operators.scripting with annotations of type OperatorDescription Modifier and Type Class Description class
RunScript
Processes rows using user-defined scripts. -
Uses of OperatorDescription in com.pervasive.datarush.operators.select
Classes in com.pervasive.datarush.operators.select with annotations of type OperatorDescription Modifier and Type Class Description class
FilterRows
Filters records based on a specified predicate.class
LimitRows
Truncates a flow to a fixed number of records.class
SampleRandomRows
Apply random sampling to the input data. -
Uses of OperatorDescription in com.pervasive.datarush.operators.sink
Classes in com.pervasive.datarush.operators.sink with annotations of type OperatorDescription Modifier and Type Class Description class
CollectRecords
Collects input data into an in-memory token list.class
LogRows
Log information about the input data from a flow. -
Uses of OperatorDescription in com.pervasive.datarush.operators.sort
Classes in com.pervasive.datarush.operators.sort with annotations of type OperatorDescription Modifier and Type Class Description class
Sort
Sorts the input data. -
Uses of OperatorDescription in com.pervasive.datarush.operators.source
Classes in com.pervasive.datarush.operators.source with annotations of type OperatorDescription Modifier and Type Class Description class
EmitRecords
Emits an in-memory token list as output.class
GenerateArithmeticSequence
Generates a sequence of numerical values, with a constant difference between consecutive values.class
GenerateConstant
Generates copies of a constant value.class
GenerateRandom
Generates rows of random data. -
Uses of OperatorDescription in com.pervasive.datarush.operators.string
Classes in com.pervasive.datarush.operators.string with annotations of type OperatorDescription Modifier and Type Class Description class
SplitField
Splits a string field into multiple fields, based on a specified pattern.
-