Uses of Class
com.pervasive.datarush.annotations.OperatorDescription
-
Packages that use OperatorDescription Package Description com.actian.dataflow.operators.io.orc com.actian.dataflow.operators.io.parquet com.pervasive.datarush.analytics.arm Provides common classes for Association Rule Mining (ARM).com.pervasive.datarush.analytics.arm.fpgrowth Provides the operator to perform the FP-growth ARM algorithm.com.pervasive.datarush.analytics.cleansing Provides operators related to data cleansing.com.pervasive.datarush.analytics.cluster.kmeans Provides the KMeans algorithm.com.pervasive.datarush.analytics.decisiontree.learner Provides the PMML learner operator and associated classes.com.pervasive.datarush.analytics.decisiontree.predictor Provides the decision tree predictor operator and associated classes.com.pervasive.datarush.analytics.decisiontree.pruner Provides the decision tree pruner operator and associated classes.com.pervasive.datarush.analytics.knn Provides an implementation of the KNN algorithm using DataRush's sparse data API.com.pervasive.datarush.analytics.naivebayes.learner Provides an implementation of the Naive Bayes learner.com.pervasive.datarush.analytics.naivebayes.predictor Provides an implementation of a Naive Bayes predictor.com.pervasive.datarush.analytics.r com.pervasive.datarush.analytics.regression Provides utility, PMML and other classes for shared use by regression related entities.com.pervasive.datarush.analytics.stats Provides various statistics, Data Summarizer, and Data Quality Analyzer.com.pervasive.datarush.analytics.svm.predictor Provides an implementation of an SVM predictor.com.pervasive.datarush.analytics.text Provides various unstructured text processing operators.com.pervasive.datarush.analytics.viz Provides operators for classifier performance visualization.com.pervasive.datarush.hbase com.pervasive.datarush.matching Provides operators for performing discovering duplicates or links between records.com.pervasive.datarush.matching.cluster Provides operators for clustering the results of duplicate or linkage discovery.com.pervasive.datarush.operators.assertion Provides operators for making assertions on flows and files.com.pervasive.datarush.operators.group Provides data aggregation components.com.pervasive.datarush.operators.io Provides base file I/O components including encoders and decoders.com.pervasive.datarush.operators.io.avro Provides operators for reading and writing files in Avro format.com.pervasive.datarush.operators.io.binary com.pervasive.datarush.operators.io.jdbc Provides operators for reading from JDBC sources and writing to JDBC targets.com.pervasive.datarush.operators.io.mdf com.pervasive.datarush.operators.io.staging Provides operators for reading and writing DataRush staging datasets.com.pervasive.datarush.operators.io.textfile Provides operators for reading and writing text data.com.pervasive.datarush.operators.io.vectorwise com.pervasive.datarush.operators.io.vectorwise.dl com.pervasive.datarush.operators.join Provides operators for joining together two data sets into a single one.com.pervasive.datarush.operators.model Provides operators for handling models.com.pervasive.datarush.operators.partition Provides operators for partitioning and unpartitioning flows of data.com.pervasive.datarush.operators.record Provides operators for manipulating record structure.com.pervasive.datarush.operators.scripting Provides theRunScriptoperator for running user-defined scripts on the rows of an input record flow.com.pervasive.datarush.operators.select Provides operators for selecting a subset of the data set.com.pervasive.datarush.operators.sink Provides theLogRowsoperator for writing debugging information about a flow to the logging API.com.pervasive.datarush.operators.sort Provides operators for sorting and manipulating sorted flows.com.pervasive.datarush.operators.source Provides operators for generating data tokens in various ways.com.pervasive.datarush.operators.string Provides operators for operating on string values in records. -
-
Uses of OperatorDescription in com.actian.dataflow.operators.io.orc
Classes in com.actian.dataflow.operators.io.orc with annotations of type OperatorDescription Modifier and Type Class Description classReadORCclassWriteORCWrite data in the Apache Hive ORC format. -
Uses of OperatorDescription in com.actian.dataflow.operators.io.parquet
Classes in com.actian.dataflow.operators.io.parquet with annotations of type OperatorDescription Modifier and Type Class Description classReadParquetReads data previously written using Apache Parquet format by Apache Hive. -
Uses of OperatorDescription in com.pervasive.datarush.analytics.arm
Classes in com.pervasive.datarush.analytics.arm with annotations of type OperatorDescription Modifier and Type Class Description classConvertARMModelAn operator that converts an association model in PMML into a target format.classFrequentItemsCompute the frequent items within the given transactions. -
Uses of OperatorDescription in com.pervasive.datarush.analytics.arm.fpgrowth
Classes in com.pervasive.datarush.analytics.arm.fpgrowth with annotations of type OperatorDescription Modifier and Type Class Description classFPGrowthAn operator that implements the FP-growth algorithm, outputting a PMML model containing generated items sets and association rules. -
Uses of OperatorDescription in com.pervasive.datarush.analytics.cleansing
Classes in com.pervasive.datarush.analytics.cleansing with annotations of type OperatorDescription Modifier and Type Class Description classReplaceMissingValuesReplace missing values in the input data according to the given replacement specifications. -
Uses of OperatorDescription in com.pervasive.datarush.analytics.cluster.kmeans
Classes in com.pervasive.datarush.analytics.cluster.kmeans with annotations of type OperatorDescription Modifier and Type Class Description classKMeansComputes clustering model for the given input based on the k-Means algorithm. -
Uses of OperatorDescription in com.pervasive.datarush.analytics.decisiontree.learner
Classes in com.pervasive.datarush.analytics.decisiontree.learner with annotations of type OperatorDescription Modifier and Type Class Description classDecisionTreeLearnerOperator responsible for constructing a Decision Tree. -
Uses of OperatorDescription in com.pervasive.datarush.analytics.decisiontree.predictor
Classes in com.pervasive.datarush.analytics.decisiontree.predictor with annotations of type OperatorDescription Modifier and Type Class Description classDecisionTreePredictorOperator responsible for predicting outcomes based on a Decision Tree PMML model. -
Uses of OperatorDescription in com.pervasive.datarush.analytics.decisiontree.pruner
Classes in com.pervasive.datarush.analytics.decisiontree.pruner with annotations of type OperatorDescription Modifier and Type Class Description classDecisionTreePrunerPerforms pruning of the provided input model. -
Uses of OperatorDescription in com.pervasive.datarush.analytics.knn
Classes in com.pervasive.datarush.analytics.knn with annotations of type OperatorDescription Modifier and Type Class Description classKNNClassifierApplies the K-nearest neighbor algorithm to classify input data against an already classified set of example data. -
Uses of OperatorDescription in com.pervasive.datarush.analytics.naivebayes.learner
Classes in com.pervasive.datarush.analytics.naivebayes.learner with annotations of type OperatorDescription Modifier and Type Class Description classNaiveBayesLearnerOperator responsible for building a Naive Bayes PMML model from input data. -
Uses of OperatorDescription in com.pervasive.datarush.analytics.naivebayes.predictor
Classes in com.pervasive.datarush.analytics.naivebayes.predictor with annotations of type OperatorDescription Modifier and Type Class Description classNaiveBayesPredictorOperator responsible for predicting outcomes based on a Naive Bayes PMML model. -
Uses of OperatorDescription in com.pervasive.datarush.analytics.r
Classes in com.pervasive.datarush.analytics.r with annotations of type OperatorDescription Modifier and Type Class Description classRunRScriptExecute an R script in flow. -
Uses of OperatorDescription in com.pervasive.datarush.analytics.regression
Classes in com.pervasive.datarush.analytics.regression with annotations of type OperatorDescription Modifier and Type Class Description classLinearRegressionLearnerPerforms a multivariate linear regression on the given training data.classSumOfSquaresCompute the sum of squares for the given fields of the input data. -
Uses of OperatorDescription in com.pervasive.datarush.analytics.stats
Classes in com.pervasive.datarush.analytics.stats with annotations of type OperatorDescription Modifier and Type Class Description classCountRangesDetermines which range each value in a field falls within and counts the totals.classDataQualityAnalyzerEvaluates a set of quality tests on an input dataset.classDistinctValuesCalculates distinct values of the given input field.classMostFrequentValuesCompute the most frequent values within the given fields.classNormalizeValuesApply normalization methods to fields within an input data flow.classRankRank data using the given rank mode.classSummaryStatisticsDiscovers various metrics of an input dataset, based on the configured detail level. -
Uses of OperatorDescription in com.pervasive.datarush.analytics.svm.predictor
Classes in com.pervasive.datarush.analytics.svm.predictor with annotations of type OperatorDescription Modifier and Type Class Description classSVMPredictorOperator responsible for classification based on a SVM PMML model. -
Uses of OperatorDescription in com.pervasive.datarush.analytics.text
Classes in com.pervasive.datarush.analytics.text with annotations of type OperatorDescription Modifier and Type Class Description classCalculateNGramFrequencyCalculates the n-gram frequencies for a tokenized text field.classCalculateWordFrequencyCalculates the word frequencies for a tokenized text field.classConvertTextCaseConverts the case on a TokenizedText field.classCountTokensCounts the number of tokens in a tokenized text field.classDictionaryFilterFilters a tokenized text field using a dictionary.classExpandTextFrequencyExpands text frequency field.classExpandTextTokensExpands a TokenizedText field.classFilterTextFilters a tokenized text field.classGenerateBagOfWordsCalculates the bag of words for a tokenized text field.classTextFrequencyFilterFilters a frequency map field.classTextStemmerStems a TokenizedText field.classTextTokenizerTokenizes a string field as a TokenizedText object. -
Uses of OperatorDescription in com.pervasive.datarush.analytics.viz
Classes in com.pervasive.datarush.analytics.viz with annotations of type OperatorDescription Modifier and Type Class Description classDrawDiagnosticsChartThis operator takes the output of one or multiple predictors and uses the confidence values produced by these predictors along with the actual target values ("true class") to produce diagnostic charts. -
Uses of OperatorDescription in com.pervasive.datarush.hbase
Classes in com.pervasive.datarush.hbase with annotations of type OperatorDescription Modifier and Type Class Description classDeleteHBaseWrite delete markers to HBaseclassReadHBaseRead a result set from HBase.classWriteHBaseWrite a result set to HBase. -
Uses of OperatorDescription in com.pervasive.datarush.matching
Classes in com.pervasive.datarush.matching with annotations of type OperatorDescription Modifier and Type Class Description classDiscoverDuplicatesDiscover duplicate records within a single source using fuzzy matching operators.classDiscoverLinksUse fuzzy matching operators to discover linked records from two data sources. -
Uses of OperatorDescription in com.pervasive.datarush.matching.cluster
Classes in com.pervasive.datarush.matching.cluster with annotations of type OperatorDescription Modifier and Type Class Description classClusterDuplicatesTransform record pairs into clusters of like records, where the two sides of the pair are from the same source.classClusterLinksTransform record pairs into clusters of like records. -
Uses of OperatorDescription in com.pervasive.datarush.operators.assertion
Classes in com.pervasive.datarush.operators.assertion with annotations of type OperatorDescription Modifier and Type Class Description classAssertEqualVerifies that actual rows are equal to expected rows.classAssertEqualHashVerifies that actual rows are equal to expected rows without regard to order.classAssertEqualTypesAsserts that two input flows have identical types.classAssertMetadataAssert that the metadata on the input port is set correctly.classAssertPredicateAssert that the given predicate is true for all input values.classAssertRowCountVerifies that the input flow contains the specified row count.classAssertSortedVerifies that the input data is sorted by the given set of keys. -
Uses of OperatorDescription in com.pervasive.datarush.operators.group
Classes in com.pervasive.datarush.operators.group with annotations of type OperatorDescription Modifier and Type Class Description classGroupPerforms grouping (aggregation) of sorted input data.classRemoveDuplicatesRemoves duplicate rows based on a specified set of group keys. -
Uses of OperatorDescription in com.pervasive.datarush.operators.io
Classes in com.pervasive.datarush.operators.io with annotations of type OperatorDescription Modifier and Type Class Description classReadSourceReads a data source as a stream of records.classWriteSinkWrites a stream of records to a data sink. -
Uses of OperatorDescription in com.pervasive.datarush.operators.io.avro
Classes in com.pervasive.datarush.operators.io.avro with annotations of type OperatorDescription Modifier and Type Class Description classReadAvroReads data previously written using Apache Avro format.classWriteAvroWrites data using Apache Avro format. -
Uses of OperatorDescription in com.pervasive.datarush.operators.io.binary
Classes in com.pervasive.datarush.operators.io.binary with annotations of type OperatorDescription Modifier and Type Class Description classBinaryWriterWrites raw binary data to a filesystem. -
Uses of OperatorDescription in com.pervasive.datarush.operators.io.jdbc
Classes in com.pervasive.datarush.operators.io.jdbc with annotations of type OperatorDescription Modifier and Type Class Description classDeleteFromJDBCThis operator deletes data in the target table in a database by applying SQL delete statements.classReadFromJDBCTheReadFromJDBCoperator is used to access relational database systems using a supplied JDBC driver.classUpdateInJDBCThis operator updates the target table in a database by applying SQL update statements.classWriteToJDBCIn its simplest form, writes records from an input port to a JDBC target table using insert statements. -
Uses of OperatorDescription in com.pervasive.datarush.operators.io.mdf
Classes in com.pervasive.datarush.operators.io.mdf with annotations of type OperatorDescription Modifier and Type Class Description classReadMDFReads data previously written using MDF format. -
Uses of OperatorDescription in com.pervasive.datarush.operators.io.staging
Classes in com.pervasive.datarush.operators.io.staging with annotations of type OperatorDescription Modifier and Type Class Description classForceRecordStagingForces staging of record ports.classReadStagingDatasetReads a sequence of records previously staged to disk.classWriteStagingDatasetWrites a sequence of records to disk in an internal format for staged data. -
Uses of OperatorDescription in com.pervasive.datarush.operators.io.textfile
Classes in com.pervasive.datarush.operators.io.textfile with annotations of type OperatorDescription Modifier and Type Class Description classParseTextFieldsParses input text records according to a specified text schema.classReadARFFRead files in the Attribute-Relation File Format (ARFF).classReadDelimitedTextReads a text file of delimited records as record tokens.classReadFixedTextReads a text file of fixed-width records as record tokens.classReadJSONThe ReadJSON operator reads a JSON file of key-value pairs or array of objects as record tokens.classReadLogReads a log file as record tokens.classWriteARFFWrite files using the Attribute-Relation File Format (ARFF).classWriteDelimitedTextWrites a stream of records as delimited text.classWriteFixedTextWrites a record dataflow as a text file of fixed-width records. -
Uses of OperatorDescription in com.pervasive.datarush.operators.io.vectorwise
Classes in com.pervasive.datarush.operators.io.vectorwise with annotations of type OperatorDescription Modifier and Type Class Description classLoadActianVectorBulk load data into the Actian Vector database.classLoadVectorOnHadoopDeprecated.this operator has been replaced withLoadActianVector; use that operator instead. -
Uses of OperatorDescription in com.pervasive.datarush.operators.io.vectorwise.dl
Classes in com.pervasive.datarush.operators.io.vectorwise.dl with annotations of type OperatorDescription Modifier and Type Class Description classLoadVectorOnHadoopDirectDeprecated.this operator has been replaced withLoadActianVector; use that operator instead. -
Uses of OperatorDescription in com.pervasive.datarush.operators.join
Classes in com.pervasive.datarush.operators.join with annotations of type OperatorDescription Modifier and Type Class Description classCrossJoinProduce the cartesian product of two sets of records.classFilterExistingRowsFilters records on the left based on the presence of matching records on the right.classJoinPerforms a relational equi-join on two input datasets by a specified set of keys.classSemiJoinDeprecated.this operator has been replaced withFilterExistingRows; use that operator instead, linking to the appropriate output port. -
Uses of OperatorDescription in com.pervasive.datarush.operators.model
Classes in com.pervasive.datarush.operators.model with annotations of type OperatorDescription Modifier and Type Class Description classGetModel<T>Provides a way to update an in-memory reference to a model object.classPutModel<T>Provides a way to inject an in-memory reference to a model object into a graph. -
Uses of OperatorDescription in com.pervasive.datarush.operators.partition
Classes in com.pervasive.datarush.operators.partition with annotations of type OperatorDescription Modifier and Type Class Description classGatherHintForces parallel streams of data to be gathered into a single non-parallel stream.classPartitionHintForces the input data to be partitioned into parallel streams of data for subsequent parallel operations. -
Uses of OperatorDescription in com.pervasive.datarush.operators.record
Classes in com.pervasive.datarush.operators.record with annotations of type OperatorDescription Modifier and Type Class Description classColumnsToRowsNormalize records by transposing values from row columns into multiple rows.classDeriveFieldsApplies one or more functions to the input record data.classMergeFieldsMerges two streams of data with an equivalent number of rows into one.classRemapFieldsRearranges and renames fields in a record.classRemoveFieldsRemoves a subset of fields from the input records.classRetainFieldsPreserves a subset of fields from the input records.classRowsToColumnsThe RowsToColumns operator is used to pivot data from a narrow representation (rows) into a wider representation (columns).classSelectFieldsPreserves a subset of fields from the input records. -
Uses of OperatorDescription in com.pervasive.datarush.operators.scripting
Classes in com.pervasive.datarush.operators.scripting with annotations of type OperatorDescription Modifier and Type Class Description classRunScriptProcesses rows using user-defined scripts. -
Uses of OperatorDescription in com.pervasive.datarush.operators.select
Classes in com.pervasive.datarush.operators.select with annotations of type OperatorDescription Modifier and Type Class Description classFilterRowsFilters records based on a specified predicate.classLimitRowsTruncates a flow to a fixed number of records.classSampleRandomRowsApply random sampling to the input data. -
Uses of OperatorDescription in com.pervasive.datarush.operators.sink
Classes in com.pervasive.datarush.operators.sink with annotations of type OperatorDescription Modifier and Type Class Description classCollectRecordsCollects input data into an in-memory token list.classLogRowsLog information about the input data from a flow. -
Uses of OperatorDescription in com.pervasive.datarush.operators.sort
Classes in com.pervasive.datarush.operators.sort with annotations of type OperatorDescription Modifier and Type Class Description classSortSorts the input data. -
Uses of OperatorDescription in com.pervasive.datarush.operators.source
Classes in com.pervasive.datarush.operators.source with annotations of type OperatorDescription Modifier and Type Class Description classEmitRecordsEmits an in-memory token list as output.classGenerateArithmeticSequenceGenerates a sequence of numerical values, with a constant difference between consecutive values.classGenerateConstantGenerates copies of a constant value.classGenerateRandomGenerates rows of random data. -
Uses of OperatorDescription in com.pervasive.datarush.operators.string
Classes in com.pervasive.datarush.operators.string with annotations of type OperatorDescription Modifier and Type Class Description classSplitFieldSplits a string field into multiple fields, based on a specified pattern.
-