Package | Description |
---|---|
com.actian.dataflow.operators.io.orc | |
com.actian.dataflow.operators.io.parquet | |
com.pervasive.datarush.analytics.arm |
Provides common classes for Association Rule Mining (ARM).
|
com.pervasive.datarush.analytics.arm.fpgrowth |
Provides the operator to perform the FP-growth ARM algorithm.
|
com.pervasive.datarush.analytics.cleansing |
Provides operators related to data cleansing.
|
com.pervasive.datarush.analytics.knn |
Provides an implementation of the KNN algorithm using DataRush's sparse data API.
|
com.pervasive.datarush.analytics.naivebayes.learner |
Provides an implementation of the Naive Bayes learner.
|
com.pervasive.datarush.analytics.pmml |
Provides shared and base classes for PMML model representation of Analytics algorithms.
|
com.pervasive.datarush.analytics.r | |
com.pervasive.datarush.analytics.regression |
Provides utility, PMML and other classes for shared use by regression related entities.
|
com.pervasive.datarush.analytics.stats |
Provides various statistics, Data Summarizer, and Data Quality Analyzer.
|
com.pervasive.datarush.analytics.viz |
Provides operators for classifier performance visualization.
|
com.pervasive.datarush.hbase | |
com.pervasive.datarush.matching |
Provides operators for performing discovering duplicates or links between
records.
|
com.pervasive.datarush.matching.block |
Provides operators for generating possible candidate pairs.
|
com.pervasive.datarush.matching.cluster |
Provides operators for clustering the results of duplicate or
linkage discovery.
|
com.pervasive.datarush.matching.tune |
Provides operators for analyzing data for approximate matching.
|
com.pervasive.datarush.operators |
Provides classes and interfaces for developing dataflow operators.
|
com.pervasive.datarush.operators.assertion |
Provides operators for making assertions on flows and files.
|
com.pervasive.datarush.operators.group |
Provides data aggregation components.
|
com.pervasive.datarush.operators.io |
Provides base file I/O components including encoders and decoders.
|
com.pervasive.datarush.operators.io.avro |
Provides operators for reading and writing files in Avro format.
|
com.pervasive.datarush.operators.io.binary | |
com.pervasive.datarush.operators.io.jdbc |
Provides operators for reading from JDBC sources and writing to JDBC targets.
|
com.pervasive.datarush.operators.io.mdf | |
com.pervasive.datarush.operators.io.paraccel | |
com.pervasive.datarush.operators.io.staging |
Provides operators for reading and writing DataRush staging datasets.
|
com.pervasive.datarush.operators.io.textfile |
Provides operators for reading and writing text data.
|
com.pervasive.datarush.operators.io.vectorwise | |
com.pervasive.datarush.operators.io.vectorwise.dl | |
com.pervasive.datarush.operators.join |
Provides operators for joining together two data sets into a single one.
|
com.pervasive.datarush.operators.partition |
Provides operators for partitioning and unpartitioning flows of data.
|
com.pervasive.datarush.operators.record |
Provides operators for manipulating record structure.
|
com.pervasive.datarush.operators.select |
Provides operators for selecting a subset of the data set.
|
Modifier and Type | Class and Description |
---|---|
class |
ReadORC |
class |
WriteORC
Write data in the Apache Hive ORC format.
|
Modifier and Type | Class and Description |
---|---|
class |
ReadParquet
Reads data previously written using Apache Parquet format by Apache Hive.
|
Modifier and Type | Class and Description |
---|---|
class |
FrequentItems
Compute the frequent items within the given transactions.
|
Modifier and Type | Class and Description |
---|---|
class |
FPGrowth
An operator that implements the FP-growth algorithm, outputting a PMML model
containing generated items sets and association rules.
|
Modifier and Type | Class and Description |
---|---|
class |
ReplaceMissingValues
Replace missing values in the input data according to the given replacement specifications.
|
Modifier and Type | Class and Description |
---|---|
class |
KNNClassifier
Applies the K-nearest neighbor algorithm to classify input data against an
already classified set of example data.
|
Modifier and Type | Class and Description |
---|---|
class |
NaiveBayesLearner
Operator responsible for building a Naive Bayes PMML model from input data.
|
Modifier and Type | Class and Description |
---|---|
class |
ReadPMML
Reads a PMML model object from a file.
|
Modifier and Type | Class and Description |
---|---|
class |
RunRScript
Execute an R script in flow.
|
Modifier and Type | Class and Description |
---|---|
class |
LogisticRegressionLearner
Fits a multinomial logistic regression model to the training data.
|
Modifier and Type | Class and Description |
---|---|
class |
DataQualityAnalyzer
Evaluates a set of quality tests on an input dataset.
|
class |
DistinctValues
Calculates distinct values of the given input field.
|
class |
NormalizeValues
Apply normalization methods to fields within an input data flow.
|
class |
SummaryStatistics
Discovers various metrics of an input dataset, based on the configured
detail level.
|
Modifier and Type | Class and Description |
---|---|
class |
DrawDiagnosticsChart
This operator takes the output of one or multiple predictors and uses the
confidence values produced by these predictors along with the actual target values ("true class")
to produce diagnostic charts.
|
Modifier and Type | Class and Description |
---|---|
class |
DeleteHBase
Write delete markers to HBase
|
class |
KeyOperator
Specifies key field mapping when accessing HBase.
|
class |
KeyValueOperator
Specifies data field mapping when accessing HBase.
|
class |
ReadHBase
Read a result set from HBase.
|
class |
WriteHBase
Write a result set to HBase.
|
Modifier and Type | Class and Description |
---|---|
class |
DiscoverDuplicates
Discover duplicate records within a single source using fuzzy matching operators.
|
class |
DiscoverLinks
Use fuzzy matching operators to discover linked records from two data sources.
|
Modifier and Type | Class and Description |
---|---|
class |
BlockCartesian
Block the input data by creating a cartesian product of the data.
|
class |
BlockRecords
Block records into groups of like records based on a set of key fields
and generate record pairs from these groups.
|
class |
BlockSelf
Block records from a single source into groups of like records based on
a set of key fields and generate record pairs from these groups.
|
Modifier and Type | Class and Description |
---|---|
class |
ClusterDuplicates
Transform record pairs into clusters of like records, where the two sides of
the pair are from the same source.
|
class |
ClusterLinks
Transform record pairs into clusters of like records.
|
Modifier and Type | Class and Description |
---|---|
class |
AnalyzeDuplicateKeys
Provides an analysis of the quality of a set of blocking keys
over data to be deduplicated.
|
class |
AnalyzeLinkKeys
Provides an analysis of the quality of a set of blocking keys
over two data sets to be linked.
|
Modifier and Type | Class and Description |
---|---|
class |
AbstractRecordCompositeOperator
Convenient base class for
CompositeOperator 's that are
also RecordPipelineOperator 's. |
class |
OpenComposite
A composite operator that can be externally composed.
|
class |
OpenModelSink<T>
A model sink that can be externally composed.
|
class |
OpenModelSource<T>
A model source that can be externally composed.
|
class |
OpenMultiModelSink<T>
A model sink that can be externally composed.
|
class |
OpenMultiModelSource<T>
A model source that can be externally composed.
|
class |
OpenMultiRecordSink
A record sink that can be externally composed.
|
class |
OpenMultiRecordSource
A record source that can be externally composed.
|
class |
OpenRecordSink
A record sink that can be externally composed.
|
class |
OpenRecordSource
A record source that can be externally composed.
|
Modifier and Type | Method and Description |
---|---|
CompositeOperator |
OperatorProxies.CompositeOperatorProxy.getTarget() |
Modifier and Type | Method and Description |
---|---|
static OperatorProxies.CompositeOperatorProxy |
OperatorProxies.proxy(CompositeOperator op)
Public only as a matter of implementation; not intended for external use
Creates a proxy to the given operator that allows the framework to access its protected methods.
|
Modifier and Type | Class and Description |
---|---|
class |
AssertEqualHash
Verifies that actual rows are equal to expected rows without regard to order.
|
class |
AssertEqualTypes
Asserts that two input flows have identical types.
|
class |
AssertRowCount
Verifies that the input flow contains the specified row count.
|
Modifier and Type | Class and Description |
---|---|
class |
Group
Performs grouping (aggregation) of sorted input data.
|
class |
RemoveDuplicates
Removes duplicate rows based on a specified set of group keys.
|
Modifier and Type | Class and Description |
---|---|
class |
AbstractReader
A generic reader of byte data representing a stream of records.
|
class |
AbstractWriter
A generic writer of byte data representing a stream of records.
|
class |
ReadSource
Reads a data source as a stream of records.
|
Modifier and Type | Class and Description |
---|---|
class |
ReadAvro
Reads data previously written using Apache Avro format.
|
class |
WriteAvro
Writes data using Apache Avro format.
|
Modifier and Type | Class and Description |
---|---|
class |
BinaryWriter
Writes raw binary data to a filesystem.
|
Modifier and Type | Class and Description |
---|---|
class |
AbstractWriteToJDBC
Abstract implementation of a JDBC writer.
|
class |
DeleteFromJDBC
This operator deletes data in the target table in a database by applying
SQL delete statements.
|
class |
JDBCOperator
Common base class for
ReadFromJDBC and AbstractWriteToJDBC . |
class |
ReadFromJDBC
The
ReadFromJDBC operator is used to access relational database systems using a supplied JDBC driver. |
class |
UpdateInJDBC
This operator updates the target table in a database by applying SQL update
statements.
|
class |
WriteToJDBC
In its simplest form, writes records from an input port to a JDBC target
table using insert statements.
|
Modifier and Type | Class and Description |
---|---|
class |
ReadMDF
Reads data previously written using MDF format.
|
Modifier and Type | Class and Description |
---|---|
class |
LoadMatrix
Load data from the input data stream into an Actian Matrix database table.
|
class |
MatrixOperatorBase<ExecState> |
class |
MatrixSink
Load data from the input data stream into Actian Matrix.
|
Modifier and Type | Class and Description |
---|---|
class |
ReadStagingDataset
Reads a sequence of records previously staged to disk.
|
class |
WriteStagingDataset
Writes a sequence of records to disk in an internal format
for staged data.
|
Modifier and Type | Class and Description |
---|---|
class |
AbstractTextReader
A generic reader of text data representing a stream of records.
|
class |
AbstractTextWriter
A generic writer of text data representing a stream of records.
|
class |
ReadARFF
Read files in the Attribute-Relation File Format (ARFF).
|
class |
ReadDelimitedText
Reads a text file of delimited records as record tokens.
|
class |
ReadFixedText
Reads a text file of fixed-width records as record tokens.
|
class |
ReadJSON
The ReadJSON operator reads a JSON file of key-value pairs or array of objects
as record tokens.
|
class |
ReadLog
Reads a log file as record tokens.
|
class |
WriteARFF
Write files using the Attribute-Relation File Format (ARFF).
|
class |
WriteDelimitedText
Writes a stream of records as delimited text.
|
class |
WriteFixedText
Writes a record dataflow as a text file of fixed-width records.
|
Modifier and Type | Class and Description |
---|---|
class |
LoadActianVector
Bulk load data into the Actian Vector database.
|
class |
LoadVectorOnHadoop
Deprecated.
this operator has been replaced with
LoadActianVector ; use that operator instead. |
Modifier and Type | Class and Description |
---|---|
class |
LoadVectorOnHadoopDirect
Deprecated.
this operator has been replaced with
LoadActianVector ; use that operator instead. |
Modifier and Type | Class and Description |
---|---|
class |
AbstractRelationalJoin
Common base class for the various types of join that we support.
|
class |
FilterExistingRows
Filters records on the left based on the presence of matching records on the right.
|
class |
Join
Performs a relational equi-join on two input datasets by a specified set of keys.
|
class |
SemiJoin
Deprecated.
this operator has been replaced with
FilterExistingRows ; use that operator instead,
linking to the appropriate output port. |
Modifier and Type | Class and Description |
---|---|
class |
Randomize
Operator that re-orders its input in a random fashion.
|
Modifier and Type | Class and Description |
---|---|
class |
DiscoverDomain
Utility operator for discovering the domain of string fields.
|
class |
RowsToColumns
The RowsToColumns operator is used to pivot data from a narrow representation
(rows) into a wider representation (columns).
|
Modifier and Type | Class and Description |
---|---|
class |
LimitRows
Truncates a flow to a fixed number of records.
|
class |
SampleRandomRows
Apply random sampling to the input data.
|
Copyright © 2024 Actian Corporation. All rights reserved.