All Classes and Interfaces

Class
Description
Checks equivalence based on the absolute error between two floating point numbers.
 
 
 
 
Base class for DataRush operator and application unit tests.
Convenient base class for DeferredCompositeOperator's that are also RecordPipelineOperator's.
A base class for simple record-to-record operators, providing common functions.
Convenient base class that should be extended by all implementations of IntervalTimerSupplier
Base class for all LogicalOperators.
Common base class for all types of model ports.
Base class for model port factories.
Base class for model metadata.
 
Base implementation of EncoderFactory for customizing the encoding/decoding of ObjectTokenType.
 
A representation of features found in PMML regressionModel elements that do not depend on its attributes.
 
The possible values of the ModelType attribute of a PMML regressionModel.
The normalization methods used.
Capture information about numeric predictor values.
 
Common base class for "predictor" processes.
A generic reader of byte data representing a stream of records.
Convenient base class for CompositeOperator's that are also RecordPipelineOperator's.
Skeleton implementation of a formatter for writing records to a sink.
For internal use only.
Skeleton implementation of a regular expression based LogFormat class.
Common base class for the various types of join that we support.
Base class for scalar encoders.
A TokenSequence containing scalar typed tokens.
 
A generic reader of text data representing a stream of records.
A generic writer of text data representing a stream of records.
Base class for TokenEncoders
Base class for TokenEncoders.
A generic writer of byte data representing a stream of records.
Abstract implementation of a JDBC writer.
 
 
 
Basic mutable implementation of AccumulatedTimerSupplier.
Definition associated with a IntervalTimerSupplier.
A snapshot that reflects a point-in-time of a IntervalTimerSupplier.
Abstract statistic that represents "wall-clock" time of some runtime activity.
Provides implementations of action functions.
A representation of an aggregation on a specific field in a record.
Interface for aggregation functions computing distance/similarity of two vectors according to the PMML specification.
This enumeration contains implementations of AggregationFunctions that correspond to the aggregation functions for clustering models in the PMML specification.
Interface for evaluators of AggregationFunctions.
Interface for factories that create AggregationFunctionEvaluators for AggregationFunctions.
A provider of registered classes for the AggregationRegistry.
Stores a list of aggregations registered by the AggregationProvider, and provides a method for calling those aggregations.
Interface to be implemented by implementations of aggregations; consumers of the API should generally not need to implement this interface.
Class responsible for creating aggregators.
An alias set consists of a set of name aliases and optionally a token order.
 
Type registry of analytics.
Provides an analysis of the quality of a set of blocking keys over data to be deduplicated.
Provides an analysis of the quality of a set of blocking keys over two data sets to be linked.
An encoder factory that works with annotations placed on class methods that mark the methods for getting an encoder and decoder for the class.
An application specific resource that is required at run-time on the target execution cluster.
The resource type.
Visibility of the resource.
An analyzer for ARFF files.
Contains the results of analyzing an ARFF file.
Describes an instance of ARFF data.
Defines valid representation types for ARFF instance data.
An ArgumentConverter is used by the DefaultFunctionInvoker to convert ScalarValuedFunction arguments to the correct type.
Provides implementations of common arithmetic functions.
Thrown when two array elements differ.
Verifies that actual rows are equal to expected rows.
Verifies that actual rows are equal to expected rows without regard to order.
Assert that the given type and the type of the input port are equal.
Asserts that two input flows have identical types.
A collection of methods providing assertions useful in tests.
A filter used to determine whether line mismatches are significant.
Assert that the metadata on the input port is set correctly.
Assert that the given predicate is true for all input values.
Verifies that the input flow contains the specified row count.
Verifies that the input data is sorted by the given set of keys.
An association rule defining the strength of a relationship between two item sets.
A session qualifier to provide uniqueness of session attributes.
Describes the format of a file using Apache Avro format.
Represents metadata associated with an Avro formatted file.
Utilities for working with Avro schemas.
 
Public only as a matter of implementation; for internal use only.
 
 
 
 
Describes a range of bytes from an Azure Blob.
 
Provides services to access Azure Blob storage as a file system.
 
Used to redistribute the data in a roughly balanced fashion.
A data source identified by a Path.
A data source identified by a Path.
Basic mutable implementation of CounterSupplier.
 
 
Subclass of DistributedExecutionException that contains all the exceptions that occurred in one batch.
Object representation of the PMML ContStats element.
Miscellaneous BigInteger and BigDecimal utilities
Corresponds to the PMML NumericInfo element.
A buffer for building variable-length binary valued data.
Utilities for working with binary data.
 
A binary valued field of a record input port.
A binary valued field of a record output port.
Provides extended data access methods on binary data flows.
A mutable BinaryValued object.
A TokenSettable object containing a binary value.
An immutable BinaryValued object.
Primary implementation of MutableBinaryTokenSequence.
A TokenSequence containing binary valued data.
A TokenValued object containing a binary value.
A ScalarValuedIterator over binary values.
Writes raw binary data to a filesystem.
Listener for detecting the assignment of names to objects in a namespace.
Object representation of the PMML NumericInfo extension element.
Object representation of the PMML BivariateStats extension element.
Block the input data by creating a cartesian product of the data.
Block records into groups of like records based on a set of key fields and generate record pairs from these groups.
Block records from a single source into groups of like records based on a set of key fields and generate record pairs from these groups.
A formatter representing boolean token values as user-specified text values.
A boolean valued field of a record input port.
A boolean valued field of a record output port.
A parser producing boolean token values from a user-specified mapping of text values.
A mutable BooleanValued object.
A TokenSettable object containing a boolean value.
An immutable BooleanValued object.
Primary implementation of MutableBooleanTokenSequence.
A TokenSequence containing boolean valued data.
A TokenValued object containing a boolean value.
A ScalarValuedIterator over boolean values.
An error indicating premature termination of a data flow.
Provides access to built-in data streams.
An abstract sink for bytes.
An abstract source of bytes.
An input stream that decompresses from the BZip2 format to be read as any other stream.
 
Constants for both the compress and decompress BZip2 classes.
Defines support for the bzip2 compression algorithm.
Enumeration of possible caches for Element's.
Calculates the n-gram frequencies for a tokenized text field.
Calculates the word frequencies for a tokenized text field.
A specific date on the Gregorian calendar.
A formatter representing character token values as single-character strings.
A character valued field of a record input port.
A character valued field of a record output port.
A parser producing character token values from single-character strings.
A mutable CharValued object.
Describes the encoding format of character data.
A TokenSettable object containing a character value.
An immutable CharValued object.
Primary implementation of MutableCharTokenSequence.
A TokenSequence containing character valued data.
 
A TokenValued object containing a character value.
A ScalarValuedIterator over character values.
Determines how to classify a query record based on its nearest neighbors.
Resolves the class of a query record from a set of neighbors.
Representation of the distance between two points.
Describes which classification methodology to use.
Defines the types of classifiers supported by matching.
 
The classpath specifies for a job.
An element of a classpath.
Utility that should be used for all json-related class serialization.
An "non-encrypting" encryption provider.
Describes the format of a web server log in NCSA Common log format.
A record iterator which has resources which need to be released when being disposed.
A TokenAppendable which has resources which need to be released when being disposed.
Corresponds to the Cluster PMML element.
Builder for Cluster instances.
Transform record pairs into clusters of like records, where the two sides of the pair are from the same source.
Corresponds to the ClusteringField PMML element.
Builder for ClusteringField instances.
Transform record pairs into clusters of like records.
Assigns input data to clusters based on the provided PMML Clustering Model.
Extension point to register cluster providers for new schemes.
Facade to lookup providers from META-INF/services.
A "URL-like" entity that identifies a cluster.
 
 
Register coercers used by the DataRush library.
 
Miscellaneous collection utilities.
Collects input data into an in-memory token list.
Handler for name collisions within a namespace.
Utility class to select a number of column fields based on field name and type criteria.
Normalize records by transposing values from row columns into multiple rows.
Describes the format of a web server log in NCSA Combined log format.
Interface defining methods used to run an executable outside of the JVM.
A decoder for byte-wise comparable binary encodings.
Converts token values into a byte-wise comparable binary encoding.
Values of this enumeration correspond to the compare functions for clustering models in the PMML specification.
A configuration of a field comparison for matching.
Corresponds to the ComparisonMeasure PMML element.
Builder for ComparisonMeasure instances.
Values of this enumeration correspond to the comparison measure kinds for ComparisonMeasures in clustering models in the PMML specification.
The type of field comparisons supported by the matching operators.
Enumeration of compilation levels.
 
To be implemented by operators that can be defined by chaining together other operations.
A record sequence comprised of scalar sequences for fields.
A composite value comprised of scalar valued fields.
Composition context used by CompositeOperator's.
Interface to be implemented by iterative operators.
Corresponds to the CompoundPredicate PMML element.
Enumeration of allowed values for the "booleanOperator" attribute.
Describes a range of bytes from a compressed file.
Describes the compression scheme to use when writing data in Apache Avro format.
Provides support for a compression format.
A factory for locating registered compression formats.
 
A data source which concatenates multiple sources.
Provides implementations of common conditional evaluation functions.
An element of credential information.
 
Factory for creating connection objects.
 
Basic JDBC Connection settings.
An abstract argument converter for converting String constants to the correct object type.

Child classes should implement the convert(String) object to produce an object.
Corresponds to the True and False PMML elements.
A reference to a known value.
Build a container context based on the contents of the classpath specifier.
Service to gather run time statistics for the application master and worker containers.
Service implementation that collects run time stats of the application master and all of the containers launched by the application master.
PMML Input Mapping for continuous ( numerical ) data.
Object representation of the PMML ContStats element.
 
Provides implementations of common type conversion functions.
Supported target conversion types (from PMML).
An operator that converts an association model in PMML into a target format.
Converts the case on a TokenizedText field.
 
Registers core classes containing Aggregation methods.
Registers core classes containing ScalarValuedFunction methods.
Public only as a matter of implementation; for internal use only.
The count aggregator to use
Definition associated with a CounterSupplier.
A snapshot that reflects a point-in-time of a CounterSupplier.
Abstract statistic that represents a count of some runtime activity.
Keeps track of the number of bytes written to the underlying stream
Determines which range each value in a field falls within and counts the totals.
Counts the number of tokens in a tokenized text field.
Produce the cartesian product of two sets of records.
Defines the parameters for one instance of a parameterized query that is run in parallel.
DataDistribution is the component of RecordMetadata that describes how the data is distributed.
 
Describes the record format of external data, such as in a file.
A formatter for converting record data to binary or text format.
A parser for record data in binary or text format.
Unchecked exception thrown to indicate that an input flow contains rows that are out of order.
DataOrdering is the component of RecordMetadata that describes how the data is ordered.
Evaluates a set of quality tests on an input dataset.
A quality test consists of a test name (used to reference the test in the statistics) plus a boolean predicate.
A mutable TokenValued object.
Controls the representation of a dataset.
The default value that is assumed for unspecified values.
A utility class providing convenience methods for asserting that text files are identical.
Utility for writing a data-set to be read by ReadStagingDataset outside of an engine context.
Specifies serialization format versions of staging data sets.
Utility for reading a data-set written by WriteStagingDataset outside of an engine context.
The metadata describing a staging dataset.
Can be used to provide tuning parameters that control how iterative operators stage their data.
A description of the the storage format of a data set.
Describes a range of bytes from a data source.
An immutable TokenValued object.
Deprecated.
since 6.1; use DateTimeUtils instead.
 
A formatter representing date token values with a user-defined text format.
A date valued field of a record input port.
A date valued field of a record output port.
A parser producing date token values from a user-defined text format.
A mutable DateValued object.
A TokenSettable object containing a date value.
Provides implementations of common date/time functions.
Definitions of date and time values that can be extracted from date and timestamp types.
The desired granularity of the output.
Describes the formatting of a string representing a date/time value.
Utilities for working with date/time values.
Converts an argument to a DateTimeValue enum value.
An immutable DateValued object.
Deprecated.
since 6.1
Primary implementation of MutableDateTokenSequence.
A TokenSequence containing date valued data.
A TokenValued object containing a date value.
A ScalarValuedIterator over date values.
 
Operator responsible for constructing a Decision Tree.
Operator responsible for predicting outcomes based on a Decision Tree PMML model.
Performs pruning of the provided input model.
Controls verbose decision tree logging
An annotation for a static method of a class marking the method as the getter of a TokenDecoder for the class.
The default argument converter.
Default implementation of ConnectionFactory that uses the JDBC DriverManager for creating connections.
A formatter representing double token values with the default Java text format.
A parser producing double token values from the default Java text format.
 
A parser producing duration token values from duration strings expressed in ISO 8601 format.
The default encryption provider.
A formatter representing float token values with the default Java text format.
A parser producing float token values from the default Java text format.
A default function invoker, which passes arguments directly to the method using an ArgumentConverter.

Arguments may specify a converter using the FunctionArgument annotation.
A formatter representing int token values with the default Java text format.
A parser producing int token values from the default Java text format.
A formatter representing long token values with the default Java text format.
A parser producing long token values from the default Java text format.
Provides a change number sequence which is manually managed.
A formatter representing money token values with the default text format.
 
A formatter representing numeric token values with the default Java text format.
A parser producing numeric token values from the default Java text format.
 
A parser producing period token values from period strings expressed in ISO 8601 format.
For rare use cases; implementors should use CompositeOperator whenever possible!
This operator deletes data in the target table in a database by applying SQL delete statements.
Write delete markers to HBase
 
An analyzer for files containing delimited text.
Contains the results of an analysis of a delimited text file.
Describes the format of a delimited text file.
Applies one or more functions to the input record data.
Controls the detail level of the statistics calculated by SummaryStatistics.
Filters a tokenized text field using a dictionary.
A filter for selecting paths.
Utility operator for discovering the domain of string fields.
Discover duplicate records within a single source using fuzzy matching operators.
Use fuzzy matching operators to discover linked records from two data sources.
InputMappingModel for discrete ( categorical ) data.
Object representation of the PMML DiscrStats element.
Enum of distance measure that can be used to build a k-means model.
Calculates distinct values of the given input field.
A distributed equivalent of a Callable.
A distributed equivalent of a Callable.
Convenient base class for DistributedCallable's that do not support monitoring/partial results.
Convenient base class for DistributedCallableBatch's that do not support monitoring/partial results.
An ExecutionException in a distributed context.
Distributed equivalent of the ExecutorService concept.
Used to coordinate processing between operators that have a processing dependency but no explicit data dependency.
A formatter representing double token values with a user-defined text format.
A double valued field of a record input port.
A double valued field of a record output port.
A parser producing double token values from a user-defined text format.
A mutable DoubleValued object.
A TokenSettable object containing a double value.
An immutable DoubleValued object.
Primary implementation of MutableDoubleTokenSequence.
A TokenSequence containing double valued data.
A TokenValued object containing a double value.
A ScalarValuedIterator over double values.
This operator takes the output of one or multiple predictors and uses the confidence values produced by these predictors along with the actual target values ("true class") to produce diagnostic charts.
Base for exceptions thrown by DataRush.
Responsible for looking up database drivers by name
Wrapper for errors occurring during the execution of a dataflow graph.
 
For test purposes--sorts inputs and writes to a file
 
A duration valued field of a record input port.
A duration valued field of a record output port.
 
A mutable DurationValued object.
A TokenSettable object containing a duration value.
An immutable DurationValued object.
Primary implementation of MutableDurationTokenSequence.
A TokenSequence containing duration valued data.
A TokenValued object containing a Duration value.
A ScalarValuedIterator over duration values.
Provides enum-like semantics ( ordinal mapping ) for dynamically constructed data.
Builder for constructing new dynamic enums
A value of a given enumeration.
Text data type for converting to/from an integer ordinal mapping
A distribution where data is range-partitioned by a selected array of keys.
An index-based comparator for a TokenSequence.
A format analyzer for Extended Log Format files.
 
Describes the format of a web server log in W3C Extended log format.
Emits an in-memory token list as output.
An annotation for a static method of a class marking the method as the getter of a TokenEncoder for the class.
An interface defining a factory that provides an encoder and decoder for a particular class or set of classes.
A manager of EncryptionProvider's.
A provider of textual encryption.
Exception thrown when an error relating to reaching the end of a flow is encountered.
A collection of engine configuration settings.
Nested class containing settings specific to ports.
Settings in this object determine settings for remote monitoring
Settings in this object determine default tuning for the Sort operator.
Defined configuration properties for the DataRush engine.
Deprecated. 
Deprecated.
Replace with string+domain
An exception indicating end-of-file has been unexpectedly reached on a stream.
The EqualRangeBinning operator can be used to divide a set of numeric data into equal range bins.
The action to take if a SQL error occurs while reading data from the database.
 
A record source sink throws an error either at composition time or execution time.
A record source that throws an error either at composition time or execution time.
An object describing the context in which a subexpression of a larger expression is being evaluated.
A factory used to construct evaluators for ScalarValuedFunctions.
Check equivalence based on an exact match.
A one-to-one mapping of strings to boolean values which does case-sensitive comparisons.
ExecutableOperators are the most commonly used operators.
Miscellaneous execution options.
Miscellaneous utilities related to DistributedExecutorService and its subordinate classes.
Expands text frequency field.
Expands a TokenizedText field.
External data types are those DataRush interacts with in some way but does not support internally in dataflow queues (they are not PortTypes).
Defines an external sink of record data.
Defines an external source of record data.
An element of a RecordTokenType schema.
A utility class for copying a set of input values to output fields.
A container for structural information about a text file containing delimited records.
A collection of structural information about a text file consisting of delimited-field records.
Defines how a field's value in the output is derived from fields in the input.
Contains meta-information about the allowed values, lower, and upper bounds for a given field.
Exception indicating a field already exists in a field namespace such as a record.
Utility class for constructing field iterators associated with a given record type.
Class responsible for creating iterators associated with a given RecordValued object.
Various properties that control the behavior of a field iterator.
A reference to a field in a record schema.
A transformation on a record type.
 
A mapping directive to be applied to a source field.
Unchecked exception thrown to indicate a field in a record schema is of an invalid type for a given context.
Unchecked exception thrown when an attempt is made to compare two fields of incompatible types in a record schema.
An I/O exception indicating the file in question already exists.
Provides access to files and directories.
Generic configuration object which consists of a list of arbitrary named fields.
A FileMetaConfiguration is a grouping of all file configurations related to the client.
An entry in the meta-configuration consisting of a path prefix and those configurations that are to be used for paths starting with the given prefix.
The metadata associated with physical data files.
Describes a range of bytes from a file.
 
Describes the file system identified by a path scheme.
Provides basic operations on paths for a specific path scheme or schemes.
Utility methods for operating on files.
Configuration for a filter used during matching operations.
Filters records on the left based on the presence of matching records on the right.
 
Base class for operators which filter fields in the input records.
Filters records based on a specified predicate.
Filters a tokenized text field.
The types of filters supported for matching operations.
 
An TextDataType with a fixed-size text representation.
Describes the format of a fixed text file.
A typer for functions which return a fixed, known type.
A schema for a text file containing records with different fixed-width formats.
A definition of a fixed-width record in a text file.
A formatter representing float token values with a user-defined text format.
A filter for handling mismatches in floating point values.
Represents allowable error bounds for comparing the equality of two floating point numbers.
A float valued field of a record input port.
A float valued field of a record output port.
A parser producing float token values from a user-defined text format.
A mutable BinaryValued object.
A TokenSettable object containing a float value.
An immutable FloatValued object.
Primary implementation of MutableFloatTokenSequence.
A TokenSequence containing float valued data.
A TokenValued object containing a float value.
A ScalarValuedIterator over float values.
Forces staging of record ports.
Forces the operators on the input and output sides to execute sequentially, instead of concurrently.
A generic format analyzer for log files.
Contains the results of analyzing a file.
A specialized function invoker for format.
Provides implementations of common formatting functions.
A collection of parameters for configuring parsing.
An operator that implements the FP-growth algorithm, outputting a PMML model containing generated items sets and association rules.
An implementation of the FP-growth algorithm based on FPTree.
Compute the frequent items within the given transactions.
Provides access to FTP resources as a file system.
 
 
An operator may set this as their requiredDataDistribution in order to indicate that the data needs to be sent to all nodes in the cluster (or all threads in the case of pseudo-distributed operation).
Indicates that a method returns a ScalarValuedFunction and should be included in the FunctionRegistry or returns an Aggregation and should be included in the AggregationRegistry.
 
 
Sets properties for arguments of functions in the FunctionRegistry.
Information about the context in which a function is evaluated.
A description of a function, including its name, description, and argument names.
An executor of a function.
A FunctionInvoker is used by the FunctionRegistry to invoke a function method and by the AggregationInvoker to invoke an aggregation method.

The implementing class essentially translates an array of ScalarValuedFunction arguments into the proper format for a given function method.

A function method can specify its invoker using the Invoker annotation.
A provider of registered classes for the FunctionRegistry.
Stores a list of functions registered by the FunctionTypeResoultionProvider, and provides a method for calling those functions.
 
Determines the upper bound and result type of a ScalarValuedFunction.
A collection of useful utilities for implementing classes related to functions.
QualityMeasure that uses information gain as the splitting criteria.
QualityMeasure that uses information gain ratio as the splitting criteria.
Forces parallel streams of data to be gathered into a single non-parallel stream.
The gauge aggregator to use
Definition associated with a IntervalTimerSupplier.
A snapshot that reflects a point-in-time of a GaugeSupplier.
Abstract statistic that represents a measure of some runtime activity whose value may fluctuate over time.
 
 
 
Provides services to access Google Cloud Storage as a file system.
Generates a sequence of numerical values, with a constant difference between consecutive values.
Calculates the bag of words for a tokenized text field.
A combination generator capable of generating combinations of a given array of integers.
Generates copies of a constant value.
Generates rows of random data.
Generates a cycle of repeating values.
Provides implementations of Generator functions.
Describes the format of a generic log that can be parsed using a regular expression.
A formatter representing object token values as text values.
A pass-through PMML Model.
An object publishing buffered data.
General purpose TCP server that provides thread pooling, start and stop code.
Handles incomming connections
A description of the generic data type, the root of the type hierarchy.
Provides a way to update an in-memory reference to a model object.
Specialization of GetModel for PMMLPort's.
Describes the format of a Glassfish log file.
A data source representing the concatenation of all files matching a pattern.
Converts an argument to a Granularity enum value.
Thrown to indicate the cancellation of a running dataflow graph.
Provides notification whenever an operator is compiled.
A sequence of names typically used to identify elements within nested namespaces.
Factory for generating paths which provides for a standard separator and an escaping mechanism.
Describes the state of a LogicalGraphInstance.
Performs grouping (aggregation) of sorted input data.
Finds key groupings within the input key fields and, for each key group, generates all pairwise combinations of distinct rows in that group.
A globally unique identifier.
Defines support for the gzip compression algorithm.
Just a map of key/value properties
Implementation of a path for the Hadoop Distributed File System (HDFS).
Provides access to the Hadoop Distributed File System (HDFS).
Subclass of IOExceptions for Hadoop.
Public only as a matter of implementation; for internal use only.
A representation of a partially completed hash of a sequence of values.
A default hash partitioning function
Utility methods for hashing different data types.
A partitioning function that partitions records of data based on a HBase table.
A formatter representing binary token values as hexadecimal strings.
A parser producing binary token values from hexadecimal strings.
 
Configuration of the indexing phase for a matching operation.
Types of indexing operations supported.
 
A factory mapped by the widest type encompassing one or more input values.
Abstract class that represents a PMML input mapping.
A change sequence derived from an input port.
A runtime instance of a LogicalPort.
A snapshot of run-time statistics for an input port.
Live run-time statistics for an input port.
An abstract factory for input streams.
Contains various factory methods and utilities for creating InputStreamSupplier's.
Represents the pair (inputValue, targetValue).
A one-to-one mapping of strings to boolean values which does case-insensitive comparisons.
Interval<T extends Number & Comparable<? super T>>
Object representation of the PMML Interval element.
Enumeration of allowed values for the "closure" attribute of the Interval element.
Describes the formatting of a string representing TemporalAmounts such as Duration and Period.
A simple formatter/parser utility for TemporalAmounts such as Duration and Period.
Basic mutable implementation of AccumulatedTimerSupplier.
Definition associated with a IntervalTimerSupplier.
A snapshot that reflects a point-in-time of a IntervalTimerSupplier.
Abstract statistic that represents "wall-clock" time of some runtime activity.
A formatter representing int token values with a user-defined text format.
An integer valued field of a record input port.
An integer valued field of a record output port.
A parser producing int token values from a user-defined text format.
A mutable IntValued object.
A TokenSettable object containing an integer value.
An immutable IntValued object.
Primary implementation of MutableIntTokenSequence.
A TokenSequence containing integer valued data.
A TokenValued object containing an integer value.
A ScalarValuedIterator over a integer fields.
Thrown to indicate that an aggregator was constructed with an invalid number of arguments.
Exception indicating reference to an invalid field (or fields) in a record structure.
Exception thrown when an invalid argument is passed to a function within an expression.
An exception indicating the data type of a subexpression is not valid for use in the containing expression.
Thrown to indicate that either: the thrower attempted to parse a malformed or non-compliant PMML document, including one that used a different version of the PMML standard, or the thrower was required to build a PMML element that was non-compliant (e.g.
Exception that indicates a schema definition is invalid.
 
 
Valid operations on an I/O byte channel, such as a file or network socket.
Gathers statistics for an I/O channel.
Provides a context for instrumenting I/O operations.
A ip4 address valued field of a record input port.
A ip4 address valued field of a record output port.
A mutable Ip4AddressValued object.
A TokenSettable object containing a Inet4Address value.
An immutable Ip4AddressValued object.
Primary implementation of MutableIp4AddressTokenSequence.
A TokenSequence containing ip4address valued data.
A TokenValued object containing a ip4 address value.
A ScalarValuedIterator over ip4address values.
A ip6 address valued field of a record input port.
A ip6 address valued field of a record output port.
A mutable Ip6AddressValued object.
A TokenSettable object containing a Inet6Address value.
An immutable Ip6AddressValued object.
Primary implementation of MutableIp6AddressTokenSequence.
A TokenSequence containing ip6address valued data.
A TokenValued object containing a ip4 address value.
A ScalarValuedIterator over ip6address values.
Utilities for working with IpAddress data.
 
A formatter representing date token values as ISO 8601 date strings.
A parser producing date token values from ISO 8601 date and timestamp strings.
A formatter representing time token values using only the time portion, using ISO 8601 conventions.
A parser producing time token values from time-of-day expressed in ISO 8601 format.
A formatter representing timestamp token values as ISO 8601 timestamp strings.
A parser producing timestamp token values from ISO 8601 timestamp strings.
Definition of an item with the context of association rule mining.
A representation of a set of items with their support (frequency) within a data set.
The encoder factory for the ItemSet class.
To be implemented by operations that must make multiple passes over the input data.
Exception thrown if attempting to call TokenValuedIterator.stepNext() after it has already returned false.
 
Basic JDBC connector class.
Subclass of DRExceptions for JDBC.
 
Common base class for ReadFromJDBC and AbstractWriteToJDBC.
 
 
Context available to job.
Client-side API for submitting units of work to the job master.
Factory for creating new JobExecutorService's.
 
Basic job information that contains the job name, id, and the user that submitted the job.
 
 
 
Rollup of all parameters needed to launch a job
 
 
Performs a relational equi-join on two input datasets by a specified set of keys.
Combination of a left key and a right key.
Specifies how the Join operator should handle rows with unmatched keys.
The primary class that should be used for performing any JSON serialization/deserialization.
Subclass of DRExceptions for JSON.
 
A simple JSON helper utility
Discovers a schema for JSON text by checking values against the discovered JSON types and various regular expressions which map to data types.
A utility class used to read and parse JSON text records.
 
Utilities for custom JSON serializers.
 
An analyzer for files containing JSON text.
Contains the results of the analysis of a json text file.
Authenticates as a specified Kerberos principal using a keytab file.
Base class for the various SVM PMML kernel types.
DataDistribution based on a set of keys from the input data.
Specifies key field mapping when accessing HBase.
Specifies data field mapping when accessing HBase.
Computes clustering model for the given input based on the k-Means algorithm.
Applies the K-nearest neighbor algorithm to classify input data against an already classified set of example data.
An operator that issues warnings if a dataflow contains an unusually large number of distinct key values.
 
Truncates a flow to a fixed number of records.
Object representation of the LinearKernelType PMML element.
Performs a multivariate linear regression on the given training data.
A function invoker which passes all of its arguments as a single list to the method.
Provides implementations of common List functions.
Utility class to dump all registered operators to stdout or the file specified.
 
 
Service interface that controls which classes are auto-imported as a convenience.
Register standard classes to be available via live connect.
Bulk load data into the Actian Vector database.
Specifies the method to use for bulk loading data into Vector.
Run a command on the local machine using a locally started process.
 
Provides access to the local file system.
Describes the format of a log4j log file.
 
 
 
A simple formatter for use with DataRush applications.
Represents a logical graph which is intended to be executed.
Public factory for creating initially empty logical graphs.
The executable artifact of a logical graph.
Miscellaneous utilities for working with LogicalGraphInstance.
A read-only super-interface of LogicalGraphInstance
 
A logical operator is the fundamental unit of composition which may be added to a LogicalGraph.
A operator's port ( either input or output ).
Indicates whether the port is input or output
A port factory creates a port.
Marker interface for port options used by LogicalSubgraph.filtered
Statistic associated with a LogicalPort or a LogicalOperator.
A builder for statistics.
Miscellaneous utilities and constants associated with the LogicalStatistic class.
Extension of LogicalGraph used by IterativeOperator's.
Interface for creating subgraphs.
Fits a multinomial logistic regression model to the training data.
 
Log information about the input data from a flow.
A formatter representing long token values with a user-defined text format.
A long valued field of a record input port.
A long valued field of a record output port.
A parser producing long token values from a user-defined text format.
A mutable IntValued object.
A TokenSettable object containing a long value.
An immutable LongValued object.
Primary implementation of MutableLongTokenSequence.
A TokenSequence containing long valued data.
A TokenValued object containing a long value.
A ScalarValuedIterator over long values.
Simple LRUCache implementation based off of linked hash map
Provides implementations of common Map functions.
A base class for factories which choose an evaluator based on a token type.
Defines a mapping between data type and an evaluator class.
The class invoked as the Dataflow application master.
 
Type resolution provider for matching operators.
Provides implementations of common math functions found in the Math library.
 
 
 
 
 
 
 
 
Merges two streams of data with an equivalent number of rows into one.
This operator is uses to force a merge of a partitioned model into a single model.
Base interface for all contexts in which metadata is being computed.
Base interface for various contexts in which port metadata can be resolved.
Helper class for metadata needed by association models.
Provides utility methods for working with port metadata.
Corresponds to the MissingValueWeights PMML element.
Builder for MissingValueWeights instances.
Metadata helper utility for types that support a mixture of continuous and categorical fields.
A mockable sink that can be utilized with the SubJobExecutor to pass data back to the launcher.
A mockable source that can be utilized with the SubJobExecutor to insert parameter data.
Values of this enumeration correspond to the model classes of clustering models in the PMML specification.
Interface for a model converter that converts an association model in PMML into a target model type.
 
A model merge handle is responsible for merging partial models into a final model.
 
A model storage handler is responsible for handling the persistence of a model object.
A monotonically increasing sequence of change numbers.
 
 
 
 
 
 
 
Utilities for working with money data.
 
A money valued field of a record input port.
A money valued field of a record output port.
 
A mutable MoneyValued object.
A TokenSettable object containing a money value.
An immutable MoneyValued object.
Primary implementation of MutableMoneyTokenSequence.
A TokenSequence containing money valued data.
A TokenValued object containing a money value.
A ScalarValuedIterator over money values.
Extension of Callable that provides a way to get its current status prior to execution completion.
Extension of Future that allows the client to query for the partial status.
Compute the most frequent values within the given fields.
 
Mix-in interface to be implemented by some operators.
Mix-in interface to be implemented by some operators.
A MutableTokenSequence containing binary valued data.
A MutableTokenSequence containing boolean valued data.
A MutableTokenSequence containing character valued data.
A composite of settable scalar fields.
A MutableTokenSequence containing date valued data.
A MutableTokenSequence containing double valued data.
A MutableTokenSequence containing duration valued data.
A MutableTokenSequence containing float valued data.
A MutableTokenSequence containing integer valued data.
A MutableTokenSequence containing ip4address valued data.
A MutableTokenSequence containing ip6address valued data.
A MutableTokenSequence containing long valued data.
A MutableTokenSequence containing money valued data.
A MutableTokenSequence containing numeric valued data.
A MutableTokenSequence containing object valued data.
A MutableTokenSequence containing period valued data.
A MutableTokenSequence of composite tokens, representing record data.
A MutableTokenSequence containing scalar typed tokens.
A MutableTokenSequence containing string valued data.
A MutableTokenSequence containing timestamp valued data.
A MutableTokenSequence containing time valued data.
A mutable TokenSequence.
Operator responsible for building a Naive Bayes PMML model from input data.
Operator responsible for predicting outcomes based on a Naive Bayes PMML model.
Encapsulates the association of a name and entry in a namespace.
An object which is assigned a name.
A mapping of names to objects, similar to a Map.
A mutable implementation of a Namespace.
A collection of utility methods for working with Namespaces.
Generates names for namespace entries when none is provided.
Determines how to measure how near an example and query record are to each other.
A calculator of "nearness".
Specifies the network configuration to use.
 
Implementation of an n-gram.
Implementation of an n-gram model.
Represents the initial node allocation.
Assignment details for a particular node ID.
Reflects the current state of node allocation, which takes into account failed nodes.
Assignment details for a particular node ID.
Contains node and host information
Contains host information associated with the nodes in the cluster that the current job is running on.
 
 
Wraps an output stream in one that does not close the underlying stream
An executor of a function which is non-deterministic.
Describes a field to be normalized.
Apply normalization methods to fields within an input data flow.
OutputStream that ignores all calls.
An null token.
Converts an argument to a NumberFormat object.
A number valued field of a record output port.
A TokenSettable object containing a number value.
A formatter representing numeric token values with a user-defined text format.
Corresponds to the PMML NumericInfo element.
A numeric valued field of a record input port.
A numeric valued field of a record output port.
A parser producing numeric token values from a user-defined text format.
A mutable NumericValued object.
A TokenSettable object containing a numeric value.
An immutable NumericValued object.
Primary implementation of MutableNumericTokenSequence.
A TokenSequence containing numeric valued data.
A TokenValued object containing a numeric value.
A ScalarValuedIterator over numeric values.
An object valued field of a record input port.
A typer determining the output type of a function based on the types of one or more Object inputs.
An object valued field of a record output port.
A mutable ObjectValued object.
A TokenSettable object containing a Java object.
An immutable ObjectValued object.
Primary implementation of MutableObjectTokenSequence.
A TokenSequence containing object valued data.
A description of a scalar data type for a Java class.
Identifies a class which is associated with Java object data.
A TokenValued object containing a Java object.
A ScalarValuedIterator over object values.
A composite operator that can be externally composed.
A model sink that can be externally composed.
A model source that can be externally composed.
A model sink that can be externally composed.
A model source that can be externally composed.
A record sink that can be externally composed.
A record source that can be externally composed.
A record sink that can be externally composed.
A record source that can be externally composed.
FOR INTERNAL USE ONLY.
FOR INTERNAL USE ONLY.
Base interface for any context/graph that allows operators to be added and ports to be connected.
Provides a summary description of an operator that can be exposed in documentation.
Captures the definition of an operator.
A runtime instance of a LogicalOperator.
Public only as a matter of implementation; not intended for external use
Public only as a matter of implementation; not intended for external use
Public only as a matter of implementation; not intended for external use
Public only as a matter of implementation; not intended for external use
Public only as a matter of implementation; not intended for external use
Public only as a matter of implementation; not intended for external use
Public only as a matter of implementation; not intended for external use
An OperatorRegistrar is capable of registering a DataRush operator within a scripting environment.
Define the interface for a registration factory capable of registering DataRush operators with a scripting environment.
Register DataRush library operators within a scripting environment.
Advanced operator settings that can be specified for an operator.
The execution state of an operator within a graph.
 
Enumeration of possible operator types
Data format implementation for ORC files.
Meta data about an ORC data set shared externally.
Extra options for an OSGI classpath element
A callback interface for handling events occurring on an output port.
The SQL action to take when processing records from a flow.
A runtime instance of a LogicalPort.
A snapshot of run-time statistics for an output port.
Live run-time statistics for an input port.
A wrapper for creating FixedSizeTextType objects from arbitrary TextDataType objects.
Field justifications for padded values.
 
A ParallelismStrategy is responsible for determining parallelization of an operator based on configured parallelism and source parallelism.
Records the name of a parameter.
 
Data format implementation for Parquet files.
Meta data about a Parquet data set shared externally.
Describes the possible actions for handling record parsing errors.
Exception indicating a parsing error.
Exception indicating an error while parsing a value expression.
Parses input text records according to a specified text schema.
A collection of parameters for configuring parsing.
Base class for all distributions that represent a partitioning of the data.
Base class for all distributions whose partitioning parameters are based on the data.
Base class for all distributions whose partitioning parameters are specified statically.
Forces the input data to be partitioned into parallel streams of data for subsequent parallel operations.
Represents a pluggable partitioning function; for use by operators that require custom partitioning
Computes partition assignments for each row if input
Contextual information for PartitioningFunction's.
Information pertaining to an instance of a partitioning operation.
The class invoked as the Dataflow partition server/worker.
A pass-thru envelope for a json-serializable object.
Wraps a string into an object that is json-serialized in an encrypted format.
A ConfigurationField that consists of secret information.
An abstract identifier for a resource.
Describes a Path along with its metadata.
 
Exception thrown when the serialization of a path cannot be parsed.
A factory for creating Path objects.
Utility methods for dealing with Hadoop style paths.
Discovers a schema for delimited text by checking values against a mapping of regular expressions to data types.
 
A period valued field of a record input port.
A period valued field of a record output port.
 
A mutable PeriodValued object.
A TokenSettable object containing a period value.
An immutable PeriodValued object.
Primary implementation of MutablePeriodTokenSequence.
A TokenSequence containing period valued data.
A TokenValued object containing a Period value.
A ScalarValuedIterator over period values.
Configures a phase of field comparisons, classifiers and a filter to use during a matching operation.
Specifies which fields to remove from records after completion of a phase.
Marker interface for physical inputs.
Marker interface for physical outputs.
The common interface for all ports, both input and output.
Mix-in interface to be implemented by some operators.
Various utility methods for PMML-Dom manipulation
Utility methods for formatting and parsing PMML arrays
Enumeration of PMML array types
Captures the information required to build a PMML Association model.
PMML clustering model.
Builder for PMMLClusteringModel instances.
 
Base class for PMML object model objects.
Enumeration of values for the "functionName" attribute
Meta-information associated with a given PMMLModel.
PMML Naive Bayes Model.
Specialization of model ports for handling PMML models.
The port metadata associated with PMMLPorts.
Object representation of a PMML matrix
Implementation of a 'maximum likelihood' classifier on top of regression models.
PMML object model for Regression models.
PMML ModelStats.
PMML SupportVectorMachineModel.
Corresponds to the PMML SVM-REPRESENTATION enumeration.
PMML TreeModel.
Corresponds to the PMML MISSING-VALUE-STRATEGY enumeration.
Corresponds to the PMML NO-TRUE-CHILD-STRATEGY enumeration.
Corresponds to the PMML splitCharacteristic enumeration.
Encapsulates version-specific PMML information.
Object representation of the PolynomialKernelType PMML element.
Provides a summary description of a port that can be exposed in documentation.
Definition of an operator port.
A runtime instance of a LogicalPort.
Marker interface for metadata associated with a given port.
 
A representation of the setting for a port configuration property.
Utility class for working with port settings.
A snapshot of run-time statistics for a dataflow port.
Live run-time statistics for a dataflow port.
Possible states for a port.
 
Parent class for all of the PMML predicates.
Provides implementations of common boolean and logical functions.
A PMML object model for some of the metadata about a predictive (usually regression) model's quality.
Indicator for the phases of model-building during which a dataset may be used.
Utilities for acquiring process-level authentication credentials.
A collection of settings made available to ProcessSubjectProvider implementations.
Provides various utilities to use when working with the ProcessBuilder class.
Executes an application graph for each distinct key group of data within the input data set.
Simple container for context variables passed into the JavaScript environment for each group.
Provides a mechanism for acquiring process-level authentication credentials.
Utility methods to consume properties files.
 
A description of a comparison property.
Provides a summary description of an operator property that can be exposed in documentation.
 
Utility methods for property validation.
Shared interface implemented by LogicalGraph and OpenComposite.
Pruning configuration for the decision tree.
The C4.5 pruning strategy.
Subinterface of DistributedExecutorService for running in pseudo-distributed mode.
 
Provides a way to inject an in-memory reference to a model object into a graph.
Specialization of PutModel for PMMLPort's.
The quality measure to be used for evaluating splits.
Quantile<T extends Number & Comparable<? super T>>
Corresponds to the PMML Quantile element.
Object representation of the RadialBasisKernelType PMML element.
Operator that re-orders its input in a random fashion.
An aggregator for a range of values
A partitioning function that partitions records of data based on specified ranges.
Rank data using the given rank mode.
Definition of the supported rank modes.
Storage class by which a custom aggregator can load its internals.
Bulk read data from the Actian Vector database.
Read files in the Attribute-Relation File Format (ARFF).
Reads data previously written using Apache Avro format.
Reads a text file of delimited records as record tokens.
Reads a text file of fixed-width records as record tokens.
The ReadFromJDBC operator is used to access relational database systems using a supplied JDBC driver.
Read a result set from HBase.
The ReadJSON operator reads a JSON file of key-value pairs or array of objects as record tokens.
Reads a log file as record tokens.
Reads data previously written using MDF format.
 
 
Reads data previously written using Apache Parquet format by Apache Hive.
Reads a PMML model object from a file.
Reads a data source as a stream of records.
Reads a sequence of records previously staged to disk.
Gets the collected data.
A factory for creating RecordIdentifier objects.
A discriminator for identifying text record types in multi-record schemas.
An input port on a record data flow.
Provides a mechanism to wait for data on a set of RecordInput objects.
The actions supported when a record length error occurs.
Provides a mapping interface over records by defined key fields.
Factory for creating record maps.
The metadata associated with record ports.
 
 
 
 
 
 
 
An output port generating a flow of record tokens.
Refinement of PipelineOperator, specific to RecortPort's.
Defines a logical record port.
Provides a way for iterative operators to select a filtered view of one of their input ports.
A composite register of scalar registers; a mutable RecordValued object.
For internal purposes only.
 
For internal purposes only
For internal purposes only
 
A container for structural information about a file containing text records.
A collection of structural information about a text file consisting of records.
A TokenSettable object containing record data.
An adapter for converting the apparent type of a RecordSettable.
Refinement of SinkOperator, specific to RecortPort's.
Refinement of SourceOperator, specific to RecortPort's.
A schema for text files containing a single record type.
Provides a mapping between records and integer ids.
An accessor for performing lookup operations.
An accessor for building the map.
A composite token of scalar valued tokens; an immutable RecordValued object.
A sequence of composite tokens to which additional tokens can be appended.
An object for comparing composite token values.
Primary implementation of MutableRecordTokenSequence.
A TokenSequence of composite tokens, representing record data.
A description of a record data type.
An object for dynamically constructing a record type.
Identifies a class which is associated with record data.
A TokenValued object containing record data.
An adapter for converting the apparent type of a RecordValued.
Iterator over record values.
An indirect handle to an object.
Utility methods related to reflection.
A factory creating a fixed class of function evaluator.
 
A word break iterator that that allows its default behavior for the Locale to be overridden by supplied regular expression rules.
 
Apply a regression model to the input data.
Checks equivalence based on the relative error between two floating point numbers.
Rearranges and renames fields in a record.
Run a command on a remote server using the ssh protocol connecting to a sshd.
Removes duplicate rows based on a specified set of group keys.
Removes a subset of fields from the input records.
Replace missing values in the input data according to the given replacement specifications.
Specify per field or per group of fields how missing (null) values should be handled.
The action to take when a missing value is discovered within a field.
Describes the resource allocation for a job server process.
Resource specifier for the resources needed by a job.
A factory mapped by the expected type of the result.
 
Preserves a subset of fields from the input records.
Represents a position in some sequence.
Script factory for the Rhino engine that uses compiled scripts for performance.
Exception that indicates a remote communication failure.
Extension of InterruptedException thrown to indicate that the current thread was interrupted waiting for a remote request.
The RowsToColumns operator is used to pivot data from a narrow representation (rows) into a wider representation (columns).
Exception thrown when the number of characters in the first row of a delimited text file exceeds the maximum allowed.
Defines a rule that can be used to determine if a particular record discriminator value fits the definition of a record type.
Generate association rules from frequent item sets.
Processes rows using user-defined scripts written in JavaScript.
Execute an R script in flow.
Processes rows using user-defined scripts.
 
Provides services to access the Amazon S3 web service as a file system.
The mode to use when applying random sampling.
Apply random sampling to the input data.
Describes an implementation of a function.
Specifies a bound on subexpressions used as arguments to a function.
A field of a record input port.
A field of a record output port.
A mutable ScalarValued object.
A TokenSettable containing a scalar value.
An immutable ScalarValued object.
A sequence of scalar tokens to which additional tokens can be appended.
A TokenSequence containing scalar typed tokens.
A description of a scalar data type.
Identifies a class which is associated with scalar data.
The scalar types exposed to scripting.
A TokenValued object containing a scalar value.
The description of a function taking record data and producing a scalar result.
Iterator over scalar values.
Collection of static methods helpful in building schemas (TextRecord instances).
An internal class used to transform schemas to/from JSON.
Enumeration of schema field types
Defines the types of schemas supported.
Corresponds to the PMML ScoreDistribution element.
Provides an unchecked wrapper for exceptions thrown by scripts providing such information as line and column number of the error.
Container for options to the script environment that are exposed to operators that are scripting aware.
Interface implemented by operators that need to be scripting aware.
Exception that indicates a script attempted to assign a value of the wrong type to an output field.
General script runner.
Type registry of script module.
A repositionable iterator over record values.
A repositionable iterator over scalar values.
A TokenValuedIterator that is also seekable.
Preserves a subset of fields from the input records.
Deprecated.
this operator has been replaced with FilterExistingRows; use that operator instead, linking to the appropriate output port.
Specifies how the SemiJoin operator should behave.
Contains sequence functions.
ModelStorageHandler that supports persistence of any Java-serializable model object.
This class serves as a wrapper for an object that is json serializable such that it can be java-serialized.
Miscellaneous serialization utilities
Provides a means of looking up services of a particular type.
Provides a key for a session attribute and, optionally, an initial value
Provides access to SFTP resources as a file system.
Object representation of the SigmoidKernelType PMML element.
A collection of functions for computing similarity of strings.
Convenient base class for implementations of AggregationProvider.
Convenient base class for implementations of FunctionProvider.
A simple implementation for model ports.
Factory for creating SimpleModelPorts
Corresponds to the PMML SimplePredicate element.
Enumeration of allowed values for the "operator" attribute.
For internal use only; public as a matter of implementation.
Registry of services.
Server-side implementations may implement this interface to receive notifications of server start and stop
Corresponds to the PMML SimpleSetPredicate element.
Enumeration of allowed values for the "booleanOperator" attribute.
Convenient base class for implementations of TypeResolutionProvider.
For tests that wish to simulate a partitioned read.
A split iterator containing a single split.
Mix-in interface to be implemented by some operators.
Defines support for the Snappy compression algorithm.
Responsible for opening a socket
Given a NetworkConfiguration and a socket address, creates the appropriate socket builder
Sorts the input data.
Common base class for all processes that need to detect group boundaries
Combination of a left key, a right key, and an ordering.
A combination of a key field and an ordering.
Mix-in interface to be implemented by some operators.
An implementation of MutableBinaryTokenSequence which stores data sparsely.
An implementation of MutableBooleanTokenSequence which stores data sparsely.
An implementation of MutableCharTokenSequence which stores data sparsely.
An implementation of MutableDateTokenSequence which stores data sparsely.
An implementation of MutableDoubleTokenSequence which stores data sparsely.
An implementation of MutableDurationTokenSequence which stores data sparsely.
An implementation of MutableFloatTokenSequence which stores data sparsely.
An implementation of MutableIntTokenSequence which stores data sparsely.
An implementation of MutableIp4AddressTokenSequence which stores data sparsely.
An implementation of MutableIp6AddressTokenSequence which stores data sparsely.
An implementation of MutableLongTokenSequence which stores data sparsely.
An implementation of MutableMoneyTokenSequence which stores data sparsely.
An implementation of MutableNumericTokenSequence which stores data sparsely.
An implementation of MutableObjectTokenSequence which stores data sparsely.
An implementation of MutablePeriodTokenSequence which stores data sparsely.
The matching decoder for SparseRecordEncoder.
A composite register of scalar registers; a mutable RecordValued object.
An implementation of MutableRecordTokenSequence which stores data sparsely.
An implementation of MutableStringTokenSequence which stores data sparsely.
An implementation of MutableTimestampTokenSequence which stores data sparsely.
An implementation of MutableDateTokenSequence which stores data sparsely.
Splits a string field into multiple fields, based on a specified pattern.
Interface defining an input data stream that works within the boundaries of a defined split.
A wrapper for input streams providing windowing behavior.
A forward-only iterator over data splits with associated locality information.
Settings which control the generation of splits on files.
An object representing the context of a data split parsing operation.
Indicates a parsing error on a data split.
A character based reader for splits.
Represents a file split for a compression format that supports splitting.
 
A handle by which one can access the staged data associated with an iterative port.
Various options that can be passed to StagedSource.getSource(StagedSourceOptions).
Provides metadata for a statistic including including display name controls how the statistic is to be aggregated across partitions.
Utilities for formatting runtime statistics for presentation in the UI.
Provides implementations of common statistical functions.
Utility that provides a mapping from StatisticDefinition to StatisticSupplier.
For advanced use only; operator authors should not need to access this class.
The state associated with a StatisticSupplier.
Abstract supplier of a runtime statistic.
Miscellaneous utilities for working with StatisticSupplier's.
 
Useful statistics related functions.
Supported methods of value normalization.
Interface for a stemmer.
Enumeration of the possible snowball stemmer algorithms.
 
Convenient pairing of StorageBucketId and it's associated StorageManager.
A storage bucket represents one unit of work to be exchanged by nodes in the cluster.
A manager of intermediate storage.
Extension point to register a new StorageManager type.
Utilities for creating storage managers
Rollup of the various parameters needed to create a StorageManager.
An allocator of StorageBucketId's.
Context used when computing metadata via StreamingOperator.computeMetadata(StreamingMetadataContext).
Base class for LogicalOperators that are a potentially streaming operation.
Defines the strictness mode used when evaluating JavaScript source.
A collection of functions for phonetic encoding of string values.
Converts a string token value to a string.
A string valued field of a record input port.
A string valued field of a record output port.
Converts a string to a string token value.
A mutable StringValued object.
Provides implementations of common string functions.
Specifies behavior of null values for string concatenation.
A TokenSettable object containing a string value.
An immutable StringValued object.
Primary implementation of MutableStringTokenSequence.
A TokenSequence containing string valued data.
Utility methods for operating on strings.
A TokenValued object containing a string value.
A ScalarValuedIterator over string values.
Parser for text file schemas written in Pervasive DataIntegrator structured schema (.schema) format.
The SubJobExecutor operator can be used to execute JSON serialized subgraphs within the current workflow.
Discovers various metrics of an input dataset, based on the configured detail level.
Compute the sum of squares for the given fields of the input data.
Enumerated type that provides factory constructors for the supported log formats.
Object representation of the SupportVectorMachine PMML element.
Builds a PMMLSupportVectorMachineModel from an input dataset.
Operator responsible for classification based on a SVM PMML model.
Base class of SVM types associated with predicting a target variable.
Base class the represents the type of SVM model to build.
Corresponds to the C-SVC SVM model type.
Corresponds to a one-class SVM model type.
Describes the format of a syslog log file.
Public only as a matter of implementation; clients should not use this class.
PMML extension used to support numerical data.
A provider of useful services for unit tests.
A tree node that can hold information on text elements.
Contextual information needed by TokenParser's and TokenFormatter's.
Describes behaviors used during conversion of token values to and from text.
A mapping between a scalar token type and text representations of values of that type.
Definition of a text element.
 
Enumeration of the possible text and character groupings.
A ConfigurationField that consists of clear text information.
The text analog of Field.
Interface for a text filter usable by the FilterText operator.
Filters a frequency map field.
A formatter representing ip4address token values as text IPv4 strings.
A parser producing ip4address token values from text IPv4 strings.
A formatter representing ip6address token values as text IPv6 strings.
A parser producing ip6address token values from text IPv6 strings.
Interface for a node in a TextContainer tree.
A definition of a variable-width record type in a text file.
A generator of text record schemas.
A definition of the record schema of a text file.
Stems a TokenizedText field.
Tokenizes a string field as a TokenizedText object.
Utility methods for operating on TextContainer objects.
Common external text data type definitions.
Enumerates the possible conversions for string-valued text types.
A formatter representing time token values with a user-defined text format.
A time valued field of a record input port.
A specific moment of time on a 24-hour clock.
A representation of a point-in-time when a timeout period expires.
A time valued field of a record output port.
A parser producing time token values from a user-defined text format.
Duration timer for elapsed wall-clock time.
A mutable TimeValued object.
A TokenSettable object containing a time value.
Deprecated.
since 6.1; use DateTimeUtils instead.
 
A timestamp valued field of a record input port.
A timestamp valued field of a record output port.
A parser producing timestamp token values from a user-defined text format.
A mutable TimestampValued object.
A TokenSettable object containing a timestamp value.
An immutable TimestampValued object.
Deprecated.
since 6.1
Primary implementation of MutableTimestampTokenSequence.
A TokenSequence containing timestamp valued data.
A TokenValued object containing a timestamp value.
A ScalarValuedIterator over timestamp values.
An immutable TimeValued object.
Primary implementation of MutableTimeTokenSequence.
A TokenSequence containing time valued data.
A TokenValued object containing a time value.
A ScalarValuedIterator over time values.
Converts an argument to a TimeZone object.
A sequence of data tokens to which additional tokens can be appended.
An object for comparing tokens of compatible types.
A factory for creating TokenComparator objects.
An object capable of converting tokens into other formats.
A factory for creating TokenConverter objects.
Converts binary encodings of tokens into token values.
Converts token values into a binary encoding.
A factory for obtaining standard encoders/decoders for token values.
Converts token values into text values.
A TextContainer that can store a tokenized paragraph.
A TextContainer that can store a tokenized sentence.
A TextContainer that can store a tokenized document.
A TextContainer that can store a tokenized word.
An ordering of token values.
Converts text values into token values.
An object capable of transferring a token from a random-access source to a target.
A factory for creating DataRegister objects.
A random-access, read-only view of a sequence of data tokens of homogeneous type.
Utilities for working with arrays of tokens.
An object holding a nullable data value which is mutable.
An object capable of producing a sort order permutation of a TokenSequence.
A description of the properties of a token data type.
A collection of constant descriptors and utilities for the supported token types.
Identifies a class which handles data tokens.
An object performing (potentially) different processing based on token type.
Utilities for working with token values.
An representation of a nullable data value.
Iterator over token values.
 
Provides a transformation from a source data type to a target data type.
Provides a set of transforms and a means to find the wanted one.
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
Corresponds to the PMML Node element.
A mapping between boolean values and strings.
A base class for type-driven object factories.
Unchecked exception thrown to indicate a type is not valid in a given context.
Describes a pattern which identifies a text data type.
Facade that can be used to query the list of classes registered via TypeResolutionProvider's.
A provider of registered classes.
Unchecked exception thrown when an attempt is made to compare two typed objects of incompatible types.
Various utilities for manipulating token types.
 
Checks equivalence based on the distance between two floating point numbers in units of least precision (ULP).
Defines support for uncompressed data.
Provides a union of two data sources.
 
Optionally indicates a unit of measure of a statistic.
Object representation of the PMML UnivariateStats element.
Provides UNIX-style globbing over paths.
Provides information for performing globbing.
Specifies the behavior for handling data sources which are unreadable.
 
The default data distribution; if unspecified an operator will receive data in the same distribution as the upstream operator.
Thrown to indicate that a PMML element was encountered that is not currently supported by our predictor.
This operator updates the target table in a database by applying SQL update statements.
Provides generic access to URL resources.
A collection of utility methods for working with URLs.
Variable expander that knows how to expand simple macro expressions of the form "${variableName}".
Object representation of the VectorDictionary PMML element.
Internal factory class for composing direct load operations on Vector.
Helper class that provides information about the Vectorwise installation and various utilities.
 
 
 
Describes the version of the DataRush installation.
Identifies a "virtual node" in a cluster.
Enumeration of non-parallel locations
 
 
 
A specialized function invoker for wavg.
Configuration to a web application.
A handle that provides lifecycle methods to a web application server.
Event information.
A listener to various server events.
A provider for an embedded web application server.
Facade for finding and using WebApplicationServerProvider's.
A typer determining the output type of a function based on the types of one or more inputs.
 
Implementation of a word frequency model.
 
Storage class by which a custom aggregator can storage its internals.
Write files using the Attribute-Relation File Format (ARFF).
Writes data using Apache Avro format.
Writes a stream of records as delimited text.
Writes a record dataflow as a text file of fixed-width records.
Write a result set to HBase.
Enumerates the possible file dispositions for writing.
Write data in the Apache Hive ORC format.
Write a PMML object representation to a file at the given path.
Writes a stream of records to a data sink.
Writes a sequence of records to disk in an internal format for staged data.
In its simplest form, writes records from an input port to a JDBC target table using insert statements.
Utility methods for the manipulation of XML documents and elements.
A container of objects passed to the application master as arguments.
The executor responsible for starting Dataflow worker containers within YARN.
A job executor is created for each new job.
Implementation of the ClusterProvider
 
Container of arguments passed to each partition worker.