All Packages

Package Summary
Package
Description
 
 
 
 
 
 
 
Provides common classes for Analytics, such as operator registration.
Provides common classes for Association Rule Mining (ARM).
Provides the operator to perform the FP-growth ARM algorithm.
Provides operators related to data cleansing.
Provides base PMML for clustering models.
Provides the KMeans algorithm.
Provides PMML model classes for decision trees.
Provides the PMML learner operator and associated classes.
Provides the decision tree predictor operator and associated classes.
Provides the decision tree pruner operator and associated classes.
Provides various statistics functions.
Provides an implementation of the KNN algorithm using DataRush's sparse data API.
Provides PMML model classes for Naive Bayes.
Provides an implementation of the Naive Bayes learner.
Provides an implementation of a Naive Bayes predictor.
Provides shared and base classes for PMML model representation of Analytics algorithms.
 
Provides utility, PMML and other classes for shared use by regression related entities.
Provides various statistics, Data Summarizer, and Data Quality Analyzer.
Provides PMML model classes for SVM.
Provides an implementation of an SVM learner.
Provides an implementation of an SVM predictor.
Provides various unstructured text processing operators.
 
Provides various stemmer algorithms based on the snowball definitions.
Provides some (internal) utility classes for Analytics.
Provides operators for classifier performance visualization.
Provides annotations used to describe elements of a dataflow graph.
Provides classes and interfaces related to authentication.
 
 
Provides interfaces the define the "cluster abstraction layer".
 
Provides classes for dynamic coercion of data structures (for example, arrays and maps) to complex Java objects.
 
Provides common utilities
 
Provides basic interfaces used when converting tokens into various formats.
Implementations of encoders of tokens into binary formats.
Implementations of encoders of tokens into text formats.
Provides classes and interfaces related to encryption
 
Provides classes and interfaces related to defining functions on records.
 
Provides classes and interfaces for the construction of executable dataflow graphs.
 
 
 
Provides classes and interfaces performing file-like I/O operations.
Provides classes and interfaces for supporting data compression.
 
Provides classes and interfaces associated with filesystem configuration
 
 
Provides common utilities and registry for JSON parsing
Provides operators for performing discovering duplicates or links between records.
Provides operators for generating possible candidate pairs.
Provides operators for clustering the results of duplicate or linkage discovery.
Provides functions related to approximate matching.
Provides operators for analyzing data for approximate matching.
 
Provides classes and interfaces for reporting on commonly monitored resources such as memory, threads, and I/O.
Provides utilities for creating and managing collections of named objects.
 
Provides classes and interfaces for developing dataflow operators.
Provides operators for making assertions on flows and files.
 
Provides data aggregation components.
Provides base file I/O components including encoders and decoders.
Provides operators for reading and writing files in Avro format.
 
Provides operators for reading from JDBC sources and writing to JDBC targets.
 
Provides operators for reading and writing DataRush staging datasets.
Provides operators for reading and writing text data.
 
 
Provides operators for joining together two data sets into a single one.
Provides operators for handling models.
Provides operators for partitioning and unpartitioning flows of data.
Provides operators for manipulating record structure.
Provides the RunScript operator for running user-defined scripts on the rows of an input record flow.
 
Provides operators for selecting a subset of the data set.
Provides the LogRows operator for writing debugging information about a flow to the logging API.
Provides operators for sorting and manipulating sorted flows.
Provides operators for generating data tokens in various ways.
Provides operators for operating on string values in records.
 
Provides classes and interfaces related to receiving and sending data in a dataflow graph.
Provides implementations of port objects dealing with the flow of single objects between operators.
Provides classes and interfaces for accessing and producing the data flowing between operators in a dataflow graph.
Provides implementations of port objects related to the flow of record sets between operators.
Provides an object model for capturing schema information used primarily by the textfile package.
 
 
 
Provides classes and interfaces related to sequences of tokens.
Provides implementations of sequences of record valued tokens.
Provides implementations of sequences of scalar token values.
Provides tools useful for testing DataRush applications.
 
Provides classes and utilities for working with data tokens.
Provides implementations of and utilities for record valued tokens.
Provides implementations of and utilities for scalar valued tokens.
Provides classes and interfaces for the description of token data types.
Provides utilities useful in DataRush application development.