See: Description
Interface | Description |
---|---|
DataSplit |
Describes a range of bytes from a data source.
|
DirectoryFilter |
A filter for selecting paths.
|
FileSystem |
Describes the file system identified by a path scheme.
|
FileSystemProvider |
Provides basic operations on paths for a specific path scheme
or schemes.
|
InputStreamSupplier |
An abstract factory for input streams.
|
IOChannelStatsCollector |
Gathers statistics for an I/O channel.
|
IOMonitoringContext |
Provides a context for instrumenting I/O operations.
|
Path |
An abstract identifier for a resource.
|
PathDetails |
Describes a
Path along with its metadata. |
PathGlob | |
SplitInputStream |
Interface defining an input data stream that works within the boundaries
of a defined split.
|
SplitIterator |
A forward-only iterator over data splits with
associated locality information.
|
Class | Description |
---|---|
BasicPathDetails | |
BinaryBuilder |
A buffer for building variable-length binary valued data.
|
BinaryReader |
Provides extended data access methods on binary data flows.
|
BuiltinStreamProvider |
Provides access to built-in data streams.
|
CharsetEncoding |
Describes the encoding format of character data.
|
CompressedFileSplit |
Describes a range of bytes from a compressed file.
|
CompressionSplitIterator | |
FileClient |
Provides access to files and directories.
|
FileSplit |
Describes a range of bytes from a file.
|
FTPFileSystemProvider |
Provides access to FTP resources as a file system.
|
FTPPath | |
HadoopFilePath |
Implementation of a path for the Hadoop Distributed File System (HDFS).
|
HadoopFileSystemProvider |
Provides access to the Hadoop Distributed File System (HDFS).
|
InputStreamSuppliers |
Contains various factory methods and utilities for creating
InputStreamSupplier 's. |
LocalFileSystemProvider |
Provides access to the local file system.
|
Paths |
A factory for creating
Path objects. |
PathUtility |
Utility methods for dealing with Hadoop style paths.
|
PortRange | |
SFTPFileSystemProvider |
Provides access to SFTP resources as a file system.
|
SingleSplitIterator |
A split iterator containing a single split.
|
SplitInputStreamImpl |
A wrapper for input streams providing windowing behavior.
|
SplitOptions |
Settings which control the generation of splits
on files.
|
SplitReader |
A character based reader for splits.
|
SplittableCompressedFileSplit |
Represents a file split for a compression format that supports splitting.
|
UnixStyleGlobbing |
Provides UNIX-style globbing over paths.
|
UnixStyleGlobbing.GlobDefinition |
Provides information for performing globbing.
|
URLFileSystemProvider |
Provides generic access to URL resources.
|
Enum | Description |
---|---|
BasicPathDetails.ObjectType | |
FTPPath.FTPProtocol | |
IOChannelOperation |
Valid operations on an I/O byte channel, such as a
file or network socket.
|
WriteMode |
Enumerates the possible file dispositions for writing.
|
Exception | Description |
---|---|
EOFException |
An exception indicating end-of-file has been unexpectedly
reached on a stream.
|
FileAlreadyExistsException |
An I/O exception indicating the file in question already exists.
|
There are three main components to this model:
Of these items, users only need to be aware of paths and the utility classes surrounding them. File systems and file system providers are implementation-specific and only required when developing support for a new type of path.
Additionally, the DataRush model has the concept of a file split, similar to that found in Hadoop. Splits are used to parallelize processing on files. File system providers should provide support for splitting files when possible to get parallelism when reading files.
Copyright © 2020 Actian Corporation. All rights reserved.