Module datarush.library
Package com.pervasive.datarush.operators.io.textfile
Provides operators for reading and writing text data. A number
of text formats are supported, including:
- Delimited text, where fields a separated by some known character,
such as is used with
.csv
and.tsv
files. - Fixed text, where fields are located at fixed offsets in every record.
- ARFF, Attribute-Relation File Format, a format commonly used in data analytics.
-
Interface Summary Interface Description FieldDelimiterSpecifier A collection of structural information about a text file consisting of delimited-field records.LogFormat RecordSeparatorSpecifier A collection of structural information about a text file consisting of records. -
Class Summary Class Description AbstractRegexLogFormat Skeleton implementation of a regular expression based LogFormat class.AbstractTextReader A generic reader of text data representing a stream of records.AbstractTextWriter A generic writer of text data representing a stream of records.ARFFAnalyzer An analyzer for ARFF files.ARFFAnalyzer.Analysis Contains the results of analyzing an ARFF file.ARFFDataFormat Describes an instance of ARFF data.CLFLogFormat Describes the format of a web server log in NCSA Common log format.CombinedLogFormat Describes the format of a web server log in NCSA Combined log format.DelimitedTextAnalyzer An analyzer for files containing delimited text.DelimitedTextAnalyzer.Analysis Contains the results of an analysis of a delimited text file.DelimitedTextFormat Describes the format of a delimited text file.ELFAnalyzer A format analyzer for Extended Log Format files.ELFAnalyzer.ELFAnalysis ELFLogFormat Describes the format of a web server log in W3C Extended log format.FieldDelimiterSettings A container for structural information about a text file containing delimited records.FixedTextFormat Describes the format of a fixed text file.FormatAnalyzer A generic format analyzer for log files.FormatAnalyzer.FormatAnalysis Contains the results of analyzing a file.GenericLogFormat Describes the format of a generic log that can be parsed using a regular expression.GlassfishLogFormat Describes the format of a Glassfish log file.JSONFormat JsonHelper A simple JSON helper utilityJsonRecordReader A utility class used to read and parse JSON text records.JsonRecordReader.JsonFieldReader JsonTextAnalyzer An analyzer for files containing JSON text.JsonTextAnalyzer.JsonAnalysis Contains the results of the analysis of a json text file.Log4jLogFormat Describes the format of a log4j log file.Log4jPatternParser LogDataFormat ParseTextFields Parses input text records according to a specified text schema.ReadARFF Read files in the Attribute-Relation File Format (ARFF).ReadDelimitedText Reads a text file of delimited records as record tokens.ReadFixedText Reads a text file of fixed-width records as record tokens.ReadJSON The ReadJSON operator reads a JSON file of key-value pairs or array of objects as record tokens.ReadLog Reads a log file as record tokens.RecordSeparatorSettings A container for structural information about a file containing text records.SyslogLogFormat Describes the format of a syslog log file.WriteARFF Write files using the Attribute-Relation File Format (ARFF).WriteDelimitedText Writes a stream of records as delimited text.WriteFixedText Writes a record dataflow as a text file of fixed-width records. -
Enum Summary Enum Description ARFFMode Defines valid representation types for ARFF instance data.RecordLengthErrorAction The actions supported when a record length error occurs.SupportedLogType Enumerated type that provides factory constructors for the supported log formats. -
Exception Summary Exception Description InvalidSchemaException Exception that indicates a schema definition is invalid.ParseException Exception indicating a parsing error.RowTooLongException Exception thrown when the number of characters in the first row of a delimited text file exceeds the maximum allowed.