Module datarush.library
Class CombinedLogFormat
java.lang.Object
com.pervasive.datarush.operators.io.textfile.AbstractRegexLogFormat
com.pervasive.datarush.operators.io.textfile.CombinedLogFormat
- All Implemented Interfaces:
LogFormat
Describes the format of a web server log in
NCSA Combined log format. The format pattern
specifies whether or not the cookie field is
present in the data.
-
Nested Class Summary
Nested classes/interfaces inherited from class com.pervasive.datarush.operators.io.textfile.AbstractRegexLogFormat
AbstractRegexLogFormat.RegexParser -
Field Summary
FieldsFields inherited from class com.pervasive.datarush.operators.io.textfile.AbstractRegexLogFormat
formatPattern, logType -
Constructor Summary
ConstructorsConstructorDescriptionCreate a log format for accessing combined log format data.CombinedLogFormat(String formatPattern) Create a log format for accessing combined log format data. -
Method Summary
Modifier and TypeMethodDescriptioncreateParser(ParsingOptions options, CharsetEncoding charEncoding, String newline) Gets the record schema of the source.getType()Gets the record type associated with the format.booleanIndicates if the format supports parsing of subsections of a file.protected voidRefresh and recalculate the schema.Methods inherited from class com.pervasive.datarush.operators.io.textfile.AbstractRegexLogFormat
analyzeFormat, getFormatPattern, getLogType, setAnalysis, setFormatPattern
-
Field Details
-
schema
-
-
Constructor Details
-
CombinedLogFormat
public CombinedLogFormat()Create a log format for accessing combined log format data. -
CombinedLogFormat
Create a log format for accessing combined log format data.- Parameters:
formatPattern- boolean string that determines if cookies are present
-
-
Method Details
-
getType
Description copied from interface:LogFormatGets the record type associated with the format. Records produced by the associated parser or consumed by the associated formatter will be of this type.For many formats, this may be derived from a schema object describing the format layout.
- Returns:
- the format's record type
-
getSchema
Description copied from class:AbstractRegexLogFormatGets the record schema of the source.- Specified by:
getSchemain classAbstractRegexLogFormat- Returns:
- the record schema of the source
-
refreshSchema
protected void refreshSchema()Description copied from class:AbstractRegexLogFormatRefresh and recalculate the schema. This is usually done after changing a setting.- Specified by:
refreshSchemain classAbstractRegexLogFormat
-
isSplittable
public boolean isSplittable()Description copied from interface:LogFormatIndicates if the format supports parsing of subsections of a file.A format should only return
trueif it can, at least in some situations, support this sort of parsing. If a format requires reading the entire file, it must returnfalse.If a format is not splittable, a file in the format cannot be parsed in parallel; however, individual files can still be parsed independently in parallel, as when reading the contents of a directory or using a file globbing pattern.
- Specified by:
isSplittablein interfaceLogFormat- Overrides:
isSplittablein classAbstractRegexLogFormat- Returns:
trueif the format supports parsing only a portion of the file,falseotherwise
-
createParser
public DataFormat.DataParser createParser(ParsingOptions options, CharsetEncoding charEncoding, String newline)
-