public class ExpandTextTokens extends ExecutableOperator implements RecordPipelineOperator
Constructor and Description |
---|
ExpandTextTokens()
Default constructor.
|
ExpandTextTokens(String textField)
Constructor specifying the tokenized text field to expand.
|
ExpandTextTokens(String textField,
TextElementType tokenType)
Constructor specifying the tokenized text field to expand and
the type of token to expand.
|
Modifier and Type | Method and Description |
---|---|
protected void |
computeMetadata(StreamingMetadataContext ctx)
Implementations must adhere to the following contracts
|
protected void |
execute(ExecutionContext ctx)
Executes the operator.
|
RecordPort |
getInput()
Get the input port of this operator.
|
String |
getInputField()
Get the tokenized text field to expand.
|
RecordPort |
getOutput()
Get the output port of this operator.
|
String |
getOutputField()
Get the string output field.
|
TextElementType |
getTokenType()
Get the type of text token to expand.
|
void |
setInputField(String textField)
Set the tokenized text field to expand.
|
void |
setOutputField(String tokenField)
Set the string output field.
|
void |
setTokenType(TextElementType tokenType)
Set the type of text token to expand.
|
cloneForExecution, getNumInputCopies, getPortSettings, handleInactiveOutput
disableParallelism, getInputPorts, getOutputPorts, newInput, newInput, newOutput, newRecordInput, newRecordInput, newRecordOutput, notifyError
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
disableParallelism, getInputPorts, getOutputPorts
public ExpandTextTokens()
setInputField(String)
} and
setOutputField(String)
to set the name
of the text field to expand and its output field.public ExpandTextTokens(String textField)
textField
- name of the field to expandpublic ExpandTextTokens(String textField, TextElementType tokenType)
textField
- name of the field to expandtokenType
- type of token to expandpublic void setInputField(String textField)
If this field does not exist in the input, or is not of type TokenizedText, an exception will be thrown at composition time.
textField
- name of the field to expandpublic String getInputField()
public void setOutputField(String tokenField)
tokenField
- The name of the string output fieldpublic String getOutputField()
public void setTokenType(TextElementType tokenType)
tokenType
- type of token to expandpublic TextElementType getTokenType()
public RecordPort getInput()
getInput
in interface PipelineOperator<RecordPort>
public RecordPort getOutput()
getOutput
in interface PipelineOperator<RecordPort>
protected void computeMetadata(StreamingMetadataContext ctx)
StreamingOperator
StreamingMetadataContext.parallelize(ParallelismStrategy)
.
RecordPort#setRequiredDataOrdering
, otherwise data may arrive in any order.
RecordPort#setRequiredDataDistribution
, otherwise data will arrive in an unspecified partial distribution
.
RecordPort#getSourceDataDistribution
and RecordPort#getSourceDataOrdering
. These should be
viewed as a hints to help chose a more efficient algorithm. In such cases, though, operators must
still declare data ordering and data distribution requirements; otherwise there is no guarantee that
data will arrive sorted/distributed as required.
RecordPort#setType
.RecordPort#setOutputDataOrdering
RecordPort#setOutputDataDistribution
AbstractModelPort#setMergeHandler
.MergeModel
is a convenient, re-usable model reducer, parameterized with
a merge-handler.
SimpleModelPort
's have no associated metadata and therefore there is
never any output metadata to declare. PMMLPort
's, on the other hand,
do have associated metadata. For all PMMLPorts, implementations must declare
the following:
PMMLPort.setPMMLModelSpec
.
computeMetadata
in class StreamingOperator
ctx
- the contextprotected void execute(ExecutionContext ctx)
ExecutableOperator
execute
in class ExecutableOperator
ctx
- context in which to lookup physical ports bound to logical portsCopyright © 2021 Actian Corporation. All rights reserved.