public final class NormalizeValues extends AbstractRecordCompositeOperator
setIncludeInputFields(boolean)
.
Normalization methods require certain statistics about the input data such as the
mean, standard deviation, minimum value, maximum value and so on. These statistics
are captured in a PMMLModel. The statistics can be gathered by an upstream operator such as
SummaryStatistics
and passed into this operator. If not, they will be
calculated with a first pass over the data and then applied in a second pass.
SummaryStatistics
input, output
Constructor and Description |
---|
NormalizeValues() |
Modifier and Type | Method and Description |
---|---|
protected void |
compose(CompositionContext ctx)
Compose the body of this operator.
|
boolean |
getIncludeInputFields()
Get the property that specifies whether or not to
include the input fields in the output data.
|
RecordPort |
getInput()
Returns the input port
|
StatsFunctions.NormalizeMethod |
getMethod()
Get the normalization method configured.
|
PMMLPort |
getModelInput()
Get the optional input port used to read the PMML model
containing field statistics needed by normalization methods.
|
RecordPort |
getOutput()
Returns the output port
|
List<String> |
getScoreFields()
Get the names of fields configured to be normalized.
|
void |
setIncludeInputFields(boolean includeInputFields)
Set the indicator of whether or not to include the input fields
in the output data.
|
void |
setMethod(StatsFunctions.NormalizeMethod method)
Set the normalization method to use.
|
void |
setScoreFields(List<String> scoreFields)
Set the names of the input fields to normalize.
|
void |
setScoreFields(String... scoreFields)
Set the names of the input fields to normalize.
|
disableParallelism, getInputPorts, getOutputPorts, newInput, newInput, newOutput, newRecordInput, newRecordInput, newRecordOutput, notifyError
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
disableParallelism, getInputPorts, getOutputPorts
public PMMLPort getModelInput()
This port is optional. If a statistics model is not provided
the needed statistics will be calculated using SummaryStatistics
.
public RecordPort getInput()
PipelineOperator
getInput
in interface PipelineOperator<RecordPort>
getInput
in class AbstractRecordCompositeOperator
public RecordPort getOutput()
PipelineOperator
getOutput
in interface PipelineOperator<RecordPort>
getOutput
in class AbstractRecordCompositeOperator
public void setScoreFields(List<String> scoreFields)
scoreFields
- names of fields to normalizepublic void setScoreFields(String... scoreFields)
scoreFields
- names of fields to normalizepublic List<String> getScoreFields()
public void setMethod(StatsFunctions.NormalizeMethod method)
method
- normalization methodpublic StatsFunctions.NormalizeMethod getMethod()
public boolean getIncludeInputFields()
public void setIncludeInputFields(boolean includeInputFields)
true
causes the input values to be transferred to the output. Otherwise
the input values are excluded leaving only the transformed fields
in the output data.
This value is true
by default.
includeInputFields
- true if input fields are included; false if excludedprotected void compose(CompositionContext ctx)
CompositeOperator
OperatorComposable.add(O)
OperatorComposable.connect(P, P)
. This includes
connections from the composite's input ports to sub-operators, connections between sub-operators, and
connections from sub-operators output ports to the composite's output portscompose
in class CompositeOperator
ctx
- the contextCopyright © 2016 Actian Corporation. All rights reserved.