Class DiscoverDuplicates

    • Constructor Detail

      • DiscoverDuplicates

        public DiscoverDuplicates()
        Discover duplicates using initial defaults.
      • DiscoverDuplicates

        public DiscoverDuplicates​(Index index,
                                  List<Phase> phases)
        Discover duplicates using multiple phases of comparison, classifying and filtering.
        Parameters:
        index - properties used to index the input data
        phases - definition of phases for field comparisons
    • Method Detail

      • getInput

        public RecordPort getInput()
        Gets the record port providing input to the operation.
        Returns:
        the input port for the operation
      • setPhases

        public void setPhases​(List<Phase> phases)
        Sets the phases of comparison, classifying and filtering used to determine matches.
        Parameters:
        phases - definition of phases for field comparisons
      • getOutput

        public RecordPort getOutput()
        Gets the record port providing the output from the operation.
        Returns:
        the output port for the operation
      • setIndex

        public void setIndex​(Index index)
        Sets the pair generation method for determining initial candidate matches.
        Parameters:
        index - properties used to index the input data
      • getIndex

        public Index getIndex()
        Gets the pair generation method for determining initial candidate matches.
        Returns:
        properties used to index the input data
      • getPhases

        public List<Phase> getPhases()
        Gets the phases of comparison, classifying and filtering used to determine matches.
        Returns:
        definition of phases for field comparisons
      • compose

        protected void compose​(CompositionContext ctx)
        Description copied from class: CompositeOperator
        Compose the body of this operator. Implementations should do the following:
        1. Perform any validation of configuration, input types, etc
        2. Instantiate and configure sub-operators, adding them to the provided context via the method OperatorComposable.add(O)
        3. Create necessary connections via the method OperatorComposable.connect(P, P). This includes connections from the composite's input ports to sub-operators, connections between sub-operators, and connections from sub-operators output ports to the composite's output ports
        Specified by:
        compose in class CompositeOperator
        Parameters:
        ctx - the context