Intraflow Match Report
Intraflow Match locates matches between similar data records within a single input stream. You can create hierarchical rules based on any fields that have been defined or created in other stages of the dataflow.
Reporting
The Intraflow Match job allows you to monitor the results of the job. The counters available are:- INPUT_RECORDS
- The number of records in the matching stage before the matching sort is performed.
- DUPLICATE_RECORDS
- The number of duplicate records within a match group, which can be either a suspect or a candidate record.
- UNIQUE_RECORDS
- The number of suspect or candidate records which do not match any other records in their
respective match group.
If it is the only record in a match group, a suspect is automatically unique.
- MATCH_GROUPS
- (Group By) Records grouped together by a match key.
- DUPLICATE_COLLECTIONS
- The number of duplicate collections, which consist of a suspect and its duplicate records grouped together by a CollectionNumber.
- EXPRESS_MATCHES
- The number of Express Matches made in a collection.
An Express Match is made when a suspect and candidate have an exact match on the contents of a designated field, usually an ExpressMatchKey provided by the Match Key Generator. If an Express Match is made, no further processing is done to determine if the suspect and candidate are duplicates.
- AVERAGE_SCORE
- The average match score of all duplicates.
The possible values are 0-100, with 0 indicating a poor match and 100 indicating an exact match.
- TOTAL_DUPLICATES
- The total number of duplicates found.
- TOTAL_SCORE
- The total match score of all duplicates.