This procedure shows how to create a dataflow that takes personal name data (for example "John P. Smith"), identifies common nicknames of the same name, and create a standard version of the name that can then be used to consolidate redundant records.
Note: Before beginning, make sure that your input data has a field named "Name" that contains the full name of the person.
-
If you have not already done so, load the following tables onto the Spectrumâ„¢ Technology Platform server:
- Open Parser Base
- Open Parser Enhanced Names
Use the Data Normalization Module's database load utility to load these tables. For instructions on loading tables, see the Installation Guide.
-
In Enterprise Designer, create a new dataflow.
-
Drag a source stage onto the canvas.
-
Double-click the source stage and configure it. See the Dataflow Designer's Guide for instructions on configuring source stages.
-
Drag an Open Name Parser stage onto the canvas and connect it to the source stage.
For example, if you are using a Read from File stage, your dataflow would look like this:
-
Drag a Table Lookup stage onto the canvas and connect it to the Open Name Parser stage.
Your dataflow should now look like this:
-
Double-click the Table Lookup stage on the canvas.
-
In the Source field, select FirstName.
-
In the Destination field, select FirstName.
By specifying the same field as both the source and destination, the field will be updated with the standardized version of the name.
-
In the Table field, select NickNames.xml.
-
Click OK.
-
Click OK again to close the Table Lookup Options window.
-
Drag a sink stage onto the canvas and connect it to the Table Lookup stage.
For example, if you were using a Write to File sink, your dataflow would now look like this:
-
Double-click the sink stage and configure it. See the Dataflow Designer's Guide for instructions on configuring source stages.
You now have a dataflow that takes personal names and standardizes the first name, replacing nicknames with the standard form of the name.