Components of the SDK Java API
The key components to use a Spectrum™ Data & Address Quality for Big Data SDK job using the Java API are:
- JAR Files
-
- Hadoop JAR files.
- The JAR files of the module to which the desired Spectrum™ Data & Address Quality for Big Data SDK
job belongs, as indicated in the table:
Module Job JAR File Advanced Matching Module All AMM jobs amm.core-12.2.jar Data Integration Module All DIM jobs dim.core-12.2.jar Data Normalization Module All DNM jobs dnm.core-12.2.jar Global Addressing Module Global Address Validation gam-global addressvalidation.core-12.2.jar Universal Addressing Module Validate Address uam-universaladdress.core-12.2.jar Universal Addressing Module Validate Address Global uam-global.core-12.2.jar Universal Addressing Module Validate Address Loqate uam-loqate.core-12.2.jar Universal Name Module All UNM jobs unm.core-12.2.jar
- Configuration Files
- Files in XML format containing all parameters and values required to run a job,
including match rules, input file details, output file details, and MapReduce or Spark
configuration details.
Sample configuration XML files are placed at the location <Big Data Quality bundle>\samples\configuration.
- Client Java Application
- Java application to use the API to create and run the required Spectrum™ Data & Address Quality for Big Data SDK job provided by its Java API.
- Hadoop Platform
- The created job accesses the configured Hadoop platform to access input data and dump the output data in a file.