Overview

Use the link in your welcome email to download the ZIP file. A typical installer ZIP file is downloaded, named like BigDataSDK120F0101.zip.

Extract the contents of the downloaded ZIP file on your machine to access the installer, and run the installer which guides you through the installation process. Once installed, the SDK tool is added in your system and placed at the defined location.

You can then import the Big Data Quality SDK JAR file into your project and start accessing the APIs from your machine.

Supported Modules

Big Data Quality SDK supports the modules.
  1. Advanced Matching Module
  2. Data Normalization Module
  3. Universal Name Module
  4. Universal Addressing Module
    Note: You must start the Acushare service before creating the first Validate Address job of the Universal Addressing Module. For more information, see Running Acushare Service.

SDK Usage

The SDK can currently be used through:

  1. Java APIs
    • MapReduce API
    • Spark API
  2. Hive User-Defined Functions