Supported job configurations

This table lists the supported Spark job configurations on Windows and Linux platforms:

Windows Linux
Stages Yarn+Client Yarn+Cluster Spark+Client Yarn+Client Yarn+Cluster Spark+Client
MatchKeyGenerator Supported Supported Supported Supported Supported Supported
Intraflow Match Supported Supported Supported Supported Supported Supported
Interflow Match Supported Supported Supported Supported Supported Supported
Transactional Match Supported Supported Supported Supported Supported Supported
Table Lookup Supported Supported Supported Supported Supported Supported
Advanced Transformer Supported Supported Supported Supported Supported Supported
Open Name Parser Supported Supported Supported Supported Supported Supported
Duplicate Synchronization Supported Supported Supported Supported Supported Supported
Filter Supported Supported Supported Supported Supported Supported
Best of Breed Supported Supported Supported Supported Supported Supported
Open Parser Supported Supported Supported Supported Supported Supported
Address Validation (GAV)
Groovy Supported Supported Supported Supported Supported Supported
Joiner Supported Supported Supported Supported Supported Supported
Validate Address Global (AD) Supported Supported Supported Supported Supported Supported
Validate Address (C1P) Supported Supported Supported
Note:
  • GAV and C1P stages are not supported on Windows platform where spectrum installation directory contains spaces.
  • GAV and C1P stages behavior need to verify again on windows platform where spectrum installation directory does not contain any spaces.
  • Reference data on HDFS strategy is not supported.
  • Addressing job with group by feature is not supported in case of Yarn+Cluster mode.