Configuration Files

Table 1. inputFileConfig
Parameter	Description
pb.bdq.input.type	Input file type. The values can be: `file`, `TEXT`, or `ORC`.
pb.bdq.inputfile.path	The path where you have placed the input file on HDFS. For example, /user/hduser/sampledata/groovy/input/groovy_Input.csv
textinputformat.record.delimiter	File record delimiter used in the text type input file. For example, `LINUX`, `MACINTOSH`, or `WINDOWS`
pb.bdq.inputformat.field.delimiter	Field or column delimiter used in the input file, such as comma (`,`) or tab.
pb.bdq.inputformat.text.qualifier	Text qualifiers, if any, in the columns or fields of the input file.
pb.bdq.inputformat.file.header	Comma-separated value of the headers used in the input file.
pb.bdq.inputformat.skip.firstrow	If the first row is to be skipped from processing. The values can be `True` or `False`, where `True` indicates skip.

Table 2. scriptExecuterConfig
Parameter	Description
pb.bdq.job.type	This is a constant value that defines the job. The value for this job is: `CustomScript`.
pb.bdq.job.name	Name of the job. Default is `CustomScriptSample`.
pb.bdq.dim.date.pattern	Specifies the date pattern to be used in the job as: M/d/yy Note: This is an optional property.
pb.bdq.dim.datetime.pattern	Specifies the date-time pattern to be used in the job as: M/d/yy h:mm a Note: This is an optional property.
pb.bdq.dim.time.pattern	Specifies the time pattern to be used in the job as: h:mm a Note: This is an optional property.
pb.bdq.dim.groovy.input.fields.0	Specifies the input fields and their data types in the format: `{"name":<"name of the field">,"type":<"field type">}`. For example, `{"name":"AddressLine2","type":"string"}`
pb.bdq.dim.groovy.output.fields.0	Specifies the input fields and their data types in the format: `{"name":<"name of the field">,"type":<"field type">}`. For example, `{"name":"AddressLine2","type":"string"}`
pb.bdq.dim.groovy.script.0	Path of the groovy script to be executed. For Example, /home/hduser/script/groovy.txt

Table 3. mapReduceConfig
Specifies the MapReduce configuration parameters
Use this file to customize MapReduce parameters, such as mapreduce.map.memory.mb, mapreduce.reduce.memory.mb and mapreduce.map.speculative, as needed for your job.

Table 4. OutputFileConfig
Parameter	Description
pb.bdq.output.type	Specify if the output is in: `file`, `TEXT`, or `ORC` format.
pb.bdq.outputfile.path	The path where you want the output file to be generated on HDFS. For example, /user/hduser/sampledata/groovy/output
pb.bdq.outputformat.field.delimiter	Field or column delimiter in the output file, such as comma (`,`) or tab.
pb.bdq.output.overwrite	For a `true` value, the output folder is overwritten every time job is run.
pb.bdq.outputformat.headerfile.create	Specify `true`, if the output file needs to have a header.