Write to Hadoop Sequence File
The Write to Hadoop Sequence File stage writes data to a sequence file as output from a dataflow. A sequence file is a flat file consisting of binary key/value pairs. For more information, go to wiki.apache.org/hadoop/SequenceFile.
File Properties Tab
Fields | Description |
---|---|
Server | Indicates the file you select in the File name field is located on the Hadoop system. You need to create a connection to the Hadoop file server in the Management Console before using it in the stage. If you select a file on the Hadoop system, the server name will be the name you specify in the Management Console while creating a file server. |
File name | Specifies the path to the file. Click the ellipses button (...) to browse to the file you want. |
Field separator |
Specifies the character used to separate fields in a delimited file. For example, this record uses a pipe (|) as a field separator:
These characters available to define as field separators are:
If the file uses a different character as a field separator, click the ellipses button to select another character as a delimiter. |
Text qualifier |
The character used to surround text values in a delimited file. For example, this record uses double quotes (") as a text qualifier.
The characters available to define as text qualifiers are:
If the file uses a different text qualifier, click the ellipses button to select another character as a text qualifier. |
Fields Tab
The Fields tab defines the names, positions, and types of fields in the file. For more information, see Defining Fields In an Output Sequence File