Sorting Input Records

In the Read from Hadoop Sequence File stage, the Sort Fields tab defines fields by which to sort the input records before they are sent into the dataflow. Sorting is optional.

  1. In Read from Hadoop Sequence File, click the Sort Fields tab.
  2. On the Sort Fields tab, click Add.
  3. Click the drop-down arrow in the Field Name column and select the field you want to sort by. The fields available for selection depend on the fields defined in this input file.
  4. In the Order column, select Ascending or Descending.
  5. Repeat until you have added all the input fields you wish to use for sorting. Change the order of the sort by highlighting the row for the field you wish to move and clicking Up or Down.