Backing up a Hadoop cluster

You can either schedule a backup job or run a backup job manually. See, NetBackup Administrator's Guide, Volume I

For overview of the backup process, See Backing up Hadoop data.

The backup process comprises of the following stages:

Pre-processing: In the pre-processing stage, the first backup host that you have configured with the BigData policy, triggers the discovery. At this stage, a snapshot of the complete backup selection is generated. The snapshot details are visible on the NameNode web interface.
Data transfer: During the data transfer process, one child job is created for each backup host.
Post-processing: As part of the post-processing, NetBackup cleans up the snapshots on NameNode.